Important
This functionality is currently under review and may be removed in a future version of Sitemap Creator. If you currently use this feature, we would be grateful if you could email [email protected] and explain your use case for the feature.
For some sites, you may have a link to a folder, and others might link to the default document of that folders. Sitemap Creator would class these as two separate entries and generate additional elements accordingly.
If you define default documents, Sitemap Creator will try and link page-less URLs to URLs containing the default document name.
To configure default documents
- From the Project Properties dialogue, expected the Deprecated category and select the Default Documents option.
- Enter the default document in the Default documents field.
You can specify multiple default documents by entering each document on a new line
See Also
Configuring the Crawler
Working with local files
- Extracting inline data
- Remapping extensions
- Remapping local files
- Updating local time stamps
- Using query string parameters in local filenames
Controlling the crawl
- Content types
- Crawling above the root URL
- Crawling additional root URLs
- Including additional domains
- Including sub and sibling domains
- Limiting downloads by file count
- Limiting downloads by size
- Limiting scans by depth
- Limiting scans by distance
- Scanning data attributes
- Setting speed limits
- Working with Rules
JavaScript
Security
- Crawling private areas
- Manually logging into a website
- TLS/SSL certificate options
- Working with Forms
- Working with Passwords
Modifying URLs
Advanced
- Aborting the crawl using HTTP status codes
- Cookies
- Defining custom headers
- Following redirects
- HEAD vs GET for preliminary requests
- HTTP Compression
- Modifying page titles
- Origin reports
- Overwriting read only files
- Saving link data in a Crawler Project
- Setting the web page language
- Specifying a User Agent
- Specifying accepted content types
- Using Keep-Alive