Crawling outside the base URL

If set the crawl address for a project to be a nested URL, by default WebCopy will not crawl outside of this base path.In some circumstances it may be desirable to allow this without changing the root URL to be at a higher level.

To enable or disable crawling outside the base URL

From the Project Properties dialogue, select the Advanced category
Check or uncheck the Crawl above the root URL option

Examples with outer URL crawling disabled

The following example table demonstrates which URLs would be copied when the Crawl above the root URL setting is disabled (default), assuming a base URL of /features/.

Address	Skip
`/auth/`	Yes
`/elements/`	Yes
`/features/`	No
`/features/sub_feature`	No
`/resources/`	Yes

Examples with outer URL crawling enabled

The following example table demonstrates which URLs would be copied when the Crawl above the root URL setting enabled, assuming a base URL of /features/.

Address	Skip
`/auth/`	No
`/elements/`	No
`/features/`	No
`/features/sub_feature`	No
`/resources/`	No

Cyotek WebCopy Help

Crawling outside the base URL

To enable or disable crawling outside the base URL

Examples with outer URL crawling disabled

Examples with outer URL crawling enabled

See Also

Configuring the Crawler

Working with local files

Controlling the crawl

JavaScript

Security

Modifying URLs

Creating a site map

Advanced

Deprecated features