As a simpler alternative to creating rules, you can give WebCopy a list of content types that want to download, and it will scan the website, downloading the allowed types and ignoring everything else.

Including all content types

To reset WebCopy to the default behaviour and include all resources regards of type

  1. From the Project Properties dialog, select the Content Types category
  2. In the Content Types group, select Include all

Including only the selected content types

Important

This functionality does not work correctly in WebCopy 1.8 and lower if text/html or text/css is excluded. Please update to version 1.9 or higher.

To automatically download only a given set of content types and ignore all others

  1. From the Project Properties dialog, select the Content Types category
  2. In the Content Types group, select Include only resources with the content types listed below
  3. In the Types to include field, enter each content type you wish to include, one per line

Tip

Click Select Types to display a dialogue box for selecting content types either from those detected in the site to be copied, or from a global database

Including everything except selected content types

To automatically download all documents except those matching specific content types

  1. From the Project Properties dialog, select the Content Types category
  2. In the Content Types group, select Include all resources except for the content types listed below
  3. In the Types to exclude field, enter each content type you wish to exclude, one per line

Tip

Click Select Types to display a dialogue box for selecting content types either from those detected in the site to be copied, or from a global database

See Also

Configuring the Crawler

Working with local files

Controlling the crawl

JavaScript

Security

Modifying URLs

Creating a site map

Advanced

Deprecated features