WebCopy has a fixed set of rules which govern how it crawls a document such as a HTML page or style sheet for additional resources. With the rise of responsive websites, these default rules might not always be sufficient - for example custom data attributes may be applied to the img tag in order to support retina images. WebCopy allows you to specify additional attributes to scan, either by using simple names or more complex XPath expressions.

  1. From the Project Properties dialogue, expand the Advanced category and select Custom Attributes
  2. In the edit field, enter each additional custom attribute you wish to scan

For more advanced scenarios, you can use XPath expressions. For example, if a document contained a p tag and an img tag, each with a custom attribute named data-original, you can scan only those on the img tag by using the expression //img/@data-original.

Enter only one attribute name per line

WebCopy does not currently support custom attributes where multiple URLs are contained in a single attribute, or the attribute value includes additional content around the URL.

See Also

Configuring the Crawler

Working with local files

Controlling the crawl

JavaScript

Security

Modifying URLs

Creating a site map

Advanced

Deprecated features

© 2010-2024 Cyotek Ltd. All Rights Reserved.
Documentation version 1.10 (buildref #186.15944), last modified 2024-08-18. Generated 2024-08-18 08:00 using Cyotek HelpWrite Professional version 6.20.0