WebCopy has a fixed set of rules which govern how it crawls a document such as a HTML page or style sheet for additional resources. With the rise of responsive websites, these default rules might not always be sufficient - for example custom attributes may be applied to the img
tag in order to support retina images. WebCopy now allows you to specify additional attributes to scan, either by using simple names or more complex XPath expressions.
To scan custom attributes for links to other resources
- From the Project Properties dialog, click the Custom Attributes category
- In the edit field, enter each additional custom attribute you wish to scan
- Click OK to save your changes and close the options dialog
For more advanced scenarios, you can use XPath expressions. For example, if a document contained a p
tag and an img
tag, each with a custom attribute named data-original
, you can scan only those on the img
tag by using the expression //img/@data-original
.
!CautionBox Enter only one attribute name per line
See Also
Customizing Projects
- Customizing Projects
- Specifying the web site
- Specifying the save folder
- Remapping extensions
- Specifying how a website is crawled
- Specifying default documents
- Crawling additional root URL's
- Updating local time stamps
- Fixing sites using mixed prefixes
- Extracting inline data