WebScraper uses the Integrity v8 engine to quickly scan a website, and can output extracted data (currently) as CSV or JSON. Plus download images to a folder.
- Easy to scan a site - just enter the starting URL and press "Go"
- Easy to export - choose the columns you want
- Plenty of extraction options, including HTML elements with certain classes or IDs, regular expressions, or entire content in a number of formats (html, plain text, markdown)
- Since v4.1 can download to a folder all images discovered
- Configuration of various limits on the crawl and the output file size
More…
Homepage: http://peacockmedia.software
Compatibility: OS X 10.8 or later 64-bit
TO MAC USERS: If RAR password doesn't work, use this archive program:
RAR Expander 0.8.5 Beta 4 and extract password protected files without error.
TO WIN USERS: If RAR password doesn't work, use this archive program:
Latest Winrar and extract password protected files without error.