- HTTrack
HTTrack is a free and
open source website copier andoffline browser byXavier Roche , licensed under theGNU General Public License . It allows one to downloadWorld Wide Web sites from theInternet to a local computer. By default, HTTrack arranges the downloaded site by the original site's relative link-structure. The downloaded (or "mirrored") website can be browsed by opening a page of the site in a browser.HTTrack can also update an existing mirrored site and interrupted downloads. HTTrack is fully configurable by options and by filters (include/exclude), and has an integrated help system. There is a basic command line version and two
GUI versions (WinHTTrack and WebHTrack); the former can be part of scripts and cron jobs.HTTrack uses a
web crawler to download a website. Some parts of the website may not be downloaded by default due to the robots exclusion protocol unless disabled during the program. HTTrack can follow links that are generated with basicJavaScript and insideApplet s or Flash, but not complex links (generated using functions or expressions) or server-sideimage maps .See also
*
Robots Exclusion Standard
*Web crawler External links
* [http://www.httrack.com/ Official site]
* [http://youtube.com/watch?v=RdYaSjXAeAk Demonstration of WinHTTrack in use]
* [https://addons.mozilla.org/firefox/1616/ SpiderZilla Add-on for Mozilla FireFox]
Wikimedia Foundation. 2010.