Download of heritrix-1.8.0.jar (heritrix-1.8.0.jar ( external link: SF.net): 1,171,911 bytes) will begin shortly. If not so, click link on the left.
The archive-crawler project is building Heritrix: a flexible, extensible, robust, and scalable web crawler capable of fetching, archiving, and analyzing the full diversity and breadth of internet-accesible content.