“Hello I’d like a backup of the internet, on a floppy*”
The British Library and the Bibliotheque Nationale de France are embarking on a programme to archive resources on the World Wide Web in their respective national domains. To achieve this programme, the British Library as lead partner wishes to tender for a contract to multiple suppliers to provide development services and/or software technology for a Smart Archiving Crawler. This will comprise of a framework controlling and interacting with Heritrix, the Internet Archive’s open source archiving web crawler, and modules which provide prioritisation capabilities using document thematic analysis and link weighting.
http://www.bonchurchmanor.com/web/blogs/mark/2004_10_01_archive.html
*I made that bit up








Leave a Reply