![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
|||||||||||||||||||||||||||||||||
![]() |
![]() |
You are in: UK | |||||||||||||||||||||||||||||||||||
![]() |
![]() |
Wednesday, 27 March, 2002, 12:46 GMT
The internet. Volume One
![]() Books are one thing, but try cataloging the net
![]() Many thousands of sites go up and come down, and are hardly even noticed. The average webpage disappears after just 60 days, according to the Digital Preservation Society.
But now the British Library is having a stab at preserving websites, as it has done with books for about 200 years. Since June last year, the library has been running a pilot project to archive 100 British websites. Now it wants to expand the scheme, called Domain UK, to permanently keep tabs on 10,000 sites of social and historical importance. Tidal wave of data The digital age has been a big headache to librarians.
Before computers, the job of archiving information was relatively straightforward. Publishers of books, magazines, periodicals and newspapers sent copies of everything they produced to the British Library, which filed them away for future reference. But no such obligation has been established for digital information. The result is that thousands of web pages and websites, which one day could be seen as invaluable documents of our time, are disappearing. Lost forever "It's hard to know which sites we've missed, because of course we don't know about them," says Dr Field. "But some of the early pages put up after 11 September, those of bereaved relatives and friends, have gone already.
The British Library had been talking for some time about the need to save web material, but it finally sprang into action during last year's general election campaign. "We found 79 sites dedicated to the election and we thought we've got to hold on to these somehow," says Stephen Bury, who runs the Domain UK pilot project. Rich variety "The debate that goes on during an election will always be historically and culturally important, and increasingly this is going on on the net."
Mr Bury was in charge of selecting the 100 sites that have been monitored for the pilot project, which helps explain the presence of rovers.co.uk - the internet home of Mr Bury's favourite football team, Blackburn Rovers. Others selected include Cabgallery.com, a site dedicated to a London taxi which used to double up as a mobile art gallery; and shadoof.net, a literary site. Snapshots "We were looking for a cross section of British sites, ones that represented a wide variety of interests and subjects. So, for example, we chose the Soil Association site and also the Monsanto site, to see how the debates on GM foods matched up."
But if the expansion plans go ahead - this depends on a £600,000 grant from the government - he could have his work cut out. As well as regularly archiving 10,000 sites, the plan is to take half-yearly snapshots of the whole ".uk" domain - which currently runs to about 25 million web pages. The challenge then is to catalogue them all so future generations will know how to navigate this mass of information. |
![]() |
Internet links:
![]() The BBC is not responsible for the content of external internet sites Top UK stories now:
![]() ![]() Links to more UK stories are at the foot of the page.
![]() |
![]() |
Links to more UK stories
|
![]() |
![]() |
^^ Back to top News Front Page | World | UK | UK Politics | Business | Sci/Tech | Health | Education | Entertainment | Talking Point | In Depth | AudioVideo ---------------------------------------------------------------------------------- To BBC Sport>> | To BBC Weather>> ---------------------------------------------------------------------------------- © MMIII | News Sources | Privacy |