British Library plans to archive whole UK Web

Daily Newsletters

Sign up to ZDNet UK's daily newsletter.

NEWS
A trial project to archive 6,000 UK Web sites was announced on Tuesday by the UK Web Archiving Consortium. The consortium, led by the British Library, includes the Wellcome Trust, the National Archives and the Scottish and Welsh national libraries.

Each member of the consortium will choose content relevant to its subject. All types of Web content will be included, from government documents to blogs.

Richard Boulderstone, director of e-strategy at the British Library, said that all types of material will be collected including "informal material" such as discussion forums. "Letters and other informal works tell us how society is actually operating," he said.

The British Library will not censor the material because it does not want to restrict what people can find out about in the future.

"We would like to take a snapshot of every year, as a sample of what the Web looked like", said Boulderstone, suggesting that in the future people could look back to 2004 and see the swear words that Web users were using.

Only a limited number of Web sites will be archived initially but "ultimately, we would like to archive the whole UK Web," said Boulderstone.

One of the problems faced by the consortium is that, due to UK copyright law, permission is needed before a site can be archived. The British Library is working with the government to extend the law to allow them blanket access to all Web sites because "there are 4 million sites that we would like to capture -- we cannot ask everyone for permission," said Boulderstone.

The UK Web Archiving Consortium is not the first to archive the Web. The Wayback Machine, run by US-based Internet Archive, is a service that allows people to visit archived versions of Web sites.

According to Boulderstone, the British Library's approach differs from that of the Internet Archive because his organisation seeks permission from Web sites. In the future, the British Library hopes to improve on Wayback by archiving more frequently and with more depth, and through providing metadata so that information can be found more easily.

Talkback

I think its absolutely appalling that the British Library can ride roughshod over peoples copyright in such a fashion - by basically altering the law to allow them to do what they want with other people's web content.

via Facebook 24 June, 2004 16:46
Reply

This seems to be not only rather pointless, but unless they are planning to also archive every database and server side code on the web (or I am missing something) then most of these archived sites will not actually do anything.

via Facebook 24 June, 2004 17:29
Reply

What a wonderful idea. I don't think any webmaster producing original work will object to having their contributions preserved for the future. My only worry is how will the sites be stored? CD based storage will not last that long.

via Facebook 9 August, 2004 20:42
Reply

Post your comment

In order to post a comment you need to be registered and logged in.

You can also log in with Facebook. Log in or create your ZDNet UK account below

  • Login

Will not be displayed with your comment

By signing up for this service, you indicate that you agree to our Terms and Conditions and have read and understood our Privacy Policy. Questions about membership? Find the answers in the Community FAQ

Get ZDNet UK's daily newsletter

Enter your email address to sign up

ZDNet UK Live

Marcus Karlsson

Any update on this, considering the claimed "first week of February"?

55 minutes ago by Marcus Karlsson via Facebook on Archos confirms G9 Ice Cream Sandwich update schedule
apexwm

Bill Goodrich : Just as al_langevin pointed out, with Windows Server 2008 there is no Services for Macintosh anymore. It's gone, not available....

9 hours ago by apexwm on Windows Server 2008 drops the ball for Mac compatibility
txtrainguy

Replying to an old topic that I'm currently facing with my CEO (who is on a Mac). Our servers are primarily Windows Servers, office is about...

16 hours ago by txtrainguy on Windows Server 2008 drops the ball for Mac compatibility
k0tcs3

Sure, that makes perfect sense. Pay wrong-doers money and thank them for breaching your security and pointing out your flaws, that would surely...

16 hours ago by k0tcs3 on US indicts Romanian over NASA climate change hack
Random_Error

I think he's referring specifically to Android apps, as Apple do regulate their App Store, but Google seem to let any old crap onto the Android store!

16 hours ago by Random_Error on RIM: BlackBerry will keep 'garbage' apps out of store
Paul Fezziwig

Keep the crap apps out?! How will they compete with Android and Apple's claim to fame of having so many life changing apps? I wonder if the media...

22 hours ago by Paul Fezziwig via Facebook on RIM: BlackBerry will keep 'garbage' apps out of store
Aigars Mahinovs

It has been shown time after time that if there is an author store that sells the songs at even 1$ per song and gives you a high-quality digital...

22 hours ago by Aigars Mahinovs via Facebook on Copyright isn't working, says European Commission
awbMaven

""As a result of Butyka's alleged conduct, researchers were unable to use the computers for more than two months while NASA removed the malicious...

1 day ago by awbMaven on US indicts Romanian over NASA climate change hack
subhorup

It simultaneously worries me and uplifts me that a self-proclaimed group of internet activists name themselves after Indian mythical figures....

1 day ago by subhorup on Anonymous activists release PCAnywhere source code
naviathan

It's actually far easier to work anonymously on the internet than you think. With tools like Tor bouncing your traffic around the world before...

2 days ago by naviathan on Anonymous activists release PCAnywhere source code
Agnostic_OS

1000272134 and bluedalmatian with you both there but then I'm still in 10.04 land (and happy with it)

2 days ago by Agnostic_OS on Ten factors that make Ubuntu 11.10 a hit
apexwm

Interesting article and definitely see your points on the products mentioned. One of the top products for our Help Desk (approximately 20% of all...

2 days ago by apexwm on Ten flawed products that derail productivity
Paul Hutchinson

Absolutely - this should obviously not be handled my isp - but handled by their hosting operator. What's been suggested here is that my isp police...

2 days ago by Paul Hutchinson via Facebook on MPs urge ISPs to take down terrorist material
Techs UK

Looks like a great phone. I don't notice any deficiencies in WP7. used IOS before, that's pretty good. I don't spend much time in Apps, all i need...

2 days ago by Techs UK on Nokia pins US 're-entry' hopes on Lumia 900
Larry Bloggy

Now with the help of these apps you are always synced with MS outlook while on the move. Just download apps like xobni or outlookreflex and get...

2 days ago by Larry Bloggy via Facebook on Outlook Social Connector beta 2 and the LinkedIn connector
mike40g123

Your details are wrong. The version currently being made is the one with 2 USB ports, 256MB RAM and a network port. This is the Model B. The...

2 days ago by mike40g123 on Raspberry Pi boards set to go on sale
Moley

The thing that has been puzzling me for quite a while is how Anonymous can remain anonymous whilst not only being active on the Internet but also...

3 days ago by Moley on Anonymous activists release PCAnywhere source code
Don Dilly

If what Semantec is saying is rue, that is even worse and shows a complete disregard for thier users. If what Anonymous claims is true and the...

3 days ago by Don Dilly via Facebook on Anonymous activists release PCAnywhere source code
MattChurchy

Didn't seem particularly biased to me either. Oh though you might have mentioned some other competitors with free search and email services...

3 days ago by MattChurchy on Time for an evil umpire: Google, Microsoft & privacy
Simon Bisson and Mary Branscombe

James - exactly as much as anyone paid you for your comment; I don't feel that I need to say that I'm independant and unbiased, but just for you...

3 days ago by Simon Bisson and Mary Branscombe on Time for an evil umpire: Google, Microsoft & privacy