HP, MIT delve deep with digital library

Daily Newsletters

Sign up to ZDNet UK's daily newsletter.

NEWS
The Massachusetts Institute of Technology and Hewlett-Packard on Monday unveiled a system for electronically archiving books, lecture notes and scientific data that potentially will serve as a model for academic libraries in the future. Called DSpace, the new system is essentially a centralised, electronic repository for the massive amounts of intellectual property created by research institutions, said Mackenzie Smith, associate director of MIT Libraries and the DSpace project director. Preserving data in an accessible manner is increasingly becoming a problem for a number of universities and government agencies. MIT itself produces an estimated 10,000 pieces of digital content a year, a figure that includes conference papers and technical reports. Some of the data is also quite large and difficult to access. One faculty member has generated ocean floor maps that take up 30 terabytes of data. "We began this to get some kind of territorial control over all of this research," Smith said. "If you're lucky, you can get some of it on Google, but most of the stuff we are talking about is not indexed in any way you can get it." Potentially, DSpace will lead to the creation of a virtual library that meshes the collections of several research universities. MIT is already discussing using the system to link to the libraries of Cambridge and Cornell, she said. Corporations and government agencies have also been in contact with MIT. The heart of the DSpace system is an open-source storage and retrieval system. Each academic department has been assigned a customised portal for submitting materials, Smith said. Professors and researchers can then deposit information directly into the system through a portal, or after a peer review, depending on the departmental regulations. To retrieve documents, researchers can consult an index. Author and text searches will come in later versions, she said. "Part of the reason for doing this is that the faculty says, 'My stuff is too hard to find,'" she said. Using open-source software also cut costs, Smith added. The MIT system, which currently can hold two terabytes (2 trillion bytes) of data, can be replicated for $100,000 to $500,000 (£64,000 to £320,000), with most of the expense deriving from hardware. The software will be licensed freely under the Berkeley licence. The system can also be expanded. Eventually, MIT's system will wield more than a petabyte, or a quadrillion bytes of data. Over time, academics and librarians will then have to go through the arduous process of determining what to keep and what to eliminate. The system will ideally let universities cut costs associated with housing documents and research findings, but electronic storage isn't free so culling is inevitable, she said. The project started about 18 months ago and was jointly developed by MIT and HP. The company and the university have collaborated on a number of projects. Recently, the two digitised all of the output of MIT Press, including out-of-print textbooks, and put it into a searchable database.
Everybody needs storage. And almost every week some company manages to squeeze more storage into less space for a lower price. For the latest news, reviews and price checks on everything from USB flash cards and PCCard hard disks to storage area networks, see ZDNet UK's Storage News Section. Have your say instantly, and see what others have said. Go to the ZDNet news forum. Let the editors know what you think in the Mailroom.

Post your comment

In order to post a comment you need to be registered and logged in.

You can also log in with Facebook. Log in or create your ZDNet UK account below

  • Login

Will not be displayed with your comment

By signing up for this service, you indicate that you agree to our Terms and Conditions and have read and understood our Privacy Policy. Questions about membership? Find the answers in the Community FAQ

Get ZDNet UK's daily newsletter

Enter your email address to sign up

ZDNet UK Live

bootlegger

Make that 13 people now - I got refused today at Manchester airport. I thought I was up to date on this legislation - I knew of the EU ruling from...

17 minutes ago by bootlegger on UK airport body scans will not be opt out
tinycg

Don't forget to check out apps like GoodReader or SlideShark either, they're indispensible for people on the go in presentation situations. Best...

3 hours ago by tinycg on Four top iPad apps for people on the move
TerryRK

Well it seems there is something a number of us agree on. Why is the Ubuntu Unity launcher so ugly? I thought perhaps it was something to do with...

8 hours ago by TerryRK on A tale of two distros: Ubuntu and Linux Mint
Freebies202

Duplicate comments are not made intentionally. Its very good to know that now you are keeping check on this problem because sometimes a commenter...

17 hours ago by Freebies202 on Microsoft fixes blog comments, speeds up blogs with open source
kevinmchapman

"the very significant number of users" and "many (most) of us" - you have no evidence for these statements. It is a fact that most users are saying...

1 day ago by kevinmchapman on A tale of two distros: Ubuntu and Linux Mint
Marg Menzies Harrison

Another grammar faux pas is the improper use of "you". When sitting down down in a restaurant, for example, I get cringe when the waitress...

1 day ago by Marg Menzies Harrison via Facebook on 10 flagrant grammar mistakes that make you look stupid
zdnetukuser

And NOW, folks, for Canonical's next trick... Kubuntu is late. Here's a pencil. Draw your own conclusions. cf.:...

1 day ago by zdnetukuser on Linux Minterface
Moley

@kevinmchapman. The discussion here reflects the very significant number of users who really do like the traditional menu system and who wish to...

1 day ago by Moley on A tale of two distros: Ubuntu and Linux Mint
kevinmchapman

Er, no... It is an efficient means of finding the application/file/setting you need in one place. The icons are a simply a fallback for when you...

1 day ago by kevinmchapman on A tale of two distros: Ubuntu and Linux Mint
TerryRK

Isn't the provision of a text based search an admission by the developers that the mass of icons approach does not work? I don't need to use a...

1 day ago by TerryRK on A tale of two distros: Ubuntu and Linux Mint
kevinmchapman

"Unity and GNOME 3 both abandon the old text-based cascading menus in favour of a graphical icon-driven system." Point truly missed. Both use a...

1 day ago by kevinmchapman on A tale of two distros: Ubuntu and Linux Mint
TerryRK

whs001 - Thank you, I'm glad you liked the article. I absolutely agree with you on your first point. I should perhaps have made it clearer that...

1 day ago by TerryRK on A tale of two distros: Ubuntu and Linux Mint
Dennis Nilsson

If we allow corporate interest to dictate the way our government circumvents due process against foreign entities then we should accept the same...

1 day ago by Dennis Nilsson via Facebook on ACTA stumbles in Germany
GHar123

I totally dislike pirating of works, I fear that artists will be deterred from creating works if they think that they are going to get ripped off....

1 day ago by GHar123 on ACTA stumbles in Germany
JCB33

How dare film makers, artists or anybody that invests in creativity stop us pirating their works for free. I want to be able to walk into my local...

2 days ago by JCB33 on ACTA stumbles in Germany
Moley

@GrueMaster. I prefer horses for courses rather than one size fits all. I, and I suspect most other computer users, do not really wish to have...

2 days ago by Moley on A tale of two distros: Ubuntu and Linux Mint
greycynic

The product that scares me every time I have to use it is the Office 2007 version of Excel. The first bug that I found was applying the median...

2 days ago by greycynic on Ten flawed products that derail productivity
GrueMaster

Nice review and very informative. One thing I'd like to add (in reply to whs001's 1st question), the main reason to have the same interface from...

2 days ago by GrueMaster on A tale of two distros: Ubuntu and Linux Mint
Frederick Wrigley

I'be been using Mint 12 since the RC came out, and I am far more happy with the Cinnamon, the Mate, and, yes (with extensions), theGnome 3...

2 days ago by Frederick Wrigley via Facebook on A tale of two distros: Ubuntu and Linux Mint
bdantas

Excellent article. One small correction, though--although a fresh installation of Linux Mint 12 will, indeed, provide the user with a version of...

2 days ago by bdantas on A tale of two distros: Ubuntu and Linux Mint