Google to unlock libraries

Daily Newsletters

Sign up to ZDNet UK's daily newsletter.

NEWS

Google will expand its ability for searching books by working with Oxford, Stanford and Harvard Universities, among others, to digitise out-of-print and copyrighted works.

On Tuesday, the Mountain View, California-based company is expected to announce relationships with five major libraries, including the Oxford University and the New York Public Library, to create digital copies of some books so that they may be searchable using Google. Also on Tuesday, the company will begin sampling some works already scanned for Google Print, the company's searchable index of books that it formally unveiled in October.

Susan Wojcicki, Google's director of product management, said the project will evolve over several years.

"Libraries have been the keepers of information for centuries," she said. "We're excited to unlock that wealth of information."

For now, the scope of Google's relationship with each institution varies. For example, Harvard Publications Director Peter Kosewski said the university is in a pilot programme with Google to scan only 40,000 randomly selected books from its collection of 15 million, the largest academic library in the United States dating to the 1630s. By going through the process, Harvard will be able to vet issues such as care of the books and copyright concerns and determine whether it's appropriate to proceed, he said.

Google has long said it plans to make the world's information accessible and searchable, and a cornerstone to its mission would be to bring libraries to life online. Google itself was borne out of a library digitisation project at Stanford, Wojcicki said, and its founders had planned all along to build a vast searchable index of books. Only now has the company found the technology and resources to work with libraries to scan their volumes, she said.

Faced with increasing competition from Microsoft, Yahoo and others, Google is also trying to continually differentiate itself in Web search and make its service vital to consumer in new ways. The task is not only in making it easy for consumers to find an obscure travel site on Zimbabwe or track a UPS package, but now it's also in helping a visitor call up and read a work of Shakespeare.

Still, the company must navigate tricky issues of copyright. Because libraries own only copies of copyrighted books and don't hold the rights to reproduce those works for wide distribution, Google will likely have to deal with publishers to share revenue on advertising, excerpt only a small portion of material or promote the purchase of books on third-party sites such as Amazon, all of which Google said it plans to do. The company said that at first, it will only display biographical information for copyrighted works.

For books in the public domain -- books no longer protected by copyright -- Google will allow people to search and read the entirety of the work. Oxford, for example, has agreed to let Google scan all of its books published in and before 1900.

New York Public Library has agreed to a pilot programme with Google, granting rights to scan an undisclosed number of books. Stanford and the University of Michigan have given Google the go-ahead to digitise their entire libraries, which Google estimated at seven million volumes each.

Many universities tout exclusive collections of books or letters, and for this reason, Google may also run into trouble obtaining clearances down the road to meet its goals. Harvard's Kosewski said that its test is only with a small number of books and that it would require an entirely new set of considerations if the university were to grant Google or others the ability to scan such works.

"The potential to serve people worldwide is without question," Kosewski said. "We have to ensure that the collections can be taken very good care of."

Google's project coincides with another academic pursuit. The company only recently introduced Google Scholar, a service for searching academic papers such as theses or abstracts. A commercial outfit that sells access to similar materials recently sued Google over its new programme.

The library project builds on Google's previously released print service, which when launched, focused largely on digitising works from publishers, including Random House and Knopf Publishing Group. The company recently invited any publisher to scan their books for inclusion in the index.

The service lets Web surfers call up brief excerpts from books, critic reviews, bibliographic and author's notes and, in some cases, a picture of the book's cover.

Google makes money from the service by displaying related ads alongside book text, and the company shares the majority of the ad revenue with publishers.

Rivals are jockeying for similar utility. Microsoft, for example, has built encyclopaedia answers from its Encarta software into search results for its new proprietary engine. Last year, Yahoo began a content-acquisition project to digitise more searchable material. And Amazon.com features a search-inside-the-book tool so that people can browse works digitally before buying.

Post your comment

In order to post a comment you need to be registered and logged in.

You can also log in with Facebook. Log in or create your ZDNet UK account below

  • Login

Will not be displayed with your comment

By signing up for this service, you indicate that you agree to our Terms and Conditions and have read and understood our Privacy Policy. Questions about membership? Find the answers in the Community FAQ

Get ZDNet UK's daily newsletter

Enter your email address to sign up

ZDNet UK Live

Moley

@kevinmchapman. The discussion here reflects the very significant number of users who really do like the traditional menu system and who wish to...

26 minutes ago by Moley on A tale of two distros: Ubuntu and Linux Mint
kevinmchapman

Er, no... It is an efficient means of finding the application/file/setting you need in one place. The icons are a simply a fallback for when you...

2 hours ago by kevinmchapman on A tale of two distros: Ubuntu and Linux Mint
TerryRK

Isn't the provision of a text based search an admission by the developers that the mass of icons approach does not work? I don't need to use a...

3 hours ago by TerryRK on A tale of two distros: Ubuntu and Linux Mint
kevinmchapman

"Unity and GNOME 3 both abandon the old text-based cascading menus in favour of a graphical icon-driven system." Point truly missed. Both use a...

4 hours ago by kevinmchapman on A tale of two distros: Ubuntu and Linux Mint
TerryRK

whs001 - Thank you, I'm glad you liked the article. I absolutely agree with you on your first point. I should perhaps have made it clearer that...

4 hours ago by TerryRK on A tale of two distros: Ubuntu and Linux Mint
Dennis Nilsson

If we allow corporate interest to dictate the way our government circumvents due process against foreign entities then we should accept the same...

5 hours ago by Dennis Nilsson via Facebook on ACTA stumbles in Germany
GHar123

I totally dislike pirating of works, I fear that artists will be deterred from creating works if they think that they are going to get ripped off....

7 hours ago by GHar123 on ACTA stumbles in Germany
JCB33

How dare film makers, artists or anybody that invests in creativity stop us pirating their works for free. I want to be able to walk into my local...

12 hours ago by JCB33 on ACTA stumbles in Germany
Moley

@GrueMaster. I prefer horses for courses rather than one size fits all. I, and I suspect most other computer users, do not really wish to have...

15 hours ago by Moley on A tale of two distros: Ubuntu and Linux Mint
greycynic

The product that scares me every time I have to use it is the Office 2007 version of Excel. The first bug that I found was applying the median...

15 hours ago by greycynic on Ten flawed products that derail productivity
GrueMaster

Nice review and very informative. One thing I'd like to add (in reply to whs001's 1st question), the main reason to have the same interface from...

16 hours ago by GrueMaster on A tale of two distros: Ubuntu and Linux Mint
Frederick Wrigley

I'be been using Mint 12 since the RC came out, and I am far more happy with the Cinnamon, the Mate, and, yes (with extensions), theGnome 3...

17 hours ago by Frederick Wrigley via Facebook on A tale of two distros: Ubuntu and Linux Mint
bdantas

Excellent article. One small correction, though--although a fresh installation of Linux Mint 12 will, indeed, provide the user with a version of...

18 hours ago by bdantas on A tale of two distros: Ubuntu and Linux Mint
Alan Ralph

In related news, the ISPs club together to get the members of the Home Affairs Select Committee (ya goofed on that part, ZDNet UK) copies of "The...

18 hours ago by Alan Ralph via Facebook on MPs urge ISPs to take down terrorist material
Alan Ralph

In related news, the ISPs club together to get the members of the Home Affairs Select Committee (ya goofed on that part, ZDNet UK) copies of "The...

18 hours ago by Alan Ralph via Facebook on MPs urge ISPs to take down terrorist material
Moley

For Gnome 2 die-hards, it is possible to add icons to the bottom panel (or top top panel, if you prefer) which provide the exact Gnome 2...

19 hours ago by Moley on A tale of two distros: Ubuntu and Linux Mint
ramwellian

Your comments would seem pretty naive and immature. Your 'solution' appears to be, "gee, let's all just give in to the hackers and give them...

19 hours ago by ramwellian on Cloud computing security: no more oxymoron?
BugStalker

"Interesting thought ... If you installed Win7 as a dual boot on a machine that previously only had Linux, and it wrecked your Linux installation,...

19 hours ago by BugStalker on Windows 7 Declares War on GRUB
whs001

This is an excellent summary of Ubuntu and Mint and the interface differences between them. Most such articles take a very partisan position for...

19 hours ago by whs001 on A tale of two distros: Ubuntu and Linux Mint
Moley

@ewallace. Not so clear. Anyone can obtain the text, for example from here http://www.ustr.gov/webfm_send/2379. I support ACTA so long as it and...

20 hours ago by Moley on ACTA: Facts, misconceptions and questions