Google to unlock libraries

Daily Newsletters

Sign up to ZDNet UK's daily newsletter.

NEWS

Google will expand its ability for searching books by working with Oxford, Stanford and Harvard Universities, among others, to digitise out-of-print and copyrighted works.

On Tuesday, the Mountain View, California-based company is expected to announce relationships with five major libraries, including the Oxford University and the New York Public Library, to create digital copies of some books so that they may be searchable using Google. Also on Tuesday, the company will begin sampling some works already scanned for Google Print, the company's searchable index of books that it formally unveiled in October.

Susan Wojcicki, Google's director of product management, said the project will evolve over several years.

"Libraries have been the keepers of information for centuries," she said. "We're excited to unlock that wealth of information."

For now, the scope of Google's relationship with each institution varies. For example, Harvard Publications Director Peter Kosewski said the university is in a pilot programme with Google to scan only 40,000 randomly selected books from its collection of 15 million, the largest academic library in the United States dating to the 1630s. By going through the process, Harvard will be able to vet issues such as care of the books and copyright concerns and determine whether it's appropriate to proceed, he said.

Google has long said it plans to make the world's information accessible and searchable, and a cornerstone to its mission would be to bring libraries to life online. Google itself was borne out of a library digitisation project at Stanford, Wojcicki said, and its founders had planned all along to build a vast searchable index of books. Only now has the company found the technology and resources to work with libraries to scan their volumes, she said.

Faced with increasing competition from Microsoft, Yahoo and others, Google is also trying to continually differentiate itself in Web search and make its service vital to consumer in new ways. The task is not only in making it easy for consumers to find an obscure travel site on Zimbabwe or track a UPS package, but now it's also in helping a visitor call up and read a work of Shakespeare.

Still, the company must navigate tricky issues of copyright. Because libraries own only copies of copyrighted books and don't hold the rights to reproduce those works for wide distribution, Google will likely have to deal with publishers to share revenue on advertising, excerpt only a small portion of material or promote the purchase of books on third-party sites such as Amazon, all of which Google said it plans to do. The company said that at first, it will only display biographical information for copyrighted works.

For books in the public domain -- books no longer protected by copyright -- Google will allow people to search and read the entirety of the work. Oxford, for example, has agreed to let Google scan all of its books published in and before 1900.

New York Public Library has agreed to a pilot programme with Google, granting rights to scan an undisclosed number of books. Stanford and the University of Michigan have given Google the go-ahead to digitise their entire libraries, which Google estimated at seven million volumes each.

Many universities tout exclusive collections of books or letters, and for this reason, Google may also run into trouble obtaining clearances down the road to meet its goals. Harvard's Kosewski said that its test is only with a small number of books and that it would require an entirely new set of considerations if the university were to grant Google or others the ability to scan such works.

"The potential to serve people worldwide is without question," Kosewski said. "We have to ensure that the collections can be taken very good care of."

Google's project coincides with another academic pursuit. The company only recently introduced Google Scholar, a service for searching academic papers such as theses or abstracts. A commercial outfit that sells access to similar materials recently sued Google over its new programme.

The library project builds on Google's previously released print service, which when launched, focused largely on digitising works from publishers, including Random House and Knopf Publishing Group. The company recently invited any publisher to scan their books for inclusion in the index.

The service lets Web surfers call up brief excerpts from books, critic reviews, bibliographic and author's notes and, in some cases, a picture of the book's cover.

Google makes money from the service by displaying related ads alongside book text, and the company shares the majority of the ad revenue with publishers.

Rivals are jockeying for similar utility. Microsoft, for example, has built encyclopaedia answers from its Encarta software into search results for its new proprietary engine. Last year, Yahoo began a content-acquisition project to digitise more searchable material. And Amazon.com features a search-inside-the-book tool so that people can browse works digitally before buying.

Post your comment

In order to post a comment you need to be registered and logged in.

You can also log in with Facebook. Log in or create your ZDNet UK account below

  • Login

Will not be displayed with your comment

By signing up for this service, you indicate that you agree to our Terms and Conditions and have read and understood our Privacy Policy. Questions about membership? Find the answers in the Community FAQ

Get ZDNet UK's daily newsletter

Enter your email address to sign up

ZDNet UK Live

apexwm

NanWag : A Windows Server 2008 is being used because the environment that the Macs are in is a heavy Windows environment. I am proposing that...

19 minutes ago by apexwm on Windows Server 2008 drops the ball for Mac compatibility
BellamysIT

Really good article. You bring to light a few really good things. However, isn't it true that over 70% of fortune 500 companies use sharepoint?...

21 minutes ago by BellamysIT on Designing a SharePoint farm: Tiers before bedtime
annonymous2

If Piratebay is a crime then so is borrowing a dvd you purchased to a family member or a friend. Why should we not be aloud to share. Most of the...

2 hours ago by annonymous2 on UK ISPs ordered to block Pirate Bay website
NanWag

File Services For Macintosh was causing Excel to prompt for Overwriting changes or Save Another Copy because it was changing the timestamp on the...

3 hours ago by NanWag on Windows Server 2008 drops the ball for Mac compatibility
Regis Machado

creative cloud $48/month in the USA, £48/month in the UK ($79). good for the competitors

5 hours ago by Regis Machado via Facebook on Adobe move promotes piracy
Tom Espiner

Hello KosGirl, Good question. I've asked Belfius for a response. The latest post I can find on Pastebin about it is here:...

5 hours ago by Tom Espiner on Hackers hold bank to ransom over stolen data
KosGirl

Have there been any further updates to this story? I can't find any information on whether the hackers released the data or not.

6 hours ago by KosGirl on Hackers hold bank to ransom over stolen data
SandJ

I have done 7 speed tests this morning on different speed test tools. They tell me my download speed is: 12.3, 12.3, 12.3, 11.1, 12.7, 12.7, 11.7...

7 hours ago by SandJ on Watchdog: TalkTalk's broadband speed test misled users
Jack Schofield

@Mary Microsoft could always send Mozilla a spec sheet and oblige them to meet the same standards as IE. Then Mozilla can spend millions of...

10 hours ago by Jack Schofield on Windows RT browsers and the point of Windows RT
goth1csnake3

Not before time, that people making films,dvd's get whats coming to them. Well done, Virgin Media.

12 hours ago by goth1csnake3 on Virgin Media: Spotify deal will bring down piracy
Simon Bisson and Mary Branscombe

Apex - the question then is what about letting the user choose to have a tablet where they don't have to have that responsibility? why can't the...

22 hours ago by Simon Bisson and Mary Branscombe on Windows RT browsers and the point of Windows RT
Simon Bisson and Mary Branscombe

Moley, Apex, thanks; I think there's an interesting other dimension of choice - the choice to have a platform that is 'locked down' in the sense...

22 hours ago by Simon Bisson and Mary Branscombe on Mozilla accuses Microsoft of shutting Firefox out of WOA
Yellowcave

Not surprised. I once used the methods to let my firewall just notify me of breaches. Not one single logged event was genuine. Once, we all...

1 day ago by Yellowcave on Mobile porn filters catch innocent content, says report
duplex

live realy sucks in facebook becuase people hack your profile

1 day ago by duplex on Irish watchdog: Facebook privacy still falls short
Ed Macnair

If only it was that simple. When you start accessing Cloud applications you are stuck with the security model the vendor provides...........unless...

1 day ago by Ed Macnair via Facebook on IT security? You're doing it wrong!
Phil at Cloud4

Another good updaet, I have enjoyed going on the journey reading this series on SharePoint 2010 and have learned alot. Great writing.

1 day ago by Phil at Cloud4 on Designing a SharePoint farm: Tiers before bedtime
muteen

roumers of an ipad Mini, isnt that just an iTouch!?

1 day ago by muteen on Apple rebrands iPad 4G as 'Wi-Fi + Cellular' for UK
apexwm

Thanks for this article and bringing this issue to light. Unfortunately this type of activity is common not only with Adobe, but many other...

1 day ago by apexwm on Adobe move promotes piracy
Andy Bolstridge

there's a very thin line between tax avoidance and tax efficiency - earning £850 a month and claiming dividends to bring my income up to normal...

1 day ago by Andy Bolstridge via Facebook on The Idle Self-employed
Andy Bolstridge

I see that they are happy to announce these numbers.. but no-one will take any notice until they start announcing sales numbers too.

1 day ago by Andy Bolstridge via Facebook on Microsoft's score card for Smoked by Windows Phone