LookSmart draws on desktop power

Daily Newsletters

Sign up to ZDNet UK's daily newsletter.

NEWS
LookSmart, a US-based online directory is hoping to spin a small acquisition into a big project that will use distributed computing to improve its Web search results. In January, LookSmart quietly bought the assets of Grub, an Oklahoma-based developer of technology that lets people donate their computers' otherwise unused processing power to run spiders, or software programs that continually crawl the Net, indexing pages and words. This collective, or distributed, computing power could be used to find new, outdated or updated Web pages daily. LookSmart, which licenses editorial and commercial directory listings to Microsoft's MSN and other Web sites, paid $1.3m (£830,500) in cash and stock for Grub, according to a recent filing with the Securities and Exchange Commission. LookSmart said it is testing the Grub system and plans to unveil the distributed computing project in early April. "Most engines only update their entire document catalogue once a month, because there's an inherent computing problem: They can't do it any faster," said Pete Adams, chief technology officer of LookSmart. "The goal of this technology is to be able to crawl every document on the Internet every day. We can only do that if we can grow the number of people that are running the software -- the computing power we would use is a function of how many people we have donating their computer power." The Grub buyout underscores growing interest in distributed computing, in which computing jobs are farmed out in small chunks across the Internet to the otherwise untapped processing cycles of ordinary PCs. The movement has had grand ambitions -- to find a cure for cancer or find signs of intelligent life in the universe, among other things. But thus far, its chief successes have been curiosities such as the discovery of gigantic prime numbers. LookSmart's long-shot bet on Grub highlights the race to innovate in the search engine arena. A handful of companies are vying for control in the niche, one of the few areas of the Net economy to have generated strong revenue and profit growth since the bursting of the dot-com bubble. In December, Yahoo paid $235m for search assets of Inktomi, while Overture snapped up some of the assets of Norway's Fast Search & Transfer as well as AltaVista. Meanwhile, Disney has suggested that it might be interested in selling its Infoseek search engine. Though it has a history as an editorial guide for the Internet, LookSmart has modified its business in recent years in order to survive the dot-com downturn. It still operates a volunteer-staffed directory, but the company has largely turned its focus to small-business listings, in which marketers pay for Web site reviews. It also sells commercial listings related to keyword searches. The formula helped the company to reach profitability under Generally Accepted Accounting Principles (GAAP) for the first time in its fourth quarter last year. At the same time, LookSmart has expanded its arsenal of search services to challenge the growing popularity of Google, the Web's best-loved search engine. Last year, LookSmart bought WiseNut, an emerging technology company that uses automated crawlers to index the Web, for about $9.25m in stock. LookSmart has yet to fully push the service. In the meantime, it says that it's expanded its index to more than a billion documents and is improving WiseNut's algorithms for calculating the relevance of Web pages in relation to keywords. Distributed digging
Google itself has experimented with distributed computing. Last year, the search leader invited 500 people to try out a new version of a toolbar that lets Windows users donate their computers' unused processing power to the Folding@home scientific research project at Stanford University. That experiment resulted in a small success when Stanford published a scientific paper based on the Folding@home calculations last year. However, the idea of using distributed computing to boost search results remains is still in its early stages. Grub has operated under the radar since 2000, when it was founded in Oklahoma by Kord Campbell. Since Grub was acquired, the company's four-person team has moved to LookSmart's San Francisco headquarters. Previous attempts to harness distributed computing models to update search listings have so far failed to produce useful results, according to search experts. For example, Infoseek has a patent on a system under which sites feed their content to a search index, in order to keep it updated and comprehensive -- but the company never did anything with it, said Danny Sullivan, editor of the industry newsletter Search Engine Watch. Sullivan said that setting up a collective effort to catalogue the Web could lead to some improvements in efficiency, but it could open the door to other problems. "If you allow anyone to just send you information, a small number of people will try to manipulate the system," he said. "Suddenly, you'll have someone that says: 'Surprise! I'm an Amazon.com affiliate and I have a million-page Web site, each page duplicates an exact page at Amazon, add me to (your) index.' When it comes to Web search, some people will abuse this, because there are monetary reasons to do it." Many pages on the Web are static and so don't need to be indexed frequently, said Sullivan. Instead, search engines need to be more intelligent about directing people specifically to information relevant to their searches. "If it were just that the system was going to harness the collective computing power of Web users, I think it would be useful. But it comes back to...spammers. When you drop the barriers completely, will the experience (for search consumers) be great? The conventional wisdom would be no." Sullivan speculated that LookSmart might use the Grub system to start a "trusted feed" service for inclusion into its WiseNut index. Marketers could send updated Web pages to the index to refresh it for a fee -- or what's known as paid inclusion. Search engines, including Inktomi and Fast's AlltheWeb, already use such a service to keep indexes of product-related sites and catalogues fresh, and to augment revenue. Pulling in participants
So far, only about 130 volunteers are participating in the test to donate their computer's processing power to crawl the Web. They do so by downloading software to their PC. The company's success will hinge on the number of people it signs up to donate computer resources to the cause. As part of the project, Grub promotes the benefits of "local searching," in which Webmasters can index their own sites and submit changed pages to the Grub directory, a process that can help save on network resources. LookSmart also plans to introduce a Web application programming interface (API), with which Webmasters will be able to query documents contained in the registry. Charles King, research director for the Sageza Group, a Mountain View, California-based information technology analysis firm, said that the real challenge that LookSmart faces is in recruiting devoted volunteers. King said that projects such as Seti@Home, a distributed computing search for extraterrestrial life, have an "inbuilt geek factor" that draws Web surfers to donate their computer resources to the cause. Similarly, Intel hosts a project to research cures for cancer, luring people who have been touched by the disease. Though he doubted whether Web indexing would be a big attractor, he said that it could be a good application for distributed computing. "The Web is growing at such a phenomenal rate that charting an index of it is an ongoing process," said King. "There are so many pages added on a daily basis, that a snapshot today will be inaccurate tomorrow. But this is the kind of thing that will be successful only as long as they can inspire interest and keep it."
For a round-up of the latest tech business coverage, see the Business News Section. Let the editors know what you think in the Mailroom.

Post your comment

In order to post a comment you need to be registered and logged in.

You can also log in with Facebook. Log in or create your ZDNet UK account below

  • Login

Will not be displayed with your comment

By signing up for this service, you indicate that you agree to our Terms and Conditions and have read and understood our Privacy Policy. Questions about membership? Find the answers in the Community FAQ

Get ZDNet UK's daily newsletter

Enter your email address to sign up

ZDNet UK Live

bootlegger

Make that 13 people now - I got refused today at Manchester airport. I thought I was up to date on this legislation - I knew of the EU ruling from...

2 hours ago by bootlegger on UK airport body scans will not be opt out
tinycg

Don't forget to check out apps like GoodReader or SlideShark either, they're indispensible for people on the go in presentation situations. Best...

5 hours ago by tinycg on Four top iPad apps for people on the move
TerryRK

Well it seems there is something a number of us agree on. Why is the Ubuntu Unity launcher so ugly? I thought perhaps it was something to do with...

9 hours ago by TerryRK on A tale of two distros: Ubuntu and Linux Mint
Freebies202

Duplicate comments are not made intentionally. Its very good to know that now you are keeping check on this problem because sometimes a commenter...

19 hours ago by Freebies202 on Microsoft fixes blog comments, speeds up blogs with open source
kevinmchapman

"the very significant number of users" and "many (most) of us" - you have no evidence for these statements. It is a fact that most users are saying...

1 day ago by kevinmchapman on A tale of two distros: Ubuntu and Linux Mint
Marg Menzies Harrison

Another grammar faux pas is the improper use of "you". When sitting down down in a restaurant, for example, I get cringe when the waitress...

1 day ago by Marg Menzies Harrison via Facebook on 10 flagrant grammar mistakes that make you look stupid
zdnetukuser

And NOW, folks, for Canonical's next trick... Kubuntu is late. Here's a pencil. Draw your own conclusions. cf.:...

1 day ago by zdnetukuser on Linux Minterface
Moley

@kevinmchapman. The discussion here reflects the very significant number of users who really do like the traditional menu system and who wish to...

1 day ago by Moley on A tale of two distros: Ubuntu and Linux Mint
kevinmchapman

Er, no... It is an efficient means of finding the application/file/setting you need in one place. The icons are a simply a fallback for when you...

1 day ago by kevinmchapman on A tale of two distros: Ubuntu and Linux Mint
TerryRK

Isn't the provision of a text based search an admission by the developers that the mass of icons approach does not work? I don't need to use a...

1 day ago by TerryRK on A tale of two distros: Ubuntu and Linux Mint
kevinmchapman

"Unity and GNOME 3 both abandon the old text-based cascading menus in favour of a graphical icon-driven system." Point truly missed. Both use a...

1 day ago by kevinmchapman on A tale of two distros: Ubuntu and Linux Mint
TerryRK

whs001 - Thank you, I'm glad you liked the article. I absolutely agree with you on your first point. I should perhaps have made it clearer that...

1 day ago by TerryRK on A tale of two distros: Ubuntu and Linux Mint
Dennis Nilsson

If we allow corporate interest to dictate the way our government circumvents due process against foreign entities then we should accept the same...

1 day ago by Dennis Nilsson via Facebook on ACTA stumbles in Germany
GHar123

I totally dislike pirating of works, I fear that artists will be deterred from creating works if they think that they are going to get ripped off....

2 days ago by GHar123 on ACTA stumbles in Germany
JCB33

How dare film makers, artists or anybody that invests in creativity stop us pirating their works for free. I want to be able to walk into my local...

2 days ago by JCB33 on ACTA stumbles in Germany
Moley

@GrueMaster. I prefer horses for courses rather than one size fits all. I, and I suspect most other computer users, do not really wish to have...

2 days ago by Moley on A tale of two distros: Ubuntu and Linux Mint
greycynic

The product that scares me every time I have to use it is the Office 2007 version of Excel. The first bug that I found was applying the median...

2 days ago by greycynic on Ten flawed products that derail productivity
GrueMaster

Nice review and very informative. One thing I'd like to add (in reply to whs001's 1st question), the main reason to have the same interface from...

2 days ago by GrueMaster on A tale of two distros: Ubuntu and Linux Mint
Frederick Wrigley

I'be been using Mint 12 since the RC came out, and I am far more happy with the Cinnamon, the Mate, and, yes (with extensions), theGnome 3...

2 days ago by Frederick Wrigley via Facebook on A tale of two distros: Ubuntu and Linux Mint
bdantas

Excellent article. One small correction, though--although a fresh installation of Linux Mint 12 will, indeed, provide the user with a version of...

2 days ago by bdantas on A tale of two distros: Ubuntu and Linux Mint