Net not as interconnected as you think

Daily Newsletters

Sign up to ZDNet UK's daily newsletter.

NEWS
If you think the World Wide Web is an information superhighway system, think again: The Web's most extensive mapping project shows that Internet traffic tends to flow in a strong one-way direction -- and for most sites, online users would find that "you can't get there from here." The study, conducted by researchers at IBM, Compaq and AltaVista, is to be presented at scientific conferences next week. It builds on previous research into the structure of the World Wide Web and argues against the widely held impression that the entire Internet is highly interconnected. The researchers used AltaVista's Web crawler to trace more than 200m Web pages in May and October 1999, following the 1.5bn on links embedded in those pages. That sample is just a fraction of the estimated billion-plus pages on the Web, but it dwarfs the 40m pages used for previous studies. On the basis of their analysis, the researchers set out a "Bow Tie Theory" of Web structure:
  • The central core, the knot of the bow tie, represents Web pages that are interconnected so well that you can eventually get from any page in the core to any other page just by following Internet links. Examples of core pages would include the home pages for IBM.com and MSNBC.com, said Nam LaMore, an IBM spokesman. This "strongly connected core" makes up just 30 percent of the entire Web sample.
  • Another 24 percent represents "origination pages." These are pages with links that you can eventually follow into the core -- but which cannot be accessed through links from the core. One example is a personal Web page about your pet that includes links to online pet stores. "You point to them, but no one (in the strongly connected core) is pointing back at you," LaMore said.
  • Yet another 24 percent consists of "destination pages" that can be accessed from links in the connected core but do not link back to the core. One example are research papers buried deep on university or corporate web sites. Such a page "could be on IBM.com/research/projects/almaden and on and on -- and finally here's where it dumps you," La More said.
  • The other 22 percent is completely disconnected from the central core: These pages are either "tendrils," connected by links only to pages in one of the other categories; "tubes," which link origination and destination pages without going through the core; or "islands" not linked to the rest of the Internet at all. An example of an "island" would be a group of student or family Web pages that link only to one another. The only way to find such pages would be to know the address in advance. Even most search engines would not be able to find such an island, unless it was linked to the rest of the Internet at some point in the past. Moreover, the researchers found, the proportions for these four categories remained constant between the May and October surveys, even though the number of Web pages grew substantially.
To Part II What do you think? Tell the Mailroom. And read what others have said.

Post your comment

In order to post a comment you need to be registered and logged in.

You can also log in with Facebook. Log in or create your ZDNet UK account below

  • Login

Will not be displayed with your comment

By signing up for this service, you indicate that you agree to our Terms and Conditions and have read and understood our Privacy Policy. Questions about membership? Find the answers in the Community FAQ

Get ZDNet UK's daily newsletter

Enter your email address to sign up

ZDNet UK Live

bordero

ike fuelband is great for every healthminded person ! to work out! theres this website called textme4free.com that you can use to text anywhere in...

4 hours ago by bordero on Nike's FuelBand wristband gamifies exercise
BrownieBoy

> I'm told it's somewhat annoying when people have their Macs stolen > and Apple stores treat the thief as the owner, but there you go. Ouch,...

6 hours ago by BrownieBoy on AMD Ultrathins to challenge Intel Ultrabooks
Moley

@kevinmchapman. OK, I acknowledge that 'most' was a gratuitous throwaway comment as an afterthought and too presumptuous. As to proof, as you...

10 hours ago by Moley on A tale of two distros: Ubuntu and Linux Mint
Jack Schofield

@BrownieBoy > Works really well for thieves.... >> Nice attempt to deflect the argument by tossing in a point that's totally >> irrelevant, even...

11 hours ago by Jack Schofield on AMD Ultrathins to challenge Intel Ultrabooks
raskolnikof

fantastic that the so called piracy bills have been withdrawn. however, these anti-democracy supporters are still in the shadows so lets be alert...

12 hours ago by raskolnikof on SOPA, Protect IP support wavers in face of online protest
Tony Douglas

Please God no; teach them anything you like - thinking rationally, the uses and misuses of data, what data is and what it's not - but leave the...

14 hours ago by Tony Douglas via Facebook on Kids are the future. Teach ’em to code.
BrownieBoy

@Jack, > Works really well for thieves.... Nice attempt to deflect the argument by tossing in a point that's totally irrelevant, even it were...

1 day ago by BrownieBoy on AMD Ultrathins to challenge Intel Ultrabooks
bootlegger

Make that 13 people now - I got refused today at Manchester airport. I thought I was up to date on this legislation - I knew of the EU ruling from...

1 day ago by bootlegger on UK airport body scans will not be opt out
tinycg

Don't forget to check out apps like GoodReader or SlideShark either, they're indispensible for people on the go in presentation situations. Best...

1 day ago by tinycg on Four top iPad apps for people on the move
TerryRK

Well it seems there is something a number of us agree on. Why is the Ubuntu Unity launcher so ugly? I thought perhaps it was something to do with...

2 days ago by TerryRK on A tale of two distros: Ubuntu and Linux Mint
Freebies202

Duplicate comments are not made intentionally. Its very good to know that now you are keeping check on this problem because sometimes a commenter...

2 days ago by Freebies202 on Microsoft fixes blog comments, speeds up blogs with open source
kevinmchapman

"the very significant number of users" and "many (most) of us" - you have no evidence for these statements. It is a fact that most users are saying...

2 days ago by kevinmchapman on A tale of two distros: Ubuntu and Linux Mint
Marg Menzies Harrison

Another grammar faux pas is the improper use of "you". When sitting down down in a restaurant, for example, I get cringe when the waitress...

2 days ago by Marg Menzies Harrison via Facebook on 10 flagrant grammar mistakes that make you look stupid
zdnetukuser

And NOW, folks, for Canonical's next trick... Kubuntu is late. Here's a pencil. Draw your own conclusions. cf.:...

2 days ago by zdnetukuser on Linux Minterface
Moley

@kevinmchapman. The discussion here reflects the very significant number of users who really do like the traditional menu system and who wish to...

3 days ago by Moley on A tale of two distros: Ubuntu and Linux Mint
kevinmchapman

Er, no... It is an efficient means of finding the application/file/setting you need in one place. The icons are a simply a fallback for when you...

3 days ago by kevinmchapman on A tale of two distros: Ubuntu and Linux Mint
TerryRK

Isn't the provision of a text based search an admission by the developers that the mass of icons approach does not work? I don't need to use a...

3 days ago by TerryRK on A tale of two distros: Ubuntu and Linux Mint
kevinmchapman

"Unity and GNOME 3 both abandon the old text-based cascading menus in favour of a graphical icon-driven system." Point truly missed. Both use a...

3 days ago by kevinmchapman on A tale of two distros: Ubuntu and Linux Mint
TerryRK

whs001 - Thank you, I'm glad you liked the article. I absolutely agree with you on your first point. I should perhaps have made it clearer that...

3 days ago by TerryRK on A tale of two distros: Ubuntu and Linux Mint
Dennis Nilsson

If we allow corporate interest to dictate the way our government circumvents due process against foreign entities then we should accept the same...

3 days ago by Dennis Nilsson via Facebook on ACTA stumbles in Germany