How does XML measure up?

Daily Newsletters

Sign up to ZDNet UK's daily newsletter.

Can you describe what Antarctica is doing and how XML fits in?
The premise of Antarctica is that enterprises are typically really good at collecting information, but Wall Street doesn't reward you for collecting information. Many people in chief information officer and chief technology officer roles would share the perception that the enterprise world in general is not doing a good enough job in getting value out of the large inventory of information that's built up.

So I became interested in mining -- getting more value out of all this data that's built up. I concluded that one of the main pain points that's preventing people from getting adequate return on investment from data inventories is the user interface. There's an analogy with the advent of the graphical user interface for PCs. Before that, computers were something of a very small minority of the population. Once everybody got a GUI (graphical user iterface) with a desktop metaphor, the use of the computer became much more widespread.

It's Antarctica's hypothesis that by putting a graphic interface somewhat in the spirit of the desktop metaphor on complex information spaces, we can open up the value in there. In our case, the metaphor isn't a desktop -- it's a map.

On the XML front, we're using it on the periphery. Our back end consists of either an SQL database you can talk to directly, or send us an XML input file and we'll read that. Interestingly, in our actual deployments, there's been only one case in which somebody's elected to talk to the database. Everyone else has been happy with XML.

There's a lot of business interest in search now. Do you think companies would be better off focusing on user interface issues than algorithms?
Absolutely. There's no reason to expect that search is going to get that much better.

The basic algorithms by which search is done have not improved much since about 1975. The only way to improve the situation is by enhancing search engines with more deterministic metadata, essentially adding knowledge management techniques that give you more information from which to draw connections. If you look at the victory of Google in the search engine business, it wasn't because they had better search techniques. It's because they deployed one key metadata value -- how many pages are linked to this one -- to enhance the relevancy of their results. The same concepts need to be applied to the enterprise.

There are really two ways to get information: search and browse. And browse has a lot of potential. But to work, the drill down has to be intuitive. It cannot be stupid. You have to be really aggressive about bringing the relevant stuff to the top. You can't force the person to go through multiple levels to get to what they want.

And visual representation plays a big role in that?
Right. It turns out that the display technique that returns the most amount of data per square inch is cartography. That's why we're using a map metaphor. The whole notion of the search engine results list, for which I'm partly responsible, is terribly information-thin. Google creates the illusion that the results list is somewhat one-dimensional: this is the most interesting; this is less so.

But if I type "bicycle" into Google, it doesn't know what I'm looking for. I may be looking for bicycle racing results, or I may be interested in that song by Queen. With a map, I get started off right away: Here are the matches for bicycle in music; here are the matches for outdoor sports; here are the ones for shopping.

You've been a major contributor to the W3C. What's your view of the standards process? It seems that in some cases, like XML, it works very well, but there are others like SVG (Scalable Vector Graphics) where it's been quite slow.
Standards processes don't do well in dealing with new technologies, so I disagree that being ahead of the market is a good thing. The standards process works best when you've got a problem that's already been solved, and we have a consensus on what the right way to go is, and you just need to write down the rules.

That's totally what XML was. There had been 15 years of SGML, so there was a really good set of knowledge as to how markup and text should work. And the Web had been around for five years, so we knew how URLs (Uniform Resource Locators) worked, and Unicode had been around, so we knew how to do internationalisation. XML just took those solved problems, packaged them up neatly and got consensus on it all.

SVG is a different thing. I haven't given up on SVG; I think it has a bright future, because it really is better than the alternatives. And they didn't invent much stuff. People from Adobe and Microsoft know this stuff.

XML probably couldn't be done anymore at W3C. The reason XML was so successful is that nobody noticed -- we came in low, fast and under the radar, and it was already finished by the time the big vendors noticed it. Now, any time there's a new initiative around XML, there are instantly 75 vendors who want to go on the working group.

Post your comment

In order to post a comment you need to be registered and logged in.

You can also log in with Facebook. Log in or create your ZDNet UK account below

  • Login

Will not be displayed with your comment

By signing up for this service, you indicate that you agree to our Terms and Conditions and have read and understood our Privacy Policy. Questions about membership? Find the answers in the Community FAQ

Get ZDNet UK's daily newsletter

Enter your email address to sign up

ZDNet UK Live

songmaster

SHleG: Do you remember building a clockwork scorpion kit (I'm pretty sure I have a photo of it somewhere) — I think it was called something like...

48 minutes ago by songmaster on Software with everything
Chris Wortman

Good I love Yahoo! Their search engine is getting better than Google as of late. I find more of what I want on the first page, and usually within...

1 hour ago by Chris Wortman via Facebook on Linux Mint 13 ramps up for KDE release
PatrickG

openhgs has made the point for Windows 8 multiple monitors without realising it! With Windows 7 you have to switch the mouse and so your focus...

3 hours ago by PatrickG on Windows 8 could speed multi-monitor uptake
Leslie Satenstein

Mozilla has threatened to stop supporting Linux. I guess that UBUNTU is going with another browser. I indicated that if Mozilla stops supporting...

4 hours ago by Leslie Satenstein via Facebook on Firefox rapid release improves Fedora Linux
Andy Bolstridge

Much as I abhor Microsoft's licensing practices, this is almost certainly down to purchasing IT equipment via 3rd party consultants - you get the...

5 hours ago by Andy Bolstridge via Facebook on 6 million wasted licences and £1,200 PCs: welcome to government IT
Jack Schofield

@openhgs Windows users have had multiple desktops since Linus started writing Linux. They just haven't shipped as standard because not enough...

21 hours ago by Jack Schofield on Windows 8 could speed multi-monitor uptake
Jack Schofield

@Phil at Cloud4 What, Microsoft gets £1,200 per PC and £1,622 per server? Gosh, I'm amazed....

21 hours ago by Jack Schofield on 6 million wasted licences and £1,200 PCs: welcome to government IT
craigsc

You guys have no idea what is going on at Autonomy. Autonomy could have been a much more profitable organization. The sales operations at Autonomy...

23 hours ago by craigsc on HP cuts 27,000 staff as Autonomy chief Lynch leaves
Moley

How does this impact on dual or multi booting? Seems to me to more or less prohibit this, from Windows 8 anyway. Will Grub 2 recognise Windows 8,...

23 hours ago by Moley on Windows 8 start-up speed forces USB boot workaround
apexwm

I don't understand why there cannot be a slight pause during the boot process so the user can press a key. Many operating systems do this, even if...

1 day ago by apexwm on Windows 8 start-up speed forces USB boot workaround
Gavin Goodman

You can now buy the Xi3 modular computer in the UK at http://www.ocdistribution.com . This can be bought with the Tand3m software, pricing and...

1 day ago by Gavin Goodman on CES 2012: Xi3 microSERV3R
Phil at Cloud4

I agree: Mike Lynch can clearly build a business and manage strategy. I suspect the exit of Mike is more likely the end of a planned handover...

1 day ago by Phil at Cloud4 on HP cuts 27,000 staff as Autonomy chief Lynch leaves
Phil at Cloud4

This is unbeleivable government wastage with only one winner... Microsoft 1 - Tax payer Nil!

1 day ago by Phil at Cloud4 on 6 million wasted licences and £1,200 PCs: welcome to government IT
Mispam

So what do you do when you can't boot into windows? Why can't I just hold Shift while I power up instead of having to boot into windows and click a...

1 day ago by Mispam on Windows 8 start-up speed forces USB boot workaround
apexwm

I've also seen that Mac OS X for Intel machines is supposed to run in VirtualBox, which would also be a nice solution. I've never tried it though.

1 day ago by apexwm on xTreme Triple Booting: Linux, Mac & Windows
dave heasman

What I wonder is why when companies are caught bang to rights in not providing contracted services, people bend over to smear the customers? Surely...

1 day ago by dave heasman on Virgin throttles broadband for high-speed customers
pjc158

Strange statement from HP regarding Mike Lynch and not capable of scaling a company. Autonomy was a $7bn purchase which started as a small company...

1 day ago by pjc158 on HP cuts 27,000 staff as Autonomy chief Lynch leaves
lojolondon

Or - possibly, they will destroy business by ensuring people do not invest where there is no return. Another socialist idea, well beyond it's...

1 day ago by lojolondon on Open Data Institute will act as biz incubator
J.A. Watson

Good stuff Jake, very interesting. Thanks. jw

1 day ago by J.A. Watson on xTreme Triple Booting: Linux, Mac & Windows
openhgs

"the cost of a second LCD screen is about the same as one day of an office worker's time, so this should soon be recouped in extra productivity."...

2 days ago by openhgs on Windows 8 could speed multi-monitor uptake