Are supercomputers just better liars?

Daily Newsletters

Sign up to ZDNet UK's daily newsletter.

COMMENT

Supercomputers might be better at providing the right answers — or they could just be providing the wrong answers in far greater detail, says Andrew Jones.

Supercomputers enable simulations to use much higher resolutions, more detailed physics, and a greater amount of input data. Yet does all that extra simulation power ensure their predictions are more accurate — or more likely to be right than a simpler model? In other words, how do you know your supercomputer is telling the truth?

It is a fact of modern life that many things we deal with daily are influenced by computer modelling in some way. Phones, soft-drink cans, vehicles, computers, weather forecasts, food containers, healthcare products — all are designed with input from computer modelling. Those products are shipped to us with logistics supported by computer modelling. Everyday life is powered by energy found or generated by computer modelling.

Increasingly, computer simulations are replacing physical testing for most design work — jet-engine failures are now tested in computer simulations many times before a single final physical test is conducted. That testing saves huge amounts of time and money for designers of jet engines.

The computer says...
In fact, so relied on are these tests that the physical test is only carried out when the computer says it will be completely successful. Similar stories can be found elsewhere — the Airbus 380 was famously designed almost entirely through computer simulations.

In other situations, the computer model is the only way to get a prediction. We can't pop into tomorrow to check the weather, so we have to ask a computer for its predictions — not to mention climate-change assessments. With lightning-strike protection, even if we could find lightning with repeatability, persuading pilots to be the first to try a design would be problematic. Equally, with high-speed crash testing, you would be hard pushed to assure a test driver that the safety system will work.

Real lives or business decisions with large economic impact will often rest on computer predictions. So how do we know they are right?

Model comparisons
Thankfully, most developers of computer models test them before releasing them for real design work. They may compare the predictions with those from other models — perhaps using different principles or algorithms — or with physical testing, historical data, or other known data points.

Supercomputers are often called into use for this purpose. In fact, one of the biggest roles of supercomputer simulations in science and engineering is exploring the validity of the models. Models are pushed to extreme scales, data sets and boundary conditions, to help establish the confidence that they will be safe with less extreme parameters.

Read this

How to stand out in the supercomputing crowd

HPC's key business benefit may be to differentiate an organisation from its rivals, but that shouldn't rule out the use of commodity products

Read more +

Many users of models are rigorous about validating their predictions, especially those users with a strong link to the advancement of the model or its underpinning science. But, unfortunately, not all users of models are so scrupulous.

They think the model must be right — after all, it is running at a higher resolution than before, or with physics algorithm v2.0, or some other enhancement, so the answers must be more accurate. Or they assume it is the model supplier's job to make sure it is correct. And yes, it is — but how often do users check that their prediction relies on a certified part of parameter space?

Misleading assumptions
Even for developers, the assumptions of increasing scale can be misleading. Higher resolutions do not guarantee more accurate results. For example, is the code numerically capable of handling the smaller or larger floating-point numbers involved? Do the algorithms remain stable over the larger number of iterations? Does the code use a subroutine or library call that may not have been designed or certified for this regime?

What about correctness errors unique to parallel processing, such as race conditions? At the higher end of supercomputing, fault tolerance is becoming critical — not just in node failures, but in softer errors such as data corruption in memory or in the interconnect.

What to do? As the use of high-performance computing becomes more widespread, we need to be wary of the assumptions that more powerful simulations are more accurate. In most cases they will be. In many cases they will explore parts of a design that are not possible any other way. But we need to check.

Building in safety
I don't have the space here to describe how to build safety into model development and validate its use; but I'll leave you with two thoughts.

First, more computational power, through high-performance computing, will make better modelling possible — but be sure to use some of that computational power to validate the modelling.

Second, don't make the dangerous assumption that physical testing is always better than computer predictions. Physical testing has its own sources of errors, assumptions, and regions of validity. Disagreement between computer and measurement does not mean the computer is wrong. You should explore the accuracy of both.

As vice president of HPC at the Numerical Algorithms Group, Andrew Jones leads the company's HPC services and consulting business, providing expertise in parallel, scalable and robust software development. Jones is well known in the supercomputing community. He is a former head of HPC at the University of Manchester and has more than 10 years' experience in HPC as an end user.

Talkback

Since many years it is known that computers produce numerical results of questionable accuracy. This is an effect of the finite length of numbers in storage.

Yet it is also known that this can be overcome by the proper use of interval arithmetic. Mathematics, algorithms and programming languages were developed at the Institute of Applied Mathematics of Karlsruhe University by the group of Prof. Kulisch in the 80-ies.

At present interval arithmetic executes slowly because there is no proper hardware support. This would cost only little, but no manufacturer of processors found it worth the effort till today.

So it is a fact: Supercomputers produce wrong numbers faster than ordinary computers!

If anybody (Intel?, AMD?, IBM?, SUN?) wants to know more on hardware support for interval arithmetic, send mail to
user: kirchner
host: informatik.uni-kl.de

kirchner-r 26 March, 2009 11:26
Reply

Post your comment

In order to post a comment you need to be registered and logged in.

You can also log in with Facebook. Log in or create your ZDNet UK account below

  • Login

Will not be displayed with your comment

By signing up for this service, you indicate that you agree to our Terms and Conditions and have read and understood our Privacy Policy. Questions about membership? Find the answers in the Community FAQ

Get ZDNet UK's daily newsletter

Enter your email address to sign up

ZDNet UK Live

Roberto_Store

Now On Sale, Unlocked iPhone 4S / Galaxy Note In Factory Box. Roberto-Techie(UK) ”Now on Sales” Smartphone, Android,Tablets,Gadget &...

2 hours ago by Roberto_Store on Samsung Galaxy S III lined up for sale
Paul Smyth

Is this classic FUD? One thing I would definitely have notice is a Mozilla threat to stop supporting GNU/Linux.

4 hours ago by Paul Smyth via Facebook on Firefox rapid release improves Fedora Linux
UnderINK

I agree with the previous commenter wholeheartedly. I couldn't say it better myself. This is very 'Big Brother'. And while I agree with protecting...

8 hours ago by UnderINK on European e-identity plan to be unveiled this month
Simon Bisson and Mary Branscombe

Nice to see that Turing's idea of a general purpose computer doing once-hardware-powered tasks in software is now universal ;-) Mary

14 hours ago by Simon Bisson and Mary Branscombe on Software with everything
Jason Burchell

seriously now. I've only bothered to read a small bit of the comments. do me and the rest of the world a favour. stop saying it does not work or...

18 hours ago by Jason Burchell via Facebook on Music industry negotiating over 24-bit downloads
Philip Charles Cohen

Read about it and weep, John Donahoe ... In addition to Visa’s V.me, there is now MasterCard’s PayPass digital wallet soon to arrive; another...

22 hours ago by Philip Charles Cohen via Facebook on PayPal takes phone-based payments to the high street
apexwm

Leslie Satenstein : Where have you ever seen Mozilla even mention this? Firefox is the most popular browser in the GNU/Linux OS, so I don't see...

22 hours ago by apexwm on Firefox rapid release improves Fedora Linux
songmaster

SHleG: Do you remember building a clockwork scorpion kit (I'm pretty sure I have a photo of it somewhere) — I think it was called something like...

1 day ago by songmaster on Software with everything
Chris Wortman

Good I love Yahoo! Their search engine is getting better than Google as of late. I find more of what I want on the first page, and usually within...

1 day ago by Chris Wortman via Facebook on Linux Mint 13 ramps up for KDE release
PatrickG

openhgs has made the point for Windows 8 multiple monitors without realising it! With Windows 7 you have to switch the mouse and so your focus...

1 day ago by PatrickG on Windows 8 could speed multi-monitor uptake
Leslie Satenstein

Mozilla has threatened to stop supporting Linux. I guess that UBUNTU is going with another browser. I indicated that if Mozilla stops supporting...

1 day ago by Leslie Satenstein via Facebook on Firefox rapid release improves Fedora Linux
Andy Bolstridge

Much as I abhor Microsoft's licensing practices, this is almost certainly down to purchasing IT equipment via 3rd party consultants - you get the...

1 day ago by Andy Bolstridge via Facebook on 6 million wasted licences and £1,200 PCs: welcome to government IT
Jack Schofield

@openhgs Windows users have had multiple desktops since Linus started writing Linux. They just haven't shipped as standard because not enough...

2 days ago by Jack Schofield on Windows 8 could speed multi-monitor uptake
Jack Schofield

@Phil at Cloud4 What, Microsoft gets £1,200 per PC and £1,622 per server? Gosh, I'm amazed....

2 days ago by Jack Schofield on 6 million wasted licences and £1,200 PCs: welcome to government IT
craigsc

You guys have no idea what is going on at Autonomy. Autonomy could have been a much more profitable organization. The sales operations at Autonomy...

2 days ago by craigsc on HP cuts 27,000 staff as Autonomy chief Lynch leaves
Moley

How does this impact on dual or multi booting? Seems to me to more or less prohibit this, from Windows 8 anyway. Will Grub 2 recognise Windows 8,...

2 days ago by Moley on Windows 8 start-up speed forces USB boot workaround
apexwm

I don't understand why there cannot be a slight pause during the boot process so the user can press a key. Many operating systems do this, even if...

2 days ago by apexwm on Windows 8 start-up speed forces USB boot workaround
Gavin Goodman

You can now buy the Xi3 modular computer in the UK at http://www.ocdistribution.com . This can be bought with the Tand3m software, pricing and...

2 days ago by Gavin Goodman on CES 2012: Xi3 microSERV3R
Phil at Cloud4

I agree: Mike Lynch can clearly build a business and manage strategy. I suspect the exit of Mike is more likely the end of a planned handover...

2 days ago by Phil at Cloud4 on HP cuts 27,000 staff as Autonomy chief Lynch leaves
Phil at Cloud4

This is unbeleivable government wastage with only one winner... Microsoft 1 - Tax payer Nil!

2 days ago by Phil at Cloud4 on 6 million wasted licences and £1,200 PCs: welcome to government IT