When supercomputing benchmarks fail to add up

Daily Newsletters

Sign up to ZDNet UK's daily newsletter.

COMMENT

Benchmarking is either an invaluable procurement tool or just a pointless attempt to quantify the immeasurable, says Andrew Jones.

Benchmarking is valuable for many reasons. It can measure the performance of a system architecture or new algorithms, or evaluate progress in application development and identify hotspots to optimise. But the highest-profile use of benchmarking is to help with high-performance computing (HPC) buying decisions.

Benchmarking is so important in procurement that it would be foolish to try doing without it. Yet it is also something that can be hard to get right, because of complexity, the plethora of options and user applications, and the difficulty of keeping the benchmarking in proportion to the scale of investment.

In supercomputing, benchmarking is an area of specialism in its own right, both on the vendor and user sides. It is a discipline that can help make the most of your investment, but it can also make your head whirl in the process.

Using benchmarks correctly
For example, people often assume the system with the best benchmark will win the order. Sometimes bidders make this assumption, seeing the benchmarking as the most concrete aspect of the proposal evaluation process, and sometimes it is buyers who think the benchmark will provide an unambiguous winner.

Of course, in any sensible procurement, benchmarks are no more than a supporting element. They are one method of determining the business benefit. But other business aspects also matter — for example, reliability, service partnership and price.

Benchmarks will not — or should not — exclusively pick the winner, but they can and should be used to narrow the field and help avoid buying a turkey.

Excessive demands
However, nearly as bad as those procurements that use no benchmarking are those that require each bidder to benchmark a large number of applications under specific rules on their tendered systems. Sometimes the benchmarking effort by the bidder and buyer is out of proportion to the scale of the proposed procurement.

The efforts of bidders under these excessive conditions raise costs and reduce margins, potentially affecting the price-performance available to the customer, or the long-term viability of the bidder's business. In some cases, the size of the benchmark effort compared with the investment on offer simply means the vendor cannot even afford to bid, therefore reducing competition and options for the customer.

The effort from the procurement team is also significant. They have to define the benchmarks, support the vendors running them, collate and compare the results, and investigate anomalies. All these activities take time, raise the cost of the procurement and, again, potentially reduce the money available for investing in the proposed system.

Get the balance right
So, the trick is to balance the benchmarking with the scale of the procurement. For small procurements, or for clearly defined user needs — for example, a few applications with known dataset patterns — buyers could use a very lightweight benchmarking process or publicly available and verified benchmarks. Or they could employ a benchmarking specialist with experience of the applications and systems to provide independent advice.

If the procurement is larger, high profile, involves public money or a diverse set of users and applications, a wider benchmarking exercise becomes viable. It is also important to get the size of benchmark dataset right. If you are buying a bigger machine to cope with workload growth, don't just use today's problem size.

Prove it
Perhaps the most important use of benchmarks is in acceptance tests. These are tests the buyer conducts after the winning system has been delivered and the vendor wants to hand it over to the customer — followed by an invoice, of course. The tests check that the vendor has supplied a system to match the bid and that it works correctly.

Benchmarks need to be a key part of this process. If the system delivered cannot match the bid and rejects it, then negotiate a remedy in the form of a discount or extra performance, or, if necessary, make a business decision to accept the solution as-is, knowing the risk.

Read this

What to do if your supercomputing supplier fails

Supercomputing providers often live on the edge — technologically and financially. But if your supplier fails, it need not be a disaster

Read more +

Benchmarks should not be the only feature of an acceptance test, and the set used for acceptance may be either more or less thorough than the procurement benchmarks. But to be fair and to get the right result, it must be made clear to bidders what they will have to commit to in terms of benchmarks.

If the procurement benchmarks are only a performance guide by the vendors, then their value to you in comparing solutions is low. If the procurement benchmarks are part of the acceptance suite, then bidders must be held to them, and their effort and the impact on price and risk in the bid should be commensurate.

It is surprising that so many HPC procurements are conducted without benchmarking playing any role at all. In these cases, the buyer is relying on the vendor's sales pitch and subjective assumptions about value for money. How can the return on investment be evaluated under those conditions?

As vice-president of HPC at the Numerical Algorithms Group, Andrew Jones leads the company's HPC services and consulting business, providing expertise in parallel, scalable and robust software development. Jones is well known in the supercomputing community. He is a former head of HPC at the University of Manchester and has more than 10 years' experience in HPC as an end user.

Post your comment

In order to post a comment you need to be registered and logged in.

You can also log in with Facebook. Log in or create your ZDNet UK account below

  • Login

Will not be displayed with your comment

By signing up for this service, you indicate that you agree to our Terms and Conditions and have read and understood our Privacy Policy. Questions about membership? Find the answers in the Community FAQ

Get ZDNet UK's daily newsletter

Enter your email address to sign up

ZDNet UK Live

Roberto_Store

Now On Sale, Unlocked iPhone 4S / Galaxy Note In Factory Box. Roberto-Techie(UK) ”Now on Sales” Smartphone, Android,Tablets,Gadget &...

3 hours ago by Roberto_Store on Samsung Galaxy S III lined up for sale
Paul Smyth

Is this classic FUD? One thing I would definitely have notice is a Mozilla threat to stop supporting GNU/Linux.

5 hours ago by Paul Smyth via Facebook on Firefox rapid release improves Fedora Linux
UnderINK

I agree with the previous commenter wholeheartedly. I couldn't say it better myself. This is very 'Big Brother'. And while I agree with protecting...

9 hours ago by UnderINK on European e-identity plan to be unveiled this month
Simon Bisson and Mary Branscombe

Nice to see that Turing's idea of a general purpose computer doing once-hardware-powered tasks in software is now universal ;-) Mary

14 hours ago by Simon Bisson and Mary Branscombe on Software with everything
Jason Burchell

seriously now. I've only bothered to read a small bit of the comments. do me and the rest of the world a favour. stop saying it does not work or...

18 hours ago by Jason Burchell via Facebook on Music industry negotiating over 24-bit downloads
Philip Charles Cohen

Read about it and weep, John Donahoe ... In addition to Visa’s V.me, there is now MasterCard’s PayPass digital wallet soon to arrive; another...

22 hours ago by Philip Charles Cohen via Facebook on PayPal takes phone-based payments to the high street
apexwm

Leslie Satenstein : Where have you ever seen Mozilla even mention this? Firefox is the most popular browser in the GNU/Linux OS, so I don't see...

23 hours ago by apexwm on Firefox rapid release improves Fedora Linux
songmaster

SHleG: Do you remember building a clockwork scorpion kit (I'm pretty sure I have a photo of it somewhere) — I think it was called something like...

1 day ago by songmaster on Software with everything
Chris Wortman

Good I love Yahoo! Their search engine is getting better than Google as of late. I find more of what I want on the first page, and usually within...

1 day ago by Chris Wortman via Facebook on Linux Mint 13 ramps up for KDE release
PatrickG

openhgs has made the point for Windows 8 multiple monitors without realising it! With Windows 7 you have to switch the mouse and so your focus...

1 day ago by PatrickG on Windows 8 could speed multi-monitor uptake

Mozilla has threatened to stop supporting Linux. I guess that UBUNTU is going with another browser. I indicated that if Mozilla stops supporting...

1 day ago by via Facebook on Firefox rapid release improves Fedora Linux

Much as I abhor Microsoft's licensing practices, this is almost certainly down to purchasing IT equipment via 3rd party consultants - you get the...

1 day ago by via Facebook on 6 million wasted licences and £1,200 PCs: welcome to government IT
Jack Schofield

@openhgs Windows users have had multiple desktops since Linus started writing Linux. They just haven't shipped as standard because not enough...

2 days ago by Jack Schofield on Windows 8 could speed multi-monitor uptake
Jack Schofield

@Phil at Cloud4 What, Microsoft gets £1,200 per PC and £1,622 per server? Gosh, I'm amazed....

2 days ago by Jack Schofield on 6 million wasted licences and £1,200 PCs: welcome to government IT
craigsc

You guys have no idea what is going on at Autonomy. Autonomy could have been a much more profitable organization. The sales operations at Autonomy...

2 days ago by craigsc on HP cuts 27,000 staff as Autonomy chief Lynch leaves
Moley

How does this impact on dual or multi booting? Seems to me to more or less prohibit this, from Windows 8 anyway. Will Grub 2 recognise Windows 8,...

2 days ago by Moley on Windows 8 start-up speed forces USB boot workaround
apexwm

I don't understand why there cannot be a slight pause during the boot process so the user can press a key. Many operating systems do this, even if...

2 days ago by apexwm on Windows 8 start-up speed forces USB boot workaround
Gavin Goodman

You can now buy the Xi3 modular computer in the UK at http://www.ocdistribution.com . This can be bought with the Tand3m software, pricing and...

2 days ago by Gavin Goodman on CES 2012: Xi3 microSERV3R
Phil at Cloud4

I agree: Mike Lynch can clearly build a business and manage strategy. I suspect the exit of Mike is more likely the end of a planned handover...

2 days ago by Phil at Cloud4 on HP cuts 27,000 staff as Autonomy chief Lynch leaves
Phil at Cloud4

This is unbeleivable government wastage with only one winner... Microsoft 1 - Tax payer Nil!

2 days ago by Phil at Cloud4 on 6 million wasted licences and £1,200 PCs: welcome to government IT