What to do if your supercomputing supplier fails

Daily Newsletters

Sign up to ZDNet UK's daily newsletter.

COMMENT

High-performance computing operates at extremes of technological innovation and risk. That combination makes supplier failure a distinct possibility, but it needn't be the end of the world, says supercomputing expert Andrew Jones.

High-performance computing (HPC) delivers some of its greatest benefits because niche providers, or niche divisions of large suppliers, push the limits of technology and business. But rapid technological innovation, the shortage of HPC experts and the need for providers to operate at times near the limits of the possible mean the risk of suppliers disappearing is inevitable.

Users of HPC have always worried about supplier viability; those concerns are not new to the present economic situation, nor will they cease when things improve. The trick for the customer is to realise that supplier failure doesn't matter — providing it is properly managed. Indeed, turnover in HPC providers is a consequence of the innovation and technology-chasing that helps make the industry so potent in the first place.

Stable vendors only, please
I am fortunate enough to work for a healthy HPC company, but not all HPC suppliers are in that situation. Some appear to be perpetually on the brink. Many more are moving closer to it because of the downturn.

Customers naturally worry whether a prospective supplier will stay in business for the duration of a contract. That worry often translates into an overemphasis in the procurement process on the long-term financial position of suppliers, rather than the overall business relationship and the technology. Bidders are often made to jump through financial hoops far more than in other areas.

But the reality for most HPC services is that the most likely risk to smooth delivery will come from technology — especially relating to service installations or refreshes — or from people, especially towards the end of a contract.

Even in cases where most of the hardware and integration work will come from a single risky supplier, the main question is usually: "Will they survive long enough to complete my installation and establish a stable service?", rather than: "Will they survive the whole service lifetime?".

Alternative sources of support
That focus is because the technology is usually based on commodity hardware, perhaps with a specific differentiation, as discussed in my last column for ZDNet UK. Being aware of alternative sources of support for this technology is both good competitive business practice and sensible risk management.

Indeed, the concern should be over committing to a single source of technology support rather than supplier viability. And in any case, many people would contend that any piece of hardware is obsolete within a few months of installation.

Read this

How to stand out in the supercomputing crowd

HPC's key business benefit may be to differentiate an organisation from its rivals, but that shouldn't rule out the use of commodity products

Read more +

The key message is that with proper risk management, the risk of individual suppliers in certain areas of an overall provider mix going bust is no more or less a threat to the service delivery than the many, more-likely technology risks, ranging from roadmap and product delivery delays through architecture limitations and component failures to integration issues.

OK, sometimes it matters
For some aspects of the service, the end-user organisation will have concerns throughout the contract, and not just during the phase of creating a stable operation. For example, user and application support require a sustained relationship between customer and provider to be most effective.

The risk here is as much about people leaving as companies going bust. Indeed the movement of people in the HPC industry, especially between the big players, is probably a greater threat to continuity and quality of service than the risk of a smaller dedicated provider disappearing.

One obvious risk-mitigation strategy in the event of the disappearance of a smaller provider is to hire the recently redundant staff to work for you directly rather than via the ideal managed service — not perfect, but it is a good backup plan.

Clearly, though, the service lifetime relationship is core to the quality of service to users, so a stronger emphasis would be placed on the track record and viability of the prospective supplier.

Protecting against the personnel risk can also be managed by selecting a provider that can deliver support or other services, such as technology or procurement consulting, with a sufficient depth of staff. But some turnover of staff is good, because it can help with innovation and in sustaining the enthusiasm of the service provider.

Risk makes it possible
The inherent instability of the HPC industry — some companies operating near the edge of viability, others with good margins, new companies emerging, some disappearing, different people coming to the fore, and of course the technology race — is what makes HPC able to offer such an edge to its users' business.

In the HPC industry, as elsewhere, reward and risk are closely intertwined. Indeed, clichés are often simply well-repeated truths — "risk brings rewards", or "nothing ventured, nothing gained". Not to mention "cover your a**".

As vice president of HPC at the Numerical Algorithms Group, Andrew Jones leads the company's HPC services and consulting business, providing expertise in parallel, scalable and robust software development. Jones is well known in the supercomputing community. He is a former head of HPC at the University of Manchester and has more than 10 years' experience in HPC as an end user.

Post your comment

In order to post a comment you need to be registered and logged in.

You can also log in with Facebook. Log in or create your ZDNet UK account below

  • Login

Will not be displayed with your comment

By signing up for this service, you indicate that you agree to our Terms and Conditions and have read and understood our Privacy Policy. Questions about membership? Find the answers in the Community FAQ

Get ZDNet UK's daily newsletter

Enter your email address to sign up

ZDNet UK Live

Roberto_Store

Now On Sale, Unlocked iPhone 4S / Galaxy Note In Factory Box. Roberto-Techie(UK) ”Now on Sales” Smartphone, Android,Tablets,Gadget &...

2 hours ago by Roberto_Store on Samsung Galaxy S III lined up for sale
Paul Smyth

Is this classic FUD? One thing I would definitely have notice is a Mozilla threat to stop supporting GNU/Linux.

4 hours ago by Paul Smyth via Facebook on Firefox rapid release improves Fedora Linux
UnderINK

I agree with the previous commenter wholeheartedly. I couldn't say it better myself. This is very 'Big Brother'. And while I agree with protecting...

8 hours ago by UnderINK on European e-identity plan to be unveiled this month
Simon Bisson and Mary Branscombe

Nice to see that Turing's idea of a general purpose computer doing once-hardware-powered tasks in software is now universal ;-) Mary

14 hours ago by Simon Bisson and Mary Branscombe on Software with everything
Jason Burchell

seriously now. I've only bothered to read a small bit of the comments. do me and the rest of the world a favour. stop saying it does not work or...

17 hours ago by Jason Burchell via Facebook on Music industry negotiating over 24-bit downloads
Philip Charles Cohen

Read about it and weep, John Donahoe ... In addition to Visa’s V.me, there is now MasterCard’s PayPass digital wallet soon to arrive; another...

22 hours ago by Philip Charles Cohen via Facebook on PayPal takes phone-based payments to the high street
apexwm

Leslie Satenstein : Where have you ever seen Mozilla even mention this? Firefox is the most popular browser in the GNU/Linux OS, so I don't see...

22 hours ago by apexwm on Firefox rapid release improves Fedora Linux
songmaster

SHleG: Do you remember building a clockwork scorpion kit (I'm pretty sure I have a photo of it somewhere) — I think it was called something like...

1 day ago by songmaster on Software with everything
Chris Wortman

Good I love Yahoo! Their search engine is getting better than Google as of late. I find more of what I want on the first page, and usually within...

1 day ago by Chris Wortman via Facebook on Linux Mint 13 ramps up for KDE release
PatrickG

openhgs has made the point for Windows 8 multiple monitors without realising it! With Windows 7 you have to switch the mouse and so your focus...

1 day ago by PatrickG on Windows 8 could speed multi-monitor uptake
Leslie Satenstein

Mozilla has threatened to stop supporting Linux. I guess that UBUNTU is going with another browser. I indicated that if Mozilla stops supporting...

1 day ago by Leslie Satenstein via Facebook on Firefox rapid release improves Fedora Linux
Andy Bolstridge

Much as I abhor Microsoft's licensing practices, this is almost certainly down to purchasing IT equipment via 3rd party consultants - you get the...

1 day ago by Andy Bolstridge via Facebook on 6 million wasted licences and £1,200 PCs: welcome to government IT
Jack Schofield

@openhgs Windows users have had multiple desktops since Linus started writing Linux. They just haven't shipped as standard because not enough...

2 days ago by Jack Schofield on Windows 8 could speed multi-monitor uptake
Jack Schofield

@Phil at Cloud4 What, Microsoft gets £1,200 per PC and £1,622 per server? Gosh, I'm amazed....

2 days ago by Jack Schofield on 6 million wasted licences and £1,200 PCs: welcome to government IT
craigsc

You guys have no idea what is going on at Autonomy. Autonomy could have been a much more profitable organization. The sales operations at Autonomy...

2 days ago by craigsc on HP cuts 27,000 staff as Autonomy chief Lynch leaves
Moley

How does this impact on dual or multi booting? Seems to me to more or less prohibit this, from Windows 8 anyway. Will Grub 2 recognise Windows 8,...

2 days ago by Moley on Windows 8 start-up speed forces USB boot workaround
apexwm

I don't understand why there cannot be a slight pause during the boot process so the user can press a key. Many operating systems do this, even if...

2 days ago by apexwm on Windows 8 start-up speed forces USB boot workaround
Gavin Goodman

You can now buy the Xi3 modular computer in the UK at http://www.ocdistribution.com . This can be bought with the Tand3m software, pricing and...

2 days ago by Gavin Goodman on CES 2012: Xi3 microSERV3R
Phil at Cloud4

I agree: Mike Lynch can clearly build a business and manage strategy. I suspect the exit of Mike is more likely the end of a planned handover...

2 days ago by Phil at Cloud4 on HP cuts 27,000 staff as Autonomy chief Lynch leaves
Phil at Cloud4

This is unbeleivable government wastage with only one winner... Microsoft 1 - Tax payer Nil!

2 days ago by Phil at Cloud4 on 6 million wasted licences and £1,200 PCs: welcome to government IT