Supercomputing: Small firms making a big impact

Daily Newsletters

Sign up to ZDNet UK's daily newsletter.

Sophisticated, but affordable
The move away from monolithic machines stuffed with proprietary hardware and software built by companies such as IBM, Cray and NEC has come about through a combination of technological sophistication and appealing prices.

While computers such as NEC's Earth Simulator are still preferred for some tasks, such as weather prediction, researchers have found that most applications can be run on clusters of two- and four-processor assemblies from Intel or Advanced Micro Devices running Linux.

(Large clusters, including a much heralded one at Virginia Polytechnic Institute, have also been built out of Apple PCs running IBM chips. Srinidhi Varadarajan, the brains behind Virginia Tech's cluster, also serves as California Digital's chief technology officer.)

Many of these cluster contracts still go to large computer makers. Dell and IBM installed the first two NCSA clusters, and Hewlett-Packard has also signed several contracts. But small companies are undeniably winning many prestigious projects.

Increased familiarity at the labs with these types of systems has cut implementation times and costs. Lawrence Livermore, for example, has hired its own Linux kernel and compiler experts to speed the shift to clusters.

"We're not buying a solution. We buy the pieces individually and act as the general contactor," said Mark Seager, assistant department head for advance technology at Lawrence Livermore. "By doing this, we are getting a huge price-performance boost, by a factor of two or three."

Although the core servers are built around standardised components, heavy-duty technical expertise is required to jump into this market. Linux Networx helps customers decide how many processors and how much memory to install, as well as what sort of interconnect technology -- InfiniBand, Gigabit Ethernet, Quadrics' QSNet, or Myricom's MyriNet -- will be most appropriate. And it assembles and tests the servers and interconnects in its own factory before shipping them for easy reassembly at the customer site.

Software also is a huge component of the contracts.

"As the watermark of commodity, off-the-shelf technology rises, the area of our differentiation also moves up. More and more of the percentage of system value [is] from those things that go around the hardware," said Bernard Daines, founder and CEO of Linux Networx.

Verari in April acquired MPI Software, a small company specialising in the Message Passing Interface software that lets different nodes in a cluster communicate.

"There is a fair amount of sophistication needed to truly understand what is going on, on the software side of a cluster and to eliminate latency on the network," said Bone of California Digital. "There is no technical reason why someone could not figure out the intricacies of clusters, but it requires a methodological accumulation of expertise of low-latency interconnects, management tools and other things."

Technical expertise aside, these smaller companies are also getting a boost from government policies designed to help the little guy.

"As a government organisation, we have to help small corporations," said Lawrence Livermore's Seager. "We feel this is something we should encourage. Being small isn't a killer as long as they are qualified."

A short history of clusters
The cluster paradigm -- independent machines connected with a high-speed network -- has been around for more than 15 years, according to Dave Turek, leader of IBM's "Deep Computing" team. "What's changed in recent years is that they can be assembled using Linux, Intel or AMD processors and conventional networks instead of exotic, rare or customised technology."

Many say the turnover began in 1999 and 2000. At that time, mass-produced and relatively inexpensive Intel chips began to surpass RISC chips in performance, according to Intel executives -- a gap that has continued to widen. Linux and Beowulf clustering technology, which allows Linux boxes to be tied together, have also become more widespread.

Lawrence Livermore created its first Linux-Intel cluster in the late 1990s. The machine contained Pentium II chips, which limited memory bandwidth, but Seager said the lab knew the setup would evolve. The introduction of the Pentium 4 became a watershed moment for the lab by expanding bandwidth from 800 megabytes per second to 2.4 gigabytes per second, Seager said.

These smaller companies to some degree followed the lead of what the labs were doing.

"We were building these clusters by hand. They were offering to put these things together for us. They would do the assembly but not provide much support," said the NCSA's Pennington.

California Digital was founded in 1994 and initially specialised in relatively generic Intel servers. In June 2001, it bought the hardware unit of VA Linux and refocused itself on the high-performance computing market.

The Thunder project then came along after Lawrence Livermore researchers were impressed with manageability tools California Digital released to the open-source community. By chance, the company had just completed an Itanium 2 project for a large corporate client -- "one of the biggest corporations in the world", Bone said -- that required California Digital to build a cluster that could run 21 different applications, an unusually large amount.

Breaking into this market isn't easy, though. "This is a very tight-knit community," said Jason Waxman, director of multiprocessor platform marketing at Intel. Not only do companies have to have technical sophistication, "they have to know what it takes to win a bid and deal with government contracts."

"This is not a field where you can walk in off the street. You have to have some credibility," Pennington added.

The tight-knit nature of the market also involves to some extent the secrecy of various computing projects. Waxman knows of a relatively new start-up formed by refugees from one of the classic supercomputer makers, but he said he couldn't reveal the name. California Digital's Bone said he couldn't reveal the names of any corporate customers. Pennington, meanwhile, refused to divulge the scope of NCSA's current proposal request.

Despite these challenges, however, the growth opportunities for these companies seem strong.

"In the future, we will be getting more Linux clusters," said Seager.

CNET News.com's Stephen Shankland contributed to this report.

Post your comment

In order to post a comment you need to be registered and logged in.

You can also log in with Facebook. Log in or create your ZDNet UK account below

  • Login

Will not be displayed with your comment

By signing up for this service, you indicate that you agree to our Terms and Conditions and have read and understood our Privacy Policy. Questions about membership? Find the answers in the Community FAQ

Get ZDNet UK's daily newsletter

Enter your email address to sign up

ZDNet UK Live

Jack Schofield

@openhgs Windows users have had multiple desktops since Linus started writing Linux. They just haven't shipped as standard because not enough...

12 hours ago by Jack Schofield on Windows 8 could speed multi-monitor uptake
Jack Schofield

@Phil at Cloud4 What, Microsoft gets £1,200 per PC and £1,622 per server? Gosh, I'm amazed....

12 hours ago by Jack Schofield on 6 million wasted licences and £1,200 PCs: welcome to government IT
craigsc

You guys have no idea what is going on at Autonomy. Autonomy could have been a much more profitable organization. The sales operations at Autonomy...

14 hours ago by craigsc on HP cuts 27,000 staff as Autonomy chief Lynch leaves
Moley

How does this impact on dual or multi booting? Seems to me to more or less prohibit this, from Windows 8 anyway. Will Grub 2 recognise Windows 8,...

14 hours ago by Moley on Windows 8 start-up speed forces USB boot workaround
apexwm

I don't understand why there cannot be a slight pause during the boot process so the user can press a key. Many operating systems do this, even if...

15 hours ago by apexwm on Windows 8 start-up speed forces USB boot workaround
Gavin Goodman

You can now buy the Xi3 modular computer in the UK at http://www.ocdistribution.com . This can be bought with the Tand3m software, pricing and...

15 hours ago by Gavin Goodman on CES 2012: Xi3 microSERV3R
Phil at Cloud4

I agree: Mike Lynch can clearly build a business and manage strategy. I suspect the exit of Mike is more likely the end of a planned handover...

19 hours ago by Phil at Cloud4 on HP cuts 27,000 staff as Autonomy chief Lynch leaves
Phil at Cloud4

This is unbeleivable government wastage with only one winner... Microsoft 1 - Tax payer Nil!

19 hours ago by Phil at Cloud4 on 6 million wasted licences and £1,200 PCs: welcome to government IT
Mispam

So what do you do when you can't boot into windows? Why can't I just hold Shift while I power up instead of having to boot into windows and click a...

20 hours ago by Mispam on Windows 8 start-up speed forces USB boot workaround
apexwm

I've also seen that Mac OS X for Intel machines is supposed to run in VirtualBox, which would also be a nice solution. I've never tried it though.

21 hours ago by apexwm on xTreme Triple Booting: Linux, Mac & Windows
dave heasman

What I wonder is why when companies are caught bang to rights in not providing contracted services, people bend over to smear the customers? Surely...

22 hours ago by dave heasman on Virgin throttles broadband for high-speed customers
pjc158

Strange statement from HP regarding Mike Lynch and not capable of scaling a company. Autonomy was a $7bn purchase which started as a small company...

22 hours ago by pjc158 on HP cuts 27,000 staff as Autonomy chief Lynch leaves
lojolondon

Or - possibly, they will destroy business by ensuring people do not invest where there is no return. Another socialist idea, well beyond it's...

1 day ago by lojolondon on Open Data Institute will act as biz incubator
J.A. Watson

Good stuff Jake, very interesting. Thanks. jw

1 day ago by J.A. Watson on xTreme Triple Booting: Linux, Mac & Windows
openhgs

"the cost of a second LCD screen is about the same as one day of an office worker's time, so this should soon be recouped in extra productivity."...

1 day ago by openhgs on Windows 8 could speed multi-monitor uptake
Thomas Gellhaus

I also installed the KDE version; I also will probably try out razorqt since I really haven't had a chance to before. I'm looking forward to the...

2 days ago by Thomas Gellhaus via Facebook on Mageia 2 Released
francisabigail

Acquiring when reinvention/cannibalization is too challenging for a large organization can be an excellent strategy- still, so many mergers stumble...

2 days ago by francisabigail on Ariba buy parks SAP on Oracle's cloud turf
apexwm

All of the feedback regarding using a touch monitor for a desktop PC is right on. Several months ago, we installed a "demo" multitouch all-in-one...

2 days ago by apexwm on Windows 8 could speed multi-monitor uptake
191706

anyone wanting to triple boot *their* own Mac

2 days ago by 191706 on xTreme Triple Booting: Linux, Mac & Windows
SoapyTablet

Cont.. Biggest Bugbear: Win7's stop-animate-go approach to work, you develop a staggered (not in the above alchohol sense of the word) approach to...

2 days ago by SoapyTablet on Windows 8 could speed multi-monitor uptake