Amazon blames outage on complicated systems

Daily Newsletters

Sign up to ZDNet UK's daily newsletter.

NEWS

Amazon.com appears to be blaming its complicated infrastructure for the outage that left it inaccessible to many US visitors for more than an hour and a half on Friday.

Amazon declared itself clear of the problem on Friday afternoon. "The Amazon retail site was down for approximately two hours earlier today beginning around 10.25am. The site [is] back up," the company said in statement following the outage. "Amazon's systems are very complex and, on rare occasions, despite our best efforts, they may experience problems. We work to minimise any disruption and to get the site back as quickly as possible." Amazon declined to comment further.

The site, which is held up as an exponent of cloud computing due to the large number and complexity of web services used by partner sites, went offline completely by 10.21am. PDT on Friday. Efforts to restore it appeared to be taking effect about noon, said Keynote Systems, which monitors website responsiveness. As of 12.45pm, the site was working intermittently, with many product pages functioning but others still broken.

"At noon PDT, we started to see the site getting better," said Shawn White, director of external operations for Keynote. "We [were] seeing about 70 percent availability."

Sustained outages can be a serious problem. EBay suffered outages in 1999 that outraged users and sent the stock down; even a backup system didn't ward off more problems in 2002.

For major commerce sites, the problem can have a ripple effect. Both Amazon and eBay provide a commercial foundation used by many partners and entrepreneurs.

Based on last quarter's revenue of $4.13bn (£2.09bn) globally, a full-scale global outage would cost Amazon more than $31,000 per minute, on average. For North America, it would be more than $16,000 per minute. Those figures do not include revenue from other sources, such as search or contextual advertisements or Amazon Web Services.

It appeared that Amazon Web Services such as the S3 storage and EC2 computing services continued to function at least for some customers, though the Amazon Web Services page at Amazon.com wasn't working.

"S3 and EC2 continue to function for us as normal," said Don MacAskill, chief executive of photo-sharing site Smugmug. Mashery.com chief executive Oren Michels, who uses Amazon Web Services for several functions and who has several customers who use Amazon Web Services, reported no problems on Friday.

As to the explanation for the outage, the company only hinted its complicated computing infrastructure was the culprit.

In the estimation of Shawn White, director of operations for Keynote, the most likely culprit was simple human error.

"Some engineer might have made a particular change, not knowing it could cause a trickle-down effect [that eventually brought down the site]," said White.

For example, he said, somebody in charge of maintenance might have been directing internet traffic to a particular group of servers, but selected the wrong group.

"What I find still so surprising is that it happened in the middle of the day. Typically, you do that in off-peak hours," White said. "[Amazon] ranks on the top with performance and availability, consistently, time and time again."

Another possible explanation is an attack such as the distributed denial-of-service (DDoS) attack that struck Amazon and other high-profile sites in 2000. White said he thinks it unlikely, though, that a crushing load of network traffic brought Amazon down.

"These guys are experts at dealing with flash floods of users", including those that routinely arrive during peak shopping days, said White. "Usually, when you see a site going under because of traffic issues or a denial-of-service attack, you see a gradual slowdown in performance and drop in availability. Here, we saw at 10.16am that it completely dropped off — 100 percent."

Soups Ranjan, a senior member of the technical staff of network protection and management company Narus, hasn't yet found any attack evidence.

"It doesn't seem to be the result of a network-initiated attack, at least from my preliminary analysis from our probes," Ranjan said.

Human error may not sound as gripping a tale as a network attack, but there's plenty of drama for the people responsible. And it's the career-limiting variety of drama, said Illuminata analyst Gordon Haff, who hazarded a guess that Amazon's problem involved its front-end web servers.

The security group of WebSense, a website and communications protection company, also saw no evidence that Amazon's problem was security related.

CNET News.com's Robert Vamosi contributed to this report.

Talkback

Hello,
I am a student in grenoble school of business, and I'm doing a research about "business dependence on the Internet". Can you help me by answering to 3 rapid questions, then by sending this message to your competent colleagues or friends.
The questionnaire is here: http://pasczoon.free.fr/blogen.html

Thank you in advance.
Pascal, email: Internet.Dependence.Study –at- gmail.com

PascZoon 9 June, 2008 19:52
Reply

Technology has changed our lives. Now its high time that we should update our systems to be always connected.

Richards

Richards 6 August, 2008 16:44
Reply

Post your comment

In order to post a comment you need to be registered and logged in.

You can also log in with Facebook. Log in or create your ZDNet UK account below

  • Login

Will not be displayed with your comment

By signing up for this service, you indicate that you agree to our Terms and Conditions and have read and understood our Privacy Policy. Questions about membership? Find the answers in the Community FAQ

Get ZDNet UK's daily newsletter

Enter your email address to sign up

ZDNet UK Live

Moley

The thing that has been puzzling me for quite a while is how Anonymous can remain anonymous whilst not only being active on the Internet but also...

11 hours ago by Moley on Anonymous activists release PCAnywhere source code
Don Dilly

If what Semantec is saying is rue, that is even worse and shows a complete disregard for thier users. If what Anonymous claims is true and the...

15 hours ago by Don Dilly via Facebook on Anonymous activists release PCAnywhere source code
MattChurchy

Didn't seem particularly biased to me either. Oh though you might have mentioned some other competitors with free search and email services...

18 hours ago by MattChurchy on Time for an evil umpire: Google, Microsoft & privacy
Simon Bisson and Mary Branscombe

James - exactly as much as anyone paid you for your comment; I don't feel that I need to say that I'm independant and unbiased, but just for you...

20 hours ago by Simon Bisson and Mary Branscombe on Time for an evil umpire: Google, Microsoft & privacy
Carl White

Once they realise symantec are willing to pay real money, they will simply keep extorting, unless of course symantec/authorities can use the...

23 hours ago by Carl White via Facebook on Symantec offered hackers $50k in source code sting
Jonathan Hassell

You can find more information on BS 8878 by Jonathan Hassell its lead-author at http://www.hassellinclusion.com/bs8878/ The page includes a...

1 day ago by Jonathan Hassell on BSI publishes first British web accessibility standard
servermanagement

Thanks for this list. Now I know, what to include on my system to make it more functional.

1 day ago by servermanagement on Ten flawed products that derail productivity
1000092626

What if it's a 4 car household? The point is, more bandwidth = more things you can do simultaneously, like streaming HD video in one room of the...

1 day ago by 1000092626 on Virgin Media beats 100Mbps schedule, hikes prices
Gary Burton

No point whatsoever increasing broadband download speed. unless ever server on the net has access to massively up rated throughput. The worlds...

1 day ago by Gary Burton via Facebook on Virgin Media beats 100Mbps schedule, hikes prices
Random_Error

They're also increasing their TV package prices, whether to help fund this or not.

2 days ago by Random_Error on Virgin Media beats 100Mbps schedule, hikes prices
Techs UK

How can you set it up wrong to intermittently connect? Should I be asking for more pay? Outlook/Exchange is a breeze.

2 days ago by Techs UK on Ten flawed products that derail productivity
JamesCheese

And how much did Microsoft pay you for that article?

2 days ago by JamesCheese on Time for an evil umpire: Google, Microsoft & privacy
JamesCheese

"But how many times have you seen someone make a video call from a tablet?" I do myself a lot. "How often have you seen someone hook up a tablet...

2 days ago by JamesCheese on Apple and Amazon's tablet rivals don't get it
k0tcs3

I have to disagree with this article. Maybe there is a cultural difference between the US and UK, or maybe your network of friends is less...

2 days ago by k0tcs3 on Apple and Amazon's tablet rivals don't get it
filthylooker

My thoughts are that there's some space for change in the business world for tablets as destop replacements. I'd contend that the tablet has a...

2 days ago by filthylooker on Apple and Amazon's tablet rivals don't get it
emrahatilkan

Adobe did not dropped AIR development. It was Flex.

2 days ago by emrahatilkan on Flash 11 and AIR 3 get a release date
dd2

Company called Synergix ( www.synergix.com ) has a fix for the offline folders issue experienced by Win 7 users. And you can check out...

2 days ago by dd2 on VPNs, offline files and the simple Windows 7 fix; sometimes
Neil Lawther

I think all your above points are increasingly more invalid. The android ecosystem is open and evolving and maturing day by day. developers are...

2 days ago by Neil Lawther via Facebook on Apple and Amazon's tablet rivals don't get it
David Meyer

That really is what the European Commission is telling me. To give a precise quote: if a member state turns down the agreement, "ACTA will stay a...

2 days ago by David Meyer on ACTA's EU future in doubt after Polish pause
MyProffs Proffs

Apple devices are back online in German, take the down, no put them back...

2 days ago by MyProffs Proffs via Facebook on German iPhone, iPad sales temporarily banned