BlackBerry outage: Was it inevitable?

Daily Newsletters

Sign up to ZDNet UK's daily newsletter.

NEWS

RIM's massive BlackBerry email outage this week highlights how vulnerable the company's network has become as it tries to keep up with demand for its popular service.

RIM did not provide details of what caused the outage, which left millions of BlackBerry subscribers without access to email on Tuesday evening and into Wednesday morning. The company said in a statement released early on Wednesday that it was still reviewing the situation.

But analysts say that judging from the nature of the outage and who was affected, the problem falls squarely on RIM's shoulders. For one, the outage only affected data services, including email and mobile web-browsing. Subscribers were still able to make phone calls and send and receive SMS text messages.

All of this points to some kind of technical issue within one of RIM's Network Operations Centres (NOCs), which act as an intermediary between corporate mail servers and recipients.

The email outage, first reported by WNBC, began at around 17:00 (PDT) on Tuesday and lasted until the small hours of the morning on Wednesday when email began trickling into inboxes to users across North America and parts of Europe and Asia. The widespread disruption highlights just how vulnerable RIM's network has become, especially as the company's subscriber base grows.

Over the years, RIM has built a good reputation as a reliable service provider attracting bankers, lawyers and even congressional lawmakers as subscribers. Lately the company has also been trying to attract more mainstream customers with new handsets such as the BlackBerry Pearl and the BlackBerry 8800, both of which include media players and mobile browsers for web surfing.

The result has been a spike in subscriber growth. In the company's latest quarter, it reported it had added 1.02 million new subscribers, taking its total to eight million. This is a huge increase from the two million subscribers the company reported a year ago when it settled its patent infringement case with NTP. The company expects to add between 1.125 million and 1.15 million subscribers during the current quarter.

Dan Taylor, managing director of the Mobile Enterprise Alliance, a not-for-profit trade organisation that promotes enterprise mobility, said: "With all the recent subscriber growth the company has seen, it's not surprising that they would have network problems.

"They've just about quadrupled their subscriber base in the last 12 to 16 months. In some ways it was an accident waiting to happen. I'm sure the people running the NOC were aware that something could happen, and I'm sure they are working to get it fixed."

How it works
While it's not known for sure what caused RIM's outage, it's not difficult to see how the very nature of RIM's network could potentially lead to a major service outage. RIM's service is centralised and it works by routing all BlackBerry emails through one of two main NOCs, which are essentially large data centres. One NOC is located in Canada and it primarily services the Western Hemisphere as well as parts of Asia, said analysts familiar with the company. The other data centre, located in the UK, handles email traffic in Europe, Africa and the Middle East.

The BlackBerry Enterprise Server, which sits on the corporate network, receives emails from the company's Exchange or Lotus email server and forwards those emails in an encrypted tunnel to one of the NOCs. The NOC then acts as an efficient delivery system that authenticates users and forwards the messages to the appropriate handheld device.

Because user authentication is handled by RIM away from the corporate network, it protects companies from hackers who may try to obtain information through email servers, which sit inside the company's firewall. RIM's approach also means corporate IT departments don't have to juggle relationships with multiple mobile operators because RIM handles all of that for them in the NOC.

The flipside of RIM's approach is that with only two NOCs handling emails from eight million subscribers, there are two major points of potential failure. And when something goes wrong in one or both of these data centres, it can result in an outage like the one that occurred on Tuesday night and Wednesday morning, which technologically paralysed users.

Gene Signorini, vice president of enterprise research at the Yankee Group, said: "Anytime you have a situation where traffic is flowing through a single data centre, there is potential for a catastrophic outage. But, that said, the RIM architecture also provides a lot of benefits to its corporate customers. It's just the nature of the beast."

Some of the most common issues that can result in an outage are power failures, failure of a critical component that takes down a larger component, software bugs, viruses and other attacks from the outside, or patches that fail. RIM hasn't identified which issue caused this particular outage but Todd Kort, principal analyst at Gartner said the outage may have been caused by a software bug.

He said: "If the RIM outage is affecting other parts of the globe, this fact most likely points to some type of software bug."

While Motorola's Good Technology also uses NOC architecture to push email to subscribers, competing mobile email solutions like the ones sold by Microsoft and Nokia through Intellisync do not route emails through a centralised data centre and are therefore immune to this kind of outage. With these architectures, a single company or a single mobile operator might experience an outage, but it would be nothing in comparison to the magnitude of what was witnessed with BlackBerry.

That said, companies managing their own corporate wireless email do not get the management and security benefits that those using RIM's BlackBerry service get.

Investors seemed to take the BlackBerry outage in stride, with the company's share price changing little on Wednesday as service began to come back to life. Gartner's Kort said he believes that whatever infrastructure problems the company's quick growth may be causing, he is confident that RIM will quickly fix them.

"RIM has already recovered nicely from paying $600m to NTP in its patent case," he said. "It has about $1.4bn in cash, so I'm sure they can buy whatever state-of-the-art equipment they need to keep their network solid."

Talkback

Whatever can go wrong, WILL go wrong.

A lesson for all network administrators, of all network sizes.

julian 19 April, 2007 14:08
Reply

Post your comment

In order to post a comment you need to be registered and logged in.

You can also log in with Facebook. Log in or create your ZDNet UK account below

  • Login

Will not be displayed with your comment

By signing up for this service, you indicate that you agree to our Terms and Conditions and have read and understood our Privacy Policy. Questions about membership? Find the answers in the Community FAQ

Get ZDNet UK's daily newsletter

Enter your email address to sign up

ZDNet UK Live

Tony Douglas

Please God no; teach them anything you like - thinking rationally, the uses and misuses of data, what data is and what it's not - but leave the...

2 hours ago by Tony Douglas via Facebook on Kids are the future. Teach ’em to code.
BrownieBoy

@Jack, > Works really well for thieves.... Nice attempt to deflect the argument by tossing in a point that's totally irrelevant, even it were...

16 hours ago by BrownieBoy on AMD Ultrathins to challenge Intel Ultrabooks
bootlegger

Make that 13 people now - I got refused today at Manchester airport. I thought I was up to date on this legislation - I knew of the EU ruling from...

19 hours ago by bootlegger on UK airport body scans will not be opt out
tinycg

Don't forget to check out apps like GoodReader or SlideShark either, they're indispensible for people on the go in presentation situations. Best...

22 hours ago by tinycg on Four top iPad apps for people on the move
TerryRK

Well it seems there is something a number of us agree on. Why is the Ubuntu Unity launcher so ugly? I thought perhaps it was something to do with...

1 day ago by TerryRK on A tale of two distros: Ubuntu and Linux Mint
Freebies202

Duplicate comments are not made intentionally. Its very good to know that now you are keeping check on this problem because sometimes a commenter...

2 days ago by Freebies202 on Microsoft fixes blog comments, speeds up blogs with open source
kevinmchapman

"the very significant number of users" and "many (most) of us" - you have no evidence for these statements. It is a fact that most users are saying...

2 days ago by kevinmchapman on A tale of two distros: Ubuntu and Linux Mint
Marg Menzies Harrison

Another grammar faux pas is the improper use of "you". When sitting down down in a restaurant, for example, I get cringe when the waitress...

2 days ago by Marg Menzies Harrison via Facebook on 10 flagrant grammar mistakes that make you look stupid
zdnetukuser

And NOW, folks, for Canonical's next trick... Kubuntu is late. Here's a pencil. Draw your own conclusions. cf.:...

2 days ago by zdnetukuser on Linux Minterface
Moley

@kevinmchapman. The discussion here reflects the very significant number of users who really do like the traditional menu system and who wish to...

2 days ago by Moley on A tale of two distros: Ubuntu and Linux Mint
kevinmchapman

Er, no... It is an efficient means of finding the application/file/setting you need in one place. The icons are a simply a fallback for when you...

2 days ago by kevinmchapman on A tale of two distros: Ubuntu and Linux Mint
TerryRK

Isn't the provision of a text based search an admission by the developers that the mass of icons approach does not work? I don't need to use a...

2 days ago by TerryRK on A tale of two distros: Ubuntu and Linux Mint
kevinmchapman

"Unity and GNOME 3 both abandon the old text-based cascading menus in favour of a graphical icon-driven system." Point truly missed. Both use a...

2 days ago by kevinmchapman on A tale of two distros: Ubuntu and Linux Mint
TerryRK

whs001 - Thank you, I'm glad you liked the article. I absolutely agree with you on your first point. I should perhaps have made it clearer that...

2 days ago by TerryRK on A tale of two distros: Ubuntu and Linux Mint
Dennis Nilsson

If we allow corporate interest to dictate the way our government circumvents due process against foreign entities then we should accept the same...

2 days ago by Dennis Nilsson via Facebook on ACTA stumbles in Germany
GHar123

I totally dislike pirating of works, I fear that artists will be deterred from creating works if they think that they are going to get ripped off....

2 days ago by GHar123 on ACTA stumbles in Germany
JCB33

How dare film makers, artists or anybody that invests in creativity stop us pirating their works for free. I want to be able to walk into my local...

3 days ago by JCB33 on ACTA stumbles in Germany
Moley

@GrueMaster. I prefer horses for courses rather than one size fits all. I, and I suspect most other computer users, do not really wish to have...

3 days ago by Moley on A tale of two distros: Ubuntu and Linux Mint
greycynic

The product that scares me every time I have to use it is the Office 2007 version of Excel. The first bug that I found was applying the median...

3 days ago by greycynic on Ten flawed products that derail productivity
GrueMaster

Nice review and very informative. One thing I'd like to add (in reply to whs001's 1st question), the main reason to have the same interface from...

3 days ago by GrueMaster on A tale of two distros: Ubuntu and Linux Mint