Disaster recovery -- a checklist

ANALYSIS
Real world computing accepts that things go wrong with technology. Whether it's a solitary disk read error or a disaster that turns your HQ into ash and rubble, a breakdown in systems can ruin your business -- while a sensible recovery plan may be the one thing that keeps it going. Here are the basic steps to creating such a plan and making sure it will work when needed. Make disaster recovery an integral part of the way your business runs. Someone at the top needs explicit responsibility for overseeing the plan, as it is too easy to make dangerous economies when times are tough. Prioritise the data and systems that need to be recovered first. Each department thinks that theirs is the most important, but the decision has to be made -- and that usually ends up with the IT department, which may not have the appropriate business insight. Don't forget to look outside the data centre for things that need protecting. If employees have heavily customised desktops to do their work, how will it affect them if they have to start from scratch? Paper records are also always important. Make sure you have redundancy for critical systems, whether it's a RAID storage system, server mirroring or even a complete duplicate data centre. There should be no one point of failure, including power supplies, telecommunications or even the office building itself, that will disrupt your business for any length of time. Most companies will not survive an unplanned outage of critical systems that exceeds four days, and even lapses substantially less than that can be disproportionately damaging. Along with redundancy, backup is the most important part of disaster recovery. Once you know what you need to backup, decide when and how you will do your backups. A common scheme is to do a full backup at the beginning of each week, followed by deltas -- backups of changes -- at least daily if not more often. These can be differential backups, where the entire difference from the starting state is copied each time, or incremental, where the difference since the last backup is stored. Incremental backups take less time but produce more individual backups that have to be restored in order; with differential, you have just two restorations to make. Offsite backups are essential, but difficult to manage -- especially for the smaller company. Where teleworking is common, it may be possible to automate the keeping of remote copies of information as part of the standard access arrangements. Whatever the backup process -- and floppy disks, CD-Rs, removable hard disks, tapes, leased lines and VPNs are all common -- ensure that access to offsite backups isn't dependent on just one person. It is common to duplicate the weekly backup and keep it offsite, and also to keep monthly backups. Don't neglect security. If you need to make backups of sensitive information, is it adequately protected from attack if someone gets access to -- or steals -- the backup? Conversely, if you have a secure backup protected by encryption or severe access controls, is it possible to retrieve the information if key employees are missing? Run regular tests to shake the bugs out of your plan -- and that means testing absolutely everything. Countless businesses have suffered because the regular backup procedure seemed to be working perfectly until the time came to retrieve information in earnest. Tests that produce no errors aren't tough enough: you're not testing to make sure it works, but to find out when it doesn't. This will also tell you if your recovery procedure is working but too slow or cumbersome -- a system that comes back but takes two days to rebuild may be inappropriate. Deciding backup and restoration strategies should be part of the initial architectural planning of any major system and should influence bus types, storage devices and the segmentation of the network. When your business processes change, reassess your plans. An acquisition, new operating system installation or reorganisation can trigger this. Also, when you change an underlying system and migrate data over make sure you can recover to the old system for as long as may be necessary -- it's no good having old data you desperately need if you no longer have a system that will read it. Make sure your critical suppliers also have strict disaster recovery plans. There's no point in having your data in the hands of a company that is itself struggling to get back on its feet after a problem. And keep your own dull stuff up to date -- lists of employees with addresses and mobile phone numbers, supplier contacts, and making everyone's role in the recovery plan part of their basic training.
Have your say instantly in the Tech Update forum. Find out what's where in the new Tech Update with our Guided Tour. Let the editors know what you think in the Mailroom.

Post your comment

In order to post a comment you need to be registered and logged in.

You can also log in with Facebook. Log in or create your ZDNet UK account below

  • Login

Will not be displayed with your comment

By signing up for this service, you indicate that you agree to our Terms and Conditions and have read and understood our Privacy Policy. Questions about membership? Find the answers in the Community FAQ

Get ZDNet UK's daily newsletter

Enter your email address to sign up

ZDNet UK Live

GrueMaster

Nice review and very informative. One thing I'd like to add (in reply to whs001's 1st question), the main reason to have the same interface from...

19 minutes ago by GrueMaster on A tale of two distros: Ubuntu and Linux Mint
Frederick Wrigley

I'be been using Mint 12 since the RC came out, and I am far more happy with the Cinnamon, the Mate, and, yes (with extensions), theGnome 3...

1 hour ago by Frederick Wrigley via Facebook on A tale of two distros: Ubuntu and Linux Mint
bdantas

Excellent article. One small correction, though--although a fresh installation of Linux Mint 12 will, indeed, provide the user with a version of...

2 hours ago by bdantas on A tale of two distros: Ubuntu and Linux Mint
Alan Ralph

In related news, the ISPs club together to get the members of the Home Affairs Select Committee (ya goofed on that part, ZDNet UK) copies of "The...

2 hours ago by Alan Ralph via Facebook on MPs urge ISPs to take down terrorist material
Alan Ralph

In related news, the ISPs club together to get the members of the Home Affairs Select Committee (ya goofed on that part, ZDNet UK) copies of "The...

2 hours ago by Alan Ralph via Facebook on MPs urge ISPs to take down terrorist material
Moley

For Gnome 2 die-hards, it is possible to add icons to the bottom panel (or top top panel, if you prefer) which provide the exact Gnome 2...

3 hours ago by Moley on A tale of two distros: Ubuntu and Linux Mint
ramwellian

Your comments would seem pretty naive and immature. Your 'solution' appears to be, "gee, let's all just give in to the hackers and give them...

3 hours ago by ramwellian on Cloud computing security: no more oxymoron?
BugStalker

"Interesting thought ... If you installed Win7 as a dual boot on a machine that previously only had Linux, and it wrecked your Linux installation,...

4 hours ago by BugStalker on Windows 7 Declares War on GRUB
whs001

This is an excellent summary of Ubuntu and Mint and the interface differences between them. Most such articles take a very partisan position for...

4 hours ago by whs001 on A tale of two distros: Ubuntu and Linux Mint
Moley

@ewallace. Not so clear. Anyone can obtain the text, for example from here http://www.ustr.gov/webfm_send/2379. I support ACTA so long as it and...

4 hours ago by Moley on ACTA: Facts, misconceptions and questions
45283

I think WinRT is fantastic. I just wish it was an option for people that didn't want to go through Microsoft's App Store with its attendant...

7 hours ago by 45283 on Why Windows 8 needs architectural hygiene for WOA
Burn-IT

Nine people? £30m? Who's back pocket is that lot going in? And IF they say it is for new buildings, what about all the ones the government has...

8 hours ago by Burn-IT on Police set to launch three £30m e-crime hubs
ewallace

Just to be clear, nobody knows what is in the text of ACTA, here is a photograph of the text of ACTA http://twitpic.com/8h9iju as submitted to the...

8 hours ago by ewallace on ACTA: Facts, misconceptions and questions
fgvrg56

Unfortunately main issue is that ASUS is refusing to accept that they make some mistake on this version of asus Transformer prime. 1 - GPS sensor...

10 hours ago by fgvrg56 on Asus Eee Pad Transformer Prime Wi-Fi & GPS problems?
Ben Woods

@Marcus A fair question. Just talked with Archos which said it was working on an announcement for next week....

11 hours ago by Ben Woods on Archos confirms G9 Ice Cream Sandwich update schedule
Marcus Karlsson

Any update on this, considering the claimed "first week of February"?

12 hours ago by Marcus Karlsson via Facebook on Archos confirms G9 Ice Cream Sandwich update schedule
apexwm

Bill Goodrich : Just as al_langevin pointed out, with Windows Server 2008 there is no Services for Macintosh anymore. It's gone, not available....

20 hours ago by apexwm on Windows Server 2008 drops the ball for Mac compatibility
txtrainguy

Replying to an old topic that I'm currently facing with my CEO (who is on a Mac). Our servers are primarily Windows Servers, office is about...

1 day ago by txtrainguy on Windows Server 2008 drops the ball for Mac compatibility
k0tcs3

Sure, that makes perfect sense. Pay wrong-doers money and thank them for breaching your security and pointing out your flaws, that would surely...

1 day ago by k0tcs3 on US indicts Romanian over NASA climate change hack
Random_Error

I think he's referring specifically to Android apps, as Apple do regulate their App Store, but Google seem to let any old crap onto the Android store!

1 day ago by Random_Error on RIM: BlackBerry will keep 'garbage' apps out of store