Data Storage - eWeek


Data Storage: Data Center Disaster Preparedness: 10 Tips for Minimizing the Damage

By Chris Preimesberger on 2011-10-18


Tsunamis, hurricanes, earthquakes, floods, extreme cold, extreme heat, human error, power surges—the causes of data center disasters are numerous. Do you really know if your data center is prepared for the next natural disaster? This is not a multiple-choice question. The answer is either "yes" or "no," and if it's no, then you may have some serious work to do. Anyone can back up data automatically. Storage is relatively inexpensive, and automated software makes it almost a no-brainer to keep everything stored. But in the event of a major outage, can you restart your systems quickly and efficiently with little or no loss of data? This is where regular preparedness testing comes in. Tedious though it is, it’s nonetheless an imperative component of the whole disaster-recovery process. If you don’t know how well your system works, how can you ever depend on it? The insights in this slide show are offered by Brad Wurtz, president and CEO of Power Assure Inc.

  • of

Forecast as Best You Can

Use real-time monitoring, trend analysis and forecasting tools to plan responses to events: Understand system utilization and application tiers, and spot patterns and behaviors that help mitigate better peak demand and unforeseen failures and outages.

Resource Redundancy Is a Must

Structure your data center redundancy as appropriate for your application service levels. Set up backup energy supplies from various sources to respond to unplanned events. Build in the appropriate redundancy levels for the application priority tier your data center is expected to continue to deliver through the duration of the emergency.

Test Triage Procedure for Power Outage

Use run-book automation to implement the power capping, throttling or powering up or down of servers based on demand for computing resources and available capacity. Continue operations of most critical applications during outages.

Check UPS System Regularly

Design and maintain your power backup uninterruptible power supply to ensure reliability: Size your generators and build your UPS/battery setup according to your data center needs and flexibility for shifting to another site.

Establish Application Priorities

Identify application tiers as follows: 4=mission critical, 3=highly desirable, and 2=non-essential. There is no No. 1 in this priority framework. Adjust storage cluster sizes and resource allocation based on application tiers. Define emergency procedures and priorities based on application tiers.

Plan for Server Capacity Triage

Create specific steps that transfer applications to and from available systems and locations, and implement shedding and shifting and automated failover.

Keep Disaster-Recovery Apparatus Up-to-Date

Maintain an up-to-date disaster-recovery and business-continuity plan for multiple scenarios, ranging from a brief brownout to an extended blackout, including weeklong outages when even fuel maintenance services will no longer be available.

Consider Automating Disaster Recovery

Implement disaster-recovery plan automation. Automate processes to move most critical applications to failover sites without service interruption, including storage and networking.

Test Failover for Reliability

Regularly schedule failover testing to ensure reliability; verify redundancy mechanisms while the system is under load in preparation of a potential failure.

Appoint a Team to Own Business Continuity

Form a cross-functional team to own business continuity and develop new processes to improve business continuity.

  • More slideshows

Advertisement

FEATURED SPONSOR MESSAGE

Microsoft Sponsored Resource Center

Windows Azure is a public cloud platform for building, hosting and scaling applications. Try Windows Azure free for 90 days and get 20GB outbound and unlimited inbound data transfer.

Learn more

Brought to you by

 
eWEEK Quick LInks

 
Close this advertisement