Amazon Still Resolving Problems on Second Day of Outage

 
 
By Chris Preimesberger  |  Posted 2011-04-22 Email Print this article Print
 
 
 
 
 
 
 

Numerous Websites were still offline in some systems by late afternoon April 22-more than 40 hours after the initial outage.

Amazon.com reported that problems that shut down some of its Web services-namely its Elastic Compute Cloud, Relational Database Service and Elastic Beanstalk-still hadn't been completely resolved by late afternoon April 22, more than 40 hours after they went offline.

The most critical outage started at 1:41 a.m. PDT April 21 at an AWS (Amazon Web Services) data center in Northern Virginia and caused disruptions in its EC2 (Elastic Compute Cloud) hosting service, knocking thousands of Websites-including such popular ones as Foursquare, Reddit, Quora and Hootsuite-off the Internet.

These Websites and numerous smaller sites were still offline in some systems by late afternoon April 22. Businesses that depend on the AWS hosting service lost money during that window of time-income that cannot be regained.

The AWS Elastic Beanstalk, which software developers use for deploying and managing applications in the AWS cloud, was running but was experiencing performance problems, Amazon said. Elastic Beanstalk automatically handles the deployment details of capacity provisioning, load balancing, auto-scaling and application health monitoring.

Amazon reported April 22 on its status Website that it has made progress in fixing the outage.

"We continue to see progress in recovering volumes, and have heard many additional customers confirm that they're recovering. Our current estimate is that the majority of volumes will be recovered over the next 5 to 6 hours," Amazon said Friday morning. "As we mentioned in our last post, a smaller number of volumes will require a more time-consuming process to recover, and we anticipate that those will take longer to recover."

By late afternoon Pacific Time on Friday, Amazon had issued a total of 19 updates on its status page since the outages began. EC2, Relational Database Service and Elastic Beanstalk were still having problems.

SLA Not Violated, However

Lydia Leong of Gartner Research wrote in an advisory that Amazon EC2 didn't actually violate its service-level agreement when the outage occurred.

"Amazon's SLA for EC2 is 99.95 percent for multi-AZ deployments," Leong wrote. "That means that you should expect that you can have about 4.5 hours of total region downtime each year without Amazon violating its SLA.

"Note, by the way, that this outage does not actually violate their SLA. Their SLA defines unavailability as a lack of external connectivity to EC2 instances, coupled with the inability to provision working instances. In this case, EC2 was just fine by that definition. It was Elastic Block Store [EBS] and Relational Database Service [RDS] which weren't, and neither of those services have SLAs."

An Amazon spokeswoman didn't respond to an eWEEK query by end of business April 22.

Mixed Reaction from Customers

Reaction to the outage from cloud customers was mixed.

"Proponents of cloud computing aren't going to like the fact that Amazon had issues that resulted in outages among its customers' sites, but the fact is that most insurers have their own outages when they host applications internally, in some cases with more frequency and severity than we're seeing here with Amazon," Craig Weber, a senior vice president of the Insurance Group at Celent, a Boston-based financial research and consulting firm.

"This outage should focus the discussion on the relative reliability of various approaches and the tradeoffs between them. Of course, there are also lessons about being aware of the capabilities of your business partners.

"Engaging with an SAAS [software as a service] vendor requires understanding things like their architecture, their disaster-recovery capability and similar issues, because worst-case scenarios always seem to emerge eventually."

Morphlabs, which was one of the first AWS solution providers when it launched Morph Appspace in 2007, now has more than 4,000 users. Founder and CEO Winston Damarillo told eWEEK that organizations need to focus on two things to help get a better understanding of their cloud solutions: diversity and control.

"A multi-vendor approach to the cloud means that an organization is not relying on one company or solution to keep its cloud in working order," Damarillo wrote.

"With the hybrid cloud model, companies are able to extend existing infrastructure resources without isolating themselves. When the time comes that they have maxed out their hardware compute capabilities behind a firewall, they can easily make use of the public cloud, as well."

 

 

 
 
 
 
Chris Preimesberger Chris Preimesberger was named Editor-in-Chief of Features & Analysis at eWEEK in November 2011. Previously he served eWEEK as Senior Writer, covering a range of IT sectors that include data center systems, cloud computing, storage, virtualization, green IT, e-discovery and IT governance. His blog, Storage Station, is considered a go-to information source. Chris won a national Folio Award for magazine writing in November 2011 for a cover story on Salesforce.com and CEO-founder Marc Benioff, and he has served as a judge for the SIIA Codie Awards since 2005. In previous IT journalism, Chris was a founding editor of both IT Manager's Journal and DevX.com and was managing editor of Software Development magazine. His diverse resume also includes: sportswriter for the Los Angeles Daily News, covering NCAA and NBA basketball, television critic for the Palo Alto Times Tribune, and Sports Information Director at Stanford University. He has served as a correspondent for The Associated Press, covering Stanford and NCAA tournament basketball, since 1983. He has covered a number of major events, including the 1984 Democratic National Convention, a Presidential press conference at the White House in 1993, the Emmy Awards (three times), two Rose Bowls, the Fiesta Bowl, several NCAA men's and women's basketball tournaments, a Formula One Grand Prix auto race, a heavyweight boxing championship bout (Ali vs. Spinks, 1978), and the 1985 Super Bowl. A 1975 graduate of Pepperdine University in Malibu, Calif., Chris has won more than a dozen regional and national awards for his work. He and his wife, Rebecca, have four children and reside in Redwood City, Calif.Follow on Twitter: editingwhiz
 
 
 
 
 
 
 

Submit a Comment

Loading Comments...
 
Manage your Newsletters: Login   Register My Newsletters























 
 
 
 
 
 
 
 
 
 
 
Rocket Fuel