FierceCIOFierceCIOTechWatchFierceMobileITFierceContentManagementFierceGovernmentIT   FierceVoIPFierceHealthITFierceFinanceIT

Why your cloud service will eventually fail

Tools
Tags
no doubt
Google
Gmail
amazon


I'm sure you've heard about it by now--considering the indignation and anger that rippled through the Internet--Google's Gmail went down again earlier this week, this time for a total of 100 minutes. The official Gmail blog has the technical details of what exactly happened, though I thought fellow editor of FierceCIO Judi Hasson wrote a succinct report pertaining to the important facts of this outage.

For my opinion on the debacle, feel free to check out "Gmail goes down; is it really cause for concern?." For today's commentary though, I wanted to focus on the bigger picture related to the very hot topic of cloud computing.

Expectations over cloud computing

There is no doubt that everyone wants a slice of the cloud computing pie. Just this week alone, we see VMware announcing a new service called vCloud Express, a thinly-veiled Amazon competitor. Even IBM has plans to roll out its own cloud-based desktop, which can be accessed via nothing more than a standard web browser with Java support.

So why were those 100 minutes of downtime on Gmail such a huge deal? I know, business got disrupted and all, but you can't tell me that none of your enterprise servers have ever crashed. Indeed, I never even noticed the downtime myself, as I use an auto-forward rule on my Gmail account, which effectively bypassed the affected web interface.

It is at this point that we arrive at the crux of the matter: That nobody expects a cloud-based architecture to fail. It's that simple. So even though Gmail was never ever taken down for system maintenance, shutdown temporarily for hardware "upgrades," or experienced any of the "scheduled downtime" that is so common in the enterprise, everyone's expectation on the matter is still unanimous and unyielding. You see, cloud services are not supposed to go down.

The downside of cloud computing

Now that we are clear on our expectations for cloud computing, I want to show you why it is not a realistic expectation to have perfect uptime, in spite of everything you have read.

You see, in our haste to build the smartest, most fault-redundant network, and the best hardware, what many forget is the somber fact that the human "fallibility quotient" remains unchanged. By that, I mean the ability for the best of us to make mistakes, or for crucial oversights to happen.

Ironically, while human errors committed under the old paradigm are limited at most to a single corporation or department, errors in the cloud are amplified many times over. With large services like Gmail, even the smallest downtime has the potential to be felt around the globe.

In addition, consider this: A smaller company would just have flipped the reboot switch, probably taking only 10 to 15 minutes to get things up and running again. This option is clearly not possible for Google's Gmail. Alas, the behemoth and immensely powerful system created to host millions of Gmail users is also necessarily complex.

Just as it takes kilometers of sea for an immense, oil-carrying supertanker just to come to a stop, problems can take time to rectify even if remedied practically immediately. This was the case with the recent outage--all of 100 minutes in fact. - Paul Mah

Bookmark and Share
Get Your FREE FierceCIO:TechWatch Email Newsletter:
Comments (5) | Post a comment

Comments

This is a great article regarding just how unrealistic some expectations have been set. When AWS went out for a while earlier this year, a lot of people started crying "The sky is falling" then as well.

It's a shame that 100 minutes of downtime would cause such a big reaction but, you're absolutely correct in pointing out that because of the size and usage of Gmail, even the smallest amount of down time reverberates around the world. In most cases, if an Enterprise website was down for only 100 minutes, most of the people involved would be happy. Kind of ironic.

More to the point, you're absolutely correct, the various Cloud platforms will go down for some length of time, sometime in the future. It's an imperfect world we live in. We can only rely on the professionalism of the technicians on the other end that work dilligently to bring us back to an operating state as quickly as possible.

Great Article! The more people who feel that going to the cloud is a great solution need to remember that the more of your services you pile into the cloud, the more risk you are taking by having unknown sources manage it, unknown down time, unknown qualifications of those managing it, unknown quality / stability of the equipment, unplanned outages, and several other key unknown factors you have no direct control over. Also the fact that you will have to step through a million hoops of IT support to get some lame answer when a problem occurs can be quite troublesome as well. At least by having an IT Department in house, you can walk to them or call them direct to get status updates and information you need with direct answers. I think going to the cloud is a risky maneuver for any company or IT Director to bet their company or career on. Just say NO to cloud for me.....

Not sure about the Cloud craze yet. I do know one thing, being down almost 2 hours in some businesses would mean loss of jobs.

You might not feel so comfortable with that kind of downtime if you were a patient in the hospital with a critical problem and the on-call resident couldn't get to your chart because all the patient records had been moved into the cloud, as Google Health would recommend.

in addition to "the more risk you are taking by having unknown sources manage it, unknown down time, unknown qualifications of those managing it,"
there s the #1 issue of reduced security. brands depedent on privacy and security who fall for a pitch from amazon, et al are putting their business model at risk, for the sake of making Jeff Bezos richer.
once there was back ip, co-location, akamaisation. But those allowed too many competitors, too many entrepreneurs.

Post new comment

The content of this field is kept private and will not be shown publicly.

More information about formatting options

To combat spam, please enter the code in the image.