Showing posts with label Amazon. Show all posts
Showing posts with label Amazon. Show all posts

19 August 2013

Reuters: Amazon Offline for 15 Minutes

Reuters reported that U.S. and Canadian customers of Amazon experienced outages of up to 15 minutes on Amazon.com. While the Reuters headline referred to the outage as "rare," Amazon outages receive considerable attention. The Amazon S3 cloud architecture is thought to be one of the more robust in the rapidly developing cloud computing market.

Amazon did not respond inquiries from Reuters reporters.

The Amazon dashboard showing system status provided a message similar to this one (for Flexible Payment System) for a number of its services earlier today:
12:52 PM PDT Between 11:45 AM and 12:32 PM PDT, FPS experienced elevated error rates and latencies impacting CBUI pipeline, ASP, Pay, Reserve and Settle APIs in the North America region. The issue has been resolved and the service is now operating normally.

23 April 2011

Cloudy Computing: Amazon EC2 2011 Outage Added to 2008, 2009 Events

Cakewalk.com ECommerce Site Explains EC2 Outage to Customers
The net's collective memory can offset even the short memories of consumers and investors.Search for Amazon web service outages, and one is reminded that it was around the time that rumblings of the 2008 Big Recession began to make themselves felt, in July to be exact, that Amazon's S3 storage service suffered an outage. The July outage affected Twitter, which used S3 to store images. Keep reading through the search results, and one is further reminded that it was the second outage that year; earlier in February, Amazon explained the outage as an overloading of authentication requests. That problem turned out to be a problem with server-to-server system health reporting in Amazon's farm. In June of 2009, a lightning strike on one of Amazon's data centers took down some resources for more than four hours. Amazon's health dashboard described the latest problem as "connectivity and latency issues with RDS database instances in the US-East-1 region." In other words, for some customers, the Elastic Compute Cloud (EC2) stretched and snapped again in April 2011.

The screenshot above is taken from Cakewalk, a music technology firm based in the Boston Greater Metro area. Cakewalk is an Amazon customer and its ecommerce site was directly affected by the outage. As can be seen from their message, the outage resulted in an interruption to an existing web campaign, and obviously no ecommerce revenues were possible during the approximately two day outage. In addition to the lost revenue, as web staff scrambles to compensate for the outage, errors can be made in reconfiguring sites with workarounds. There's no way for the GlitchReporter to know the cause for sure, but Cakewalk's ecommerce store is now getting a 404 on the store's home page (see below).

These failures are infrequent, and failures will happen in the cloud just as they happen to internal data centers, but there are important differences, too. As the GlitchReporter noted not long ago with a Gmail cloud outage, the scale and complexity of cloud resources may reduce the frequency, but perhaps not the cascading effects of failures across enterprises. Perhaps more importantly, because technical communications from big company cloud firms (Microsoft, Google, Amazon) tend to be sketchy during such outages, customers could not be blamed for thinking (even if erroneously) that they might be better able to work around and plan for failures that occur inside their own data centers.


11 June 2009

Tweeted: Amazon Merchandise "Unavailable" Glitch Hits Some

At least some Amazon.com visitors experienced a curious glitch of some kind. According to @mit429, @LeighEllwood, @vagari, @WeberBooks -- to name a few, Amazon showed some or all titles as "unavailable." According to @Chris_Rotella, when he phoned Amazon (you can do that?), they told him that systems were being "updated" at the time.

According to the Twitter timestamps, the problem occurred sometime after 11A Eastern.

The Amazon AWS status page reported an ECS power failure that affected certain instances running in a specific affected server rack, but that had been remedied by 1:20A PDT. I don't know whether Amazon proper is running on the Amazon Web Services infrastructure, (despite use of the inset AWS image) but it may not be coincidental. The buzz would have been much larger if all of Amazon everywhere had been affected.