This document discusses the importance of preparing applications to handle traffic surges and scaling for cloud deployments. It provides examples of outages experienced by companies like Netflix and Best Buy due to applications not being able to scale up during periods of high demand. The document then outlines strategies companies can use to avoid such outages, including using a cloud management platform to quickly provision resources, implement auto-scaling, and build resilient applications. It highlights how RightScale was able to help American Girl deploy a new application to AWS within 14 days using these approaches.
3. #
Super-Bowled: Coke Chase Site Outage
• Problem: Micro site to complement
ad-campaign goes down
• Length: 4 hours
• Impact: Lost engagement opportunity
for a large ad-campaign
• Cause: Site not prepared to handle
traffic
4. #
Bad Santa: Netflix Christmas Eve Outage
• Problem: AWS outage takes down
Netflix
• Length: 23 hours 41 minutes
• Impact: 27 million Netflix users
• Cause: Infrastructure issues at AWS
5. #
Very Black Black Friday for Best Buy
• Problem: Checkout delays when
sales tax calculator can’t scale
• Length: 1 hour
• Impact: Best Buy Loyalty
members can’t check out during
special promo. Best Buy pays for
expedited shipping.
• Cause: Application doesn’t scale
6. #
Click Frenzy Meets #ClickFail
• Problem: Heavily promoted
Australian alternative to Cyber
Monday fails almost immediately
• Length: 3 hours
• Impact: #ClickFail hashtag gets
28,000 mentions in two days
• Cause: Site not prepared to handle
traffic
9. 9
#
Success at American Girl
Situation:
• New virtual world for doll owners
• Challenges in data center
• Holiday catalog drop promoting virtual
world was imminent
RightScale Solution:
• Up and running with RightScale using
AWS in 14 Days
• Scaled to meet performance
• Stable system
• Reduced costs
Cloud Management Platform
10. 10
#
We Have Many More Such Stories
Web Campaigns
Mobile Apps
Games
Cloud Management Platform
11. 11
#
We Have A Lot Of Experience Doing This
7
10
5M+
10K+
years
clouds
servers
Scale to
servers
Cloud Management Platform
12. 12
#
Behind The Scenes: Elasticity And Scaling
Production Stack
Development
LOAD BALANCERS
LOAD BALANCERS
APP SERVERS
MASTER DB
Scale
Up
and
Down
APP SERVERS
SLAVE DB
Launch an
environment
in minutes
SNAPSHOTS
CLOUD
STORAGE
MASTER DB
SLAVE DB
Testing
LOAD BALANCERS
SNAPSHOTS
APP SERVERS
CLOUD
STORAG
E
MASTER DB
SLAVE DB
Test at
massive
scale
SNAPSHOTS
CLOUD
STORAGE
Cloud Management Platform
13. 13
#
Behind The Scenes: Resilience
DNS
US-EAST
US-WEST
LOAD BALANCERS
LOAD BALANCERS
APP SERVERS
APP SERVERS
MASTER DB
SLAVE DB
SLAVE DB
REPLICATE
REPLICATE
ELB
SNAPSHOTS
SNAPSHOTS
S3
Cloud Management Platform
14. 14
#
Behind The Scenes: Resilience – Multi-Cloud
DNS
Dallas
US-EAST
LOAD BALANCERS
LOAD BALANCERS
APP SERVERS
APP SERVERS
MASTER DB
SLAVE DB
SLAVE DB
REPLICATE
REPLICATE
CBS
SNAPSHOTS
SNAPSHOTS
CLOUD
FILES
Cloud Management Platform