Soumettre la recherche
Mettre en ligne
System Availability Talk
•
Télécharger en tant que PPT, PDF
•
1 j'aime
•
908 vues
m_richardson
Suivre
Talk i gave on HA, resiliency and recovery of systems
Lire moins
Lire la suite
Technologie
Business
Signaler
Partager
Signaler
Partager
1 sur 33
Télécharger maintenant
Recommandé
Top ten secret weapons for performance testing in an agile environment
Top ten secret weapons for performance testing in an agile environment
alistairjones
GWAVACon 2013: Novell GroupWise
GWAVACon 2013: Novell GroupWise
GWAVA
How to measure the business impact of web performance
How to measure the business impact of web performance
SOASTA
[Rakuten TechConf2014] [F-6] Changing the Behavior of IT
[Rakuten TechConf2014] [F-6] Changing the Behavior of IT
Rakuten Group, Inc.
Interns as ft es for employer penalty
Interns as ft es for employer penalty
Patti Goldfarb, CSA
Filosofiametafisica 140310174603-phpapp01
Filosofiametafisica 140310174603-phpapp01
sebas andy
Cooking with Chef
Cooking with Chef
m_richardson
Sales Hacker Series San Francisco - Elay Cohen - To Sell Is To Be Human
Sales Hacker Series San Francisco - Elay Cohen - To Sell Is To Be Human
Sales Hacker
Recommandé
Top ten secret weapons for performance testing in an agile environment
Top ten secret weapons for performance testing in an agile environment
alistairjones
GWAVACon 2013: Novell GroupWise
GWAVACon 2013: Novell GroupWise
GWAVA
How to measure the business impact of web performance
How to measure the business impact of web performance
SOASTA
[Rakuten TechConf2014] [F-6] Changing the Behavior of IT
[Rakuten TechConf2014] [F-6] Changing the Behavior of IT
Rakuten Group, Inc.
Interns as ft es for employer penalty
Interns as ft es for employer penalty
Patti Goldfarb, CSA
Filosofiametafisica 140310174603-phpapp01
Filosofiametafisica 140310174603-phpapp01
sebas andy
Cooking with Chef
Cooking with Chef
m_richardson
Sales Hacker Series San Francisco - Elay Cohen - To Sell Is To Be Human
Sales Hacker Series San Francisco - Elay Cohen - To Sell Is To Be Human
Sales Hacker
MTBF / MTTR - Energized Work TekTalk, Mar 2012
MTBF / MTTR - Energized Work TekTalk, Mar 2012
Energized Work
Emc sql server 2012 overview
Emc sql server 2012 overview
solarisyougood
Musings of an MSP - Why Some Things Never Change and Others Have To - Datacom
Musings of an MSP - Why Some Things Never Change and Others Have To - Datacom
Amazon Web Services
2012 Annual State of the Union for Mobile Ecommerce Performance [Velocity EU]
2012 Annual State of the Union for Mobile Ecommerce Performance [Velocity EU]
Strangeloop
Disaster Recovery with MySQL and Tungsten
Disaster Recovery with MySQL and Tungsten
Jeff Mace
Walmart pagespeed-slide
Walmart pagespeed-slide
Bitsytask
Walmart Web Performance Circa 2013
Walmart Web Performance Circa 2013
Cliff Crocker
Presentation virtualizing oracle unlocked enterprise wide benefits
Presentation virtualizing oracle unlocked enterprise wide benefits
solarisyourep
O'Reilly webcast: Joshua Bixby on Mobile Performance Trends and Predictions
O'Reilly webcast: Joshua Bixby on Mobile Performance Trends and Predictions
Strangeloop
Scaling mature systems
Scaling mature systems
HanMorten
Why You Should Move to the Cloud
Why You Should Move to the Cloud
Eric (Sonny) Wilson, BSc.
Automation & Cloud Evolution - Long View VMware Forum Calgary January 21 2014
Automation & Cloud Evolution - Long View VMware Forum Calgary January 21 2014
James Charter
Executing the Digital Strategy
Executing the Digital Strategy
Ben Turner
Optimizing Browser Rendering
Optimizing Browser Rendering
michael.labriola
How to Choose the Right Cloud for Continuity
How to Choose the Right Cloud for Continuity
marketingunitrends
Works on my machine, your problem now? - QCon 2014
Works on my machine, your problem now? - QCon 2014
Wolfgang Gottesheim
At bruxelles scaling agile - v1.5 slideshare
At bruxelles scaling agile - v1.5 slideshare
Herve Lourdin
Scaling CQ5
Scaling CQ5
connectwebex
Dev talks Cluj 2018 : Java in the 21 Century: Are you thinking far enough ahead?
Dev talks Cluj 2018 : Java in the 21 Century: Are you thinking far enough ahead?
Steve Poole
Oracle primavera and bpm the power of integration ppt
Oracle primavera and bpm the power of integration ppt
p6academy
Persistence in the cloud with bosh
Persistence in the cloud with bosh
m_richardson
bootstrapping containers with confd
bootstrapping containers with confd
m_richardson
Contenu connexe
Similaire à System Availability Talk
MTBF / MTTR - Energized Work TekTalk, Mar 2012
MTBF / MTTR - Energized Work TekTalk, Mar 2012
Energized Work
Emc sql server 2012 overview
Emc sql server 2012 overview
solarisyougood
Musings of an MSP - Why Some Things Never Change and Others Have To - Datacom
Musings of an MSP - Why Some Things Never Change and Others Have To - Datacom
Amazon Web Services
2012 Annual State of the Union for Mobile Ecommerce Performance [Velocity EU]
2012 Annual State of the Union for Mobile Ecommerce Performance [Velocity EU]
Strangeloop
Disaster Recovery with MySQL and Tungsten
Disaster Recovery with MySQL and Tungsten
Jeff Mace
Walmart pagespeed-slide
Walmart pagespeed-slide
Bitsytask
Walmart Web Performance Circa 2013
Walmart Web Performance Circa 2013
Cliff Crocker
Presentation virtualizing oracle unlocked enterprise wide benefits
Presentation virtualizing oracle unlocked enterprise wide benefits
solarisyourep
O'Reilly webcast: Joshua Bixby on Mobile Performance Trends and Predictions
O'Reilly webcast: Joshua Bixby on Mobile Performance Trends and Predictions
Strangeloop
Scaling mature systems
Scaling mature systems
HanMorten
Why You Should Move to the Cloud
Why You Should Move to the Cloud
Eric (Sonny) Wilson, BSc.
Automation & Cloud Evolution - Long View VMware Forum Calgary January 21 2014
Automation & Cloud Evolution - Long View VMware Forum Calgary January 21 2014
James Charter
Executing the Digital Strategy
Executing the Digital Strategy
Ben Turner
Optimizing Browser Rendering
Optimizing Browser Rendering
michael.labriola
How to Choose the Right Cloud for Continuity
How to Choose the Right Cloud for Continuity
marketingunitrends
Works on my machine, your problem now? - QCon 2014
Works on my machine, your problem now? - QCon 2014
Wolfgang Gottesheim
At bruxelles scaling agile - v1.5 slideshare
At bruxelles scaling agile - v1.5 slideshare
Herve Lourdin
Scaling CQ5
Scaling CQ5
connectwebex
Dev talks Cluj 2018 : Java in the 21 Century: Are you thinking far enough ahead?
Dev talks Cluj 2018 : Java in the 21 Century: Are you thinking far enough ahead?
Steve Poole
Oracle primavera and bpm the power of integration ppt
Oracle primavera and bpm the power of integration ppt
p6academy
Similaire à System Availability Talk
(20)
MTBF / MTTR - Energized Work TekTalk, Mar 2012
MTBF / MTTR - Energized Work TekTalk, Mar 2012
Emc sql server 2012 overview
Emc sql server 2012 overview
Musings of an MSP - Why Some Things Never Change and Others Have To - Datacom
Musings of an MSP - Why Some Things Never Change and Others Have To - Datacom
2012 Annual State of the Union for Mobile Ecommerce Performance [Velocity EU]
2012 Annual State of the Union for Mobile Ecommerce Performance [Velocity EU]
Disaster Recovery with MySQL and Tungsten
Disaster Recovery with MySQL and Tungsten
Walmart pagespeed-slide
Walmart pagespeed-slide
Walmart Web Performance Circa 2013
Walmart Web Performance Circa 2013
Presentation virtualizing oracle unlocked enterprise wide benefits
Presentation virtualizing oracle unlocked enterprise wide benefits
O'Reilly webcast: Joshua Bixby on Mobile Performance Trends and Predictions
O'Reilly webcast: Joshua Bixby on Mobile Performance Trends and Predictions
Scaling mature systems
Scaling mature systems
Why You Should Move to the Cloud
Why You Should Move to the Cloud
Automation & Cloud Evolution - Long View VMware Forum Calgary January 21 2014
Automation & Cloud Evolution - Long View VMware Forum Calgary January 21 2014
Executing the Digital Strategy
Executing the Digital Strategy
Optimizing Browser Rendering
Optimizing Browser Rendering
How to Choose the Right Cloud for Continuity
How to Choose the Right Cloud for Continuity
Works on my machine, your problem now? - QCon 2014
Works on my machine, your problem now? - QCon 2014
At bruxelles scaling agile - v1.5 slideshare
At bruxelles scaling agile - v1.5 slideshare
Scaling CQ5
Scaling CQ5
Dev talks Cluj 2018 : Java in the 21 Century: Are you thinking far enough ahead?
Dev talks Cluj 2018 : Java in the 21 Century: Are you thinking far enough ahead?
Oracle primavera and bpm the power of integration ppt
Oracle primavera and bpm the power of integration ppt
Plus de m_richardson
Persistence in the cloud with bosh
Persistence in the cloud with bosh
m_richardson
bootstrapping containers with confd
bootstrapping containers with confd
m_richardson
Docker Service Registration and Discovery
Docker Service Registration and Discovery
m_richardson
Puppetcamp Melbourne - puppetdb
Puppetcamp Melbourne - puppetdb
m_richardson
Node collaboration - sharing information between your systems
Node collaboration - sharing information between your systems
m_richardson
Node collaboration - Exported Resources and PuppetDB
Node collaboration - Exported Resources and PuppetDB
m_richardson
Serverspec and Sensu - Testing and Monitoring collide
Serverspec and Sensu - Testing and Monitoring collide
m_richardson
Chef - managing yours servers with Code
Chef - managing yours servers with Code
m_richardson
Open Source Monitoring Tools
Open Source Monitoring Tools
m_richardson
Plus de m_richardson
(9)
Persistence in the cloud with bosh
Persistence in the cloud with bosh
bootstrapping containers with confd
bootstrapping containers with confd
Docker Service Registration and Discovery
Docker Service Registration and Discovery
Puppetcamp Melbourne - puppetdb
Puppetcamp Melbourne - puppetdb
Node collaboration - sharing information between your systems
Node collaboration - sharing information between your systems
Node collaboration - Exported Resources and PuppetDB
Node collaboration - Exported Resources and PuppetDB
Serverspec and Sensu - Testing and Monitoring collide
Serverspec and Sensu - Testing and Monitoring collide
Chef - managing yours servers with Code
Chef - managing yours servers with Code
Open Source Monitoring Tools
Open Source Monitoring Tools
Dernier
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
Radu Cotescu
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
Delhi Call girls
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
sudhanshuwaghmare1
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
apidays
Evaluating the top large language models.pdf
Evaluating the top large language models.pdf
ChristopherTHyatt
How to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
naman860154
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
The Digital Insurer
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
Enterprise Knowledge
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
wesley chun
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
Product Anonymous
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
ThousandEyes
presentation ICT roal in 21st century education
presentation ICT roal in 21st century education
jfdjdjcjdnsjd
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
Martijn de Jong
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
Michael W. Hawkins
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Igalia
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
Safe Software
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
hans926745
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
UK Journal
Dernier
(20)
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Evaluating the top large language models.pdf
Evaluating the top large language models.pdf
How to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
presentation ICT roal in 21st century education
presentation ICT roal in 21st century education
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
System Availability Talk
1.
Michael Richardson Twitter: @Mr_SPB 1©
2011 Energized Work - www.energizedwork.com Availability and Recoverability
2.
So what is
High Availability? • Five 9s? • No Single point of failure? • Multiple Data Centre’s? • Fault Tolerance? • Load Balancing? • Uptime? 2© 2012 Energized Work - www.energizedwork.com
3.
The 9’s of
Availability 3© 2012 Energized Work - www.energizedwork.com 9 9
4.
The 9’s of
Availability 4© 2012 Energized Work - www.energizedwork.com Availability Downtime per Year One nine (90%) 36.5 days Two nines (99%) 3.65 days Three nines (99.9%) 8.76 hours Four nines (99.99%) 52.56 minutes Five nines (99.999%) 5.26 minutes
5.
Problem with the
9’s 5© 2012 Energized Work - www.energizedwork.com • What do they mean? • Guaranteed or just an SLA • Multiplicity (99.9% * 99.9% * 99.9% = 99.7%)
6.
SLA availability numbers: just
aim to provide a level of confidence in a website’s service 6© 2012 Energized Work - www.energizedwork.com
7.
No Single Point
of Failure (SPOF) 7© 2012 Energized Work - www.energizedwork.com
8.
two of everything? 8©
2012 Energized Work - www.energizedwork.com
9.
Start with this 9©
2012 Energized Work - www.energizedwork.com Index.html Users
10.
End with this 10©
2012 Energized Work - www.energizedwork.com WEB1 switch 1 switch 2 WEB2 APP1 APP2 DB1 DB2 Firewall 1 Firewall 2 Users
11.
• It’s expensive
££ • Where do you draw the line? • Are failures independent • Can you guarantee No SPOF? • Increased complexity 11© 2012 Energized Work - www.energizedwork.com Problems with eliminating SPOF
12.
Problem: Data Centre’s
Fail 12© 2012 Energized Work - www.energizedwork.com
13.
Solution: Get a
2nd Data Centre 13© 2012 Energized Work - www.energizedwork.com
14.
Hot/Hot Multisite 14© 2012
Energized Work - www.energizedwork.com • Full range of services available in multiple locations. • Easy to automate failover of sites • Data Consistency is hard. • Capacity Planning concerns +
15.
Hot/Warm Multisite 15© 2012
Energized Work - www.energizedwork.com • Simpler than Hot/Hot • Read/write ratio dependant • Synchronous or Asynchronously replicate data? +
16.
Hot/Cold Multisite 16© 2012
Energized Work - www.energizedwork.com • Easy to setup • Will it work? • Can it be trusted? • Cold site rapidly become stale • Is it actually valuable? +
17.
DR Multisite 17© 2012
Energized Work - www.energizedwork.com • Fingers crossed you never need it. • How can/should you test it? • Cloud? +
18.
Problems with Multiple
sites 18© 2012 Energized Work - www.energizedwork.com • ££ - it’s expensive • Managing more systems • Managing consistency of Data • Managing Capacity • Is it still fail proof? • Unless you test it, it’s just a plan
19.
19© 2012 Energized
Work - www.energizedwork.com We now have a Complex System
20.
• More redundancy
and automation leads to more complexity. • More complexity often adds more points of failure. 20© 2012 Energized Work - www.energizedwork.com Complex Systems
21.
Author: Dr. Richard
Cook 21© 2012 Energized Work - www.energizedwork.com “How Complex Systems fail” • Catastrophe is always just around the corner. • Human Operators have dual roles. • Change introduces new forms of failure
22.
Failure and Recovery 22©
2012 Energized Work - www.energizedwork.com
23.
Questions for the
Customer 23© 2012 Energized Work - www.energizedwork.com • What is the cost of downtime? • What are the RTO and RPO?
24.
24© 2012 Energized
Work - www.energizedwork.com RTO = Recovery Time Objective RPO = Recovery Point Objective
25.
Aggressive RTO &
RPO is expensive and has a performance impact. 25© 2012 Energized Work - www.energizedwork.com
26.
RTO / RPO
example 26© 2012 Energized Work - www.energizedwork.com problem •Simple DB •Business can tolerate up to 15 minutes downtime •10 minute window of data lose.
27.
RTO / RPO
example 27© 2012 Energized Work - www.energizedwork.com Possible solution 1.Continuously replicate data to 2nd host 2.Continue with nightly backups and also copy DB transaction logs from the primary host to another system.
28.
So what’s more
important? 28© 2012 Energized Work - www.energizedwork.com Increasing Availability Or Reducing Recovery Time
29.
29© 2012 Energized
Work - www.energizedwork.com MTBF Or MTTR What about MTTD??
30.
30© 2012 Energized
Work - www.energizedwork.com Answer? It Depends
31.
31© 2012 Energized
Work - www.energizedwork.com Failure is inevitable
32.
32© 2012 Energized
Work - www.energizedwork.com Ask anyone
33.
33© 2011 Energized
Work - www.energizedwork.com Thank you The End Twitter - @Mr_SPB
Notes de l'éditeur
28/10/10 © Energized Work Limited 2010 Agile Evangelists - LEANING
28/10/10 © Energized Work Limited 2010 Agile Evangelists - LEANING
Ask any business how much downtime is acceptable and you will get a consistent answer. 28/10/10 © Energized Work Limited 2010 Agile Evangelists - LEANING
28/10/10 © Energized Work Limited 2010 Agile Evangelists - LEANING
Found more in Marketing literature than technical literature 28/10/10 © Energized Work Limited 2010 Agile Evangelists - LEANING
An SLA is just an instrument that makes business people comfortable (just like insurance) 28/10/10 © Energized Work Limited 2010 Agile Evangelists - LEANING
28/10/10 © Energized Work Limited 2010 Agile Evangelists - LEANING
28/10/10 © Energized Work Limited 2010 Agile Evangelists - LEANING
28/10/10 © Energized Work Limited 2010 Agile Evangelists - LEANING
28/10/10 © Energized Work Limited 2010 Agile Evangelists - LEANING
1 & 2 Diminishing returns Paradoxically, adding more components to an overall system design can undermine efforts to achieve high availability Cascading failures 28/10/10 © Energized Work Limited 2010 Agile Evangelists - LEANING
28/10/10 © Energized Work Limited 2010 Agile Evangelists - LEANING
28/10/10 © Energized Work Limited 2010 Agile Evangelists - LEANING
Read & Write anywhere Global Server Load Balancing with DNS 28/10/10 © Energized Work Limited 2010 Agile Evangelists - LEANING
Read intensive apps are well suited to this – Reads Hot/Hot 28/10/10 © Energized Work Limited 2010 Agile Evangelists - LEANING
Cold site is so untrusted that perhaps spending hours restoring the primary DC is a better and safer bet. 28/10/10 © Energized Work Limited 2010 Agile Evangelists - LEANING
Cold site is so untrusted that perhaps spending hours restoring the primary DC is a better and safer bet. 28/10/10 © Energized Work Limited 2010 Agile Evangelists - LEANING
Talk about capacity planning Hot/Hot – config switches Most companies don ’ t thoroughly test DC failover. When failure occurs many companies will often focus on restoring the failure in the primary DC rather attempt a failover. So why bother having a 2 nd DC anyway. If you plan on having multiple DC ’ s or DR then test your procedures when you ’ re not in an emergency situation. Game Day events 28/10/10 © Energized Work Limited 2010 Agile Evangelists - LEANING
28/10/10 © Energized Work Limited 2010 Agile Evangelists - LEANING
28/10/10 © Energized Work Limited 2010 Agile Evangelists - LEANING
Mention John Alspaw ’ s Qcon talk 2. Dual roles of humans Defenders against failure Producers of failure 3. Introduce a technology change To prevent low-consequence, but high frequency failures May introduce low frequency, but high consequence failure Introduce new pathways to large-scale, catastrophic failures. Focus of humans is on the beneficial charactistics of the change. New failure ’ s maybe difficult to foresee. Give config management example Knife Resolv.conf 3. Also covers maintenance and why many find it difficult. Build and forget mentality. 28/10/10 © Energized Work Limited 2010 Agile Evangelists - LEANING
28/10/10 © Energized Work Limited 2010 Agile Evangelists - LEANING
Cost of downtime – easy or difficult to measure Can downtime actually be equated to lost revenue. Give online shopping example 28/10/10 © Energized Work Limited 2010 Agile Evangelists - LEANING
RTO and RPO are often in competition Give eg of replication lag between 2 sites. Zero RPO example - If replication lags between systems and you have an aggressive RPO you maybe better off taking a few hours outage and focusing on restoring your primary site. Zero RTO example – if replication lags between DC ’ s you may decide to failover immediately and take the data loss for some inflight transactions Aggressive RTO & RPO is expensive and has a performance 28/10/10 © Energized Work Limited 2010 Agile Evangelists - LEANING
28/10/10 © Energized Work Limited 2010 Agile Evangelists - LEANING
Typical nightly backups aren ’ t going to cut it. Common practice is to backup systems nightly. Is your business happy to lose up to 24 hours of data? Probably not. 28/10/10 © Energized Work Limited 2010 Agile Evangelists - LEANING
Covers you for any catastrophic hardware failure 2 nd host has independent storage infrastructure. Data corruption would however result in 2 copies of crap 2. Covers you for data corruption Playing back transaction logs will also allow you to identify the place where corruption occurred. 28/10/10 © Energized Work Limited 2010 Agile Evangelists - LEANING
28/10/10 © Energized Work Limited 2010 Agile Evangelists - LEANING
What about MTTD? 28/10/10 © Energized Work Limited 2010 Agile Evangelists - LEANING
My experience tells me most companies focus on availability How many companies take nightly tape backups but have never bothered trying to restore or test them? If you think you can built a completely fail-proof system you are kidding yourself. How many companies have game days? 28/10/10 © Energized Work Limited 2010 Agile Evangelists - LEANING
28/10/10 © Energized Work Limited 2010 Agile Evangelists - LEANING
28/10/10 © Energized Work Limited 2010 Agile Evangelists - LEANING
28/10/10 © Energized Work Limited 2010 Agile Evangelists - LEANING
Télécharger maintenant