SlideShare une entreprise Scribd logo
1  sur  45
Télécharger pour lire hors ligne
Resilience
in
Data Management
1
2015 Davide P. Carioni
Resiliency in data management
A case for Redundant Arrays of Inexpensive Disks
when: 1988
where: Chicago
who: D. A. Patterson, G. Gibson, and R. H. Katz
thesis: a top performing mainframe disk drive can be beaten on performance by an array
of inexpensive drives developed for personal computer market.
abstract:
2
2015 Davide P. Carioni
Resiliency in data management
RAID
3
2015 Davide P. Carioni
Resiliency in data management
Redundant Arrays of Independent Disks
In disks array several independent disks are considered as a single, large, high-
performance logical disk.
The data are striped across several disks accessed in parallel:
• high data transfer rate: large data accesses (heavy I/O op.)
• high I/O rate: small but frequent data accesses (light I/O op.)
• load balancing across the disks
Two orthogonal techniques:
• redundancy: to improve reliability
• data striping: to improve performance
4
2015 Davide P. Carioni
Resiliency in data management
redundancy in a I/O operation (a simple example)
5
ADAPTER
ABCDEFGH
ABCDEFGH
ABCDEFGH
2015 Davide P. Carioni
Resiliency in data management
Data striping in a I/O operation (a simple example)
6
AB
CD
EF
ADAPTER
GH
GH
GH
ABCDEFGH
2 byte interleaving (stripe unit)
2015 Davide P. Carioni
Resiliency in data management
virtualization in a I/O operation (a simple example)
7
ADAPTERABCDEFGH
single large logical disk
2015 Davide P. Carioni
Resiliency in data management
Data striping
striping: data are written sequentially in units on multiple disks according to a
cyclic algorithm (round robin)
stripe unit: dimension of the unit of data that are written on a single disk
stripe width: number of disks considered by the striping algorithm (does not
necessarily coincide with the number of physical disks in the array – there can
be “hot spares”)
Performance gains:
• multiple independent I/O requests will be executed in parallel by several
disks decreasing the queue length (and time) of the disks
• single multiple-block I/O requests will be executed by multiple disks in
parallel increasing of the transfer rate of a single request
8
2015 Davide P. Carioni
Resiliency in data management
Parallelism and reliability
the more physical disks in the array
the larger the size and performance gains
but …
the larger the probability of failure of a disk
⇓
this is the main motivation for the introduction of
redundancy
9
2015 Davide P. Carioni
Resiliency in data management
Parallelism and reliability
The probability of a failure (assuming independent failures) in an array of 100
disks is 100 higher the probability of a failure of a single disk
Redundancy: error correcting codes (stored on disks different from the ones
with the data) are computed to tolerate loss due to disk failures
Performance: since write operations must update also the redundant
information, their performance is worse than the one of the traditional writes
10
« if a disk has an Mean Time To Failure (MTTF) of 200,000 hours (~23 years)
an array of 100 disks will show a MTTF of 2000 hours (~ 3 months) »
2015 Davide P. Carioni
Resiliency in data management
Data reconstruction (a simple example)
11
12 8
data data checksum
2015 Davide P. Carioni
Resiliency in data management
Data reconstruction (a simple example)
12
12 8 20+ =
data data checksum
2015 Davide P. Carioni
Resiliency in data management
Data reconstruction (a simple example)
13
12 8 20
12 20
+ =
data data checksum
data data checksum
2015 Davide P. Carioni
Resiliency in data management
Data reconstruction (a simple example)
14
12 8 20
12 20
20 12 8
+ =
- =
data data checksum
data data checksum
checksum data data
2015 Davide P. Carioni
Resiliency in data management
RAID standard levels
RAID 0 striping only
RAID 1 mirroring only
RAID 2 bit interleaving (not used)
RAID 3 byte interleaving - redundancy (parity disk)
RAID 4 block interleaving - redundancy (parity disk)
RAID 5 block interleaving - redundancy (parity block distributed) – highly utilized
RAID 6 greater redundancy (tolerates up to two failed disks)
15
2015 Davide P. Carioni
Resiliency in data management
RAID 0 and RAID 1
16
RAID 0
A7
A5
A3
A1
A8
A6
A4
A2
RAID 1
A4
A3
A2
A1
A4
A3
A2
A1
2015 Davide P. Carioni
Resiliency in data management
RAID 3
17
RAID 3
B5
B2
A5
A2
B6
B3
A6
A3
B4
B1
A4
A1
BP(4-6)
BP(1-3)
AP(4-6)
AP(1-3)
2015 Davide P. Carioni
Resiliency in data management
RAID 4
18
RAID 4
D2
C2
B2
A2
D3
C3
B3
A3
D1
C1
B1
A1
DP
CP
BP
AP
2015 Davide P. Carioni
Resiliency in data management
RAID 5
19
RAID 5
D1
CP
B2
A2
D2
C2
BP
A3
DP
C1
B1
A1
D3
C3
B3
AP
2015 Davide P. Carioni
Resiliency in data management
RAID 6
20
RAID 6
DQ
CP
B2
A2
D1
CQ
BP
A3
DP
C1
B1
A1
D2
C2
BQ
AP
D3
C3
B3
AQ
2015 Davide P. Carioni
Resiliency in data management
Nested levels
21
RAID 1
A7
A5
A3
A1
A7
A5
A3
A1
RAID 1
A8
A6
A4
A2
A8
A6
A4
A2
RAID 0
2015 Davide P. Carioni
Resiliency in data management
Overview
RAID level Utilization
ratio
Reliability R/W performance Rebuild
performance
0 1 N/A very good good
1 0.5 excellent very good/good good
3 (n-1)/n good good/fair fair
5 (n-1)/n good good/fair poor
6 (n-2)/n excellent very good/poor poor
1+0 0.5 excellent very good/good good
5+0 (n-1)/n excellent very good/good fair
22
Nota Bene: RAID technology should not be intended as a substitute for a suitable
backup procedure
2015 Davide P. Carioni
Resiliency in data management
Data Mirroring
23
2015 Davide P. Carioni
Resiliency in data management
Synchronous VS Asynchronous mirroring
Synchronous mirroring: provides a consistent copy of a source disk on a
target disk. Data is synchronously written to the target disk after it is written to
the source virtual disk, so that the copy is continuously updated.
Asynchronous mirroring: provides a consistent copy of a source disk on a
target disk. Data is asynchronously written to the target virtual disk, so that the
copy is continuously updated, but the copy might not contain the last few
updates in the event that a disaster recovery operation is performed.
24
2015 Davide P. Carioni
Resiliency in data management
Synchronous VS Asynchronous mirroring
25
1. write 2. write
3. ack4. ack
primary secondary
1. write a. write
b. ack2. ack
primary secondary
Sinchronization clock
2015 Davide P. Carioni
Resiliency in data management
Multipath
26
2015 Davide P. Carioni
Resiliency in data management
Multipath
In computer storage, multipath I/O is a fault-tolerance technique that defines
more than one physical path between the CPU in a computer system and its
mass-storage devices through the buses, controllers, switches, and bridge
devices connecting them.
Multipath software layers can also leverage the redundant paths to provide
performance-enhancing features, including dynamic load balancing and
trunking.
27
Path 1
Path 2
2015 Davide P. Carioni
Resiliency in data management
Backup
28
2015 Davide P. Carioni
Resiliency in data management
Backup
Backup technologies provide effective recovery options for systems subject to
data loss from human error, hardware failure or major natural disasters. They
are ideally suited for quick restoration of large amounts of lost information and
can return complete systems to full operational capacity in a short period of time.
Two orthogonal techniques:
• Incremental backup: saves the data that has changed since the last backup.
• PROs: fast backup, small space occupancy
• CON: slow recovery
• Differential backup: saves the data that has changed since the last full
backup.
• PRO: fast recovery
• CONs: slow backup, big space occupancy
29
2015 Davide P. Carioni
Resiliency in data management
Incremental VS differential Backup (a simple example)
Backup frequency: daily
Full backup day: sunday
30
sun
mon
tue
sat
fri
thu
wed
sun
mon
tue
sat
fri
thu
wed
incremental differential
2015 Davide P. Carioni
Resiliency in data management
Point in time copy
A point in time copy is a logical image of the content of an associated base
volume created at a specific moment. A snapshot image can be thought of as a
restore point. Snapshot images are useful any time you need to be able to roll
back to a known good data set at a specific point in time.
For example, before performing a risky operation on a volume, you can create a
snapshot image to enable “undo” capability for the entire volume. A snapshot
image is created almost instantaneously, and initially uses no disk space,
because it stores only the incremental changes needed to roll the volume back
to the point-in-time when the snapshot image was created.
Two alternative approaches:
• copy on write
• redirect on write
31
2015 Davide P. Carioni
Resiliency in data management
Copy on write
Task: modify C
32
A B C D
S
Disk
Volatile memory
2015 Davide P. Carioni
Resiliency in data management
Copy on write
Task: modify C
33
A B C D
C
S
Disk
Volatile memory
read
2015 Davide P. Carioni
Resiliency in data management
Copy on write
Task: modify C
34
A B C D C
C
S
Disk
Volatile memory
copy
2015 Davide P. Carioni
Resiliency in data management
Copy on write
Task: modify C
35
A B C D C
Cʹ
S
Disk
Volatile memory
modify
2015 Davide P. Carioni
Resiliency in data management
Copy on write
Task: modify C
36
A B Cʹ D C
S
Disk
Volatile memory
write
2015 Davide P. Carioni
Resiliency in data management
Redirect on write
Task: modify C
37
A B C D
S
Disk
Volatile memory
2015 Davide P. Carioni
Resiliency in data management
Redirect on write
Task: modify C
38
A B C D
C
S
Disk
Volatile memory
read
2015 Davide P. Carioni
Resiliency in data management
Redirect on write
Task: modify C
39
A B C D
Cʹ
S
Disk
Volatile memory
modify
2015 Davide P. Carioni
Resiliency in data management
Copy on write
Task: modify C
40
A B C D Cʹ
Cʹ
S
Disk
Volatile memory
write
2015 Davide P. Carioni
Resiliency in data management
Archive
41
2015 Davide P. Carioni
Resiliency in data management
Archive
Archive technologies typically store a version of a file that's no longer changing,
or shouldn't be changing. Data archiving is intended as a repository for data that
needs to be stored for periods that may extend to decades.
Speed is less important in archives; even if the event is a legal action, you
typically only have a few days to respond. Searchabilty is more critical in
archives. In addition, importance is placed on the ability to scale data integrity
and data retention over a long period of time, possibly decades.
To effectively manage data, file archiving systems discover all files on a network
and provide an inventory of unstructured data. During the discovery process, the
systems collect file system metadata and extract file contents, building a
foundation for data classification and application of information governance
policies.
42
2015 Davide P. Carioni
Resiliency in data management
Archive rule (a simple example)
43
Legal
requirement
Business
related
5 years retention
10 years
retention
3 years retention
S
Y
Y
2015 Davide P. Carioni
Resiliency in data management
Resilient storage infrastructure (a simple example)
44
Site A Site B
Site C
Sync
mirroring
Multipath
Backup
Async
mirroring
Archive
Vault
2015 Davide P. Carioni
Resiliency in data management
Davide Carioni wrote this file.
As long as you retain this notice you can do whatever you want with
this stuff. If we meet some day, and you think this stuff is worth it,
you can buy me a beer in return.
45
Delivery truck image – by Freepik - Own work - Licensed under CC BY 3.0 via flaticon.com
BeerWare Logo - by Kita59 - Own work - Licensed under CC BY-SA 3.0 via Wikimedia Commons

Contenu connexe

Similaire à Resilience in data management

Storage, San And Business Continuity Overview
Storage, San And Business Continuity OverviewStorage, San And Business Continuity Overview
Storage, San And Business Continuity OverviewAlan McSweeney
 
disk structure and multiple RAID levels .ppt
disk structure and multiple  RAID levels .pptdisk structure and multiple  RAID levels .ppt
disk structure and multiple RAID levels .pptRAJASEKHARV10
 
S de2784 footprint-reduction-edge2015-v2
S de2784 footprint-reduction-edge2015-v2S de2784 footprint-reduction-edge2015-v2
S de2784 footprint-reduction-edge2015-v2Tony Pearson
 
Hu Yoshida - Storage Trends and Directions (Storage Expo 2010)
Hu Yoshida - Storage Trends and Directions (Storage Expo 2010)Hu Yoshida - Storage Trends and Directions (Storage Expo 2010)
Hu Yoshida - Storage Trends and Directions (Storage Expo 2010)VNU Exhibitions Europe
 
Transforming Backup and Recovery in VMware environments with EMC Avamar and D...
Transforming Backup and Recovery in VMware environments with EMC Avamar and D...Transforming Backup and Recovery in VMware environments with EMC Avamar and D...
Transforming Backup and Recovery in VMware environments with EMC Avamar and D...CTI Group
 
RAID: High-Performance, Reliable Secondary Storage
RAID: High-Performance, Reliable Secondary StorageRAID: High-Performance, Reliable Secondary Storage
RAID: High-Performance, Reliable Secondary StorageUğur Tılıkoğlu
 
BGOUG "Agile Data: revolutionizing database cloning'
BGOUG  "Agile Data: revolutionizing database cloning'BGOUG  "Agile Data: revolutionizing database cloning'
BGOUG "Agile Data: revolutionizing database cloning'Kyle Hailey
 
DevOps, Databases and The Phoenix Project UGF4042 from OOW14
DevOps, Databases and The Phoenix Project UGF4042 from OOW14DevOps, Databases and The Phoenix Project UGF4042 from OOW14
DevOps, Databases and The Phoenix Project UGF4042 from OOW14Kyle Hailey
 
040419 san forum
040419 san forum040419 san forum
040419 san forumThiru Raja
 
Unitrends Sales Presentation 2010
Unitrends Sales Presentation 2010Unitrends Sales Presentation 2010
Unitrends Sales Presentation 2010lincolng
 
Data Virtualization: revolutionizing database cloning
Data Virtualization: revolutionizing database cloningData Virtualization: revolutionizing database cloning
Data Virtualization: revolutionizing database cloningKyle Hailey
 
Storage Virtualization Challenges
Storage Virtualization ChallengesStorage Virtualization Challenges
Storage Virtualization ChallengesRandy Weis
 
BigData as a Platform: Cassandra and Current Trends
BigData as a Platform: Cassandra and Current TrendsBigData as a Platform: Cassandra and Current Trends
BigData as a Platform: Cassandra and Current TrendsMatthew Dennis
 
Kscope 14 Presentation : Virtual Data Platform
Kscope 14 Presentation : Virtual Data PlatformKscope 14 Presentation : Virtual Data Platform
Kscope 14 Presentation : Virtual Data PlatformKyle Hailey
 
HPC DAY 2017 | HPE Storage and Data Management for Big Data
HPC DAY 2017 | HPE Storage and Data Management for Big DataHPC DAY 2017 | HPE Storage and Data Management for Big Data
HPC DAY 2017 | HPE Storage and Data Management for Big DataHPC DAY
 
10 Reasons To Virtulaize Your Storage
10 Reasons To Virtulaize Your Storage10 Reasons To Virtulaize Your Storage
10 Reasons To Virtulaize Your Storagerpsprowl
 
Four Assumptions Killing Backup Storage Webinar
Four Assumptions Killing Backup Storage WebinarFour Assumptions Killing Backup Storage Webinar
Four Assumptions Killing Backup Storage WebinarStorage Switzerland
 
EMC Isilon Solutions for Data Archives
EMC Isilon Solutions for Data ArchivesEMC Isilon Solutions for Data Archives
EMC Isilon Solutions for Data Archivessolarisyougood
 

Similaire à Resilience in data management (20)

Storage, San And Business Continuity Overview
Storage, San And Business Continuity OverviewStorage, San And Business Continuity Overview
Storage, San And Business Continuity Overview
 
disk structure and multiple RAID levels .ppt
disk structure and multiple  RAID levels .pptdisk structure and multiple  RAID levels .ppt
disk structure and multiple RAID levels .ppt
 
S de2784 footprint-reduction-edge2015-v2
S de2784 footprint-reduction-edge2015-v2S de2784 footprint-reduction-edge2015-v2
S de2784 footprint-reduction-edge2015-v2
 
Hu Yoshida - Storage Trends and Directions (Storage Expo 2010)
Hu Yoshida - Storage Trends and Directions (Storage Expo 2010)Hu Yoshida - Storage Trends and Directions (Storage Expo 2010)
Hu Yoshida - Storage Trends and Directions (Storage Expo 2010)
 
Transforming Backup and Recovery in VMware environments with EMC Avamar and D...
Transforming Backup and Recovery in VMware environments with EMC Avamar and D...Transforming Backup and Recovery in VMware environments with EMC Avamar and D...
Transforming Backup and Recovery in VMware environments with EMC Avamar and D...
 
RAID: High-Performance, Reliable Secondary Storage
RAID: High-Performance, Reliable Secondary StorageRAID: High-Performance, Reliable Secondary Storage
RAID: High-Performance, Reliable Secondary Storage
 
BGOUG "Agile Data: revolutionizing database cloning'
BGOUG  "Agile Data: revolutionizing database cloning'BGOUG  "Agile Data: revolutionizing database cloning'
BGOUG "Agile Data: revolutionizing database cloning'
 
DevOps, Databases and The Phoenix Project UGF4042 from OOW14
DevOps, Databases and The Phoenix Project UGF4042 from OOW14DevOps, Databases and The Phoenix Project UGF4042 from OOW14
DevOps, Databases and The Phoenix Project UGF4042 from OOW14
 
040419 san forum
040419 san forum040419 san forum
040419 san forum
 
Unitrends Sales Presentation 2010
Unitrends Sales Presentation 2010Unitrends Sales Presentation 2010
Unitrends Sales Presentation 2010
 
NoSQL
NoSQLNoSQL
NoSQL
 
Data Virtualization: revolutionizing database cloning
Data Virtualization: revolutionizing database cloningData Virtualization: revolutionizing database cloning
Data Virtualization: revolutionizing database cloning
 
Storage Virtualization Challenges
Storage Virtualization ChallengesStorage Virtualization Challenges
Storage Virtualization Challenges
 
Raid
Raid Raid
Raid
 
BigData as a Platform: Cassandra and Current Trends
BigData as a Platform: Cassandra and Current TrendsBigData as a Platform: Cassandra and Current Trends
BigData as a Platform: Cassandra and Current Trends
 
Kscope 14 Presentation : Virtual Data Platform
Kscope 14 Presentation : Virtual Data PlatformKscope 14 Presentation : Virtual Data Platform
Kscope 14 Presentation : Virtual Data Platform
 
HPC DAY 2017 | HPE Storage and Data Management for Big Data
HPC DAY 2017 | HPE Storage and Data Management for Big DataHPC DAY 2017 | HPE Storage and Data Management for Big Data
HPC DAY 2017 | HPE Storage and Data Management for Big Data
 
10 Reasons To Virtulaize Your Storage
10 Reasons To Virtulaize Your Storage10 Reasons To Virtulaize Your Storage
10 Reasons To Virtulaize Your Storage
 
Four Assumptions Killing Backup Storage Webinar
Four Assumptions Killing Backup Storage WebinarFour Assumptions Killing Backup Storage Webinar
Four Assumptions Killing Backup Storage Webinar
 
EMC Isilon Solutions for Data Archives
EMC Isilon Solutions for Data ArchivesEMC Isilon Solutions for Data Archives
EMC Isilon Solutions for Data Archives
 

Dernier

Call Now ≽ 9953056974 ≼🔝 Call Girls In Yusuf Sarai ≼🔝 Delhi door step delevry≼🔝
Call Now ≽ 9953056974 ≼🔝 Call Girls In Yusuf Sarai ≼🔝 Delhi door step delevry≼🔝Call Now ≽ 9953056974 ≼🔝 Call Girls In Yusuf Sarai ≼🔝 Delhi door step delevry≼🔝
Call Now ≽ 9953056974 ≼🔝 Call Girls In Yusuf Sarai ≼🔝 Delhi door step delevry≼🔝9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
HLH PPT.ppt very important topic to discuss
HLH PPT.ppt very important topic to discussHLH PPT.ppt very important topic to discuss
HLH PPT.ppt very important topic to discussDrMSajidNoor
 
Vip Mumbai Call Girls Kalyan Call On 9920725232 With Body to body massage wit...
Vip Mumbai Call Girls Kalyan Call On 9920725232 With Body to body massage wit...Vip Mumbai Call Girls Kalyan Call On 9920725232 With Body to body massage wit...
Vip Mumbai Call Girls Kalyan Call On 9920725232 With Body to body massage wit...amitlee9823
 
Introduction-to-4x4-SRAM-Memory-Block.pptx
Introduction-to-4x4-SRAM-Memory-Block.pptxIntroduction-to-4x4-SRAM-Memory-Block.pptx
Introduction-to-4x4-SRAM-Memory-Block.pptxJaiLegal
 
Develop Keyboard Skill.pptx er power point
Develop Keyboard Skill.pptx er power pointDevelop Keyboard Skill.pptx er power point
Develop Keyboard Skill.pptx er power pointGetawu
 
SM-N975F esquematico completo - reparación.pdf
SM-N975F esquematico completo - reparación.pdfSM-N975F esquematico completo - reparación.pdf
SM-N975F esquematico completo - reparación.pdfStefanoBiamonte1
 
Call Girls in Vashi Escorts Services - 7738631006
Call Girls in Vashi Escorts Services - 7738631006Call Girls in Vashi Escorts Services - 7738631006
Call Girls in Vashi Escorts Services - 7738631006Pooja Nehwal
 
Makarba ( Call Girls ) Ahmedabad ✔ 6297143586 ✔ Hot Model With Sexy Bhabi Rea...
Makarba ( Call Girls ) Ahmedabad ✔ 6297143586 ✔ Hot Model With Sexy Bhabi Rea...Makarba ( Call Girls ) Ahmedabad ✔ 6297143586 ✔ Hot Model With Sexy Bhabi Rea...
Makarba ( Call Girls ) Ahmedabad ✔ 6297143586 ✔ Hot Model With Sexy Bhabi Rea...Naicy mandal
 
Get Premium Pimple Saudagar Call Girls (8005736733) 24x7 Rate 15999 with A/c ...
Get Premium Pimple Saudagar Call Girls (8005736733) 24x7 Rate 15999 with A/c ...Get Premium Pimple Saudagar Call Girls (8005736733) 24x7 Rate 15999 with A/c ...
Get Premium Pimple Saudagar Call Girls (8005736733) 24x7 Rate 15999 with A/c ...MOHANI PANDEY
 
(👉Ridhima)👉VIP Model Call Girls Mulund ( Mumbai) Call ON 9967824496 Starting ...
(👉Ridhima)👉VIP Model Call Girls Mulund ( Mumbai) Call ON 9967824496 Starting ...(👉Ridhima)👉VIP Model Call Girls Mulund ( Mumbai) Call ON 9967824496 Starting ...
(👉Ridhima)👉VIP Model Call Girls Mulund ( Mumbai) Call ON 9967824496 Starting ...motiram463
 
NO1 Verified Amil Baba In Karachi Kala Jadu In Karachi Amil baba In Karachi A...
NO1 Verified Amil Baba In Karachi Kala Jadu In Karachi Amil baba In Karachi A...NO1 Verified Amil Baba In Karachi Kala Jadu In Karachi Amil baba In Karachi A...
NO1 Verified Amil Baba In Karachi Kala Jadu In Karachi Amil baba In Karachi A...Amil baba
 
Top Rated Pune Call Girls Chakan ⟟ 6297143586 ⟟ Call Me For Genuine Sex Serv...
Top Rated  Pune Call Girls Chakan ⟟ 6297143586 ⟟ Call Me For Genuine Sex Serv...Top Rated  Pune Call Girls Chakan ⟟ 6297143586 ⟟ Call Me For Genuine Sex Serv...
Top Rated Pune Call Girls Chakan ⟟ 6297143586 ⟟ Call Me For Genuine Sex Serv...Call Girls in Nagpur High Profile
 
Shikrapur Call Girls Most Awaited Fun 6297143586 High Profiles young Beautie...
Shikrapur Call Girls Most Awaited Fun  6297143586 High Profiles young Beautie...Shikrapur Call Girls Most Awaited Fun  6297143586 High Profiles young Beautie...
Shikrapur Call Girls Most Awaited Fun 6297143586 High Profiles young Beautie...tanu pandey
 
怎样办理维多利亚大学毕业证(UVic毕业证书)成绩单留信认证
怎样办理维多利亚大学毕业证(UVic毕业证书)成绩单留信认证怎样办理维多利亚大学毕业证(UVic毕业证书)成绩单留信认证
怎样办理维多利亚大学毕业证(UVic毕业证书)成绩单留信认证tufbav
 
Call Girls In RT Nagar ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In RT Nagar ☎ 7737669865 🥵 Book Your One night StandCall Girls In RT Nagar ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In RT Nagar ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
Call Girls Kothrud Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Kothrud Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Kothrud Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Kothrud Call Me 7737669865 Budget Friendly No Advance Bookingroncy bisnoi
 
↑Top celebrity ( Pune ) Nagerbazar Call Girls8250192130 unlimited shot and al...
↑Top celebrity ( Pune ) Nagerbazar Call Girls8250192130 unlimited shot and al...↑Top celebrity ( Pune ) Nagerbazar Call Girls8250192130 unlimited shot and al...
↑Top celebrity ( Pune ) Nagerbazar Call Girls8250192130 unlimited shot and al...Call Girls in Nagpur High Profile
 

Dernier (20)

Call Now ≽ 9953056974 ≼🔝 Call Girls In Yusuf Sarai ≼🔝 Delhi door step delevry≼🔝
Call Now ≽ 9953056974 ≼🔝 Call Girls In Yusuf Sarai ≼🔝 Delhi door step delevry≼🔝Call Now ≽ 9953056974 ≼🔝 Call Girls In Yusuf Sarai ≼🔝 Delhi door step delevry≼🔝
Call Now ≽ 9953056974 ≼🔝 Call Girls In Yusuf Sarai ≼🔝 Delhi door step delevry≼🔝
 
HLH PPT.ppt very important topic to discuss
HLH PPT.ppt very important topic to discussHLH PPT.ppt very important topic to discuss
HLH PPT.ppt very important topic to discuss
 
Vip Mumbai Call Girls Kalyan Call On 9920725232 With Body to body massage wit...
Vip Mumbai Call Girls Kalyan Call On 9920725232 With Body to body massage wit...Vip Mumbai Call Girls Kalyan Call On 9920725232 With Body to body massage wit...
Vip Mumbai Call Girls Kalyan Call On 9920725232 With Body to body massage wit...
 
CHEAP Call Girls in Hauz Quazi (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Hauz Quazi  (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Hauz Quazi  (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Hauz Quazi (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS LiveVip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
 
Introduction-to-4x4-SRAM-Memory-Block.pptx
Introduction-to-4x4-SRAM-Memory-Block.pptxIntroduction-to-4x4-SRAM-Memory-Block.pptx
Introduction-to-4x4-SRAM-Memory-Block.pptx
 
Develop Keyboard Skill.pptx er power point
Develop Keyboard Skill.pptx er power pointDevelop Keyboard Skill.pptx er power point
Develop Keyboard Skill.pptx er power point
 
SM-N975F esquematico completo - reparación.pdf
SM-N975F esquematico completo - reparación.pdfSM-N975F esquematico completo - reparación.pdf
SM-N975F esquematico completo - reparación.pdf
 
Call Girls in Vashi Escorts Services - 7738631006
Call Girls in Vashi Escorts Services - 7738631006Call Girls in Vashi Escorts Services - 7738631006
Call Girls in Vashi Escorts Services - 7738631006
 
Makarba ( Call Girls ) Ahmedabad ✔ 6297143586 ✔ Hot Model With Sexy Bhabi Rea...
Makarba ( Call Girls ) Ahmedabad ✔ 6297143586 ✔ Hot Model With Sexy Bhabi Rea...Makarba ( Call Girls ) Ahmedabad ✔ 6297143586 ✔ Hot Model With Sexy Bhabi Rea...
Makarba ( Call Girls ) Ahmedabad ✔ 6297143586 ✔ Hot Model With Sexy Bhabi Rea...
 
Get Premium Pimple Saudagar Call Girls (8005736733) 24x7 Rate 15999 with A/c ...
Get Premium Pimple Saudagar Call Girls (8005736733) 24x7 Rate 15999 with A/c ...Get Premium Pimple Saudagar Call Girls (8005736733) 24x7 Rate 15999 with A/c ...
Get Premium Pimple Saudagar Call Girls (8005736733) 24x7 Rate 15999 with A/c ...
 
(👉Ridhima)👉VIP Model Call Girls Mulund ( Mumbai) Call ON 9967824496 Starting ...
(👉Ridhima)👉VIP Model Call Girls Mulund ( Mumbai) Call ON 9967824496 Starting ...(👉Ridhima)👉VIP Model Call Girls Mulund ( Mumbai) Call ON 9967824496 Starting ...
(👉Ridhima)👉VIP Model Call Girls Mulund ( Mumbai) Call ON 9967824496 Starting ...
 
(ISHITA) Call Girls Service Aurangabad Call Now 8617697112 Aurangabad Escorts...
(ISHITA) Call Girls Service Aurangabad Call Now 8617697112 Aurangabad Escorts...(ISHITA) Call Girls Service Aurangabad Call Now 8617697112 Aurangabad Escorts...
(ISHITA) Call Girls Service Aurangabad Call Now 8617697112 Aurangabad Escorts...
 
NO1 Verified Amil Baba In Karachi Kala Jadu In Karachi Amil baba In Karachi A...
NO1 Verified Amil Baba In Karachi Kala Jadu In Karachi Amil baba In Karachi A...NO1 Verified Amil Baba In Karachi Kala Jadu In Karachi Amil baba In Karachi A...
NO1 Verified Amil Baba In Karachi Kala Jadu In Karachi Amil baba In Karachi A...
 
Top Rated Pune Call Girls Chakan ⟟ 6297143586 ⟟ Call Me For Genuine Sex Serv...
Top Rated  Pune Call Girls Chakan ⟟ 6297143586 ⟟ Call Me For Genuine Sex Serv...Top Rated  Pune Call Girls Chakan ⟟ 6297143586 ⟟ Call Me For Genuine Sex Serv...
Top Rated Pune Call Girls Chakan ⟟ 6297143586 ⟟ Call Me For Genuine Sex Serv...
 
Shikrapur Call Girls Most Awaited Fun 6297143586 High Profiles young Beautie...
Shikrapur Call Girls Most Awaited Fun  6297143586 High Profiles young Beautie...Shikrapur Call Girls Most Awaited Fun  6297143586 High Profiles young Beautie...
Shikrapur Call Girls Most Awaited Fun 6297143586 High Profiles young Beautie...
 
怎样办理维多利亚大学毕业证(UVic毕业证书)成绩单留信认证
怎样办理维多利亚大学毕业证(UVic毕业证书)成绩单留信认证怎样办理维多利亚大学毕业证(UVic毕业证书)成绩单留信认证
怎样办理维多利亚大学毕业证(UVic毕业证书)成绩单留信认证
 
Call Girls In RT Nagar ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In RT Nagar ☎ 7737669865 🥵 Book Your One night StandCall Girls In RT Nagar ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In RT Nagar ☎ 7737669865 🥵 Book Your One night Stand
 
Call Girls Kothrud Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Kothrud Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Kothrud Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Kothrud Call Me 7737669865 Budget Friendly No Advance Booking
 
↑Top celebrity ( Pune ) Nagerbazar Call Girls8250192130 unlimited shot and al...
↑Top celebrity ( Pune ) Nagerbazar Call Girls8250192130 unlimited shot and al...↑Top celebrity ( Pune ) Nagerbazar Call Girls8250192130 unlimited shot and al...
↑Top celebrity ( Pune ) Nagerbazar Call Girls8250192130 unlimited shot and al...
 

Resilience in data management

  • 2. 2015 Davide P. Carioni Resiliency in data management A case for Redundant Arrays of Inexpensive Disks when: 1988 where: Chicago who: D. A. Patterson, G. Gibson, and R. H. Katz thesis: a top performing mainframe disk drive can be beaten on performance by an array of inexpensive drives developed for personal computer market. abstract: 2
  • 3. 2015 Davide P. Carioni Resiliency in data management RAID 3
  • 4. 2015 Davide P. Carioni Resiliency in data management Redundant Arrays of Independent Disks In disks array several independent disks are considered as a single, large, high- performance logical disk. The data are striped across several disks accessed in parallel: • high data transfer rate: large data accesses (heavy I/O op.) • high I/O rate: small but frequent data accesses (light I/O op.) • load balancing across the disks Two orthogonal techniques: • redundancy: to improve reliability • data striping: to improve performance 4
  • 5. 2015 Davide P. Carioni Resiliency in data management redundancy in a I/O operation (a simple example) 5 ADAPTER ABCDEFGH ABCDEFGH ABCDEFGH
  • 6. 2015 Davide P. Carioni Resiliency in data management Data striping in a I/O operation (a simple example) 6 AB CD EF ADAPTER GH GH GH ABCDEFGH 2 byte interleaving (stripe unit)
  • 7. 2015 Davide P. Carioni Resiliency in data management virtualization in a I/O operation (a simple example) 7 ADAPTERABCDEFGH single large logical disk
  • 8. 2015 Davide P. Carioni Resiliency in data management Data striping striping: data are written sequentially in units on multiple disks according to a cyclic algorithm (round robin) stripe unit: dimension of the unit of data that are written on a single disk stripe width: number of disks considered by the striping algorithm (does not necessarily coincide with the number of physical disks in the array – there can be “hot spares”) Performance gains: • multiple independent I/O requests will be executed in parallel by several disks decreasing the queue length (and time) of the disks • single multiple-block I/O requests will be executed by multiple disks in parallel increasing of the transfer rate of a single request 8
  • 9. 2015 Davide P. Carioni Resiliency in data management Parallelism and reliability the more physical disks in the array the larger the size and performance gains but … the larger the probability of failure of a disk ⇓ this is the main motivation for the introduction of redundancy 9
  • 10. 2015 Davide P. Carioni Resiliency in data management Parallelism and reliability The probability of a failure (assuming independent failures) in an array of 100 disks is 100 higher the probability of a failure of a single disk Redundancy: error correcting codes (stored on disks different from the ones with the data) are computed to tolerate loss due to disk failures Performance: since write operations must update also the redundant information, their performance is worse than the one of the traditional writes 10 « if a disk has an Mean Time To Failure (MTTF) of 200,000 hours (~23 years) an array of 100 disks will show a MTTF of 2000 hours (~ 3 months) »
  • 11. 2015 Davide P. Carioni Resiliency in data management Data reconstruction (a simple example) 11 12 8 data data checksum
  • 12. 2015 Davide P. Carioni Resiliency in data management Data reconstruction (a simple example) 12 12 8 20+ = data data checksum
  • 13. 2015 Davide P. Carioni Resiliency in data management Data reconstruction (a simple example) 13 12 8 20 12 20 + = data data checksum data data checksum
  • 14. 2015 Davide P. Carioni Resiliency in data management Data reconstruction (a simple example) 14 12 8 20 12 20 20 12 8 + = - = data data checksum data data checksum checksum data data
  • 15. 2015 Davide P. Carioni Resiliency in data management RAID standard levels RAID 0 striping only RAID 1 mirroring only RAID 2 bit interleaving (not used) RAID 3 byte interleaving - redundancy (parity disk) RAID 4 block interleaving - redundancy (parity disk) RAID 5 block interleaving - redundancy (parity block distributed) – highly utilized RAID 6 greater redundancy (tolerates up to two failed disks) 15
  • 16. 2015 Davide P. Carioni Resiliency in data management RAID 0 and RAID 1 16 RAID 0 A7 A5 A3 A1 A8 A6 A4 A2 RAID 1 A4 A3 A2 A1 A4 A3 A2 A1
  • 17. 2015 Davide P. Carioni Resiliency in data management RAID 3 17 RAID 3 B5 B2 A5 A2 B6 B3 A6 A3 B4 B1 A4 A1 BP(4-6) BP(1-3) AP(4-6) AP(1-3)
  • 18. 2015 Davide P. Carioni Resiliency in data management RAID 4 18 RAID 4 D2 C2 B2 A2 D3 C3 B3 A3 D1 C1 B1 A1 DP CP BP AP
  • 19. 2015 Davide P. Carioni Resiliency in data management RAID 5 19 RAID 5 D1 CP B2 A2 D2 C2 BP A3 DP C1 B1 A1 D3 C3 B3 AP
  • 20. 2015 Davide P. Carioni Resiliency in data management RAID 6 20 RAID 6 DQ CP B2 A2 D1 CQ BP A3 DP C1 B1 A1 D2 C2 BQ AP D3 C3 B3 AQ
  • 21. 2015 Davide P. Carioni Resiliency in data management Nested levels 21 RAID 1 A7 A5 A3 A1 A7 A5 A3 A1 RAID 1 A8 A6 A4 A2 A8 A6 A4 A2 RAID 0
  • 22. 2015 Davide P. Carioni Resiliency in data management Overview RAID level Utilization ratio Reliability R/W performance Rebuild performance 0 1 N/A very good good 1 0.5 excellent very good/good good 3 (n-1)/n good good/fair fair 5 (n-1)/n good good/fair poor 6 (n-2)/n excellent very good/poor poor 1+0 0.5 excellent very good/good good 5+0 (n-1)/n excellent very good/good fair 22 Nota Bene: RAID technology should not be intended as a substitute for a suitable backup procedure
  • 23. 2015 Davide P. Carioni Resiliency in data management Data Mirroring 23
  • 24. 2015 Davide P. Carioni Resiliency in data management Synchronous VS Asynchronous mirroring Synchronous mirroring: provides a consistent copy of a source disk on a target disk. Data is synchronously written to the target disk after it is written to the source virtual disk, so that the copy is continuously updated. Asynchronous mirroring: provides a consistent copy of a source disk on a target disk. Data is asynchronously written to the target virtual disk, so that the copy is continuously updated, but the copy might not contain the last few updates in the event that a disaster recovery operation is performed. 24
  • 25. 2015 Davide P. Carioni Resiliency in data management Synchronous VS Asynchronous mirroring 25 1. write 2. write 3. ack4. ack primary secondary 1. write a. write b. ack2. ack primary secondary Sinchronization clock
  • 26. 2015 Davide P. Carioni Resiliency in data management Multipath 26
  • 27. 2015 Davide P. Carioni Resiliency in data management Multipath In computer storage, multipath I/O is a fault-tolerance technique that defines more than one physical path between the CPU in a computer system and its mass-storage devices through the buses, controllers, switches, and bridge devices connecting them. Multipath software layers can also leverage the redundant paths to provide performance-enhancing features, including dynamic load balancing and trunking. 27 Path 1 Path 2
  • 28. 2015 Davide P. Carioni Resiliency in data management Backup 28
  • 29. 2015 Davide P. Carioni Resiliency in data management Backup Backup technologies provide effective recovery options for systems subject to data loss from human error, hardware failure or major natural disasters. They are ideally suited for quick restoration of large amounts of lost information and can return complete systems to full operational capacity in a short period of time. Two orthogonal techniques: • Incremental backup: saves the data that has changed since the last backup. • PROs: fast backup, small space occupancy • CON: slow recovery • Differential backup: saves the data that has changed since the last full backup. • PRO: fast recovery • CONs: slow backup, big space occupancy 29
  • 30. 2015 Davide P. Carioni Resiliency in data management Incremental VS differential Backup (a simple example) Backup frequency: daily Full backup day: sunday 30 sun mon tue sat fri thu wed sun mon tue sat fri thu wed incremental differential
  • 31. 2015 Davide P. Carioni Resiliency in data management Point in time copy A point in time copy is a logical image of the content of an associated base volume created at a specific moment. A snapshot image can be thought of as a restore point. Snapshot images are useful any time you need to be able to roll back to a known good data set at a specific point in time. For example, before performing a risky operation on a volume, you can create a snapshot image to enable “undo” capability for the entire volume. A snapshot image is created almost instantaneously, and initially uses no disk space, because it stores only the incremental changes needed to roll the volume back to the point-in-time when the snapshot image was created. Two alternative approaches: • copy on write • redirect on write 31
  • 32. 2015 Davide P. Carioni Resiliency in data management Copy on write Task: modify C 32 A B C D S Disk Volatile memory
  • 33. 2015 Davide P. Carioni Resiliency in data management Copy on write Task: modify C 33 A B C D C S Disk Volatile memory read
  • 34. 2015 Davide P. Carioni Resiliency in data management Copy on write Task: modify C 34 A B C D C C S Disk Volatile memory copy
  • 35. 2015 Davide P. Carioni Resiliency in data management Copy on write Task: modify C 35 A B C D C Cʹ S Disk Volatile memory modify
  • 36. 2015 Davide P. Carioni Resiliency in data management Copy on write Task: modify C 36 A B Cʹ D C S Disk Volatile memory write
  • 37. 2015 Davide P. Carioni Resiliency in data management Redirect on write Task: modify C 37 A B C D S Disk Volatile memory
  • 38. 2015 Davide P. Carioni Resiliency in data management Redirect on write Task: modify C 38 A B C D C S Disk Volatile memory read
  • 39. 2015 Davide P. Carioni Resiliency in data management Redirect on write Task: modify C 39 A B C D Cʹ S Disk Volatile memory modify
  • 40. 2015 Davide P. Carioni Resiliency in data management Copy on write Task: modify C 40 A B C D Cʹ Cʹ S Disk Volatile memory write
  • 41. 2015 Davide P. Carioni Resiliency in data management Archive 41
  • 42. 2015 Davide P. Carioni Resiliency in data management Archive Archive technologies typically store a version of a file that's no longer changing, or shouldn't be changing. Data archiving is intended as a repository for data that needs to be stored for periods that may extend to decades. Speed is less important in archives; even if the event is a legal action, you typically only have a few days to respond. Searchabilty is more critical in archives. In addition, importance is placed on the ability to scale data integrity and data retention over a long period of time, possibly decades. To effectively manage data, file archiving systems discover all files on a network and provide an inventory of unstructured data. During the discovery process, the systems collect file system metadata and extract file contents, building a foundation for data classification and application of information governance policies. 42
  • 43. 2015 Davide P. Carioni Resiliency in data management Archive rule (a simple example) 43 Legal requirement Business related 5 years retention 10 years retention 3 years retention S Y Y
  • 44. 2015 Davide P. Carioni Resiliency in data management Resilient storage infrastructure (a simple example) 44 Site A Site B Site C Sync mirroring Multipath Backup Async mirroring Archive Vault
  • 45. 2015 Davide P. Carioni Resiliency in data management Davide Carioni wrote this file. As long as you retain this notice you can do whatever you want with this stuff. If we meet some day, and you think this stuff is worth it, you can buy me a beer in return. 45 Delivery truck image – by Freepik - Own work - Licensed under CC BY 3.0 via flaticon.com BeerWare Logo - by Kita59 - Own work - Licensed under CC BY-SA 3.0 via Wikimedia Commons