SlideShare une entreprise Scribd logo
1  sur  29
Truly non-intrusive cinder backup for
mission critical systems
Lightening talk by
Dipak Kumar Singh &
Deepak Gupta
on 06 Nov 2017 at OpenStack Summit, Sydney, Australia
Table of Content
 Challenges of Backup of Live system
Example of sanity error of data, Implication of OS buffer cache on
sanity at crash or live backup etc.
 Current Approach & Proposed Approach
Current approach, Idea on which proposed solution is based etc.
 Proposed Solution & POC
Design information of POC, result of POC’s validation, Known
Limitations, Next Step etc.
 Appendix
Common Questions, References, Experimental Data etc.
3
Background
▌Reliability of data and its availability is one of the key
requirements for Mission critical systems.
▌Openstack ensures data availability by keeping multiple copies of
data in storage nodes. It also facilitates backup for Disaster
Recovery.
▌However, when it comes to Point-in-Time backup of live system,
Openstack relies on volume snapshot and VM pause. Those
solutions are not fool proof because impact of OS buffer cache is
not accounted. Secondly, file system journaling & fsck does not
always work.
▌This presentation talks about problem scenarios and their
probable solutions for truly non-intrusive backup in Openstack.
Challenges of Backup of live system
5
Simple example of Sanity of Data (1/3)
Let’s understand sanity of data first in context of backup.
During a backup process, an application makes two changes shown below
Name: Rahul
Id: 11755
Blood Test Status: Wait
Blood Test File:
Name: Vicky
Id: 11755
…
File: PatientsRecord.txt
Size: 200 bytes
Name: Rahul
Id: 11755
Blood Test Status: NA
Blood Test File:
Name: Vicky
Id: 11755
…
File: PatientsRecord.txt
Size: 200 bytes
...
File: Test_79.pdf
Size: 200 bytes
Name: Rahul
Id: 11755
Blood Test Status: OK
Blood Test File: Test79.pdf
Name: Vicky
Id: 11755
…
File: PatientsRecord.txt
Size: 209 bytes
...
File: Test79.pdf
Size: 200 bytes
TIME State0(initially) State1(at T1) State2(at T2)
Second Change:
Two lines modified
at time T2
First Change:
A new file created
at time T1
One of change
at T2 points to
file created at
T1
6
Simple example of Sanity of Data (2/3)
Total of three states created by the application during backup process as
shown below.
Name: Rahul
Id: 11755
Blood Test Status: Wait
Blood Test File:
Name: Vicky
Id: 11755
…
File: PatientsRecord.txt
Size: 200 bytes
Name: Rahul
Id: 11755
Blood Test Status: NA
Blood Test File:
Name: Vicky
Id: 11755
…
File: PatientsRecord.txt
Size: 200 bytes
...
File: Test_79.pdf
Size: 200 bytes
Name: Rahul
Id: 11755
Blood Test Status: OK
Blood Test File: Test79.pdf
Name: Vicky
Id: 11755
…
File: PatientsRecord.txt
Size: 209 bytes
...
File: Test79.pdf
Size: 200 bytes
TIME State0(initially) State1(at T1) State2(at T2)
Data restore
must bring back
to any one of
these three
states created
by application.
Any other data
restore is a
sanity error.
7
Simple example of Sanity of Data (3/3)
What if second change is saved in backup but not first. Recovered File-
system will look like as shown in RHS.
Name: Rahul
Id: 11755
Blood Test Status: Wait
Blood Test File:
Name: Vicky
Id: 11755
…
File: PatientsRecord.txt
Size: 200 bytes
Name: Rahul
Id: 11755
Blood Test Status: NA
Blood Test File:
Name: Vicky
Id: 11755
…
File: PatientsRecord.txt
Size: 200 bytes
...
File: Test_79.pdf
Size: 200 bytes
Name: Rahul
Id: 11755
Blood Test Status: OK
Blood Test File: Test79.pdf
Name: Vicky
Id: 11755
…
File: PatientsRecord.txt
Size: 209 bytes
...
File: Test79.pdf
Size: 200 bytes
TIME State0(initially) State1(at T1) State2(at T2) Restored(at T3)
Name: Rahul
Id: 11755
Blood Test Status: OK
Blood Test File: Test79.pdf
Name: Vicky
Id: 11755
…
File: PatientsRecord.txt
Size: 209 bytes
Restored a state
which was never
generated by the
application.
Data Restore
8
Point In Time Backup & Snapshot
▌Backup must be taken for any point of time in the history. For
example state0,state1 or state2 in the example of previous slide.
▌Technically, this concept is called ‘Point In Time’ Backup. PIT is
commonly used in context of Disaster Recovery. Backup software
are expected to take PIT backup.
▌Snapshot of volume ensures PIT data of the volume in use.
Then backup is taken from snapshot.
▌Since snapshot is created for the volume, data in Operating
Systems buffer cache is not captured. So snapshot is not
enough for PIT backup of live system.
9
Implication of Buffer Cache on sanity of data
▌Data from buffer cache is written to disk with goal of better I/O.
Invariably, the order in which applications writes to OS is different than order in
which data is written to Disk.
This is technically called out-or-order write. Example shown in diagram in next
slide. Therefore, sanity of data is lost.
▌Journaling is used to ensure sanity of data.
Journaling has its cost too in I/O. Therefore, default ext4 journaling mode
ensures file system consistency only, not file’s data.
An experiment has shown that file size became zero for new files created
within 30 seconds of crash in ext4 mounted with default options.
Refer to Appendix section for experimental data.
Note that in OS, file-system integrity is tradeoff against performance.
10
Example of Out-of-Order Write of OS buffer cache data
TIME S0 at T0 S1 at T1 S2 at T2 BadState1 at T2’ S2 at T2’’
AsinVolume
Aswrittenby
Applicationand
availableinOS
No such state
ever in OS
What if this goes
in snapshot.
Name: Rahul
Id: 11755
Test Status:
Test File:
…
File: Patients.txt
Size: 209 bytes
Name: Rahul
Id: 11755
Test Status:
Test File:
…
File: Patients.txt
Size: 209 bytes
...
File: Test_79.pdf
Size: 200 bytes
Name: Rahul
Id: 11755
Test Status: OK
Test File: Test79.pdf
…
File: Patients.txt
Size: 209 bytes
...
File: Test_79.pdf
Size: 200 bytes
Name: Rahul
Id: 11755
Test Status: OK
Test File: Test79.pdf
…
File: Patients.txt
Size: 209 bytes
Name: Rahul
Id: 11755
Test Status: OK
Test File: Test79.pdf
…
File: Patients.txt
Size: 209 bytes
...
File: Test79.pdf
Size: 200 bytes
Out of Order
write to disk
Second application
write flushed to
Volume before first
write
Both writes
flushed. OK.
An application’s
state S2 reached
Name: Rahul
Id: 11755
Test Status:
Test File:
…
File: Patients.txt
Size: 209 bytes
Current Approach & Proposed Approach
12
Current approach to take backup of Live system
▌The most common solution to take backup of live volume is a two
step process
a) Create a snapshot of live volume. That might involve momentarily
stopping the volume effectively the VM.
b) Take backup from snapshot.
▌Pause time by (a) is usually imperceptible in current implementations
of Virtual machines and Storage making it practically non-intrusive.
▌This approach is same as removing power cable from a machine,
then take backup of disks attached to the machine.
▌Journaling would ensure sanity of file-system but not data
unless performance costly journaling of ‘metadata+data’ is used.
13
Idea of Proposed Solution – Briefly stop buffer caching of OS
▌Proposed solution is based on very simple idea of
disabling buffer cache briefly for taking snapshot.
▌That is effectively making OS write-through
▌Simple CLI call on Linux will do this job
▌Proposed design and POC has been shared in subsequent slides
of this presentation.
▌Feedback on its benefit in real world, proposed design and POC is
solicited from the audience so that this idea can see the light of
the day.
14
How to disable buffer cache on Linux?
▌/proc/sys/vm/dirty_bytes
defines max dirty bytes in Linux buffer cache.
▌When this value of dirty pages is reached, subsequent write()
call will become write-through.
▌dirty_bytes can be changed to low value, not zero, in running
Linux using CLI sysctl
▌Lower dirty_bytes value increases write() latency to the tune of
milliseconds. Therefore, original value is reverted back as soon
as snapshot is created.
Proposed Solution & POC
16
Design Guidelines
▌Disable buffer cache of Guest Linux OS temporarily
▌Use other standard steps of Cinder
Cinder’s snapshot and cinder backup from snapshot are used in solution so that
code change is minimal.
▌Output file:
Backup file produced by live system backup is exactly same as cinder’s regular
backup files.
Therefore, exactly same standard restore process of OpenStack is used for
restoring the data.
▌Proposed Use Model
 Adding new option in CLI cinder looks a good solution
$ cinder backup-create –-livebackup <instanceId> ...
17
Sequence of Steps for Backup
1. Retrieve current buffer
cache config
2. Make buffer cache zero
4. Restore back buffer
cache config
Guest OSCinder
3. Generate snapshot of
Volume(s) attached to
Guest OS
5. Take backup from
snapshot(s)
6. Delete snapshot(s)
Live Backup
Controller
Access to
Guest OS is
required.
Only impact is
on write I/O
latency in this
phase.
Guest OS
continue to run
18
Entity Relationship Diagram
Controller
Node
POC – Overview
▌POC was based on “Sequence of Steps for Backup” mentioned in
previous slide.
User
POC
Code
Guest
OS
(Linux)
Cinder
SWIFT
Storage
User Input
1. Instance Id,
2. Guest OS login
& password
Backup is
produced on
SWIFT as
usualDatabase
19
POC Validation - Using count of ‘new files with zero size’
▌The POC was able to take backup and restore data.
▌POC was also validated by checking impact of ‘delayed block
allocation’ in ext4 in the new approach.
▌Steps of Validation
Take backup of Guest OS when following script is running
for i in {0..600} # Create new files every second
do
echo hello > NewFile_$i.txt # Creating new file of 6 bytes
sleep 1
done
Restore backup. Count number of zero size files.
Does count decrease in new approach? Observation shared in next slide.
20
POC Validation – Result of ‘new files with zero size’
▌ Number of zero size files seen after restore of live backup of Linux Guest OS.
▌ At most one file of zero size is expected
Run 1 Run 2 Run 3 Run 4 Run a Run b Run c Run d
Zero size file count 32 6 34 30 1 1 1 1
32
6
34
30
1 1 1 1
Run Environment
Guest OS ubunto16 4.4.0-97-generic
File-system Default ext4 mount option used (barrier=1,data=ordered)
Run environment No load on the Hypervisor and Guest OS during testing
Steps Standard OpenStack forced
snapshot then backup used
POC enabled ( dirty_bytes made
very low during snapshot)
▌ Result: Magnitude of files with zero size has drastically reduced.
21
Known Limitation & Next step
▌Known Limitation
Min values supported by dirty_bytes is two pages only, not zero. It can’t be
disabled completely.
This is documented behavior of Linux. Reason of limit is being explored to find
way to make it absolute zero.
▌Next Step
Based on feedback received on idea and POC result, next course of AIs will
have to be decided.
Appendix
Common Questions, References etc.
23
Common Questions
▌ Can all volumes attached to a Guest OS be backed up together?
Solution does facilitate it. However, snapshot feature of OpenStack
should support it too.
▌ Can multiple Guest OSes be backup up together?
Same answer as above.
▌ Is it truly non-intrusive?
 Yes. Guest OS continues to run
 However, IO write latency would go high for some time.
 Depending of snapshot feature of Cinder, Guest OS might be
momentarily paused. If SAN’s hardware level snapshot is used, then
no pause would be involved.
24
References and Useful reading (1/2)
▌Impact of ‘Delayed Block Allocation’ on ext4’s sanity
 Linus Torvalds Upset over Ext3 and Ext4 http://www.linux-
magazine.com/Online/News/Linus-Torvalds-Upset-over-Ext3-and-Ext4
 ext4 and data loss by Jonathan Corbet https://lwn.net/Articles/322823/
▌Filesystem Journal
 Description ‘2.1 File-system consistency’ of
https://www.usenix.org/system/files/conference/fast12/chidambaram.pdf
 Ext4 journal options at section ‘3. Options’ of
https://www.kernel.org/doc/Documentation/filesystems/ext4.txt
▌Linux buffer cache
 http://www.tldp.org/LDP/sag/html/buffer-cache.html
 https://www.kernel.org/doc/Documentation/sysctl/vm.txt
 Section ‘14.3.2 Writeback Parameters’
https://doc.opensuse.org/documentation/leap/tuning/html/book.sle.tuning/cha.tuning.me
mory.html#cha.tuning.memory.vm
25
References and Useful reading (2/2)
▌Alternative solutions to take backup of live systems
 Volume Shadow Copy Service (VSS) on Windows
 Vmsync on VMware Hypervisor
▌Openstack’s backup and restore
 Back up and restore volumes and snapshots
https://docs.openstack.org/cinder/latest/admin/blockstorage-volume-backups.html
 Cinder CLI https://docs.openstack.org/python-cinderclient/latest/cli/details.html
▌Code used in experiments
At https://github.com/saurabh0095/Unix-IO-test
26
Experiment – Impact of ‘Delayed Block Allocation’ on ext4 (1/2)
▌Test Objective :
To demonstrate magnitude of data loss at OS crash due to delayed block
allocation, an experiment was performed.
▌Test Steps & Observations
Create New files with small amount of data every second.
The system is crashed by removing power cable.
At system recovery, 30-35 recent files are seen of zero size.
Expectation is that at most one file, which was getting written at the time of
crash, should be of zero size.
▌Cause
Zero file size is seen because default journaling mode of ext4 write metadata .
But actual data write is delayed due to ‘Delayed Block Allocation’ leading to
inconsistency by losing data of file.
27
Experimental Data – ‘Delayed Block Allocation’ on ext4 (2/2)
▌ Number of zero size files seen after Guest OS was crashed by simulating power
cable removal on two different virtual machines.
▌ At most one file of zero size is expected. Around 30 files were seen.
Run 1 Run 2 Run 3 Run 4 Run 5 Run 6 Run 7 Run 8 Run 9 Run 10
Zero size file count 35 30 32 35 33 36 30 35 33 29
35
30
32
35
33
36
30
35
33
29
Two Run Environments - same result
Hypervisor VirtualBox 4.3.28 on Window Microsoft Hyper-V (2016 Standard)
Guest OS ubunto16 4.4.0-97-generic RHEL 7 3.10.0-123.el7.x86_64
Default ext4 mount option used (barrier=1,data=ordered)
No load on the Hypervisor and Guest OS during testing
28
Contact Information of Authors
Dipak Kumar Singh Deepak Gupta
Senior Solutions Architect, IT Platforms Deputy General Manager, IT Platform
NEC Technologies India Pvt. Ltd.
dipak.singh@india.nec.com deepak.gupta@india.nec.com
dipak123@gmail.com dkumargupta@gmail.com
http://linkedin.com/in/dipak123 https://www.linkedin.com/in/dkumargupta/
• https://www.openstack.org/summit/sydney-2017/summit-schedule/events/19305/truly-non-intrusive-openstack-cinder-
backup-for-mission-critical-systems
• https://www.openstack.org/assets/presentation-media/OpenStack-Truly-non-intrusive-Cinder-backup-1.1.pptx
Truly non-intrusive OpenStack Cinder backup for mission critical systems

Contenu connexe

Tendances

제3회난공불락 오픈소스 인프라세미나 - lustre
제3회난공불락 오픈소스 인프라세미나 - lustre제3회난공불락 오픈소스 인프라세미나 - lustre
제3회난공불락 오픈소스 인프라세미나 - lustreTommy Lee
 
EMC Dteata domain advanced command troubleshoot
EMC Dteata domain advanced command troubleshootEMC Dteata domain advanced command troubleshoot
EMC Dteata domain advanced command troubleshootsolarisyougood
 
Oracle Exadata Exam Dump
Oracle Exadata Exam DumpOracle Exadata Exam Dump
Oracle Exadata Exam DumpPooja C
 
TECHNICAL BRIEF▶NetBackup Appliance AutoSupport for NetBackup 5330
TECHNICAL BRIEF▶NetBackup Appliance AutoSupport for NetBackup 5330TECHNICAL BRIEF▶NetBackup Appliance AutoSupport for NetBackup 5330
TECHNICAL BRIEF▶NetBackup Appliance AutoSupport for NetBackup 5330Symantec
 
Exadata Patching Demystified
Exadata Patching DemystifiedExadata Patching Demystified
Exadata Patching DemystifiedEnkitec
 
point in time recovery
point in time recoverypoint in time recovery
point in time recoveryssuser1eca7d
 
Oracle Exadata 1Z0-485 Certification
Oracle Exadata 1Z0-485 CertificationOracle Exadata 1Z0-485 Certification
Oracle Exadata 1Z0-485 CertificationExadatadba
 
Erasure codes fast 2012
Erasure codes fast 2012Erasure codes fast 2012
Erasure codes fast 2012Accenture
 
Webinar slides: The Holy Grail Webinar: Become a MySQL DBA - Database Perform...
Webinar slides: The Holy Grail Webinar: Become a MySQL DBA - Database Perform...Webinar slides: The Holy Grail Webinar: Become a MySQL DBA - Database Perform...
Webinar slides: The Holy Grail Webinar: Become a MySQL DBA - Database Perform...Severalnines
 
IBM Transaction Analysis Workbench for z/OS: Combining analytic and deep-dive...
IBM Transaction Analysis Workbench for z/OS: Combining analytic and deep-dive...IBM Transaction Analysis Workbench for z/OS: Combining analytic and deep-dive...
IBM Transaction Analysis Workbench for z/OS: Combining analytic and deep-dive...Graham Hannington
 
SSD WhitePaper by Houman Shabani
SSD WhitePaper  by Houman ShabaniSSD WhitePaper  by Houman Shabani
SSD WhitePaper by Houman ShabaniHouman Shabani
 
Facebook's Approach to Big Data Storage Challenge
Facebook's Approach to Big Data Storage ChallengeFacebook's Approach to Big Data Storage Challenge
Facebook's Approach to Big Data Storage ChallengeDataWorks Summit
 
Trivadis TechEvent 2017 ACFS Replication as of 12 2 by Mathias Zarick
Trivadis TechEvent 2017 ACFS Replication as of 12 2 by Mathias ZarickTrivadis TechEvent 2017 ACFS Replication as of 12 2 by Mathias Zarick
Trivadis TechEvent 2017 ACFS Replication as of 12 2 by Mathias ZarickTrivadis
 
Increase density and performance with upgrades from Intel and Microsoft
Increase density and performance with upgrades from Intel and MicrosoftIncrease density and performance with upgrades from Intel and Microsoft
Increase density and performance with upgrades from Intel and MicrosoftPrincipled Technologies
 

Tendances (20)

NetApp against ransomware
NetApp against ransomwareNetApp against ransomware
NetApp against ransomware
 
Backups
BackupsBackups
Backups
 
제3회난공불락 오픈소스 인프라세미나 - lustre
제3회난공불락 오픈소스 인프라세미나 - lustre제3회난공불락 오픈소스 인프라세미나 - lustre
제3회난공불락 오픈소스 인프라세미나 - lustre
 
EMC Dteata domain advanced command troubleshoot
EMC Dteata domain advanced command troubleshootEMC Dteata domain advanced command troubleshoot
EMC Dteata domain advanced command troubleshoot
 
Oracle Exadata Exam Dump
Oracle Exadata Exam DumpOracle Exadata Exam Dump
Oracle Exadata Exam Dump
 
TECHNICAL BRIEF▶NetBackup Appliance AutoSupport for NetBackup 5330
TECHNICAL BRIEF▶NetBackup Appliance AutoSupport for NetBackup 5330TECHNICAL BRIEF▶NetBackup Appliance AutoSupport for NetBackup 5330
TECHNICAL BRIEF▶NetBackup Appliance AutoSupport for NetBackup 5330
 
Exadata Patching Demystified
Exadata Patching DemystifiedExadata Patching Demystified
Exadata Patching Demystified
 
ZFSperftools2012
ZFSperftools2012ZFSperftools2012
ZFSperftools2012
 
point in time recovery
point in time recoverypoint in time recovery
point in time recovery
 
Zfs intro v2
Zfs intro v2Zfs intro v2
Zfs intro v2
 
Oracle Exadata 1Z0-485 Certification
Oracle Exadata 1Z0-485 CertificationOracle Exadata 1Z0-485 Certification
Oracle Exadata 1Z0-485 Certification
 
ZFS
ZFSZFS
ZFS
 
Erasure codes fast 2012
Erasure codes fast 2012Erasure codes fast 2012
Erasure codes fast 2012
 
Webinar slides: The Holy Grail Webinar: Become a MySQL DBA - Database Perform...
Webinar slides: The Holy Grail Webinar: Become a MySQL DBA - Database Perform...Webinar slides: The Holy Grail Webinar: Become a MySQL DBA - Database Perform...
Webinar slides: The Holy Grail Webinar: Become a MySQL DBA - Database Perform...
 
IBM Transaction Analysis Workbench for z/OS: Combining analytic and deep-dive...
IBM Transaction Analysis Workbench for z/OS: Combining analytic and deep-dive...IBM Transaction Analysis Workbench for z/OS: Combining analytic and deep-dive...
IBM Transaction Analysis Workbench for z/OS: Combining analytic and deep-dive...
 
SSD WhitePaper by Houman Shabani
SSD WhitePaper  by Houman ShabaniSSD WhitePaper  by Houman Shabani
SSD WhitePaper by Houman Shabani
 
Facebook's Approach to Big Data Storage Challenge
Facebook's Approach to Big Data Storage ChallengeFacebook's Approach to Big Data Storage Challenge
Facebook's Approach to Big Data Storage Challenge
 
distcom-short-20140112-1600
distcom-short-20140112-1600distcom-short-20140112-1600
distcom-short-20140112-1600
 
Trivadis TechEvent 2017 ACFS Replication as of 12 2 by Mathias Zarick
Trivadis TechEvent 2017 ACFS Replication as of 12 2 by Mathias ZarickTrivadis TechEvent 2017 ACFS Replication as of 12 2 by Mathias Zarick
Trivadis TechEvent 2017 ACFS Replication as of 12 2 by Mathias Zarick
 
Increase density and performance with upgrades from Intel and Microsoft
Increase density and performance with upgrades from Intel and MicrosoftIncrease density and performance with upgrades from Intel and Microsoft
Increase density and performance with upgrades from Intel and Microsoft
 

Similaire à Truly non-intrusive OpenStack Cinder backup for mission critical systems

Combining IBM Real-time Compression and IBM ProtecTIER Deduplication
Combining IBM Real-time Compression and IBM ProtecTIER DeduplicationCombining IBM Real-time Compression and IBM ProtecTIER Deduplication
Combining IBM Real-time Compression and IBM ProtecTIER DeduplicationIBM India Smarter Computing
 
DevOps, Databases and The Phoenix Project UGF4042 from OOW14
DevOps, Databases and The Phoenix Project UGF4042 from OOW14DevOps, Databases and The Phoenix Project UGF4042 from OOW14
DevOps, Databases and The Phoenix Project UGF4042 from OOW14Kyle Hailey
 
BGOUG "Agile Data: revolutionizing database cloning'
BGOUG  "Agile Data: revolutionizing database cloning'BGOUG  "Agile Data: revolutionizing database cloning'
BGOUG "Agile Data: revolutionizing database cloning'Kyle Hailey
 
Deep Dive on Delivering Amazon EC2 Instance Performance
Deep Dive on Delivering Amazon EC2 Instance PerformanceDeep Dive on Delivering Amazon EC2 Instance Performance
Deep Dive on Delivering Amazon EC2 Instance PerformanceAmazon Web Services
 
High Availability And Oracle Data Guard 11g R2
High Availability And Oracle Data Guard 11g R2High Availability And Oracle Data Guard 11g R2
High Availability And Oracle Data Guard 11g R2Mario Redón Luz
 
Choosing the Right EC2 Instance and Applicable Use Cases - AWS June 2016 Webi...
Choosing the Right EC2 Instance and Applicable Use Cases - AWS June 2016 Webi...Choosing the Right EC2 Instance and Applicable Use Cases - AWS June 2016 Webi...
Choosing the Right EC2 Instance and Applicable Use Cases - AWS June 2016 Webi...Amazon Web Services
 
Real time database compression optimization using iterative length compressio...
Real time database compression optimization using iterative length compressio...Real time database compression optimization using iterative length compressio...
Real time database compression optimization using iterative length compressio...csandit
 
REAL TIME DATABASE COMPRESSION OPTIMIZATION USING ITERATIVE LENGTH COMPRESSIO...
REAL TIME DATABASE COMPRESSION OPTIMIZATION USING ITERATIVE LENGTH COMPRESSIO...REAL TIME DATABASE COMPRESSION OPTIMIZATION USING ITERATIVE LENGTH COMPRESSIO...
REAL TIME DATABASE COMPRESSION OPTIMIZATION USING ITERATIVE LENGTH COMPRESSIO...cscpconf
 
A Step-By-Step Disaster Recovery Blueprint & Best Practices for Your NetBacku...
A Step-By-Step Disaster Recovery Blueprint & Best Practices for Your NetBacku...A Step-By-Step Disaster Recovery Blueprint & Best Practices for Your NetBacku...
A Step-By-Step Disaster Recovery Blueprint & Best Practices for Your NetBacku...Symantec
 
Denver devops : enabling DevOps with data virtualization
Denver devops : enabling DevOps with data virtualizationDenver devops : enabling DevOps with data virtualization
Denver devops : enabling DevOps with data virtualizationKyle Hailey
 
Start Counting: How We Unlocked Platform Efficiency and Reliability While Sav...
Start Counting: How We Unlocked Platform Efficiency and Reliability While Sav...Start Counting: How We Unlocked Platform Efficiency and Reliability While Sav...
Start Counting: How We Unlocked Platform Efficiency and Reliability While Sav...VMware Tanzu
 
Deep Dive on Delivering Amazon EC2 Instance Performance
Deep Dive on Delivering Amazon EC2 Instance PerformanceDeep Dive on Delivering Amazon EC2 Instance Performance
Deep Dive on Delivering Amazon EC2 Instance PerformanceAmazon Web Services
 
Fundamentals of storage Unit III Backup and Recovery.ppt
Fundamentals of storage Unit III Backup and Recovery.pptFundamentals of storage Unit III Backup and Recovery.ppt
Fundamentals of storage Unit III Backup and Recovery.pptRahul Borate
 
Data Virtualization: Revolutionizing data cloning
Data Virtualization: Revolutionizing data cloningData Virtualization: Revolutionizing data cloning
Data Virtualization: Revolutionizing data cloning Kyle Hailey
 
R1Soft CDP 3.0 Key Features
R1Soft CDP 3.0 Key FeaturesR1Soft CDP 3.0 Key Features
R1Soft CDP 3.0 Key FeaturesR1Soft
 

Similaire à Truly non-intrusive OpenStack Cinder backup for mission critical systems (20)

Combining IBM Real-time Compression and IBM ProtecTIER Deduplication
Combining IBM Real-time Compression and IBM ProtecTIER DeduplicationCombining IBM Real-time Compression and IBM ProtecTIER Deduplication
Combining IBM Real-time Compression and IBM ProtecTIER Deduplication
 
DevOps, Databases and The Phoenix Project UGF4042 from OOW14
DevOps, Databases and The Phoenix Project UGF4042 from OOW14DevOps, Databases and The Phoenix Project UGF4042 from OOW14
DevOps, Databases and The Phoenix Project UGF4042 from OOW14
 
BGOUG "Agile Data: revolutionizing database cloning'
BGOUG  "Agile Data: revolutionizing database cloning'BGOUG  "Agile Data: revolutionizing database cloning'
BGOUG "Agile Data: revolutionizing database cloning'
 
Deep Dive on Delivering Amazon EC2 Instance Performance
Deep Dive on Delivering Amazon EC2 Instance PerformanceDeep Dive on Delivering Amazon EC2 Instance Performance
Deep Dive on Delivering Amazon EC2 Instance Performance
 
Systore07 V4
Systore07 V4Systore07 V4
Systore07 V4
 
High Availability And Oracle Data Guard 11g R2
High Availability And Oracle Data Guard 11g R2High Availability And Oracle Data Guard 11g R2
High Availability And Oracle Data Guard 11g R2
 
Choosing the Right EC2 Instance and Applicable Use Cases - AWS June 2016 Webi...
Choosing the Right EC2 Instance and Applicable Use Cases - AWS June 2016 Webi...Choosing the Right EC2 Instance and Applicable Use Cases - AWS June 2016 Webi...
Choosing the Right EC2 Instance and Applicable Use Cases - AWS June 2016 Webi...
 
Real time database compression optimization using iterative length compressio...
Real time database compression optimization using iterative length compressio...Real time database compression optimization using iterative length compressio...
Real time database compression optimization using iterative length compressio...
 
REAL TIME DATABASE COMPRESSION OPTIMIZATION USING ITERATIVE LENGTH COMPRESSIO...
REAL TIME DATABASE COMPRESSION OPTIMIZATION USING ITERATIVE LENGTH COMPRESSIO...REAL TIME DATABASE COMPRESSION OPTIMIZATION USING ITERATIVE LENGTH COMPRESSIO...
REAL TIME DATABASE COMPRESSION OPTIMIZATION USING ITERATIVE LENGTH COMPRESSIO...
 
Evaluator Group on TS7680 ProtecTIER for z/OS
Evaluator Group on TS7680 ProtecTIER for z/OSEvaluator Group on TS7680 ProtecTIER for z/OS
Evaluator Group on TS7680 ProtecTIER for z/OS
 
IBM ProtecTIER Deduplication for z/OS
IBM ProtecTIER Deduplication for z/OSIBM ProtecTIER Deduplication for z/OS
IBM ProtecTIER Deduplication for z/OS
 
A Step-By-Step Disaster Recovery Blueprint & Best Practices for Your NetBacku...
A Step-By-Step Disaster Recovery Blueprint & Best Practices for Your NetBacku...A Step-By-Step Disaster Recovery Blueprint & Best Practices for Your NetBacku...
A Step-By-Step Disaster Recovery Blueprint & Best Practices for Your NetBacku...
 
Denver devops : enabling DevOps with data virtualization
Denver devops : enabling DevOps with data virtualizationDenver devops : enabling DevOps with data virtualization
Denver devops : enabling DevOps with data virtualization
 
optimizing_ceph_flash
optimizing_ceph_flashoptimizing_ceph_flash
optimizing_ceph_flash
 
Start Counting: How We Unlocked Platform Efficiency and Reliability While Sav...
Start Counting: How We Unlocked Platform Efficiency and Reliability While Sav...Start Counting: How We Unlocked Platform Efficiency and Reliability While Sav...
Start Counting: How We Unlocked Platform Efficiency and Reliability While Sav...
 
Deep Dive on Delivering Amazon EC2 Instance Performance
Deep Dive on Delivering Amazon EC2 Instance PerformanceDeep Dive on Delivering Amazon EC2 Instance Performance
Deep Dive on Delivering Amazon EC2 Instance Performance
 
11g R2
11g R211g R2
11g R2
 
Fundamentals of storage Unit III Backup and Recovery.ppt
Fundamentals of storage Unit III Backup and Recovery.pptFundamentals of storage Unit III Backup and Recovery.ppt
Fundamentals of storage Unit III Backup and Recovery.ppt
 
Data Virtualization: Revolutionizing data cloning
Data Virtualization: Revolutionizing data cloningData Virtualization: Revolutionizing data cloning
Data Virtualization: Revolutionizing data cloning
 
R1Soft CDP 3.0 Key Features
R1Soft CDP 3.0 Key FeaturesR1Soft CDP 3.0 Key Features
R1Soft CDP 3.0 Key Features
 

Dernier

How To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected WorkerHow To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected WorkerThousandEyes
 
Diamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with PrecisionDiamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with PrecisionSolGuruz
 
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...panagenda
 
Hand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxHand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxbodapatigopi8531
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comFatema Valibhai
 
How To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsHow To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsAndolasoft Inc
 
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online  ☂️CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online  ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️anilsa9823
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsAlberto González Trastoy
 
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...ICS
 
5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdfWave PLM
 
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Steffen Staab
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providermohitmore19
 
Software Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsSoftware Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsArshad QA
 
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...kellynguyen01
 
A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxA Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxComplianceQuest1
 
Right Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsRight Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsJhone kinadey
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Modelsaagamshah0812
 
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfThe Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfkalichargn70th171
 

Dernier (20)

How To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected WorkerHow To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
 
Diamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with PrecisionDiamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with Precision
 
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
 
Hand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxHand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptx
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.com
 
How To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsHow To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.js
 
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online  ☂️CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online  ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
 
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS LiveVip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
 
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
 
5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf
 
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service provider
 
Software Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsSoftware Quality Assurance Interview Questions
Software Quality Assurance Interview Questions
 
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
 
A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxA Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docx
 
Right Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsRight Money Management App For Your Financial Goals
Right Money Management App For Your Financial Goals
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Models
 
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfThe Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
 
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 

Truly non-intrusive OpenStack Cinder backup for mission critical systems

  • 1. Truly non-intrusive cinder backup for mission critical systems Lightening talk by Dipak Kumar Singh & Deepak Gupta on 06 Nov 2017 at OpenStack Summit, Sydney, Australia
  • 2. Table of Content  Challenges of Backup of Live system Example of sanity error of data, Implication of OS buffer cache on sanity at crash or live backup etc.  Current Approach & Proposed Approach Current approach, Idea on which proposed solution is based etc.  Proposed Solution & POC Design information of POC, result of POC’s validation, Known Limitations, Next Step etc.  Appendix Common Questions, References, Experimental Data etc.
  • 3. 3 Background ▌Reliability of data and its availability is one of the key requirements for Mission critical systems. ▌Openstack ensures data availability by keeping multiple copies of data in storage nodes. It also facilitates backup for Disaster Recovery. ▌However, when it comes to Point-in-Time backup of live system, Openstack relies on volume snapshot and VM pause. Those solutions are not fool proof because impact of OS buffer cache is not accounted. Secondly, file system journaling & fsck does not always work. ▌This presentation talks about problem scenarios and their probable solutions for truly non-intrusive backup in Openstack.
  • 4. Challenges of Backup of live system
  • 5. 5 Simple example of Sanity of Data (1/3) Let’s understand sanity of data first in context of backup. During a backup process, an application makes two changes shown below Name: Rahul Id: 11755 Blood Test Status: Wait Blood Test File: Name: Vicky Id: 11755 … File: PatientsRecord.txt Size: 200 bytes Name: Rahul Id: 11755 Blood Test Status: NA Blood Test File: Name: Vicky Id: 11755 … File: PatientsRecord.txt Size: 200 bytes ... File: Test_79.pdf Size: 200 bytes Name: Rahul Id: 11755 Blood Test Status: OK Blood Test File: Test79.pdf Name: Vicky Id: 11755 … File: PatientsRecord.txt Size: 209 bytes ... File: Test79.pdf Size: 200 bytes TIME State0(initially) State1(at T1) State2(at T2) Second Change: Two lines modified at time T2 First Change: A new file created at time T1 One of change at T2 points to file created at T1
  • 6. 6 Simple example of Sanity of Data (2/3) Total of three states created by the application during backup process as shown below. Name: Rahul Id: 11755 Blood Test Status: Wait Blood Test File: Name: Vicky Id: 11755 … File: PatientsRecord.txt Size: 200 bytes Name: Rahul Id: 11755 Blood Test Status: NA Blood Test File: Name: Vicky Id: 11755 … File: PatientsRecord.txt Size: 200 bytes ... File: Test_79.pdf Size: 200 bytes Name: Rahul Id: 11755 Blood Test Status: OK Blood Test File: Test79.pdf Name: Vicky Id: 11755 … File: PatientsRecord.txt Size: 209 bytes ... File: Test79.pdf Size: 200 bytes TIME State0(initially) State1(at T1) State2(at T2) Data restore must bring back to any one of these three states created by application. Any other data restore is a sanity error.
  • 7. 7 Simple example of Sanity of Data (3/3) What if second change is saved in backup but not first. Recovered File- system will look like as shown in RHS. Name: Rahul Id: 11755 Blood Test Status: Wait Blood Test File: Name: Vicky Id: 11755 … File: PatientsRecord.txt Size: 200 bytes Name: Rahul Id: 11755 Blood Test Status: NA Blood Test File: Name: Vicky Id: 11755 … File: PatientsRecord.txt Size: 200 bytes ... File: Test_79.pdf Size: 200 bytes Name: Rahul Id: 11755 Blood Test Status: OK Blood Test File: Test79.pdf Name: Vicky Id: 11755 … File: PatientsRecord.txt Size: 209 bytes ... File: Test79.pdf Size: 200 bytes TIME State0(initially) State1(at T1) State2(at T2) Restored(at T3) Name: Rahul Id: 11755 Blood Test Status: OK Blood Test File: Test79.pdf Name: Vicky Id: 11755 … File: PatientsRecord.txt Size: 209 bytes Restored a state which was never generated by the application. Data Restore
  • 8. 8 Point In Time Backup & Snapshot ▌Backup must be taken for any point of time in the history. For example state0,state1 or state2 in the example of previous slide. ▌Technically, this concept is called ‘Point In Time’ Backup. PIT is commonly used in context of Disaster Recovery. Backup software are expected to take PIT backup. ▌Snapshot of volume ensures PIT data of the volume in use. Then backup is taken from snapshot. ▌Since snapshot is created for the volume, data in Operating Systems buffer cache is not captured. So snapshot is not enough for PIT backup of live system.
  • 9. 9 Implication of Buffer Cache on sanity of data ▌Data from buffer cache is written to disk with goal of better I/O. Invariably, the order in which applications writes to OS is different than order in which data is written to Disk. This is technically called out-or-order write. Example shown in diagram in next slide. Therefore, sanity of data is lost. ▌Journaling is used to ensure sanity of data. Journaling has its cost too in I/O. Therefore, default ext4 journaling mode ensures file system consistency only, not file’s data. An experiment has shown that file size became zero for new files created within 30 seconds of crash in ext4 mounted with default options. Refer to Appendix section for experimental data. Note that in OS, file-system integrity is tradeoff against performance.
  • 10. 10 Example of Out-of-Order Write of OS buffer cache data TIME S0 at T0 S1 at T1 S2 at T2 BadState1 at T2’ S2 at T2’’ AsinVolume Aswrittenby Applicationand availableinOS No such state ever in OS What if this goes in snapshot. Name: Rahul Id: 11755 Test Status: Test File: … File: Patients.txt Size: 209 bytes Name: Rahul Id: 11755 Test Status: Test File: … File: Patients.txt Size: 209 bytes ... File: Test_79.pdf Size: 200 bytes Name: Rahul Id: 11755 Test Status: OK Test File: Test79.pdf … File: Patients.txt Size: 209 bytes ... File: Test_79.pdf Size: 200 bytes Name: Rahul Id: 11755 Test Status: OK Test File: Test79.pdf … File: Patients.txt Size: 209 bytes Name: Rahul Id: 11755 Test Status: OK Test File: Test79.pdf … File: Patients.txt Size: 209 bytes ... File: Test79.pdf Size: 200 bytes Out of Order write to disk Second application write flushed to Volume before first write Both writes flushed. OK. An application’s state S2 reached Name: Rahul Id: 11755 Test Status: Test File: … File: Patients.txt Size: 209 bytes
  • 11. Current Approach & Proposed Approach
  • 12. 12 Current approach to take backup of Live system ▌The most common solution to take backup of live volume is a two step process a) Create a snapshot of live volume. That might involve momentarily stopping the volume effectively the VM. b) Take backup from snapshot. ▌Pause time by (a) is usually imperceptible in current implementations of Virtual machines and Storage making it practically non-intrusive. ▌This approach is same as removing power cable from a machine, then take backup of disks attached to the machine. ▌Journaling would ensure sanity of file-system but not data unless performance costly journaling of ‘metadata+data’ is used.
  • 13. 13 Idea of Proposed Solution – Briefly stop buffer caching of OS ▌Proposed solution is based on very simple idea of disabling buffer cache briefly for taking snapshot. ▌That is effectively making OS write-through ▌Simple CLI call on Linux will do this job ▌Proposed design and POC has been shared in subsequent slides of this presentation. ▌Feedback on its benefit in real world, proposed design and POC is solicited from the audience so that this idea can see the light of the day.
  • 14. 14 How to disable buffer cache on Linux? ▌/proc/sys/vm/dirty_bytes defines max dirty bytes in Linux buffer cache. ▌When this value of dirty pages is reached, subsequent write() call will become write-through. ▌dirty_bytes can be changed to low value, not zero, in running Linux using CLI sysctl ▌Lower dirty_bytes value increases write() latency to the tune of milliseconds. Therefore, original value is reverted back as soon as snapshot is created.
  • 16. 16 Design Guidelines ▌Disable buffer cache of Guest Linux OS temporarily ▌Use other standard steps of Cinder Cinder’s snapshot and cinder backup from snapshot are used in solution so that code change is minimal. ▌Output file: Backup file produced by live system backup is exactly same as cinder’s regular backup files. Therefore, exactly same standard restore process of OpenStack is used for restoring the data. ▌Proposed Use Model  Adding new option in CLI cinder looks a good solution $ cinder backup-create –-livebackup <instanceId> ...
  • 17. 17 Sequence of Steps for Backup 1. Retrieve current buffer cache config 2. Make buffer cache zero 4. Restore back buffer cache config Guest OSCinder 3. Generate snapshot of Volume(s) attached to Guest OS 5. Take backup from snapshot(s) 6. Delete snapshot(s) Live Backup Controller Access to Guest OS is required. Only impact is on write I/O latency in this phase. Guest OS continue to run
  • 18. 18 Entity Relationship Diagram Controller Node POC – Overview ▌POC was based on “Sequence of Steps for Backup” mentioned in previous slide. User POC Code Guest OS (Linux) Cinder SWIFT Storage User Input 1. Instance Id, 2. Guest OS login & password Backup is produced on SWIFT as usualDatabase
  • 19. 19 POC Validation - Using count of ‘new files with zero size’ ▌The POC was able to take backup and restore data. ▌POC was also validated by checking impact of ‘delayed block allocation’ in ext4 in the new approach. ▌Steps of Validation Take backup of Guest OS when following script is running for i in {0..600} # Create new files every second do echo hello > NewFile_$i.txt # Creating new file of 6 bytes sleep 1 done Restore backup. Count number of zero size files. Does count decrease in new approach? Observation shared in next slide.
  • 20. 20 POC Validation – Result of ‘new files with zero size’ ▌ Number of zero size files seen after restore of live backup of Linux Guest OS. ▌ At most one file of zero size is expected Run 1 Run 2 Run 3 Run 4 Run a Run b Run c Run d Zero size file count 32 6 34 30 1 1 1 1 32 6 34 30 1 1 1 1 Run Environment Guest OS ubunto16 4.4.0-97-generic File-system Default ext4 mount option used (barrier=1,data=ordered) Run environment No load on the Hypervisor and Guest OS during testing Steps Standard OpenStack forced snapshot then backup used POC enabled ( dirty_bytes made very low during snapshot) ▌ Result: Magnitude of files with zero size has drastically reduced.
  • 21. 21 Known Limitation & Next step ▌Known Limitation Min values supported by dirty_bytes is two pages only, not zero. It can’t be disabled completely. This is documented behavior of Linux. Reason of limit is being explored to find way to make it absolute zero. ▌Next Step Based on feedback received on idea and POC result, next course of AIs will have to be decided.
  • 23. 23 Common Questions ▌ Can all volumes attached to a Guest OS be backed up together? Solution does facilitate it. However, snapshot feature of OpenStack should support it too. ▌ Can multiple Guest OSes be backup up together? Same answer as above. ▌ Is it truly non-intrusive?  Yes. Guest OS continues to run  However, IO write latency would go high for some time.  Depending of snapshot feature of Cinder, Guest OS might be momentarily paused. If SAN’s hardware level snapshot is used, then no pause would be involved.
  • 24. 24 References and Useful reading (1/2) ▌Impact of ‘Delayed Block Allocation’ on ext4’s sanity  Linus Torvalds Upset over Ext3 and Ext4 http://www.linux- magazine.com/Online/News/Linus-Torvalds-Upset-over-Ext3-and-Ext4  ext4 and data loss by Jonathan Corbet https://lwn.net/Articles/322823/ ▌Filesystem Journal  Description ‘2.1 File-system consistency’ of https://www.usenix.org/system/files/conference/fast12/chidambaram.pdf  Ext4 journal options at section ‘3. Options’ of https://www.kernel.org/doc/Documentation/filesystems/ext4.txt ▌Linux buffer cache  http://www.tldp.org/LDP/sag/html/buffer-cache.html  https://www.kernel.org/doc/Documentation/sysctl/vm.txt  Section ‘14.3.2 Writeback Parameters’ https://doc.opensuse.org/documentation/leap/tuning/html/book.sle.tuning/cha.tuning.me mory.html#cha.tuning.memory.vm
  • 25. 25 References and Useful reading (2/2) ▌Alternative solutions to take backup of live systems  Volume Shadow Copy Service (VSS) on Windows  Vmsync on VMware Hypervisor ▌Openstack’s backup and restore  Back up and restore volumes and snapshots https://docs.openstack.org/cinder/latest/admin/blockstorage-volume-backups.html  Cinder CLI https://docs.openstack.org/python-cinderclient/latest/cli/details.html ▌Code used in experiments At https://github.com/saurabh0095/Unix-IO-test
  • 26. 26 Experiment – Impact of ‘Delayed Block Allocation’ on ext4 (1/2) ▌Test Objective : To demonstrate magnitude of data loss at OS crash due to delayed block allocation, an experiment was performed. ▌Test Steps & Observations Create New files with small amount of data every second. The system is crashed by removing power cable. At system recovery, 30-35 recent files are seen of zero size. Expectation is that at most one file, which was getting written at the time of crash, should be of zero size. ▌Cause Zero file size is seen because default journaling mode of ext4 write metadata . But actual data write is delayed due to ‘Delayed Block Allocation’ leading to inconsistency by losing data of file.
  • 27. 27 Experimental Data – ‘Delayed Block Allocation’ on ext4 (2/2) ▌ Number of zero size files seen after Guest OS was crashed by simulating power cable removal on two different virtual machines. ▌ At most one file of zero size is expected. Around 30 files were seen. Run 1 Run 2 Run 3 Run 4 Run 5 Run 6 Run 7 Run 8 Run 9 Run 10 Zero size file count 35 30 32 35 33 36 30 35 33 29 35 30 32 35 33 36 30 35 33 29 Two Run Environments - same result Hypervisor VirtualBox 4.3.28 on Window Microsoft Hyper-V (2016 Standard) Guest OS ubunto16 4.4.0-97-generic RHEL 7 3.10.0-123.el7.x86_64 Default ext4 mount option used (barrier=1,data=ordered) No load on the Hypervisor and Guest OS during testing
  • 28. 28 Contact Information of Authors Dipak Kumar Singh Deepak Gupta Senior Solutions Architect, IT Platforms Deputy General Manager, IT Platform NEC Technologies India Pvt. Ltd. dipak.singh@india.nec.com deepak.gupta@india.nec.com dipak123@gmail.com dkumargupta@gmail.com http://linkedin.com/in/dipak123 https://www.linkedin.com/in/dkumargupta/ • https://www.openstack.org/summit/sydney-2017/summit-schedule/events/19305/truly-non-intrusive-openstack-cinder- backup-for-mission-critical-systems • https://www.openstack.org/assets/presentation-media/OpenStack-Truly-non-intrusive-Cinder-backup-1.1.pptx