Data deduplication is a technique for reducing the amount of storage space an organization needs to save its data. There are many different types of data deduplication for data backup and storage process.
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
Data deduplication and its different types
1. Data deduplication is a technique for reducing the amount of
storage space an organization needs to save its data.
What Is Data Deduplication Technology?
2. Benefits of Data Deduplication
It reduces the amount of disk or tape that organizations need to buy.
It can reduce storage requirements up to 95 percent.
It can reduce the amount of network bandwidth required for backup
processes.
It can speed up the backup and recovery process.
It can save money and time.
3. Types of Data Deduplication
Source Deduplication
Target Deduplication
Inline Deduplication
Post-Process Deduplication
Global Deduplication
4. Source deduplication is the removal of redundancies from data before transmission to
the backup target.
It uses the client software for comparing new data blocks on the primary storage
device with the previously backed up data blocks.
Source Deduplication
5. Target deduplication removes all the redundant data in the backup appliance most often
on virtual tape library or a NAS device.
It reduces the storage capacity required for backup data but does not reduces the amount
of data sent across LAN or WAN.
Target Deduplication
6. Inline Deduplication
Inline deduplication is the removal of redundancies from data before or as it is being
written to a backup device.
Inline deduplication reduces the amount of redundant data in an application and the
capacity needed for the backup disk targets.
Inline deduplication is the removal of redundancies from data before or as it is being written to a backup device.
7. Post-Process Deduplication
Post-process deduplication writes the backup data into the disk cache before it starts the
dedupe process.
It is mostly used in the backup applications, virtual tape libraries and the like, where
reduction of backup time is required.
8. Global Deduplication
Global data deduplication is a method of preventing redundant data when backing up data
to multiple deduplication devices.
It removes all the possible backup data redundancies across multiple systems.