Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
The future of tape april 16
1. IBM Future of Tape
Tape Storage Solutions
TAPE $AVES: COST • ENERGY • DATA • COMPANY
Josef (Sepp) Weingand
Business Development Leader DACH – Data Protection & Retention
Infos / Find me on: weingand@de.ibm.com, +49 171 5526783
http://sepp4backup.blogspot.de/
http://www.linkedin.com/pub/josef-weingand/2/788/300
http://www.facebook.com/josef.weingand
http://de.slideshare.net/JosefWeingand
https://www.xing.com/profile/Josef_Weingand
https://www.xing.com/net/ibmdataprotection
2. IBM SystemsIBM Systems
Why using Tape – Past, Current and in the
Future
Very energy efficient: no power needed once data is recorded
Very secure:
• Data is inaccessible when cartridge is not mounted
• Drive level encryption
• Portable
Very long expected media lifetime (30+ years)
Very reliable: Typically no data loss in case of drive failure
Main net advantage of tape for archival storage is cost
Capacity – future improvements in Capacity / Roadmap
| 2
3. IBM SystemsIBM Systems
The data deluge
80% of all files created are inactive
no access in at least 3 months!
=> NAS: Never Access Storage
| 3
Source: D. Anderson, 2013 IEEE Conf. on Massive Data Storage
4. IBM SystemsIBM Systems
HDD ?!?
HDD has reached the limit of (known) materials to produce
larger write fields:
• Areal density/capacity scaling achieved by shrinking the same basic
technology to write smaller and smaller bits on disk
Technologies to go beyond the superparamagnetic limit:
• Two dimensional magnetic recording (TDMR)
• Heat Assisted Magnetic Recording (HAMR)
• Microwave Assisted Magnetic Recording (MAMR)
• Bit Patterned Media (BPM)
Recent Capacity Scaling of HDD: Volumetric Density
• Slow down in areal density scaling partially compensated by adding
more disks: conventional technology has reached space limit (~5
platters)
• Helium filled drive less turbulence thinner disks higher capacity
• WD 6TB (2013) 6 platters
• HGST 10TB Drive (2015) 7 platters w/ shingled magnetic recording
• Doesn’t scale: No space for more heads and platters!
| 4
Magnetic Media “Trilemma”:
6. IBM SystemsIBM Systems
NAND Area Density
NAND has three strategies for increasing bits per unit area in a silicon chip
1. Lithographic scaling of the bit cell (x, y) dimensions by reducing F*
• - 20nm to 16nm 1.56X more density
• - 16nm to 13nm 1.51X more density
2. Increasing the number of bits per cell
• - 1 bit per cell to MLC (2 bits per cell) 2.00x increase
• - MLC to TLC (3 bits per cell) 1.50x increase
3. 3D stacking (larger bit cell but multiple layers of cells)
• Example: 16 nm goes to 48 nm and cell design goes from 4F2 to 6F2 so cell area increases 13.5x But by using 27 layers the
effective density on the surface of the wafer increases by 2.00x (27/13.5)
| 6
3D Design Example
- Basic Cell 2F x 3F (F is minimum feature)
- 12 cells per layer
- 4 layers
- 2 bits or 3 bits per cell
F
7. IBM SystemsIBM Systems
Flash Challenges
Flash reliability becoming worse with technology scaling – endurance constantly decreasing (19nm: 3k
cycles for c-MLC, 10k cycles for e-MLC)
Reliability targets are met with increasingly higher effort and cost:
• exponentially higher ECC effort
• significant latency/bandwidth penalty
2D Floating Gate Flash faces significant scaling challenges below
15nm node
3D NAND allows ‘Effective Flash scaling’ into sub-10nm
• 3D-NAND offers cost-effective path for sub-10nm Flash scaling:
non-EUV-based solution, lithography ground rule relaxed to ~40nm
| 7
Flash scaling effects on endurance, ECC
Technology Node (nm)
Very challenging to maintain same performance, write endurance, and retention specs beyond planar 1x nm.
Solution is 3D cell NAND technology !
Next Step: Storage Class Memory – e.g. PCM
… The $1B question is… when?
When will DRAM and NAND truly stop scaling?
When will manufacturers make large-scale investments in a new
technology?
8. IBM SystemsIBM Systems
Physical Bit Cells (2015): NAND, HDD and TAPE
Bit cells shown at scale
NAND struggles with bit stability below 19nm for MLC/TLC Trend is
to transition to larger cells with vertical (3d) stacking
HDD struggles with difficulty writing small magnetic grains needed
to enable future areal density growth
• shingling for now, HAMR for the future
Tape’s large bit cell suggests there is room to grow
| 8
TAPE
7 Gbit/in²
2000nm x 100nm
HDD
840 Gbit/in²
60nm x 11nm
NAND - MLC
1024 Gbit/in²
23nm x 23nm
NAND - TLC
1500 Gbit/in²
19nm x 19nm
9. IBM SystemsIBM Systems
Tape Technology Demonstration IBM Rüschlikon – May 2015
Areal recording density :
123 Gb/in2
88x LTO6 areal density
220 TB cartridge capacity
This demonstration shows that tape technology has the potential for significant capacity increase for years
to come!
= 9777 h@50Mbps – 407 d@50Mpbs
HDD Technology:
• No room to continue adding platters
• HDD capacity will be driven by areal density scaling (10-20% /a)
Cost advantage of tape will continue to grow!
10. IBM SystemsIBM Systems
Demo Technologies
Focus on aggressive track density scaling
Require:
• dramatic improvement in track following enables track width reduction
• reduce reader width from a few microns to 90 nm
Ultra narrow reader results in a dramatic loss in read back signal
that must be compensated for with
• improved media technology require improved writer technology
• Based on current BaFe Media
The coercivity of metal particles smaller than 3000 nm³ decrease with size
The coercivity of BaFe particles can be tuned independently of size enabling small particle media
with long archival lifetime
• improved signal processing and coding
• improved reader technology
Improved SNR
| 10
11. IBM SystemsIBM Systems
IBM-FujiFilm demonstration of 123 Gb/in2 on BaFe tape
| 11
123 Gbit/in2 demo
(Source: INSIC 2015-2025 International Magnetic Tape Storage Roadmap)
http://www.insic.org/news/2015%20roadmap/15pdfs/INSIC%20Areal%20Density%20Trend%20Chart.pdf
12. IBM Systems
INSIC 2015-2025 Tape Roadmap
http://www.insic.org/news/2015%20roadmap/15pdfs/2015%20Technical%20Roadmap.pdf
13. IBM Systems
Any statements regarding IBM's future direction and intent are subject to change or withdrawal without notice, and represent goals and objectives only.
IBM Tape Drive Roadmaps
TS1100
Generations
Gen-1
3592 J1A
Gen-2
TS1120
Gen-3
TS1130
Gen-4
TS1140
Gen-5
TS1150
Gen-6 Gen-7
Max Native
Capacity
300 GB
(JA)
700 GB
(JB)
1.0 TB
(JB)
4.0 TB
(JC)
10 TB
(JD)
15-24 TB 30-40 TB
Other Native
Capacities with
Media Reuse
500 GB
JA
640 GB JA 1.6 TB
JB
640 GB JA
R/O
7 TB JC 8-10 TB JC
15-20 TB JD
8-10 TB JC
15-20 TB
JD
30-40 TB JE
Native Data Rate 40 MB/s 100 MB/s 160 MB/s 250 MB/s Up to 360
MB/s
Up to 540 MB/s 1000 MB/s
2003
2005 2007 2009 2012
2006 2008 2011 2014
LTO
Generations
LTO-3 LTO-4 LTO-5 LTO-6 LTO-7 LTO-8 LTO-9 LTO-10
Max Native
Capacity
400 GB
(L3)
800 GB
(L4)
1.5 TB
(L5)
2.5 TB (L6) 6 TB (L7) Up to
12.8 TB
(L8)
Up to 25
TB (L9)
Up to 48
TB (L10)
Other Native
Capacities
200 GB
L2
100 GB L1 R/O
400 GB L3
200 GB L2 R/O
800 GB
L4
400 GB L3 R/O
1.5 TB L5
800 GB L4 R/O
2.5 TB L6
1.5 TB L5
R/O
6.4 TB L7
2.5 TB L6
R/O
12 TB L8
6.4 TB L7 R/O
25 TB L9
12.8 TB L8 R/O
Native Data
Rate
80 MB/s 120 MB/s 140 MB/s 160 MB/s 300 MB/s Up to 472
MB/s
Up to 708
MB/s
Up to 1100
MB/s
* Data Compression engine enhancement from 2:1 to 2.5:1
2015
14. IBM SystemsIBM Systems
Data Growth and the GAP with HDD Technology
Until 2014 disk capacity growth outperform data growth.
Now Data Growth is much higher than disk capacity growth.
Close the GAP with TAPE
Stop the discussion „Tape is dead“
15. IBM SystemsIBM Systems
IBM Linear Tape File System (LTFS)
Tape like USB-Stick
Self-describing tape format for archiving data to
tape
Improves efficiency, simplifies direct access and
management of files on tape
Four offerings:
• Standalone Drive Edition (SDE) - free
• Library Edition (LE) -free
• Spectrum Archive / Enterprise Edition (EE)
Supports LTO 5/6/7 and TS1140/TS1150 tape
drives
| 15
FilesCartridge1
Library
Cartridge2
16. IBM SystemsIBM Systems
IBM Spectrum Archive: Tape Tier
Spectrum Scale plus Spectrum Archive - Changing the economics of
storage with low cost file system based storage
Seamlessly incorporates tape storage to keep data online at much
lower costs
Data still listed in directories
Once data is accessed it is moved to disk
Other than longer access times, users have no idea data is stored on
tape
Featuring the award winning Linear Tape File System (LTFS)
technology
Up to 10TB in single tape cartridge now, 220 TB tape cartridge in
future (Research Demo in 2015)
Flash
Gold Pool
Disk
Silver Pool
Tier 1 Tier 2
Single name space
Spectrum Scale
CIO Finance Engineering
Tape
LTFS
Tier 3
Spectrum Archive
17. IBM SystemsIBM Systems
IBM Spectrum Archive: Policy-based Cost Optimization
•Powerful policy engine
–Example: File Heat measures how often the file is accessed.
–As the file gets “cold” move it automatically to a lower cost
storage pool
–Information Lifecycle Management
–Fast metadata ‘scanning’ and data movement
–Automated data migration to based on threshold
•Users not affected by data migration
–Single namespace
–Persistent view of the data
•Tape as the external pool of Spectrum Scale
Small files last
accessed > 30
days
last accessed
> 60days
Silver pool is
>60% full Drain it
to 20%
accessed
today and
file size is
<1G
Send it back to
Silver pool when
accessed
System pool
(Flash)
Gold pool
(SSD)
Silver pool
( NL SAS)
TS4500
Spectrum Archive
Automation
18. IBM SystemsIBM Systems
1. Redundancy: Supports Multiple Tape Library Attachment per GPFS cluster
Data replication to the pools in separate libraries for additional data resiliency
Total capacity expansion beyond single library limit
2. Resiliency and Compliance: Data recording on WORM Tape Cartridge
WORM cartridge for long-term records retention applications
Supports IBM 3592 WORM Cartridges (3592-JY and 3592-JZ Tapes)
3. Capacity Scalability: Expands the Storage Capacity with LTO-7 technology
2.5x-4x capacity in the same data center floor space (comparison with LTO-6 & LTO-5)
Supports the migration of file larger than 2.xTB (up to 6TB)
4. Interoperability: New Platform Support
Redhat Enterprise Linux Version 7.1
Spectrum Scale 4.1.1
What’s New in Version 1.2 (GA in Dec/2015)
19. IBM SystemsIBM Systems
• Allows to connect to two tape libraries to a
GPFS cluster
• The second library can be the same library
model as the first one, or different model
• Example 1: 1st Library = TS4500, 2nd
Library = TS3310
• Example 2: 1st Library = Logical Library
in TS3500 with LTO drives, 2nd Library =
Another Logical Library in same TS3500
with TS1150 drives
• An EE node will be connected to one of
two libraries
• The libraries can be connected to GPFS
cluster in single site, or can be placed in
the metro distance locations via GPFS
synchronous mirroring (stretched cluster)
Multiple Tape Library Attachment
Configuration of 4 node EE system with 2 libraries
Library 1
Pool 2S
Pool 1P
EE Node 1
F F
D D
T1 T2 T3
T4 T5
Ethernet
Library 2
Pool 2P
Pool 1S
D D
Ta Tb Tc
Td Te
D D
EE Node 2
F F
EE Node 3
F F
EE Node 4
F F
D D
GPFS Native ClientGPFS Native ClientGPFS Native Client
GPFS Native ClientGPFS Native ClientCIFS/SMB Client
GPFS Native ClientGPFS Native ClientNFS Client
GPFS Native ClientGPFS Native ClientFTP Client
SAN
Redundant Pool Pair
Redundant Pool Pair
EE Node Group 1 EE Node Group 2
GPFS
EE Cluster
20. IBM Systems
Potential Use Cases
Archiving
Archive large volumes of data
Retain data for long periods of
time
Leverage simplicity, tape TCO
and standardized format
Backup
Simple backup solution
Easy data access and restore
Leverage simplicity and tape
TCO and standardized format
Long-term Retention
Simple, policy based data
migration
Easy data access
Leverage simplicity and tape
TCO and standardized format
Scale-out NAS
Simple file system usage
Retain data for long periods of
time
Leverage simplicity, tape TCO
and standardized format
Cloud
Long term data retention
Easy data access
Leverage simplicity and tape
TCO and standardized format
Tiered Storage
Simple policy based data migration
Easy data access
Leverage simplicity and tape
TCO and standardized format
Simplicity * TCO * Standard Format
21. IBM SystemsIBM Systems
Customers looking for new ways to store cold data are looking at tape
21
“The spinning disk cartel will cringe at the premise that tape is faster/cheaper/better. They’ll make faces and tell
you why this is nonsense. They will tell you tape is not cool and you are not cool if you use tape. Don’t just trust
their skepticism. Do the research yourself and come to your own conclusions. You may find that you can drive
significant value for your organisation.”
David Vellante, Wikibon
22. IBM SystemsIBM Systems
Data Security / Protection with Tape
Tape offers:
• Read-After-Write Verification
• 2 independent ECC (=Raid6)
• IBM Jaguar Drives can recover a Media failure of about 11 mm
• Lifetime Warranty for IBM Tape Media
• Returns > 0,02% (10 000 Cartridges -> 2 broken)
• In addition Tape is a “offline” Media!
• No Virus, no software failure, no sabotage,
no human Errors…
SAS HDDs Bit Error Rate 10^16
• NL SAS 10^15
• http://www.enterprisestorageforum.com/storage-technology/sas-vs.-sata-1.html
With NL SAS you will get a Data loss after 110 TB!!!!!
Example: 50 TB/daily; 2,5 PB capacity
• Raid5 with 8 TB (8+1) => 470 HDDs -> 0,12 TB per HDDs / daily ->
• Failure=Data Loss after ~950 days = 2,5 years
With Tape: Data loss after 62 years!
| 22
23. IBM SystemsIBM Systems
FLAPE… to optimize your storage-
related costs
Adapt your storage to data value with the 2 complementary storages: Flash + Tape !
IBM FlashSystem for its outstanding performance
gains, microlatency, macro efficiency and
enterprise reliability.
IBM Tapes for unequalled cost-effectiveness, high
capacity, scalability and low power consumption &
footprint.
« HOT »
data
« COLD »
data
A cloud storage for
unstructured data
24. IBM Systems
Sample “Flape” Usage Configuration
LTFS EE
GPFS Global Name Space
Policy or Manual
Based Migration
GPFS
Library
N Scale-out
Nodes
GPFS
Flash Node
GPFS
Flash Node
Primary
LTFS EE
GPFS
Library
LTFS EE
GPFS
Impact Potential
•Costs 75% Less to deploy
•Requires 75% Less Power &
Cooling
•Requires 75% Less Footprint
•Doesn’t need to be rolled every 3-4
years with tech refreshes
•May cost 90% less over 10 years
25. IBM SystemsIBM Systems
D2F2T = FLAPE
TSM B/A Client
TSM for …
Storage Area Network
Local Area Network
Migration
TSM
Client
TSM
Client
Flash
Tape
5 PB Disk (with Scale-Out Disk/NAS)
–27,5 KW
Flape
–57 TB Flash
–5,5 PB TS4500 with 12 Drives
–1,4 KW
CAPEX=> Disk vs Flape = 4,7 : 1 $
with DeDup 3:1 (Tape compression 1,7:1)
–~ 10 PB
–3 PB Disk
•16,5 KW
–Flape
•1,4 KW
CAPEX => Disk vs Flape = 2,9 : 1 $
26. IBM SystemsIBM Systems
Press:
".... combination of flash and tape is better than tape alone or disk and tape for storing archival data. The
argument is based on tape being not only cheaper than disk, but actually faster than disk for streaming large
files."
"A core finding is that disk capacity prices are not decreasing as fast as those of flash or tape."
This tells us that the ability to get data off tape is getting faster, relatively, given the tape growth rates. And the
areal density is growing at approximately 30 per cent versus disk, which is growing at only 9.6 per cent. Data
can be extracted 4x faster from tape than it can from HDD.
.... tape can be faster than disk for large file
“…, tape is getting faster and is inexpensive and HDDs are not getting that much faster and,.”
http://www.theregister.co.uk/2014/07/16/flape/
Will the “Tape is Dead” Folks Please Sit Down?
….For them, tape's density and economies of scale remain an excellent backup and/or archival choice….. Tape is
proving more reliable than disk, especially lower cost disk. The National Energy Research Scientific
Computing Center (NERSC) reported that tape cartridges are up to four orders of magnitude more reliable
than SATA…. After showing signs of bottoming out a few years ago, 2013 sales stopped declining and 2014 is
seeing sales rise….. tape performance is generally superior with sequential access, which is why tape is
particularly useful with backup, archive and big data sets.
http://www.enterprisestorageforum.com/backup-recovery/will-the-tape-is-dead-folks-please-sit-down-1.html
| 26
27. IBM SystemsIBM Systems
Whitepaper
Tape Reaches New Markets as Innovations Accelerate
Abstract
The tape industry continues to gain significant momentum as tape has firmly established its long-
term role for effectively managing extreme data growth with new use cases. Both LTO (Linear
Tape Open) and enterprise tape products continue to deliver unprecedented storage capacities
per cartridge with the lowest total cost of ownership compared with all other existing storage
solutions. Steady developments have made tape technology the most reliable storage medium
available, now surpassing HDDs by three orders of magnitude in data reliability. As a result, tape
is well positioned to effectively address many data intensive industries including cloud,
entertainment, the internet, and high performance computing along with data intensive
applications such as big data, backup, recovery, archive, disaster recovery and compliance. Disk
technology has been advancing, but tape’s progress over the last ten years has been even
greater. Tape is now the most dynamic and sexy storage product on the market. If you don’t
believe it - just read on.
http://tapestorage.org/wp-content/uploads/2015/12/State-of-the-Tape-Market-2015-121915-Final-FM.pdf
| 27
28. IBM SystemsIBM Systems
The future of Tape begins now
TAPE $AVES: COST • ENERGY • DATA • COMPANY
The era of big data is creating demand for cost effective storage
solutions
IBM Resarch Rüschlikon/Zürich has proven the future of Tape with
220 TB Capacity.
• Tape remains the most cost-efficient and greenest technology for archival
storage and active archive applications
• Tape has a sustainable roadmap for at least another decade
• 123 Gbit/in2 areal density demo shows feasibility of multiple future tape generations
• Potential exists for the continued of scaling of tape beyond 123 Gbit/in2
• The cost advantage of tape over HDD and optical disk will continue to grow
Tape is safe & secure; ideal for Backup
„Tape is the last Line of Defense!“
| 28
29. IBM Systems
Josef (Sepp) Weingand
Business Development Leader DACH – Data Protection & Retention
Infos / Find me on: weingand@de.ibm.com, +49 171 5526783
http://sepp4backup.blogspot.de/
http://www.linkedin.com/pub/josef-weingand/2/788/300
http://www.facebook.com/josef.weingand
http://de.slideshare.net/JosefWeingand
https://www.xing.com/profile/Josef_Weingand
https://www.xing.com/net/ibmdataprotection
32. IBM SystemsIBM Systems
Trademarks
32
The following terms are trademarks or registered trademarks of the IBM Corporation in either the United States, other countries or both.
– IBM, GDPS, Spectrum Storage, Spectrum Archive, Spectrum Scale, System Storage, System z, Virtualization Engine
Linear Tape File System, Linear Tape-Open, LTO, the LTO Logo, Ultrium, and the Ultrium logo are trademarks of HP, IBM Corp. and Quantum in the U.S. and other countries.
Other company, product or service names may be trademarks or service marks of others
Notes de l'éditeur
A capacity of 100 terabytes of data is sufficient to store the text from 100 million books which would fill a book shelf stretching from Las Vegas to Phoenix and back again and would require about 5 million trees to be cut down and made into paper to print all of the books.
GPFS + LTFS LE+ (GLUES)
Global name space is used for file storage on disk.
GPFS migration to tape can be automated by Policy or manually kicked off.
User does not see the migration or where the files are only the global name space.
TSm migration is not required, tape data is stored in the LTFS format