Optimizing the Upstreaming Workflow: Flexibly Scale Storage for Seismic Processing Demands

Optimizing the Upstream Workflow
Flexibly Scaling Performance to Meet Seismic Processing
Demands
AVERE SYSTEMS, INC
5000 McKnight Road, Suite 404
Pittsburgh, PA 15237
(412) 635-7170
averesystems.com

Seismic Processing Use Case
– World’s Largest Exploration & Production Company
2Proprietary & Confidential
Challenges
–  Lower performance
–  Larger footprint
–  600TB of unnecessary SATA disks
–  No Flash/SSD
–  Multiple manual copies between
Panasas and NetApp required
–  Proprietary client code, complicates
compute farm management
Avere Benefits
–  50% higher throughput
–  50% smaller footprint
–  Lower cost:
–  33% lower CAPEX ($/op)
–  50% lower OPEX (space,
power, cooling)
–  Auto-tiering of hot data blocks to/
from Avere cluster
–  Smooth transitions between
workflow stages
Before
After
Compute Farm
NetApp FAS 3270
Panasas Cluster
Manual
copies
Compute Farm NetApp FAS 3270Avere FXT 4200 Cluster
• 16TB of Flash/SSD
• 22GB/s Throughput
Auto-tiering

Upstream Workflow – Technology Needs
Seismic
Acquisition
Seismic
Processing
Seismic
Interpretation
Reservoir
Simulation
Reservoir
Engineering
Technology Needs
Cost-effective tape
replacement
High IOPS, multi-
threaded IO
High B/W, single-
thread IO
Read, write, metadata
performance
General file system
workload
Dense Multi-PB file system,
large flat files
Many data types Multi-threaded IO Small files & relational
DBs
Portable 100TB+ “scratch
areas”
Flat files & relational
DBs
Multi-TB “scratch
areas”
Data protection,
replication
Scalable to 10s of
thousands of CPUs
Workstation
interactivity
Small CPU clusters De-dupe &
compression

How We Got Here
•  Seismic processing challenges
–  Provide 10’s of GB/sec of throughput
–  Cost-effectively store 100’s of TBs or even PBs of data
–  Specialty, proprietary storage silos complicate workflow
Need High
Performance
Need Cheap
Capacity

What We Are Doing About It
•  Scale performance
–  Tiering places active data on fast media (e.g. RAM, Flash)
–  Linearly scale performance through clustering
•  Reduce cost
–  Support existing NAS environments
–  Store everything on near-line disks (7.2k SATA/SAS)
•  Simplify workflow
–  Seamless transitions between upstream workflow stages
–  Avoid storage silos and inefficient data copy steps

How We Do It – Scale Performance
•  Auto-tiering active data to RAM and Flash
•  Automatic replication and striping of hot blocks on
multiple FXT nodes
•  Deliver 2+ GB/sec per FXT node
•  Scale to 50 FXT nodes à 100+ GB/sec
Compute Farm NetApp FAS 3270Avere FXT 4200 Cluster
• 16TB of Flash/SSD
• 22GB/s Throughput
Auto-tiering

How We Do It – Save Cost
•  Support existing NAS environments
–  Avoid costly upgrade
–  Support heterogeneous NAS vendors
•  Store primary data on near-line disks (e.g. 7.2k SATA)
–  CAPEX savings (avoid using lots of 15k disks)
–  OPEX savings (due to reduced space, power, and cooling)
•  Reduce cost by 50% or more
–  Proven in customer environments and benchmark testing
–  See next slide for example…

Comparing 1,000,000 IOPS Solutions*
EMC Isilon
$10.7 / IOPS
NetApp
$5.1 / IOPS
Avere
$2.3 / IOPS
Throughput
(IOPS)
Latency/ORT
(ms)
List Price $/IOPS Disk
Quantity
Rack
Units
Cabinets Product Config
Avere FXT 3800 1,592,334 1.24 $3,637,500 $2.3 549 76 1.8
32-node cluster,
cloud storage config
NetApp FAS 6240 1,512,784 1.53 $7,666,000 $5.1 1728 436 12 24-node cluster
EMC Isilon S200 1,112,705 2.54 $11,903,540 $10.7 3360 288 7 140-node cluster
*Comparing the top SPEC SFS results for a single NFS file system/namespace (as of 02Apr2013). See www.spec.org/sfs2008 for more information.

Upstream Workflow – Challenges
Seismic
Acquisition
Seismic
Processing
Seismic
Interpretation
Reservoir
Simulation
Reservoir
Engineering
Challenges
Expense and risk due to multiple infrastructure silos
Expensive, specialty storage required for high-IO steps
Management complexity and data downtime due to copying data
Longer time to final results
NetApp or
EMC Isilon
Panasas or
Lustre/DDN
Copy
NetApp or
EMC Isilon
Panasas or
Lustre/DDN
NetApp or
EMC Isilon
Copy Copy Copy

Avere Optimizes Upstream Workflow
Seismic
Acquisition
Seismic
Processing
Seismic
Interpretation
Reservoir
Simulation
Reservoir
Engineering
Avere Benefits
Integrated and unified workflow
Faster time and lower risk to final results
Improved application performance/spend
Better enable remote access & WAN efficiency
3rd-Party Core Filer
Avere Edge Filer

Cloud Storage
Primary Datacenter
Where We See This Going
Cloud Computing
Remote Office
Secondary/Partner/Colo Datacenter

Comparing 1,000,000 IOPS Solutions*
EMC Isilon
$10.7 / IOPS
NetApp
$5.1 / IOPS
150ms
Avere
$2.3 / IOPS
Throughput
(IOPS)
Latency/ORT
(ms)
List Price $/IOPS Disk
Quantity
Rack
Units
Cabinets Product Config
Avere FXT 3800 1,592,334 1.24 $3,637,500 $2.3 549 76 1.8
32-node cluster,
cloud storage config
NetApp FAS 6240 1,512,784 1.53 $7,666,000 $5.1 1728 436 12 24-node cluster
EMC Isilon S200 1,112,705 2.54 $11,903,540 $10.7 3360 288 7 140-node cluster
*Comparing the top SPEC SFS results for a single NFS file system/namespace (as of 02Apr2013). See www.spec.org/sfs2008 for more information.

Questions?
For more information, please visit our
website…
www.averesystems.com

Thank You!
AVERE SYSTEMS, INC
5000 McKnight Road, Suite 404
Pittsburgh, PA 15237
(412) 635-7170
averesystems.com

Optimizing the Upstreaming Workflow: Flexibly Scale Storage for Seismic Processing Demands

Recommandé

Recommandé

Contenu connexe

Tendances

Tendances (20)

En vedette

En vedette (20)

Similaire à Optimizing the Upstreaming Workflow: Flexibly Scale Storage for Seismic Processing Demands

Similaire à Optimizing the Upstreaming Workflow: Flexibly Scale Storage for Seismic Processing Demands (20)

Plus de Avere Systems

Plus de Avere Systems (19)

Dernier

Dernier (20)

Optimizing the Upstreaming Workflow: Flexibly Scale Storage for Seismic Processing Demands

Notes de l'éditeur