SlideShare une entreprise Scribd logo
1  sur  24
Télécharger pour lire hors ligne
SUPERCOMPUTING 2013 PRESS DECK

Sumit Gupta | General Manager, Tesla Accelerated Computing
SC13
News

1

IBM Taps GPU Accelerators

2

New Product Announcements

3

New Supercomputer Announcements
Accelerated Computing Growing Fast
2x Growth in One Year
50%

Percent of HPC Systems
With Accelerators

44%

Hundreds of GPU
Accelerated Apps
300

242

250

40%

200

30%
22%

24%

150

20%

NVIDIA GPU is
Accelerator of Choice
INTEL PHI

4%

OTHERS

11%

182
113

100

10%

50

0%

0
2010

2011

2012

Intersect360 Research
HPC User Site Census: Systems, July 2013

NVIDIA GPUs

85%

2011

2012

2013
Intersect360 Research
HPC User Site Census: Systems, July 2013
IBM Using GPUs to Accelerate
Enterprise & Data Analytics Applications
Application
Infrastructure

Business Intelligence

Predictive Analytics
Risk Analytics
IBM Partners with NVIDIA to Build NextGeneration Supercomputers

+
Tesla

GPU

POWER8

CPU

GPU-Accelerated POWER-Based Systems Available in 2014
GPU Computing in Data Centers

Power
ARM64
x86

x86

2007

2008

2009

2010

2011

2012

2013

2014
Linux GCC Compiler to Support GPU Accelerators
Open Source
OpenACC in GCC by Mentor Graphics & Samsung

Pervasive Impact
Free to all Linux users

Mainstream
Most Widely Used HPC Compiler

“ Incorporating OpenACC into GCC is an excellent example of open source and
open standards working together to make accelerated computing broadly
accessible to all Linux developers.

”

7

OpenACC-standard.org confidential

Oscar Hernandez
Oak Ridge National Laboratory
SC13
News

1

IBM Taps GPU Accelerators

2

New Product Announcements

3

New Supercomputer Announcements
Tesla K40

World’s Fastest Accelerator
for Supercomputing and
Big Data Analytics

CUDA 6

Dramatically Simplifies
Parallel Programming with
Unified Memory
Tesla K40

World’s Fastest Accelerator
FASTER

1.4 TF| 2880 Cores | 288 GB/s
ns/day

5

LARGER

2x Memory Enables More Apps

AMBER Benchmark

4

SMARTER

Unlock Extra Performance
Using Power Headroom

6GB

3
2

Fluid
Rendering
Dynamics
Seismic
Analysis

1
0
CPU

K20X

K40

GPU Boost

12GB
AMBER Benchmark: SPFP-Nucleosome
CPU: Dual E5-2687W @ 3.10GHz, 64GB System Memory, CentOS 6.2, GPU systems: Single Tesla K20X or Single Tesla K40
GPU Boost

Up to 25% Extra Performance on Applications
Use Power Headroom to Run at Higher Clocks
1.40

25%

Faster
1.20

20%

Faster

14%

Faster

17%

Faster

1.00
0.80

13%

Faster

0.60
0.40
0.20

11%

Faster

0.00
AMBER SPFP-TRPCage

Tesla K40 (base)

LAMMPS-EAM

NAMD 2.9-APOA1

Tesla K40 with GPU Boost
ANNOUNCING

Unified Memory

CUDA 6
Unified Memory

Dramatically Lower Developer Effort
Developer View Today

System
Memory

GPU Memory

Developer View With
Unified Memory

Unified Memory
Super Simplified Memory Management Code
CPU Code
void sortfile(FILE *fp, int N) {
char *data;
data = (char *)malloc(N);

CUDA 6 Code with Unified Memory
void sortfile(FILE *fp, int N) {
char *data;
cudaMallocManaged(&data, N);

fread(data, 1, N, fp);
qsort(data, N, 1, compare);

qsort<<<...>>>(data,N,1,compare);
cudaDeviceSynchronize();

use_data(data);

use_data(data);

free(data);

}

fread(data, 1, N, fp);

cudaFree(data);

}
SC13
News

1

IBM Taps GPU Accelerators

2

New Product Announcements

3

New Supercomputer Announcements
Fastest Supercomputer In Europe
6.27 PetaFLOPS (80% Linpack Efficiency)
Piz Daint

Greenest Petascale System
3110 MFLOPS/W
#2: JUQUEEN: 2176 MFLOPS/W

Production-Grade
Weather Forecasts: COSMO
7 National Weather Agencies
Germany | Greece | Italy | Poland | Russia |
Romania | Switzerland
Greenest Supercomputer in the World
Tokyo Tech KFC System

4000+ MFLOPS per Watt
25% Higher than #1 Green500 System
160 Tesla K20X GPUs

Oil Immersion Technology
Current Green500 #1: CINECA Eurora System, Italy, 3208 MF/W
ANSYS Fluent Doubles Performance with GPUs
Automobile Drag Simulation Throughput
30

Number of Jobs per Day

25

90%
Faster

20
15

2x

10

Better Insight for Low Drag Design

5

2%

0
CPU

K40

2 x E5-2680 CPUs 8 cores used; 2 Tesla K40s
Sedan Geometry, 3.6M mixed cells
Steady, turbulent, external aerodynamics- Coupled PBNS, DP Solver

1.5B

Less Drag

Gal. of Fuel Saved/Year
SUPERCOMPUTING 2013 PRESS DECK

Sumit Gupta | General Manager, Tesla Accelerated Computing
Additional Information
Tesla K40

20-40% Faster than K20X on Applications
1.5

1.4x

K20X

1.3x

1.2x

1.3x

K40 @ base

1.3x

1.3x

K40 @ boost

1.3x

1.0

0.5

0.0
ANSYS 14

LAMMPS

NAMD 2.9

AMBER

LSMS

QMCPACK

SMP-V14sp-4

EAM

APOA1

SPFP-Nucleosome

Fe32

3x3x1

CUBLAS
First Tesla K40 Customers

CSC Finland

Texas Advanced
Computing Center

CEA France

Swinburne
Australia
Tesla K40 OEM Partners
K20X

K40

Peak Single Precision
Peak SGEMM

3.93 TF
2.95 TF

4.29 TF
3.22 TF

Peak Double Precision
Peak DGEMM

1.31 TF
1.22 TF

1.43 TF
1.33 TF

Memory size

6 GB

12 GB

Memory BW (ECC off)

250 GB/s

288 GB/s

Memory Clock

2.6 GHz

3.0 GHz

PCIe Gen

Gen 2

Gen 3

# of Cores

2688

2880

Core Clock

732 MHz

Base: 745 MHz
Boost Clocks: 810 & 875 Mhz

Total Board Power

235W

235W

Form Factor

PCIe Passive

PCIe Passive, Active
9

Contenu connexe

En vedette

Robotics by sai m ani krishna
Robotics  by sai m ani krishnaRobotics  by sai m ani krishna
Robotics by sai m ani krishnadrmahendra2002
 
load balancing ant algo in MANET by navish jindal
load balancing ant algo in MANET by navish jindalload balancing ant algo in MANET by navish jindal
load balancing ant algo in MANET by navish jindalNavish Jindal
 
Introduction to heterogeneous_computing_for_hpc
Introduction to heterogeneous_computing_for_hpcIntroduction to heterogeneous_computing_for_hpc
Introduction to heterogeneous_computing_for_hpcSupasit Kajkamhaeng
 
Ictericia neonatal actualización NICE 2012
Ictericia neonatal actualización NICE 2012Ictericia neonatal actualización NICE 2012
Ictericia neonatal actualización NICE 2012Cristobal Buñuel
 
A NonInvasive Tool for the Screening of Neonatal Jaundice
A NonInvasive Tool for the Screening of Neonatal JaundiceA NonInvasive Tool for the Screening of Neonatal Jaundice
A NonInvasive Tool for the Screening of Neonatal Jaundiceavantika
 
IOT based Intelligence for Fire Emergency Response
IOT based Intelligence for Fire Emergency ResponseIOT based Intelligence for Fire Emergency Response
IOT based Intelligence for Fire Emergency Responseiramvaseem
 
An efficient approach for load balancing using dynamic ab algorithm in cloud ...
An efficient approach for load balancing using dynamic ab algorithm in cloud ...An efficient approach for load balancing using dynamic ab algorithm in cloud ...
An efficient approach for load balancing using dynamic ab algorithm in cloud ...bhavikpooja
 
Tesla personal super computer
Tesla personal super computerTesla personal super computer
Tesla personal super computerPriya Manik
 
approach to the diagnosis of Neonatal jaundice
approach to the diagnosis of Neonatal jaundiceapproach to the diagnosis of Neonatal jaundice
approach to the diagnosis of Neonatal jaundicegelaye mandefro
 
Neonatal jaundice
Neonatal jaundiceNeonatal jaundice
Neonatal jaundiceAlya Imad
 
LOAD BALANCING ALGORITHMS
LOAD BALANCING ALGORITHMSLOAD BALANCING ALGORITHMS
LOAD BALANCING ALGORITHMStanmayshah95
 
Physiological Neonatal Jaundice
Physiological Neonatal JaundicePhysiological Neonatal Jaundice
Physiological Neonatal JaundiceGyaltsen Gurung
 
Accelerated Computing: The Path Forward
Accelerated Computing: The Path ForwardAccelerated Computing: The Path Forward
Accelerated Computing: The Path ForwardNVIDIA
 
Building New Realities in AEC with NVIDIA Quadro VR Webinar
Building New Realities in AEC with NVIDIA Quadro VR WebinarBuilding New Realities in AEC with NVIDIA Quadro VR Webinar
Building New Realities in AEC with NVIDIA Quadro VR WebinarNVIDIA
 
GRAPHICS PROCESSING UNIT (GPU)
GRAPHICS PROCESSING UNIT (GPU)GRAPHICS PROCESSING UNIT (GPU)
GRAPHICS PROCESSING UNIT (GPU)self employed
 
Load Balancing
Load BalancingLoad Balancing
Load Balancingnashniv
 
Graphic Processing Unit (GPU)
Graphic Processing Unit (GPU)Graphic Processing Unit (GPU)
Graphic Processing Unit (GPU)Jafar Khan
 
Neonatal Jaundice 1
Neonatal Jaundice 1Neonatal Jaundice 1
Neonatal Jaundice 1DRALFAQAWI
 

En vedette (20)

Robotics by sai m ani krishna
Robotics  by sai m ani krishnaRobotics  by sai m ani krishna
Robotics by sai m ani krishna
 
load balancing ant algo in MANET by navish jindal
load balancing ant algo in MANET by navish jindalload balancing ant algo in MANET by navish jindal
load balancing ant algo in MANET by navish jindal
 
Introduction to heterogeneous_computing_for_hpc
Introduction to heterogeneous_computing_for_hpcIntroduction to heterogeneous_computing_for_hpc
Introduction to heterogeneous_computing_for_hpc
 
Supercomputer final
Supercomputer finalSupercomputer final
Supercomputer final
 
Ictericia neonatal actualización NICE 2012
Ictericia neonatal actualización NICE 2012Ictericia neonatal actualización NICE 2012
Ictericia neonatal actualización NICE 2012
 
A NonInvasive Tool for the Screening of Neonatal Jaundice
A NonInvasive Tool for the Screening of Neonatal JaundiceA NonInvasive Tool for the Screening of Neonatal Jaundice
A NonInvasive Tool for the Screening of Neonatal Jaundice
 
IOT based Intelligence for Fire Emergency Response
IOT based Intelligence for Fire Emergency ResponseIOT based Intelligence for Fire Emergency Response
IOT based Intelligence for Fire Emergency Response
 
An efficient approach for load balancing using dynamic ab algorithm in cloud ...
An efficient approach for load balancing using dynamic ab algorithm in cloud ...An efficient approach for load balancing using dynamic ab algorithm in cloud ...
An efficient approach for load balancing using dynamic ab algorithm in cloud ...
 
Tesla personal super computer
Tesla personal super computerTesla personal super computer
Tesla personal super computer
 
approach to the diagnosis of Neonatal jaundice
approach to the diagnosis of Neonatal jaundiceapproach to the diagnosis of Neonatal jaundice
approach to the diagnosis of Neonatal jaundice
 
Neonatal jaundice
Neonatal jaundiceNeonatal jaundice
Neonatal jaundice
 
LOAD BALANCING ALGORITHMS
LOAD BALANCING ALGORITHMSLOAD BALANCING ALGORITHMS
LOAD BALANCING ALGORITHMS
 
Physiological Neonatal Jaundice
Physiological Neonatal JaundicePhysiological Neonatal Jaundice
Physiological Neonatal Jaundice
 
Accelerated Computing: The Path Forward
Accelerated Computing: The Path ForwardAccelerated Computing: The Path Forward
Accelerated Computing: The Path Forward
 
Building New Realities in AEC with NVIDIA Quadro VR Webinar
Building New Realities in AEC with NVIDIA Quadro VR WebinarBuilding New Realities in AEC with NVIDIA Quadro VR Webinar
Building New Realities in AEC with NVIDIA Quadro VR Webinar
 
GRAPHICS PROCESSING UNIT (GPU)
GRAPHICS PROCESSING UNIT (GPU)GRAPHICS PROCESSING UNIT (GPU)
GRAPHICS PROCESSING UNIT (GPU)
 
Load Balancing
Load BalancingLoad Balancing
Load Balancing
 
Graphic Processing Unit (GPU)
Graphic Processing Unit (GPU)Graphic Processing Unit (GPU)
Graphic Processing Unit (GPU)
 
HPC Computing Trends
HPC Computing TrendsHPC Computing Trends
HPC Computing Trends
 
Neonatal Jaundice 1
Neonatal Jaundice 1Neonatal Jaundice 1
Neonatal Jaundice 1
 

Similaire à Nvidia SC13 Podcast

組み込みから HPC まで ARM コアで実現するエコシステム
組み込みから HPC まで ARM コアで実現するエコシステム組み込みから HPC まで ARM コアで実現するエコシステム
組み込みから HPC まで ARM コアで実現するエコシステムShinnosuke Furuya
 
Introduction to PowerAI - The Enterprise AI Platform
Introduction to PowerAI - The Enterprise AI PlatformIntroduction to PowerAI - The Enterprise AI Platform
Introduction to PowerAI - The Enterprise AI PlatformIndrajit Poddar
 
Accelerating Data Science With GPUs
Accelerating Data Science With GPUsAccelerating Data Science With GPUs
Accelerating Data Science With GPUsiguazio
 
Application Optimisation using OpenPOWER and Power 9 systems
Application Optimisation using OpenPOWER and Power 9 systemsApplication Optimisation using OpenPOWER and Power 9 systems
Application Optimisation using OpenPOWER and Power 9 systemsGanesan Narayanasamy
 
20201006_PGconf_Online_Large_Data_Processing
20201006_PGconf_Online_Large_Data_Processing20201006_PGconf_Online_Large_Data_Processing
20201006_PGconf_Online_Large_Data_ProcessingKohei KaiGai
 
Trends towards the merge of HPC + Big Data systems
Trends towards the merge of HPC + Big Data systemsTrends towards the merge of HPC + Big Data systems
Trends towards the merge of HPC + Big Data systemsIgor José F. Freitas
 
20170602_OSSummit_an_intelligent_storage
20170602_OSSummit_an_intelligent_storage20170602_OSSummit_an_intelligent_storage
20170602_OSSummit_an_intelligent_storageKohei KaiGai
 
TiECon Florida keynote - New opportunities for entrepreneurs using GPU & CUDA
TiECon Florida keynote - New opportunities for entrepreneurs using GPU & CUDATiECon Florida keynote - New opportunities for entrepreneurs using GPU & CUDA
TiECon Florida keynote - New opportunities for entrepreneurs using GPU & CUDAShanker Trivedi
 
Kindratenko hpc day 2011 Kiev
Kindratenko hpc day 2011 KievKindratenko hpc day 2011 Kiev
Kindratenko hpc day 2011 KievVolodymyr Saviak
 
Hybrid CPU GPU MATLAB Image Processing Benchmarking
Hybrid CPU GPU MATLAB Image Processing BenchmarkingHybrid CPU GPU MATLAB Image Processing Benchmarking
Hybrid CPU GPU MATLAB Image Processing BenchmarkingDimitris Vayenas
 
Accelerate Machine Learning Software on Intel Architecture
Accelerate Machine Learning Software on Intel Architecture Accelerate Machine Learning Software on Intel Architecture
Accelerate Machine Learning Software on Intel Architecture Intel® Software
 
PGI Compilers & Tools Update- March 2018
PGI Compilers & Tools Update- March 2018PGI Compilers & Tools Update- March 2018
PGI Compilers & Tools Update- March 2018NVIDIA
 
GPU 101: The Beast In Data Centers
GPU 101: The Beast In Data CentersGPU 101: The Beast In Data Centers
GPU 101: The Beast In Data CentersRommel Garcia
 
Webinar: NVIDIA JETSON – A Inteligência Artificial na palma de sua mão
Webinar: NVIDIA JETSON – A Inteligência Artificial na palma de sua mãoWebinar: NVIDIA JETSON – A Inteligência Artificial na palma de sua mão
Webinar: NVIDIA JETSON – A Inteligência Artificial na palma de sua mãoEmbarcados
 
BladeCenter GPU Expansion Blade (BGE) - Client Presentation
BladeCenter GPU Expansion Blade (BGE) - Client PresentationBladeCenter GPU Expansion Blade (BGE) - Client Presentation
BladeCenter GPU Expansion Blade (BGE) - Client PresentationCliff Kinard
 
GTC 2018 で発表された自動運転最新情報
GTC 2018 で発表された自動運転最新情報GTC 2018 で発表された自動運転最新情報
GTC 2018 で発表された自動運転最新情報NVIDIA Japan
 
Big Data LDN 2017: BI Converges with AI - GPUs for Fast Data
Big Data LDN 2017: BI Converges with AI - GPUs for Fast DataBig Data LDN 2017: BI Converges with AI - GPUs for Fast Data
Big Data LDN 2017: BI Converges with AI - GPUs for Fast DataMatt Stubbs
 
HPC DAY 2017 | FlyElephant Solutions for Data Science and HPC
HPC DAY 2017 | FlyElephant Solutions for Data Science and HPCHPC DAY 2017 | FlyElephant Solutions for Data Science and HPC
HPC DAY 2017 | FlyElephant Solutions for Data Science and HPCHPC DAY
 

Similaire à Nvidia SC13 Podcast (20)

組み込みから HPC まで ARM コアで実現するエコシステム
組み込みから HPC まで ARM コアで実現するエコシステム組み込みから HPC まで ARM コアで実現するエコシステム
組み込みから HPC まで ARM コアで実現するエコシステム
 
Introduction to PowerAI - The Enterprise AI Platform
Introduction to PowerAI - The Enterprise AI PlatformIntroduction to PowerAI - The Enterprise AI Platform
Introduction to PowerAI - The Enterprise AI Platform
 
Accelerating Data Science With GPUs
Accelerating Data Science With GPUsAccelerating Data Science With GPUs
Accelerating Data Science With GPUs
 
Application Optimisation using OpenPOWER and Power 9 systems
Application Optimisation using OpenPOWER and Power 9 systemsApplication Optimisation using OpenPOWER and Power 9 systems
Application Optimisation using OpenPOWER and Power 9 systems
 
RAPIDS Overview
RAPIDS OverviewRAPIDS Overview
RAPIDS Overview
 
20201006_PGconf_Online_Large_Data_Processing
20201006_PGconf_Online_Large_Data_Processing20201006_PGconf_Online_Large_Data_Processing
20201006_PGconf_Online_Large_Data_Processing
 
Trends towards the merge of HPC + Big Data systems
Trends towards the merge of HPC + Big Data systemsTrends towards the merge of HPC + Big Data systems
Trends towards the merge of HPC + Big Data systems
 
20170602_OSSummit_an_intelligent_storage
20170602_OSSummit_an_intelligent_storage20170602_OSSummit_an_intelligent_storage
20170602_OSSummit_an_intelligent_storage
 
TiECon Florida keynote - New opportunities for entrepreneurs using GPU & CUDA
TiECon Florida keynote - New opportunities for entrepreneurs using GPU & CUDATiECon Florida keynote - New opportunities for entrepreneurs using GPU & CUDA
TiECon Florida keynote - New opportunities for entrepreneurs using GPU & CUDA
 
Kindratenko hpc day 2011 Kiev
Kindratenko hpc day 2011 KievKindratenko hpc day 2011 Kiev
Kindratenko hpc day 2011 Kiev
 
Hybrid CPU GPU MATLAB Image Processing Benchmarking
Hybrid CPU GPU MATLAB Image Processing BenchmarkingHybrid CPU GPU MATLAB Image Processing Benchmarking
Hybrid CPU GPU MATLAB Image Processing Benchmarking
 
Accelerate Machine Learning Software on Intel Architecture
Accelerate Machine Learning Software on Intel Architecture Accelerate Machine Learning Software on Intel Architecture
Accelerate Machine Learning Software on Intel Architecture
 
PGI Compilers & Tools Update- March 2018
PGI Compilers & Tools Update- March 2018PGI Compilers & Tools Update- March 2018
PGI Compilers & Tools Update- March 2018
 
GPU 101: The Beast In Data Centers
GPU 101: The Beast In Data CentersGPU 101: The Beast In Data Centers
GPU 101: The Beast In Data Centers
 
Webinar: NVIDIA JETSON – A Inteligência Artificial na palma de sua mão
Webinar: NVIDIA JETSON – A Inteligência Artificial na palma de sua mãoWebinar: NVIDIA JETSON – A Inteligência Artificial na palma de sua mão
Webinar: NVIDIA JETSON – A Inteligência Artificial na palma de sua mão
 
BladeCenter GPU Expansion Blade (BGE) - Client Presentation
BladeCenter GPU Expansion Blade (BGE) - Client PresentationBladeCenter GPU Expansion Blade (BGE) - Client Presentation
BladeCenter GPU Expansion Blade (BGE) - Client Presentation
 
GTC 2018 で発表された自動運転最新情報
GTC 2018 で発表された自動運転最新情報GTC 2018 で発表された自動運転最新情報
GTC 2018 で発表された自動運転最新情報
 
Big Data LDN 2017: BI Converges with AI - GPUs for Fast Data
Big Data LDN 2017: BI Converges with AI - GPUs for Fast DataBig Data LDN 2017: BI Converges with AI - GPUs for Fast Data
Big Data LDN 2017: BI Converges with AI - GPUs for Fast Data
 
GTC 2022 Keynote
GTC 2022 KeynoteGTC 2022 Keynote
GTC 2022 Keynote
 
HPC DAY 2017 | FlyElephant Solutions for Data Science and HPC
HPC DAY 2017 | FlyElephant Solutions for Data Science and HPCHPC DAY 2017 | FlyElephant Solutions for Data Science and HPC
HPC DAY 2017 | FlyElephant Solutions for Data Science and HPC
 

Plus de inside-BigData.com

Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...inside-BigData.com
 
Transforming Private 5G Networks
Transforming Private 5G NetworksTransforming Private 5G Networks
Transforming Private 5G Networksinside-BigData.com
 
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...inside-BigData.com
 
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...inside-BigData.com
 
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...inside-BigData.com
 
HPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural NetworksHPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural Networksinside-BigData.com
 
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean MonitoringBiohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoringinside-BigData.com
 
Machine Learning for Weather Forecasts
Machine Learning for Weather ForecastsMachine Learning for Weather Forecasts
Machine Learning for Weather Forecastsinside-BigData.com
 
HPC AI Advisory Council Update
HPC AI Advisory Council UpdateHPC AI Advisory Council Update
HPC AI Advisory Council Updateinside-BigData.com
 
Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19inside-BigData.com
 
Energy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic TuningEnergy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic Tuninginside-BigData.com
 
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPODHPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPODinside-BigData.com
 
Versal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud AccelerationVersal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud Accelerationinside-BigData.com
 
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance EfficientlyZettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance Efficientlyinside-BigData.com
 
Scaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's EraScaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's Erainside-BigData.com
 
CUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computingCUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computinginside-BigData.com
 
Introducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi ClusterIntroducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi Clusterinside-BigData.com
 

Plus de inside-BigData.com (20)

Major Market Shifts in IT
Major Market Shifts in ITMajor Market Shifts in IT
Major Market Shifts in IT
 
Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...
 
Transforming Private 5G Networks
Transforming Private 5G NetworksTransforming Private 5G Networks
Transforming Private 5G Networks
 
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
 
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
 
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
 
HPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural NetworksHPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural Networks
 
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean MonitoringBiohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
 
Machine Learning for Weather Forecasts
Machine Learning for Weather ForecastsMachine Learning for Weather Forecasts
Machine Learning for Weather Forecasts
 
HPC AI Advisory Council Update
HPC AI Advisory Council UpdateHPC AI Advisory Council Update
HPC AI Advisory Council Update
 
Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19
 
Energy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic TuningEnergy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic Tuning
 
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPODHPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
 
State of ARM-based HPC
State of ARM-based HPCState of ARM-based HPC
State of ARM-based HPC
 
Versal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud AccelerationVersal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud Acceleration
 
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance EfficientlyZettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
 
Scaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's EraScaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's Era
 
CUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computingCUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computing
 
Introducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi ClusterIntroducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi Cluster
 
Overview of HPC Interconnects
Overview of HPC InterconnectsOverview of HPC Interconnects
Overview of HPC Interconnects
 

Dernier

TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 

Dernier (20)

TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 

Nvidia SC13 Podcast

  • 1. SUPERCOMPUTING 2013 PRESS DECK Sumit Gupta | General Manager, Tesla Accelerated Computing
  • 2. SC13 News 1 IBM Taps GPU Accelerators 2 New Product Announcements 3 New Supercomputer Announcements
  • 3. Accelerated Computing Growing Fast 2x Growth in One Year 50% Percent of HPC Systems With Accelerators 44% Hundreds of GPU Accelerated Apps 300 242 250 40% 200 30% 22% 24% 150 20% NVIDIA GPU is Accelerator of Choice INTEL PHI 4% OTHERS 11% 182 113 100 10% 50 0% 0 2010 2011 2012 Intersect360 Research HPC User Site Census: Systems, July 2013 NVIDIA GPUs 85% 2011 2012 2013 Intersect360 Research HPC User Site Census: Systems, July 2013
  • 4. IBM Using GPUs to Accelerate Enterprise & Data Analytics Applications Application Infrastructure Business Intelligence Predictive Analytics Risk Analytics
  • 5. IBM Partners with NVIDIA to Build NextGeneration Supercomputers + Tesla GPU POWER8 CPU GPU-Accelerated POWER-Based Systems Available in 2014
  • 6. GPU Computing in Data Centers Power ARM64 x86 x86 2007 2008 2009 2010 2011 2012 2013 2014
  • 7. Linux GCC Compiler to Support GPU Accelerators Open Source OpenACC in GCC by Mentor Graphics & Samsung Pervasive Impact Free to all Linux users Mainstream Most Widely Used HPC Compiler “ Incorporating OpenACC into GCC is an excellent example of open source and open standards working together to make accelerated computing broadly accessible to all Linux developers. ” 7 OpenACC-standard.org confidential Oscar Hernandez Oak Ridge National Laboratory
  • 8. SC13 News 1 IBM Taps GPU Accelerators 2 New Product Announcements 3 New Supercomputer Announcements
  • 9. Tesla K40 World’s Fastest Accelerator for Supercomputing and Big Data Analytics CUDA 6 Dramatically Simplifies Parallel Programming with Unified Memory
  • 10. Tesla K40 World’s Fastest Accelerator FASTER 1.4 TF| 2880 Cores | 288 GB/s ns/day 5 LARGER 2x Memory Enables More Apps AMBER Benchmark 4 SMARTER Unlock Extra Performance Using Power Headroom 6GB 3 2 Fluid Rendering Dynamics Seismic Analysis 1 0 CPU K20X K40 GPU Boost 12GB AMBER Benchmark: SPFP-Nucleosome CPU: Dual E5-2687W @ 3.10GHz, 64GB System Memory, CentOS 6.2, GPU systems: Single Tesla K20X or Single Tesla K40
  • 11. GPU Boost Up to 25% Extra Performance on Applications Use Power Headroom to Run at Higher Clocks 1.40 25% Faster 1.20 20% Faster 14% Faster 17% Faster 1.00 0.80 13% Faster 0.60 0.40 0.20 11% Faster 0.00 AMBER SPFP-TRPCage Tesla K40 (base) LAMMPS-EAM NAMD 2.9-APOA1 Tesla K40 with GPU Boost
  • 13. Unified Memory Dramatically Lower Developer Effort Developer View Today System Memory GPU Memory Developer View With Unified Memory Unified Memory
  • 14. Super Simplified Memory Management Code CPU Code void sortfile(FILE *fp, int N) { char *data; data = (char *)malloc(N); CUDA 6 Code with Unified Memory void sortfile(FILE *fp, int N) { char *data; cudaMallocManaged(&data, N); fread(data, 1, N, fp); qsort(data, N, 1, compare); qsort<<<...>>>(data,N,1,compare); cudaDeviceSynchronize(); use_data(data); use_data(data); free(data); } fread(data, 1, N, fp); cudaFree(data); }
  • 15. SC13 News 1 IBM Taps GPU Accelerators 2 New Product Announcements 3 New Supercomputer Announcements
  • 16. Fastest Supercomputer In Europe 6.27 PetaFLOPS (80% Linpack Efficiency) Piz Daint Greenest Petascale System 3110 MFLOPS/W #2: JUQUEEN: 2176 MFLOPS/W Production-Grade Weather Forecasts: COSMO 7 National Weather Agencies Germany | Greece | Italy | Poland | Russia | Romania | Switzerland
  • 17. Greenest Supercomputer in the World Tokyo Tech KFC System 4000+ MFLOPS per Watt 25% Higher than #1 Green500 System 160 Tesla K20X GPUs Oil Immersion Technology Current Green500 #1: CINECA Eurora System, Italy, 3208 MF/W
  • 18. ANSYS Fluent Doubles Performance with GPUs Automobile Drag Simulation Throughput 30 Number of Jobs per Day 25 90% Faster 20 15 2x 10 Better Insight for Low Drag Design 5 2% 0 CPU K40 2 x E5-2680 CPUs 8 cores used; 2 Tesla K40s Sedan Geometry, 3.6M mixed cells Steady, turbulent, external aerodynamics- Coupled PBNS, DP Solver 1.5B Less Drag Gal. of Fuel Saved/Year
  • 19. SUPERCOMPUTING 2013 PRESS DECK Sumit Gupta | General Manager, Tesla Accelerated Computing
  • 21. Tesla K40 20-40% Faster than K20X on Applications 1.5 1.4x K20X 1.3x 1.2x 1.3x K40 @ base 1.3x 1.3x K40 @ boost 1.3x 1.0 0.5 0.0 ANSYS 14 LAMMPS NAMD 2.9 AMBER LSMS QMCPACK SMP-V14sp-4 EAM APOA1 SPFP-Nucleosome Fe32 3x3x1 CUBLAS
  • 22. First Tesla K40 Customers CSC Finland Texas Advanced Computing Center CEA France Swinburne Australia
  • 23. Tesla K40 OEM Partners
  • 24. K20X K40 Peak Single Precision Peak SGEMM 3.93 TF 2.95 TF 4.29 TF 3.22 TF Peak Double Precision Peak DGEMM 1.31 TF 1.22 TF 1.43 TF 1.33 TF Memory size 6 GB 12 GB Memory BW (ECC off) 250 GB/s 288 GB/s Memory Clock 2.6 GHz 3.0 GHz PCIe Gen Gen 2 Gen 3 # of Cores 2688 2880 Core Clock 732 MHz Base: 745 MHz Boost Clocks: 810 & 875 Mhz Total Board Power 235W 235W Form Factor PCIe Passive PCIe Passive, Active 9