Nvidia Deep Learning Solutions - Alex Sabatier

•Télécharger en tant que PPTX, PDF•

1 j'aime•1,316 vues

Alex Sabatier from Nvidia talks about the future of Deep Learning from an chipmaker perspective - Powered by the open source machine learning software H2O.ai. Contributors welcome at: https://github.com/h2oai - To view videos on H2O open source machine learning software, go to: https://www.youtube.com/user/0xdata

Technologie

NVIDIA DEEP LEARNING
SOLUTIONS
Alex Sabatier – Global Account Manager
asabatier@nbvidia.com

2
AGENDA
NVIDIA GPUs
Deep learning revolution
DGX1 - First AI Supercomputer

3
GPU Computing
NVIDIA
Computing for the Most Demanding Users
Computing Human Imagination
Computing Human Intelligence

6NVIDIA CONFIDENTIAL. DO NOT DISTRIBUTE.
PERFORMANCE GAP CONTINUES TO GROW
0
100
200
300
400
500
600
700
800
2008 2011 2012 2014 2016
Peak Memory Bandwidth
NVIDIA GPU x86 CPU
M2090
M1060
K20
K80
Pascal
GB/s
0.0
0.5
1.0
1.5
2.0
2.5
3.0
3.5
4.0
4.5
5.0
5.5
2008 2010 2012 2014 2016
Peak Double Precision FLOPS
NVIDIA GPU x86 CPU
M1060
K20
GFLOPS
K80
Pascal
M2090

7
CREDENTIALS BUILT OVER TIME
300K CUDA Developers,
4x Growth in 4 years
Majority of HPC Applications are
GPU-Accelerated, 410 and Growing
100% of Deep Learning
Frameworks are Accelerated
113
206
242
370
410
0
50
100
150
200
250
300
350
400
450
2011 2012 2013 2014 2015 2016
287
# of Applications
Academia Games Finance
Manufacturing Internet Oil & Gas
National Labs Automotive Defense
M & E
300K
TORCH
THEANO
CAFFE
MATCONVNET
PURINEMOCHA.JL
MINERVA MXNET*
BIG SUR TENSORFLOW
WATSON CNTK

8
NVIDIA POWERS WORLD’S
LEADING DATA CENTERS
FOR HPC AND AI

9
DEEP LEARNING
A NEW COMPUTING MODEL
“little girl is eating
piece of cake"
LEARNING
ALGORITHM
“millions of trillions
of FLOPS”
Device
InferenceTraining

10
IMPACT OF AI IS HUGE FOR ENTERPRISES
Google: Hundreds of millions of dollars in
power savings with AI-operated data center
Netflix: $1 billion savings per year with AI-
assisted recommendation engine
AdTheorent: 300% higher user engagement for
mobile advertisers during shopping season

11
NVIDIA DGX-1
AI Supercomputer-in-a-Box
170 TFLOPS | 8x Tesla P100 16GB | NVLink Hybrid Cube Mesh
2x Xeon | 8 TB RAID 0 | Quad IB 100Gbps, Dual 10GbE | 3U — 3200W

13NVIDIA CONFIDENTIAL. DO NOT DISTRIBUTE.
170 TFLOPS FP16
Accelerates Major AI Frameworks
Dual 10GbE, Quad IB 100Gb
3RU – 3200W
NVIDIA DGX-1

14
0X
4X
8X
12X
16X
GeForce® GTX TITAN X GeForce® GTX 1080 Tesla® P100 DIGITS™ DevBox (4X
GeForce® GTX Titan X)
Quadro® VCA (8X Quadro®
M6000)
DGX-1™ (8X Tesla® P100)
RelativeTrainingPerformance
ResNet Inception v3 AlexNet vgg MSR
DGX-1 — A LEAGUE OF ITS OWN
Caffe on DeepMark. GeForce TITAN X and GTX 1080 system: Intel Core i7-5930K @ 3.5 GHz, 64 GB System Memory | Tesla P100 (SXM2) system: Dual CPU server, Intel E5-2698 v4 @ 2.2 GHz, 256 GB System Memory
1X
GeForce GTX TITAN X GeForce GTX 1080 Tesla P100 DIGITS DevBox
(4X GeForce GTX TITAN X)
Quadro VCA
(8X Quadro M6000)
DGX-1
(8X Tesla P100)

15
Instant productivity — plug-and-
play, supports every AI framework
and accelerated analytics
software applications
Performance optimized across
the entire stack
Always up-to-date via the cloud
Mixed framework environments
— baremetal and containerized
Direct access to NVIDIA experts
DGX STACK
Complete Analytics and Deep Learning platform

16
NVIDIA EXPERTISE AT EVERY STEP
Solution Architects
Global Network
of Partners
Deep Learning
Institute
GTC
Conferences
1:1 support
Network training setup
Network optimization
Certified expert instructors
Worldwide workshops
Online courses
Epicenter of industry leaders
Onsite training
Global reach
NVIDIA Partner Network
OEMs
Startups
Need image

17
NVIDIA DEEP LEARNING PARTNERS
Graph and Data
Analytics
Enterprises Data ManagementDL Frameworks Enterprise DL
Services Core Analytics Tech

18
NVIDIA DEEP LEARNING EVERYWHERE,
EVERY PLATFORM
TITAN X
Available via etail in
200+ countries
DGX-1
The AI appliance for
instant productivity
TESLA
Servers in every shape and size
CLOUD
Everywhere

19
VISIT THE DEEP LEARNING WEBPAGE
http://www.nvidia.com/object/deep-learning.html

Nvidia Deep Learning Solutions - Alex Sabatier

Recommandé

GTC China 2016NVIDIA

Accelerated Computing: The Path ForwardNVIDIA

GPU Technology Conference 2014 KeynoteNVIDIA

GPU Accelerated Deep Learning for CUDNN V2NVIDIA

Opening Keynote at GTC 2015: Leaps in Visual ComputingNVIDIA

NVIDIA CES 2016 Press ConferenceNVIDIA

Enabling Artificial Intelligence - Alison B. LowndesWithTheBest

GTC 2012 Jen-Hsun Huang KeynoteNVIDIA

Recommandé

GTC China 2016NVIDIA

Accelerated Computing: The Path ForwardNVIDIA

GPU Technology Conference 2014 KeynoteNVIDIA

GPU Accelerated Deep Learning for CUDNN V2NVIDIA

Opening Keynote at GTC 2015: Leaps in Visual ComputingNVIDIA

NVIDIA CES 2016 Press ConferenceNVIDIA

Enabling Artificial Intelligence - Alison B. LowndesWithTheBest

GTC 2012 Jen-Hsun Huang KeynoteNVIDIA

GTC 2013 Jen-Hsun Huang KeynoteNVIDIA

Building the World's Largest GPURenee Yao

Visual Computing: The Road Ahead, NVIDIA CEO Jen-Hsun Huang at CES 2015 NVIDIA

Simplifying AI Infrastructure: Lessons in Scaling on DGX SystemsRenee Yao

GTC 2016 Opening KeynoteNVIDIA

Dell and NVIDIA for Your AI workloads in the Data CenterRenee Yao

Accelerate AI w/ Synthetic Data using GANsRenee Yao

Tesla Accelerated Computing Platforminside-BigData.com

HPE and NVIDIA empowering AI and IoTRenee Yao

Introduction to multi gpu deep learning with DIGITS 2 - Mike WangPAPIs.io

NVIDIA Overview 2015NVIDIA

Fuelling the AI Revolution with GamingAlison B. Lowndes

GTC 2018: A New AI Era DawnsNVIDIA

PGI Compilers & Tools Update- March 2018NVIDIA

Innovation RoundtableAlison B. Lowndes

OpenACC Month Highlights- OctoberNVIDIA

GTC Taiwan 2017 在 Google Cloud 當中使用 GPU 進行效能最佳化NVIDIA Taiwan

NVIDIA Keynote #GTC21 Alison B. Lowndes

Tegra 4i expands the marketBrian Caulfield

Shattering AI Performance RecordsNVIDIA

NVIDIA DGX User Group 1st Meet Up_30 Apr 2021.pdfMuhammadAbdullah311866

Dell NVIDIA AI Powered Transformation WebinarBill Wong

Contenu connexe

Tendances

GTC 2013 Jen-Hsun Huang KeynoteNVIDIA

Building the World's Largest GPURenee Yao

Visual Computing: The Road Ahead, NVIDIA CEO Jen-Hsun Huang at CES 2015 NVIDIA

Simplifying AI Infrastructure: Lessons in Scaling on DGX SystemsRenee Yao

GTC 2016 Opening KeynoteNVIDIA

Dell and NVIDIA for Your AI workloads in the Data CenterRenee Yao

Accelerate AI w/ Synthetic Data using GANsRenee Yao

Tesla Accelerated Computing Platforminside-BigData.com

HPE and NVIDIA empowering AI and IoTRenee Yao

Introduction to multi gpu deep learning with DIGITS 2 - Mike WangPAPIs.io

NVIDIA Overview 2015NVIDIA

Fuelling the AI Revolution with GamingAlison B. Lowndes

GTC 2018: A New AI Era DawnsNVIDIA

PGI Compilers & Tools Update- March 2018NVIDIA

Innovation RoundtableAlison B. Lowndes

OpenACC Month Highlights- OctoberNVIDIA

GTC Taiwan 2017 在 Google Cloud 當中使用 GPU 進行效能最佳化NVIDIA Taiwan

NVIDIA Keynote #GTC21 Alison B. Lowndes

Tegra 4i expands the marketBrian Caulfield

Shattering AI Performance RecordsNVIDIA

Tendances (20)

GTC 2013 Jen-Hsun Huang Keynote

Building the World's Largest GPU

Visual Computing: The Road Ahead, NVIDIA CEO Jen-Hsun Huang at CES 2015

Simplifying AI Infrastructure: Lessons in Scaling on DGX Systems

GTC 2016 Opening Keynote

Dell and NVIDIA for Your AI workloads in the Data Center

Accelerate AI w/ Synthetic Data using GANs

Tesla Accelerated Computing Platform

HPE and NVIDIA empowering AI and IoT

Introduction to multi gpu deep learning with DIGITS 2 - Mike Wang

NVIDIA Overview 2015

Fuelling the AI Revolution with Gaming

GTC 2018: A New AI Era Dawns

PGI Compilers & Tools Update- March 2018

Innovation Roundtable

OpenACC Month Highlights- October

GTC Taiwan 2017 在 Google Cloud 當中使用 GPU 進行效能最佳化

NVIDIA Keynote #GTC21

Tegra 4i expands the market

Shattering AI Performance Records

Similaire à Nvidia Deep Learning Solutions - Alex Sabatier

NVIDIA DGX User Group 1st Meet Up_30 Apr 2021.pdfMuhammadAbdullah311866

Dell NVIDIA AI Powered Transformation WebinarBill Wong

計算力学シミュレーションに GPU は役立つのか？Shinnosuke Furuya

Nvidia tesla-k80-overviewCommunication Progress

AI, A New Computing ModelNVIDIA Taiwan

Harnessing the virtual realm for successful real world artificial intelligenceAlison B. Lowndes

組み込みから HPC まで ARM コアで実現するエコシステムShinnosuke Furuya

H2O World 2017 Keynote - Jim McHugh, VP & GM of Data Center, NVIDIASri Ambati

NVIDIA GPUs Power HPC & AI Workloads in Cloud with Univainside-BigData.com

Latest HPC News from NVIDIAinside-BigData.com

GTC 2022 KeynoteAlison B. Lowndes

NVIDIA vGPU - Introduction to NVIDIA Virtual GPULee Bushen

Hardware in SpaceAlison B. Lowndes

High End Modeling & Imaging with Intel Iris Pro GraphicsIntel® Software

Ac922 watson 180208 v1IBM Sverige

HP Z8 G5_Datasheet.pdfssuser7dcef0

20201006_PGconf_Online_Large_Data_ProcessingKohei KaiGai

Webinar: NVIDIA JETSON – A Inteligência Artificial na palma de sua mãoEmbarcados

NVIDIA DGX-1 超級電腦與人工智慧及深度學習NVIDIA Taiwan

Accelerating Data Science With GPUsiguazio

Similaire à Nvidia Deep Learning Solutions - Alex Sabatier (20)

NVIDIA DGX User Group 1st Meet Up_30 Apr 2021.pdf

Dell NVIDIA AI Powered Transformation Webinar

計算力学シミュレーションに GPU は役立つのか？

Nvidia tesla-k80-overview

AI, A New Computing Model

Harnessing the virtual realm for successful real world artificial intelligence

組み込みから HPC まで ARM コアで実現するエコシステム

H2O World 2017 Keynote - Jim McHugh, VP & GM of Data Center, NVIDIA

NVIDIA GPUs Power HPC & AI Workloads in Cloud with Univa

Latest HPC News from NVIDIA

GTC 2022 Keynote

NVIDIA vGPU - Introduction to NVIDIA Virtual GPU

Hardware in Space

High End Modeling & Imaging with Intel Iris Pro Graphics

Ac922 watson 180208 v1

HP Z8 G5_Datasheet.pdf

20201006_PGconf_Online_Large_Data_Processing

Webinar: NVIDIA JETSON – A Inteligência Artificial na palma de sua mão

NVIDIA DGX-1 超級電腦與人工智慧及深度學習

Accelerating Data Science With GPUs

Plus de Sri Ambati

H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DaySri Ambati

Generative AI Masterclass - Model Risk Management.pptxSri Ambati

AI and the Future of Software Development: A Sneak Peek Sri Ambati

LLMOps: Match report from the top of the 5thSri Ambati

Building, Evaluating, and Optimizing your RAG App for ProductionSri Ambati

Building LLM Solutions using Open Source and Closed Source Solutions in Coher...Sri Ambati

Risk Management for LLMsSri Ambati

Open-Source AI: Community is the WaySri Ambati

Building Custom GenAI Apps at H2OSri Ambati

Applied Gen AI for the Finance Vertical Sri Ambati

Cutting Edge Tricks from LLM PapersSri Ambati

Practitioner's Guide to LLMs: Exploring Use Cases and a Glimpse Beyond Curren...Sri Ambati

Open Source h2oGPT with Retrieval Augmented Generation (RAG), Web Search, and...Sri Ambati

KGM Mastering Classification and Regression with LLMs: Insights from Kaggle C...Sri Ambati

LLM Interpretability Sri Ambati

Never Reply to an Email AgainSri Ambati

Introducción al Aprendizaje Automatico con H2O-3 (1)Sri Ambati

From Rapid Prototypes to an end-to-end Model Deployment: an AI Hedge Fund Use...Sri Ambati

AI Foundations Course Module 1 - Shifting to the Next Step in Your AI Transfo...Sri Ambati

AI Foundations Course Module 1 - An AI Transformation JourneySri Ambati

Plus de Sri Ambati (20)

H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day

Generative AI Masterclass - Model Risk Management.pptx

AI and the Future of Software Development: A Sneak Peek

LLMOps: Match report from the top of the 5th

Building, Evaluating, and Optimizing your RAG App for Production

Building LLM Solutions using Open Source and Closed Source Solutions in Coher...

Risk Management for LLMs

Open-Source AI: Community is the Way

Building Custom GenAI Apps at H2O

Applied Gen AI for the Finance Vertical

Cutting Edge Tricks from LLM Papers

Practitioner's Guide to LLMs: Exploring Use Cases and a Glimpse Beyond Curren...

Open Source h2oGPT with Retrieval Augmented Generation (RAG), Web Search, and...

KGM Mastering Classification and Regression with LLMs: Insights from Kaggle C...

LLM Interpretability

Never Reply to an Email Again

Introducción al Aprendizaje Automatico con H2O-3 (1)

From Rapid Prototypes to an end-to-end Model Deployment: an AI Hedge Fund Use...

AI Foundations Course Module 1 - Shifting to the Next Step in Your AI Transfo...

AI Foundations Course Module 1 - An AI Transformation Journey

Dernier

How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes

A Call to Action for Generative AI in 2024Results

2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong

Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia

A Domino Admins Adventures (Engage 2024)Gabriella Davis

Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j

Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer

08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge

The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los

How to convert PDF to text with Nanonetsnaman860154

What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco

Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science

Artificial Intelligence: Facts and MythsJoaquim Jorge

04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG

CNv6 Instructor Chapter 6 Quality of Servicegiselly40

Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC

IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge

Dernier (20)

How to Troubleshoot Apps for the Modern Connected Worker

A Call to Action for Generative AI in 2024

2024: Domino Containers - The Next Step. News from the Domino Container commu...

Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...

A Domino Admins Adventures (Engage 2024)

Handwritten Text Recognition for manuscripts and early printed texts

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...

Tata AIG General Insurance Company - Insurer Innovation Award 2024

08448380779 Call Girls In Diplomatic Enclave Women Seeking Men

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf

The 7 Things I Know About Cyber Security After 25 Years | April 2024

How to convert PDF to text with Nanonets

What Are The Drone Anti-jamming Systems Technology?

Powerful Google developer tools for immediate impact! (2023-24 C)

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx

Artificial Intelligence: Facts and Myths

04-2024-HHUG-Sales-and-Marketing-Alignment.pptx

CNv6 Instructor Chapter 6 Quality of Service

Breaking the Kubernetes Kill Chain: Host Path Mount

IAC 2024 - IA Fast Track to Search Focused AI Solutions

Nvidia Deep Learning Solutions - Alex Sabatier

1. NVIDIA DEEP LEARNING SOLUTIONS Alex Sabatier – Global Account Manager asabatier@nbvidia.com

2. 2 AGENDA NVIDIA GPUs Deep learning revolution DGX1 - First AI Supercomputer

3. 3 GPU Computing NVIDIA Computing for the Most Demanding Users Computing Human Imagination Computing Human Intelligence

4. 4 GPUS: ACCELERATE APPLICATIONS GPUCPU

5. 5 GPU VS CPU – A PAINTING ANALOGY

6. 6NVIDIA CONFIDENTIAL. DO NOT DISTRIBUTE. PERFORMANCE GAP CONTINUES TO GROW 0 100 200 300 400 500 600 700 800 2008 2011 2012 2014 2016 Peak Memory Bandwidth NVIDIA GPU x86 CPU M2090 M1060 K20 K80 Pascal GB/s 0.0 0.5 1.0 1.5 2.0 2.5 3.0 3.5 4.0 4.5 5.0 5.5 2008 2010 2012 2014 2016 Peak Double Precision FLOPS NVIDIA GPU x86 CPU M1060 K20 GFLOPS K80 Pascal M2090

7. 7 CREDENTIALS BUILT OVER TIME 300K CUDA Developers, 4x Growth in 4 years Majority of HPC Applications are GPU-Accelerated, 410 and Growing 100% of Deep Learning Frameworks are Accelerated 113 206 242 370 410 0 50 100 150 200 250 300 350 400 450 2011 2012 2013 2014 2015 2016 287 # of Applications Academia Games Finance Manufacturing Internet Oil & Gas National Labs Automotive Defense M & E 300K TORCH THEANO CAFFE MATCONVNET PURINEMOCHA.JL MINERVA MXNET* BIG SUR TENSORFLOW WATSON CNTK

8. 8 NVIDIA POWERS WORLD’S LEADING DATA CENTERS FOR HPC AND AI

9. 9 DEEP LEARNING A NEW COMPUTING MODEL “little girl is eating piece of cake" LEARNING ALGORITHM “millions of trillions of FLOPS” Device InferenceTraining

10. 10 IMPACT OF AI IS HUGE FOR ENTERPRISES Google: Hundreds of millions of dollars in power savings with AI-operated data center Netflix: $1 billion savings per year with AI- assisted recommendation engine AdTheorent: 300% higher user engagement for mobile advertisers during shopping season

12. 12

13. 13NVIDIA CONFIDENTIAL. DO NOT DISTRIBUTE. 170 TFLOPS FP16 Accelerates Major AI Frameworks Dual 10GbE, Quad IB 100Gb 3RU – 3200W NVIDIA DGX-1

14. 14 0X 4X 8X 12X 16X GeForce® GTX TITAN X GeForce® GTX 1080 Tesla® P100 DIGITS™ DevBox (4X GeForce® GTX Titan X) Quadro® VCA (8X Quadro® M6000) DGX-1™ (8X Tesla® P100) RelativeTrainingPerformance ResNet Inception v3 AlexNet vgg MSR DGX-1 — A LEAGUE OF ITS OWN Caffe on DeepMark. GeForce TITAN X and GTX 1080 system: Intel Core i7-5930K @ 3.5 GHz, 64 GB System Memory | Tesla P100 (SXM2) system: Dual CPU server, Intel E5-2698 v4 @ 2.2 GHz, 256 GB System Memory 1X GeForce GTX TITAN X GeForce GTX 1080 Tesla P100 DIGITS DevBox (4X GeForce GTX TITAN X) Quadro VCA (8X Quadro M6000) DGX-1 (8X Tesla P100)

15. 15 Instant productivity — plug-and- play, supports every AI framework and accelerated analytics software applications Performance optimized across the entire stack Always up-to-date via the cloud Mixed framework environments — baremetal and containerized Direct access to NVIDIA experts DGX STACK Complete Analytics and Deep Learning platform

16. 16 NVIDIA EXPERTISE AT EVERY STEP Solution Architects Global Network of Partners Deep Learning Institute GTC Conferences 1:1 support Network training setup Network optimization Certified expert instructors Worldwide workshops Online courses Epicenter of industry leaders Onsite training Global reach NVIDIA Partner Network OEMs Startups Need image

17. 17 NVIDIA DEEP LEARNING PARTNERS Graph and Data Analytics Enterprises Data ManagementDL Frameworks Enterprise DL Services Core Analytics Tech

18. 18 NVIDIA DEEP LEARNING EVERYWHERE, EVERY PLATFORM TITAN X Available via etail in 200+ countries DGX-1 The AI appliance for instant productivity TESLA Servers in every shape and size CLOUD Everywhere

19. 19 VISIT THE DEEP LEARNING WEBPAGE http://www.nvidia.com/object/deep-learning.html

Notes de l'éditeur

First I am going to start to provide some insight about NVIDIA and the GPU which it at the center of it strategy. Then we will discover why Deep learning, this new field of AI using massive dataset to create software solving amazing problems across the whole industry. I also want to mention how NVIDIA GPUs make their way in the datacenter, not only for HPC and the nascent Deep learning, but also addressing accelerated data analytics now that the amount of available data grow exponentially. You will discover what’s make DGX1 an unique platform powered by the 5 miracle of the latest GPU generation named Pascal. The unique hardware configuration explains the unprecedented power of this appliance. To make it a tool of his own, NVIDIA take a great proud to work on the dedicated SW stack allowing the users to take advantage of his row power without having to spend hours configuring and loaded the various sw elements. What we wanted to achieve is to give data scientist and deep learner a ready to use platform powerfull enough where they can directly express their creativity and solve issues they were not able to address up. to now. Ready?, so let’s start.
We could say that NVIDIA makes computers that are loved by the most demanding users in the world — gamers, designers, and scientists. NVIDIA pioneered GPU computing, a supercharged form of computing 10 years ago allowing CUDA, the GPU programing language to be accessible on all of our platforms including the gaming ones. Yes a gamer can have a break from his favorite game and spend some time programming the GPU he was previously using for gaming. The GPU evolved from a 3D graphics chip into a computer platform that gives humans the power to simulate virtual worlds. The fabulous and rich worlds created by Game developers of course, but also the creations from Industrial designers and Architect. 90% of the workstation uses NVIDIA graphic solution to render and visualize their creations. The movie industry also used our Graphic solution for CGI, special effect and animation movies. NVIDIA works very closely with application developers and their users to make sure the whole chain benefits the power of the GPU. That’s not all biologist and medical imaging specialists also use GPUs to simulate a drug effect on a specific molecule. It is also used to simulate weather and make prediction, make financial simulation, used by oil an gas exploration to explore underground resource. That’s computing human imagination. But GPU are also amazingly powerful to understand the world around them. Compute Vision and Deep learning algorithm allow computers to see and understand the world around them, recognizing the road lanes, traffic signs, truck, cars , bike, pedestrian in a slef driving car. Helping a robot to identify the object he can grab, recognizing valuable object from garbage on a recycling line at a waste processing factory. The impact of visual recognition is almost infinite. But it is not all, Machines can now listen, understand and translate what they hear. Have you tried OK Google on and android device. It works pretty well , It works even for me with my so thick French accent. With the ability to see and hear, those computers are simulating Human intelligence. GPU trained the deepmind/Google system who beat the world champion at the game of go. So why GPU excel at those tasks? It all comes down to an architecture differences vs traditionnal CPUs
GPU Computing is a type of heterogeneous computing – that is, parallel computing with multiple processor architectures involving CPU and GPU. CPU is a an old architecture known for being sequential and featuring a complex instruction set. CPUs have now a few cores 2,4,8, up to 20 -24 for the most powerful one. GPU contains up to several thousand cores able to execute similar instructions in the same clock cycle. This is highly efficient for today’s computing applications having data intensive requirements and a large amount of a parallelism. GPUs are best at accelerate these parallel parts of any applications by spreading the computation over those thousands of cores. The results if a significant speedup in application execution time. Andrew Ng, head of Baidu research in AI , view the GPU as a high throughput parallel engine. The deep learning algorithm used by Andrew’s team in “Deep speech 2” the name of his Neural language processing system requires a lot of parallel throughput to train the model. It could not be possible to use a CPU for that. Thanks to CPU the model is so performant that he is able to understand Mandarin talked by a 5 year old… Let me show you some numbers.
Those 2 diagrams shows the evolution of the compute power and the memory bandwidth for both GPU (Green curve) and CPU (blue curve) during the last 8 years. It is obvious that the trend is completely different here. While the CPU compute power expressed here in Gflops ( Float point Op/sec) follows Moore’s law, GPU shows an ability to increase significantly its compute power generation after generation. The latest Pascal P100 tesla GPU are close to 5 GFLOPS while the latest x86 CPU are barely reaching half a GFLOPS. The performance increase is due to the parallelization of the compute distributed across an increasing number of cores. GPU have a powerful compute capability, but it is not enough. To be truly efficient data needs to arrive quickly to the cores to be processed. The curves on the right hand side shows the evolution of the memory bandwidth in Gbyte/sec. once again the diagram talk by itself. NVIDIA engineered its GPU so that the compute power and memory bandwith allow to address the latest computing need of the industry. Raw power and memory bandwidth are a great starting point, But it is when a great eco-system develop solution on the platform that its truly serve its purpose.
NVIDIA built a platform that HPC can build on. It started 10 years ago, when CUDA was accessible to developers accelerated computing is now the most important platform for HPC. No other company in the world comes close. There is wide ecosystem of applications developers, researchers, students and users that NVIDIA feed with the latest updates, training and conferences to create a strong link that allow us to understand the actual use and future needs to provide the platform of tomorrow. Majority of HPC apps are now accelerated. Every HPC center can enjoy the benefits of GPUs. 410 apps now (9 of the top 10 – 35 of the top 50) It is Driven by the world’s largest ecosystem of HPC developers. 4x growth. Why? CUDA and OpenACC uniquely provide simple way to get performance for parallel arch’s. DL: All frameworks optimized for GPUs Training neural networks to perform these tasks is a huge computational challenge. It can take weeks or even months to design and train a neural network to perform a specific task at near-human levels of accuracy. Fortunately, much of this work can be performed in parallel on modern GPUs. We created dedicated GPU accelerated and SDK to allow those frameworks to be GPU accelerated. Name some of them. Today these GPU-accelerated DL frameworks are being used by researchers and scientists world wide to solve problems in computer vision, speech translation, natural language understanding, that were previously considered impossible to solve.
You may have heard of the rie of GPU in the data center. Microsoft Azure and Amazone AWS raced for the GA of K80 clusters to enable GPU cloud computing services. Google is a massive users of NVIDA GPU, Jeff Dean, the head of search came to our annual GPU conference named GTC and claim during his key note that Deep learning won’t be possible without GPUs – CPU are too slow. That was a massive endorsement. IBM congnitive machine Watson is using NVIDIA GPU. The US most powerful supercomputer also relies on NVIDIA GPU at Oak Ridge and Livermore National lab.
Because the massively parallel GPU was widely available, more and more people started to pick it up. 4-5 years ago, AI researchers discovered it. The combination of deep neural networks, big data and powerful GPU platforms reignited AI. Deep learning — a new computing model where networks are “trained” to extract features from massive amounts of data — has proven to be remarkably effective at solving some of the most complex problems in computer science. In this example a network is trained with an millions of images, the learning phase requires trillions and trillions of operation to create a Neural network. This Neural net is now able to recognize and describe an image he never seen before. When the Neural Net struggle to understand a picture, the image I analysed by data scientist and added in the initial data set to train an update model which will be successful when a similar picture is presented. It reminds me this quote from Mandela: “ I never lose, either I win or I learn something” Deep learning allow NN to learn from their mistake.
Deep learning applications are highly valuable, let me give you some examples: Netflix uses a Deepl learning powered recommendation algorithm that submit personalized sugestions to the user. Netflix explained Strong recommendations increase the amount of time viewers watch content on Netflix keeping subscriber churn as low as possible. According to a paper published by Netflix executives, the on-demand video streaming service claims its AI assisted recommendation system saves the company $1 billion per year. Google was a pioneer in Deep learning using GPU with Search. Sundar Pichai, head of Android, recently praised deep learning for new applications like the assistant or the new messaging app “Hello” who provides pre-formatted contextual answers to text messages. Google nailed the value of Deep learning there is now 1600 application using deep learning. They even use it to manage the power of their numerous Data center. The neural networks control "about 120 variables in the data centers," including "the fans and the cooling systems, the windows and other things." The AI worked out the most efficient methods of cooling by analyzing data from sensors among the server racks, including information on things like temperatures and pump speeds.. http://www.theverge.com/2016/7/21/12246258/google-deepmind-ai-data-center-cooling "One last example that might help add context to the use of deep learning and the type of results it can deliver is AdTheorent, This company built a DL system to deliver real-time bidding assistance to advertisers bidding on ads for mobile devices. It helped advertisers reach an engagement level that was 200 to 300 per cent higher than the industry average for the last holiday shopping season. This is an example of a system that needs a very fast response time and carries an incredible value.
Because we wanted the whole community of data scientists and Deep learners, across all industries, to be able to create those fantastic uses cases we decide to build the ultimate appliance for it. We best knows GPU and the way to accelerate the DL framework so we create DGX1 to unleash DL enormous promise. DGX1 is an AI supercomputer-in-a-box. DGX-1 is a plug-and-play appliance with the computing power of a 250-node HPC cluster. I told you a shrink version of a data center delivered with a built-in software stack supporting the main DL frameworks and GPU accelerated database and graph applications. A dream come true for the users who just have to focus on their task nd not spend cycle to prepare and maintain their software tools. Let’s have a look inside:
DGX1 delivered 170 Tflops of FP16 performances thanks to 8 GPU P100 sxm2 with 16GB of HBM2 each ensuring a memory bandwidth of 732GByte/s for a total of 28, 672 CUDA cores!!! Those 8 GPUs are connected with NVlink: NVIDIA new interconnect interface that allow data to be shared between GPU 5 times faster than the latest PCIe Gen3 bus. The GPU are connected in a hybrid Cube mesh configuration, which allow the farthest GPU to be only 2 node away. It is ideal when you want large job to be spread across several GPUS. This is what you want for highly scalable systems. 2 Xeon CPU with 20 cores with a total of 256GB of system memory are used for scheduling and pushing the data to the GPU. That 3RU box requires a total of 3200W of power supply and will be connected to othe DGX1 boxes or the rest for your datacenter with a Dual 10GB Ethernet and a quad infiniband 100Gbps EDR interface. P100 SXM2 is a new generation of GPU with a dedicated form factor specifically designed for the data center.
with the DGX-1 platform, you now have all of the tools needed for realizing the benefits of GPU-accelerated deep learning. NVIDIA already provides a number of libraries for accelerating computational performance through GPUs. Some of these libraries, like NCCL, have actually been optimized specifically for the 8-GPU architecture on DGX-1. NVIDIA Docker containers provide packaged applications such as DL frameworks that are multi-GPU aware, AND you can schedule and run these from the NVIDIA cloud service. All these tools also make it easy for you to build, package, and run your own containerized applications on DGX-1. In other words, with NVIDIA libraries, containers, and DGX-1, you’ve got everything you need for developing and running multi-GPU accelerated applications. Container Based Applications – Easily run accelerated computing and deep learning frameworks through GPU aware containers. Build your own containers and host them in a private repository through the cloud service. NVIDIA Cloud Management – Manage your node or cluster and run containers from the NVIDIA cloud service. Connecting to the cloud is easy; just plug in power and internet and you’re ready to go.
Deep learning is a fundamentally new software model that needs a new computing platform GPU computing is an ideal approach and the GPU is the ideal processor. A combination of factors is essential to create a new computing platform — performance, programming productivity, and open accessibility. Performance. NVIDIA GPUs are naturally great at parallel workloads and speed-up DNNs by 10-20x, reducing each of the many training iterations from weeks to days. Programmability. AI innovation is on a breakneck pace. Ease of programming and developer productivity is paramount. The programmability and richness of NVIDIA’s CUDA platform allow researchers to innovate quickly — building new configurations of CNNs, DNNs, deep inception networks, RNNs, LSTMs, and reinforcement learning networks. Accessibility. Developers want to create anywhere and deploy everywhere. NVIDIA GPUs are available all over the world, from every PC OEM; in desktops, notebooks, servers, or supercomputers; and in the cloud from Alibaba, Amazon, Baidu, Google, IBM and Microsoft.
Visit the Deep Learning page today