SlideShare une entreprise Scribd logo
1  sur  13
HOW DO I UNDERSTAND DEEP LEARNING
PERFORMANCE?
THERE’S A LOT OF CONFUSION AROUND DEEP
LEARNING PERFORMANCE.
HOW DO YOU MEASURE IT?
WHAT SHOULD YOU MEASURE?
JUST REMEMBER “PLASTER”
A FRAMEWORK FOR DEEP LEARNING
PERFORMANCE
WHAT DOES PLASTER STAND FOR?
Programmability
Latency
Accuracy
Size of Model
Throughput
Energy Efficiency
Rate of Learning
PROGRAMMABILITY
Machine learning is experiencing explosive growth
not only in the size and complexity of the models
but also the burgeoning diversity of neural network
architectures. It is difficult even for experts to
understand the model choices and then choose the
appropriate model to solve their AI business
problems.
After a deep learning model is coded and trained,
it is then optimized for a specific runtime inference
environment. NVIDIA addresses training and
inference challenges with two key tools– CUDA and
TensorRT, NVIDIA’s programmable inference
accelerator. In addition, NVIDIA’s deep learning
platform accelerates ALL deep learning
frameworks, both for training and inference.
LATENCY
Humans and machines need a response to make
decisions and take action. Latency is the time
between requesting something and receiving a
response.
With most human-facing software systems, not just
AI, the time is often measured in milliseconds. Voice
recognition is a commonly understood application,
thanks to Siri, Alexa, and similar voice interfaces.
There is wide demand for digital assistants in both
consumer and customer service applications. But when
humans try to interface with digital assistants, a lag
of even a few seconds starts to feel unnatural.
ACCURACY
While accuracy is important in every industry,
healthcare needs especially high accuracy. Medical
imaging has advanced significantly in the last couple
of decades, increasing usage and requiring more
analysis to identify medical issues. Medical imaging
advancements and usage also mean that large
volumes of data must be transmitted from medical
machines to medical specialists to analyze.
Options to address the data volume issue have been
either to transmit the full information with long
delays or to sample the data and reconstruct it using
techniques that can lead to inaccurate reconstruction
and diagnostics.
An advantage of deep learning is that it can be
trained at high precision and implemented at lower
precision.
SIZE OF MODEL
The size of a deep learning model and the capacity of
the physical network between processors have impacts
on performance, especially in the latency and
throughput aspects of PLASTER. Deep learning network
models are exploding in numbers.
Their size and complexity are also increasing, enabling
far more detailed analysis and driving the need for
more powerful systems for training. In a deep learning
model, the drivers of compute power and physical
network expansion are:
• Number of layers
• Number of nodes (neurons) per layer
• Complexity of computation per layer
• Number of connections between a node at one layer
and the nodes of neighboring layers
THROUGHPUT
Developers are increasingly optimizing inference within a
specified latency threshold. While the latency limit
ensures good customer experience, maximizing
throughput within that limit is critical to maximizing
data center efficiency as well as revenue.
There has been a tendency to use throughput as the only
performance metric, as more computations-per-second
generally leads to better performance across other areas.
However, if a system cannot deliver adequate throughput
within a specified latency requirement, power budget, or
server node count, then the system will not ultimately
serve an application’s inference needs well. Without the
appropriate balance of throughput and latency, the
result can be poor customer service, missing service level
agreements (SLAs), and potentially a failed service.
ENERGY EFFICIENCY
As DL accelerator performance improves, DL accelerator
power consumption escalates. Providing ROI for deep
learning solutions involves more than looking at just the
inference performance of a system. Power consumption
can quickly increase costs of delivering a service, driving a
need to focus on energy efficiency in both devices and
systems.
Speech processing is a good example of a solution that
needs heavy processing to provide an intelligent response
in a natural voice. Data center inference providing real-
time processing for speech can easily involve large racks
of machines that can impact a company’s total cost of
ownership (TCO). Therefore, the industry measures
operational success in inferences-per-watt (higher is
better). Hyperscale data centers seek to maximize energy
efficiency for as many inferences as they can deliver with
a fixed power budget.
RATE OF LEARNING
As one of the two words in “AI” is intelligence, users
will want the neural networks to learn and adapt in a
reasonable timeframe. For complex DL systems to gain
traction in business, software tool developers must
support the DevOps movement.
As organizations continue to experiment with deep
learning and neural networks, they are learning how to
more effectively build and implement DL systems. DL
models must be retrained periodically as inferencing
services gather new data and as services grow and
change. Therefore, IT organizations and software
developers must increase the rate at which they can
retrain models as new data arrives.
HOW CAN PLASTER IMPACT YOUR BUSINESS?
DOWNLOAD WHITEPAPER

Contenu connexe

Tendances

HPC Top 5 Stories: April 26, 2018
HPC Top 5 Stories: April 26, 2018HPC Top 5 Stories: April 26, 2018
HPC Top 5 Stories: April 26, 2018NVIDIA
 
GTC Europe 2017 Keynote
GTC Europe 2017 KeynoteGTC Europe 2017 Keynote
GTC Europe 2017 KeynoteNVIDIA
 
NVIDIA Corporation Brochure: Who We Are
NVIDIA Corporation Brochure: Who We AreNVIDIA Corporation Brochure: Who We Are
NVIDIA Corporation Brochure: Who We AreNVIDIA
 
HPC Top 5 Stories: July, 21, 2017
 HPC Top 5 Stories: July, 21, 2017 HPC Top 5 Stories: July, 21, 2017
HPC Top 5 Stories: July, 21, 2017NVIDIA
 
HPC Top 5 Stories: Nov. 21, 2016
HPC Top 5 Stories: Nov. 21, 2016HPC Top 5 Stories: Nov. 21, 2016
HPC Top 5 Stories: Nov. 21, 2016NVIDIA
 
GTC 2015 Highlights
GTC 2015 HighlightsGTC 2015 Highlights
GTC 2015 HighlightsNVIDIA
 
The Convergence of HPC and Deep Learning
The Convergence of HPC and Deep LearningThe Convergence of HPC and Deep Learning
The Convergence of HPC and Deep LearningNVIDIA
 
Harness the Power of AI and Deep Learning for Business
Harness the Power of AI and Deep Learning for BusinessHarness the Power of AI and Deep Learning for Business
Harness the Power of AI and Deep Learning for BusinessNVIDIA
 
AI For Enterprise
AI For EnterpriseAI For Enterprise
AI For EnterpriseNVIDIA
 
Fueling the Next Wave of AI Discovery - CVPR 2018
Fueling the Next Wave of AI Discovery - CVPR 2018Fueling the Next Wave of AI Discovery - CVPR 2018
Fueling the Next Wave of AI Discovery - CVPR 2018NVIDIA
 
NVIDIA 2017 Overview
NVIDIA 2017 OverviewNVIDIA 2017 Overview
NVIDIA 2017 OverviewNVIDIA
 
HPC Top 5 Stories: Nov. 11, 2016
HPC Top 5 Stories: Nov. 11, 2016HPC Top 5 Stories: Nov. 11, 2016
HPC Top 5 Stories: Nov. 11, 2016NVIDIA
 
HPC Top 5 Stories: September 29, 2017
HPC Top 5 Stories: September 29, 2017HPC Top 5 Stories: September 29, 2017
HPC Top 5 Stories: September 29, 2017NVIDIA
 
ISC 2016 Day 3 Recap
ISC 2016 Day 3 RecapISC 2016 Day 3 Recap
ISC 2016 Day 3 RecapOpenPOWERorg
 
HPC Top 5 Stories: Dec. 7, 2016
HPC Top 5 Stories: Dec. 7, 2016HPC Top 5 Stories: Dec. 7, 2016
HPC Top 5 Stories: Dec. 7, 2016NVIDIA
 
ISC 2016 Day 2 Recap
ISC 2016 Day 2 RecapISC 2016 Day 2 Recap
ISC 2016 Day 2 RecapOpenPOWERorg
 
ISC 2016 Day 1 Recap
ISC 2016 Day 1 RecapISC 2016 Day 1 Recap
ISC 2016 Day 1 RecapOpenPOWERorg
 
NVIDIA CEO Jensen Huang Presentation at Supercomputing 2019
NVIDIA CEO Jensen Huang Presentation at Supercomputing 2019NVIDIA CEO Jensen Huang Presentation at Supercomputing 2019
NVIDIA CEO Jensen Huang Presentation at Supercomputing 2019NVIDIA
 
5 biggest hpc trends 2021
5 biggest hpc trends 20215 biggest hpc trends 2021
5 biggest hpc trends 2021Sandeep Mishra
 
GTC 2018: A New AI Era Dawns
GTC 2018: A New AI Era DawnsGTC 2018: A New AI Era Dawns
GTC 2018: A New AI Era DawnsNVIDIA
 

Tendances (20)

HPC Top 5 Stories: April 26, 2018
HPC Top 5 Stories: April 26, 2018HPC Top 5 Stories: April 26, 2018
HPC Top 5 Stories: April 26, 2018
 
GTC Europe 2017 Keynote
GTC Europe 2017 KeynoteGTC Europe 2017 Keynote
GTC Europe 2017 Keynote
 
NVIDIA Corporation Brochure: Who We Are
NVIDIA Corporation Brochure: Who We AreNVIDIA Corporation Brochure: Who We Are
NVIDIA Corporation Brochure: Who We Are
 
HPC Top 5 Stories: July, 21, 2017
 HPC Top 5 Stories: July, 21, 2017 HPC Top 5 Stories: July, 21, 2017
HPC Top 5 Stories: July, 21, 2017
 
HPC Top 5 Stories: Nov. 21, 2016
HPC Top 5 Stories: Nov. 21, 2016HPC Top 5 Stories: Nov. 21, 2016
HPC Top 5 Stories: Nov. 21, 2016
 
GTC 2015 Highlights
GTC 2015 HighlightsGTC 2015 Highlights
GTC 2015 Highlights
 
The Convergence of HPC and Deep Learning
The Convergence of HPC and Deep LearningThe Convergence of HPC and Deep Learning
The Convergence of HPC and Deep Learning
 
Harness the Power of AI and Deep Learning for Business
Harness the Power of AI and Deep Learning for BusinessHarness the Power of AI and Deep Learning for Business
Harness the Power of AI and Deep Learning for Business
 
AI For Enterprise
AI For EnterpriseAI For Enterprise
AI For Enterprise
 
Fueling the Next Wave of AI Discovery - CVPR 2018
Fueling the Next Wave of AI Discovery - CVPR 2018Fueling the Next Wave of AI Discovery - CVPR 2018
Fueling the Next Wave of AI Discovery - CVPR 2018
 
NVIDIA 2017 Overview
NVIDIA 2017 OverviewNVIDIA 2017 Overview
NVIDIA 2017 Overview
 
HPC Top 5 Stories: Nov. 11, 2016
HPC Top 5 Stories: Nov. 11, 2016HPC Top 5 Stories: Nov. 11, 2016
HPC Top 5 Stories: Nov. 11, 2016
 
HPC Top 5 Stories: September 29, 2017
HPC Top 5 Stories: September 29, 2017HPC Top 5 Stories: September 29, 2017
HPC Top 5 Stories: September 29, 2017
 
ISC 2016 Day 3 Recap
ISC 2016 Day 3 RecapISC 2016 Day 3 Recap
ISC 2016 Day 3 Recap
 
HPC Top 5 Stories: Dec. 7, 2016
HPC Top 5 Stories: Dec. 7, 2016HPC Top 5 Stories: Dec. 7, 2016
HPC Top 5 Stories: Dec. 7, 2016
 
ISC 2016 Day 2 Recap
ISC 2016 Day 2 RecapISC 2016 Day 2 Recap
ISC 2016 Day 2 Recap
 
ISC 2016 Day 1 Recap
ISC 2016 Day 1 RecapISC 2016 Day 1 Recap
ISC 2016 Day 1 Recap
 
NVIDIA CEO Jensen Huang Presentation at Supercomputing 2019
NVIDIA CEO Jensen Huang Presentation at Supercomputing 2019NVIDIA CEO Jensen Huang Presentation at Supercomputing 2019
NVIDIA CEO Jensen Huang Presentation at Supercomputing 2019
 
5 biggest hpc trends 2021
5 biggest hpc trends 20215 biggest hpc trends 2021
5 biggest hpc trends 2021
 
GTC 2018: A New AI Era Dawns
GTC 2018: A New AI Era DawnsGTC 2018: A New AI Era Dawns
GTC 2018: A New AI Era Dawns
 

Similaire à How Do I Understand Deep Learning Performance?

TECHNICAL OVERVIEW NVIDIA DEEP LEARNING PLATFORM Giant Leaps in Performance ...
TECHNICAL OVERVIEW NVIDIA DEEP  LEARNING PLATFORM Giant Leaps in Performance ...TECHNICAL OVERVIEW NVIDIA DEEP  LEARNING PLATFORM Giant Leaps in Performance ...
TECHNICAL OVERVIEW NVIDIA DEEP LEARNING PLATFORM Giant Leaps in Performance ...Willy Marroquin (WillyDevNET)
 
Whitepaper: Simplifying Data Center Network Management Leveraging SDN - Happi...
Whitepaper: Simplifying Data Center Network Management Leveraging SDN - Happi...Whitepaper: Simplifying Data Center Network Management Leveraging SDN - Happi...
Whitepaper: Simplifying Data Center Network Management Leveraging SDN - Happi...Happiest Minds Technologies
 
A SURVEY OF NEURAL NETWORK HARDWARE ACCELERATORS IN MACHINE LEARNING
A SURVEY OF NEURAL NETWORK HARDWARE ACCELERATORS IN MACHINE LEARNING A SURVEY OF NEURAL NETWORK HARDWARE ACCELERATORS IN MACHINE LEARNING
A SURVEY OF NEURAL NETWORK HARDWARE ACCELERATORS IN MACHINE LEARNING mlaij
 
Performance Comparison between Pytorch and Mindspore
Performance Comparison between Pytorch and MindsporePerformance Comparison between Pytorch and Mindspore
Performance Comparison between Pytorch and Mindsporeijdms
 
Migration services (DB2 to Teradata)
Migration services (DB2  to Teradata)Migration services (DB2  to Teradata)
Migration services (DB2 to Teradata)ModakAnalytics
 
Network performance - skilled craft to hard science
Network performance - skilled craft to hard scienceNetwork performance - skilled craft to hard science
Network performance - skilled craft to hard scienceMartin Geddes
 
The Transformation of HPC: Simulation and Cognitive Methods in the Era of Big...
The Transformation of HPC: Simulation and Cognitive Methods in the Era of Big...The Transformation of HPC: Simulation and Cognitive Methods in the Era of Big...
The Transformation of HPC: Simulation and Cognitive Methods in the Era of Big...inside-BigData.com
 
Data center-terminology photostory-
Data center-terminology photostory-Data center-terminology photostory-
Data center-terminology photostory-VenkatRamana242
 
Electi Deep Learning Optimization
Electi  Deep Learning OptimizationElecti  Deep Learning Optimization
Electi Deep Learning OptimizationNikolas Markou
 
Machine Learning-Based Prefetch Optimization for Data Center ...
Machine Learning-Based Prefetch Optimization for Data Center ...Machine Learning-Based Prefetch Optimization for Data Center ...
Machine Learning-Based Prefetch Optimization for Data Center ...butest
 
Building Data Ecosystems for Accelerated Discovery
Building Data Ecosystems for Accelerated DiscoveryBuilding Data Ecosystems for Accelerated Discovery
Building Data Ecosystems for Accelerated Discoveryadamkraut
 
STUDY OF VARIOUS FACTORS AFFECTING PERFORMANCE OF MULTI-CORE PROCESSORS
STUDY OF VARIOUS FACTORS AFFECTING PERFORMANCE OF MULTI-CORE PROCESSORSSTUDY OF VARIOUS FACTORS AFFECTING PERFORMANCE OF MULTI-CORE PROCESSORS
STUDY OF VARIOUS FACTORS AFFECTING PERFORMANCE OF MULTI-CORE PROCESSORSijdpsjournal
 

Similaire à How Do I Understand Deep Learning Performance? (20)

TECHNICAL OVERVIEW NVIDIA DEEP LEARNING PLATFORM Giant Leaps in Performance ...
TECHNICAL OVERVIEW NVIDIA DEEP  LEARNING PLATFORM Giant Leaps in Performance ...TECHNICAL OVERVIEW NVIDIA DEEP  LEARNING PLATFORM Giant Leaps in Performance ...
TECHNICAL OVERVIEW NVIDIA DEEP LEARNING PLATFORM Giant Leaps in Performance ...
 
Whitepaper: Simplifying Data Center Network Management Leveraging SDN - Happi...
Whitepaper: Simplifying Data Center Network Management Leveraging SDN - Happi...Whitepaper: Simplifying Data Center Network Management Leveraging SDN - Happi...
Whitepaper: Simplifying Data Center Network Management Leveraging SDN - Happi...
 
A SURVEY OF NEURAL NETWORK HARDWARE ACCELERATORS IN MACHINE LEARNING
A SURVEY OF NEURAL NETWORK HARDWARE ACCELERATORS IN MACHINE LEARNING A SURVEY OF NEURAL NETWORK HARDWARE ACCELERATORS IN MACHINE LEARNING
A SURVEY OF NEURAL NETWORK HARDWARE ACCELERATORS IN MACHINE LEARNING
 
1605.08695.pdf
1605.08695.pdf1605.08695.pdf
1605.08695.pdf
 
Performance Comparison between Pytorch and Mindspore
Performance Comparison between Pytorch and MindsporePerformance Comparison between Pytorch and Mindspore
Performance Comparison between Pytorch and Mindspore
 
Migration services (DB2 to Teradata)
Migration services (DB2  to Teradata)Migration services (DB2  to Teradata)
Migration services (DB2 to Teradata)
 
Priorities Shift In IC Design
Priorities Shift In IC DesignPriorities Shift In IC Design
Priorities Shift In IC Design
 
Network performance - skilled craft to hard science
Network performance - skilled craft to hard scienceNetwork performance - skilled craft to hard science
Network performance - skilled craft to hard science
 
The Transformation of HPC: Simulation and Cognitive Methods in the Era of Big...
The Transformation of HPC: Simulation and Cognitive Methods in the Era of Big...The Transformation of HPC: Simulation and Cognitive Methods in the Era of Big...
The Transformation of HPC: Simulation and Cognitive Methods in the Era of Big...
 
Data center-terminology photostory-
Data center-terminology photostory-Data center-terminology photostory-
Data center-terminology photostory-
 
Data center terminology photostory
Data center terminology photostoryData center terminology photostory
Data center terminology photostory
 
Electi Deep Learning Optimization
Electi  Deep Learning OptimizationElecti  Deep Learning Optimization
Electi Deep Learning Optimization
 
FINAL VER - 2015_09
FINAL VER - 2015_09FINAL VER - 2015_09
FINAL VER - 2015_09
 
Technovision
TechnovisionTechnovision
Technovision
 
FC Brochure & Insert
FC Brochure & InsertFC Brochure & Insert
FC Brochure & Insert
 
Machine Learning-Based Prefetch Optimization for Data Center ...
Machine Learning-Based Prefetch Optimization for Data Center ...Machine Learning-Based Prefetch Optimization for Data Center ...
Machine Learning-Based Prefetch Optimization for Data Center ...
 
Building Data Ecosystems for Accelerated Discovery
Building Data Ecosystems for Accelerated DiscoveryBuilding Data Ecosystems for Accelerated Discovery
Building Data Ecosystems for Accelerated Discovery
 
STUDY OF VARIOUS FACTORS AFFECTING PERFORMANCE OF MULTI-CORE PROCESSORS
STUDY OF VARIOUS FACTORS AFFECTING PERFORMANCE OF MULTI-CORE PROCESSORSSTUDY OF VARIOUS FACTORS AFFECTING PERFORMANCE OF MULTI-CORE PROCESSORS
STUDY OF VARIOUS FACTORS AFFECTING PERFORMANCE OF MULTI-CORE PROCESSORS
 
accelerate-intelligent-solutions-with-machine-learning-platform-brief
accelerate-intelligent-solutions-with-machine-learning-platform-briefaccelerate-intelligent-solutions-with-machine-learning-platform-brief
accelerate-intelligent-solutions-with-machine-learning-platform-brief
 
Value Stories - 7th Issue
Value Stories - 7th Issue Value Stories - 7th Issue
Value Stories - 7th Issue
 

Plus de NVIDIA

NVIDIA Story 2023.pdf
NVIDIA Story 2023.pdfNVIDIA Story 2023.pdf
NVIDIA Story 2023.pdfNVIDIA
 
NVIDIA GTC2022 Spring Highlights
NVIDIA GTC2022 Spring HighlightsNVIDIA GTC2022 Spring Highlights
NVIDIA GTC2022 Spring HighlightsNVIDIA
 
NVIDIA Brochure 2021 Company Overview
NVIDIA Brochure 2021 Company OverviewNVIDIA Brochure 2021 Company Overview
NVIDIA Brochure 2021 Company OverviewNVIDIA
 
NVIDIA GTC 2020 October Summary
NVIDIA GTC 2020 October SummaryNVIDIA GTC 2020 October Summary
NVIDIA GTC 2020 October SummaryNVIDIA
 
The Best of AI and HPC in Healthcare and Life Sciences
The Best of AI and HPC in Healthcare and Life SciencesThe Best of AI and HPC in Healthcare and Life Sciences
The Best of AI and HPC in Healthcare and Life SciencesNVIDIA
 
NLP for Biomedical Applications
NLP for Biomedical ApplicationsNLP for Biomedical Applications
NLP for Biomedical ApplicationsNVIDIA
 
Top 5 Deep Learning and AI Stories - August 30, 2019
Top 5 Deep Learning and AI Stories - August 30, 2019Top 5 Deep Learning and AI Stories - August 30, 2019
Top 5 Deep Learning and AI Stories - August 30, 2019NVIDIA
 
Seven Ways to Boost Artificial Intelligence Research
Seven Ways to Boost Artificial Intelligence ResearchSeven Ways to Boost Artificial Intelligence Research
Seven Ways to Boost Artificial Intelligence ResearchNVIDIA
 
NVIDIA Developer Program Overview
NVIDIA Developer Program OverviewNVIDIA Developer Program Overview
NVIDIA Developer Program OverviewNVIDIA
 
NVIDIA at Computex 2019
NVIDIA at Computex 2019 NVIDIA at Computex 2019
NVIDIA at Computex 2019 NVIDIA
 
Top 5 DGX Sessions From GTC 2019
Top 5 DGX Sessions From GTC 2019Top 5 DGX Sessions From GTC 2019
Top 5 DGX Sessions From GTC 2019NVIDIA
 
Top 5 Data Science Sessions from GTC 2019
Top 5 Data Science Sessions from GTC 2019Top 5 Data Science Sessions from GTC 2019
Top 5 Data Science Sessions from GTC 2019NVIDIA
 
This Week in Data Science - Top 5 News - April 26, 2019
This Week in Data Science - Top 5 News - April 26, 2019This Week in Data Science - Top 5 News - April 26, 2019
This Week in Data Science - Top 5 News - April 26, 2019NVIDIA
 
GTC 2019 Keynote in Silicon Valley
GTC 2019 Keynote in Silicon ValleyGTC 2019 Keynote in Silicon Valley
GTC 2019 Keynote in Silicon ValleyNVIDIA
 
CUDA DLI Training Courses at GTC 2019
CUDA DLI Training Courses at GTC 2019CUDA DLI Training Courses at GTC 2019
CUDA DLI Training Courses at GTC 2019NVIDIA
 
DGX Sessions You Won't Want to Miss at GTC 2019
DGX Sessions You Won't Want to Miss at GTC 2019DGX Sessions You Won't Want to Miss at GTC 2019
DGX Sessions You Won't Want to Miss at GTC 2019NVIDIA
 
Transforming Healthcare at GTC Silicon Valley
Transforming Healthcare at GTC Silicon ValleyTransforming Healthcare at GTC Silicon Valley
Transforming Healthcare at GTC Silicon ValleyNVIDIA
 
OpenACC Monthly Highlights February 2019
OpenACC Monthly Highlights February 2019OpenACC Monthly Highlights February 2019
OpenACC Monthly Highlights February 2019NVIDIA
 
CUDA Sessions You Won't Want to Miss at GTC 2019
CUDA Sessions You Won't Want to Miss at GTC 2019CUDA Sessions You Won't Want to Miss at GTC 2019
CUDA Sessions You Won't Want to Miss at GTC 2019NVIDIA
 
Empowering Radiology with AI
Empowering Radiology with AIEmpowering Radiology with AI
Empowering Radiology with AINVIDIA
 

Plus de NVIDIA (20)

NVIDIA Story 2023.pdf
NVIDIA Story 2023.pdfNVIDIA Story 2023.pdf
NVIDIA Story 2023.pdf
 
NVIDIA GTC2022 Spring Highlights
NVIDIA GTC2022 Spring HighlightsNVIDIA GTC2022 Spring Highlights
NVIDIA GTC2022 Spring Highlights
 
NVIDIA Brochure 2021 Company Overview
NVIDIA Brochure 2021 Company OverviewNVIDIA Brochure 2021 Company Overview
NVIDIA Brochure 2021 Company Overview
 
NVIDIA GTC 2020 October Summary
NVIDIA GTC 2020 October SummaryNVIDIA GTC 2020 October Summary
NVIDIA GTC 2020 October Summary
 
The Best of AI and HPC in Healthcare and Life Sciences
The Best of AI and HPC in Healthcare and Life SciencesThe Best of AI and HPC in Healthcare and Life Sciences
The Best of AI and HPC in Healthcare and Life Sciences
 
NLP for Biomedical Applications
NLP for Biomedical ApplicationsNLP for Biomedical Applications
NLP for Biomedical Applications
 
Top 5 Deep Learning and AI Stories - August 30, 2019
Top 5 Deep Learning and AI Stories - August 30, 2019Top 5 Deep Learning and AI Stories - August 30, 2019
Top 5 Deep Learning and AI Stories - August 30, 2019
 
Seven Ways to Boost Artificial Intelligence Research
Seven Ways to Boost Artificial Intelligence ResearchSeven Ways to Boost Artificial Intelligence Research
Seven Ways to Boost Artificial Intelligence Research
 
NVIDIA Developer Program Overview
NVIDIA Developer Program OverviewNVIDIA Developer Program Overview
NVIDIA Developer Program Overview
 
NVIDIA at Computex 2019
NVIDIA at Computex 2019 NVIDIA at Computex 2019
NVIDIA at Computex 2019
 
Top 5 DGX Sessions From GTC 2019
Top 5 DGX Sessions From GTC 2019Top 5 DGX Sessions From GTC 2019
Top 5 DGX Sessions From GTC 2019
 
Top 5 Data Science Sessions from GTC 2019
Top 5 Data Science Sessions from GTC 2019Top 5 Data Science Sessions from GTC 2019
Top 5 Data Science Sessions from GTC 2019
 
This Week in Data Science - Top 5 News - April 26, 2019
This Week in Data Science - Top 5 News - April 26, 2019This Week in Data Science - Top 5 News - April 26, 2019
This Week in Data Science - Top 5 News - April 26, 2019
 
GTC 2019 Keynote in Silicon Valley
GTC 2019 Keynote in Silicon ValleyGTC 2019 Keynote in Silicon Valley
GTC 2019 Keynote in Silicon Valley
 
CUDA DLI Training Courses at GTC 2019
CUDA DLI Training Courses at GTC 2019CUDA DLI Training Courses at GTC 2019
CUDA DLI Training Courses at GTC 2019
 
DGX Sessions You Won't Want to Miss at GTC 2019
DGX Sessions You Won't Want to Miss at GTC 2019DGX Sessions You Won't Want to Miss at GTC 2019
DGX Sessions You Won't Want to Miss at GTC 2019
 
Transforming Healthcare at GTC Silicon Valley
Transforming Healthcare at GTC Silicon ValleyTransforming Healthcare at GTC Silicon Valley
Transforming Healthcare at GTC Silicon Valley
 
OpenACC Monthly Highlights February 2019
OpenACC Monthly Highlights February 2019OpenACC Monthly Highlights February 2019
OpenACC Monthly Highlights February 2019
 
CUDA Sessions You Won't Want to Miss at GTC 2019
CUDA Sessions You Won't Want to Miss at GTC 2019CUDA Sessions You Won't Want to Miss at GTC 2019
CUDA Sessions You Won't Want to Miss at GTC 2019
 
Empowering Radiology with AI
Empowering Radiology with AIEmpowering Radiology with AI
Empowering Radiology with AI
 

Dernier

Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DaySri Ambati
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 

Dernier (20)

Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 

How Do I Understand Deep Learning Performance?

  • 1. HOW DO I UNDERSTAND DEEP LEARNING PERFORMANCE?
  • 2. THERE’S A LOT OF CONFUSION AROUND DEEP LEARNING PERFORMANCE.
  • 3. HOW DO YOU MEASURE IT? WHAT SHOULD YOU MEASURE?
  • 4. JUST REMEMBER “PLASTER” A FRAMEWORK FOR DEEP LEARNING PERFORMANCE
  • 5. WHAT DOES PLASTER STAND FOR? Programmability Latency Accuracy Size of Model Throughput Energy Efficiency Rate of Learning
  • 6. PROGRAMMABILITY Machine learning is experiencing explosive growth not only in the size and complexity of the models but also the burgeoning diversity of neural network architectures. It is difficult even for experts to understand the model choices and then choose the appropriate model to solve their AI business problems. After a deep learning model is coded and trained, it is then optimized for a specific runtime inference environment. NVIDIA addresses training and inference challenges with two key tools– CUDA and TensorRT, NVIDIA’s programmable inference accelerator. In addition, NVIDIA’s deep learning platform accelerates ALL deep learning frameworks, both for training and inference.
  • 7. LATENCY Humans and machines need a response to make decisions and take action. Latency is the time between requesting something and receiving a response. With most human-facing software systems, not just AI, the time is often measured in milliseconds. Voice recognition is a commonly understood application, thanks to Siri, Alexa, and similar voice interfaces. There is wide demand for digital assistants in both consumer and customer service applications. But when humans try to interface with digital assistants, a lag of even a few seconds starts to feel unnatural.
  • 8. ACCURACY While accuracy is important in every industry, healthcare needs especially high accuracy. Medical imaging has advanced significantly in the last couple of decades, increasing usage and requiring more analysis to identify medical issues. Medical imaging advancements and usage also mean that large volumes of data must be transmitted from medical machines to medical specialists to analyze. Options to address the data volume issue have been either to transmit the full information with long delays or to sample the data and reconstruct it using techniques that can lead to inaccurate reconstruction and diagnostics. An advantage of deep learning is that it can be trained at high precision and implemented at lower precision.
  • 9. SIZE OF MODEL The size of a deep learning model and the capacity of the physical network between processors have impacts on performance, especially in the latency and throughput aspects of PLASTER. Deep learning network models are exploding in numbers. Their size and complexity are also increasing, enabling far more detailed analysis and driving the need for more powerful systems for training. In a deep learning model, the drivers of compute power and physical network expansion are: • Number of layers • Number of nodes (neurons) per layer • Complexity of computation per layer • Number of connections between a node at one layer and the nodes of neighboring layers
  • 10. THROUGHPUT Developers are increasingly optimizing inference within a specified latency threshold. While the latency limit ensures good customer experience, maximizing throughput within that limit is critical to maximizing data center efficiency as well as revenue. There has been a tendency to use throughput as the only performance metric, as more computations-per-second generally leads to better performance across other areas. However, if a system cannot deliver adequate throughput within a specified latency requirement, power budget, or server node count, then the system will not ultimately serve an application’s inference needs well. Without the appropriate balance of throughput and latency, the result can be poor customer service, missing service level agreements (SLAs), and potentially a failed service.
  • 11. ENERGY EFFICIENCY As DL accelerator performance improves, DL accelerator power consumption escalates. Providing ROI for deep learning solutions involves more than looking at just the inference performance of a system. Power consumption can quickly increase costs of delivering a service, driving a need to focus on energy efficiency in both devices and systems. Speech processing is a good example of a solution that needs heavy processing to provide an intelligent response in a natural voice. Data center inference providing real- time processing for speech can easily involve large racks of machines that can impact a company’s total cost of ownership (TCO). Therefore, the industry measures operational success in inferences-per-watt (higher is better). Hyperscale data centers seek to maximize energy efficiency for as many inferences as they can deliver with a fixed power budget.
  • 12. RATE OF LEARNING As one of the two words in “AI” is intelligence, users will want the neural networks to learn and adapt in a reasonable timeframe. For complex DL systems to gain traction in business, software tool developers must support the DevOps movement. As organizations continue to experiment with deep learning and neural networks, they are learning how to more effectively build and implement DL systems. DL models must be retrained periodically as inferencing services gather new data and as services grow and change. Therefore, IT organizations and software developers must increase the rate at which they can retrain models as new data arrives.
  • 13. HOW CAN PLASTER IMPACT YOUR BUSINESS? DOWNLOAD WHITEPAPER