SlideShare une entreprise Scribd logo
1  sur  114
Télécharger pour lire hors ligne
NVIDIA
Accelerated Computing
Full Stack, 3 Chips, Data Center Scale
30 Million CUDA Downloads
150 SDKs
$100 Trillion Industry Served
Gaming
Data Science
Robotics
Broadcast
CAD
Physical
Sciences
Life
Sciences
Quantum
Physics
Digital
Twins
Genomics
5G
Quantum
Computing
Cybersecurity
AI
NLU
Machine
Learning
AI
Recsys
AI
Speech
AI
Computer
Vision
Medical
Imaging
Autonomous
Vehicles
EDA
COMPUTING FOR THE DA VINCIS AND EINSTEINS OF OUR TIME
8 of 10
World’s Top 500
Supercomputers
9 of 10
World’s Top 500 Green
Supercomputers
#1
MLPerf
Training and Inference
2021 Nobel Prize in Physics
Studying Earth’s Climate
25,000
Companies Run
on NVIDIA AI
Oak Ridge National Laboratory and the oak leaf symbol are registered trademarks of the U.S. Department of Energy. Use of this mark does not constitute or imply its
endorsement, recommendation, or favoring by the United States Government or any agency thereof or its contractors or subcontractors.
World Record Accuracy
2.96% Gap on
Gehring and Homberger
Scalable to 1,000s
of Locations
3 Seconds
vs
5 Minutes
to Route 1,000 Packages
ANNOUNCING
NVIDIA REOPT
Re-Optimize Logistics and Supply Chain in Real-Time
Accelerated Solver for Vehicle Route, Warehouse Picking,
Fleet-Mix Optimization
Massively Parallel Algorithm Generates Thousands of
Solution Candidates and Refinements
Dynamic Rerouting Reduces Travel Time – Save
Billions for a $10 Trillion Logistics Industry
Available Now
nvidia.com/reopt
LEADING QUANTUM SIMULATORS
INDUSTRY PARTNERS
RESEARCH COMMUNITY PARTNERS
Oak Ridge National Laboratory and the oak leaf symbol are registered trademarks of the U.S. Department of Energy. Use of this mark does not constitute or imply its
endorsement, recommendation, or favoring by the United States Government or any agency thereof or its contractors or subcontractors.
ANNOUNCING
NVIDIA CUQUANTUM DGX
APPLIANCE
Research the Computer of Tomorrow on the Most
Powerful Computer Today
Appliance Available Q1 2022
cuQuantum Available Now for Download
developer.nvidia.com/cuquantum
Out-of-the-Box Optimized Stack for Cirq
Other Simulators in Development
cuQuantum SDK in Open Beta;
Accelerate Popular Quantum
Simulators from Google, IBM
cuQuantum
GOOGLE
ANNOUNCING
NVIDIA CUQUANTUM DGX
APPLIANCE
Research the Computer of Tomorrow on the Most
Powerful Computer Today
Out-of-the-Box Optimized Stack for Cirq
Other Simulators in Development
cuQuantum SDK in Open Beta;
Accelerate Popular Quantum
Simulators from Google, IBM
cuQuantum
Appliance Available Q1 2022
cuQuantum Available Now for Download
developer.nvidia.com/cuquantum
DGX cuQuantum Appliance
State Vector Simulator on Dual AMD CPU
Sycamore
Supremacy
Circuit
Quantum
Fourier
Transform
Shor's
Algorithm
29 Minutes
19 Seconds
8 Minutes
7 Seconds
22 Minutes
26 Seconds
ANNOUNCING
WORLD RECORD QUANTUM
SIMULATION OF MAXCUT
Record Qubit Scale
MaxCut Algorithm with cuQuantum Tensor
Network Simulation
1,688 Qubits on 896 GPUs
Advance Quantum Algorithm Research in
Drug Discovery, Climate Research, Cybersecurity,
and Finance
Tensor Network
Simulator on Theta
cuQuantum
on Selene
210
3,375 VERTICES
ANNOUNCING
NVIDIA CUNUMERIC
Accelerated Computing At-Scale for PyData
and NumPy Ecosystem
Python Used by 20 Million Data Scientists,
Researchers, and Scientists
NumPy Downloaded 122,000,000 Times Since 2017
NumPy Used by 790,000 Projects in GitHub
NumPy is the Foundation of Pandas, SciPy,
and Scikit-Learn
2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021
ANNOUNCING
NVIDIA CUNUMERIC
Accelerated Computing At-Scale for PyData
and NumPy Ecosystem
Transparently Accelerates and Scales
NumPy Workflows
Zero Code Changes
Automatic Parallelism and Acceleration for
Multi-GPU, Multi-Node Systems
Scales to 1,000s of GPUs
Available Now on GitHub and Conda
cuNumeric
NVIDIA Python Data Science and Machine Learning Ecosystem
cuDF
Pandas Scikit-Learn NetworkX NumPy
cuML cuGraph
DATA-CENTER-SCALE COMPUTE ENGINE
Implicitly Extract
Instruction-Level Parallelism
Overlap Memory Latency and
Computation
Manage Coherence of Memory
Hierarchy
Dynamic Scheduling
Retire
Reorder Buffer
Int Int FP FP L/S L/S
Instruction Buffer
In-Order
Out-Of-Order
In-Order
Branch Fetch
Instruction
Decode/Rename
Dispatch
DATA-CENTER-SCALE COMPUTE ENGINE
Implicitly Extract
Task-Level Parallelism
Overlap Memory Latency and
Computation
Manage Coherence of Memory
Hierarchy
Dynamic Scheduling
Time
(
seconds
)
Relative Dataset Size | # of GPUs
0
50
100
150
1 2 4 8 16 32 64 128 256 512 1024
CFD Python
(Weak Scaling)
Retire
Reorder Buffer
Instruction Buffer
In-Order
Out-Of-Order
Branch Fetch
Task
Decode/Rename
Dispatch
CPUs GPUs NICs
In-Order
Omniverse
Million-X
Science
AI
Data-Center-Scale
Computing
Robotics &
Self-Driving Cars
Avatars
SPEECH
NLU
RECOMMENDER
DIALOG MANAGER
NV AVATAR
COMPUTER VISION
CONVERSATIONAL AVATAR
Camera In
Mic In/Out
Graphics /
Video Out
400G NDR InfiniBand
Cloud-Native Supercomputing
ANNOUNCING NVIDIA QUANTUM-2
Multi-Tenant
Bare-Metal Secure
Performance
Isolation
Congestion
Control
SHARP Gen 3
In-Network Computing
Precision
Timing
400G NDR InfiniBand
Cloud-Native Supercomputing
ANNOUNCING NVIDIA QUANTUM-2
QUANTUM-2 SWITCH
57 Billion Transistors TSMC 7N
Optimized Multi-Tenant In-Network Computing
64-Ports of 400 Gbps or 128-Ports of 200 Gbps
3X Higher Switching Throughput | 32X More AI Acceleration Engines
Sampling Now
400G NDR InfiniBand
Cloud-Native Supercomputing
ANNOUNCING NVIDIA QUANTUM-2
CONNECTX-7 INFINIBAND
8 Billion Transistors TSMC 7N
16 Core / 256 Threads Datapath Accelerator | 400 Gbps Crypto Accelerations
4X In-Network Computing Performance | 2X GPUDirect Throughput
Sampling Jan ‘22
400G NDR InfiniBand
Cloud-Native Supercomputing
ANNOUNCING NVIDIA QUANTUM-2
BLUEFIELD-3 INFINIBAND
22 Billion Transistors TSMC 7N
16 Arm 64-Bit Cores
16 Core / 256 Threads Datapath Accelerator | 400 Gbps Crypto Accelerations
4X In-Network Computing Performance | 2X GPUDirect Throughput
Sampling May ‘22
400G NDR InfiniBand
Cloud-Native Supercomputing
ANNOUNCING NVIDIA QUANTUM-2
2X
Data Throughput
400 Gbps
4X
MPI Performance
All-2-All
Acceleration
5X
Switch Capacity
>1.6 Petabps
2048 Ports
6.5X
Higher Scalability
>1M Nodes
in 3 Hops
32X
AI Accelerators
SHARP v3
400G NDR InfiniBand
Cloud-Native Supercomputing
ANNOUNCING NVIDIA QUANTUM-2
2X
Data Throughput
400 Gbps
4X
MPI Performance
All-2-All
Acceleration
5X
Switch Capacity
>1.6 Petabps
2048 Ports
6.5X
Higher Scalability
>1M Nodes
in 3 Hops
32X
AI Accelerators
SHARP v3
SOFTWARE
DEFINED
CLOUD-NATIVE
DISAGGREGATED
COMPUTING
SCALE UP &
SCALE OUT
ZERO-TRUST
DOCA 1.0
Accelerated Secure Bare-Metal Cloud
Crypto Storage Acceleration
Software-Defined Networking
De/Compression Congestion Control
RegEx
NVIDIA DOCA 1.0
Accelerated Data Center Infrastructure
Offload, Accelerate, and Isolate Data Center
Infrastructure with Accelerated Networking, Security,
Storage, and Management Applications
NVIDIA BlueField DPU
1400
DOCA Developers
108
New DOCA APIs
Deep Packet Inspection Intrusion Detection
Load Balancers
Telemetry Security Groups
Firewall
DOCA 1.2
Zero-Trust Security Framework
Service
Containers
Crypto Storage Acceleration
Software-Defined Networking
De/Compression Congestion Control
RegEx
ANNOUNCING
NVIDIA DOCA 1.2
Accelerated Data Center Infrastructure
NEW Zero-Trust Security Framework
Extend Threat Protection to Every Touch Point
Hardware & Software Authentication, Line-Rate Data
Encryption, Distributed Firewall, and Smart Telemetry
Security Groups and Virtual Private Cloud Isolation
NVIDIA BlueField DPU
Deep Packet Inspection Intrusion Detection
Load Balancers
Telemetry Security Groups
Firewall
DOCA 1.2
Zero-Trust Security Framework
Service
Containers
Crypto Storage Acceleration
Software-Defined Networking
De/Compression Congestion Control
RegEx
NVIDIA BlueField DPU
ANNOUNCING
CYBERSECURITY LEADERS EXTEND
ZERO-TRUST WITH NVIDIA
BLUEFIELD
Deploy Security-as-a-Service with DOCA 1.2
Extend Security from Perimeter to Edge
Security Processing on BlueField Offload CPU Burden
EXPANDING BLUEFIELD ECOSYSTEM
CLOUD CYBERSECURITY STORAGE
EDGE
PLATFORM
Anomaly Detection
Machine
Human
Machine
Machine
Machine
Machine
Human
Humans and Machines
Across the Enterprise
Post-Processing
Pre-Processing
Inference Requests
Apache Kafka
TRITON
Log Data
NVIDIA MORPHEUS
ANNOUNCING
NVIDIA MORPHEUS
Accelerated AI Platform for Next Gen SIEM
Built on NVIDIA RAPIDS and NVIDIA AI
600X Faster Data Processing – Monitor Every User and
Machine-Generated Data for Anomalous Behavior
Detect Anomalies with 10s of Millions of AI Models
in Real-Time
Pre-Trained Models for User Activity Fingerprinting and
Phishing Detection
Early Access 2 Available Now
nvidia.com/morpheus
Million-X
Science
Accelerated Computing Data Center Scale
AI
MILLION-X LEAP
1980 1990 2000 2010 2020
Single-threaded perf
1.5X per year
1.1X per year
102
103
104
105
106
107
109
108
101
MACHINE
LEARNING
SCALE
UP & OUT
ACCELERATED
COMPUTING
PHYSICS-ML TURBOCHARGES SCIENCE
EXPLOSION IN HPC + AI RESEARCH
# ML+Science Papers in ArXiv
0
1000
2000
3000
4000
5000
6000
2015 2016 2017 2018 2019 2020
MILLION-X DRUG DISCOVERY
Refinement in Simulation Docking & Virtual Screening Physics-Based Simulation
Imaging & Crystallography
Exhaustive
Search
BINDING
FREE ENERGY
CHEMICAL COMPOUNDS
PROTEIN STRUCTURE OF
DISEASE TARGET
MILLION-X DRUG DISCOVERY
SMNPPPPETSNPNKPKRQTNQLQYL
LRVVLKTLWKHQFAWPFQQPVDAV
KLNLPDYYKIIKTPMDMGTIKKRLEN
NYYWNAQECIQDFNTMFTNCYIYNK
PGDDIVZRS
HQFAWPFQQPVDAVKLNL
QTNQLQYLLRVVLKTLWR
Structure Prediction Hybrid Docking Machine-Learned Simulation
Accelerated Sequencing
Generative Search &
Synthesis
BINDING
FREE ENERGY
CHEMICAL COMPOUNDS
PROTEIN STRUCTURE OF
DISEASE TARGET
MILLION-X DRUG DISCOVERY
1.00E+03
1.00E+05
1.00E+07
1.00E+09
1.00E+11
1.00E+13
1995 2000 2005 2010 2015 2020 2025
Number
of
Available
Structures
Generative
Models
AlphaFold
Known
Chemicals
Structured
Proteins
ENTOS TRANSCENDS SIMULATION
WITH MACHINE LEARNED
POTENTIALS
Quantum Accuracy
1,000X Faster than DFT
Reactive
MILLION-X CLIMATE SCIENCE
1000km
100km
10km
1km
100m
10m
1m
1980 1990 2000 2010 2020 2030 2040 2050 2060
AR1 AR2 AR3 AR4 AR5 AR6
(IPCC)
1km at 1min (1X COMPUTE)
100m at 1s (10,000X COMPUTE)
1m at 0.01s (100 BILLION X COMPUTE)
CONVECTION
RESOLVING
STORM
RESOLVING
STRATOCUMULUS
RESOLVING
RESOLUTION
Figure adapted from: Schneider, T., Teixeira, J., Bretherton, C. et al. Climate goals and computing the future of clouds. Nature Clim Change 7, 3–5 (2017). https://doi.org/10.1038/nclimate3190
ANNOUNCING
NVIDIA MODULUS
Physics-ML Neural Simulation Framework
Framework for Developing Physics-ML Models
Train Physics-ML Models Using Governing Physics,
Simulation, and Observed Data
Multi-GPU, Multi-Node Training
1,000-100,000X Speed Models – Ideal for Digital Twins
SymPy Equation
Model Library
(SiREN, PINO, PINN, MESHFREE)
Multi-Node Multi-GPU Training Engine
Numerical
Optimization
Plans
Geometry
ICs & BCs
Observations
Computational Graph Compiler
Available Now
developer.nvidia.com/modulus
EARTH DIGITAL TWIN IN OMNIVERSE
ERA5 ECMWF
Atmospheric Winds & Geopotential
10 TB | 30km | 5 Atmos Layers
RAPIDS
100,000X Speed-Up
0.25 Seconds for 7-Day Forecast
Training: 4 Hours on 128 A100 GPUs
Modulus Omniverse
Fourier Neural Operator
AI
EARTH
EMULATOR
Satellite
Ocean
Ecosystem
Atmosphere
Extreme
Weather
Prediction
Wind Energy
Forecasting
Disaster
Mitigation
Omniverse
OMNIVERSE
Internet
Physical
World
OMNIVERSE
Omniverse
Internet
Physical
World
Path Tracing
MDL
Physics
AI
OMNIVERSE
Omniverse
Worlds
Omniverse
Worlds
Omniverse
Worlds
Omniverse
Internet
Physical
World
Path Tracing
MDL
Physics
AI
OMNIVERSE FOR DESIGN COLLABORATION
Designer #1’s World
Designer #3’s World
Internet
Physical World
Designer #2’s World
Shared World
Path Tracing
MDL
Physics
AI
OMNIVERSE FOR DIGITAL TWIN
Factory Designer
Robot Gym
Internet
Physical World
Robot Gym
Factory Digital Twin
Path Tracing
MDL
Physics
AI
ANNOUNCING NEW OMNIVERSE FEATURES
SHOWROOM
Available in Beta
FARM
Available in Beta
AR
Available in Beta
VR
Coming Soon
500
Companies
40M
3D Designers
70K
Downloads
DIGITAL
TWINS
HPC
MEDIA &
ENTERTAINMENT
MANUFACTURING
AEC
CONNECTING VIRTUAL WORLDS
ANNOUNCING
EARLY ACCESS BENTLEY ITWIN FOR
NVIDIA OMNIVERSE
Physically-Accurate 4D Visualization of Infrastructure
Digital Twins
Supports 4D Design Review and Construction Simulation
Early Access Now Available
bentley.com/4DVisualization
SIEMENS ENERGY BUILDS HRSG DIGITAL TWIN IN OMNIVERSE
Boiler Design
Flow Simulation
Internet
Physical World
Training Data
Plant Digital Twin
Path Tracing
MDL
Physics
AI
BMW GROUP BUILDS FACTORY DIGITAL TWINS IN OMNIVERSE
Factory Planning
Robot Gym
Internet
Physical World
Digital Human Training
Factory Digital Twin
Path Tracing
MDL
Physics
AI
ERICSSON BUILDS CITY DIGITAL TWIN IN OMNIVERSE
City Planning
Simulation
Internet
Physical World
Materials
5G Network Digital Twin
Path Tracing
MDL
Physics
AI
AI
GRAPH NEURAL NETWORKS CAPTURE INSIGHTS FROM INTERCONNECTED DATA
90B Relationships in a Social Network
1.1B Transactions a Day
40B Molecules in Chemical Databases
DRUG DISCOVERY SOCIAL CONNECTIONS | FRAUD DETECTION
ANNOUNCING
DGL ACCELERATION WITH CUDA-X
GPU-Accelerated GNN Workflow
GNN for Molecule Reaction Prediction,
Node Classification, Knowledge Graphs,
and Model Explainability
CUDA-Optimized Reference Examples for
SE3-Transformer, R-GCN, and GraphSage
Early Access in December
ngc.nvidia.com
cuDF cuGraph DGL with CUDA-X
Text
Images
Relationships
Sub-Graph
Construction Graph to GNN
Graph
Construction
PROCESSING THE LARGEST GRAPH NEURAL NETWORKS
300B pins in graphs with billions of nodes
and billions of edges
5X faster training on graphs with over
10M nodes and over 100M edges from 24hrs to 5hrs
Improving fraud detection over
billions of transactions
AlexNet
VGG-19 Seq2Seq
Resnet
InceptionV3
Xception
ResNeXt
DenseNet201
Transformer
ELMo
GPT-1
BERT Large
Megatron
Microsoft T-NLG
GPT-3
Megatron-Turing NLG 530B
MoCo ResNet50
XLNet
Wav2Vec 2.0
100
1,000
10,000
100,000
1,000,000
10,000,000
100,000,000
1,000,000,000
10,000,000,000
2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022
Training
Compute
(PetaFLOPS)
Transformer: 275x / 2yrs
All AI Models: 25x / 2yrs
Moore's Law: 2x / 2yrs
SELF-SUPERVISED LEARNING ARRIVES WITH TRANSFORMERS
ANNOUNCING NVIDIA NEMO MEGATRON
Megatron 530B
NEMO MEGATRON
Automated
Data Curation
Distributed
Training
Customer’s Data
Custom
Megatron 530B
ANNOUNCING NVIDIA TRITON MULTI-GPU MULTI-NODE INFERENCE
Triton Inference Server
Magnum IO
(Multi-GPU, Multi-Node)
Query Response
Application
Response Time
1/2 Second
2 DGX A100
Response Time
> 1 Minute
Dual Socket CPU Server
JAPANESE
Global Ecommerce
ENTERPRISE DIGITAL
WORKFLOWS
SWEDISH
Finance, Healthcare,
and Manufacturing
PORTUGUESE
AI Assistant
CHINESE
Customer Service
VIETNAMESE
Radiologists and
Telehealth AI Service
INTENT RECOGNITION,
TRANSLATION, AND CHAT
KOREAN
Chatbots and Call Centers
LARGE LANGUAGE MODELS TAKING HPC MAINSTREAM
THANK YOU TO THOSE PRESENTING AT GTC!
RETAIL CLOUD ENTERPRISE SaaS
TELECOMMUNICATIONS
INDUSTRIAL HEALTHCARE CONSUMER INTERNET
AUTO
LEADING COMPANIES RUNNING NVIDIA AI
RETAIL CLOUD ENTERPRISE SaaS
TELECOMMUNICATIONS
INDUSTRIAL HEALTHCARE CONSUMER INTERNET
AUTO
ANNOUNCING
MICROSOFT TEAMS ADOPTS
NVIDIA AI WITH AZURE
COGNITIVE SERVICES
Nearly 250 Million Monthly Active Microsoft
Teams Users
Live Transcription and Captions in 28 Languages
Triton Inference Server in Azure Cognitive Services
AI INFERENCE IS HARD
PROCESSORS
AI Inference
DEPLOYMENT
PLATFORMS
Cloud On-Prem Edge Embedded
T4 GPU Arm CPU
A100 GPU
V100 GPU x86 CPU
FRAMEWORKS APP CONSTRAINTS
Real Time Batch Streaming
MODELS
CNN
GNN Decision Trees
RNN Transformers
ANNOUNCING
TENSORRT INTEGRATED WITH
PYTORCH AND TENSORFLOW
Accelerate In-Framework Inference with TensorRT with
Just 1 Line of Code
3X Faster
Supports Every Workload
FP32, TF32, FP16, INT8
Available for Download Today
ngc.nvidia.com
ANNOUNCING NVIDIA TRITON
WITH FOREST INFERENCING
ML and DL in One Application
Tree Models (XGBoost, Random Forest, LightGBM) Are
Ubiquitous
Large Tree Ensembles Push CPUs Beyond
Response-Time Limits
Ensembling with ML, DL, and More Complex Models is the
Future with Fraud Detection
0%
3.5 ms 0 ms
1.5 ms
Detection Rate
Max Response Time
80%
Transaction > $50?
Time after midnight?
XGBOOST MODEL
ANNOUNCING NVIDIA TRITON
WITH FOREST INFERENCING
ML and DL in One Application
Tree Models (XGBoost, Random Forest, LightGBM) Are
Ubiquitous
Large Tree Ensembles Push CPUs Beyond
Response-Time Limits
Ensembling with ML, DL, and More Complex Models is the
Future with Fraud Detection
7K
Transaction > $50?
Time after midnight?
XGBOOST MODEL
7K
7K 7K
GOAL
Increase Detection Rate Within 1.5ms
3.5 ms 0 ms
80%
0%
1.5 ms
Detection Rate
Max Response Time
ANNOUNCING NVIDIA TRITON
WITH FOREST INFERENCING
ML and DL in One Application
Unified Deployment Engine for DL and ML
Inference Random Forests, GBDTs, and
Decision Trees
Deploy on CPU and GPU
Process Million+ Node Tree Models with Low-Latency
Fraud Detection, Recommender Systems, Risk Assessment,
and Predictive Maintenance
Available for Download Today
ngc.nvidia.com
UNACCEPTABLE
RESPONSE TIME
0 ms
3.5 ms
1M
Nodes
7K
GIANT XGBOOST MODEL
7K 7K
7K
Detection Rate
80%
0%
Max Response Time
1.5 ms
1M
Nodes
ANNOUNCING
TRITON INFERENCE SERVER 2.15
NEW Integration into AWS SageMaker and AliCloud –
Now All Major Frameworks, Major Clouds, and
AI Platforms
NEW Support for Arm – Now Inference on Every
Generation of GPUs, x86 CPUs, and Arm
NEW Model Analyzer Optimizes for App
QoS Requirements
NEW Forest Inference | NEW Distributed Multi-GPU,
Multi-Node Inference
Optimal
Model Config
TensorFlow PyTorch
TensorRT
RAPIDS
OpenVINO ONNX RT
Triton Inference Server
Application
Ampere x86 CPU
Volta Arm CPU
Turing
QoS Requirements
Triton
Model Analyzer
BOOST THROUGHPUT OF MODERN DATA CENTERS WITH NVIDIA TRITON
TensorRT 1 TensorRT 4 TensorRT 8.2
ACCELERATES EVERY WORKLOAD WORLD-CLASS RESPONSE TIME AND THROUGHPUT
12X
Recommenders
< 1 sec
10X
Reinforcement
Learning
583X
Speech Recognition
< 100ms
36X
Computer Vision
< 7ms
178X
Text-to-Speech
< 100ms
21X
NLP
< 50ms
CLASSIFICATION CLASSIFICATION
DETECTION
SEGMENTATION
RECOMMENDERS
CLASSIFICATION
DETECTION
SEGMENTATION
RECOMMENDERS
RL
FRAUD DETECTION
TEXT-TO-SPEECH
SPEECH RECOGNITION
SENTIMENT ANALYSIS
TRANSLATION
SENTENCE COMPLETION
Q & A
Software-Defined Real-Time Secure Hybrid Cloud
Deploy &
Orchestrate Fleet
High-Speed IO
Networking
Storage
Data
Processing
Signal &
Image
Processing
DNN
PINN
Graphics Streaming
RETAIL HEALTHCARE MANUFACTURING
EDGE AI AUTOMATING EVERY INDUSTRY
15M Stores
(CV, SpeechAI, NLU, RecSys)
160K Hospitals
(CV, SpeechAI, NLU, Robotics)
10M Factories
(CV, SpeechAI, NLU, Robotics)
FAST FOOD
7M Restaurants
(CV, SpeechAI, NLP)
NVIDIA Unified Compute Framework
NVIDIA Fleet Command
NVIDIA UNIFIED COMPUTE FRAMEWORK FOR REAL-TIME EDGE APPLICATIONS
RETAIL HEALTHCARE MANUFACTURING
15M Stores
(CV, SpeechAI, NLU, RecSys)
160K Hospitals
(CV, SpeechAI, NLU, Robotics)
10M Factories
(CV, SpeechAI, NLU, Robotics)
FAST FOOD
7M Restaurants
(CV, SpeechAI, NLP)
Fleet Command
Aerial 5G
NVIDIA AI
NVIDIA Metropolis
With Unified Computing Framework
NVIDIA METROPOLIS AI EDGE
3rd Party
L2+
3rd Party
5G CORE
NVIDIA
L1
DATA CENTER COMMAND CENTER
Video In
DeepStream
on EGX
Triton
on EGX
Triton
on EGX
DeepStream
on EGX
Metadata
Display
DeepStream
on EGX
DeepStream
on EGX
MEDIA
HANDELING
DETECTION CLASSIFICATION SMOOTHENING
VISUALIZATION
TRACKING
Fleet Command
Aerial 5G
NVIDIA AI
NVIDIA Metropolis
With Unified Computing Framework
MAVENIR
L2+
MAVENIR
5G Core
NVIDIA
L1
Powered by NVIDIA Metropolis AI-on-5G Edge Platform
ANNOUNCING MAVENIR MAVEDGE-AI
Mavenir MAVedge-AI Application
DATA CENTER COMMAND CENTER
Video In
DeepStream
on EGX
Triton
on EGX
Triton
on EGX
DeepStream
on EGX
Metadata
Display
DeepStream
on EGX
DeepStream
on EGX
MEDIA
HANDELING
DETECTION CLASSIFICATION SMOOTHENING
VISUALIZATION
TRACKING
Maxine
Clara
Metropolis
Isaac
Merlin
Riva
NVIDIA AI ECOSYSTEM
NVIDIA AI
NVIDIA BASE COMMAND NVIDIA FLEET COMMAND
Morpheus
Maxine
Clara
Metropolis
Isaac
Merlin
Riva
1,000+ Partners
Cloud to Core to Edge
NVIDIA AI ECOSYSTEM
NVIDIA AI
NVIDIA BASE COMMAND NVIDIA FLEET COMMAND
Morpheus
NVIDIA PARTNER NETWORK
MLOps | CLOUD ML PaaS
ORCHESTRATORS
5G EDGE
OEM & CLOUDS
Silicon Valley
Singapore
Paris
Washington, D.C.
Dallas
Amsterdam
Frankfurt
London
Tokyo
ANNOUNCING
NVIDIA LAUNCHPAD ACROSS THE
GLOBE
GTC 2021
Avatars
OMNIVERSE AVATAR
Live Customer
Support
Web Customer
Support
Video
Conference
& Telepresence
Games Robots
OMNIVERSE AVATAR
Realistic Imaginary
Autonomous Teleoperated
PROJECT MAXINE WITH OMNIVERSE AVATAR
RIVA
MEGATRON 530B
MERLIN
DIALOG MANAGER
NV AVATAR
NV CV
Camera In
Mic In/Out
Graphics /
Video Out
MEGATRON 530B
MERLIN
DIALOG MANAGER
RIVA
Mic In/Out
ANNOUNCING
NVIDIA RIVA SPEECH AI
World Class Quality and Response Time
SDK to Customize for Use Case and Unique Voice for
Brand Virtual Assistant
Train New Voice with Only 30 Mins of Speech Data
Human-Like Expressivity and Fine-Grained Control
Deploy in Cloud, On-Prem, Edge, and Embedded
Enterprise Support Available Q1 ‘22 Globally
developer.nvidia.com/riva
NVIDIA ADVANCES SPEECH AI
Tacotron2 + WaveGlow
Fastpitch + HiFiGAN
0
100
200
300
400
500
600
V100 A100
Throughput
TEXT-TO-SPEECH
12X HIGHER PERFORMANCE
DeepSpeech2
Jasper Quartznet
Citrinet-1024
2018 2019 2020 2021
0
5
10
15
20
25
30
35
40
45
50
Error
Rate
SPEECH RECOGNITION
4X HIGHER ACCURACY
WIDELY ADOPTED
UCAAS | FINANCE | TELECOM | CONSUMER
70K Developers | 250K Downloads
PROJECT TOKKIO WITH OMNIVERSE AVATAR
Video Out
Video In
Audio In
Audio Out
Megatron 530B
Triton MGMN
on DGX
ASR
Triton on EGX
DL FACE
TRACKER
DeepStream
on EGX
AUD2FACE
OMNIVERSE
Zero-Shot DM
Triton on EGX
Merlin
Triton on EGX
TTS
Triton on EGX
RIVA
AVATAR
SIMULATION
OMNIVERSE
PROJECT MAXINE WITH OMNIVERSE AVATAR
Video Out
Video In
Audio In
Audio Out
with Translation
AUDIO
DENOISE
Triton on EGX
GAZE REPOSE
DeepStream
on EGX
VID2FACE
Triton on EGX
RIVA
SPEECH AI
Triton on EGX
AUD2FACE
OMNIVERSE
FACE TRACKER
DeepStream
on EGX
POSE & MESH
Triton on EGX
Robotics &
Self-Driving Cars
STRYKER AIRO TruCT
Interoperative CT
JOHNSON & JOHNSON AURIS
Robotic Endoscopy
MEDTRONIC HUGO
Robotic-Assisted Surgery
INTUITIVE SURGICAL ION
Robotic-Assisted Lung Biopsy
2M Devices | 16K Companies | 10K Modalities
MEDICAL INSTRUMENTS INTEGRATE AI AND ROBOTICS
RENDERING
OV on RTX
DATA
PROCESSING
cuCIM on EGX
ZERO-SHOT NLU
DIALOG
MANAGER
Triton on EGX
RIVA ASR
Triton on EGX
Audio
Sensor
SENSOR
PROCESSING
CUDA on EGX
IMAGE
PROCESSING
Triton on EGX
STREAM
DISPLAY
CloudXR on RTX
PHYSICS
PROCESSING
CUDA on EGX
ANNOUNCING
NVIDIA CLARA HOLOSCAN
AI COMPUTING INSTRUMENTS
PLATFORM
Stream-Computing Platform for High-Throughput
Signal, Data, AI, and Graphics Processing
Run in Data Center, Embedded Instruments,
or Hybrid
Remotely Update, Orchestrate, and Monitor Fleet
Platform for SaaS Model
Available November 15
developer.nvidia.com/clara-holoscan-sdk
ANNOUNCING NVIDIA AGX ORIN
Computational Sensing Instrument Platform
NVIDIA A6000
NVIDIA ConnectX-7
NVIDIA Orin
ANNOUNCING NVIDIA AGX ORIN
Computational Sensing Instrument Platform
NVIDIA A6000
NVIDIA ConnectX-7
NVIDIA Orin
700+ Companies
NVIDIA ISAAC POWERING THE ROBOTICS REVOLUTION
NVIDIA ISAAC
Synthetic
Data
Data
DIGITAL TWIN
FACTORY
PHYSICAL
FACTORY
HD Map
TRAIN AI MODEL
ANNOUNCING
NVIDIA ISAAC ROS
NEW Isaac ROS GEM Brings NVIDIA Robotics AI
to ROS Community
Accelerate ROS-Native Packages up to 10X
Isaac Sim Out-of-Box ROS Support
NEW Isaac Sim Replicator for Synthetic Data Generation
Isaac Sim Replicator
Omniverse Farm
TAO
ISAAC GEMS
AUTO-LABELED SYNTHETIC DATA
ISAAC SIM
ISAAC ROS APPLICATION
STEREO DEPTH
SGM
SEGMENTATION
DNN
HUMAN POSE
ESTIMATION
Download Isaac ROS GEMS
developer.nvidia.com/isaac-ros-gems
ANNOUNCING
OMNIVERSE REPLICATOR FOR ISAAC
SIM
NEW Isaac ROS GEM Brings NVIDIA Robotics AI
to ROS Community
Accelerate ROS-Native Packages up to 10X
Isaac Sim Out-of-Box ROS Support
NEW Isaac Sim Replicator for Synthetic Data Generation
GROWING FLEETS POWERED BY NVIDIA DRIVE
R Auto
NVIDIA DRIVE AV
Synthetic
Data
World
Data
VIRTUAL
WORLD
PHYSICAL
WORLD
HD Map
TRAIN AI MODEL
TRANSFORM SURROUND 2D TO 4D WORLD MODEL
Functionally Safe Production Ready AV Platform
ANNOUNCING DRIVE HYPERION 8 GA
Software Defined Vehicle
Dual Orin X Standard Form Factor
Production Sensor Set
DriveWorks Acceleration Libraries
DRIVE AV Software
Tools for OEM Adaptation
Available Now
nvidia.com/drive-hyperion
2 x NVIDIA DRIVE Orin
Systems-on-a-Chip (SoCs)
ObstacleNet
OpenRoadNet
RadarNet
SignNet
MapNet
LidarNet
OMNIVERSE REPLICATOR FOR DRIVE SIM
OMNIVERSE REPLICATOR FOR DRIVE SIM
DRIVE AV MAP
Fleet & Survey Mapping Auto Map Generation Localization & Planning,
Simulation, Digital Twin
ANNOUNCING DRIVE CONCIERGE
NVIDIA DRIVE CHAUFFEUR NVIDIA DRIVE CONCIERGE
NVIDIA
Accelerated Computing
Full Stack, 3 Chips, Data Center Scale
30 Million CUDA Downloads
150 SDKs
$100 Trillion Industry Served
Nemo
Megatron
Triton
ReOpt
cuQuantum
cuNumeric
Morpheus
Metropolis
Clara
Holoscan
Isaac
Maxine DRIVE
RIVA
DGL Modulus Omniverse
Launchpad AGX Orin Hyperion 8
Quantum-2
NVIDIA Keynote #GTC21

Contenu connexe

Tendances

How to Utilize MLflow and Kubernetes to Build an Enterprise ML Platform
How to Utilize MLflow and Kubernetes to Build an Enterprise ML PlatformHow to Utilize MLflow and Kubernetes to Build an Enterprise ML Platform
How to Utilize MLflow and Kubernetes to Build an Enterprise ML Platform
Databricks
 
NVIDIA GTC 2020 October Summary
NVIDIA GTC 2020 October SummaryNVIDIA GTC 2020 October Summary
NVIDIA GTC 2020 October Summary
NVIDIA
 

Tendances (20)

𝐆𝐞𝐧𝐞𝐫𝐚𝐭𝐢𝐯𝐞 𝐀𝐈: 𝐂𝐡𝐚𝐧𝐠𝐢𝐧𝐠 𝐇𝐨𝐰 𝐁𝐮𝐬𝐢𝐧𝐞𝐬𝐬 𝐈𝐧𝐧𝐨𝐯𝐚𝐭𝐞𝐬 𝐚𝐧𝐝 𝐎𝐩𝐞𝐫𝐚𝐭𝐞𝐬
𝐆𝐞𝐧𝐞𝐫𝐚𝐭𝐢𝐯𝐞 𝐀𝐈: 𝐂𝐡𝐚𝐧𝐠𝐢𝐧𝐠 𝐇𝐨𝐰 𝐁𝐮𝐬𝐢𝐧𝐞𝐬𝐬 𝐈𝐧𝐧𝐨𝐯𝐚𝐭𝐞𝐬 𝐚𝐧𝐝 𝐎𝐩𝐞𝐫𝐚𝐭𝐞𝐬𝐆𝐞𝐧𝐞𝐫𝐚𝐭𝐢𝐯𝐞 𝐀𝐈: 𝐂𝐡𝐚𝐧𝐠𝐢𝐧𝐠 𝐇𝐨𝐰 𝐁𝐮𝐬𝐢𝐧𝐞𝐬𝐬 𝐈𝐧𝐧𝐨𝐯𝐚𝐭𝐞𝐬 𝐚𝐧𝐝 𝐎𝐩𝐞𝐫𝐚𝐭𝐞𝐬
𝐆𝐞𝐧𝐞𝐫𝐚𝐭𝐢𝐯𝐞 𝐀𝐈: 𝐂𝐡𝐚𝐧𝐠𝐢𝐧𝐠 𝐇𝐨𝐰 𝐁𝐮𝐬𝐢𝐧𝐞𝐬𝐬 𝐈𝐧𝐧𝐨𝐯𝐚𝐭𝐞𝐬 𝐚𝐧𝐝 𝐎𝐩𝐞𝐫𝐚𝐭𝐞𝐬
 
Kubernetes architecture
Kubernetes architectureKubernetes architecture
Kubernetes architecture
 
GTC 2019 Keynote in Silicon Valley
GTC 2019 Keynote in Silicon ValleyGTC 2019 Keynote in Silicon Valley
GTC 2019 Keynote in Silicon Valley
 
GitOps with ArgoCD
GitOps with ArgoCDGitOps with ArgoCD
GitOps with ArgoCD
 
Omniverse for the Metaverse
Omniverse for the MetaverseOmniverse for the Metaverse
Omniverse for the Metaverse
 
Nvidia Corporate Presentation
Nvidia Corporate PresentationNvidia Corporate Presentation
Nvidia Corporate Presentation
 
How to Utilize MLflow and Kubernetes to Build an Enterprise ML Platform
How to Utilize MLflow and Kubernetes to Build an Enterprise ML PlatformHow to Utilize MLflow and Kubernetes to Build an Enterprise ML Platform
How to Utilize MLflow and Kubernetes to Build an Enterprise ML Platform
 
NVIDIA GTC 2020 October Summary
NVIDIA GTC 2020 October SummaryNVIDIA GTC 2020 October Summary
NVIDIA GTC 2020 October Summary
 
MLOps - Build pipelines with Tensor Flow Extended & Kubeflow
MLOps - Build pipelines with Tensor Flow Extended & KubeflowMLOps - Build pipelines with Tensor Flow Extended & Kubeflow
MLOps - Build pipelines with Tensor Flow Extended & Kubeflow
 
FLiP Into Trino
FLiP Into TrinoFLiP Into Trino
FLiP Into Trino
 
Generative-AI-in-enterprise-20230615.pdf
Generative-AI-in-enterprise-20230615.pdfGenerative-AI-in-enterprise-20230615.pdf
Generative-AI-in-enterprise-20230615.pdf
 
Metaverse System Architectures
Metaverse System ArchitecturesMetaverse System Architectures
Metaverse System Architectures
 
QNX Software Systems
QNX Software SystemsQNX Software Systems
QNX Software Systems
 
fpgax #13.pptx
fpgax #13.pptxfpgax #13.pptx
fpgax #13.pptx
 
Introduction to Kubernetes
Introduction to KubernetesIntroduction to Kubernetes
Introduction to Kubernetes
 
GitHub Copilot.pptx
GitHub Copilot.pptxGitHub Copilot.pptx
GitHub Copilot.pptx
 
Gen AI Cognizant & AWS event presentation_12 Oct.pdf
Gen AI Cognizant & AWS event presentation_12 Oct.pdfGen AI Cognizant & AWS event presentation_12 Oct.pdf
Gen AI Cognizant & AWS event presentation_12 Oct.pdf
 
Kubernetes 101
Kubernetes 101Kubernetes 101
Kubernetes 101
 
Best Practice on using Azure OpenAI Service
Best Practice on using Azure OpenAI ServiceBest Practice on using Azure OpenAI Service
Best Practice on using Azure OpenAI Service
 
Kubeflow Pipelines (with Tekton)
Kubeflow Pipelines (with Tekton)Kubeflow Pipelines (with Tekton)
Kubeflow Pipelines (with Tekton)
 

Similaire à NVIDIA Keynote #GTC21

“Accelerate Tomorrow’s Models with Lattice FPGAs,” a Presentation from Lattic...
“Accelerate Tomorrow’s Models with Lattice FPGAs,” a Presentation from Lattic...“Accelerate Tomorrow’s Models with Lattice FPGAs,” a Presentation from Lattic...
“Accelerate Tomorrow’s Models with Lattice FPGAs,” a Presentation from Lattic...
Edge AI and Vision Alliance
 

Similaire à NVIDIA Keynote #GTC21 (20)

組み込みから HPC まで ARM コアで実現するエコシステム
組み込みから HPC まで ARM コアで実現するエコシステム組み込みから HPC まで ARM コアで実現するエコシステム
組み込みから HPC まで ARM コアで実現するエコシステム
 
AI talk at CogX 2018
AI talk at CogX 2018AI talk at CogX 2018
AI talk at CogX 2018
 
NVIDIA DGX-1 超級電腦與人工智慧及深度學習
NVIDIA DGX-1 超級電腦與人工智慧及深度學習NVIDIA DGX-1 超級電腦與人工智慧及深度學習
NVIDIA DGX-1 超級電腦與人工智慧及深度學習
 
Harnessing the virtual realm for successful real world artificial intelligence
Harnessing the virtual realm for successful real world artificial intelligenceHarnessing the virtual realm for successful real world artificial intelligence
Harnessing the virtual realm for successful real world artificial intelligence
 
Nvidia at SEMICon, Munich
Nvidia at SEMICon, MunichNvidia at SEMICon, Munich
Nvidia at SEMICon, Munich
 
NVIDIA DataArt IT
NVIDIA DataArt ITNVIDIA DataArt IT
NVIDIA DataArt IT
 
Talk on commercialising space data
Talk on commercialising space data Talk on commercialising space data
Talk on commercialising space data
 
Hardware in Space
Hardware in SpaceHardware in Space
Hardware in Space
 
NVIDIA DGX User Group 1st Meet Up_30 Apr 2021.pdf
NVIDIA DGX User Group 1st Meet Up_30 Apr 2021.pdfNVIDIA DGX User Group 1st Meet Up_30 Apr 2021.pdf
NVIDIA DGX User Group 1st Meet Up_30 Apr 2021.pdf
 
Accelerating Innovation from Edge to Cloud
Accelerating Innovation from Edge to CloudAccelerating Innovation from Edge to Cloud
Accelerating Innovation from Edge to Cloud
 
EPSRC CDT Conference
EPSRC CDT ConferenceEPSRC CDT Conference
EPSRC CDT Conference
 
Nvidia tesla-k80-overview
Nvidia tesla-k80-overviewNvidia tesla-k80-overview
Nvidia tesla-k80-overview
 
Breaking RSA & the internet
Breaking RSA & the internetBreaking RSA & the internet
Breaking RSA & the internet
 
GTC 2018: A New AI Era Dawns
GTC 2018: A New AI Era DawnsGTC 2018: A New AI Era Dawns
GTC 2018: A New AI Era Dawns
 
“Accelerate Tomorrow’s Models with Lattice FPGAs,” a Presentation from Lattic...
“Accelerate Tomorrow’s Models with Lattice FPGAs,” a Presentation from Lattic...“Accelerate Tomorrow’s Models with Lattice FPGAs,” a Presentation from Lattic...
“Accelerate Tomorrow’s Models with Lattice FPGAs,” a Presentation from Lattic...
 
NVIDIA CEO Jensen Huang Presentation at Supercomputing 2019
NVIDIA CEO Jensen Huang Presentation at Supercomputing 2019NVIDIA CEO Jensen Huang Presentation at Supercomputing 2019
NVIDIA CEO Jensen Huang Presentation at Supercomputing 2019
 
Application Optimisation using OpenPOWER and Power 9 systems
Application Optimisation using OpenPOWER and Power 9 systemsApplication Optimisation using OpenPOWER and Power 9 systems
Application Optimisation using OpenPOWER and Power 9 systems
 
Cuda meetup presentation 5
Cuda meetup presentation 5Cuda meetup presentation 5
Cuda meetup presentation 5
 
Decision-ready climate data
Decision-ready climate dataDecision-ready climate data
Decision-ready climate data
 
2 Sessione - Macchine virtuali per la scalabilità di calcolo per velocizzare ...
2 Sessione - Macchine virtuali per la scalabilità di calcolo per velocizzare ...2 Sessione - Macchine virtuali per la scalabilità di calcolo per velocizzare ...
2 Sessione - Macchine virtuali per la scalabilità di calcolo per velocizzare ...
 

Plus de Alison B. Lowndes

Plus de Alison B. Lowndes (18)

Exploring solutions for humanity's greatest challenges
Exploring solutions for humanity's greatest challengesExploring solutions for humanity's greatest challenges
Exploring solutions for humanity's greatest challenges
 
Future of Skills
Future of SkillsFuture of Skills
Future of Skills
 
MAXSS & NVIDIA
MAXSS & NVIDIAMAXSS & NVIDIA
MAXSS & NVIDIA
 
DataArt
DataArtDataArt
DataArt
 
From gaming to the metaverse
From gaming to the metaverseFrom gaming to the metaverse
From gaming to the metaverse
 
Tales of AI agents saving the human race!
Tales of AI agents saving the human race!Tales of AI agents saving the human race!
Tales of AI agents saving the human race!
 
Harnessing the virtual realm
Harnessing the virtual realmHarnessing the virtual realm
Harnessing the virtual realm
 
AI + E-commerce
AI + E-commerceAI + E-commerce
AI + E-commerce
 
Talk on using AI to address some of humanities problems
Talk on using AI to address some of humanities problemsTalk on using AI to address some of humanities problems
Talk on using AI to address some of humanities problems
 
GTC Fall 2020 Keynote
GTC Fall 2020 KeynoteGTC Fall 2020 Keynote
GTC Fall 2020 Keynote
 
Innovation Roundtable
Innovation RoundtableInnovation Roundtable
Innovation Roundtable
 
Fuelling the AI Revolution with Gaming
Fuelling the AI Revolution with GamingFuelling the AI Revolution with Gaming
Fuelling the AI Revolution with Gaming
 
Possibilities of generative models
Possibilities of generative modelsPossibilities of generative models
Possibilities of generative models
 
Harnessing AI for the Benefit of All.
Harnessing AI for the Benefit of All.Harnessing AI for the Benefit of All.
Harnessing AI for the Benefit of All.
 
Phi Week 2019
Phi Week 2019Phi Week 2019
Phi Week 2019
 
AI in the Financial Services Industry
AI in the Financial Services IndustryAI in the Financial Services Industry
AI in the Financial Services Industry
 
Deep learning customer stories
Deep learning customer storiesDeep learning customer stories
Deep learning customer stories
 
NVIDIA @ Infinite Conference, London
NVIDIA @ Infinite Conference, LondonNVIDIA @ Infinite Conference, London
NVIDIA @ Infinite Conference, London
 

Dernier

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 

Dernier (20)

Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source Milvus
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
 
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelNavi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 

NVIDIA Keynote #GTC21

  • 1.
  • 2. NVIDIA Accelerated Computing Full Stack, 3 Chips, Data Center Scale 30 Million CUDA Downloads 150 SDKs $100 Trillion Industry Served Gaming Data Science Robotics Broadcast CAD Physical Sciences Life Sciences Quantum Physics Digital Twins Genomics 5G Quantum Computing Cybersecurity AI NLU Machine Learning AI Recsys AI Speech AI Computer Vision Medical Imaging Autonomous Vehicles EDA
  • 3. COMPUTING FOR THE DA VINCIS AND EINSTEINS OF OUR TIME 8 of 10 World’s Top 500 Supercomputers 9 of 10 World’s Top 500 Green Supercomputers #1 MLPerf Training and Inference 2021 Nobel Prize in Physics Studying Earth’s Climate 25,000 Companies Run on NVIDIA AI Oak Ridge National Laboratory and the oak leaf symbol are registered trademarks of the U.S. Department of Energy. Use of this mark does not constitute or imply its endorsement, recommendation, or favoring by the United States Government or any agency thereof or its contractors or subcontractors.
  • 4. World Record Accuracy 2.96% Gap on Gehring and Homberger Scalable to 1,000s of Locations 3 Seconds vs 5 Minutes to Route 1,000 Packages ANNOUNCING NVIDIA REOPT Re-Optimize Logistics and Supply Chain in Real-Time Accelerated Solver for Vehicle Route, Warehouse Picking, Fleet-Mix Optimization Massively Parallel Algorithm Generates Thousands of Solution Candidates and Refinements Dynamic Rerouting Reduces Travel Time – Save Billions for a $10 Trillion Logistics Industry Available Now nvidia.com/reopt
  • 5.
  • 6. LEADING QUANTUM SIMULATORS INDUSTRY PARTNERS RESEARCH COMMUNITY PARTNERS Oak Ridge National Laboratory and the oak leaf symbol are registered trademarks of the U.S. Department of Energy. Use of this mark does not constitute or imply its endorsement, recommendation, or favoring by the United States Government or any agency thereof or its contractors or subcontractors. ANNOUNCING NVIDIA CUQUANTUM DGX APPLIANCE Research the Computer of Tomorrow on the Most Powerful Computer Today Appliance Available Q1 2022 cuQuantum Available Now for Download developer.nvidia.com/cuquantum Out-of-the-Box Optimized Stack for Cirq Other Simulators in Development cuQuantum SDK in Open Beta; Accelerate Popular Quantum Simulators from Google, IBM cuQuantum GOOGLE
  • 7. ANNOUNCING NVIDIA CUQUANTUM DGX APPLIANCE Research the Computer of Tomorrow on the Most Powerful Computer Today Out-of-the-Box Optimized Stack for Cirq Other Simulators in Development cuQuantum SDK in Open Beta; Accelerate Popular Quantum Simulators from Google, IBM cuQuantum Appliance Available Q1 2022 cuQuantum Available Now for Download developer.nvidia.com/cuquantum DGX cuQuantum Appliance State Vector Simulator on Dual AMD CPU Sycamore Supremacy Circuit Quantum Fourier Transform Shor's Algorithm 29 Minutes 19 Seconds 8 Minutes 7 Seconds 22 Minutes 26 Seconds
  • 8. ANNOUNCING WORLD RECORD QUANTUM SIMULATION OF MAXCUT Record Qubit Scale MaxCut Algorithm with cuQuantum Tensor Network Simulation 1,688 Qubits on 896 GPUs Advance Quantum Algorithm Research in Drug Discovery, Climate Research, Cybersecurity, and Finance Tensor Network Simulator on Theta cuQuantum on Selene 210 3,375 VERTICES
  • 9. ANNOUNCING NVIDIA CUNUMERIC Accelerated Computing At-Scale for PyData and NumPy Ecosystem Python Used by 20 Million Data Scientists, Researchers, and Scientists NumPy Downloaded 122,000,000 Times Since 2017 NumPy Used by 790,000 Projects in GitHub NumPy is the Foundation of Pandas, SciPy, and Scikit-Learn 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021
  • 10. ANNOUNCING NVIDIA CUNUMERIC Accelerated Computing At-Scale for PyData and NumPy Ecosystem Transparently Accelerates and Scales NumPy Workflows Zero Code Changes Automatic Parallelism and Acceleration for Multi-GPU, Multi-Node Systems Scales to 1,000s of GPUs Available Now on GitHub and Conda cuNumeric NVIDIA Python Data Science and Machine Learning Ecosystem cuDF Pandas Scikit-Learn NetworkX NumPy cuML cuGraph
  • 11. DATA-CENTER-SCALE COMPUTE ENGINE Implicitly Extract Instruction-Level Parallelism Overlap Memory Latency and Computation Manage Coherence of Memory Hierarchy Dynamic Scheduling Retire Reorder Buffer Int Int FP FP L/S L/S Instruction Buffer In-Order Out-Of-Order In-Order Branch Fetch Instruction Decode/Rename Dispatch
  • 12. DATA-CENTER-SCALE COMPUTE ENGINE Implicitly Extract Task-Level Parallelism Overlap Memory Latency and Computation Manage Coherence of Memory Hierarchy Dynamic Scheduling Time ( seconds ) Relative Dataset Size | # of GPUs 0 50 100 150 1 2 4 8 16 32 64 128 256 512 1024 CFD Python (Weak Scaling) Retire Reorder Buffer Instruction Buffer In-Order Out-Of-Order Branch Fetch Task Decode/Rename Dispatch CPUs GPUs NICs In-Order
  • 14.
  • 15. SPEECH NLU RECOMMENDER DIALOG MANAGER NV AVATAR COMPUTER VISION CONVERSATIONAL AVATAR Camera In Mic In/Out Graphics / Video Out
  • 16. 400G NDR InfiniBand Cloud-Native Supercomputing ANNOUNCING NVIDIA QUANTUM-2 Multi-Tenant Bare-Metal Secure Performance Isolation Congestion Control SHARP Gen 3 In-Network Computing Precision Timing
  • 17. 400G NDR InfiniBand Cloud-Native Supercomputing ANNOUNCING NVIDIA QUANTUM-2 QUANTUM-2 SWITCH 57 Billion Transistors TSMC 7N Optimized Multi-Tenant In-Network Computing 64-Ports of 400 Gbps or 128-Ports of 200 Gbps 3X Higher Switching Throughput | 32X More AI Acceleration Engines Sampling Now
  • 18. 400G NDR InfiniBand Cloud-Native Supercomputing ANNOUNCING NVIDIA QUANTUM-2 CONNECTX-7 INFINIBAND 8 Billion Transistors TSMC 7N 16 Core / 256 Threads Datapath Accelerator | 400 Gbps Crypto Accelerations 4X In-Network Computing Performance | 2X GPUDirect Throughput Sampling Jan ‘22
  • 19. 400G NDR InfiniBand Cloud-Native Supercomputing ANNOUNCING NVIDIA QUANTUM-2 BLUEFIELD-3 INFINIBAND 22 Billion Transistors TSMC 7N 16 Arm 64-Bit Cores 16 Core / 256 Threads Datapath Accelerator | 400 Gbps Crypto Accelerations 4X In-Network Computing Performance | 2X GPUDirect Throughput Sampling May ‘22
  • 20. 400G NDR InfiniBand Cloud-Native Supercomputing ANNOUNCING NVIDIA QUANTUM-2 2X Data Throughput 400 Gbps 4X MPI Performance All-2-All Acceleration 5X Switch Capacity >1.6 Petabps 2048 Ports 6.5X Higher Scalability >1M Nodes in 3 Hops 32X AI Accelerators SHARP v3
  • 21. 400G NDR InfiniBand Cloud-Native Supercomputing ANNOUNCING NVIDIA QUANTUM-2 2X Data Throughput 400 Gbps 4X MPI Performance All-2-All Acceleration 5X Switch Capacity >1.6 Petabps 2048 Ports 6.5X Higher Scalability >1M Nodes in 3 Hops 32X AI Accelerators SHARP v3
  • 22. SOFTWARE DEFINED CLOUD-NATIVE DISAGGREGATED COMPUTING SCALE UP & SCALE OUT ZERO-TRUST DOCA 1.0 Accelerated Secure Bare-Metal Cloud Crypto Storage Acceleration Software-Defined Networking De/Compression Congestion Control RegEx NVIDIA DOCA 1.0 Accelerated Data Center Infrastructure Offload, Accelerate, and Isolate Data Center Infrastructure with Accelerated Networking, Security, Storage, and Management Applications NVIDIA BlueField DPU
  • 23. 1400 DOCA Developers 108 New DOCA APIs Deep Packet Inspection Intrusion Detection Load Balancers Telemetry Security Groups Firewall DOCA 1.2 Zero-Trust Security Framework Service Containers Crypto Storage Acceleration Software-Defined Networking De/Compression Congestion Control RegEx ANNOUNCING NVIDIA DOCA 1.2 Accelerated Data Center Infrastructure NEW Zero-Trust Security Framework Extend Threat Protection to Every Touch Point Hardware & Software Authentication, Line-Rate Data Encryption, Distributed Firewall, and Smart Telemetry Security Groups and Virtual Private Cloud Isolation NVIDIA BlueField DPU
  • 24. Deep Packet Inspection Intrusion Detection Load Balancers Telemetry Security Groups Firewall DOCA 1.2 Zero-Trust Security Framework Service Containers Crypto Storage Acceleration Software-Defined Networking De/Compression Congestion Control RegEx NVIDIA BlueField DPU ANNOUNCING CYBERSECURITY LEADERS EXTEND ZERO-TRUST WITH NVIDIA BLUEFIELD Deploy Security-as-a-Service with DOCA 1.2 Extend Security from Perimeter to Edge Security Processing on BlueField Offload CPU Burden
  • 25. EXPANDING BLUEFIELD ECOSYSTEM CLOUD CYBERSECURITY STORAGE EDGE PLATFORM
  • 26. Anomaly Detection Machine Human Machine Machine Machine Machine Human Humans and Machines Across the Enterprise Post-Processing Pre-Processing Inference Requests Apache Kafka TRITON Log Data NVIDIA MORPHEUS ANNOUNCING NVIDIA MORPHEUS Accelerated AI Platform for Next Gen SIEM Built on NVIDIA RAPIDS and NVIDIA AI 600X Faster Data Processing – Monitor Every User and Machine-Generated Data for Anomalous Behavior Detect Anomalies with 10s of Millions of AI Models in Real-Time Pre-Trained Models for User Activity Fingerprinting and Phishing Detection Early Access 2 Available Now nvidia.com/morpheus
  • 27.
  • 29. Accelerated Computing Data Center Scale AI MILLION-X LEAP 1980 1990 2000 2010 2020 Single-threaded perf 1.5X per year 1.1X per year 102 103 104 105 106 107 109 108 101 MACHINE LEARNING SCALE UP & OUT ACCELERATED COMPUTING
  • 30. PHYSICS-ML TURBOCHARGES SCIENCE EXPLOSION IN HPC + AI RESEARCH # ML+Science Papers in ArXiv 0 1000 2000 3000 4000 5000 6000 2015 2016 2017 2018 2019 2020
  • 31. MILLION-X DRUG DISCOVERY Refinement in Simulation Docking & Virtual Screening Physics-Based Simulation Imaging & Crystallography Exhaustive Search BINDING FREE ENERGY CHEMICAL COMPOUNDS PROTEIN STRUCTURE OF DISEASE TARGET
  • 32. MILLION-X DRUG DISCOVERY SMNPPPPETSNPNKPKRQTNQLQYL LRVVLKTLWKHQFAWPFQQPVDAV KLNLPDYYKIIKTPMDMGTIKKRLEN NYYWNAQECIQDFNTMFTNCYIYNK PGDDIVZRS HQFAWPFQQPVDAVKLNL QTNQLQYLLRVVLKTLWR Structure Prediction Hybrid Docking Machine-Learned Simulation Accelerated Sequencing Generative Search & Synthesis BINDING FREE ENERGY CHEMICAL COMPOUNDS PROTEIN STRUCTURE OF DISEASE TARGET
  • 33. MILLION-X DRUG DISCOVERY 1.00E+03 1.00E+05 1.00E+07 1.00E+09 1.00E+11 1.00E+13 1995 2000 2005 2010 2015 2020 2025 Number of Available Structures Generative Models AlphaFold Known Chemicals Structured Proteins
  • 34. ENTOS TRANSCENDS SIMULATION WITH MACHINE LEARNED POTENTIALS Quantum Accuracy 1,000X Faster than DFT Reactive
  • 35. MILLION-X CLIMATE SCIENCE 1000km 100km 10km 1km 100m 10m 1m 1980 1990 2000 2010 2020 2030 2040 2050 2060 AR1 AR2 AR3 AR4 AR5 AR6 (IPCC) 1km at 1min (1X COMPUTE) 100m at 1s (10,000X COMPUTE) 1m at 0.01s (100 BILLION X COMPUTE) CONVECTION RESOLVING STORM RESOLVING STRATOCUMULUS RESOLVING RESOLUTION Figure adapted from: Schneider, T., Teixeira, J., Bretherton, C. et al. Climate goals and computing the future of clouds. Nature Clim Change 7, 3–5 (2017). https://doi.org/10.1038/nclimate3190
  • 36. ANNOUNCING NVIDIA MODULUS Physics-ML Neural Simulation Framework Framework for Developing Physics-ML Models Train Physics-ML Models Using Governing Physics, Simulation, and Observed Data Multi-GPU, Multi-Node Training 1,000-100,000X Speed Models – Ideal for Digital Twins SymPy Equation Model Library (SiREN, PINO, PINN, MESHFREE) Multi-Node Multi-GPU Training Engine Numerical Optimization Plans Geometry ICs & BCs Observations Computational Graph Compiler Available Now developer.nvidia.com/modulus
  • 37. EARTH DIGITAL TWIN IN OMNIVERSE ERA5 ECMWF Atmospheric Winds & Geopotential 10 TB | 30km | 5 Atmos Layers RAPIDS 100,000X Speed-Up 0.25 Seconds for 7-Day Forecast Training: 4 Hours on 128 A100 GPUs Modulus Omniverse Fourier Neural Operator AI EARTH EMULATOR Satellite Ocean Ecosystem Atmosphere Extreme Weather Prediction Wind Energy Forecasting Disaster Mitigation
  • 38.
  • 43. OMNIVERSE FOR DESIGN COLLABORATION Designer #1’s World Designer #3’s World Internet Physical World Designer #2’s World Shared World Path Tracing MDL Physics AI
  • 44. OMNIVERSE FOR DIGITAL TWIN Factory Designer Robot Gym Internet Physical World Robot Gym Factory Digital Twin Path Tracing MDL Physics AI
  • 45.
  • 46. ANNOUNCING NEW OMNIVERSE FEATURES SHOWROOM Available in Beta FARM Available in Beta AR Available in Beta VR Coming Soon
  • 48. ANNOUNCING EARLY ACCESS BENTLEY ITWIN FOR NVIDIA OMNIVERSE Physically-Accurate 4D Visualization of Infrastructure Digital Twins Supports 4D Design Review and Construction Simulation Early Access Now Available bentley.com/4DVisualization
  • 49. SIEMENS ENERGY BUILDS HRSG DIGITAL TWIN IN OMNIVERSE Boiler Design Flow Simulation Internet Physical World Training Data Plant Digital Twin Path Tracing MDL Physics AI
  • 50.
  • 51. BMW GROUP BUILDS FACTORY DIGITAL TWINS IN OMNIVERSE Factory Planning Robot Gym Internet Physical World Digital Human Training Factory Digital Twin Path Tracing MDL Physics AI
  • 52.
  • 53. ERICSSON BUILDS CITY DIGITAL TWIN IN OMNIVERSE City Planning Simulation Internet Physical World Materials 5G Network Digital Twin Path Tracing MDL Physics AI
  • 54.
  • 55. AI
  • 56. GRAPH NEURAL NETWORKS CAPTURE INSIGHTS FROM INTERCONNECTED DATA 90B Relationships in a Social Network 1.1B Transactions a Day 40B Molecules in Chemical Databases DRUG DISCOVERY SOCIAL CONNECTIONS | FRAUD DETECTION
  • 57. ANNOUNCING DGL ACCELERATION WITH CUDA-X GPU-Accelerated GNN Workflow GNN for Molecule Reaction Prediction, Node Classification, Knowledge Graphs, and Model Explainability CUDA-Optimized Reference Examples for SE3-Transformer, R-GCN, and GraphSage Early Access in December ngc.nvidia.com cuDF cuGraph DGL with CUDA-X Text Images Relationships Sub-Graph Construction Graph to GNN Graph Construction
  • 58. PROCESSING THE LARGEST GRAPH NEURAL NETWORKS 300B pins in graphs with billions of nodes and billions of edges 5X faster training on graphs with over 10M nodes and over 100M edges from 24hrs to 5hrs Improving fraud detection over billions of transactions
  • 59. AlexNet VGG-19 Seq2Seq Resnet InceptionV3 Xception ResNeXt DenseNet201 Transformer ELMo GPT-1 BERT Large Megatron Microsoft T-NLG GPT-3 Megatron-Turing NLG 530B MoCo ResNet50 XLNet Wav2Vec 2.0 100 1,000 10,000 100,000 1,000,000 10,000,000 100,000,000 1,000,000,000 10,000,000,000 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 Training Compute (PetaFLOPS) Transformer: 275x / 2yrs All AI Models: 25x / 2yrs Moore's Law: 2x / 2yrs SELF-SUPERVISED LEARNING ARRIVES WITH TRANSFORMERS
  • 60. ANNOUNCING NVIDIA NEMO MEGATRON Megatron 530B NEMO MEGATRON Automated Data Curation Distributed Training Customer’s Data Custom Megatron 530B
  • 61. ANNOUNCING NVIDIA TRITON MULTI-GPU MULTI-NODE INFERENCE Triton Inference Server Magnum IO (Multi-GPU, Multi-Node) Query Response Application Response Time 1/2 Second 2 DGX A100 Response Time > 1 Minute Dual Socket CPU Server
  • 62. JAPANESE Global Ecommerce ENTERPRISE DIGITAL WORKFLOWS SWEDISH Finance, Healthcare, and Manufacturing PORTUGUESE AI Assistant CHINESE Customer Service VIETNAMESE Radiologists and Telehealth AI Service INTENT RECOGNITION, TRANSLATION, AND CHAT KOREAN Chatbots and Call Centers LARGE LANGUAGE MODELS TAKING HPC MAINSTREAM
  • 63. THANK YOU TO THOSE PRESENTING AT GTC! RETAIL CLOUD ENTERPRISE SaaS TELECOMMUNICATIONS INDUSTRIAL HEALTHCARE CONSUMER INTERNET AUTO
  • 64. LEADING COMPANIES RUNNING NVIDIA AI RETAIL CLOUD ENTERPRISE SaaS TELECOMMUNICATIONS INDUSTRIAL HEALTHCARE CONSUMER INTERNET AUTO
  • 65. ANNOUNCING MICROSOFT TEAMS ADOPTS NVIDIA AI WITH AZURE COGNITIVE SERVICES Nearly 250 Million Monthly Active Microsoft Teams Users Live Transcription and Captions in 28 Languages Triton Inference Server in Azure Cognitive Services
  • 66. AI INFERENCE IS HARD PROCESSORS AI Inference DEPLOYMENT PLATFORMS Cloud On-Prem Edge Embedded T4 GPU Arm CPU A100 GPU V100 GPU x86 CPU FRAMEWORKS APP CONSTRAINTS Real Time Batch Streaming MODELS CNN GNN Decision Trees RNN Transformers
  • 67. ANNOUNCING TENSORRT INTEGRATED WITH PYTORCH AND TENSORFLOW Accelerate In-Framework Inference with TensorRT with Just 1 Line of Code 3X Faster Supports Every Workload FP32, TF32, FP16, INT8 Available for Download Today ngc.nvidia.com
  • 68. ANNOUNCING NVIDIA TRITON WITH FOREST INFERENCING ML and DL in One Application Tree Models (XGBoost, Random Forest, LightGBM) Are Ubiquitous Large Tree Ensembles Push CPUs Beyond Response-Time Limits Ensembling with ML, DL, and More Complex Models is the Future with Fraud Detection 0% 3.5 ms 0 ms 1.5 ms Detection Rate Max Response Time 80% Transaction > $50? Time after midnight? XGBOOST MODEL
  • 69. ANNOUNCING NVIDIA TRITON WITH FOREST INFERENCING ML and DL in One Application Tree Models (XGBoost, Random Forest, LightGBM) Are Ubiquitous Large Tree Ensembles Push CPUs Beyond Response-Time Limits Ensembling with ML, DL, and More Complex Models is the Future with Fraud Detection 7K Transaction > $50? Time after midnight? XGBOOST MODEL 7K 7K 7K GOAL Increase Detection Rate Within 1.5ms 3.5 ms 0 ms 80% 0% 1.5 ms Detection Rate Max Response Time
  • 70. ANNOUNCING NVIDIA TRITON WITH FOREST INFERENCING ML and DL in One Application Unified Deployment Engine for DL and ML Inference Random Forests, GBDTs, and Decision Trees Deploy on CPU and GPU Process Million+ Node Tree Models with Low-Latency Fraud Detection, Recommender Systems, Risk Assessment, and Predictive Maintenance Available for Download Today ngc.nvidia.com UNACCEPTABLE RESPONSE TIME 0 ms 3.5 ms 1M Nodes 7K GIANT XGBOOST MODEL 7K 7K 7K Detection Rate 80% 0% Max Response Time 1.5 ms 1M Nodes
  • 71. ANNOUNCING TRITON INFERENCE SERVER 2.15 NEW Integration into AWS SageMaker and AliCloud – Now All Major Frameworks, Major Clouds, and AI Platforms NEW Support for Arm – Now Inference on Every Generation of GPUs, x86 CPUs, and Arm NEW Model Analyzer Optimizes for App QoS Requirements NEW Forest Inference | NEW Distributed Multi-GPU, Multi-Node Inference Optimal Model Config TensorFlow PyTorch TensorRT RAPIDS OpenVINO ONNX RT Triton Inference Server Application Ampere x86 CPU Volta Arm CPU Turing QoS Requirements Triton Model Analyzer
  • 72. BOOST THROUGHPUT OF MODERN DATA CENTERS WITH NVIDIA TRITON TensorRT 1 TensorRT 4 TensorRT 8.2 ACCELERATES EVERY WORKLOAD WORLD-CLASS RESPONSE TIME AND THROUGHPUT 12X Recommenders < 1 sec 10X Reinforcement Learning 583X Speech Recognition < 100ms 36X Computer Vision < 7ms 178X Text-to-Speech < 100ms 21X NLP < 50ms CLASSIFICATION CLASSIFICATION DETECTION SEGMENTATION RECOMMENDERS CLASSIFICATION DETECTION SEGMENTATION RECOMMENDERS RL FRAUD DETECTION TEXT-TO-SPEECH SPEECH RECOGNITION SENTIMENT ANALYSIS TRANSLATION SENTENCE COMPLETION Q & A
  • 73. Software-Defined Real-Time Secure Hybrid Cloud Deploy & Orchestrate Fleet High-Speed IO Networking Storage Data Processing Signal & Image Processing DNN PINN Graphics Streaming RETAIL HEALTHCARE MANUFACTURING EDGE AI AUTOMATING EVERY INDUSTRY 15M Stores (CV, SpeechAI, NLU, RecSys) 160K Hospitals (CV, SpeechAI, NLU, Robotics) 10M Factories (CV, SpeechAI, NLU, Robotics) FAST FOOD 7M Restaurants (CV, SpeechAI, NLP)
  • 74. NVIDIA Unified Compute Framework NVIDIA Fleet Command NVIDIA UNIFIED COMPUTE FRAMEWORK FOR REAL-TIME EDGE APPLICATIONS RETAIL HEALTHCARE MANUFACTURING 15M Stores (CV, SpeechAI, NLU, RecSys) 160K Hospitals (CV, SpeechAI, NLU, Robotics) 10M Factories (CV, SpeechAI, NLU, Robotics) FAST FOOD 7M Restaurants (CV, SpeechAI, NLP)
  • 75. Fleet Command Aerial 5G NVIDIA AI NVIDIA Metropolis With Unified Computing Framework NVIDIA METROPOLIS AI EDGE 3rd Party L2+ 3rd Party 5G CORE NVIDIA L1 DATA CENTER COMMAND CENTER Video In DeepStream on EGX Triton on EGX Triton on EGX DeepStream on EGX Metadata Display DeepStream on EGX DeepStream on EGX MEDIA HANDELING DETECTION CLASSIFICATION SMOOTHENING VISUALIZATION TRACKING
  • 76. Fleet Command Aerial 5G NVIDIA AI NVIDIA Metropolis With Unified Computing Framework MAVENIR L2+ MAVENIR 5G Core NVIDIA L1 Powered by NVIDIA Metropolis AI-on-5G Edge Platform ANNOUNCING MAVENIR MAVEDGE-AI Mavenir MAVedge-AI Application DATA CENTER COMMAND CENTER Video In DeepStream on EGX Triton on EGX Triton on EGX DeepStream on EGX Metadata Display DeepStream on EGX DeepStream on EGX MEDIA HANDELING DETECTION CLASSIFICATION SMOOTHENING VISUALIZATION TRACKING
  • 77.
  • 78. Maxine Clara Metropolis Isaac Merlin Riva NVIDIA AI ECOSYSTEM NVIDIA AI NVIDIA BASE COMMAND NVIDIA FLEET COMMAND Morpheus
  • 79. Maxine Clara Metropolis Isaac Merlin Riva 1,000+ Partners Cloud to Core to Edge NVIDIA AI ECOSYSTEM NVIDIA AI NVIDIA BASE COMMAND NVIDIA FLEET COMMAND Morpheus NVIDIA PARTNER NETWORK MLOps | CLOUD ML PaaS ORCHESTRATORS 5G EDGE OEM & CLOUDS
  • 80.
  • 81.
  • 84. OMNIVERSE AVATAR Live Customer Support Web Customer Support Video Conference & Telepresence Games Robots OMNIVERSE AVATAR Realistic Imaginary Autonomous Teleoperated
  • 85. PROJECT MAXINE WITH OMNIVERSE AVATAR RIVA MEGATRON 530B MERLIN DIALOG MANAGER NV AVATAR NV CV Camera In Mic In/Out Graphics / Video Out
  • 86. MEGATRON 530B MERLIN DIALOG MANAGER RIVA Mic In/Out ANNOUNCING NVIDIA RIVA SPEECH AI World Class Quality and Response Time SDK to Customize for Use Case and Unique Voice for Brand Virtual Assistant Train New Voice with Only 30 Mins of Speech Data Human-Like Expressivity and Fine-Grained Control Deploy in Cloud, On-Prem, Edge, and Embedded Enterprise Support Available Q1 ‘22 Globally developer.nvidia.com/riva
  • 87. NVIDIA ADVANCES SPEECH AI Tacotron2 + WaveGlow Fastpitch + HiFiGAN 0 100 200 300 400 500 600 V100 A100 Throughput TEXT-TO-SPEECH 12X HIGHER PERFORMANCE DeepSpeech2 Jasper Quartznet Citrinet-1024 2018 2019 2020 2021 0 5 10 15 20 25 30 35 40 45 50 Error Rate SPEECH RECOGNITION 4X HIGHER ACCURACY WIDELY ADOPTED UCAAS | FINANCE | TELECOM | CONSUMER 70K Developers | 250K Downloads
  • 88.
  • 89. PROJECT TOKKIO WITH OMNIVERSE AVATAR Video Out Video In Audio In Audio Out Megatron 530B Triton MGMN on DGX ASR Triton on EGX DL FACE TRACKER DeepStream on EGX AUD2FACE OMNIVERSE Zero-Shot DM Triton on EGX Merlin Triton on EGX TTS Triton on EGX RIVA AVATAR SIMULATION OMNIVERSE
  • 90.
  • 91. PROJECT MAXINE WITH OMNIVERSE AVATAR Video Out Video In Audio In Audio Out with Translation AUDIO DENOISE Triton on EGX GAZE REPOSE DeepStream on EGX VID2FACE Triton on EGX RIVA SPEECH AI Triton on EGX AUD2FACE OMNIVERSE FACE TRACKER DeepStream on EGX POSE & MESH Triton on EGX
  • 92.
  • 94. STRYKER AIRO TruCT Interoperative CT JOHNSON & JOHNSON AURIS Robotic Endoscopy MEDTRONIC HUGO Robotic-Assisted Surgery INTUITIVE SURGICAL ION Robotic-Assisted Lung Biopsy 2M Devices | 16K Companies | 10K Modalities MEDICAL INSTRUMENTS INTEGRATE AI AND ROBOTICS
  • 95. RENDERING OV on RTX DATA PROCESSING cuCIM on EGX ZERO-SHOT NLU DIALOG MANAGER Triton on EGX RIVA ASR Triton on EGX Audio Sensor SENSOR PROCESSING CUDA on EGX IMAGE PROCESSING Triton on EGX STREAM DISPLAY CloudXR on RTX PHYSICS PROCESSING CUDA on EGX ANNOUNCING NVIDIA CLARA HOLOSCAN AI COMPUTING INSTRUMENTS PLATFORM Stream-Computing Platform for High-Throughput Signal, Data, AI, and Graphics Processing Run in Data Center, Embedded Instruments, or Hybrid Remotely Update, Orchestrate, and Monitor Fleet Platform for SaaS Model Available November 15 developer.nvidia.com/clara-holoscan-sdk
  • 96. ANNOUNCING NVIDIA AGX ORIN Computational Sensing Instrument Platform NVIDIA A6000 NVIDIA ConnectX-7 NVIDIA Orin
  • 97. ANNOUNCING NVIDIA AGX ORIN Computational Sensing Instrument Platform NVIDIA A6000 NVIDIA ConnectX-7 NVIDIA Orin
  • 98.
  • 99. 700+ Companies NVIDIA ISAAC POWERING THE ROBOTICS REVOLUTION
  • 101. ANNOUNCING NVIDIA ISAAC ROS NEW Isaac ROS GEM Brings NVIDIA Robotics AI to ROS Community Accelerate ROS-Native Packages up to 10X Isaac Sim Out-of-Box ROS Support NEW Isaac Sim Replicator for Synthetic Data Generation Isaac Sim Replicator Omniverse Farm TAO ISAAC GEMS AUTO-LABELED SYNTHETIC DATA ISAAC SIM ISAAC ROS APPLICATION STEREO DEPTH SGM SEGMENTATION DNN HUMAN POSE ESTIMATION Download Isaac ROS GEMS developer.nvidia.com/isaac-ros-gems
  • 102. ANNOUNCING OMNIVERSE REPLICATOR FOR ISAAC SIM NEW Isaac ROS GEM Brings NVIDIA Robotics AI to ROS Community Accelerate ROS-Native Packages up to 10X Isaac Sim Out-of-Box ROS Support NEW Isaac Sim Replicator for Synthetic Data Generation
  • 103. GROWING FLEETS POWERED BY NVIDIA DRIVE R Auto
  • 105. TRANSFORM SURROUND 2D TO 4D WORLD MODEL
  • 106. Functionally Safe Production Ready AV Platform ANNOUNCING DRIVE HYPERION 8 GA Software Defined Vehicle Dual Orin X Standard Form Factor Production Sensor Set DriveWorks Acceleration Libraries DRIVE AV Software Tools for OEM Adaptation Available Now nvidia.com/drive-hyperion 2 x NVIDIA DRIVE Orin Systems-on-a-Chip (SoCs)
  • 109. DRIVE AV MAP Fleet & Survey Mapping Auto Map Generation Localization & Planning, Simulation, Digital Twin
  • 110.
  • 112. NVIDIA DRIVE CHAUFFEUR NVIDIA DRIVE CONCIERGE
  • 113. NVIDIA Accelerated Computing Full Stack, 3 Chips, Data Center Scale 30 Million CUDA Downloads 150 SDKs $100 Trillion Industry Served Nemo Megatron Triton ReOpt cuQuantum cuNumeric Morpheus Metropolis Clara Holoscan Isaac Maxine DRIVE RIVA DGL Modulus Omniverse Launchpad AGX Orin Hyperion 8 Quantum-2