Marv Wexler - Transform Your with AI.pdf

SOLTUIONSpeople, THINKubators, THINKathons
SOLTUIONSpeople, THINKubators, THINKathonsThinkubator 💡Innovation Experience Designer 💡 Linkedin's #1 Most Connected Innovator in The World à SOLTUIONSpeople, THINKubators, THINKathons
Confidential
Transform Your
Business With AI
Transform Your
Business With AI
AI Summit
Marv Wexler
GM Technical Services
September 21, 2023
AI Summit
Marv Wexler
GM Technical Services
September 21, 2023
Better Faster Greener™ © 2023 Supermicro
Confidential
Where are we on the AI journey ?
9/20/2023 Better Faster Greener™ © 2023 Supermicro
2
“Once a new technology rolls over you, if you're not part of the steamroller, you're
part of the road.” - Stewart Brand
Confidential
9/20/2023 Better Faster Greener™ © 2023 Supermicro
3
Current AI Trends
• Democratization of AI will continue
• AI is a fundamental differentiator for businesses
• Find deeper insights in data, real-time and at scale
-Else your competitors surely will
• Generative AI is becoming commercialized
• AI ethics a top priority
• Biased algorithms, Deep fakes, “Hallucinations” as a
feature
• Generative AI applications reign : Microsoft (Designer),
Adobe (Firefly), Meta (Ad creation)
• New regulations for safe and responsible practices
• EU AI Act: Set of new rules that establish obligations for risks
from artificial intelligence
Confidential
AI Applications
9/20/2023 Better Faster Greener™ © 2023 Supermicro
4
Deep Learning
Solving complex
problems
Computer model taught to
learn actions using images,
texts and sounds
Machine Learning
Machines making
decisions
Building Machines with
predictive algorithm and
create predictive models
Artificial Intelligence
Simulate intelligence
Building Smart Machines
capable of performing
intelligent tasks
Confidential
9/20/2023 Better Faster Greener™ © 2023 Supermicro
5
Text
Image
Audio
Video
Games
Text/ Voice prompt
Generative AI models
(also Large Language
LLM, or Foundational
Models)
User Input
What is Generative AI?
Generative AI models are models that, when receiving a text prompt, give an output related to
that input. The output can be text, image, audio, video, code etc.
The ability for generative AI to produce useful, impressively synthesized text, images, and other types of content
almost effortlessly based on a few text cues has already become an important business capability worthy of
providing immense value to most knowledge workers
Confidential
The far-reaching impacts of Generative AI
9/20/2023 Better Faster Greener™ © 2023 Supermicro
6
Around 75% of the technology's value will be seen across four areas:
• customer operations
• marketing and sales
• software engineering
• research and development
automating conversations with customers
creating personalized messages for customers
generating code
generative design
Confidential
Customizable AI infrastructure for Generative AI
9/20/2023 Better Faster Greener™ © 2023 Supermicro
7
Training
•compute intensive
•massive datasets
involved
Fine-Tuning
•Requires relatively less
computational power
Inferencing
•Accelerators may be
needed depending on
type of application
(batch/real-time)
Various stages in building a
Generative AI Application
At Supermicro, We have you covered all the way with affordable, customizable
and scalable solutions
Confidential
Application Details
9/20/2023 Better Faster Greener™ © 2023 Supermicro
8
LangChain Instructor Embeddings WizardLM / LLAMA
• Ask questions to your documents AND learn from your documents using the
power of LLMs.
• 100% private, no data leaves your execution environment at any point.
• You can ingest documents and ask questions without an internet connection!
localGPT
BUILT WITH
• Text pre processed
into chunks
• Embedded in a
vector space
• Query search for
similar chunks
An instruction-finetuned text
embedding model that can generate
text embeddings tailored to any
task by simply providing the task
instruction, without any finetuning.
Instructor achieves SOTA on 70
diverse embedding tasks!
(e.g., classification, retrieval,
clustering, text evaluation, etc.) and
domains (e.g., science, finance, etc.)
• WizardLM is a Llama variant
trained with
complex instructions
• Evol-Instruct which
leverages AI to
"evolve" instructions
Confidential
Application Details
9/20/2023 Better Faster Greener™ © 2023 Supermicro
9
Ingest.py
• uses LangChain tools to parse the document and create
embeddings locally using Instructor Embeddings
Chroma
vector store
• local vector database that stores the created
embeddings
Run_localGPT • uses local LLM to understand questions and create
answers.
Similarity
Search
• used to extract right piece of context
from the local vector store
Confidential
10
©2023 Supermicro
Large Scale AI Training
• Key Technologies
• NVIDIA HGX H100 SXM 8-GPU/4-GPU with 900GB/s NVLink interconnect
• Dedicated, lots of high performance, high bandwidth GPU memory - HBM3, HBM2e
• 400GbE networking (Ethernet or InfiniBand), PCIe 5.0 storage for fast AI data pipe
• NVIDIA GPUDirect RDMA and Storage to keep feeding data to GPUs with minimum latency
• Liquid cooling for GPUs and CPUs
• All-flash storage and file systems to support petabytes of hot-tier data cache
• NVIDIA HGX H100 SXM5
board with 4- GPU or 8-
GPU
• NVLink and NVSwitch
• 80GB HBM3 per GPU
• Up to 700W TDP
• NVIDIA ConnectX-7
• Up to 400GbE or 400G NDR InfiniBand
• x16/x32 PCIe 5.0
Confidential
Supermicro AI
Experience
Supermicro AI
Experience
Marv Wexler
August 2023
Marv Wexler
August 2023
Better Faster Greener™ © 2023 Supermicro
Confidential
9/20/2023 Better Faster Greener™ © 2023 Supermicro
12
Confidential
Evolving to an AI / Total IT Solutions Partner
9/20/2023 Better Faster Greener™ © 2022 Supermicro
13
 5S: Software, Services,
Switch, Storage, Security
and more
 Total Solutions: Enterprise,
OEM- Appliance / Cloud
 Complete Systems
 Sub-systems and
Components
~5X+ Faster growth rate than the
industry avg rate over the past 12+
months (~50% YoY)
~5X+ Faster growth rate than the
industry avg rate over the past 12+
months (~50% YoY)
Our Momentum:
SMCI 1.0
Components &
Subsystems
SMCI 2.0
Servers &
Storage Systems
SMCI 3.0
Total IT
Solutions
Today
1993
$5B
$10B
Confidential
SMCI AI Strategy
9/20/2023 Better Faster Greener™ © 2023 Supermicro
14
• Partner with the Leaders
• Provide the best picks and shovels for the gold miners (Apps, YOU)
• Do not be religious with Products Offerings (multi-vendor, multi-platform)
Confidential
SMCI AI Business Results
9/20/2023 Better Faster Greener™ © 2023 Supermicro
15
• Bring up platform partner for virtually all AI Solutions / GPU offerings
• Lead supplier for virtually all Large Language Model Cloud Deployments
(ChatGPT, BARD, Bing, etc.)
The Next Platform, August 16, 2023
Confidential
16
©2023 Supermicro
GPU Optimized Systems by Workloads
• Large Scale AI Training • HPC/AI Workloads
H100 PCIe
Grace Hopper Superchip (Grace
CPU + H100 GPU)
H100 NVL
HGX H100 SXM
8-GPU or 4-GPU
4U 4-GPU System (HGX H100 SXM)
(codenamed: Redstone-Next)
SYS-421GU-TNXR, SYS-521GU-TNXR
8U 8-GPU System (HGX H100 SXM)
(codenamed: Delta-Next)
SYS-821GE-TNHR, AS -8125GS-TNHR
4U 4-GPU System (HGX H100 SXM)
SYS-421GU-TNXR
4U/5U 8-10 GPU System
SYS-521GE-TNRT, SYS-421GE-TNRT/TNRT3
AS -4125GS-TNRT/TNRT1/TNRT2
1U Grace Hopper MGX System
SYS-421GU-TNXR / SYS-521GU-TNXR
8U SuperBlade (Up to 20 nodes)
SBI-411E-1G / SBI-411E-5G
Petabyte Scale All-Flash Storage
SSG-121E-NE316R, ASG-1115S-NE316R
Confidential
Scales to thousands of nodes in 32-node increments
(SRS-42UHPC-32SU-01)
Accelerate AI Development by Supermicro
Supermicro 8U Delta-Next (SYS-821GE-TNHR)
A Proven Platform, Purpose Built for AI
H100 SXM5 GPU ConnectX-7 SmartNICs
H100 Rack Scale SuperPod Scalable Unit
8x NVIDIA H100 SXM5 GPUs | 640GB HBM3 GPU Memory 2TB
System Memory | 3.2Tbps Network B/W | Superior I/O
32x HGX H100 | 1+ EFLOPS AI | 20TB HBM3 GPU Memory 102.4Tbps
Network B/W Non-blocking | InfiniBand NDR
Software: NVIDIA BCM | NGC | NVAIE | SLURM | Kubernetes
Full Turnkey AI Supercomputer for Enterprises
9/20/2023 Better Faster Greener™ © 2023 Supermicro
17
Confidential
Supermicro Rack Integration Services
• Full rack integration up to L11 and L12
• Broad portfolio of compute, power, cooling
and networking options
• Liquid cooling integration
• Cooling Distribution Unit (CDU)
• Direct to Chip cold plate
• Manifold and tubing
• Design, assembly, configuration, testing
and deployment
• Start running applications from Day 1
Confidential
Supermicro CDU
80kW to 120kW, 45°C Warm Water
Liquid Cooling Option for Rack Scale H100 SuperPods
9/20/2023 Better Faster Greener™ © 2023 Supermicro
19
Confidential
Onsite Rack Services
9/20/2023 Better Faster Greener™ © 2023 Supermicro
20
Simplifying Your Solution Deployment Needs
• White glove custom service from beginning to end
• Onsite rack & stack of the custom solution
• Onsite integration ensuring proper installation and
connectivity, providing for reliable operation and reduced
downtime
• Onsite software installation with application configurations
• Onsite benchmark testing ensuring solution meets the
requirements of the customer
• Delivery of a customized rack solution that meets all
requirements
• SMC Cooling tower product line is available to enable
facility level water connections for CDU/CDM/RDHX
Reliable – Repeatable – Reproducible
Confidential
DISCLAIMER
Super Micro Computer, Inc. may make changes to specifications and product descriptions at any time, without notice. The
information presented in this document is for informational purposes only and may contain technical inaccuracies, omissions
and typographical errors. Any performance tests and ratings are measured using systems that reflect the approximate
performance of Super Micro Computer, Inc. products as measured by those tests. Any differences in software or hardware
configuration may affect actual performance, and Super Micro Computer, Inc. does not control the design or implementation of
third party benchmarks or websites referenced in this document. The information contained herein is subject to change and may
be rendered inaccurate for many reasons, including but not limited to any changes in product and/or roadmap, component and
hardware revision changes, new model and/or product releases, software changes, firmware changes, or the like. Super Micro
Computer, Inc. assumes no obligation to update or otherwise correct or revise this information.
SUPER MICRO COMPUTER, INC. MAKES NO REPRESENTATIONS OR WARRANTIES WITH RESPECT TO THE
CONTENTS HEREOF AND ASSUMES NO RESPONSIBILITY FOR ANY INACCURACIES, ERRORS OR OMISSIONS THAT
MAY APPEAR IN THIS INFORMATION.
SUPER MICRO COMPUTER, INC. SPECIFICALLY DISCLAIMS ANY IMPLIED WARRANTIES OF MERCHANTABILITY OR
FITNESS FOR ANY PARTICULAR PURPOSE. IN NO EVENT WILL SUPER MICRO COMPUTER, INC. BE LIABLE TO ANY
PERSON FOR ANY DIRECT, INDIRECT, SPECIAL OR OTHER CONSEQUENTIAL DAMAGES ARISING FROM THE USE OF
ANY INFORMATION CONTAINED HEREIN, EVEN IF SUPER MICRO COMPUTER, Inc. IS EXPRESSLY ADVISED OF THE
POSSIBILITY OF SUCH DAMAGES.
ATTRIBUTION
© 2023 Super Micro Computer, Inc. All rights reserved.
9/20/2023 Better Faster Greener™ © 2023 Supermicro
21
Confidential
www.supermicro.com
1 sur 22

Recommandé

Steve Cunningham - AI Innovation Summit.pdf par
Steve Cunningham - AI Innovation Summit.pdfSteve Cunningham - AI Innovation Summit.pdf
Steve Cunningham - AI Innovation Summit.pdfSOLTUIONSpeople, THINKubators, THINKathons
243 vues20 diapositives
Theresa Fesinstine - AI Forward.pdf par
Theresa Fesinstine - AI Forward.pdfTheresa Fesinstine - AI Forward.pdf
Theresa Fesinstine - AI Forward.pdfSOLTUIONSpeople, THINKubators, THINKathons
344 vues20 diapositives
Andy Roy - Conversational AI - Why We Must Build.pdf par
Andy Roy - Conversational AI - Why We Must Build.pdfAndy Roy - Conversational AI - Why We Must Build.pdf
Andy Roy - Conversational AI - Why We Must Build.pdfSOLTUIONSpeople, THINKubators, THINKathons
225 vues20 diapositives
James Feldman - AII Powered Business Tools.pdf par
James Feldman - AII Powered Business Tools.pdfJames Feldman - AII Powered Business Tools.pdf
James Feldman - AII Powered Business Tools.pdfSOLTUIONSpeople, THINKubators, THINKathons
251 vues19 diapositives
Nils Vesk - Building an Innovative, Productive, AI empowered Culture.pdf par
Nils Vesk - Building an Innovative, Productive, AI empowered Culture.pdfNils Vesk - Building an Innovative, Productive, AI empowered Culture.pdf
Nils Vesk - Building an Innovative, Productive, AI empowered Culture.pdfSOLTUIONSpeople, THINKubators, THINKathons
404 vues20 diapositives
Amir Feizpour - Knowledge-Ops and LLMs.pdf par
Amir Feizpour - Knowledge-Ops and LLMs.pdfAmir Feizpour - Knowledge-Ops and LLMs.pdf
Amir Feizpour - Knowledge-Ops and LLMs.pdfSOLTUIONSpeople, THINKubators, THINKathons
298 vues20 diapositives

Contenu connexe

Tendances

Matt Lewis - The Hardest Thing-Final to Host.pdf par
Matt Lewis - The Hardest Thing-Final to Host.pdfMatt Lewis - The Hardest Thing-Final to Host.pdf
Matt Lewis - The Hardest Thing-Final to Host.pdfSOLTUIONSpeople, THINKubators, THINKathons
396 vues20 diapositives
Dr. Nassim Belbaly - Decision Markin pai summit 3v2.pdf par
Dr. Nassim Belbaly - Decision Markin pai summit 3v2.pdfDr. Nassim Belbaly - Decision Markin pai summit 3v2.pdf
Dr. Nassim Belbaly - Decision Markin pai summit 3v2.pdfSOLTUIONSpeople, THINKubators, THINKathons
336 vues19 diapositives
Terry Proto - AI Accelerates XR.pdf par
Terry Proto - AI Accelerates XR.pdfTerry Proto - AI Accelerates XR.pdf
Terry Proto - AI Accelerates XR.pdfSOLTUIONSpeople, THINKubators, THINKathons
285 vues19 diapositives
Ashen Bhatti - How I Build Companies with LLM.pdf par
Ashen Bhatti - How I Build Companies with LLM.pdfAshen Bhatti - How I Build Companies with LLM.pdf
Ashen Bhatti - How I Build Companies with LLM.pdfSOLTUIONSpeople, THINKubators, THINKathons
211 vues20 diapositives
Neha Shukla - Future of the AI Revolution - Building Ethical and Equitable T... par
 Neha Shukla - Future of the AI Revolution - Building Ethical and Equitable T... Neha Shukla - Future of the AI Revolution - Building Ethical and Equitable T...
Neha Shukla - Future of the AI Revolution - Building Ethical and Equitable T...SOLTUIONSpeople, THINKubators, THINKathons
338 vues16 diapositives
Ben Bressington - Buy Back Your Time and Increase Profits - 1 AI Strategy You... par
Ben Bressington - Buy Back Your Time and Increase Profits - 1 AI Strategy You...Ben Bressington - Buy Back Your Time and Increase Profits - 1 AI Strategy You...
Ben Bressington - Buy Back Your Time and Increase Profits - 1 AI Strategy You...SOLTUIONSpeople, THINKubators, THINKathons
298 vues20 diapositives

Tendances(20)

Leveraging Generative AI & Best practices par DianaGray10
Leveraging Generative AI & Best practicesLeveraging Generative AI & Best practices
Leveraging Generative AI & Best practices
DianaGray101.8K vues
How Does Generative AI Actually Work? (a quick semi-technical introduction to... par ssuser4edc93
How Does Generative AI Actually Work? (a quick semi-technical introduction to...How Does Generative AI Actually Work? (a quick semi-technical introduction to...
How Does Generative AI Actually Work? (a quick semi-technical introduction to...
ssuser4edc93982 vues
An Introduction to Generative AI - May 18, 2023 par CoriFaklaris1
An Introduction  to Generative AI - May 18, 2023An Introduction  to Generative AI - May 18, 2023
An Introduction to Generative AI - May 18, 2023
CoriFaklaris1958 vues
The Future is in Responsible Generative AI par Saeed Al Dhaheri
The Future is in Responsible Generative AIThe Future is in Responsible Generative AI
The Future is in Responsible Generative AI
Saeed Al Dhaheri681 vues
Generative-AI-in-enterprise-20230615.pdf par Liming Zhu
Generative-AI-in-enterprise-20230615.pdfGenerative-AI-in-enterprise-20230615.pdf
Generative-AI-in-enterprise-20230615.pdf
Liming Zhu866 vues
Unlocking the Power of Generative AI An Executive's Guide.pdf par PremNaraindas1
Unlocking the Power of Generative AI An Executive's Guide.pdfUnlocking the Power of Generative AI An Executive's Guide.pdf
Unlocking the Power of Generative AI An Executive's Guide.pdf
PremNaraindas12.2K vues
Using the power of Generative AI at scale par Maxim Salnikov
Using the power of Generative AI at scaleUsing the power of Generative AI at scale
Using the power of Generative AI at scale
Maxim Salnikov923 vues

Similaire à Marv Wexler - Transform Your with AI.pdf

Supermicro AI Pod that’s Super Simple, Super Scalable, and Super Affordable par
Supermicro AI Pod that’s Super Simple, Super Scalable, and Super AffordableSupermicro AI Pod that’s Super Simple, Super Scalable, and Super Affordable
Supermicro AI Pod that’s Super Simple, Super Scalable, and Super AffordableRebekah Rodriguez
185 vues61 diapositives
Design - Changing Perceptions of Infrastructure as a Service par
Design - Changing Perceptions of Infrastructure as a ServiceDesign - Changing Perceptions of Infrastructure as a Service
Design - Changing Perceptions of Infrastructure as a ServiceLaurenWendler
111 vues13 diapositives
Accelerating Innovation from Edge to Cloud par
Accelerating Innovation from Edge to CloudAccelerating Innovation from Edge to Cloud
Accelerating Innovation from Edge to CloudRebekah Rodriguez
288 vues31 diapositives
SUPERMICRO Innovative Computing Architecture par
SUPERMICRO Innovative Computing ArchitectureSUPERMICRO Innovative Computing Architecture
SUPERMICRO Innovative Computing ArchitectureIntel IT Center
1K vues17 diapositives
How Cloud Providers are Playing with Traditional Data Center par
How Cloud Providers are Playing with Traditional Data CenterHow Cloud Providers are Playing with Traditional Data Center
How Cloud Providers are Playing with Traditional Data CenterHostway|HOSTING
442 vues34 diapositives
Cimteq CableBuilder Go par
Cimteq CableBuilder GoCimteq CableBuilder Go
Cimteq CableBuilder GoCimteq
144 vues16 diapositives

Similaire à Marv Wexler - Transform Your with AI.pdf(20)

Supermicro AI Pod that’s Super Simple, Super Scalable, and Super Affordable par Rebekah Rodriguez
Supermicro AI Pod that’s Super Simple, Super Scalable, and Super AffordableSupermicro AI Pod that’s Super Simple, Super Scalable, and Super Affordable
Supermicro AI Pod that’s Super Simple, Super Scalable, and Super Affordable
Design - Changing Perceptions of Infrastructure as a Service par LaurenWendler
Design - Changing Perceptions of Infrastructure as a ServiceDesign - Changing Perceptions of Infrastructure as a Service
Design - Changing Perceptions of Infrastructure as a Service
LaurenWendler111 vues
SUPERMICRO Innovative Computing Architecture par Intel IT Center
SUPERMICRO Innovative Computing ArchitectureSUPERMICRO Innovative Computing Architecture
SUPERMICRO Innovative Computing Architecture
Intel IT Center1K vues
How Cloud Providers are Playing with Traditional Data Center par Hostway|HOSTING
How Cloud Providers are Playing with Traditional Data CenterHow Cloud Providers are Playing with Traditional Data Center
How Cloud Providers are Playing with Traditional Data Center
Hostway|HOSTING442 vues
Cimteq CableBuilder Go par Cimteq
Cimteq CableBuilder GoCimteq CableBuilder Go
Cimteq CableBuilder Go
Cimteq144 vues
Webinar: Microprocessadores 32 bits, suas principais aplicações no mercado br... par Embarcados
Webinar: Microprocessadores 32 bits, suas principais aplicações no mercado br...Webinar: Microprocessadores 32 bits, suas principais aplicações no mercado br...
Webinar: Microprocessadores 32 bits, suas principais aplicações no mercado br...
Embarcados115 vues
Cisco connect montreal 2018 compute v final par Cisco Canada
Cisco connect montreal 2018   compute v finalCisco connect montreal 2018   compute v final
Cisco connect montreal 2018 compute v final
Cisco Canada1.6K vues
Building Efficient Edge Nodes for Content Delivery Networks par Rebekah Rodriguez
Building Efficient Edge Nodes for Content Delivery NetworksBuilding Efficient Edge Nodes for Content Delivery Networks
Building Efficient Edge Nodes for Content Delivery Networks
New high-density storage server - IBM System x3650 M4 HD par Cliff Kinard
New high-density storage server - IBM System x3650 M4 HDNew high-density storage server - IBM System x3650 M4 HD
New high-density storage server - IBM System x3650 M4 HD
Cliff Kinard4.3K vues
IBM SoftLayer - overview of Cloud Infrastructure par Avinaba Basu
IBM SoftLayer - overview of Cloud Infrastructure IBM SoftLayer - overview of Cloud Infrastructure
IBM SoftLayer - overview of Cloud Infrastructure
Avinaba Basu2.7K vues
What is ThousandEyes Webinar par ThousandEyes
What is ThousandEyes WebinarWhat is ThousandEyes Webinar
What is ThousandEyes Webinar
ThousandEyes62 vues
IBM InterConnect 2013 Expert Integrated Systems Keynote: Sotiropoulos & Wieck par IBM Events
IBM InterConnect 2013 Expert Integrated Systems Keynote: Sotiropoulos & WieckIBM InterConnect 2013 Expert Integrated Systems Keynote: Sotiropoulos & Wieck
IBM InterConnect 2013 Expert Integrated Systems Keynote: Sotiropoulos & Wieck
IBM Events4.3K vues
Adding Recurring Revenue with Cloud Computing ProfitBricks par ProfitBricks
Adding Recurring Revenue with Cloud Computing ProfitBricksAdding Recurring Revenue with Cloud Computing ProfitBricks
Adding Recurring Revenue with Cloud Computing ProfitBricks
ProfitBricks529 vues
Cloud computing case studies with ProfitBricks IaaS par ProfitBricks
Cloud computing case studies with ProfitBricks IaaSCloud computing case studies with ProfitBricks IaaS
Cloud computing case studies with ProfitBricks IaaS
ProfitBricks1.1K vues
ProfitBricks Cloud Computing IaaS An Introduction par ProfitBricks
ProfitBricks Cloud Computing IaaS An IntroductionProfitBricks Cloud Computing IaaS An Introduction
ProfitBricks Cloud Computing IaaS An Introduction
ProfitBricks1.1K vues
TechWiseTV Workshop: ASR 9000 par Robb Boyd
TechWiseTV Workshop: ASR 9000 TechWiseTV Workshop: ASR 9000
TechWiseTV Workshop: ASR 9000
Robb Boyd736 vues
Softlayer an IBM Compay . Connaissez vous le cloud de l'avenir par Patrick Bouillaud
Softlayer an IBM Compay . Connaissez vous le cloud de l'avenir Softlayer an IBM Compay . Connaissez vous le cloud de l'avenir
Softlayer an IBM Compay . Connaissez vous le cloud de l'avenir
Patrick Bouillaud4.1K vues

Plus de SOLTUIONSpeople, THINKubators, THINKathons

George Boretos & FutureUP-AI the big picture.pdf par
George Boretos & FutureUP-AI the big picture.pdfGeorge Boretos & FutureUP-AI the big picture.pdf
George Boretos & FutureUP-AI the big picture.pdfSOLTUIONSpeople, THINKubators, THINKathons
401 vues20 diapositives
Audrey Chia - Supercharge Your Growth.pdf par
Audrey Chia - Supercharge Your Growth.pdfAudrey Chia - Supercharge Your Growth.pdf
Audrey Chia - Supercharge Your Growth.pdfSOLTUIONSpeople, THINKubators, THINKathons
331 vues22 diapositives
Garima Gupta - How AI can Change your online learning experience.pdf par
Garima Gupta - How AI can Change your online learning experience.pdfGarima Gupta - How AI can Change your online learning experience.pdf
Garima Gupta - How AI can Change your online learning experience.pdfSOLTUIONSpeople, THINKubators, THINKathons
247 vues15 diapositives
Kai Wang - AI for Innovation1.1r.pdf par
Kai Wang - AI for Innovation1.1r.pdfKai Wang - AI for Innovation1.1r.pdf
Kai Wang - AI for Innovation1.1r.pdfSOLTUIONSpeople, THINKubators, THINKathons
287 vues20 diapositives
Lars Tvede - How We Boosted Total Productivity in Our Company by 20x w.pdf par
Lars Tvede - How We Boosted Total Productivity in Our Company by 20x w.pdfLars Tvede - How We Boosted Total Productivity in Our Company by 20x w.pdf
Lars Tvede - How We Boosted Total Productivity in Our Company by 20x w.pdfSOLTUIONSpeople, THINKubators, THINKathons
229 vues28 diapositives
Kelly Dowd - Leading Digital Transformation with AI and Human-Centered Design... par
Kelly Dowd - Leading Digital Transformation with AI and Human-Centered Design...Kelly Dowd - Leading Digital Transformation with AI and Human-Centered Design...
Kelly Dowd - Leading Digital Transformation with AI and Human-Centered Design...SOLTUIONSpeople, THINKubators, THINKathons
206 vues10 diapositives

Plus de SOLTUIONSpeople, THINKubators, THINKathons(11)

Dernier

auto dialer aegis.pdf par
auto dialer aegis.pdfauto dialer aegis.pdf
auto dialer aegis.pdfVoiceLogger1
17 vues1 diapositive
[1Slide] Event Report AWS ReInvent 2023 - Q stands out par
[1Slide] Event Report AWS ReInvent 2023 - Q stands out[1Slide] Event Report AWS ReInvent 2023 - Q stands out
[1Slide] Event Report AWS ReInvent 2023 - Q stands outHolger Mueller
15 vues1 diapositive
Navigating the Complexity of Derivatives Valuation 📈 par
Navigating the Complexity of Derivatives Valuation 📈Navigating the Complexity of Derivatives Valuation 📈
Navigating the Complexity of Derivatives Valuation 📈ValAdvisor
18 vues6 diapositives
AGAM COLLECTIONS.docx par
AGAM COLLECTIONS.docxAGAM COLLECTIONS.docx
AGAM COLLECTIONS.docxSimarpreetKaur198008
11 vues1 diapositive
VCOSA - VIETNAM COTTON - YARN MARKET REPORT - 11/2023 ISSUE par
VCOSA - VIETNAM COTTON - YARN MARKET REPORT - 11/2023 ISSUEVCOSA - VIETNAM COTTON - YARN MARKET REPORT - 11/2023 ISSUE
VCOSA - VIETNAM COTTON - YARN MARKET REPORT - 11/2023 ISSUEVietnam Cotton & Spinning Association
37 vues26 diapositives
Netflix Inc. par
Netflix Inc.Netflix Inc.
Netflix Inc.125071027
14 vues11 diapositives

Dernier(20)

[1Slide] Event Report AWS ReInvent 2023 - Q stands out par Holger Mueller
[1Slide] Event Report AWS ReInvent 2023 - Q stands out[1Slide] Event Report AWS ReInvent 2023 - Q stands out
[1Slide] Event Report AWS ReInvent 2023 - Q stands out
Holger Mueller15 vues
Navigating the Complexity of Derivatives Valuation 📈 par ValAdvisor
Navigating the Complexity of Derivatives Valuation 📈Navigating the Complexity of Derivatives Valuation 📈
Navigating the Complexity of Derivatives Valuation 📈
ValAdvisor18 vues
Better Appeals and Solicitations - Bloomerang.pdf par Bloomerang
Better Appeals and Solicitations - Bloomerang.pdfBetter Appeals and Solicitations - Bloomerang.pdf
Better Appeals and Solicitations - Bloomerang.pdf
Bloomerang121 vues
Irigoyen_231129 - Around the world in 5 questions.pdf par bradgallagher6
Irigoyen_231129 - Around the world in 5 questions.pdfIrigoyen_231129 - Around the world in 5 questions.pdf
Irigoyen_231129 - Around the world in 5 questions.pdf
bradgallagher617 vues
Learning from Failure_ Lessons from Failed Startups.pptx par Codeventures
Learning from Failure_ Lessons from Failed Startups.pptxLearning from Failure_ Lessons from Failed Startups.pptx
Learning from Failure_ Lessons from Failed Startups.pptx
Codeventures19 vues
Hoole_Summit 2023 - Opening Remarks.pptx par bradgallagher6
Hoole_Summit 2023 - Opening Remarks.pptxHoole_Summit 2023 - Opening Remarks.pptx
Hoole_Summit 2023 - Opening Remarks.pptx
bradgallagher614 vues
The Talent Management Navigator Performance Management par Seta Wicaksana
The Talent Management Navigator Performance ManagementThe Talent Management Navigator Performance Management
The Talent Management Navigator Performance Management
Seta Wicaksana40 vues
Navigating EUDR Compliance within the Coffee Industry par Peter Horsten
Navigating EUDR Compliance within the Coffee IndustryNavigating EUDR Compliance within the Coffee Industry
Navigating EUDR Compliance within the Coffee Industry
Peter Horsten82 vues
DEUTSER-03188 Salt Lake Nov 30 Talk with speaker notes[13].pptx par bradgallagher6
DEUTSER-03188 Salt Lake Nov 30 Talk with speaker notes[13].pptxDEUTSER-03188 Salt Lake Nov 30 Talk with speaker notes[13].pptx
DEUTSER-03188 Salt Lake Nov 30 Talk with speaker notes[13].pptx
bradgallagher631 vues

Marv Wexler - Transform Your with AI.pdf

  • 1. Confidential Transform Your Business With AI Transform Your Business With AI AI Summit Marv Wexler GM Technical Services September 21, 2023 AI Summit Marv Wexler GM Technical Services September 21, 2023 Better Faster Greener™ © 2023 Supermicro
  • 2. Confidential Where are we on the AI journey ? 9/20/2023 Better Faster Greener™ © 2023 Supermicro 2 “Once a new technology rolls over you, if you're not part of the steamroller, you're part of the road.” - Stewart Brand
  • 3. Confidential 9/20/2023 Better Faster Greener™ © 2023 Supermicro 3 Current AI Trends • Democratization of AI will continue • AI is a fundamental differentiator for businesses • Find deeper insights in data, real-time and at scale -Else your competitors surely will • Generative AI is becoming commercialized • AI ethics a top priority • Biased algorithms, Deep fakes, “Hallucinations” as a feature • Generative AI applications reign : Microsoft (Designer), Adobe (Firefly), Meta (Ad creation) • New regulations for safe and responsible practices • EU AI Act: Set of new rules that establish obligations for risks from artificial intelligence
  • 4. Confidential AI Applications 9/20/2023 Better Faster Greener™ © 2023 Supermicro 4 Deep Learning Solving complex problems Computer model taught to learn actions using images, texts and sounds Machine Learning Machines making decisions Building Machines with predictive algorithm and create predictive models Artificial Intelligence Simulate intelligence Building Smart Machines capable of performing intelligent tasks
  • 5. Confidential 9/20/2023 Better Faster Greener™ © 2023 Supermicro 5 Text Image Audio Video Games Text/ Voice prompt Generative AI models (also Large Language LLM, or Foundational Models) User Input What is Generative AI? Generative AI models are models that, when receiving a text prompt, give an output related to that input. The output can be text, image, audio, video, code etc. The ability for generative AI to produce useful, impressively synthesized text, images, and other types of content almost effortlessly based on a few text cues has already become an important business capability worthy of providing immense value to most knowledge workers
  • 6. Confidential The far-reaching impacts of Generative AI 9/20/2023 Better Faster Greener™ © 2023 Supermicro 6 Around 75% of the technology's value will be seen across four areas: • customer operations • marketing and sales • software engineering • research and development automating conversations with customers creating personalized messages for customers generating code generative design
  • 7. Confidential Customizable AI infrastructure for Generative AI 9/20/2023 Better Faster Greener™ © 2023 Supermicro 7 Training •compute intensive •massive datasets involved Fine-Tuning •Requires relatively less computational power Inferencing •Accelerators may be needed depending on type of application (batch/real-time) Various stages in building a Generative AI Application At Supermicro, We have you covered all the way with affordable, customizable and scalable solutions
  • 8. Confidential Application Details 9/20/2023 Better Faster Greener™ © 2023 Supermicro 8 LangChain Instructor Embeddings WizardLM / LLAMA • Ask questions to your documents AND learn from your documents using the power of LLMs. • 100% private, no data leaves your execution environment at any point. • You can ingest documents and ask questions without an internet connection! localGPT BUILT WITH • Text pre processed into chunks • Embedded in a vector space • Query search for similar chunks An instruction-finetuned text embedding model that can generate text embeddings tailored to any task by simply providing the task instruction, without any finetuning. Instructor achieves SOTA on 70 diverse embedding tasks! (e.g., classification, retrieval, clustering, text evaluation, etc.) and domains (e.g., science, finance, etc.) • WizardLM is a Llama variant trained with complex instructions • Evol-Instruct which leverages AI to "evolve" instructions
  • 9. Confidential Application Details 9/20/2023 Better Faster Greener™ © 2023 Supermicro 9 Ingest.py • uses LangChain tools to parse the document and create embeddings locally using Instructor Embeddings Chroma vector store • local vector database that stores the created embeddings Run_localGPT • uses local LLM to understand questions and create answers. Similarity Search • used to extract right piece of context from the local vector store
  • 10. Confidential 10 ©2023 Supermicro Large Scale AI Training • Key Technologies • NVIDIA HGX H100 SXM 8-GPU/4-GPU with 900GB/s NVLink interconnect • Dedicated, lots of high performance, high bandwidth GPU memory - HBM3, HBM2e • 400GbE networking (Ethernet or InfiniBand), PCIe 5.0 storage for fast AI data pipe • NVIDIA GPUDirect RDMA and Storage to keep feeding data to GPUs with minimum latency • Liquid cooling for GPUs and CPUs • All-flash storage and file systems to support petabytes of hot-tier data cache • NVIDIA HGX H100 SXM5 board with 4- GPU or 8- GPU • NVLink and NVSwitch • 80GB HBM3 per GPU • Up to 700W TDP • NVIDIA ConnectX-7 • Up to 400GbE or 400G NDR InfiniBand • x16/x32 PCIe 5.0
  • 11. Confidential Supermicro AI Experience Supermicro AI Experience Marv Wexler August 2023 Marv Wexler August 2023 Better Faster Greener™ © 2023 Supermicro
  • 12. Confidential 9/20/2023 Better Faster Greener™ © 2023 Supermicro 12
  • 13. Confidential Evolving to an AI / Total IT Solutions Partner 9/20/2023 Better Faster Greener™ © 2022 Supermicro 13  5S: Software, Services, Switch, Storage, Security and more  Total Solutions: Enterprise, OEM- Appliance / Cloud  Complete Systems  Sub-systems and Components ~5X+ Faster growth rate than the industry avg rate over the past 12+ months (~50% YoY) ~5X+ Faster growth rate than the industry avg rate over the past 12+ months (~50% YoY) Our Momentum: SMCI 1.0 Components & Subsystems SMCI 2.0 Servers & Storage Systems SMCI 3.0 Total IT Solutions Today 1993 $5B $10B
  • 14. Confidential SMCI AI Strategy 9/20/2023 Better Faster Greener™ © 2023 Supermicro 14 • Partner with the Leaders • Provide the best picks and shovels for the gold miners (Apps, YOU) • Do not be religious with Products Offerings (multi-vendor, multi-platform)
  • 15. Confidential SMCI AI Business Results 9/20/2023 Better Faster Greener™ © 2023 Supermicro 15 • Bring up platform partner for virtually all AI Solutions / GPU offerings • Lead supplier for virtually all Large Language Model Cloud Deployments (ChatGPT, BARD, Bing, etc.) The Next Platform, August 16, 2023
  • 16. Confidential 16 ©2023 Supermicro GPU Optimized Systems by Workloads • Large Scale AI Training • HPC/AI Workloads H100 PCIe Grace Hopper Superchip (Grace CPU + H100 GPU) H100 NVL HGX H100 SXM 8-GPU or 4-GPU 4U 4-GPU System (HGX H100 SXM) (codenamed: Redstone-Next) SYS-421GU-TNXR, SYS-521GU-TNXR 8U 8-GPU System (HGX H100 SXM) (codenamed: Delta-Next) SYS-821GE-TNHR, AS -8125GS-TNHR 4U 4-GPU System (HGX H100 SXM) SYS-421GU-TNXR 4U/5U 8-10 GPU System SYS-521GE-TNRT, SYS-421GE-TNRT/TNRT3 AS -4125GS-TNRT/TNRT1/TNRT2 1U Grace Hopper MGX System SYS-421GU-TNXR / SYS-521GU-TNXR 8U SuperBlade (Up to 20 nodes) SBI-411E-1G / SBI-411E-5G Petabyte Scale All-Flash Storage SSG-121E-NE316R, ASG-1115S-NE316R
  • 17. Confidential Scales to thousands of nodes in 32-node increments (SRS-42UHPC-32SU-01) Accelerate AI Development by Supermicro Supermicro 8U Delta-Next (SYS-821GE-TNHR) A Proven Platform, Purpose Built for AI H100 SXM5 GPU ConnectX-7 SmartNICs H100 Rack Scale SuperPod Scalable Unit 8x NVIDIA H100 SXM5 GPUs | 640GB HBM3 GPU Memory 2TB System Memory | 3.2Tbps Network B/W | Superior I/O 32x HGX H100 | 1+ EFLOPS AI | 20TB HBM3 GPU Memory 102.4Tbps Network B/W Non-blocking | InfiniBand NDR Software: NVIDIA BCM | NGC | NVAIE | SLURM | Kubernetes Full Turnkey AI Supercomputer for Enterprises 9/20/2023 Better Faster Greener™ © 2023 Supermicro 17
  • 18. Confidential Supermicro Rack Integration Services • Full rack integration up to L11 and L12 • Broad portfolio of compute, power, cooling and networking options • Liquid cooling integration • Cooling Distribution Unit (CDU) • Direct to Chip cold plate • Manifold and tubing • Design, assembly, configuration, testing and deployment • Start running applications from Day 1
  • 19. Confidential Supermicro CDU 80kW to 120kW, 45°C Warm Water Liquid Cooling Option for Rack Scale H100 SuperPods 9/20/2023 Better Faster Greener™ © 2023 Supermicro 19
  • 20. Confidential Onsite Rack Services 9/20/2023 Better Faster Greener™ © 2023 Supermicro 20 Simplifying Your Solution Deployment Needs • White glove custom service from beginning to end • Onsite rack & stack of the custom solution • Onsite integration ensuring proper installation and connectivity, providing for reliable operation and reduced downtime • Onsite software installation with application configurations • Onsite benchmark testing ensuring solution meets the requirements of the customer • Delivery of a customized rack solution that meets all requirements • SMC Cooling tower product line is available to enable facility level water connections for CDU/CDM/RDHX Reliable – Repeatable – Reproducible
  • 21. Confidential DISCLAIMER Super Micro Computer, Inc. may make changes to specifications and product descriptions at any time, without notice. The information presented in this document is for informational purposes only and may contain technical inaccuracies, omissions and typographical errors. Any performance tests and ratings are measured using systems that reflect the approximate performance of Super Micro Computer, Inc. products as measured by those tests. Any differences in software or hardware configuration may affect actual performance, and Super Micro Computer, Inc. does not control the design or implementation of third party benchmarks or websites referenced in this document. The information contained herein is subject to change and may be rendered inaccurate for many reasons, including but not limited to any changes in product and/or roadmap, component and hardware revision changes, new model and/or product releases, software changes, firmware changes, or the like. Super Micro Computer, Inc. assumes no obligation to update or otherwise correct or revise this information. SUPER MICRO COMPUTER, INC. MAKES NO REPRESENTATIONS OR WARRANTIES WITH RESPECT TO THE CONTENTS HEREOF AND ASSUMES NO RESPONSIBILITY FOR ANY INACCURACIES, ERRORS OR OMISSIONS THAT MAY APPEAR IN THIS INFORMATION. SUPER MICRO COMPUTER, INC. SPECIFICALLY DISCLAIMS ANY IMPLIED WARRANTIES OF MERCHANTABILITY OR FITNESS FOR ANY PARTICULAR PURPOSE. IN NO EVENT WILL SUPER MICRO COMPUTER, INC. BE LIABLE TO ANY PERSON FOR ANY DIRECT, INDIRECT, SPECIAL OR OTHER CONSEQUENTIAL DAMAGES ARISING FROM THE USE OF ANY INFORMATION CONTAINED HEREIN, EVEN IF SUPER MICRO COMPUTER, Inc. IS EXPRESSLY ADVISED OF THE POSSIBILITY OF SUCH DAMAGES. ATTRIBUTION © 2023 Super Micro Computer, Inc. All rights reserved. 9/20/2023 Better Faster Greener™ © 2023 Supermicro 21