HKG18-300K2 - Keynote: Tomas Evensen - All Programmable SoCs? – Platforms to enable the future of Embedded Machine Learning

Linaro
LinaroLinaro
All Programmable SoCs?
Platforms to enable the future of Embedded Machine Learning
Linaro Connect – Hong Kong March 2018
Tomas Evensen
CTO Embedded Software, Xilinx
© Copyright 2018 Xilinx
.
Processor frequency scaling ended in 2007
Multicore architecture scaling has flattened
CPU Architectures not Scaling with Workloads
Workloads require higher
performance, lower latency
– Cloud: video, big data, AI…
– Edge: auto, surveillance, AI…Andrew Danowitz, Kyle Kelley, James Mao, John P. Stevenson, Mark Horowitz
Communications of the ACM, Vol. 55 No. 4
Page 2
© Copyright 2018 Xilinx
.
Application
Processor
64-bit
Dual/Quad-Core
Zynq UltraScale+ MPSoC
Real-Time
Processors
32-bit Dual-Core
Platform & Power
Management
Granular Power Control
Functional Safety
Configuration & Security
Unit
Anti-Tamper & Trust
Industry Standards
Fabric Acceleration
Customizable Engines
High Speed Connectivity
Video Codec
8K4K (15fps)
4K2K (60fps)
High Speed
Peripherals
Key Interfaces
Graphics Processor
ARM Mali-400MP2
Memory
Subsystem
High Bandwidth
Low Latency
Page 3
© Copyright 2018 Xilinx
.
Page 4
FPGA: The “Chameleon” Chip
What is FPGA/Fabric/Programmable Logic:
Is it glue logic?
Is it a powerful parallel DSP engine?
Is it an RTL simulator?
Yes!!! And more…
FPGA includes:
Programmable logic (LUTs)
Hardened DSP blocks
Hardened memory (BRAM, URAM)
FPGAs are great to implement:
Parallel compute (e.g. MAC)
– With variable precision
Parallel, flexible dataflows
– Build your own buses
Flexible, multiport memory hierarchies
© Copyright 2018 Xilinx
.
Breakout in Programming Model
Traditional HW design
HDMI
video
proc.
video
enc.
Development
Productivity
15x productivity with HLS, IPI
Ethernet
IP
Video
decode
C++
Video
process
C++
Video
encode
C++
HDMI
IP
SW Programmability
Page 5
© Copyright 2018 Xilinx
.
SDSoC Example: Matrix Multiply + Add
main(){
malloc(A,B,C);
mmult(A,B,D);
madd(C,D,E);
printf(E);
}
madd(inA,inB,out){
}
HLS C/C++
mmult(inA,inB,out){
}
HLS C/C++
A,B datamovers
AXI Bus
Platform
Application
Driver
mmult madd
Generated
D
A B C
E
PS
PL
Page 6
© Copyright 2018 Xilinx
.
Supporting the Whole Stack
Accelerated
Open Frameworks
Accelerated
Libraries
Development
Environment
Boards w/
HLx-based platform
Machine
learning
Database
Analytics
Platform
Development Stack
VCU1525
Acceleration card
Page 7
© Copyright 2018 Xilinx
.
Deep learning-based multi-object
recognition for smart-city
Live video object detection
SSD @ 480x360 on Zynq MPSoC
End-customer obtained:
– 5x NVIDIA TX2 performance
– Better accuracy
Example of Embedded Vision Application at the Edge
Object detection
5X Perf/watt for SSD vs GPU
Page 8
© Copyright 2018 Xilinx
.
Xilinx and Avnet is partnering and is announcing the Ultra96 board
– Equipped with Zync Ultrascale+ MPSoC (ZU3EG)
– https://www.96boards.org/product/ultra96/
Ultra96 board makes ARM® based Xilinx SoCs available to developers at a low price point
– Built to 96Boards standard, suitable for software prototyping with standardized expansion kits
– Targeting a range of applications including Machine Learning, IoT, and compute
Leverages an open-source software development platform
– 96boards community: 12K actively contributing software engineers
– Supports both self-hosted and cross development
• Self-hosted: Compile on the board itself
• Cross: Develop on your workstation/laptop
• C to fabric/FPGA: SDSoC tools available later this year
• Unboxing to coding in less than 2 minutes
Page 9
$249 Ultra96 Board Targeted for Software Designers
Available from Avnet in April
© Copyright 2018 Xilinx
.
Submit your most creative, most out-of-the box AI or ML application
at the Xilinx or Avnet table during Demo Friday (12:00 – 14:00).
The best 30 get a FREE Ultra96 board plus software to
help you realize your vision.
The 1st twenty to submit a working design by MAY 25th, 2018 get a
$25 Amazon Gift Card.
ONE Winner announced through Xilinx social media channels. If
it’s you, you’re invited to present your design to your peers in
industry at Xilinx Developer Forum 2018.
Page 10
The Future is Ultra96 Xilinx Contest
© Copyright 2018 Xilinx
.
Page 11
Demo
© Copyright 2018 Xilinx
.
German Road Sign Database
– 50,000+ 32x32 bit images for training
– 44 classes (43 road signs, 1 background)
– Training via Amazon Web Services
• AWS: p2.xlarge Instance – 8 hours à $7.78 6,5e
Binary Neural Network Characteristics
– 6 convolutional layers
– 2 max pool layers
– 3 fully connected layers
Page 12
Neural Network Example
© Copyright 2018 Xilinx
.
Page 13
Neural Network Performance Results
Up to 8,600 times faster when accelerated with programmable logic
Performance Metric Software Only Programmable Logic
Accelerated
Tiles per second 2.2 19,000
Scene rate (fps) 0.011 (92 sec per frame) 94
Overall Acceleration - 8,600X
© Copyright 2018 Xilinx
.
Dramatically Accelerate 96Board Software via an FPGA with Integrated Processors
– Wednesday 16:00-16:55
Accelerating Neural Networks for Vision Systems via FPGAs
– Thursday 11:00-11:25
Page 14
Learn More About FPGA’s and Software Acceleration
© Copyright 2018 Xilinx
.
Page 15
Questions?/ Thank You
1 sur 15

Recommandé

HKG18-301 - Dramatically Accelerate 96Board Software via an FPGA with Integra... par
HKG18-301 - Dramatically Accelerate 96Board Software via an FPGA with Integra...HKG18-301 - Dramatically Accelerate 96Board Software via an FPGA with Integra...
HKG18-301 - Dramatically Accelerate 96Board Software via an FPGA with Integra...Linaro
374 vues36 diapositives
“Vitis and Vitis AI: Application Acceleration from Cloud to Edge,” a Presenta... par
“Vitis and Vitis AI: Application Acceleration from Cloud to Edge,” a Presenta...“Vitis and Vitis AI: Application Acceleration from Cloud to Edge,” a Presenta...
“Vitis and Vitis AI: Application Acceleration from Cloud to Edge,” a Presenta...Edge AI and Vision Alliance
278 vues20 diapositives
Introducing the Vitis Unified Software Platform for Programming FPGAs par
Introducing the Vitis Unified Software Platform for Programming FPGAsIntroducing the Vitis Unified Software Platform for Programming FPGAs
Introducing the Vitis Unified Software Platform for Programming FPGAsinside-BigData.com
4.4K vues28 diapositives
“Deploying Deep Learning Applications on FPGAs with MATLAB,” a Presentation f... par
“Deploying Deep Learning Applications on FPGAs with MATLAB,” a Presentation f...“Deploying Deep Learning Applications on FPGAs with MATLAB,” a Presentation f...
“Deploying Deep Learning Applications on FPGAs with MATLAB,” a Presentation f...Edge AI and Vision Alliance
93 vues21 diapositives
“Making Edge AI Inference Programming Easier and Flexible,” a Presentation fr... par
“Making Edge AI Inference Programming Easier and Flexible,” a Presentation fr...“Making Edge AI Inference Programming Easier and Flexible,” a Presentation fr...
“Making Edge AI Inference Programming Easier and Flexible,” a Presentation fr...Edge AI and Vision Alliance
138 vues23 diapositives
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD par
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPODHPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPODinside-BigData.com
838 vues25 diapositives

Contenu connexe

Tendances

Project Trillium: Arm Machine Learning Platform par
Project Trillium: Arm Machine Learning PlatformProject Trillium: Arm Machine Learning Platform
Project Trillium: Arm Machine Learning PlatformArm
737 vues13 diapositives
Data on the move a RISC-V opportunity par
Data on the move   a RISC-V opportunityData on the move   a RISC-V opportunity
Data on the move a RISC-V opportunityRISC-V International
63 vues10 diapositives
IoTs Place in the World of 5G par
IoTs Place in the World of 5GIoTs Place in the World of 5G
IoTs Place in the World of 5GArm
547 vues14 diapositives
EPSRC CDT Conference par
EPSRC CDT ConferenceEPSRC CDT Conference
EPSRC CDT ConferenceAlison B. Lowndes
273 vues77 diapositives
Akraino and Edge Computing par
Akraino and Edge ComputingAkraino and Edge Computing
Akraino and Edge ComputingLiz Warner
475 vues20 diapositives
Major Market Shifts in IT par
Major Market Shifts in ITMajor Market Shifts in IT
Major Market Shifts in ITinside-BigData.com
5K vues27 diapositives

Tendances(20)

Project Trillium: Arm Machine Learning Platform par Arm
Project Trillium: Arm Machine Learning PlatformProject Trillium: Arm Machine Learning Platform
Project Trillium: Arm Machine Learning Platform
Arm737 vues
IoTs Place in the World of 5G par Arm
IoTs Place in the World of 5GIoTs Place in the World of 5G
IoTs Place in the World of 5G
Arm547 vues
Akraino and Edge Computing par Liz Warner
Akraino and Edge ComputingAkraino and Edge Computing
Akraino and Edge Computing
Liz Warner475 vues
HKG18-315 - Why the ecosystem is a wonderful thing, warts and all par Linaro
HKG18-315 - Why the ecosystem is a wonderful thing, warts and allHKG18-315 - Why the ecosystem is a wonderful thing, warts and all
HKG18-315 - Why the ecosystem is a wonderful thing, warts and all
Linaro219 vues
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ... par inside-BigData.com
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Embedded Fest 2019. Dov Nimratz. Artificial Intelligence in Small Embedded Sy... par EmbeddedFest
Embedded Fest 2019. Dov Nimratz. Artificial Intelligence in Small Embedded Sy...Embedded Fest 2019. Dov Nimratz. Artificial Intelligence in Small Embedded Sy...
Embedded Fest 2019. Dov Nimratz. Artificial Intelligence in Small Embedded Sy...
EmbeddedFest190 vues
Open Source Edge Computing Platforms - Overview par Krishna-Kumar
Open Source Edge Computing Platforms - OverviewOpen Source Edge Computing Platforms - Overview
Open Source Edge Computing Platforms - Overview
Krishna-Kumar 6.2K vues
Kubernetes Native Infrastructure and CoreOS Operator Framework for 5G Edge Cl... par Hidetsugu Sugiyama
Kubernetes Native Infrastructure and CoreOS Operator Framework for 5G Edge Cl...Kubernetes Native Infrastructure and CoreOS Operator Framework for 5G Edge Cl...
Kubernetes Native Infrastructure and CoreOS Operator Framework for 5G Edge Cl...
OpenStack for EDGE computing par Hazzim Anaya
OpenStack for EDGE computingOpenStack for EDGE computing
OpenStack for EDGE computing
Hazzim Anaya480 vues
How Cisco Provides World-Class Technology Conference Experiences Using Automa... par InfluxData
How Cisco Provides World-Class Technology Conference Experiences Using Automa...How Cisco Provides World-Class Technology Conference Experiences Using Automa...
How Cisco Provides World-Class Technology Conference Experiences Using Automa...
InfluxData597 vues
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently par inside-BigData.com
Zettar: Moving Massive Amounts of Data across Any Distance EfficientlyZettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
ADLINK And InfluxDB Deliver Operational Efficiency For Defense Industry With ... par DevOps.com
ADLINK And InfluxDB Deliver Operational Efficiency For Defense Industry With ...ADLINK And InfluxDB Deliver Operational Efficiency For Defense Industry With ...
ADLINK And InfluxDB Deliver Operational Efficiency For Defense Industry With ...
DevOps.com199 vues
ONAP and the K8s Ecosystem: A Converged Edge Application & Network Function P... par Liz Warner
ONAP and the K8s Ecosystem: A Converged Edge Application & Network Function P...ONAP and the K8s Ecosystem: A Converged Edge Application & Network Function P...
ONAP and the K8s Ecosystem: A Converged Edge Application & Network Function P...
Liz Warner330 vues
IoT Microservices at the Edge with Eclipse ioFog par Kilton Hopkins
IoT Microservices at the Edge with Eclipse ioFogIoT Microservices at the Edge with Eclipse ioFog
IoT Microservices at the Edge with Eclipse ioFog
Kilton Hopkins767 vues

Similaire à HKG18-300K2 - Keynote: Tomas Evensen - All Programmable SoCs? – Platforms to enable the future of Embedded Machine Learning

Xilinx Edge Compute using Power 9 /OpenPOWER systems par
Xilinx Edge Compute using Power 9 /OpenPOWER systemsXilinx Edge Compute using Power 9 /OpenPOWER systems
Xilinx Edge Compute using Power 9 /OpenPOWER systemsGanesan Narayanasamy
587 vues46 diapositives
Supermicro AI Pod that’s Super Simple, Super Scalable, and Super Affordable par
Supermicro AI Pod that’s Super Simple, Super Scalable, and Super AffordableSupermicro AI Pod that’s Super Simple, Super Scalable, and Super Affordable
Supermicro AI Pod that’s Super Simple, Super Scalable, and Super AffordableRebekah Rodriguez
184 vues61 diapositives
Harnessing the virtual realm for successful real world artificial intelligence par
Harnessing the virtual realm for successful real world artificial intelligenceHarnessing the virtual realm for successful real world artificial intelligence
Harnessing the virtual realm for successful real world artificial intelligenceAlison B. Lowndes
150 vues59 diapositives
Xilinx Data Center Strategy and CCIX par
Xilinx Data Center Strategy and CCIXXilinx Data Center Strategy and CCIX
Xilinx Data Center Strategy and CCIXYoshihiro Horie
1.2K vues41 diapositives
HKG18-100K1 - George Grey: Opening Keynote par
HKG18-100K1 - George Grey: Opening KeynoteHKG18-100K1 - George Grey: Opening Keynote
HKG18-100K1 - George Grey: Opening KeynoteLinaro
839 vues54 diapositives
IBM Special Announcement session Intel #IDF2013 September 10, 2013 par
IBM Special Announcement session Intel #IDF2013 September 10, 2013IBM Special Announcement session Intel #IDF2013 September 10, 2013
IBM Special Announcement session Intel #IDF2013 September 10, 2013Cliff Kinard
1.4K vues39 diapositives

Similaire à HKG18-300K2 - Keynote: Tomas Evensen - All Programmable SoCs? – Platforms to enable the future of Embedded Machine Learning(20)

Supermicro AI Pod that’s Super Simple, Super Scalable, and Super Affordable par Rebekah Rodriguez
Supermicro AI Pod that’s Super Simple, Super Scalable, and Super AffordableSupermicro AI Pod that’s Super Simple, Super Scalable, and Super Affordable
Supermicro AI Pod that’s Super Simple, Super Scalable, and Super Affordable
Harnessing the virtual realm for successful real world artificial intelligence par Alison B. Lowndes
Harnessing the virtual realm for successful real world artificial intelligenceHarnessing the virtual realm for successful real world artificial intelligence
Harnessing the virtual realm for successful real world artificial intelligence
Xilinx Data Center Strategy and CCIX par Yoshihiro Horie
Xilinx Data Center Strategy and CCIXXilinx Data Center Strategy and CCIX
Xilinx Data Center Strategy and CCIX
Yoshihiro Horie1.2K vues
HKG18-100K1 - George Grey: Opening Keynote par Linaro
HKG18-100K1 - George Grey: Opening KeynoteHKG18-100K1 - George Grey: Opening Keynote
HKG18-100K1 - George Grey: Opening Keynote
Linaro839 vues
IBM Special Announcement session Intel #IDF2013 September 10, 2013 par Cliff Kinard
IBM Special Announcement session Intel #IDF2013 September 10, 2013IBM Special Announcement session Intel #IDF2013 September 10, 2013
IBM Special Announcement session Intel #IDF2013 September 10, 2013
Cliff Kinard1.4K vues
HiPEAC 2019 Workshop - Vision Processing par Tulipp. Eu
HiPEAC 2019 Workshop - Vision ProcessingHiPEAC 2019 Workshop - Vision Processing
HiPEAC 2019 Workshop - Vision Processing
Tulipp. Eu297 vues
XMOS Company Overview par XMOS
XMOS Company OverviewXMOS Company Overview
XMOS Company Overview
XMOS698 vues
Backend.AI Technical Introduction (19.09 / 2019 Autumn) par Lablup Inc.
Backend.AI Technical Introduction (19.09 / 2019 Autumn)Backend.AI Technical Introduction (19.09 / 2019 Autumn)
Backend.AI Technical Introduction (19.09 / 2019 Autumn)
Lablup Inc.329 vues
High Performance Object Storage in 30 Minutes with Supermicro and MinIO par Rebekah Rodriguez
High Performance Object Storage in 30 Minutes with Supermicro and MinIOHigh Performance Object Storage in 30 Minutes with Supermicro and MinIO
High Performance Object Storage in 30 Minutes with Supermicro and MinIO
Vertex Perspectives | AI Optimized Chipsets | Part III par Vertex Holdings
Vertex Perspectives | AI Optimized Chipsets | Part IIIVertex Perspectives | AI Optimized Chipsets | Part III
Vertex Perspectives | AI Optimized Chipsets | Part III
Vertex Holdings2.6K vues
Flexible and Scalable Integration in the Automation Industry/Industrial IoT par confluent
Flexible and Scalable Integration in the Automation Industry/Industrial IoTFlexible and Scalable Integration in the Automation Industry/Industrial IoT
Flexible and Scalable Integration in the Automation Industry/Industrial IoT
confluent1.3K vues
IIoT / Industry 4.0 with Apache Kafka, Connect, KSQL, Apache PLC4X par Kai Wähner
IIoT / Industry 4.0 with Apache Kafka, Connect, KSQL, Apache PLC4X IIoT / Industry 4.0 with Apache Kafka, Connect, KSQL, Apache PLC4X
IIoT / Industry 4.0 with Apache Kafka, Connect, KSQL, Apache PLC4X
Kai Wähner11.7K vues
Enabling Artificial Intelligence - Alison B. Lowndes par WithTheBest
Enabling Artificial Intelligence - Alison B. LowndesEnabling Artificial Intelligence - Alison B. Lowndes
Enabling Artificial Intelligence - Alison B. Lowndes
WithTheBest894 vues

Plus de Linaro

Deep Learning Neural Network Acceleration at the Edge - Andrea Gallo par
Deep Learning Neural Network Acceleration at the Edge - Andrea GalloDeep Learning Neural Network Acceleration at the Edge - Andrea Gallo
Deep Learning Neural Network Acceleration at the Edge - Andrea GalloLinaro
7K vues54 diapositives
Arm Architecture HPC Workshop Santa Clara 2018 - Kanta Vekaria par
Arm Architecture HPC Workshop Santa Clara 2018 - Kanta VekariaArm Architecture HPC Workshop Santa Clara 2018 - Kanta Vekaria
Arm Architecture HPC Workshop Santa Clara 2018 - Kanta VekariaLinaro
3K vues8 diapositives
Huawei’s requirements for the ARM based HPC solution readiness - Joshua Mora par
Huawei’s requirements for the ARM based HPC solution readiness - Joshua MoraHuawei’s requirements for the ARM based HPC solution readiness - Joshua Mora
Huawei’s requirements for the ARM based HPC solution readiness - Joshua MoraLinaro
3.7K vues20 diapositives
Bud17 113: distribution ci using qemu and open qa par
Bud17 113: distribution ci using qemu and open qaBud17 113: distribution ci using qemu and open qa
Bud17 113: distribution ci using qemu and open qaLinaro
662 vues63 diapositives
OpenHPC Automation with Ansible - Renato Golin - Linaro Arm HPC Workshop 2018 par
OpenHPC Automation with Ansible - Renato Golin - Linaro Arm HPC Workshop 2018OpenHPC Automation with Ansible - Renato Golin - Linaro Arm HPC Workshop 2018
OpenHPC Automation with Ansible - Renato Golin - Linaro Arm HPC Workshop 2018Linaro
2.3K vues16 diapositives
HPC network stack on ARM - Linaro HPC Workshop 2018 par
HPC network stack on ARM - Linaro HPC Workshop 2018HPC network stack on ARM - Linaro HPC Workshop 2018
HPC network stack on ARM - Linaro HPC Workshop 2018Linaro
2.6K vues21 diapositives

Plus de Linaro(20)

Deep Learning Neural Network Acceleration at the Edge - Andrea Gallo par Linaro
Deep Learning Neural Network Acceleration at the Edge - Andrea GalloDeep Learning Neural Network Acceleration at the Edge - Andrea Gallo
Deep Learning Neural Network Acceleration at the Edge - Andrea Gallo
Linaro7K vues
Arm Architecture HPC Workshop Santa Clara 2018 - Kanta Vekaria par Linaro
Arm Architecture HPC Workshop Santa Clara 2018 - Kanta VekariaArm Architecture HPC Workshop Santa Clara 2018 - Kanta Vekaria
Arm Architecture HPC Workshop Santa Clara 2018 - Kanta Vekaria
Linaro3K vues
Huawei’s requirements for the ARM based HPC solution readiness - Joshua Mora par Linaro
Huawei’s requirements for the ARM based HPC solution readiness - Joshua MoraHuawei’s requirements for the ARM based HPC solution readiness - Joshua Mora
Huawei’s requirements for the ARM based HPC solution readiness - Joshua Mora
Linaro3.7K vues
Bud17 113: distribution ci using qemu and open qa par Linaro
Bud17 113: distribution ci using qemu and open qaBud17 113: distribution ci using qemu and open qa
Bud17 113: distribution ci using qemu and open qa
Linaro662 vues
OpenHPC Automation with Ansible - Renato Golin - Linaro Arm HPC Workshop 2018 par Linaro
OpenHPC Automation with Ansible - Renato Golin - Linaro Arm HPC Workshop 2018OpenHPC Automation with Ansible - Renato Golin - Linaro Arm HPC Workshop 2018
OpenHPC Automation with Ansible - Renato Golin - Linaro Arm HPC Workshop 2018
Linaro2.3K vues
HPC network stack on ARM - Linaro HPC Workshop 2018 par Linaro
HPC network stack on ARM - Linaro HPC Workshop 2018HPC network stack on ARM - Linaro HPC Workshop 2018
HPC network stack on ARM - Linaro HPC Workshop 2018
Linaro2.6K vues
It just keeps getting better - SUSE enablement for Arm - Linaro HPC Workshop ... par Linaro
It just keeps getting better - SUSE enablement for Arm - Linaro HPC Workshop ...It just keeps getting better - SUSE enablement for Arm - Linaro HPC Workshop ...
It just keeps getting better - SUSE enablement for Arm - Linaro HPC Workshop ...
Linaro1.9K vues
Intelligent Interconnect Architecture to Enable Next Generation HPC - Linaro ... par Linaro
Intelligent Interconnect Architecture to Enable Next Generation HPC - Linaro ...Intelligent Interconnect Architecture to Enable Next Generation HPC - Linaro ...
Intelligent Interconnect Architecture to Enable Next Generation HPC - Linaro ...
Linaro2.4K vues
Yutaka Ishikawa - Post-K and Arm HPC Ecosystem - Linaro Arm HPC Workshop Sant... par Linaro
Yutaka Ishikawa - Post-K and Arm HPC Ecosystem - Linaro Arm HPC Workshop Sant...Yutaka Ishikawa - Post-K and Arm HPC Ecosystem - Linaro Arm HPC Workshop Sant...
Yutaka Ishikawa - Post-K and Arm HPC Ecosystem - Linaro Arm HPC Workshop Sant...
Linaro2.4K vues
Andrew J Younge - Vanguard Astra - Petascale Arm Platform for U.S. DOE/ASC Su... par Linaro
Andrew J Younge - Vanguard Astra - Petascale Arm Platform for U.S. DOE/ASC Su...Andrew J Younge - Vanguard Astra - Petascale Arm Platform for U.S. DOE/ASC Su...
Andrew J Younge - Vanguard Astra - Petascale Arm Platform for U.S. DOE/ASC Su...
Linaro2.7K vues
HKG18-501 - EAS on Common Kernel 4.14 and getting (much) closer to mainline par Linaro
HKG18-501 - EAS on Common Kernel 4.14 and getting (much) closer to mainlineHKG18-501 - EAS on Common Kernel 4.14 and getting (much) closer to mainline
HKG18-501 - EAS on Common Kernel 4.14 and getting (much) closer to mainline
Linaro4.4K vues
HKG18-318 - OpenAMP Workshop par Linaro
HKG18-318 - OpenAMP WorkshopHKG18-318 - OpenAMP Workshop
HKG18-318 - OpenAMP Workshop
Linaro608 vues
HKG18-501 - EAS on Common Kernel 4.14 and getting (much) closer to mainline par Linaro
HKG18-501 - EAS on Common Kernel 4.14 and getting (much) closer to mainlineHKG18-501 - EAS on Common Kernel 4.14 and getting (much) closer to mainline
HKG18-501 - EAS on Common Kernel 4.14 and getting (much) closer to mainline
Linaro866 vues
HKG18- 115 - Partitioning ARM Systems with the Jailhouse Hypervisor par Linaro
HKG18- 115 - Partitioning ARM Systems with the Jailhouse HypervisorHKG18- 115 - Partitioning ARM Systems with the Jailhouse Hypervisor
HKG18- 115 - Partitioning ARM Systems with the Jailhouse Hypervisor
Linaro1.5K vues
HKG18-TR08 - Upstreaming SVE in QEMU par Linaro
HKG18-TR08 - Upstreaming SVE in QEMUHKG18-TR08 - Upstreaming SVE in QEMU
HKG18-TR08 - Upstreaming SVE in QEMU
Linaro476 vues
HKG18-113- Secure Data Path work with i.MX8M par Linaro
HKG18-113- Secure Data Path work with i.MX8MHKG18-113- Secure Data Path work with i.MX8M
HKG18-113- Secure Data Path work with i.MX8M
Linaro1.4K vues
HKG18-120 - Devicetree Schema Documentation and Validation par Linaro
HKG18-120 - Devicetree Schema Documentation and Validation HKG18-120 - Devicetree Schema Documentation and Validation
HKG18-120 - Devicetree Schema Documentation and Validation
Linaro1.3K vues
HKG18-223 - Trusted FirmwareM: Trusted boot par Linaro
HKG18-223 - Trusted FirmwareM: Trusted bootHKG18-223 - Trusted FirmwareM: Trusted boot
HKG18-223 - Trusted FirmwareM: Trusted boot
Linaro1.4K vues
HKG18-500K1 - Keynote: Dileep Bhandarkar - Emerging Computing Trends in the D... par Linaro
HKG18-500K1 - Keynote: Dileep Bhandarkar - Emerging Computing Trends in the D...HKG18-500K1 - Keynote: Dileep Bhandarkar - Emerging Computing Trends in the D...
HKG18-500K1 - Keynote: Dileep Bhandarkar - Emerging Computing Trends in the D...
Linaro243 vues
HKG18-317 - Arm Server Ready Program par Linaro
HKG18-317 - Arm Server Ready ProgramHKG18-317 - Arm Server Ready Program
HKG18-317 - Arm Server Ready Program
Linaro1.1K vues

Dernier

PRODUCT LISTING.pptx par
PRODUCT LISTING.pptxPRODUCT LISTING.pptx
PRODUCT LISTING.pptxangelicacueva6
14 vues1 diapositive
"Running students' code in isolation. The hard way", Yurii Holiuk par
"Running students' code in isolation. The hard way", Yurii Holiuk "Running students' code in isolation. The hard way", Yurii Holiuk
"Running students' code in isolation. The hard way", Yurii Holiuk Fwdays
17 vues34 diapositives
STKI Israeli Market Study 2023 corrected forecast 2023_24 v3.pdf par
STKI Israeli Market Study 2023   corrected forecast 2023_24 v3.pdfSTKI Israeli Market Study 2023   corrected forecast 2023_24 v3.pdf
STKI Israeli Market Study 2023 corrected forecast 2023_24 v3.pdfDr. Jimmy Schwarzkopf
20 vues29 diapositives
Microsoft Power Platform.pptx par
Microsoft Power Platform.pptxMicrosoft Power Platform.pptx
Microsoft Power Platform.pptxUni Systems S.M.S.A.
53 vues38 diapositives
Data Integrity for Banking and Financial Services par
Data Integrity for Banking and Financial ServicesData Integrity for Banking and Financial Services
Data Integrity for Banking and Financial ServicesPrecisely
25 vues26 diapositives
Scaling Knowledge Graph Architectures with AI par
Scaling Knowledge Graph Architectures with AIScaling Knowledge Graph Architectures with AI
Scaling Knowledge Graph Architectures with AIEnterprise Knowledge
38 vues15 diapositives

Dernier(20)

"Running students' code in isolation. The hard way", Yurii Holiuk par Fwdays
"Running students' code in isolation. The hard way", Yurii Holiuk "Running students' code in isolation. The hard way", Yurii Holiuk
"Running students' code in isolation. The hard way", Yurii Holiuk
Fwdays17 vues
STKI Israeli Market Study 2023 corrected forecast 2023_24 v3.pdf par Dr. Jimmy Schwarzkopf
STKI Israeli Market Study 2023   corrected forecast 2023_24 v3.pdfSTKI Israeli Market Study 2023   corrected forecast 2023_24 v3.pdf
STKI Israeli Market Study 2023 corrected forecast 2023_24 v3.pdf
Data Integrity for Banking and Financial Services par Precisely
Data Integrity for Banking and Financial ServicesData Integrity for Banking and Financial Services
Data Integrity for Banking and Financial Services
Precisely25 vues
Voice Logger - Telephony Integration Solution at Aegis par Nirmal Sharma
Voice Logger - Telephony Integration Solution at AegisVoice Logger - Telephony Integration Solution at Aegis
Voice Logger - Telephony Integration Solution at Aegis
Nirmal Sharma39 vues
ESPC 2023 - Protect and Govern your Sensitive Data with Microsoft Purview in ... par Jasper Oosterveld
ESPC 2023 - Protect and Govern your Sensitive Data with Microsoft Purview in ...ESPC 2023 - Protect and Govern your Sensitive Data with Microsoft Purview in ...
ESPC 2023 - Protect and Govern your Sensitive Data with Microsoft Purview in ...
6g - REPORT.pdf par Liveplex
6g - REPORT.pdf6g - REPORT.pdf
6g - REPORT.pdf
Liveplex10 vues
Special_edition_innovator_2023.pdf par WillDavies22
Special_edition_innovator_2023.pdfSpecial_edition_innovator_2023.pdf
Special_edition_innovator_2023.pdf
WillDavies2218 vues
Future of AR - Facebook Presentation par ssuserb54b561
Future of AR - Facebook PresentationFuture of AR - Facebook Presentation
Future of AR - Facebook Presentation
ssuserb54b56115 vues
GDG Cloud Southlake 28 Brad Taylor and Shawn Augenstein Old Problems in the N... par James Anderson
GDG Cloud Southlake 28 Brad Taylor and Shawn Augenstein Old Problems in the N...GDG Cloud Southlake 28 Brad Taylor and Shawn Augenstein Old Problems in the N...
GDG Cloud Southlake 28 Brad Taylor and Shawn Augenstein Old Problems in the N...
James Anderson92 vues
Automating a World-Class Technology Conference; Behind the Scenes of CiscoLive par Network Automation Forum
Automating a World-Class Technology Conference; Behind the Scenes of CiscoLiveAutomating a World-Class Technology Conference; Behind the Scenes of CiscoLive
Automating a World-Class Technology Conference; Behind the Scenes of CiscoLive
Unit 1_Lecture 2_Physical Design of IoT.pdf par StephenTec
Unit 1_Lecture 2_Physical Design of IoT.pdfUnit 1_Lecture 2_Physical Design of IoT.pdf
Unit 1_Lecture 2_Physical Design of IoT.pdf
StephenTec12 vues
PharoJS - Zürich Smalltalk Group Meetup November 2023 par Noury Bouraqadi
PharoJS - Zürich Smalltalk Group Meetup November 2023PharoJS - Zürich Smalltalk Group Meetup November 2023
PharoJS - Zürich Smalltalk Group Meetup November 2023
Noury Bouraqadi132 vues
Serverless computing with Google Cloud (2023-24) par wesley chun
Serverless computing with Google Cloud (2023-24)Serverless computing with Google Cloud (2023-24)
Serverless computing with Google Cloud (2023-24)
wesley chun11 vues

HKG18-300K2 - Keynote: Tomas Evensen - All Programmable SoCs? – Platforms to enable the future of Embedded Machine Learning

  • 1. All Programmable SoCs? Platforms to enable the future of Embedded Machine Learning Linaro Connect – Hong Kong March 2018 Tomas Evensen CTO Embedded Software, Xilinx
  • 2. © Copyright 2018 Xilinx . Processor frequency scaling ended in 2007 Multicore architecture scaling has flattened CPU Architectures not Scaling with Workloads Workloads require higher performance, lower latency – Cloud: video, big data, AI… – Edge: auto, surveillance, AI…Andrew Danowitz, Kyle Kelley, James Mao, John P. Stevenson, Mark Horowitz Communications of the ACM, Vol. 55 No. 4 Page 2
  • 3. © Copyright 2018 Xilinx . Application Processor 64-bit Dual/Quad-Core Zynq UltraScale+ MPSoC Real-Time Processors 32-bit Dual-Core Platform & Power Management Granular Power Control Functional Safety Configuration & Security Unit Anti-Tamper & Trust Industry Standards Fabric Acceleration Customizable Engines High Speed Connectivity Video Codec 8K4K (15fps) 4K2K (60fps) High Speed Peripherals Key Interfaces Graphics Processor ARM Mali-400MP2 Memory Subsystem High Bandwidth Low Latency Page 3
  • 4. © Copyright 2018 Xilinx . Page 4 FPGA: The “Chameleon” Chip What is FPGA/Fabric/Programmable Logic: Is it glue logic? Is it a powerful parallel DSP engine? Is it an RTL simulator? Yes!!! And more… FPGA includes: Programmable logic (LUTs) Hardened DSP blocks Hardened memory (BRAM, URAM) FPGAs are great to implement: Parallel compute (e.g. MAC) – With variable precision Parallel, flexible dataflows – Build your own buses Flexible, multiport memory hierarchies
  • 5. © Copyright 2018 Xilinx . Breakout in Programming Model Traditional HW design HDMI video proc. video enc. Development Productivity 15x productivity with HLS, IPI Ethernet IP Video decode C++ Video process C++ Video encode C++ HDMI IP SW Programmability Page 5
  • 6. © Copyright 2018 Xilinx . SDSoC Example: Matrix Multiply + Add main(){ malloc(A,B,C); mmult(A,B,D); madd(C,D,E); printf(E); } madd(inA,inB,out){ } HLS C/C++ mmult(inA,inB,out){ } HLS C/C++ A,B datamovers AXI Bus Platform Application Driver mmult madd Generated D A B C E PS PL Page 6
  • 7. © Copyright 2018 Xilinx . Supporting the Whole Stack Accelerated Open Frameworks Accelerated Libraries Development Environment Boards w/ HLx-based platform Machine learning Database Analytics Platform Development Stack VCU1525 Acceleration card Page 7
  • 8. © Copyright 2018 Xilinx . Deep learning-based multi-object recognition for smart-city Live video object detection SSD @ 480x360 on Zynq MPSoC End-customer obtained: – 5x NVIDIA TX2 performance – Better accuracy Example of Embedded Vision Application at the Edge Object detection 5X Perf/watt for SSD vs GPU Page 8
  • 9. © Copyright 2018 Xilinx . Xilinx and Avnet is partnering and is announcing the Ultra96 board – Equipped with Zync Ultrascale+ MPSoC (ZU3EG) – https://www.96boards.org/product/ultra96/ Ultra96 board makes ARM® based Xilinx SoCs available to developers at a low price point – Built to 96Boards standard, suitable for software prototyping with standardized expansion kits – Targeting a range of applications including Machine Learning, IoT, and compute Leverages an open-source software development platform – 96boards community: 12K actively contributing software engineers – Supports both self-hosted and cross development • Self-hosted: Compile on the board itself • Cross: Develop on your workstation/laptop • C to fabric/FPGA: SDSoC tools available later this year • Unboxing to coding in less than 2 minutes Page 9 $249 Ultra96 Board Targeted for Software Designers Available from Avnet in April
  • 10. © Copyright 2018 Xilinx . Submit your most creative, most out-of-the box AI or ML application at the Xilinx or Avnet table during Demo Friday (12:00 – 14:00). The best 30 get a FREE Ultra96 board plus software to help you realize your vision. The 1st twenty to submit a working design by MAY 25th, 2018 get a $25 Amazon Gift Card. ONE Winner announced through Xilinx social media channels. If it’s you, you’re invited to present your design to your peers in industry at Xilinx Developer Forum 2018. Page 10 The Future is Ultra96 Xilinx Contest
  • 11. © Copyright 2018 Xilinx . Page 11 Demo
  • 12. © Copyright 2018 Xilinx . German Road Sign Database – 50,000+ 32x32 bit images for training – 44 classes (43 road signs, 1 background) – Training via Amazon Web Services • AWS: p2.xlarge Instance – 8 hours à $7.78 6,5e Binary Neural Network Characteristics – 6 convolutional layers – 2 max pool layers – 3 fully connected layers Page 12 Neural Network Example
  • 13. © Copyright 2018 Xilinx . Page 13 Neural Network Performance Results Up to 8,600 times faster when accelerated with programmable logic Performance Metric Software Only Programmable Logic Accelerated Tiles per second 2.2 19,000 Scene rate (fps) 0.011 (92 sec per frame) 94 Overall Acceleration - 8,600X
  • 14. © Copyright 2018 Xilinx . Dramatically Accelerate 96Board Software via an FPGA with Integrated Processors – Wednesday 16:00-16:55 Accelerating Neural Networks for Vision Systems via FPGAs – Thursday 11:00-11:25 Page 14 Learn More About FPGA’s and Software Acceleration
  • 15. © Copyright 2018 Xilinx . Page 15 Questions?/ Thank You