SlideShare une entreprise Scribd logo
1  sur  40
Télécharger pour lire hors ligne
ReWArDS
Reconfigurable hardWare for Artificial
intelligence and Data Science
Luca Stornaiuolo
luca.stornaiuolo@polimi.it
11/06/2020
NECST Talk – Sala Seminari
Research Context
Context Definition34
Context Definition35
Huge amount of data that need to
be processed to get aggregated
information in real-timeSmart Embedded Systems
Context Definition36
Model
New
Data
Prediction
InferenceBig Data
Training
Context Definition37
Performance
Energy
Efficiency
Flexibility
Ease of
Development
GPU
ASIC
CPU
FPGA
Context Definition38
Performance
Energy
Efficiency
Flexibility
Ease of
Development
GPU
ASIC
CPU
FPGA
Performance
Energy
Efficiency
Flexibility
Ease of
Development
GPU
ASIC
CPU
FPGA
ReWArDS
40
ReWArDS
41
ReWArDS
DS
42
ReWArDS
A
I
DS
43
ReWArDS
A
I
DS
Embedded
44
ReWArDS
A
I
DS
Embedded
HLS
Poly
Mem
HLSPolyMem45
DDRSDRAMmemory
Memory HierarchyFPGA Device
L. Stornaiuolo, M. Rabozzi, D. Sciuto, M. D. Santambrogio,
G. Stramondo, C. Ciobanu, A. L. Varbanescu
HLSPolyMem46
DDRSDRAMmemory
Memory HierarchyFPGA Device
Poly
Mem
L. Stornaiuolo, M. Rabozzi, D. Sciuto, M. D. Santambrogio,
G. Stramondo, C. Ciobanu, A. L. Varbanescu
47
HLSPolyMem
1
4
9
12
14
17
19
22
27
30
32
35
2
5
7
10
15
18
20
23
25
28
33
36
3
6
8
11
13
16
21
24
26
29
31
34
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
1
4
7
10
13
16
19
22
25
28
31
34
2
5
8
11
14
17
20
23
26
29
32
35
3
6
9
12
15
18
21
24
27
30
33
36
HLS Array
Partitioning
Block
HLS Array
Partitioning
Cyclic
PolyMem
RoCo Scheme
1 2 3 4 5 6
7 8 9 10 11 12
13 14 15 16 17 18
19 20 21 22 23 24
25 26 27 28 29 30
31 32 33 34 35 36
Input Matrix
48
ReWArDS
A
I
DS
Embedded
HLS
Poly
Mem
49
ReWArDS
A
I
DS
Embedded
FPGA
Face
Detection
HLS
Poly
Mem
50
FaceDetection on multi-PYNQ
Anna Maria Nestorov, Alberto Scolari,
Enrico Reggiani, Marco D. Santambrogio
51
FaceDetection on multi-PYNQ
Anna Maria Nestorov, Alberto Scolari,
Enrico Reggiani, Marco D. Santambrogio
Xilinx
Pynq
FPG
A
Xilinx
Pynq
FPG
A
Xilinx
Pynq
FPG
A
Xilinx
Pynq
FPG
A
Stage1 Stage2_2 Stage3 Stage4 Stage5
Xilinx
Pynq
FPGA
Xilinx
Pynq
FPGA
Stage2_1
Xilinx
Pynq
FPGA
Xilinx
Pynq
FPGA
Xilinx
Pynq
FPGA
Xilinx
Pynq
FPGA
52
FaceDetection on multi-PYNQ
FPGA Single PYNQ vs ARM Single PYNQ: 14.7×
FPGA Distributed System vs ARM Distributed System: 39.5×
FPGA Distributed System vs GPU: 0,082×
53
FaceDetection on multi-PYNQ
FPGA Single PYNQ vs ARM Single PYNQ: 16.2×
FPGA Distributed System vs ARM Distributed System: 44×
FPGA Distributed System vs GPU: 1.3×
FPGA Single PYNQ vs ARM Single PYNQ: 14.7×
FPGA Distributed System vs ARM Distributed System: 39.5×
FPGA Distributed System vs GPU: 0,082×
54
ReWArDS
A
I
DS
Embedded
FPGA
Face
Detection
HLS
Poly
Mem
55
ReWArDS
A
I
DS
Embedded
FPGA
Face
Detection
PyNOLI
HLS
Poly
Mem
56
Giorgia Fiscaletti, Marco Speziali,
Luca Stornaiuolo , Marco D. Santambrogio
CnvW2A2
• 6 Convolutional Layers
• 3 Fully-Connected Layers
[1] BNN-PYNQ Quantized Neural Network (QNN) on PYNQ
57
Minimize Data
per Image on
the Network
Splitting Strategy
58
59
60
61
ReWArDS
A
I
DS
Embedded
FPGA
Face
Detection
PyNOLI
HLS
Poly
Mem
62
ReWArDS
A
I
DS
Embedded
FPGA
Face
Detection
PyNOLI
HLS
Poly
Mem
NumPYNQ
63
numPYNQ
Overlay
Correlation
Matrix Dot Product
FFT
Transparent Hardware Acceleration on Zynq SoC for Python Data Science Applications
Luca Stornaiuolo, Riccardo Pressiani, Filippo Carloni,
Marco D. Santambrogio, Donatella Sciuto
64
numPYNQ
Runtime
Input Analysis
Target
Implementation
numPYNQ
FFT and Correlation
65
numPYNQ
66
numPYNQ
67
numPYNQ
Transparent Hardware Acceleration on Zynq SoC for Python Data Science Applications
12.4x
5.5x
Luca Stornaiuolo, Riccardo Pressiani, Filippo Carloni,
Marco D. Santambrogio, Donatella Sciuto
68
ReWArDS
A
I
DS
Embedded
FPGA
Face
Detection
PyNOLI
HLS
Poly
Mem
NumPYNQ
69
ReWArDS
A
I
DS
Embedded
HLS
Poly
Mem
NumPYNQ
PyNOLI
FPGA
Face
Detection
EMPhASIS
Plaster
70
ReWArDS
A
I
DS
Embedded
HLS
Poly
Mem
NumPYNQ
PyNOLI
FPGA
Face
Detection
EMPhASIS
Plaster
09/06/2020
16/06/2020
ReWArDS
Reconfigurable hardWare for Artificial
intelligence and Data Science
https://necst.it/
https://www.slideshare.net/necstlab
11/06/2020
NECST Talk – Sala Seminari

Contenu connexe

Similaire à ReWArDS - NECSTTechTalk 11/06/2020

Dsp Based Field Programable Gate Array
Dsp Based Field Programable Gate ArrayDsp Based Field Programable Gate Array
Dsp Based Field Programable Gate Array
decebems
 
Achieve High-Performance with Optimizing Device Specifications in FPGA Design
Achieve High-Performance with Optimizing Device Specifications in FPGA DesignAchieve High-Performance with Optimizing Device Specifications in FPGA Design
Achieve High-Performance with Optimizing Device Specifications in FPGA Design
Logic Fruit Technologies
 

Similaire à ReWArDS - NECSTTechTalk 11/06/2020 (20)

Dsp Based Field Programable Gate Array
Dsp Based Field Programable Gate ArrayDsp Based Field Programable Gate Array
Dsp Based Field Programable Gate Array
 
Possibilities of generative models
Possibilities of generative modelsPossibilities of generative models
Possibilities of generative models
 
20170602_OSSummit_an_intelligent_storage
20170602_OSSummit_an_intelligent_storage20170602_OSSummit_an_intelligent_storage
20170602_OSSummit_an_intelligent_storage
 
Achieve High-Performance with Optimizing Device Specifications in FPGA Design
Achieve High-Performance with Optimizing Device Specifications in FPGA DesignAchieve High-Performance with Optimizing Device Specifications in FPGA Design
Achieve High-Performance with Optimizing Device Specifications in FPGA Design
 
組み込みから HPC まで ARM コアで実現するエコシステム
組み込みから HPC まで ARM コアで実現するエコシステム組み込みから HPC まで ARM コアで実現するエコシステム
組み込みから HPC まで ARM コアで実現するエコシステム
 
Enabling Artificial Intelligence - Alison B. Lowndes
Enabling Artificial Intelligence - Alison B. LowndesEnabling Artificial Intelligence - Alison B. Lowndes
Enabling Artificial Intelligence - Alison B. Lowndes
 
RAPIDS Overview
RAPIDS OverviewRAPIDS Overview
RAPIDS Overview
 
Fórum E-Commerce Brasil | Tecnologias NVIDIA aplicadas ao e-commerce. Muito a...
Fórum E-Commerce Brasil | Tecnologias NVIDIA aplicadas ao e-commerce. Muito a...Fórum E-Commerce Brasil | Tecnologias NVIDIA aplicadas ao e-commerce. Muito a...
Fórum E-Commerce Brasil | Tecnologias NVIDIA aplicadas ao e-commerce. Muito a...
 
An open flow for dn ns on ultra low-power RISC-V cores
An open flow for dn ns on ultra low-power RISC-V coresAn open flow for dn ns on ultra low-power RISC-V cores
An open flow for dn ns on ultra low-power RISC-V cores
 
Profiling deep learning network using NVIDIA nsight systems
Profiling deep learning network using NVIDIA nsight systemsProfiling deep learning network using NVIDIA nsight systems
Profiling deep learning network using NVIDIA nsight systems
 
Machine Learning with New Hardware Challegens
Machine Learning with New Hardware ChallegensMachine Learning with New Hardware Challegens
Machine Learning with New Hardware Challegens
 
DPDK: Multi Architecture High Performance Packet Processing
DPDK: Multi Architecture High Performance Packet ProcessingDPDK: Multi Architecture High Performance Packet Processing
DPDK: Multi Architecture High Performance Packet Processing
 
FPGAs : An Overview
FPGAs : An OverviewFPGAs : An Overview
FPGAs : An Overview
 
Webinar: NVIDIA JETSON – A Inteligência Artificial na palma de sua mão
Webinar: NVIDIA JETSON – A Inteligência Artificial na palma de sua mãoWebinar: NVIDIA JETSON – A Inteligência Artificial na palma de sua mão
Webinar: NVIDIA JETSON – A Inteligência Artificial na palma de sua mão
 
Harnessing the virtual realm for successful real world artificial intelligence
Harnessing the virtual realm for successful real world artificial intelligenceHarnessing the virtual realm for successful real world artificial intelligence
Harnessing the virtual realm for successful real world artificial intelligence
 
GPU Architecture NVIDIA (GTX GeForce 480)
GPU Architecture NVIDIA (GTX GeForce 480)GPU Architecture NVIDIA (GTX GeForce 480)
GPU Architecture NVIDIA (GTX GeForce 480)
 
Building the World's Largest GPU
Building the World's Largest GPUBuilding the World's Largest GPU
Building the World's Largest GPU
 
Advancing GPU Analytics with RAPIDS Accelerator for Spark and Alluxio
Advancing GPU Analytics with RAPIDS Accelerator for Spark and AlluxioAdvancing GPU Analytics with RAPIDS Accelerator for Spark and Alluxio
Advancing GPU Analytics with RAPIDS Accelerator for Spark and Alluxio
 
Fixed-point Multi-Core DSP Application Examples
Fixed-point Multi-Core DSP Application ExamplesFixed-point Multi-Core DSP Application Examples
Fixed-point Multi-Core DSP Application Examples
 
RCW@DEI - Reconf Comp
RCW@DEI - Reconf CompRCW@DEI - Reconf Comp
RCW@DEI - Reconf Comp
 

Plus de NECST Lab @ Politecnico di Milano

Embedding based knowledge graph link prediction for drug repurposing
Embedding based knowledge graph link prediction for drug repurposingEmbedding based knowledge graph link prediction for drug repurposing
Embedding based knowledge graph link prediction for drug repurposing
NECST Lab @ Politecnico di Milano
 

Plus de NECST Lab @ Politecnico di Milano (20)

Mesticheria Team - WiiReflex
Mesticheria Team - WiiReflexMesticheria Team - WiiReflex
Mesticheria Team - WiiReflex
 
Punto e virgola Team - Stressometro
Punto e virgola Team - StressometroPunto e virgola Team - Stressometro
Punto e virgola Team - Stressometro
 
BitIt Team - Stay.straight
BitIt Team - Stay.straight BitIt Team - Stay.straight
BitIt Team - Stay.straight
 
BabYodini Team - Talking Gloves
BabYodini Team - Talking GlovesBabYodini Team - Talking Gloves
BabYodini Team - Talking Gloves
 
printf("Nome Squadra"); Team - NeoTon
printf("Nome Squadra"); Team - NeoTonprintf("Nome Squadra"); Team - NeoTon
printf("Nome Squadra"); Team - NeoTon
 
BlackBoard Team - Motion Tracking Platform
BlackBoard Team - Motion Tracking PlatformBlackBoard Team - Motion Tracking Platform
BlackBoard Team - Motion Tracking Platform
 
#include<brain.h> Team - HomeBeatHome
#include<brain.h> Team - HomeBeatHome#include<brain.h> Team - HomeBeatHome
#include<brain.h> Team - HomeBeatHome
 
Flipflops Team - Wave U
Flipflops Team - Wave UFlipflops Team - Wave U
Flipflops Team - Wave U
 
Bug(atta) Team - Little Brother
Bug(atta) Team - Little BrotherBug(atta) Team - Little Brother
Bug(atta) Team - Little Brother
 
#NECSTCamp: come partecipare
#NECSTCamp: come partecipare#NECSTCamp: come partecipare
#NECSTCamp: come partecipare
 
NECSTCamp101@2020.10.1
NECSTCamp101@2020.10.1NECSTCamp101@2020.10.1
NECSTCamp101@2020.10.1
 
NECSTLab101 2020.2021
NECSTLab101 2020.2021NECSTLab101 2020.2021
NECSTLab101 2020.2021
 
TreeHouse, nourish your community
TreeHouse, nourish your communityTreeHouse, nourish your community
TreeHouse, nourish your community
 
TiReX: Tiled Regular eXpressionsmatching architecture
TiReX: Tiled Regular eXpressionsmatching architectureTiReX: Tiled Regular eXpressionsmatching architecture
TiReX: Tiled Regular eXpressionsmatching architecture
 
Embedding based knowledge graph link prediction for drug repurposing
Embedding based knowledge graph link prediction for drug repurposingEmbedding based knowledge graph link prediction for drug repurposing
Embedding based knowledge graph link prediction for drug repurposing
 
PLASTER - PYNQ-based abandoned object detection using a map-reduce approach o...
PLASTER - PYNQ-based abandoned object detection using a map-reduce approach o...PLASTER - PYNQ-based abandoned object detection using a map-reduce approach o...
PLASTER - PYNQ-based abandoned object detection using a map-reduce approach o...
 
EMPhASIS - An EMbedded Public Attention Stress Identification System
 EMPhASIS - An EMbedded Public Attention Stress Identification System EMPhASIS - An EMbedded Public Attention Stress Identification System
EMPhASIS - An EMbedded Public Attention Stress Identification System
 
Luns - Automatic lungs segmentation through neural network
Luns - Automatic lungs segmentation through neural networkLuns - Automatic lungs segmentation through neural network
Luns - Automatic lungs segmentation through neural network
 
BlastFunction: How to combine Serverless and FPGAs
BlastFunction: How to combine Serverless and FPGAsBlastFunction: How to combine Serverless and FPGAs
BlastFunction: How to combine Serverless and FPGAs
 
Maeve - Fast genome analysis leveraging exact string matching
Maeve - Fast genome analysis leveraging exact string matchingMaeve - Fast genome analysis leveraging exact string matching
Maeve - Fast genome analysis leveraging exact string matching
 

Dernier

Call Girls in Netaji Nagar, Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Netaji Nagar, Delhi 💯 Call Us 🔝9953056974 🔝 Escort ServiceCall Girls in Netaji Nagar, Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Netaji Nagar, Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Integrated Test Rig For HTFE-25 - Neometrix
Integrated Test Rig For HTFE-25 - NeometrixIntegrated Test Rig For HTFE-25 - Neometrix
Integrated Test Rig For HTFE-25 - Neometrix
Neometrix_Engineering_Pvt_Ltd
 
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak HamilCara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Kandungan 087776558899
 
notes on Evolution Of Analytic Scalability.ppt
notes on Evolution Of Analytic Scalability.pptnotes on Evolution Of Analytic Scalability.ppt
notes on Evolution Of Analytic Scalability.ppt
MsecMca
 
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
ssuser89054b
 

Dernier (20)

Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024
 
Call Girls in Netaji Nagar, Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Netaji Nagar, Delhi 💯 Call Us 🔝9953056974 🔝 Escort ServiceCall Girls in Netaji Nagar, Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Netaji Nagar, Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
 
Introduction to Serverless with AWS Lambda
Introduction to Serverless with AWS LambdaIntroduction to Serverless with AWS Lambda
Introduction to Serverless with AWS Lambda
 
Integrated Test Rig For HTFE-25 - Neometrix
Integrated Test Rig For HTFE-25 - NeometrixIntegrated Test Rig For HTFE-25 - Neometrix
Integrated Test Rig For HTFE-25 - Neometrix
 
Unit 2- Effective stress & Permeability.pdf
Unit 2- Effective stress & Permeability.pdfUnit 2- Effective stress & Permeability.pdf
Unit 2- Effective stress & Permeability.pdf
 
DC MACHINE-Motoring and generation, Armature circuit equation
DC MACHINE-Motoring and generation, Armature circuit equationDC MACHINE-Motoring and generation, Armature circuit equation
DC MACHINE-Motoring and generation, Armature circuit equation
 
Employee leave management system project.
Employee leave management system project.Employee leave management system project.
Employee leave management system project.
 
Design For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the startDesign For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the start
 
University management System project report..pdf
University management System project report..pdfUniversity management System project report..pdf
University management System project report..pdf
 
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak HamilCara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
 
Hostel management system project report..pdf
Hostel management system project report..pdfHostel management system project report..pdf
Hostel management system project report..pdf
 
A Study of Urban Area Plan for Pabna Municipality
A Study of Urban Area Plan for Pabna MunicipalityA Study of Urban Area Plan for Pabna Municipality
A Study of Urban Area Plan for Pabna Municipality
 
Unleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leapUnleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leap
 
Double Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torqueDouble Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torque
 
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
 
Thermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptThermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.ppt
 
notes on Evolution Of Analytic Scalability.ppt
notes on Evolution Of Analytic Scalability.pptnotes on Evolution Of Analytic Scalability.ppt
notes on Evolution Of Analytic Scalability.ppt
 
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
 
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
 
22-prompt engineering noted slide shown.pdf
22-prompt engineering noted slide shown.pdf22-prompt engineering noted slide shown.pdf
22-prompt engineering noted slide shown.pdf
 

ReWArDS - NECSTTechTalk 11/06/2020