SlideShare une entreprise Scribd logo
1  sur  28
Télécharger pour lire hors ligne
SmartCore system
for Dependable Many-core Processor
with Multifunction Routers
Shinya Takamaeda†, Shimpei Sato†,
Takefumi Miyoshi‡, Kenji Kise†
†Tokyo Institute of Technology, Japan
‡The University of Electro-communications, Japan
10-11-18 ICNC’10 @Hiroshima
Regular Paper
Hardware Design and Implementation
14:50-15:20
Contents
 Motivation
 Proposal: SmartCore system
 Preliminary Evaluation
 Hardware Implementation on FPGAs
 Related Work
 Conclusion
10-11-18 ICNC'102
Contents
 Motivation
 Proposal: SmartCore system
 Preliminary Evaluation
 Hardware Implementation on FPGAs
 Related Work
 Conclusion
10-11-18 ICNC'103
Many-core Processors appear!
10-11-18
Intel Single Chip Cloud Computer
48 cores (x86)
TILERA TILE-Gx100
100 cores (MIPS)
ICNC'104
Inter-connection for Many-core processors
 NoC (Network on Chip)
 Data transmission via on-chip-routers
10-11-18 ICNC'105
R
PE
R
PE
R
PE
R
PE
R
PE
R
PE
R
PE
R
PE
R
PE
R
PE
R
PE
R
PE
R
PE
R
PE
R
PE
R
PE
Low Dependability on Many-core
 Process technology scaling for more transistors
 But it increases …
 Soft errors (e.g. bit inversion)
•  since cosmic radiations
 Timing errors
•  since variations in transistor characteristic or wire delay
10-11-18 ICNC'10
How to create
a reliable Many-core processor?
6
Circuit
Micro-architecture
Architecture
Software
Assurance of the reliability on each layer
10-11-18 ICNC'107
Razor-FF
Lock-step
Check-pointing / Re-execution
Inter-connection
SmartCore system
Canary-FF
ECC in DRAM Memory
Architectural Core Salvaging
Slip Stream Processor
Contents
 Motivation
 Proposal: SmartCore system
 Preliminary Evaluation
 Hardware Implementation on FPGAs
 Related Work
 Conclusion
10-11-18 ICNC'108
We propose the SmartCore system
 SmartCore system
= Smart many-core system
with redundant cores and multifunction routers
 Key: NoC-based DMR
 To detect a error,
compare the output
packets from the pair
 On-chip router has
3 special functions
•  Copy a packet
•  Change the destination
•  Wait and Compare
2 packets
10-11-18 ICNC'10
PE
R
PE
R
PE
R
PE
R
PE
R
PE
R
Handling the
same packets
by packet
coping
Running the same
thread (DMR)
Running the
single thread
(DMR)
sharing a
packet /
comparing 2
packets
9
Base many-core architecture: M-Core [1]
 2D mesh network
connects Nodes
 Each Node memory
is independent
 Inter-Node
communication
 DMA via packets
using ID
 A packet is a series of
flits (Flow Control Unit)
•  Only the head flit of
a packet contains
the destination
10-11-18 ICNC'10
Node (2, 1)
INCC
Node
memory
Core
Comp.
Node
(1,1)
Comp.
Node
(1,2)
Comp.
Node
(1,8)
Comp.
Node
(2,1)
Comp.
Node
(2,2)
Comp.
Node
(2,8)
Comp.
Node
(3,1)
Comp.
Node
(3,2)
Comp.
Node
(3,8)
Comp.
Node
(8,1)
Comp.
Node
(8,1)
Comp.
Node
(8,8)
Operation
Node
(0,0)
Memory
Node
(1,0)
Off chip memory modules and switch
Conventional I/O
Many-core
processor chip
Memory
Node
(2,0)
Memory
Node
(3,0)
Memory
Node
(8,0)
Node (1, 1)
INCC
Node
memory
Core
Router Router
10
DMR on two nodes by using SmartCore
  Executing a same program binary on the pair
 Master Node and Mirror Node
 If generated packets are different, they are faulty
  Packet coping on the Router of the Master
 for the Mirror to use the same data as Master
  Packet comparison on the Router of Master
 If these two differ, then the Router detects a error
10-11-18 ICNC'1011
PE
R
PE
R
PE
R
PE
R
PE
R
PE
R
PE
R
PE
R
Master Node Mirror Node
Node (1,1) Node (2,1) Node (3,1) Node (4,1)
Node (1,2) Node (2,2) Node (3,2) Node (4,2)
Logically Node (1,1)
1. Coping a packet to the Mirror Node
 Router on Master Node copies a coming packet
to the Mirror Node
 The destination is changed to the Mirror Node’s ID
 Original program has a several DMA communications
 To certainly continue executing the same program in
the two Node
10-11-18 ICNC'10
INCCINCC
R R
Master Mirror
P P
12
2. Wait for a packet from the Mirror Node
3. Compare the contents of two packets
 Router on Master Node waits a packet from
Master Node and a packet from Mirror Node
 When Router on Master receives the head flits
from both Nodes, then it starts to compare the 2
flits in order
 If the contents of flits differ, a error exists in either
Master Node or Mirror Node
10-11-18 ICNC'10
INCCINCC
R R
Master MirrorP P
13
Base router
 5 inputs with input buffers / 5 outputs
 X-Y Dimension-order routing
 Wormhole switching, Xon/Xoff flow control
 1hop/1cycle, single cycle, no virtual channels
10-11-18 ICNC'10
Router
XBAR
Switch
Output port X+
Output port X-
Output port Y+
Output port Y-
Output port
DMAC
Input port X+
Input port X-
Input port Y+
Input port Y-
Input port
DMAC
Arbiter
14
  Additional buffer for coping for Mirror Node (a)
  ID translator to change the destination (b)
  Flit comparator to verify (c)
  Node type, Master/Mirror Node ID
  Configured by system software
Multifunction router for SmartCore system
10-11-18 ICNC'10
Output port INCCInput port INCC
Router
XBAR
Switch
Output port X+
Output port X-
Output port Y+
Output port Y-
Input port X+
Input port X-
Input port Y+
Input port Y-
Arbiter
node type master / mirror ID
V
Verify
ID translation
(a)(b)
(c)
15
Advantages of SmartCore system
 Adaptable to any kind of hardware modules
generating a packet
 ex) Cache, DSP, Processor core
 Because of …
 Error detection mechanism is independent to
Node structure
•  Core-granularity redundant execution /
Packet level error detection
10-11-18 ICNC'1016
Contents
 Motivation
 Proposal: SmartCore system
 Preliminary Evaluation
 Hardware Implementation on FPGAs
 Related Work
 Conclusion
10-11-18 ICNC'1017
Preliminary Evaluation of SmartCore system
 2 evaluations
 Performance overhead on DMR
 Packet rendezvous time
 Environment: SimMc 1.0
 64 (8×8) threads on 128 (16×8) Nodes
 Core
•  MIPS32 single issue / single cycle processor
 Router
•  1 hop / 1 cycle, no virtual channels, flit size: 4 bytes
 INCC (Network Interface)
•  up to 1 flit / cycle receive/send from/to router
 Benchmark: 4 apps from NAS Parallel Benchmarks
•  cg, ft, is, lu, Size: S
10-11-18 ICNC'1018
Node (X, Y)
INCC
Node
memory
Core
Router
3 configurations of thread mapping
10-11-18 ICNC'1019
1,1
1,2
1,8
2,1
2,2
2,8
8,1
8,2
8,8
8 Nodes 8 Nodes
8Nodes
1,1
1,2
1,8
2,1
2,2
2,8
8,1
8,2
8,8
8Nodes
16 Nodes
1,1
1,2
1,8
1,1
1,2
1,8
2,1
2,2
2,8
2,1
2,2
2,8
8,1
8,2
8,8
8,1
8,2
8,8
8Nodes
16 Nodes
(a) Base Allocation
(b) Redundant space allocation (Area 2x) (c) Redundant execution with SmartCore system
x,y
Proper thread
(Master Node)
Redundant thread
(Mirror Node)
x,y
Not working
to see the effect on #hops to see the effect on SmartCore
Evaluation: Performance overhead on DMR
 A little slow down
 Redundant space (Area 2x): up to 1% slow down
 Redundant execution (SmartCore): up to 4% slow
down (in cg of NPB)
10-11-18 ICNC'1020
Evaluation: Packet rendezvous time
 Cumulative distribution of # cycles that the router
on Master Node waits for a packet from Mirror
Node
 Almost communications with a little rendezvous
10-11-18 ICNC'10
cg
ft
is lu
21
Contents
 Motivation
 Proposal: SmartCore system
 Preliminary Evaluation
 Hardware Implementation on FPGAs
 Related Work
 Conclusion
10-11-18 ICNC'1022
Hardware Implementation on FPGAs
 Dependable Many-core processor on FPGA-
based prototyping system
 by using ScalableCore system [8]
•  Connected
FPGA boards
•  Variable
# FPGA boards
 2 execution mode
•  Normal Mode
–  Standard M-Core
•  SmartCore Mode
–  The pair executes
same thread
10-11-18 ICNC'1023
SD
Loader
(0,1)
Physical
ID
(1,1)
Path
(0,2)
Physical
ID
(1,2)
Physical
ID
(2,1)
Physical
ID
(2,2)
Physical
ID
(3,1)
Physical
ID
(3,2)
Physical
ID
(4,1)
Physical
ID
(4,2)
Path
(0,3)
Physical
ID
(1,3)
Physical
ID
(2,3)
Physical
ID
(3,3)
Physical
ID
(4,3)
LogicalID (1,1)
LogicalID (1,2)
LogicalID (1,3)
LogicalID (2,1)
LogicalID (2,2)
LogicalID (2,3)
Power
Master Mirror Master Mirror
Overview of 15 Nodes ScalableCore system
with SmartCore system
10-11-18 ICNC'1024
Logical ID (1,1)
Master Mirror
Logical ID (1,2)
Master Mirror
Logical ID (1,3)
Master Mirror
Logical ID (2,1)
Master Mirror
Logical ID (2,2)
Master Mirror
Logical ID (2,3)
Master Mirror
Program
Loader
ID (0,1)
SmartCore system detects a artificial fault
Contents
 Motivation
 Proposal: SmartCore system
 Preliminary Evaluation
 Hardware Implementation on FPGAs
 Related Work
 Conclusion
10-11-18 ICNC'1025
Related work
 Slipstream Processor [9, Karthik, ASPLOS2000]
 Improving ILP and dependability by using tightly
coupled two cores
 2 threads
•  Proper sequence and shorter sequence
 Loose Lock-stepped system [10, Nidhi, ISCA2007]
 Dividing cores, cache, main memory into two groups
 I/O level error detection
 Lockstep [11, IBM]
 Redundant execution on synchronized processors
 I/O level error detection
10-11-18 ICNC'1026
Contents
 Motivation
 Proposal: SmartCore system
 Preliminary Evaluation
 Hardware Implementation on FPGAs
 Related Work
 Conclusion
10-11-18 ICNC'1027
Conclusion
 We propose the SmartCore system
 NoC-based DMR by using multifunction routers
 Multifunction router has 3 special functions
•  Coping a packet
•  Changing the destination of a packet
•  Waiting and comparing the contents of two packets
 Low performance overhead
 Hardware implementation on FPGA-based
prototyping system
 Future works
 Recovery after error detections
 TMR by SmartCore system
10-11-18 ICNC'1028

Contenu connexe

Tendances

DPDK Summit 2015 - RIFT.io - Tim Mortsolf
DPDK Summit 2015 - RIFT.io - Tim MortsolfDPDK Summit 2015 - RIFT.io - Tim Mortsolf
DPDK Summit 2015 - RIFT.io - Tim MortsolfJim St. Leger
 
DPDK summit 2015: It's kind of fun to do the impossible with DPDK
DPDK summit 2015: It's kind of fun  to do the impossible with DPDKDPDK summit 2015: It's kind of fun  to do the impossible with DPDK
DPDK summit 2015: It's kind of fun to do the impossible with DPDKLagopus SDN/OpenFlow switch
 
Xilinx vs Intel (Altera) FPGA performance comparison
Xilinx vs Intel (Altera) FPGA performance comparison Xilinx vs Intel (Altera) FPGA performance comparison
Xilinx vs Intel (Altera) FPGA performance comparison Roy Messinger
 
Linux Bridging: Teaching an old dog new tricks
Linux Bridging: Teaching an old dog new tricksLinux Bridging: Teaching an old dog new tricks
Linux Bridging: Teaching an old dog new tricksStephen Hemminger
 
The Microarchitecure Of FPGA Based Soft Processor
The Microarchitecure Of FPGA Based Soft ProcessorThe Microarchitecure Of FPGA Based Soft Processor
The Microarchitecure Of FPGA Based Soft ProcessorDeepak Tomar
 
1 intro to_dpdk_and_hw
1 intro to_dpdk_and_hw1 intro to_dpdk_and_hw
1 intro to_dpdk_and_hwvideos
 
Netsft2017 day in_life_of_nfv
Netsft2017 day in_life_of_nfvNetsft2017 day in_life_of_nfv
Netsft2017 day in_life_of_nfvIntel
 
Adam_Mcconnell_Revision3
Adam_Mcconnell_Revision3Adam_Mcconnell_Revision3
Adam_Mcconnell_Revision3Adam McConnell
 
Mobile Development For Arduino 201 - ConnectTech
Mobile Development For Arduino 201 - ConnectTechMobile Development For Arduino 201 - ConnectTech
Mobile Development For Arduino 201 - ConnectTechstable|kernel
 
DPDK Summit 2015 - Intro - Tim O'Driscoll
DPDK Summit 2015 - Intro - Tim O'DriscollDPDK Summit 2015 - Intro - Tim O'Driscoll
DPDK Summit 2015 - Intro - Tim O'DriscollJim St. Leger
 
Devconf2017 - Can VMs networking benefit from DPDK
Devconf2017 - Can VMs networking benefit from DPDKDevconf2017 - Can VMs networking benefit from DPDK
Devconf2017 - Can VMs networking benefit from DPDKMaxime Coquelin
 
Unleashing End-to_end TLS Security Leveraging NGINX with Intel(r) QuickAssist...
Unleashing End-to_end TLS Security Leveraging NGINX with Intel(r) QuickAssist...Unleashing End-to_end TLS Security Leveraging NGINX with Intel(r) QuickAssist...
Unleashing End-to_end TLS Security Leveraging NGINX with Intel(r) QuickAssist...Michelle Holley
 
Intel(r) Quick Assist Technology Overview
Intel(r) Quick Assist Technology OverviewIntel(r) Quick Assist Technology Overview
Intel(r) Quick Assist Technology OverviewMichelle Holley
 

Tendances (20)

DPDK Summit 2015 - RIFT.io - Tim Mortsolf
DPDK Summit 2015 - RIFT.io - Tim MortsolfDPDK Summit 2015 - RIFT.io - Tim Mortsolf
DPDK Summit 2015 - RIFT.io - Tim Mortsolf
 
DPDK summit 2015: It's kind of fun to do the impossible with DPDK
DPDK summit 2015: It's kind of fun  to do the impossible with DPDKDPDK summit 2015: It's kind of fun  to do the impossible with DPDK
DPDK summit 2015: It's kind of fun to do the impossible with DPDK
 
DPDK KNI interface
DPDK KNI interfaceDPDK KNI interface
DPDK KNI interface
 
Xilinx vs Intel (Altera) FPGA performance comparison
Xilinx vs Intel (Altera) FPGA performance comparison Xilinx vs Intel (Altera) FPGA performance comparison
Xilinx vs Intel (Altera) FPGA performance comparison
 
Linux Bridging: Teaching an old dog new tricks
Linux Bridging: Teaching an old dog new tricksLinux Bridging: Teaching an old dog new tricks
Linux Bridging: Teaching an old dog new tricks
 
The Microarchitecure Of FPGA Based Soft Processor
The Microarchitecure Of FPGA Based Soft ProcessorThe Microarchitecure Of FPGA Based Soft Processor
The Microarchitecure Of FPGA Based Soft Processor
 
1 intro to_dpdk_and_hw
1 intro to_dpdk_and_hw1 intro to_dpdk_and_hw
1 intro to_dpdk_and_hw
 
Select and poll functions
Select and poll functionsSelect and poll functions
Select and poll functions
 
Intel dpdk Tutorial
Intel dpdk TutorialIntel dpdk Tutorial
Intel dpdk Tutorial
 
Netsft2017 day in_life_of_nfv
Netsft2017 day in_life_of_nfvNetsft2017 day in_life_of_nfv
Netsft2017 day in_life_of_nfv
 
The Spectre of Meltdowns
The Spectre of MeltdownsThe Spectre of Meltdowns
The Spectre of Meltdowns
 
Adam_Mcconnell_Revision3
Adam_Mcconnell_Revision3Adam_Mcconnell_Revision3
Adam_Mcconnell_Revision3
 
Mobile Development For Arduino 201 - ConnectTech
Mobile Development For Arduino 201 - ConnectTechMobile Development For Arduino 201 - ConnectTech
Mobile Development For Arduino 201 - ConnectTech
 
DPDK Summit 2015 - Intro - Tim O'Driscoll
DPDK Summit 2015 - Intro - Tim O'DriscollDPDK Summit 2015 - Intro - Tim O'Driscoll
DPDK Summit 2015 - Intro - Tim O'Driscoll
 
Devconf2017 - Can VMs networking benefit from DPDK
Devconf2017 - Can VMs networking benefit from DPDKDevconf2017 - Can VMs networking benefit from DPDK
Devconf2017 - Can VMs networking benefit from DPDK
 
ThesisPresentation_Upd
ThesisPresentation_UpdThesisPresentation_Upd
ThesisPresentation_Upd
 
Network sockets
Network socketsNetwork sockets
Network sockets
 
Unleashing End-to_end TLS Security Leveraging NGINX with Intel(r) QuickAssist...
Unleashing End-to_end TLS Security Leveraging NGINX with Intel(r) QuickAssist...Unleashing End-to_end TLS Security Leveraging NGINX with Intel(r) QuickAssist...
Unleashing End-to_end TLS Security Leveraging NGINX with Intel(r) QuickAssist...
 
A04660105
A04660105A04660105
A04660105
 
Intel(r) Quick Assist Technology Overview
Intel(r) Quick Assist Technology OverviewIntel(r) Quick Assist Technology Overview
Intel(r) Quick Assist Technology Overview
 

Similaire à SmartCore System for Dependable Many-core Processor with Multifunction Routers (in ICNC'10 Hiroshima)

Introduction to Programmable Networks by Clarence Anslem, Intel
Introduction to Programmable Networks by Clarence Anslem, IntelIntroduction to Programmable Networks by Clarence Anslem, Intel
Introduction to Programmable Networks by Clarence Anslem, IntelMyNOG
 
FPGA_prototyping proccesing with conclusion
FPGA_prototyping proccesing with conclusionFPGA_prototyping proccesing with conclusion
FPGA_prototyping proccesing with conclusionPersiPersi1
 
Snabbflow: A Scalable IPFIX exporter
Snabbflow: A Scalable IPFIX exporterSnabbflow: A Scalable IPFIX exporter
Snabbflow: A Scalable IPFIX exporterIgalia
 
Performance analysis and implementation of modified sdm based noc for mpsoc o...
Performance analysis and implementation of modified sdm based noc for mpsoc o...Performance analysis and implementation of modified sdm based noc for mpsoc o...
Performance analysis and implementation of modified sdm based noc for mpsoc o...eSAT Journals
 
How to Speak Intel DPDK KNI for Web Services.
How to Speak Intel DPDK KNI for Web Services.How to Speak Intel DPDK KNI for Web Services.
How to Speak Intel DPDK KNI for Web Services.Naoto MATSUMOTO
 
CV-RENJINIK-27062016
CV-RENJINIK-27062016CV-RENJINIK-27062016
CV-RENJINIK-27062016Renjini K
 
Developer's Guide to Knights Landing
Developer's Guide to Knights LandingDeveloper's Guide to Knights Landing
Developer's Guide to Knights LandingAndrey Vladimirov
 
HIGH PERFORMANCE ETHERNET PACKET PROCESSOR CORE FOR NEXT GENERATION NETWORKS
HIGH PERFORMANCE ETHERNET PACKET PROCESSOR CORE FOR NEXT GENERATION NETWORKSHIGH PERFORMANCE ETHERNET PACKET PROCESSOR CORE FOR NEXT GENERATION NETWORKS
HIGH PERFORMANCE ETHERNET PACKET PROCESSOR CORE FOR NEXT GENERATION NETWORKSijngnjournal
 
Geep networking stack-linuxkernel
Geep networking stack-linuxkernelGeep networking stack-linuxkernel
Geep networking stack-linuxkernelKiran Divekar
 
Final presentation [dissertation project], 20192 esv0002
Final presentation [dissertation project], 20192 esv0002Final presentation [dissertation project], 20192 esv0002
Final presentation [dissertation project], 20192 esv0002MOHAMMED FURQHAN
 
Cisco router basic
Cisco router basicCisco router basic
Cisco router basicTapan Khilar
 
CNF.Chap.5.pptx
CNF.Chap.5.pptxCNF.Chap.5.pptx
CNF.Chap.5.pptxhalosidiq1
 
LEGaTO: Software Stack Runtimes
LEGaTO: Software Stack RuntimesLEGaTO: Software Stack Runtimes
LEGaTO: Software Stack RuntimesLEGATO project
 
Steen_Dissertation_March5
Steen_Dissertation_March5Steen_Dissertation_March5
Steen_Dissertation_March5Steen Larsen
 
Introduction to DPDK
Introduction to DPDKIntroduction to DPDK
Introduction to DPDKKernel TLV
 
OSN days 2019 - Open Networking and Programmable Switch
OSN days 2019 - Open Networking and Programmable SwitchOSN days 2019 - Open Networking and Programmable Switch
OSN days 2019 - Open Networking and Programmable SwitchChun Ming Ou
 

Similaire à SmartCore System for Dependable Many-core Processor with Multifunction Routers (in ICNC'10 Hiroshima) (20)

Introduction to Programmable Networks by Clarence Anslem, Intel
Introduction to Programmable Networks by Clarence Anslem, IntelIntroduction to Programmable Networks by Clarence Anslem, Intel
Introduction to Programmable Networks by Clarence Anslem, Intel
 
FPGA_prototyping proccesing with conclusion
FPGA_prototyping proccesing with conclusionFPGA_prototyping proccesing with conclusion
FPGA_prototyping proccesing with conclusion
 
Snabbflow: A Scalable IPFIX exporter
Snabbflow: A Scalable IPFIX exporterSnabbflow: A Scalable IPFIX exporter
Snabbflow: A Scalable IPFIX exporter
 
Performance analysis and implementation of modified sdm based noc for mpsoc o...
Performance analysis and implementation of modified sdm based noc for mpsoc o...Performance analysis and implementation of modified sdm based noc for mpsoc o...
Performance analysis and implementation of modified sdm based noc for mpsoc o...
 
How to Speak Intel DPDK KNI for Web Services.
How to Speak Intel DPDK KNI for Web Services.How to Speak Intel DPDK KNI for Web Services.
How to Speak Intel DPDK KNI for Web Services.
 
CV-RENJINIK-27062016
CV-RENJINIK-27062016CV-RENJINIK-27062016
CV-RENJINIK-27062016
 
Developer's Guide to Knights Landing
Developer's Guide to Knights LandingDeveloper's Guide to Knights Landing
Developer's Guide to Knights Landing
 
HIGH PERFORMANCE ETHERNET PACKET PROCESSOR CORE FOR NEXT GENERATION NETWORKS
HIGH PERFORMANCE ETHERNET PACKET PROCESSOR CORE FOR NEXT GENERATION NETWORKSHIGH PERFORMANCE ETHERNET PACKET PROCESSOR CORE FOR NEXT GENERATION NETWORKS
HIGH PERFORMANCE ETHERNET PACKET PROCESSOR CORE FOR NEXT GENERATION NETWORKS
 
Geep networking stack-linuxkernel
Geep networking stack-linuxkernelGeep networking stack-linuxkernel
Geep networking stack-linuxkernel
 
Final presentation [dissertation project], 20192 esv0002
Final presentation [dissertation project], 20192 esv0002Final presentation [dissertation project], 20192 esv0002
Final presentation [dissertation project], 20192 esv0002
 
ENSA_Module_10.pptx
ENSA_Module_10.pptxENSA_Module_10.pptx
ENSA_Module_10.pptx
 
Cisco router basic
Cisco router basicCisco router basic
Cisco router basic
 
CNF.Chap.5.pptx
CNF.Chap.5.pptxCNF.Chap.5.pptx
CNF.Chap.5.pptx
 
LEGaTO: Software Stack Runtimes
LEGaTO: Software Stack RuntimesLEGaTO: Software Stack Runtimes
LEGaTO: Software Stack Runtimes
 
Polyraptor
PolyraptorPolyraptor
Polyraptor
 
Steen_Dissertation_March5
Steen_Dissertation_March5Steen_Dissertation_March5
Steen_Dissertation_March5
 
Introduction to DPDK
Introduction to DPDKIntroduction to DPDK
Introduction to DPDK
 
CCNA Icnd110 s06l01
 CCNA Icnd110 s06l01 CCNA Icnd110 s06l01
CCNA Icnd110 s06l01
 
Eigrp authentication
Eigrp authenticationEigrp authentication
Eigrp authentication
 
OSN days 2019 - Open Networking and Programmable Switch
OSN days 2019 - Open Networking and Programmable SwitchOSN days 2019 - Open Networking and Programmable Switch
OSN days 2019 - Open Networking and Programmable Switch
 

Plus de Shinya Takamaeda-Y

オープンソースコンパイラNNgenでつくるエッジ・ディープラーニングシステム
オープンソースコンパイラNNgenでつくるエッジ・ディープラーニングシステムオープンソースコンパイラNNgenでつくるエッジ・ディープラーニングシステム
オープンソースコンパイラNNgenでつくるエッジ・ディープラーニングシステムShinya Takamaeda-Y
 
DNNのモデル特化ハードウェアを生成するオープンソースコンパイラNNgenのデモ
DNNのモデル特化ハードウェアを生成するオープンソースコンパイラNNgenのデモDNNのモデル特化ハードウェアを生成するオープンソースコンパイラNNgenのデモ
DNNのモデル特化ハードウェアを生成するオープンソースコンパイラNNgenのデモShinya Takamaeda-Y
 
ディープニューラルネットワーク向け拡張可能な高位合成コンパイラの開発
ディープニューラルネットワーク向け拡張可能な高位合成コンパイラの開発ディープニューラルネットワーク向け拡張可能な高位合成コンパイラの開発
ディープニューラルネットワーク向け拡張可能な高位合成コンパイラの開発Shinya Takamaeda-Y
 
Veriloggen.Stream: データフローからハードウェアを作る(2018年3月3日 高位合成友の会 第5回 @東京工業大学)
Veriloggen.Stream: データフローからハードウェアを作る(2018年3月3日 高位合成友の会 第5回 @東京工業大学)Veriloggen.Stream: データフローからハードウェアを作る(2018年3月3日 高位合成友の会 第5回 @東京工業大学)
Veriloggen.Stream: データフローからハードウェアを作る(2018年3月3日 高位合成友の会 第5回 @東京工業大学)Shinya Takamaeda-Y
 
Veriloggen.Thread & Stream: 最高性能FPGAコンピューティングを 目指したミックスドパラダイム型高位合成 (FPGAX 201...
Veriloggen.Thread & Stream: 最高性能FPGAコンピューティングを 目指したミックスドパラダイム型高位合成 (FPGAX 201...Veriloggen.Thread & Stream: 最高性能FPGAコンピューティングを 目指したミックスドパラダイム型高位合成 (FPGAX 201...
Veriloggen.Thread & Stream: 最高性能FPGAコンピューティングを 目指したミックスドパラダイム型高位合成 (FPGAX 201...Shinya Takamaeda-Y
 
Pythonによるカスタム可能な高位設計技術 (Design Solution Forum 2016@新横浜)
Pythonによるカスタム可能な高位設計技術 (Design Solution Forum 2016@新横浜)Pythonによるカスタム可能な高位設計技術 (Design Solution Forum 2016@新横浜)
Pythonによるカスタム可能な高位設計技術 (Design Solution Forum 2016@新横浜)Shinya Takamaeda-Y
 
ゆるふわコンピュータ (IPSJ-ONE2017)
ゆるふわコンピュータ (IPSJ-ONE2017)ゆるふわコンピュータ (IPSJ-ONE2017)
ゆるふわコンピュータ (IPSJ-ONE2017)Shinya Takamaeda-Y
 
助教が吼える! 各界の若手研究者大集合「ハードウェアはやわらかい」
助教が吼える! 各界の若手研究者大集合「ハードウェアはやわらかい」助教が吼える! 各界の若手研究者大集合「ハードウェアはやわらかい」
助教が吼える! 各界の若手研究者大集合「ハードウェアはやわらかい」Shinya Takamaeda-Y
 
Debian Linux on Zynq (Xilinx ARM-SoC FPGA) Setup Flow (Vivado 2015.4)
Debian Linux on Zynq (Xilinx ARM-SoC FPGA) Setup Flow (Vivado 2015.4)Debian Linux on Zynq (Xilinx ARM-SoC FPGA) Setup Flow (Vivado 2015.4)
Debian Linux on Zynq (Xilinx ARM-SoC FPGA) Setup Flow (Vivado 2015.4)Shinya Takamaeda-Y
 
PythonとVeriloggenを用いたRTL設計メタプログラミング
PythonとVeriloggenを用いたRTL設計メタプログラミングPythonとVeriloggenを用いたRTL設計メタプログラミング
PythonとVeriloggenを用いたRTL設計メタプログラミングShinya Takamaeda-Y
 
マルチパラダイム型高水準ハードウェア設計環境の検討
マルチパラダイム型高水準ハードウェア設計環境の検討マルチパラダイム型高水準ハードウェア設計環境の検討
マルチパラダイム型高水準ハードウェア設計環境の検討Shinya Takamaeda-Y
 
Veriloggen: Pythonによるハードウェアメタプログラミング(第3回 高位合成友の会 @ドワンゴ)
Veriloggen: Pythonによるハードウェアメタプログラミング(第3回 高位合成友の会 @ドワンゴ)Veriloggen: Pythonによるハードウェアメタプログラミング(第3回 高位合成友の会 @ドワンゴ)
Veriloggen: Pythonによるハードウェアメタプログラミング(第3回 高位合成友の会 @ドワンゴ)Shinya Takamaeda-Y
 
PythonとPyCoRAMでお手軽にFPGAシステムを開発してみよう
PythonとPyCoRAMでお手軽にFPGAシステムを開発してみようPythonとPyCoRAMでお手軽にFPGAシステムを開発してみよう
PythonとPyCoRAMでお手軽にFPGAシステムを開発してみようShinya Takamaeda-Y
 
A CGRA-based Approach for Accelerating Convolutional Neural Networks
A CGRA-based Approachfor Accelerating Convolutional Neural NetworksA CGRA-based Approachfor Accelerating Convolutional Neural Networks
A CGRA-based Approach for Accelerating Convolutional Neural NetworksShinya Takamaeda-Y
 
Pythonを用いた高水準ハードウェア設計環境の検討
Pythonを用いた高水準ハードウェア設計環境の検討Pythonを用いた高水準ハードウェア設計環境の検討
Pythonを用いた高水準ハードウェア設計環境の検討Shinya Takamaeda-Y
 
Pythonによる高位設計フレームワークPyCoRAMでFPGAシステムを開発してみよう
Pythonによる高位設計フレームワークPyCoRAMでFPGAシステムを開発してみようPythonによる高位設計フレームワークPyCoRAMでFPGAシステムを開発してみよう
Pythonによる高位設計フレームワークPyCoRAMでFPGAシステムを開発してみようShinya Takamaeda-Y
 
コンピュータアーキテクチャ研究の最新動向〜ISCA2015参加報告〜 @FPGAエクストリーム・コンピューティング 第7回 (#fpgax #7)
コンピュータアーキテクチャ研究の最新動向〜ISCA2015参加報告〜 @FPGAエクストリーム・コンピューティング 第7回 (#fpgax #7)コンピュータアーキテクチャ研究の最新動向〜ISCA2015参加報告〜 @FPGAエクストリーム・コンピューティング 第7回 (#fpgax #7)
コンピュータアーキテクチャ研究の最新動向〜ISCA2015参加報告〜 @FPGAエクストリーム・コンピューティング 第7回 (#fpgax #7)Shinya Takamaeda-Y
 
FPGA・リコンフィギャラブルシステム研究の最新動向
FPGA・リコンフィギャラブルシステム研究の最新動向FPGA・リコンフィギャラブルシステム研究の最新動向
FPGA・リコンフィギャラブルシステム研究の最新動向Shinya Takamaeda-Y
 
PyCoRAMによるPythonを用いたポータブルなFPGAアクセラレータ開発 (チュートリアル@ESS2014)
PyCoRAMによるPythonを用いたポータブルなFPGAアクセラレータ開発 (チュートリアル@ESS2014)PyCoRAMによるPythonを用いたポータブルなFPGAアクセラレータ開発 (チュートリアル@ESS2014)
PyCoRAMによるPythonを用いたポータブルなFPGAアクセラレータ開発 (チュートリアル@ESS2014)Shinya Takamaeda-Y
 

Plus de Shinya Takamaeda-Y (20)

オープンソースコンパイラNNgenでつくるエッジ・ディープラーニングシステム
オープンソースコンパイラNNgenでつくるエッジ・ディープラーニングシステムオープンソースコンパイラNNgenでつくるエッジ・ディープラーニングシステム
オープンソースコンパイラNNgenでつくるエッジ・ディープラーニングシステム
 
DNNのモデル特化ハードウェアを生成するオープンソースコンパイラNNgenのデモ
DNNのモデル特化ハードウェアを生成するオープンソースコンパイラNNgenのデモDNNのモデル特化ハードウェアを生成するオープンソースコンパイラNNgenのデモ
DNNのモデル特化ハードウェアを生成するオープンソースコンパイラNNgenのデモ
 
ディープニューラルネットワーク向け拡張可能な高位合成コンパイラの開発
ディープニューラルネットワーク向け拡張可能な高位合成コンパイラの開発ディープニューラルネットワーク向け拡張可能な高位合成コンパイラの開発
ディープニューラルネットワーク向け拡張可能な高位合成コンパイラの開発
 
Veriloggen.Stream: データフローからハードウェアを作る(2018年3月3日 高位合成友の会 第5回 @東京工業大学)
Veriloggen.Stream: データフローからハードウェアを作る(2018年3月3日 高位合成友の会 第5回 @東京工業大学)Veriloggen.Stream: データフローからハードウェアを作る(2018年3月3日 高位合成友の会 第5回 @東京工業大学)
Veriloggen.Stream: データフローからハードウェアを作る(2018年3月3日 高位合成友の会 第5回 @東京工業大学)
 
Veriloggen.Thread & Stream: 最高性能FPGAコンピューティングを 目指したミックスドパラダイム型高位合成 (FPGAX 201...
Veriloggen.Thread & Stream: 最高性能FPGAコンピューティングを 目指したミックスドパラダイム型高位合成 (FPGAX 201...Veriloggen.Thread & Stream: 最高性能FPGAコンピューティングを 目指したミックスドパラダイム型高位合成 (FPGAX 201...
Veriloggen.Thread & Stream: 最高性能FPGAコンピューティングを 目指したミックスドパラダイム型高位合成 (FPGAX 201...
 
Pythonによるカスタム可能な高位設計技術 (Design Solution Forum 2016@新横浜)
Pythonによるカスタム可能な高位設計技術 (Design Solution Forum 2016@新横浜)Pythonによるカスタム可能な高位設計技術 (Design Solution Forum 2016@新横浜)
Pythonによるカスタム可能な高位設計技術 (Design Solution Forum 2016@新横浜)
 
ゆるふわコンピュータ (IPSJ-ONE2017)
ゆるふわコンピュータ (IPSJ-ONE2017)ゆるふわコンピュータ (IPSJ-ONE2017)
ゆるふわコンピュータ (IPSJ-ONE2017)
 
助教が吼える! 各界の若手研究者大集合「ハードウェアはやわらかい」
助教が吼える! 各界の若手研究者大集合「ハードウェアはやわらかい」助教が吼える! 各界の若手研究者大集合「ハードウェアはやわらかい」
助教が吼える! 各界の若手研究者大集合「ハードウェアはやわらかい」
 
Debian Linux on Zynq (Xilinx ARM-SoC FPGA) Setup Flow (Vivado 2015.4)
Debian Linux on Zynq (Xilinx ARM-SoC FPGA) Setup Flow (Vivado 2015.4)Debian Linux on Zynq (Xilinx ARM-SoC FPGA) Setup Flow (Vivado 2015.4)
Debian Linux on Zynq (Xilinx ARM-SoC FPGA) Setup Flow (Vivado 2015.4)
 
PythonとVeriloggenを用いたRTL設計メタプログラミング
PythonとVeriloggenを用いたRTL設計メタプログラミングPythonとVeriloggenを用いたRTL設計メタプログラミング
PythonとVeriloggenを用いたRTL設計メタプログラミング
 
マルチパラダイム型高水準ハードウェア設計環境の検討
マルチパラダイム型高水準ハードウェア設計環境の検討マルチパラダイム型高水準ハードウェア設計環境の検討
マルチパラダイム型高水準ハードウェア設計環境の検討
 
Veriloggen: Pythonによるハードウェアメタプログラミング(第3回 高位合成友の会 @ドワンゴ)
Veriloggen: Pythonによるハードウェアメタプログラミング(第3回 高位合成友の会 @ドワンゴ)Veriloggen: Pythonによるハードウェアメタプログラミング(第3回 高位合成友の会 @ドワンゴ)
Veriloggen: Pythonによるハードウェアメタプログラミング(第3回 高位合成友の会 @ドワンゴ)
 
PythonとPyCoRAMでお手軽にFPGAシステムを開発してみよう
PythonとPyCoRAMでお手軽にFPGAシステムを開発してみようPythonとPyCoRAMでお手軽にFPGAシステムを開発してみよう
PythonとPyCoRAMでお手軽にFPGAシステムを開発してみよう
 
A CGRA-based Approach for Accelerating Convolutional Neural Networks
A CGRA-based Approachfor Accelerating Convolutional Neural NetworksA CGRA-based Approachfor Accelerating Convolutional Neural Networks
A CGRA-based Approach for Accelerating Convolutional Neural Networks
 
Pythonを用いた高水準ハードウェア設計環境の検討
Pythonを用いた高水準ハードウェア設計環境の検討Pythonを用いた高水準ハードウェア設計環境の検討
Pythonを用いた高水準ハードウェア設計環境の検討
 
Pythonによる高位設計フレームワークPyCoRAMでFPGAシステムを開発してみよう
Pythonによる高位設計フレームワークPyCoRAMでFPGAシステムを開発してみようPythonによる高位設計フレームワークPyCoRAMでFPGAシステムを開発してみよう
Pythonによる高位設計フレームワークPyCoRAMでFPGAシステムを開発してみよう
 
コンピュータアーキテクチャ研究の最新動向〜ISCA2015参加報告〜 @FPGAエクストリーム・コンピューティング 第7回 (#fpgax #7)
コンピュータアーキテクチャ研究の最新動向〜ISCA2015参加報告〜 @FPGAエクストリーム・コンピューティング 第7回 (#fpgax #7)コンピュータアーキテクチャ研究の最新動向〜ISCA2015参加報告〜 @FPGAエクストリーム・コンピューティング 第7回 (#fpgax #7)
コンピュータアーキテクチャ研究の最新動向〜ISCA2015参加報告〜 @FPGAエクストリーム・コンピューティング 第7回 (#fpgax #7)
 
Zynq+PyCoRAM(+Debian)入門
Zynq+PyCoRAM(+Debian)入門Zynq+PyCoRAM(+Debian)入門
Zynq+PyCoRAM(+Debian)入門
 
FPGA・リコンフィギャラブルシステム研究の最新動向
FPGA・リコンフィギャラブルシステム研究の最新動向FPGA・リコンフィギャラブルシステム研究の最新動向
FPGA・リコンフィギャラブルシステム研究の最新動向
 
PyCoRAMによるPythonを用いたポータブルなFPGAアクセラレータ開発 (チュートリアル@ESS2014)
PyCoRAMによるPythonを用いたポータブルなFPGAアクセラレータ開発 (チュートリアル@ESS2014)PyCoRAMによるPythonを用いたポータブルなFPGAアクセラレータ開発 (チュートリアル@ESS2014)
PyCoRAMによるPythonを用いたポータブルなFPGAアクセラレータ開発 (チュートリアル@ESS2014)
 

Dernier

Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024SynarionITSolutions
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsRoshan Dwivedi
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 

Dernier (20)

Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 

SmartCore System for Dependable Many-core Processor with Multifunction Routers (in ICNC'10 Hiroshima)

  • 1. SmartCore system for Dependable Many-core Processor with Multifunction Routers Shinya Takamaeda†, Shimpei Sato†, Takefumi Miyoshi‡, Kenji Kise† †Tokyo Institute of Technology, Japan ‡The University of Electro-communications, Japan 10-11-18 ICNC’10 @Hiroshima Regular Paper Hardware Design and Implementation 14:50-15:20
  • 2. Contents  Motivation  Proposal: SmartCore system  Preliminary Evaluation  Hardware Implementation on FPGAs  Related Work  Conclusion 10-11-18 ICNC'102
  • 3. Contents  Motivation  Proposal: SmartCore system  Preliminary Evaluation  Hardware Implementation on FPGAs  Related Work  Conclusion 10-11-18 ICNC'103
  • 4. Many-core Processors appear! 10-11-18 Intel Single Chip Cloud Computer 48 cores (x86) TILERA TILE-Gx100 100 cores (MIPS) ICNC'104
  • 5. Inter-connection for Many-core processors  NoC (Network on Chip)  Data transmission via on-chip-routers 10-11-18 ICNC'105 R PE R PE R PE R PE R PE R PE R PE R PE R PE R PE R PE R PE R PE R PE R PE R PE
  • 6. Low Dependability on Many-core  Process technology scaling for more transistors  But it increases …  Soft errors (e.g. bit inversion) •  since cosmic radiations  Timing errors •  since variations in transistor characteristic or wire delay 10-11-18 ICNC'10 How to create a reliable Many-core processor? 6
  • 7. Circuit Micro-architecture Architecture Software Assurance of the reliability on each layer 10-11-18 ICNC'107 Razor-FF Lock-step Check-pointing / Re-execution Inter-connection SmartCore system Canary-FF ECC in DRAM Memory Architectural Core Salvaging Slip Stream Processor
  • 8. Contents  Motivation  Proposal: SmartCore system  Preliminary Evaluation  Hardware Implementation on FPGAs  Related Work  Conclusion 10-11-18 ICNC'108
  • 9. We propose the SmartCore system  SmartCore system = Smart many-core system with redundant cores and multifunction routers  Key: NoC-based DMR  To detect a error, compare the output packets from the pair  On-chip router has 3 special functions •  Copy a packet •  Change the destination •  Wait and Compare 2 packets 10-11-18 ICNC'10 PE R PE R PE R PE R PE R PE R Handling the same packets by packet coping Running the same thread (DMR) Running the single thread (DMR) sharing a packet / comparing 2 packets 9
  • 10. Base many-core architecture: M-Core [1]  2D mesh network connects Nodes  Each Node memory is independent  Inter-Node communication  DMA via packets using ID  A packet is a series of flits (Flow Control Unit) •  Only the head flit of a packet contains the destination 10-11-18 ICNC'10 Node (2, 1) INCC Node memory Core Comp. Node (1,1) Comp. Node (1,2) Comp. Node (1,8) Comp. Node (2,1) Comp. Node (2,2) Comp. Node (2,8) Comp. Node (3,1) Comp. Node (3,2) Comp. Node (3,8) Comp. Node (8,1) Comp. Node (8,1) Comp. Node (8,8) Operation Node (0,0) Memory Node (1,0) Off chip memory modules and switch Conventional I/O Many-core processor chip Memory Node (2,0) Memory Node (3,0) Memory Node (8,0) Node (1, 1) INCC Node memory Core Router Router 10
  • 11. DMR on two nodes by using SmartCore   Executing a same program binary on the pair  Master Node and Mirror Node  If generated packets are different, they are faulty   Packet coping on the Router of the Master  for the Mirror to use the same data as Master   Packet comparison on the Router of Master  If these two differ, then the Router detects a error 10-11-18 ICNC'1011 PE R PE R PE R PE R PE R PE R PE R PE R Master Node Mirror Node Node (1,1) Node (2,1) Node (3,1) Node (4,1) Node (1,2) Node (2,2) Node (3,2) Node (4,2) Logically Node (1,1)
  • 12. 1. Coping a packet to the Mirror Node  Router on Master Node copies a coming packet to the Mirror Node  The destination is changed to the Mirror Node’s ID  Original program has a several DMA communications  To certainly continue executing the same program in the two Node 10-11-18 ICNC'10 INCCINCC R R Master Mirror P P 12
  • 13. 2. Wait for a packet from the Mirror Node 3. Compare the contents of two packets  Router on Master Node waits a packet from Master Node and a packet from Mirror Node  When Router on Master receives the head flits from both Nodes, then it starts to compare the 2 flits in order  If the contents of flits differ, a error exists in either Master Node or Mirror Node 10-11-18 ICNC'10 INCCINCC R R Master MirrorP P 13
  • 14. Base router  5 inputs with input buffers / 5 outputs  X-Y Dimension-order routing  Wormhole switching, Xon/Xoff flow control  1hop/1cycle, single cycle, no virtual channels 10-11-18 ICNC'10 Router XBAR Switch Output port X+ Output port X- Output port Y+ Output port Y- Output port DMAC Input port X+ Input port X- Input port Y+ Input port Y- Input port DMAC Arbiter 14
  • 15.   Additional buffer for coping for Mirror Node (a)   ID translator to change the destination (b)   Flit comparator to verify (c)   Node type, Master/Mirror Node ID   Configured by system software Multifunction router for SmartCore system 10-11-18 ICNC'10 Output port INCCInput port INCC Router XBAR Switch Output port X+ Output port X- Output port Y+ Output port Y- Input port X+ Input port X- Input port Y+ Input port Y- Arbiter node type master / mirror ID V Verify ID translation (a)(b) (c) 15
  • 16. Advantages of SmartCore system  Adaptable to any kind of hardware modules generating a packet  ex) Cache, DSP, Processor core  Because of …  Error detection mechanism is independent to Node structure •  Core-granularity redundant execution / Packet level error detection 10-11-18 ICNC'1016
  • 17. Contents  Motivation  Proposal: SmartCore system  Preliminary Evaluation  Hardware Implementation on FPGAs  Related Work  Conclusion 10-11-18 ICNC'1017
  • 18. Preliminary Evaluation of SmartCore system  2 evaluations  Performance overhead on DMR  Packet rendezvous time  Environment: SimMc 1.0  64 (8×8) threads on 128 (16×8) Nodes  Core •  MIPS32 single issue / single cycle processor  Router •  1 hop / 1 cycle, no virtual channels, flit size: 4 bytes  INCC (Network Interface) •  up to 1 flit / cycle receive/send from/to router  Benchmark: 4 apps from NAS Parallel Benchmarks •  cg, ft, is, lu, Size: S 10-11-18 ICNC'1018 Node (X, Y) INCC Node memory Core Router
  • 19. 3 configurations of thread mapping 10-11-18 ICNC'1019 1,1 1,2 1,8 2,1 2,2 2,8 8,1 8,2 8,8 8 Nodes 8 Nodes 8Nodes 1,1 1,2 1,8 2,1 2,2 2,8 8,1 8,2 8,8 8Nodes 16 Nodes 1,1 1,2 1,8 1,1 1,2 1,8 2,1 2,2 2,8 2,1 2,2 2,8 8,1 8,2 8,8 8,1 8,2 8,8 8Nodes 16 Nodes (a) Base Allocation (b) Redundant space allocation (Area 2x) (c) Redundant execution with SmartCore system x,y Proper thread (Master Node) Redundant thread (Mirror Node) x,y Not working to see the effect on #hops to see the effect on SmartCore
  • 20. Evaluation: Performance overhead on DMR  A little slow down  Redundant space (Area 2x): up to 1% slow down  Redundant execution (SmartCore): up to 4% slow down (in cg of NPB) 10-11-18 ICNC'1020
  • 21. Evaluation: Packet rendezvous time  Cumulative distribution of # cycles that the router on Master Node waits for a packet from Mirror Node  Almost communications with a little rendezvous 10-11-18 ICNC'10 cg ft is lu 21
  • 22. Contents  Motivation  Proposal: SmartCore system  Preliminary Evaluation  Hardware Implementation on FPGAs  Related Work  Conclusion 10-11-18 ICNC'1022
  • 23. Hardware Implementation on FPGAs  Dependable Many-core processor on FPGA- based prototyping system  by using ScalableCore system [8] •  Connected FPGA boards •  Variable # FPGA boards  2 execution mode •  Normal Mode –  Standard M-Core •  SmartCore Mode –  The pair executes same thread 10-11-18 ICNC'1023 SD Loader (0,1) Physical ID (1,1) Path (0,2) Physical ID (1,2) Physical ID (2,1) Physical ID (2,2) Physical ID (3,1) Physical ID (3,2) Physical ID (4,1) Physical ID (4,2) Path (0,3) Physical ID (1,3) Physical ID (2,3) Physical ID (3,3) Physical ID (4,3) LogicalID (1,1) LogicalID (1,2) LogicalID (1,3) LogicalID (2,1) LogicalID (2,2) LogicalID (2,3) Power Master Mirror Master Mirror
  • 24. Overview of 15 Nodes ScalableCore system with SmartCore system 10-11-18 ICNC'1024 Logical ID (1,1) Master Mirror Logical ID (1,2) Master Mirror Logical ID (1,3) Master Mirror Logical ID (2,1) Master Mirror Logical ID (2,2) Master Mirror Logical ID (2,3) Master Mirror Program Loader ID (0,1) SmartCore system detects a artificial fault
  • 25. Contents  Motivation  Proposal: SmartCore system  Preliminary Evaluation  Hardware Implementation on FPGAs  Related Work  Conclusion 10-11-18 ICNC'1025
  • 26. Related work  Slipstream Processor [9, Karthik, ASPLOS2000]  Improving ILP and dependability by using tightly coupled two cores  2 threads •  Proper sequence and shorter sequence  Loose Lock-stepped system [10, Nidhi, ISCA2007]  Dividing cores, cache, main memory into two groups  I/O level error detection  Lockstep [11, IBM]  Redundant execution on synchronized processors  I/O level error detection 10-11-18 ICNC'1026
  • 27. Contents  Motivation  Proposal: SmartCore system  Preliminary Evaluation  Hardware Implementation on FPGAs  Related Work  Conclusion 10-11-18 ICNC'1027
  • 28. Conclusion  We propose the SmartCore system  NoC-based DMR by using multifunction routers  Multifunction router has 3 special functions •  Coping a packet •  Changing the destination of a packet •  Waiting and comparing the contents of two packets  Low performance overhead  Hardware implementation on FPGA-based prototyping system  Future works  Recovery after error detections  TMR by SmartCore system 10-11-18 ICNC'1028