SlideShare une entreprise Scribd logo
1  sur  22
Compilation guiding and adjusting to hardware changes in Embedded Reconfigurable Architecture (  )  May 4, 2011 Ayal Zaks IBM Haifa Research Lab E A R
Motivation ,[object Object],[object Object],[object Object]
Challenges of  (EU FP7 STREP) ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],The adaptive ERA platform will be able to meet these challenges!! E A R
Memory component Network component Processing component Monitoring Hardware scheduler LIBRARIES Applications OS (or software scheduler) C/C++/ Java compiler Power vs. Performance ARM, VEX, DSP, accelerators, etc. Crossbar, bus, NoC, etc. Multi-level caches, controllers, etc. Abstract overview of the  platform E A R
Partners of  Participant no.  Participant organisation name Short name Country 1 (Coordinator) Technische Universiteit Delft TUD NL 2 Industrial Systems Institute ISI GR 3 Universita' degli Studi di Siena UNISI IT 4 Chalmers University CHALMERS SE 5 University of Edinburgh UEDIN UK 6 Evidence EVI IT 7 ST Microelectronics ST IT 8 IBM IBM IL 9 Universidade do Rio Grande do Sul UFRGS BR 10 Uppsala University UPP SE E A R
Key elements of the ERA platform ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Work packages and leaders ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Goals of the ERA project ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
... compilers to deal with dynamically parameterizable hardware ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
The architecture:   -VEX ,[object Object],[object Object],[object Object],[object Object]
WP4 1 st  Year Achievements ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],pipe:: c0  shl  $r0.3 = $r0.3,16 c0  shl  $r0.4 = $r0.4,16 c0  shl  $r0.5 = $r0.5,16 c0  shl  $r0.6 = $r0.6,16 ;; ;; ;; c0  shr  $r0.4 = $r0.4,16 c0  shr  $r0.6 = $r0.6,16 c0  shr  $r0.3 = $r0.3,16 c0  shr  $r0.5 = $r0.5,16 ;; ;; ;; c0  mpyll  $r0.3 = $r0.3,$r0.4 c0  mpyll  $r0.5 = $r0.5,$r0.6 ;; ;;;; c0  mpyll  $r0.3 = $r0.3,$r0.5 ;; ;; ;; c0  add  $r0.3 = $r0.3,3 ;;;; ;; c0  sxth  $r0.3 = $r0.3 ;;;; ;; c0  return  $r0.1 = $r0.1,(0x0),$l0.0 ;; ;; ;; ;; __attribute__ ((noinline)) short pipe (short a, short b, short c, short d) { short f, g , t; t = a * b; f = c * d; g = t * f; return g+3; } Source code  -VEX Assembly code generated by GCC
[object Object],[object Object],[object Object],[object Object],[object Object],WP4 1 st  Year Achievements (cont.)
Characterizing ILP of ERA benchmarks ,[object Object],Original program 128 = b + 119  119 = 119 + 4 127= a + 119 129 = MEM[128] 130 = 129 + 1 MEM[127] = 130 [0] [1] [2] [3] [4] [5] [6] [7] [8] [9] [10] [11] 128 = b + 119  119 = 119 + 4 127= a + 119 129 = MEM[128] 130 = 129 + 1 MEM[127] = 130 [0] [1] [2] [3] [4] [5] [6] [7] [8] [9] [10] [11] void foo (unsigned char  *dst ,  unsigned char *src ) { int x ; for( x = 0; x < 100; x+=1 ) dst[x] = ( src [x] + 1 ); } I nitiation  I nterval prologue   epilogue   Source code  -VEX Assembly (transcribed) modulo scheduled by GCC  128 = b + 119  119 = 119 + 4 127= a + 119 129 = MEM[128] 130 = 129 + 1 MEM[127] = 130 [0] [1] [2] [3] [4] [5] [6] [7] [8] [9] [10] [11] 128 = b + 119  119 = 119 + 4 127= a + 119 129 = MEM[128] 130 = 129 + 1 MEM[127] = 130 [0] [1] [2] [3] [4] [5] [6] [7] [8] [9] [10] [11]
Example - X264 List of hot functions:
Poster at
Recent Developments ,[object Object],[object Object],[object Object],[object Object],Issue-width Slice Registers Slice LUTs BRAMs 2-issue 586 (0%) 6375 (4%) 4  (1%) 4-issue 1046 (0%) 12899 (8%) 16  (4%) 8-issue 1868 (0%) 26252 (17%) 64  (15%)
TLP vs. ILP ,[object Object],[object Object],[object Object],[object Object]
Core vs. Cache GCC EDP Different configurations, same EDP! Copyright © Keramidis & Kaxiras, ERA project
Conclusions ,[object Object],[object Object],[object Object]
Thanks! To you and: ,[object Object],[object Object],[object Object],E A R
Contact information Visit  http://www.era-project.eu  for more information Coordinator: Stephan Wong  (Delft University of Techology) [email_address] http://ce.et.tudelft.nl/~stephan/   IBM representative , Work Package 4 leader: Ayal Zaks  (IBM Haifa Research Lab) [email_address] https://www.research.ibm.com/haifa/dept/svt/code_compiler.html
Strengths of ERA partners ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Contenu connexe

Tendances

Automated Design Space Exploration and Roofline Analysis for FPGA-based HLS A...
Automated Design Space Exploration and Roofline Analysis for FPGA-based HLS A...Automated Design Space Exploration and Roofline Analysis for FPGA-based HLS A...
Automated Design Space Exploration and Roofline Analysis for FPGA-based HLS A...
NECST Lab @ Politecnico di Milano
 
LLVM-based Communication Optimizations for PGAS Programs
LLVM-based Communication Optimizations for PGAS ProgramsLLVM-based Communication Optimizations for PGAS Programs
LLVM-based Communication Optimizations for PGAS Programs
Akihiro Hayashi
 
Compiler optimization
Compiler optimizationCompiler optimization
Compiler optimization
liu_ming50
 
186 devlin p-poster(2)
186 devlin p-poster(2)186 devlin p-poster(2)
186 devlin p-poster(2)
vaidehi87
 
Compiler Optimization-Space Exploration
Compiler Optimization-Space ExplorationCompiler Optimization-Space Exploration
Compiler Optimization-Space Exploration
tmusabbir
 

Tendances (20)

Ch1
Ch1Ch1
Ch1
 
The CAOS framework: Democratize the acceleration of compute intensive applica...
The CAOS framework: Democratize the acceleration of compute intensive applica...The CAOS framework: Democratize the acceleration of compute intensive applica...
The CAOS framework: Democratize the acceleration of compute intensive applica...
 
Implementing AI: High Performance Architectures: Arm SVE and Supercomputer Fu...
Implementing AI: High Performance Architectures: Arm SVE and Supercomputer Fu...Implementing AI: High Performance Architectures: Arm SVE and Supercomputer Fu...
Implementing AI: High Performance Architectures: Arm SVE and Supercomputer Fu...
 
Improve Vectorization Efficiency
Improve Vectorization EfficiencyImprove Vectorization Efficiency
Improve Vectorization Efficiency
 
Madeo - a CAD Tool for reconfigurable Hardware
Madeo - a CAD Tool for reconfigurable HardwareMadeo - a CAD Tool for reconfigurable Hardware
Madeo - a CAD Tool for reconfigurable Hardware
 
Unleash performance through parallelism - Intel® Math Kernel Library
Unleash performance through parallelism - Intel® Math Kernel LibraryUnleash performance through parallelism - Intel® Math Kernel Library
Unleash performance through parallelism - Intel® Math Kernel Library
 
P4 Introduction
P4 Introduction P4 Introduction
P4 Introduction
 
Automated Design Space Exploration and Roofline Analysis for FPGA-based HLS A...
Automated Design Space Exploration and Roofline Analysis for FPGA-based HLS A...Automated Design Space Exploration and Roofline Analysis for FPGA-based HLS A...
Automated Design Space Exploration and Roofline Analysis for FPGA-based HLS A...
 
LLVM-based Communication Optimizations for PGAS Programs
LLVM-based Communication Optimizations for PGAS ProgramsLLVM-based Communication Optimizations for PGAS Programs
LLVM-based Communication Optimizations for PGAS Programs
 
HPCLib & Excel : An efficient way to compute with Xeon PHI
HPCLib & Excel : An efficient way to compute with Xeon PHIHPCLib & Excel : An efficient way to compute with Xeon PHI
HPCLib & Excel : An efficient way to compute with Xeon PHI
 
Compiler optimization
Compiler optimizationCompiler optimization
Compiler optimization
 
Hadoop Internals (2.3.0 or later)
Hadoop Internals (2.3.0 or later)Hadoop Internals (2.3.0 or later)
Hadoop Internals (2.3.0 or later)
 
Compiler optimizations based on call-graph flattening
Compiler optimizations based on call-graph flatteningCompiler optimizations based on call-graph flattening
Compiler optimizations based on call-graph flattening
 
Compiler optimization
Compiler optimizationCompiler optimization
Compiler optimization
 
Peephole optimization techniques in compiler design
Peephole optimization techniques in compiler designPeephole optimization techniques in compiler design
Peephole optimization techniques in compiler design
 
186 devlin p-poster(2)
186 devlin p-poster(2)186 devlin p-poster(2)
186 devlin p-poster(2)
 
Compiler Optimization-Space Exploration
Compiler Optimization-Space ExplorationCompiler Optimization-Space Exploration
Compiler Optimization-Space Exploration
 
Compiler optimization techniques
Compiler optimization techniquesCompiler optimization techniques
Compiler optimization techniques
 
Peephole Optimization
Peephole OptimizationPeephole Optimization
Peephole Optimization
 
CC-4007, Large-Scale Machine Learning on Graphs, by Yucheng Low, Joseph Gonza...
CC-4007, Large-Scale Machine Learning on Graphs, by Yucheng Low, Joseph Gonza...CC-4007, Large-Scale Machine Learning on Graphs, by Yucheng Low, Joseph Gonza...
CC-4007, Large-Scale Machine Learning on Graphs, by Yucheng Low, Joseph Gonza...
 

Similaire à Track A-Compilation guiding and adjusting - IBM

Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...
inside-BigData.com
 
Michael_Joshua_Validation
Michael_Joshua_ValidationMichael_Joshua_Validation
Michael_Joshua_Validation
MichaelJoshua
 

Similaire à Track A-Compilation guiding and adjusting - IBM (20)

Fast Insights to Optimized Vectorization and Memory Using Cache-aware Rooflin...
Fast Insights to Optimized Vectorization and Memory Using Cache-aware Rooflin...Fast Insights to Optimized Vectorization and Memory Using Cache-aware Rooflin...
Fast Insights to Optimized Vectorization and Memory Using Cache-aware Rooflin...
 
Exploring Emerging Technologies in the Extreme Scale HPC Co-Design Space with...
Exploring Emerging Technologies in the Extreme Scale HPC Co-Design Space with...Exploring Emerging Technologies in the Extreme Scale HPC Co-Design Space with...
Exploring Emerging Technologies in the Extreme Scale HPC Co-Design Space with...
 
Performance Verification for ESL Design Methodology from AADL Models
Performance Verification for ESL Design Methodology from AADL ModelsPerformance Verification for ESL Design Methodology from AADL Models
Performance Verification for ESL Design Methodology from AADL Models
 
OS for AI: Elastic Microservices & the Next Gen of ML
OS for AI: Elastic Microservices & the Next Gen of MLOS for AI: Elastic Microservices & the Next Gen of ML
OS for AI: Elastic Microservices & the Next Gen of ML
 
Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...
 
Embedded system
Embedded systemEmbedded system
Embedded system
 
Performance and Power Profiling on Intel Android Devices
Performance and Power Profiling on Intel Android DevicesPerformance and Power Profiling on Intel Android Devices
Performance and Power Profiling on Intel Android Devices
 
Accelerating Spark MLlib and DataFrame with Vector Processor “SX-Aurora TSUBASA”
Accelerating Spark MLlib and DataFrame with Vector Processor “SX-Aurora TSUBASA”Accelerating Spark MLlib and DataFrame with Vector Processor “SX-Aurora TSUBASA”
Accelerating Spark MLlib and DataFrame with Vector Processor “SX-Aurora TSUBASA”
 
openCL Paper
openCL PaperopenCL Paper
openCL Paper
 
Michael_Joshua_Validation
Michael_Joshua_ValidationMichael_Joshua_Validation
Michael_Joshua_Validation
 
The Best Programming Practice for Cell/B.E.
The Best Programming Practice for Cell/B.E.The Best Programming Practice for Cell/B.E.
The Best Programming Practice for Cell/B.E.
 
resume
resumeresume
resume
 
Daya_DBA
Daya_DBADaya_DBA
Daya_DBA
 
DhevendranResume
DhevendranResumeDhevendranResume
DhevendranResume
 
Petapath HP Cast 12 - Programming for High Performance Accelerated Systems
Petapath HP Cast 12 - Programming for High Performance Accelerated SystemsPetapath HP Cast 12 - Programming for High Performance Accelerated Systems
Petapath HP Cast 12 - Programming for High Performance Accelerated Systems
 
Software Abstractions for Parallel Hardware
Software Abstractions for Parallel HardwareSoftware Abstractions for Parallel Hardware
Software Abstractions for Parallel Hardware
 
(1) c sharp introduction_basics_dot_net
(1) c sharp introduction_basics_dot_net(1) c sharp introduction_basics_dot_net
(1) c sharp introduction_basics_dot_net
 
Enterprise PL1 - Peter Elderon
Enterprise PL1 - Peter ElderonEnterprise PL1 - Peter Elderon
Enterprise PL1 - Peter Elderon
 
ElixirでFPGAを設計する
ElixirでFPGAを設計するElixirでFPGAを設計する
ElixirでFPGAを設計する
 
NWU and HPC
NWU and HPCNWU and HPC
NWU and HPC
 

Plus de chiportal

Prof. Steve Furber, University of Manchester, Principal Designer of the BBC M...
Prof. Steve Furber, University of Manchester, Principal Designer of the BBC M...Prof. Steve Furber, University of Manchester, Principal Designer of the BBC M...
Prof. Steve Furber, University of Manchester, Principal Designer of the BBC M...
chiportal
 

Plus de chiportal (20)

Prof. Zhihua Wang, Tsinghua University, Beijing, China
Prof. Zhihua Wang, Tsinghua University, Beijing, China Prof. Zhihua Wang, Tsinghua University, Beijing, China
Prof. Zhihua Wang, Tsinghua University, Beijing, China
 
Prof. Steve Furber, University of Manchester, Principal Designer of the BBC M...
Prof. Steve Furber, University of Manchester, Principal Designer of the BBC M...Prof. Steve Furber, University of Manchester, Principal Designer of the BBC M...
Prof. Steve Furber, University of Manchester, Principal Designer of the BBC M...
 
Prof. Steve Furber, University of Manchester, Principal Designer of the BBC M...
Prof. Steve Furber, University of Manchester, Principal Designer of the BBC M...Prof. Steve Furber, University of Manchester, Principal Designer of the BBC M...
Prof. Steve Furber, University of Manchester, Principal Designer of the BBC M...
 
Prof. Uri Weiser,Technion
Prof. Uri Weiser,TechnionProf. Uri Weiser,Technion
Prof. Uri Weiser,Technion
 
Ken Liao, Senior Associate VP, Faraday
Ken Liao, Senior Associate VP, FaradayKen Liao, Senior Associate VP, Faraday
Ken Liao, Senior Associate VP, Faraday
 
Prof. Danny Raz, Director, Bell Labs Israel, Nokia
 Prof. Danny Raz, Director, Bell Labs Israel, Nokia  Prof. Danny Raz, Director, Bell Labs Israel, Nokia
Prof. Danny Raz, Director, Bell Labs Israel, Nokia
 
Marco Casale-Rossi, Product Mktg. Manager, Synopsys
Marco Casale-Rossi, Product Mktg. Manager, SynopsysMarco Casale-Rossi, Product Mktg. Manager, Synopsys
Marco Casale-Rossi, Product Mktg. Manager, Synopsys
 
Dr.Efraim Aharoni, ESD Leader, TowerJazz
Dr.Efraim Aharoni, ESD Leader, TowerJazzDr.Efraim Aharoni, ESD Leader, TowerJazz
Dr.Efraim Aharoni, ESD Leader, TowerJazz
 
Eddy Kvetny, System Engineering Group Leader, Intel
Eddy Kvetny, System Engineering Group Leader, IntelEddy Kvetny, System Engineering Group Leader, Intel
Eddy Kvetny, System Engineering Group Leader, Intel
 
Dr. John Bainbridge, Principal Application Architect, NetSpeed
 Dr. John Bainbridge, Principal Application Architect, NetSpeed  Dr. John Bainbridge, Principal Application Architect, NetSpeed
Dr. John Bainbridge, Principal Application Architect, NetSpeed
 
Xavier van Ruymbeke, App. Engineer, Arteris
Xavier van Ruymbeke, App. Engineer, ArterisXavier van Ruymbeke, App. Engineer, Arteris
Xavier van Ruymbeke, App. Engineer, Arteris
 
Asi Lifshitz, VP R&D, Vtool
Asi Lifshitz, VP R&D, VtoolAsi Lifshitz, VP R&D, Vtool
Asi Lifshitz, VP R&D, Vtool
 
Zvika Rozenshein,General Manager, EngineeringIQ
Zvika Rozenshein,General Manager, EngineeringIQZvika Rozenshein,General Manager, EngineeringIQ
Zvika Rozenshein,General Manager, EngineeringIQ
 
Lewis Chu,Marketing Director,GUC
Lewis Chu,Marketing Director,GUC Lewis Chu,Marketing Director,GUC
Lewis Chu,Marketing Director,GUC
 
Kunal Varshney, VLSI Engineer, Open-Silicon
Kunal Varshney, VLSI Engineer, Open-SiliconKunal Varshney, VLSI Engineer, Open-Silicon
Kunal Varshney, VLSI Engineer, Open-Silicon
 
Gert Goossens,Sen. Director, ASIP Tools, Synopsys
Gert Goossens,Sen. Director, ASIP Tools, SynopsysGert Goossens,Sen. Director, ASIP Tools, Synopsys
Gert Goossens,Sen. Director, ASIP Tools, Synopsys
 
Tuvia Liran, Director of VLSI, Nano Retina
Tuvia Liran, Director of VLSI, Nano RetinaTuvia Liran, Director of VLSI, Nano Retina
Tuvia Liran, Director of VLSI, Nano Retina
 
Sagar Kadam, Lead Software Engineer, Open-Silicon
Sagar Kadam, Lead Software Engineer, Open-SiliconSagar Kadam, Lead Software Engineer, Open-Silicon
Sagar Kadam, Lead Software Engineer, Open-Silicon
 
Ronen Shtayer,Director of ASG Operations & PMO, NXP Semiconductor
Ronen Shtayer,Director of ASG Operations & PMO, NXP SemiconductorRonen Shtayer,Director of ASG Operations & PMO, NXP Semiconductor
Ronen Shtayer,Director of ASG Operations & PMO, NXP Semiconductor
 
Prof. Emanuel Cohen, Technion
Prof. Emanuel Cohen, TechnionProf. Emanuel Cohen, Technion
Prof. Emanuel Cohen, Technion
 

Dernier

Making and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfMaking and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdf
Chris Hunter
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
kauryashika82
 

Dernier (20)

PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docx
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
Asian American Pacific Islander Month DDSD 2024.pptx
Asian American Pacific Islander Month DDSD 2024.pptxAsian American Pacific Islander Month DDSD 2024.pptx
Asian American Pacific Islander Month DDSD 2024.pptx
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
psychiatric nursing HISTORY COLLECTION .docx
psychiatric  nursing HISTORY  COLLECTION  .docxpsychiatric  nursing HISTORY  COLLECTION  .docx
psychiatric nursing HISTORY COLLECTION .docx
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Making and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfMaking and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdf
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdf
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
Food Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-II
Food Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-IIFood Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-II
Food Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-II
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 

Track A-Compilation guiding and adjusting - IBM

  • 1. Compilation guiding and adjusting to hardware changes in Embedded Reconfigurable Architecture ( ) May 4, 2011 Ayal Zaks IBM Haifa Research Lab E A R
  • 2.
  • 3.
  • 4. Memory component Network component Processing component Monitoring Hardware scheduler LIBRARIES Applications OS (or software scheduler) C/C++/ Java compiler Power vs. Performance ARM, VEX, DSP, accelerators, etc. Crossbar, bus, NoC, etc. Multi-level caches, controllers, etc. Abstract overview of the platform E A R
  • 5. Partners of Participant no. Participant organisation name Short name Country 1 (Coordinator) Technische Universiteit Delft TUD NL 2 Industrial Systems Institute ISI GR 3 Universita' degli Studi di Siena UNISI IT 4 Chalmers University CHALMERS SE 5 University of Edinburgh UEDIN UK 6 Evidence EVI IT 7 ST Microelectronics ST IT 8 IBM IBM IL 9 Universidade do Rio Grande do Sul UFRGS BR 10 Uppsala University UPP SE E A R
  • 6.
  • 7.
  • 8.
  • 9.
  • 10.
  • 11.
  • 12.
  • 13.
  • 14. Example - X264 List of hot functions:
  • 16.
  • 17.
  • 18. Core vs. Cache GCC EDP Different configurations, same EDP! Copyright © Keramidis & Kaxiras, ERA project
  • 19.
  • 20.
  • 21. Contact information Visit http://www.era-project.eu for more information Coordinator: Stephan Wong (Delft University of Techology) [email_address] http://ce.et.tudelft.nl/~stephan/ IBM representative , Work Package 4 leader: Ayal Zaks (IBM Haifa Research Lab) [email_address] https://www.research.ibm.com/haifa/dept/svt/code_compiler.html
  • 22.

Notes de l'éditeur

  1. Proebting was talking about performance; what about power/energy?? How can compilers help improve power? Tell me if you know..
  2. This slide is quite self-explanatory
  3. This slide shows the general overview of the ERA platform. Basically, there are different components for “processing”, “networking”, and “memories” that we can choose from in order to build the platform. On top, we want to be able to adapt to different applications by choosing from libraries of these components – an additional advantage is that we want to do this dynamically. For this, we need a hardware scheduler or an OS/software scheduler that works in tandem with the hardware scheduler. The monitoring block monitors for example the power and performance of the system and this information can be fed into the schedulers. Finally, we need a smarter compiler is better aware of the dynamic behavior of the platform.
  4. This slides shows all the partners within the project.
  5. This slide summarizes the slide with the figure of the ERA platform.
  6. -mcpu: architecture (ISA); -mtune: micro-architecture Several PowerPC versions; code size, flexibility, switch versions at specific places in code
  7. Memory params – static analysis of memory access patterns, temporal and spatial reuse Partition code into sections representing phases of distinct ILP/MEM
  8. In the table, you can highlight the fact that we can parameterize the issue width of the roVEX processor and that different instantiations have different resource utilizations.
  9. On this slide, we can see that with the same resources, we can instantiate different cores. 2 smaller ones to handle TLP or combine it into a big to exploit ILP. The idea in the ERA project is to be able to do this on-the-fly in a dynamic way manner.
  10. This slide shows results on EDP (energy-delay product) measurements by varying the instruction window size (this has a clear relation with the parallelism of an application - ILP) and cache sizes. We see in this slide that when we increase the cache size, the EDP decreases. However, more interesting is the fact that the EDP product is similar (almost the same) with varying configurations – see the arrows pointing to different ILP-cache configurations. This means that we can optimize our design by changing the parameters and still achieve the same EDP. Please note that the information on this slide has not been published yet, so it is copyrighted!!