SlideShare a Scribd company logo
1 of 23
Download to read offline
Run-Time Reconfiguration for
     HyperTransport coupled FPGAs using
                   ACCFS
                    Jochen Strunk, Andreas Heinig, Toni Volkmer,
                           Wolfgang Rehm, Heiko Schick

                         Chemnitz University of Technology
                           Computer Architecture Group
                              Prof. Wolfgang Rehm



WHTRA 2009, Heidelberg / February 12th 2009
Outline


      1 Introduction / Goals

      2 Run-Time Reconfiguration on FPGAs

      3 HyperTransport Cave with Run-Time Reconfiguration Support

      4 The Accelerator File System (ACCFS) as Software Framework

      5 Case Study - two RTRMs

      6 Conclusion



WHTRA 2009, Heidelberg Run-Time Reconfiguration for HyperTransport coupled FPGAs using ACCFS Jochen Strunk   2/23
Outline


      1 Introduction / Goals

      2 Run-Time Reconfiguration on FPGAs

      3 HyperTransport Cave with Run-Time Reconfiguration Support

      4 The Accelerator File System (ACCFS) as Software Framework

      5 Case Study - two RTRMs

      6 Conclusion



WHTRA 2009, Heidelberg Run-Time Reconfiguration for HyperTransport coupled FPGAs using ACCFS Jochen Strunk   3/23
Introduction

             FPGAs are used as accelerators in host coupled systems.
             Hot-plug functionality of plug-in-cards are not supported by
             most motherboards, BIOS’s, operating systems.
             For continuous host link connectivity todays plug-in-cards
             with FPGAs need a second chip which handles host
             communication, most common: a second FPGA.
             Uploading further accelerator modules / compute kernels is
             not possible during run-time although sufficient space would
             be available on the FPGA.

      Run-time reconfiguration (RTR)
      ⇒ On the other side FPGAs offer run-time reconfiguration support
      (DPR capable FPGAs, e.g. Xilinx Virtex, -2, -4, -5, -6).

WHTRA 2009, Heidelberg Run-Time Reconfiguration for HyperTransport coupled FPGAs using ACCFS Jochen Strunk   4/23
Goals



      Provide a solution for host coupled FPGAs
             where only one FPGA is needed in a host coupled system,
             i.e. no additional chip for host communication
             which allows to change the functionality during run-time
             e.g. for uploading further compute kernels
             where a software framework does exist for user applications,
             which allows easy handling without restricting the possibilities
             of run-time reconfigurable FPGAs




WHTRA 2009, Heidelberg Run-Time Reconfiguration for HyperTransport coupled FPGAs using ACCFS Jochen Strunk   5/23
Outline


      1 Introduction / Goals

      2 Run-Time Reconfiguration on FPGAs

      3 HyperTransport Cave with Run-Time Reconfiguration Support

      4 The Accelerator File System (ACCFS) as Software Framework

      5 Case Study - two RTRMs

      6 Conclusion



WHTRA 2009, Heidelberg Run-Time Reconfiguration for HyperTransport coupled FPGAs using ACCFS Jochen Strunk   6/23
Run-Time Reconfiguration on FPGAs

             Dynamic partial reconfiguration (DPR) is available on Xilinx
             Virtex,-2,-4,-5,-6 FPGAs.
             The functionality is divided into static and dynamic parts.
             Dynamic parts are called Run-Time Reconfigurable Modules
             (RTRMs).
             Granularity of partially reconfigurable region (PRR) is directly
             related to configuration frames.
             Three different interfaces are available for reconfiguration
             JTAG, SelectMAP, ICAP.
             A design flow for ”Module based Partial Reconfiguration” is
             applied.


WHTRA 2009, Heidelberg Run-Time Reconfiguration for HyperTransport coupled FPGAs using ACCFS Jochen Strunk   7/23
Outline


      1 Introduction / Goals

      2 Run-Time Reconfiguration on FPGAs

      3 HyperTransport Cave with Run-Time Reconfiguration Support

      4 The Accelerator File System (ACCFS) as Software Framework

      5 Case Study - two RTRMs

      6 Conclusion



WHTRA 2009, Heidelberg Run-Time Reconfiguration for HyperTransport coupled FPGAs using ACCFS Jochen Strunk   8/23
HT-Cave with RTR-Support

             To support RTRMs the standard HT-Cave-IP-Core must be
             enhanced.
             The overall design must comply with module based partial
             design flow.
             To ease porting the infrastructure to other interconnects, e.g.
             PCIe, the functionality is divided into:
                    host interface specific part:
                    HT Cave, HT Packet Engine
                    host interface independent part:
                    RTRM, RTRM Controller, Reconfig Unit, Internal Routing
                    Unit
             To generate a RTRM bit stream file a framework is provided
             to the user.

WHTRA 2009, Heidelberg Run-Time Reconfiguration for HyperTransport coupled FPGAs using ACCFS Jochen Strunk   9/23
HT-Cave with RTR-Support


      Infrastructure of a HT-Cave with RTR-Support:

                                                      static                                             dynamic

                                                                               RTRM
                                P        P                                    Controller
                                N        N
                                                                                                         RTRM
          HTX                   R        R      HT               Internal
                          HT
          host                                Packet             Routing      Reconfig
                         Cave
       connection                             Engine               Unit         Unit
                         Core
                                P        P
                                N        N
                                R        R




                            host interface specific                         host interface independent
                                                                   FPGA




WHTRA 2009, Heidelberg Run-Time Reconfiguration for HyperTransport coupled FPGAs using ACCFS Jochen Strunk          10/23
Outline


      1 Introduction / Goals

      2 Run-Time Reconfiguration on FPGAs

      3 HyperTransport Cave with Run-Time Reconfiguration Support

      4 The Accelerator File System (ACCFS) as Software Framework

      5 Case Study - two RTRMs

      6 Conclusion



WHTRA 2009, Heidelberg Run-Time Reconfiguration for HyperTransport coupled FPGAs using ACCFS Jochen Strunk   11/23
ACCFS-Introduction

                                        Applications
   Syscall−API

   Process Management Virtual File System          Virtual Memory                                                                                       Socket

                                                                                                                                                                                                                      – ACCFS –
    logical:                         virtual:           ACCFS
                                                                                                                                                                                                                Accelerator File System
                                                                                                                                                                                           Char
     ext2        ext3   ...   vfat    proc      sysfs                                                    accfs                                                                             Dev
                                                        Device Handler


                                                                               Device Handler


                                                                                                          Device Handler


                                                                                                                                     Device Handler


                                                                                                                                                             Device Handler
                                                                                                                                                                              ClearSpeed


                                                                                                                                                                                                  ac97 driver
               Block Devices                                                                                               AMD−Ati
                                                                                                Nvidia




                                                                                                                                                      FPGA
                                                                                                                                                                                                                Open generalized interface
                                                                         SPU




                                                                                                                                                                                           ...
                              Vendor Interface
                                                                                                                                                                                                                for integrating accelerators
     Disk Controller Drivers                                                                             Bus Drivers                                                                                             into Linux based systems
   Hardware

                                                                                                         PCIe Host Bridge

                                                                         SPE                                                 FPGA                                                           ...




WHTRA 2009, Heidelberg Run-Time Reconfiguration for HyperTransport coupled FPGAs using ACCFS Jochen Strunk                                                                                                                                 12/23
ACCFS-Hardware Integration Concepts
      Virtualization
      ⇒ Optimize hardware usage

      Generic interface
      ⇒ Establish interface based on well known standard: VFS

      Separation of functionalities
      ⇒ Ease integration of new accelerator types in ACCFS

      Host initiated DMA
      ⇒ Avoid page translation issues on the accelerator system

      Asynchronous context execution
      ⇒ No need for threading when running multiple instances in parallel
WHTRA 2009, Heidelberg Run-Time Reconfiguration for HyperTransport coupled FPGAs using ACCFS Jochen Strunk   13/23
ACCFS-FPGA Usage
                                   Application                               Device Handler
                                   Create Context
                                                       sys_acc_create

                                                                               Establish Context
                                                                             − FPGA available?
                                                                             − First initialization
                                  Configure Context                          − Returns: context descriptor
                                                      write (ctx/config)

                                                                                Configure FPGA
                                                                              − Validate bit stream
                                                                              − Space available?
                                   Data Exchange                              − Programm device
                                                      read / write (ctx/*)

                                                                                Validate Request

                                   Execute Design
                                                        sys_acc_run

                                                                                State Transition
                                                                                state goes into running

                                   Wait for Finish
                                                      read (ctx/status)

                                                                                 Wait for ’STOP’


                                   Destroy Context
                                                          close (ctx)
                                                                                 Wait for ’STOP’


                                         ...


WHTRA 2009, Heidelberg Run-Time Reconfiguration for HyperTransport coupled FPGAs using ACCFS Jochen Strunk    14/23
Outline


      1 Introduction / Goals

      2 Run-Time Reconfiguration on FPGAs

      3 HyperTransport Cave with Run-Time Reconfiguration Support

      4 The Accelerator File System (ACCFS) as Software Framework

      5 Case Study - two RTRMs

      6 Conclusion



WHTRA 2009, Heidelberg Run-Time Reconfiguration for HyperTransport coupled FPGAs using ACCFS Jochen Strunk   15/23
Case Study


             As prove of concept we implemented 2 different compute
             kernels as RTRMs:
                    a pattern matcher, which finds patterns in a byte stream
                    a Mersenne Twister, which is a pseudo random number
                    generator
             A vendor device driver supporting the UoH HTX Virtex-4
             XC4VFX60 FPGA Card was implemented.
             An user application was implemented, which uploads and
             exchanges the RTRMs during run-time with the use of
             ACCFS.



WHTRA 2009, Heidelberg Run-Time Reconfiguration for HyperTransport coupled FPGAs using ACCFS Jochen Strunk   16/23
Case Study
      // Pattern Matcher offload function                    // Mersenne Twister offload function
      int matcher_run (void * search_db_in, int db_size      int run_compute_kernel (double * results_out,
          void * patterns_in, int pattern_count,                 int results_count) {
          void * results_out, int results_size) {              // create context of our FPGA design
        int ret;                                               int fd_ctx = (int)acc_create("example", V_ID,
        char bufstatus[12];                                        D_ID, 0750, NULL);
        // create context of our static FPGA design            // configure the design
        int fd_ctx = (int)acc_create("example", V_ID,          int fd_cfg = openat(fd_ctx, "config", O_WRONLY);
            D_ID, 0750, NULL);                                 configure_fpga(fd_cfg, MERSENNE_RTRM_BITSTREAM);
        // configure the design                                // open memory
        int fd_cfg = openat(fd_ctx, "config", O_WRONLY);       int fd_mem = openat(fd_ctx, "memory/FPGA MEM1",
        configure_fpga(fd_cfg, MATCHER_RTRM_BITSTREAM);            O_RDWR);
        // open memory and status                              // allocating buffer
        int fd_mem = openat(fd_ctx, "memory/FPGA MEM1",        int32_t * buffer = (int32_t *) mmap(NULL,
            O_RDWR);                                               MEM_SIZE, PROT_READ | PROT_WRITE,
        int fd_status = openat(fd_ctx, "status",                   MAP_SHARED, fd_mem, 0);
            O_RDONLY);
        // fill memory with data (DMA bulk transfer)
        pwrite(fd_mem, search_db_in, db_size, DB_OFFSET);        int32_t * mt32_numbers = buffer + NUMBERS_OFFSET;
        pwrite(fd_mem, patterns_in, 4 * pattern_count,
            PATTERN_OFFSET);
        // start the matcher                                     // start the Mersenne twister MT32
        acc_run(fd_ctx, 0);                                      acc_run(fd_ctx, 0);
        // check status
        // (wait until context execution finished)               // Example C function that uses random numbers
        read(fd_status, bufstatus, 12);                          c_kernel_function(results_out, results_count,
                                                                     mt32_numbers);
          // read results of operation (DMA bulk transfer)       // unmap buffer
          ret = pread(fd_mem, results_out,                       munmap((void *) buffer, MEM_SIZE);
              results_size, RESULTS_OFFSET);
          // close files                                         // close files
          close(fd_mem); close(fd_status); close(fd_cfg);        close(fd_mem); close(fd_cfg);
          return ret;                                            return 0;
      }                                                      }
WHTRA 2009, Heidelberg Run-Time Reconfiguration for HyperTransport coupled FPGAs using ACCFS Jochen Strunk            17/23
Case Study
                                                                      placed and routed
                                                                      HT-Cave with
                                                                      RTR-support and
                                                                      pattern-matcher as
                                                                      RTRM


                                                                      Resource utilization of
                                                                      XC4VFX60:
                                                                             4 clock regions for
                                                                             HT Cave with
                                                                             RTR-support
                                                                             12 clock regions for
                                                                             RTRM
WHTRA 2009, Heidelberg Run-Time Reconfiguration for HyperTransport coupled FPGAs using ACCFS Jochen Strunk   18/23
Outline


      1 Introduction / Goals

      2 Run-Time Reconfiguration on FPGAs

      3 HyperTransport Cave with Run-Time Reconfiguration Support

      4 The Accelerator File System (ACCFS) as Software Framework

      5 Case Study - two RTRMs

      6 Conclusion



WHTRA 2009, Heidelberg Run-Time Reconfiguration for HyperTransport coupled FPGAs using ACCFS Jochen Strunk   19/23
Conclusion

             By using the ability of run-time reconfiguration of FPGAs it is
             possible to build single FPGA chip solutions for host
             coupled accelerators.
             A design of RTR-capable infrastructure was shown which
             allows to manage RTR modules during run-time.
             The implementation was done for FPGA directly coupled to
             the HyperTransport processor bus of the host system.
             The software framework ACCFS provides a generic
             interface to user applications which is able to satisfy the
             demand of RTR computing.
             The concept provided is applicable to other processor and
             peripheral bus coupled FPGAs.

WHTRA 2009, Heidelberg Run-Time Reconfiguration for HyperTransport coupled FPGAs using ACCFS Jochen Strunk   20/23
End



                                                 The End.


                                Thank you for your attention!


                                                Questions?




WHTRA 2009, Heidelberg Run-Time Reconfiguration for HyperTransport coupled FPGAs using ACCFS Jochen Strunk    21/23
ACCFS-Project Status
      ACCFS 0.5 alpha available
                 http://www.tu-chemnitz.de/cs/ra/projects/accfs

      Features
             Host support for x86 and x86 64 (ppc32/64 available soon!)
             Support for recent Linux kernels
             Fully operational VFS interface
             Device handler support for UoM HTX Virtex-4 FPGA Card

      TODO
             Resource discovery interface (via proc or sysfs)
             Extend vendor interface for better virtualization support
             Other device handlers?: Cell/B.E., Clearspeed, ...
WHTRA 2009, Heidelberg Run-Time Reconfiguration for HyperTransport coupled FPGAs using ACCFS Jochen Strunk   22/23
ACCFS Development Road Map 2009

                                                                                111111111
                                                                                000000000
                                                                                111111111
                                                                                000000000
             Extended support for                                               111111111
                                                                                000000000
                                                                                   Case study ?
                                                                                111111111
                                                                                000000000
                                                                                111111111
                                                                                000000000
             UoH HTX FPGA card                                                  ClearSpeed / GPGPUs
                                                                                111111111
                                                                                000000000
                                                                                111111111
                                                                                000000000

                                                 Support for
                                            Virtex−5 PCIe board


                        IBM QS21 PCIe coupled
                        Cell/B.E. SPE integration



                                           Virtualization facilitating functions



       Q4 2008          Q1 2009           Q2 2009           Q3 2009          Q4 2009           Q1 2010

WHTRA 2009, Heidelberg Run-Time Reconfiguration for HyperTransport coupled FPGAs using ACCFS Jochen Strunk   23/23

More Related Content

What's hot

Pragmatic optimization in modern programming - modern computer architecture c...
Pragmatic optimization in modern programming - modern computer architecture c...Pragmatic optimization in modern programming - modern computer architecture c...
Pragmatic optimization in modern programming - modern computer architecture c...Marina Kolpakova
 
Iaetsd multioperand redundant adders on fpg as
Iaetsd multioperand redundant adders on fpg asIaetsd multioperand redundant adders on fpg as
Iaetsd multioperand redundant adders on fpg asIaetsd Iaetsd
 
Implementation of Soft-core processor on FPGA (Final Presentation)
Implementation of Soft-core processor on FPGA (Final Presentation)Implementation of Soft-core processor on FPGA (Final Presentation)
Implementation of Soft-core processor on FPGA (Final Presentation)Deepak Kumar
 
Linaro Connect 2016 (BKK16) - Introduction to LISA
Linaro Connect 2016 (BKK16) - Introduction to LISALinaro Connect 2016 (BKK16) - Introduction to LISA
Linaro Connect 2016 (BKK16) - Introduction to LISAPatrick Bellasi
 
Network Programming: Data Plane Development Kit (DPDK)
Network Programming: Data Plane Development Kit (DPDK)Network Programming: Data Plane Development Kit (DPDK)
Network Programming: Data Plane Development Kit (DPDK)Andriy Berestovskyy
 
BKK16-311 EAS Upstream Stategy
BKK16-311 EAS Upstream StategyBKK16-311 EAS Upstream Stategy
BKK16-311 EAS Upstream StategyLinaro
 
A CGRA-based Approach for Accelerating Convolutional Neural Networks
A CGRA-based Approachfor Accelerating Convolutional Neural NetworksA CGRA-based Approachfor Accelerating Convolutional Neural Networks
A CGRA-based Approach for Accelerating Convolutional Neural NetworksShinya Takamaeda-Y
 
MARC ONERA Toulouse2012 Altreonic
MARC ONERA Toulouse2012 AltreonicMARC ONERA Toulouse2012 Altreonic
MARC ONERA Toulouse2012 AltreonicEric Verhulst
 
Exploiting Linux Control Groups for Effective Run-time Resource Management
Exploiting Linux Control Groups for Effective Run-time Resource ManagementExploiting Linux Control Groups for Effective Run-time Resource Management
Exploiting Linux Control Groups for Effective Run-time Resource ManagementPatrick Bellasi
 
Lec14 Computer Architecture by Hsien-Hsin Sean Lee Georgia Tech --- Coherence
Lec14 Computer Architecture by Hsien-Hsin Sean Lee Georgia Tech --- CoherenceLec14 Computer Architecture by Hsien-Hsin Sean Lee Georgia Tech --- Coherence
Lec14 Computer Architecture by Hsien-Hsin Sean Lee Georgia Tech --- CoherenceHsien-Hsin Sean Lee, Ph.D.
 
FD.io Vector Packet Processing (VPP)
FD.io Vector Packet Processing (VPP)FD.io Vector Packet Processing (VPP)
FD.io Vector Packet Processing (VPP)Kirill Tsym
 
A High Performance Heterogeneous FPGA-based Accelerator with PyCoRAM (Runner ...
A High Performance Heterogeneous FPGA-based Accelerator with PyCoRAM (Runner ...A High Performance Heterogeneous FPGA-based Accelerator with PyCoRAM (Runner ...
A High Performance Heterogeneous FPGA-based Accelerator with PyCoRAM (Runner ...Shinya Takamaeda-Y
 
BKK16-TR08 How to generate power models for EAS and IPA
BKK16-TR08 How to generate power models for EAS and IPABKK16-TR08 How to generate power models for EAS and IPA
BKK16-TR08 How to generate power models for EAS and IPALinaro
 
Pci express3-device-architecture-optimizations-idf2009-presentation
Pci express3-device-architecture-optimizations-idf2009-presentationPci express3-device-architecture-optimizations-idf2009-presentation
Pci express3-device-architecture-optimizations-idf2009-presentationjkcontee
 
QsNetIII, An HPC Interconnect For Peta Scale Systems
QsNetIII, An HPC Interconnect For Peta Scale SystemsQsNetIII, An HPC Interconnect For Peta Scale Systems
QsNetIII, An HPC Interconnect For Peta Scale SystemsFederica Pisani
 
Advanced pipelining
Advanced pipeliningAdvanced pipelining
Advanced pipelininga_spac
 
Lec13 Computer Architecture by Hsien-Hsin Sean Lee Georgia Tech -- Multicore
Lec13 Computer Architecture by Hsien-Hsin Sean Lee Georgia Tech -- MulticoreLec13 Computer Architecture by Hsien-Hsin Sean Lee Georgia Tech -- Multicore
Lec13 Computer Architecture by Hsien-Hsin Sean Lee Georgia Tech -- MulticoreHsien-Hsin Sean Lee, Ph.D.
 
LAS16-400: Mini Conference 3 AOSP (Session 1)
LAS16-400: Mini Conference 3 AOSP (Session 1)LAS16-400: Mini Conference 3 AOSP (Session 1)
LAS16-400: Mini Conference 3 AOSP (Session 1)Linaro
 

What's hot (20)

Pragmatic optimization in modern programming - modern computer architecture c...
Pragmatic optimization in modern programming - modern computer architecture c...Pragmatic optimization in modern programming - modern computer architecture c...
Pragmatic optimization in modern programming - modern computer architecture c...
 
Iaetsd multioperand redundant adders on fpg as
Iaetsd multioperand redundant adders on fpg asIaetsd multioperand redundant adders on fpg as
Iaetsd multioperand redundant adders on fpg as
 
Implementation of Soft-core processor on FPGA (Final Presentation)
Implementation of Soft-core processor on FPGA (Final Presentation)Implementation of Soft-core processor on FPGA (Final Presentation)
Implementation of Soft-core processor on FPGA (Final Presentation)
 
Linaro Connect 2016 (BKK16) - Introduction to LISA
Linaro Connect 2016 (BKK16) - Introduction to LISALinaro Connect 2016 (BKK16) - Introduction to LISA
Linaro Connect 2016 (BKK16) - Introduction to LISA
 
Network Programming: Data Plane Development Kit (DPDK)
Network Programming: Data Plane Development Kit (DPDK)Network Programming: Data Plane Development Kit (DPDK)
Network Programming: Data Plane Development Kit (DPDK)
 
BKK16-311 EAS Upstream Stategy
BKK16-311 EAS Upstream StategyBKK16-311 EAS Upstream Stategy
BKK16-311 EAS Upstream Stategy
 
Pipelining1
Pipelining1Pipelining1
Pipelining1
 
A CGRA-based Approach for Accelerating Convolutional Neural Networks
A CGRA-based Approachfor Accelerating Convolutional Neural NetworksA CGRA-based Approachfor Accelerating Convolutional Neural Networks
A CGRA-based Approach for Accelerating Convolutional Neural Networks
 
MARC ONERA Toulouse2012 Altreonic
MARC ONERA Toulouse2012 AltreonicMARC ONERA Toulouse2012 Altreonic
MARC ONERA Toulouse2012 Altreonic
 
Exploiting Linux Control Groups for Effective Run-time Resource Management
Exploiting Linux Control Groups for Effective Run-time Resource ManagementExploiting Linux Control Groups for Effective Run-time Resource Management
Exploiting Linux Control Groups for Effective Run-time Resource Management
 
Lec14 Computer Architecture by Hsien-Hsin Sean Lee Georgia Tech --- Coherence
Lec14 Computer Architecture by Hsien-Hsin Sean Lee Georgia Tech --- CoherenceLec14 Computer Architecture by Hsien-Hsin Sean Lee Georgia Tech --- Coherence
Lec14 Computer Architecture by Hsien-Hsin Sean Lee Georgia Tech --- Coherence
 
FD.io Vector Packet Processing (VPP)
FD.io Vector Packet Processing (VPP)FD.io Vector Packet Processing (VPP)
FD.io Vector Packet Processing (VPP)
 
A High Performance Heterogeneous FPGA-based Accelerator with PyCoRAM (Runner ...
A High Performance Heterogeneous FPGA-based Accelerator with PyCoRAM (Runner ...A High Performance Heterogeneous FPGA-based Accelerator with PyCoRAM (Runner ...
A High Performance Heterogeneous FPGA-based Accelerator with PyCoRAM (Runner ...
 
BKK16-TR08 How to generate power models for EAS and IPA
BKK16-TR08 How to generate power models for EAS and IPABKK16-TR08 How to generate power models for EAS and IPA
BKK16-TR08 How to generate power models for EAS and IPA
 
Unit 3
Unit 3Unit 3
Unit 3
 
Pci express3-device-architecture-optimizations-idf2009-presentation
Pci express3-device-architecture-optimizations-idf2009-presentationPci express3-device-architecture-optimizations-idf2009-presentation
Pci express3-device-architecture-optimizations-idf2009-presentation
 
QsNetIII, An HPC Interconnect For Peta Scale Systems
QsNetIII, An HPC Interconnect For Peta Scale SystemsQsNetIII, An HPC Interconnect For Peta Scale Systems
QsNetIII, An HPC Interconnect For Peta Scale Systems
 
Advanced pipelining
Advanced pipeliningAdvanced pipelining
Advanced pipelining
 
Lec13 Computer Architecture by Hsien-Hsin Sean Lee Georgia Tech -- Multicore
Lec13 Computer Architecture by Hsien-Hsin Sean Lee Georgia Tech -- MulticoreLec13 Computer Architecture by Hsien-Hsin Sean Lee Georgia Tech -- Multicore
Lec13 Computer Architecture by Hsien-Hsin Sean Lee Georgia Tech -- Multicore
 
LAS16-400: Mini Conference 3 AOSP (Session 1)
LAS16-400: Mini Conference 3 AOSP (Session 1)LAS16-400: Mini Conference 3 AOSP (Session 1)
LAS16-400: Mini Conference 3 AOSP (Session 1)
 

Viewers also liked

Experiences in Application Specific Supercomputer Design - Reasons, Challenge...
Experiences in Application Specific Supercomputer Design - Reasons, Challenge...Experiences in Application Specific Supercomputer Design - Reasons, Challenge...
Experiences in Application Specific Supercomputer Design - Reasons, Challenge...Heiko Joerg Schick
 
POLS 3620 Contemporary europe and asia presentation: Chinese & Western Opera
POLS 3620 Contemporary europe and asia presentation: Chinese & Western OperaPOLS 3620 Contemporary europe and asia presentation: Chinese & Western Opera
POLS 3620 Contemporary europe and asia presentation: Chinese & Western OperaShan Shan Hung
 
High Performance Computing - Challenges on the Road to Exascale Computing
High Performance Computing - Challenges on the Road to Exascale ComputingHigh Performance Computing - Challenges on the Road to Exascale Computing
High Performance Computing - Challenges on the Road to Exascale ComputingHeiko Joerg Schick
 
Petascale Analytics - The World of Big Data Requires Big Analytics
Petascale Analytics - The World of Big Data Requires Big AnalyticsPetascale Analytics - The World of Big Data Requires Big Analytics
Petascale Analytics - The World of Big Data Requires Big AnalyticsHeiko Joerg Schick
 
Browser and Management App for Google's Person Finder
Browser and Management App for Google's Person FinderBrowser and Management App for Google's Person Finder
Browser and Management App for Google's Person FinderHeiko Joerg Schick
 
IBM Corporate Service Corps - Helping Create Interactive Flood Maps
IBM Corporate Service Corps - Helping Create Interactive Flood MapsIBM Corporate Service Corps - Helping Create Interactive Flood Maps
IBM Corporate Service Corps - Helping Create Interactive Flood MapsHeiko Joerg Schick
 

Viewers also liked (7)

Experiences in Application Specific Supercomputer Design - Reasons, Challenge...
Experiences in Application Specific Supercomputer Design - Reasons, Challenge...Experiences in Application Specific Supercomputer Design - Reasons, Challenge...
Experiences in Application Specific Supercomputer Design - Reasons, Challenge...
 
POLS 3620 Contemporary europe and asia presentation: Chinese & Western Opera
POLS 3620 Contemporary europe and asia presentation: Chinese & Western OperaPOLS 3620 Contemporary europe and asia presentation: Chinese & Western Opera
POLS 3620 Contemporary europe and asia presentation: Chinese & Western Opera
 
High Performance Computing - Challenges on the Road to Exascale Computing
High Performance Computing - Challenges on the Road to Exascale ComputingHigh Performance Computing - Challenges on the Road to Exascale Computing
High Performance Computing - Challenges on the Road to Exascale Computing
 
Petascale Analytics - The World of Big Data Requires Big Analytics
Petascale Analytics - The World of Big Data Requires Big AnalyticsPetascale Analytics - The World of Big Data Requires Big Analytics
Petascale Analytics - The World of Big Data Requires Big Analytics
 
Blue Gene Active Storage
Blue Gene Active StorageBlue Gene Active Storage
Blue Gene Active Storage
 
Browser and Management App for Google's Person Finder
Browser and Management App for Google's Person FinderBrowser and Management App for Google's Person Finder
Browser and Management App for Google's Person Finder
 
IBM Corporate Service Corps - Helping Create Interactive Flood Maps
IBM Corporate Service Corps - Helping Create Interactive Flood MapsIBM Corporate Service Corps - Helping Create Interactive Flood Maps
IBM Corporate Service Corps - Helping Create Interactive Flood Maps
 

Similar to Run-Time Reconfiguration for HyperTransport coupled FPGAs using ACCFS

CAPI and OpenCAPI Hardware acceleration enablement
CAPI and OpenCAPI Hardware acceleration enablementCAPI and OpenCAPI Hardware acceleration enablement
CAPI and OpenCAPI Hardware acceleration enablementGanesan Narayanasamy
 
On the Capability and Achievable Performance of FPGAs for HPC Applications
On the Capability and Achievable Performance of FPGAs for HPC ApplicationsOn the Capability and Achievable Performance of FPGAs for HPC Applications
On the Capability and Achievable Performance of FPGAs for HPC ApplicationsWim Vanderbauwhede
 
And Then There Were None
And Then There Were NoneAnd Then There Were None
And Then There Were NoneSimar Neasy
 
Building a QT based solution on a i.MX7 processor running Linux and FreeRTOS
Building a QT based solution on a i.MX7 processor running Linux and FreeRTOSBuilding a QT based solution on a i.MX7 processor running Linux and FreeRTOS
Building a QT based solution on a i.MX7 processor running Linux and FreeRTOSFernando Luiz Cola
 
Using a Field Programmable Gate Array to Accelerate Application Performance
Using a Field Programmable Gate Array to Accelerate Application PerformanceUsing a Field Programmable Gate Array to Accelerate Application Performance
Using a Field Programmable Gate Array to Accelerate Application PerformanceOdinot Stanislas
 
FPGAs for Supercomputing: The Why and How
FPGAs for Supercomputing: The Why and HowFPGAs for Supercomputing: The Why and How
FPGAs for Supercomputing: The Why and HowDESMOND YUEN
 
Install FD.IO VPP On Intel(r) Architecture & Test with Trex*
Install FD.IO VPP On Intel(r) Architecture & Test with Trex*Install FD.IO VPP On Intel(r) Architecture & Test with Trex*
Install FD.IO VPP On Intel(r) Architecture & Test with Trex*Michelle Holley
 
00 opencapi acceleration framework yonglu_ver2
00 opencapi acceleration framework yonglu_ver200 opencapi acceleration framework yonglu_ver2
00 opencapi acceleration framework yonglu_ver2Yutaka Kawai
 
Do Theoretical Flo Ps Matter For Real Application’S Performance Kaust 2012
Do Theoretical Flo Ps Matter For Real Application’S Performance Kaust 2012Do Theoretical Flo Ps Matter For Real Application’S Performance Kaust 2012
Do Theoretical Flo Ps Matter For Real Application’S Performance Kaust 2012Joshua Mora
 
6 open capi_meetup_in_japan_final
6 open capi_meetup_in_japan_final6 open capi_meetup_in_japan_final
6 open capi_meetup_in_japan_finalYutaka Kawai
 
GPGPU Accelerates PostgreSQL (English)
GPGPU Accelerates PostgreSQL (English)GPGPU Accelerates PostgreSQL (English)
GPGPU Accelerates PostgreSQL (English)Kohei KaiGai
 
Exploring the Performance Impact of Virtualization on an HPC Cloud
Exploring the Performance Impact of Virtualization on an HPC CloudExploring the Performance Impact of Virtualization on an HPC Cloud
Exploring the Performance Impact of Virtualization on an HPC CloudRyousei Takano
 
SF-TAP: Scalable and Flexible Traffic Analysis Platform (USENIX LISA 2015)
SF-TAP: Scalable and Flexible Traffic Analysis Platform (USENIX LISA 2015)SF-TAP: Scalable and Flexible Traffic Analysis Platform (USENIX LISA 2015)
SF-TAP: Scalable and Flexible Traffic Analysis Platform (USENIX LISA 2015)Yuuki Takano
 
LEGaTO: Software Stack Programming Models
LEGaTO: Software Stack Programming ModelsLEGaTO: Software Stack Programming Models
LEGaTO: Software Stack Programming ModelsLEGATO project
 

Similar to Run-Time Reconfiguration for HyperTransport coupled FPGAs using ACCFS (20)

1
11
1
 
CAPI and OpenCAPI Hardware acceleration enablement
CAPI and OpenCAPI Hardware acceleration enablementCAPI and OpenCAPI Hardware acceleration enablement
CAPI and OpenCAPI Hardware acceleration enablement
 
On the Capability and Achievable Performance of FPGAs for HPC Applications
On the Capability and Achievable Performance of FPGAs for HPC ApplicationsOn the Capability and Achievable Performance of FPGAs for HPC Applications
On the Capability and Achievable Performance of FPGAs for HPC Applications
 
And Then There Were None
And Then There Were NoneAnd Then There Were None
And Then There Were None
 
Building a QT based solution on a i.MX7 processor running Linux and FreeRTOS
Building a QT based solution on a i.MX7 processor running Linux and FreeRTOSBuilding a QT based solution on a i.MX7 processor running Linux and FreeRTOS
Building a QT based solution on a i.MX7 processor running Linux and FreeRTOS
 
Fpga
FpgaFpga
Fpga
 
Using a Field Programmable Gate Array to Accelerate Application Performance
Using a Field Programmable Gate Array to Accelerate Application PerformanceUsing a Field Programmable Gate Array to Accelerate Application Performance
Using a Field Programmable Gate Array to Accelerate Application Performance
 
FPGAs for Supercomputing: The Why and How
FPGAs for Supercomputing: The Why and HowFPGAs for Supercomputing: The Why and How
FPGAs for Supercomputing: The Why and How
 
Install FD.IO VPP On Intel(r) Architecture & Test with Trex*
Install FD.IO VPP On Intel(r) Architecture & Test with Trex*Install FD.IO VPP On Intel(r) Architecture & Test with Trex*
Install FD.IO VPP On Intel(r) Architecture & Test with Trex*
 
00 opencapi acceleration framework yonglu_ver2
00 opencapi acceleration framework yonglu_ver200 opencapi acceleration framework yonglu_ver2
00 opencapi acceleration framework yonglu_ver2
 
Cyclone IV FPGA Device
Cyclone IV FPGA DeviceCyclone IV FPGA Device
Cyclone IV FPGA Device
 
Do Theoretical Flo Ps Matter For Real Application’S Performance Kaust 2012
Do Theoretical Flo Ps Matter For Real Application’S Performance Kaust 2012Do Theoretical Flo Ps Matter For Real Application’S Performance Kaust 2012
Do Theoretical Flo Ps Matter For Real Application’S Performance Kaust 2012
 
6 open capi_meetup_in_japan_final
6 open capi_meetup_in_japan_final6 open capi_meetup_in_japan_final
6 open capi_meetup_in_japan_final
 
24
2424
24
 
GPGPU Accelerates PostgreSQL (English)
GPGPU Accelerates PostgreSQL (English)GPGPU Accelerates PostgreSQL (English)
GPGPU Accelerates PostgreSQL (English)
 
Exploring the Performance Impact of Virtualization on an HPC Cloud
Exploring the Performance Impact of Virtualization on an HPC CloudExploring the Performance Impact of Virtualization on an HPC Cloud
Exploring the Performance Impact of Virtualization on an HPC Cloud
 
0507036
05070360507036
0507036
 
SF-TAP: Scalable and Flexible Traffic Analysis Platform (USENIX LISA 2015)
SF-TAP: Scalable and Flexible Traffic Analysis Platform (USENIX LISA 2015)SF-TAP: Scalable and Flexible Traffic Analysis Platform (USENIX LISA 2015)
SF-TAP: Scalable and Flexible Traffic Analysis Platform (USENIX LISA 2015)
 
LEGaTO: Software Stack Programming Models
LEGaTO: Software Stack Programming ModelsLEGaTO: Software Stack Programming Models
LEGaTO: Software Stack Programming Models
 
Userspace networking
Userspace networkingUserspace networking
Userspace networking
 

More from Heiko Joerg Schick

Da Vinci - A scaleable architecture for neural network computing (updated v4)
Da Vinci - A scaleable architecture for neural network computing (updated v4)Da Vinci - A scaleable architecture for neural network computing (updated v4)
Da Vinci - A scaleable architecture for neural network computing (updated v4)Heiko Joerg Schick
 
Huawei empowers healthcare industry with AI technology
Huawei empowers healthcare industry with AI technologyHuawei empowers healthcare industry with AI technology
Huawei empowers healthcare industry with AI technologyHeiko Joerg Schick
 
The 2025 Huawei trend forecast gives you the lowdown on data centre facilitie...
The 2025 Huawei trend forecast gives you the lowdown on data centre facilitie...The 2025 Huawei trend forecast gives you the lowdown on data centre facilitie...
The 2025 Huawei trend forecast gives you the lowdown on data centre facilitie...Heiko Joerg Schick
 
The Smarter Car for Autonomous Driving
 The Smarter Car for Autonomous Driving The Smarter Car for Autonomous Driving
The Smarter Car for Autonomous DrivingHeiko Joerg Schick
 
From edge computing to in-car computing
From edge computing to in-car computingFrom edge computing to in-car computing
From edge computing to in-car computingHeiko Joerg Schick
 
Need and value for various levels of autonomous driving
Need and value for various levels of autonomous drivingNeed and value for various levels of autonomous driving
Need and value for various levels of autonomous drivingHeiko Joerg Schick
 
Real time Flood Simulation for Metro Manila and the Philippines
Real time Flood Simulation for Metro Manila and the PhilippinesReal time Flood Simulation for Metro Manila and the Philippines
Real time Flood Simulation for Metro Manila and the PhilippinesHeiko Joerg Schick
 
directCell - Cell/B.E. tightly coupled via PCI Express
directCell - Cell/B.E. tightly coupled via PCI ExpressdirectCell - Cell/B.E. tightly coupled via PCI Express
directCell - Cell/B.E. tightly coupled via PCI ExpressHeiko Joerg Schick
 
QPACE QCD Parallel Computing on the Cell Broadband Engine™ (Cell/B.E.)
QPACE QCD Parallel Computing on the Cell Broadband Engine™ (Cell/B.E.)QPACE QCD Parallel Computing on the Cell Broadband Engine™ (Cell/B.E.)
QPACE QCD Parallel Computing on the Cell Broadband Engine™ (Cell/B.E.)Heiko Joerg Schick
 
QPACE - QCD Parallel Computing on the Cell Broadband Engine™ (Cell/B.E.)
QPACE - QCD Parallel Computing on the Cell Broadband Engine™ (Cell/B.E.)QPACE - QCD Parallel Computing on the Cell Broadband Engine™ (Cell/B.E.)
QPACE - QCD Parallel Computing on the Cell Broadband Engine™ (Cell/B.E.)Heiko Joerg Schick
 

More from Heiko Joerg Schick (13)

Da Vinci - A scaleable architecture for neural network computing (updated v4)
Da Vinci - A scaleable architecture for neural network computing (updated v4)Da Vinci - A scaleable architecture for neural network computing (updated v4)
Da Vinci - A scaleable architecture for neural network computing (updated v4)
 
Huawei empowers healthcare industry with AI technology
Huawei empowers healthcare industry with AI technologyHuawei empowers healthcare industry with AI technology
Huawei empowers healthcare industry with AI technology
 
The 2025 Huawei trend forecast gives you the lowdown on data centre facilitie...
The 2025 Huawei trend forecast gives you the lowdown on data centre facilitie...The 2025 Huawei trend forecast gives you the lowdown on data centre facilitie...
The 2025 Huawei trend forecast gives you the lowdown on data centre facilitie...
 
The Smarter Car for Autonomous Driving
 The Smarter Car for Autonomous Driving The Smarter Car for Autonomous Driving
The Smarter Car for Autonomous Driving
 
From edge computing to in-car computing
From edge computing to in-car computingFrom edge computing to in-car computing
From edge computing to in-car computing
 
Need and value for various levels of autonomous driving
Need and value for various levels of autonomous drivingNeed and value for various levels of autonomous driving
Need and value for various levels of autonomous driving
 
Real time Flood Simulation for Metro Manila and the Philippines
Real time Flood Simulation for Metro Manila and the PhilippinesReal time Flood Simulation for Metro Manila and the Philippines
Real time Flood Simulation for Metro Manila and the Philippines
 
Slimline Open Firmware
Slimline Open FirmwareSlimline Open Firmware
Slimline Open Firmware
 
Agnostic Device Drivers
Agnostic Device DriversAgnostic Device Drivers
Agnostic Device Drivers
 
The Cell Processor
The Cell ProcessorThe Cell Processor
The Cell Processor
 
directCell - Cell/B.E. tightly coupled via PCI Express
directCell - Cell/B.E. tightly coupled via PCI ExpressdirectCell - Cell/B.E. tightly coupled via PCI Express
directCell - Cell/B.E. tightly coupled via PCI Express
 
QPACE QCD Parallel Computing on the Cell Broadband Engine™ (Cell/B.E.)
QPACE QCD Parallel Computing on the Cell Broadband Engine™ (Cell/B.E.)QPACE QCD Parallel Computing on the Cell Broadband Engine™ (Cell/B.E.)
QPACE QCD Parallel Computing on the Cell Broadband Engine™ (Cell/B.E.)
 
QPACE - QCD Parallel Computing on the Cell Broadband Engine™ (Cell/B.E.)
QPACE - QCD Parallel Computing on the Cell Broadband Engine™ (Cell/B.E.)QPACE - QCD Parallel Computing on the Cell Broadband Engine™ (Cell/B.E.)
QPACE - QCD Parallel Computing on the Cell Broadband Engine™ (Cell/B.E.)
 

Recently uploaded

Kalyan callg Girls, { 07738631006 } || Call Girl In Kalyan Women Seeking Men ...
Kalyan callg Girls, { 07738631006 } || Call Girl In Kalyan Women Seeking Men ...Kalyan callg Girls, { 07738631006 } || Call Girl In Kalyan Women Seeking Men ...
Kalyan callg Girls, { 07738631006 } || Call Girl In Kalyan Women Seeking Men ...Pooja Nehwal
 
(👉Ridhima)👉VIP Model Call Girls Mulund ( Mumbai) Call ON 9967824496 Starting ...
(👉Ridhima)👉VIP Model Call Girls Mulund ( Mumbai) Call ON 9967824496 Starting ...(👉Ridhima)👉VIP Model Call Girls Mulund ( Mumbai) Call ON 9967824496 Starting ...
(👉Ridhima)👉VIP Model Call Girls Mulund ( Mumbai) Call ON 9967824496 Starting ...motiram463
 
Call Girls Banashankari Just Call 👗 7737669865 👗 Top Class Call Girl Service ...
Call Girls Banashankari Just Call 👗 7737669865 👗 Top Class Call Girl Service ...Call Girls Banashankari Just Call 👗 7737669865 👗 Top Class Call Girl Service ...
Call Girls Banashankari Just Call 👗 7737669865 👗 Top Class Call Girl Service ...amitlee9823
 
Lubrication and it's types and properties of the libricabt
Lubrication and it's types and properties of the libricabtLubrication and it's types and properties of the libricabt
Lubrication and it's types and properties of the libricabtdineshkumar430venkat
 
Book Paid Lohegaon Call Girls Pune 8250192130Low Budget Full Independent High...
Book Paid Lohegaon Call Girls Pune 8250192130Low Budget Full Independent High...Book Paid Lohegaon Call Girls Pune 8250192130Low Budget Full Independent High...
Book Paid Lohegaon Call Girls Pune 8250192130Low Budget Full Independent High...ranjana rawat
 
Low Rate Call Girls Nashik Vedika 7001305949 Independent Escort Service Nashik
Low Rate Call Girls Nashik Vedika 7001305949 Independent Escort Service NashikLow Rate Call Girls Nashik Vedika 7001305949 Independent Escort Service Nashik
Low Rate Call Girls Nashik Vedika 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 
High Profile Call Girls In Andheri 7738631006 Call girls in mumbai Mumbai ...
High Profile Call Girls In Andheri 7738631006 Call girls in mumbai  Mumbai ...High Profile Call Girls In Andheri 7738631006 Call girls in mumbai  Mumbai ...
High Profile Call Girls In Andheri 7738631006 Call girls in mumbai Mumbai ...Pooja Nehwal
 
Top Rated Pune Call Girls Chakan ⟟ 6297143586 ⟟ Call Me For Genuine Sex Serv...
Top Rated  Pune Call Girls Chakan ⟟ 6297143586 ⟟ Call Me For Genuine Sex Serv...Top Rated  Pune Call Girls Chakan ⟟ 6297143586 ⟟ Call Me For Genuine Sex Serv...
Top Rated Pune Call Girls Chakan ⟟ 6297143586 ⟟ Call Me For Genuine Sex Serv...Call Girls in Nagpur High Profile
 
9004554577, Get Adorable Call Girls service. Book call girls & escort service...
9004554577, Get Adorable Call Girls service. Book call girls & escort service...9004554577, Get Adorable Call Girls service. Book call girls & escort service...
9004554577, Get Adorable Call Girls service. Book call girls & escort service...Pooja Nehwal
 
Call Girls in Thane 9892124323, Vashi cAll girls Serivces Juhu Escorts, powai...
Call Girls in Thane 9892124323, Vashi cAll girls Serivces Juhu Escorts, powai...Call Girls in Thane 9892124323, Vashi cAll girls Serivces Juhu Escorts, powai...
Call Girls in Thane 9892124323, Vashi cAll girls Serivces Juhu Escorts, powai...Pooja Nehwal
 
NO1 Verified Amil Baba In Karachi Kala Jadu In Karachi Amil baba In Karachi A...
NO1 Verified Amil Baba In Karachi Kala Jadu In Karachi Amil baba In Karachi A...NO1 Verified Amil Baba In Karachi Kala Jadu In Karachi Amil baba In Karachi A...
NO1 Verified Amil Baba In Karachi Kala Jadu In Karachi Amil baba In Karachi A...Amil baba
 
CALL GIRLS IN Saket 83778-77756 | Escort Service In DELHI NcR
CALL GIRLS IN Saket 83778-77756 | Escort Service In DELHI NcRCALL GIRLS IN Saket 83778-77756 | Escort Service In DELHI NcR
CALL GIRLS IN Saket 83778-77756 | Escort Service In DELHI NcRdollysharma2066
 
Top Rated Pune Call Girls Shirwal ⟟ 6297143586 ⟟ Call Me For Genuine Sex Ser...
Top Rated  Pune Call Girls Shirwal ⟟ 6297143586 ⟟ Call Me For Genuine Sex Ser...Top Rated  Pune Call Girls Shirwal ⟟ 6297143586 ⟟ Call Me For Genuine Sex Ser...
Top Rated Pune Call Girls Shirwal ⟟ 6297143586 ⟟ Call Me For Genuine Sex Ser...Call Girls in Nagpur High Profile
 
VVIP Pune Call Girls Karve Nagar (7001035870) Pune Escorts Nearby with Comple...
VVIP Pune Call Girls Karve Nagar (7001035870) Pune Escorts Nearby with Comple...VVIP Pune Call Girls Karve Nagar (7001035870) Pune Escorts Nearby with Comple...
VVIP Pune Call Girls Karve Nagar (7001035870) Pune Escorts Nearby with Comple...Call Girls in Nagpur High Profile
 
VVIP Pune Call Girls Warje (7001035870) Pune Escorts Nearby with Complete Sat...
VVIP Pune Call Girls Warje (7001035870) Pune Escorts Nearby with Complete Sat...VVIP Pune Call Girls Warje (7001035870) Pune Escorts Nearby with Complete Sat...
VVIP Pune Call Girls Warje (7001035870) Pune Escorts Nearby with Complete Sat...Call Girls in Nagpur High Profile
 
(=Towel) Dubai Call Girls O525547819 Call Girls In Dubai (Fav0r)
(=Towel) Dubai Call Girls O525547819 Call Girls In Dubai (Fav0r)(=Towel) Dubai Call Girls O525547819 Call Girls In Dubai (Fav0r)
(=Towel) Dubai Call Girls O525547819 Call Girls In Dubai (Fav0r)kojalkojal131
 
Call Girls Dubai Slut Wife O525547819 Call Girls Dubai Gaped
Call Girls Dubai Slut Wife O525547819 Call Girls Dubai GapedCall Girls Dubai Slut Wife O525547819 Call Girls Dubai Gaped
Call Girls Dubai Slut Wife O525547819 Call Girls Dubai Gapedkojalkojal131
 
Call Girls in Vashi Escorts Services - 7738631006
Call Girls in Vashi Escorts Services - 7738631006Call Girls in Vashi Escorts Services - 7738631006
Call Girls in Vashi Escorts Services - 7738631006Pooja Nehwal
 

Recently uploaded (20)

Kalyan callg Girls, { 07738631006 } || Call Girl In Kalyan Women Seeking Men ...
Kalyan callg Girls, { 07738631006 } || Call Girl In Kalyan Women Seeking Men ...Kalyan callg Girls, { 07738631006 } || Call Girl In Kalyan Women Seeking Men ...
Kalyan callg Girls, { 07738631006 } || Call Girl In Kalyan Women Seeking Men ...
 
(👉Ridhima)👉VIP Model Call Girls Mulund ( Mumbai) Call ON 9967824496 Starting ...
(👉Ridhima)👉VIP Model Call Girls Mulund ( Mumbai) Call ON 9967824496 Starting ...(👉Ridhima)👉VIP Model Call Girls Mulund ( Mumbai) Call ON 9967824496 Starting ...
(👉Ridhima)👉VIP Model Call Girls Mulund ( Mumbai) Call ON 9967824496 Starting ...
 
Call Girls Banashankari Just Call 👗 7737669865 👗 Top Class Call Girl Service ...
Call Girls Banashankari Just Call 👗 7737669865 👗 Top Class Call Girl Service ...Call Girls Banashankari Just Call 👗 7737669865 👗 Top Class Call Girl Service ...
Call Girls Banashankari Just Call 👗 7737669865 👗 Top Class Call Girl Service ...
 
Lubrication and it's types and properties of the libricabt
Lubrication and it's types and properties of the libricabtLubrication and it's types and properties of the libricabt
Lubrication and it's types and properties of the libricabt
 
Book Paid Lohegaon Call Girls Pune 8250192130Low Budget Full Independent High...
Book Paid Lohegaon Call Girls Pune 8250192130Low Budget Full Independent High...Book Paid Lohegaon Call Girls Pune 8250192130Low Budget Full Independent High...
Book Paid Lohegaon Call Girls Pune 8250192130Low Budget Full Independent High...
 
Low Rate Call Girls Nashik Vedika 7001305949 Independent Escort Service Nashik
Low Rate Call Girls Nashik Vedika 7001305949 Independent Escort Service NashikLow Rate Call Girls Nashik Vedika 7001305949 Independent Escort Service Nashik
Low Rate Call Girls Nashik Vedika 7001305949 Independent Escort Service Nashik
 
High Profile Call Girls In Andheri 7738631006 Call girls in mumbai Mumbai ...
High Profile Call Girls In Andheri 7738631006 Call girls in mumbai  Mumbai ...High Profile Call Girls In Andheri 7738631006 Call girls in mumbai  Mumbai ...
High Profile Call Girls In Andheri 7738631006 Call girls in mumbai Mumbai ...
 
Top Rated Pune Call Girls Chakan ⟟ 6297143586 ⟟ Call Me For Genuine Sex Serv...
Top Rated  Pune Call Girls Chakan ⟟ 6297143586 ⟟ Call Me For Genuine Sex Serv...Top Rated  Pune Call Girls Chakan ⟟ 6297143586 ⟟ Call Me For Genuine Sex Serv...
Top Rated Pune Call Girls Chakan ⟟ 6297143586 ⟟ Call Me For Genuine Sex Serv...
 
9004554577, Get Adorable Call Girls service. Book call girls & escort service...
9004554577, Get Adorable Call Girls service. Book call girls & escort service...9004554577, Get Adorable Call Girls service. Book call girls & escort service...
9004554577, Get Adorable Call Girls service. Book call girls & escort service...
 
@Delhi ! CAll GIRLS IN Defence Colony 🦋 9999965857 🤩 Dwarka Call Girls
@Delhi ! CAll GIRLS IN Defence Colony 🦋 9999965857 🤩 Dwarka Call Girls@Delhi ! CAll GIRLS IN Defence Colony 🦋 9999965857 🤩 Dwarka Call Girls
@Delhi ! CAll GIRLS IN Defence Colony 🦋 9999965857 🤩 Dwarka Call Girls
 
Call Girls in Thane 9892124323, Vashi cAll girls Serivces Juhu Escorts, powai...
Call Girls in Thane 9892124323, Vashi cAll girls Serivces Juhu Escorts, powai...Call Girls in Thane 9892124323, Vashi cAll girls Serivces Juhu Escorts, powai...
Call Girls in Thane 9892124323, Vashi cAll girls Serivces Juhu Escorts, powai...
 
CHEAP Call Girls in Hauz Quazi (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Hauz Quazi  (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Hauz Quazi  (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Hauz Quazi (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
NO1 Verified Amil Baba In Karachi Kala Jadu In Karachi Amil baba In Karachi A...
NO1 Verified Amil Baba In Karachi Kala Jadu In Karachi Amil baba In Karachi A...NO1 Verified Amil Baba In Karachi Kala Jadu In Karachi Amil baba In Karachi A...
NO1 Verified Amil Baba In Karachi Kala Jadu In Karachi Amil baba In Karachi A...
 
CALL GIRLS IN Saket 83778-77756 | Escort Service In DELHI NcR
CALL GIRLS IN Saket 83778-77756 | Escort Service In DELHI NcRCALL GIRLS IN Saket 83778-77756 | Escort Service In DELHI NcR
CALL GIRLS IN Saket 83778-77756 | Escort Service In DELHI NcR
 
Top Rated Pune Call Girls Shirwal ⟟ 6297143586 ⟟ Call Me For Genuine Sex Ser...
Top Rated  Pune Call Girls Shirwal ⟟ 6297143586 ⟟ Call Me For Genuine Sex Ser...Top Rated  Pune Call Girls Shirwal ⟟ 6297143586 ⟟ Call Me For Genuine Sex Ser...
Top Rated Pune Call Girls Shirwal ⟟ 6297143586 ⟟ Call Me For Genuine Sex Ser...
 
VVIP Pune Call Girls Karve Nagar (7001035870) Pune Escorts Nearby with Comple...
VVIP Pune Call Girls Karve Nagar (7001035870) Pune Escorts Nearby with Comple...VVIP Pune Call Girls Karve Nagar (7001035870) Pune Escorts Nearby with Comple...
VVIP Pune Call Girls Karve Nagar (7001035870) Pune Escorts Nearby with Comple...
 
VVIP Pune Call Girls Warje (7001035870) Pune Escorts Nearby with Complete Sat...
VVIP Pune Call Girls Warje (7001035870) Pune Escorts Nearby with Complete Sat...VVIP Pune Call Girls Warje (7001035870) Pune Escorts Nearby with Complete Sat...
VVIP Pune Call Girls Warje (7001035870) Pune Escorts Nearby with Complete Sat...
 
(=Towel) Dubai Call Girls O525547819 Call Girls In Dubai (Fav0r)
(=Towel) Dubai Call Girls O525547819 Call Girls In Dubai (Fav0r)(=Towel) Dubai Call Girls O525547819 Call Girls In Dubai (Fav0r)
(=Towel) Dubai Call Girls O525547819 Call Girls In Dubai (Fav0r)
 
Call Girls Dubai Slut Wife O525547819 Call Girls Dubai Gaped
Call Girls Dubai Slut Wife O525547819 Call Girls Dubai GapedCall Girls Dubai Slut Wife O525547819 Call Girls Dubai Gaped
Call Girls Dubai Slut Wife O525547819 Call Girls Dubai Gaped
 
Call Girls in Vashi Escorts Services - 7738631006
Call Girls in Vashi Escorts Services - 7738631006Call Girls in Vashi Escorts Services - 7738631006
Call Girls in Vashi Escorts Services - 7738631006
 

Run-Time Reconfiguration for HyperTransport coupled FPGAs using ACCFS

  • 1. Run-Time Reconfiguration for HyperTransport coupled FPGAs using ACCFS Jochen Strunk, Andreas Heinig, Toni Volkmer, Wolfgang Rehm, Heiko Schick Chemnitz University of Technology Computer Architecture Group Prof. Wolfgang Rehm WHTRA 2009, Heidelberg / February 12th 2009
  • 2. Outline 1 Introduction / Goals 2 Run-Time Reconfiguration on FPGAs 3 HyperTransport Cave with Run-Time Reconfiguration Support 4 The Accelerator File System (ACCFS) as Software Framework 5 Case Study - two RTRMs 6 Conclusion WHTRA 2009, Heidelberg Run-Time Reconfiguration for HyperTransport coupled FPGAs using ACCFS Jochen Strunk 2/23
  • 3. Outline 1 Introduction / Goals 2 Run-Time Reconfiguration on FPGAs 3 HyperTransport Cave with Run-Time Reconfiguration Support 4 The Accelerator File System (ACCFS) as Software Framework 5 Case Study - two RTRMs 6 Conclusion WHTRA 2009, Heidelberg Run-Time Reconfiguration for HyperTransport coupled FPGAs using ACCFS Jochen Strunk 3/23
  • 4. Introduction FPGAs are used as accelerators in host coupled systems. Hot-plug functionality of plug-in-cards are not supported by most motherboards, BIOS’s, operating systems. For continuous host link connectivity todays plug-in-cards with FPGAs need a second chip which handles host communication, most common: a second FPGA. Uploading further accelerator modules / compute kernels is not possible during run-time although sufficient space would be available on the FPGA. Run-time reconfiguration (RTR) ⇒ On the other side FPGAs offer run-time reconfiguration support (DPR capable FPGAs, e.g. Xilinx Virtex, -2, -4, -5, -6). WHTRA 2009, Heidelberg Run-Time Reconfiguration for HyperTransport coupled FPGAs using ACCFS Jochen Strunk 4/23
  • 5. Goals Provide a solution for host coupled FPGAs where only one FPGA is needed in a host coupled system, i.e. no additional chip for host communication which allows to change the functionality during run-time e.g. for uploading further compute kernels where a software framework does exist for user applications, which allows easy handling without restricting the possibilities of run-time reconfigurable FPGAs WHTRA 2009, Heidelberg Run-Time Reconfiguration for HyperTransport coupled FPGAs using ACCFS Jochen Strunk 5/23
  • 6. Outline 1 Introduction / Goals 2 Run-Time Reconfiguration on FPGAs 3 HyperTransport Cave with Run-Time Reconfiguration Support 4 The Accelerator File System (ACCFS) as Software Framework 5 Case Study - two RTRMs 6 Conclusion WHTRA 2009, Heidelberg Run-Time Reconfiguration for HyperTransport coupled FPGAs using ACCFS Jochen Strunk 6/23
  • 7. Run-Time Reconfiguration on FPGAs Dynamic partial reconfiguration (DPR) is available on Xilinx Virtex,-2,-4,-5,-6 FPGAs. The functionality is divided into static and dynamic parts. Dynamic parts are called Run-Time Reconfigurable Modules (RTRMs). Granularity of partially reconfigurable region (PRR) is directly related to configuration frames. Three different interfaces are available for reconfiguration JTAG, SelectMAP, ICAP. A design flow for ”Module based Partial Reconfiguration” is applied. WHTRA 2009, Heidelberg Run-Time Reconfiguration for HyperTransport coupled FPGAs using ACCFS Jochen Strunk 7/23
  • 8. Outline 1 Introduction / Goals 2 Run-Time Reconfiguration on FPGAs 3 HyperTransport Cave with Run-Time Reconfiguration Support 4 The Accelerator File System (ACCFS) as Software Framework 5 Case Study - two RTRMs 6 Conclusion WHTRA 2009, Heidelberg Run-Time Reconfiguration for HyperTransport coupled FPGAs using ACCFS Jochen Strunk 8/23
  • 9. HT-Cave with RTR-Support To support RTRMs the standard HT-Cave-IP-Core must be enhanced. The overall design must comply with module based partial design flow. To ease porting the infrastructure to other interconnects, e.g. PCIe, the functionality is divided into: host interface specific part: HT Cave, HT Packet Engine host interface independent part: RTRM, RTRM Controller, Reconfig Unit, Internal Routing Unit To generate a RTRM bit stream file a framework is provided to the user. WHTRA 2009, Heidelberg Run-Time Reconfiguration for HyperTransport coupled FPGAs using ACCFS Jochen Strunk 9/23
  • 10. HT-Cave with RTR-Support Infrastructure of a HT-Cave with RTR-Support: static dynamic RTRM P P Controller N N RTRM HTX R R HT Internal HT host Packet Routing Reconfig Cave connection Engine Unit Unit Core P P N N R R host interface specific host interface independent FPGA WHTRA 2009, Heidelberg Run-Time Reconfiguration for HyperTransport coupled FPGAs using ACCFS Jochen Strunk 10/23
  • 11. Outline 1 Introduction / Goals 2 Run-Time Reconfiguration on FPGAs 3 HyperTransport Cave with Run-Time Reconfiguration Support 4 The Accelerator File System (ACCFS) as Software Framework 5 Case Study - two RTRMs 6 Conclusion WHTRA 2009, Heidelberg Run-Time Reconfiguration for HyperTransport coupled FPGAs using ACCFS Jochen Strunk 11/23
  • 12. ACCFS-Introduction Applications Syscall−API Process Management Virtual File System Virtual Memory Socket – ACCFS – logical: virtual: ACCFS Accelerator File System Char ext2 ext3 ... vfat proc sysfs accfs Dev Device Handler Device Handler Device Handler Device Handler Device Handler ClearSpeed ac97 driver Block Devices AMD−Ati Nvidia FPGA Open generalized interface SPU ... Vendor Interface for integrating accelerators Disk Controller Drivers Bus Drivers into Linux based systems Hardware PCIe Host Bridge SPE FPGA ... WHTRA 2009, Heidelberg Run-Time Reconfiguration for HyperTransport coupled FPGAs using ACCFS Jochen Strunk 12/23
  • 13. ACCFS-Hardware Integration Concepts Virtualization ⇒ Optimize hardware usage Generic interface ⇒ Establish interface based on well known standard: VFS Separation of functionalities ⇒ Ease integration of new accelerator types in ACCFS Host initiated DMA ⇒ Avoid page translation issues on the accelerator system Asynchronous context execution ⇒ No need for threading when running multiple instances in parallel WHTRA 2009, Heidelberg Run-Time Reconfiguration for HyperTransport coupled FPGAs using ACCFS Jochen Strunk 13/23
  • 14. ACCFS-FPGA Usage Application Device Handler Create Context sys_acc_create Establish Context − FPGA available? − First initialization Configure Context − Returns: context descriptor write (ctx/config) Configure FPGA − Validate bit stream − Space available? Data Exchange − Programm device read / write (ctx/*) Validate Request Execute Design sys_acc_run State Transition state goes into running Wait for Finish read (ctx/status) Wait for ’STOP’ Destroy Context close (ctx) Wait for ’STOP’ ... WHTRA 2009, Heidelberg Run-Time Reconfiguration for HyperTransport coupled FPGAs using ACCFS Jochen Strunk 14/23
  • 15. Outline 1 Introduction / Goals 2 Run-Time Reconfiguration on FPGAs 3 HyperTransport Cave with Run-Time Reconfiguration Support 4 The Accelerator File System (ACCFS) as Software Framework 5 Case Study - two RTRMs 6 Conclusion WHTRA 2009, Heidelberg Run-Time Reconfiguration for HyperTransport coupled FPGAs using ACCFS Jochen Strunk 15/23
  • 16. Case Study As prove of concept we implemented 2 different compute kernels as RTRMs: a pattern matcher, which finds patterns in a byte stream a Mersenne Twister, which is a pseudo random number generator A vendor device driver supporting the UoH HTX Virtex-4 XC4VFX60 FPGA Card was implemented. An user application was implemented, which uploads and exchanges the RTRMs during run-time with the use of ACCFS. WHTRA 2009, Heidelberg Run-Time Reconfiguration for HyperTransport coupled FPGAs using ACCFS Jochen Strunk 16/23
  • 17. Case Study // Pattern Matcher offload function // Mersenne Twister offload function int matcher_run (void * search_db_in, int db_size int run_compute_kernel (double * results_out, void * patterns_in, int pattern_count, int results_count) { void * results_out, int results_size) { // create context of our FPGA design int ret; int fd_ctx = (int)acc_create("example", V_ID, char bufstatus[12]; D_ID, 0750, NULL); // create context of our static FPGA design // configure the design int fd_ctx = (int)acc_create("example", V_ID, int fd_cfg = openat(fd_ctx, "config", O_WRONLY); D_ID, 0750, NULL); configure_fpga(fd_cfg, MERSENNE_RTRM_BITSTREAM); // configure the design // open memory int fd_cfg = openat(fd_ctx, "config", O_WRONLY); int fd_mem = openat(fd_ctx, "memory/FPGA MEM1", configure_fpga(fd_cfg, MATCHER_RTRM_BITSTREAM); O_RDWR); // open memory and status // allocating buffer int fd_mem = openat(fd_ctx, "memory/FPGA MEM1", int32_t * buffer = (int32_t *) mmap(NULL, O_RDWR); MEM_SIZE, PROT_READ | PROT_WRITE, int fd_status = openat(fd_ctx, "status", MAP_SHARED, fd_mem, 0); O_RDONLY); // fill memory with data (DMA bulk transfer) pwrite(fd_mem, search_db_in, db_size, DB_OFFSET); int32_t * mt32_numbers = buffer + NUMBERS_OFFSET; pwrite(fd_mem, patterns_in, 4 * pattern_count, PATTERN_OFFSET); // start the matcher // start the Mersenne twister MT32 acc_run(fd_ctx, 0); acc_run(fd_ctx, 0); // check status // (wait until context execution finished) // Example C function that uses random numbers read(fd_status, bufstatus, 12); c_kernel_function(results_out, results_count, mt32_numbers); // read results of operation (DMA bulk transfer) // unmap buffer ret = pread(fd_mem, results_out, munmap((void *) buffer, MEM_SIZE); results_size, RESULTS_OFFSET); // close files // close files close(fd_mem); close(fd_status); close(fd_cfg); close(fd_mem); close(fd_cfg); return ret; return 0; } } WHTRA 2009, Heidelberg Run-Time Reconfiguration for HyperTransport coupled FPGAs using ACCFS Jochen Strunk 17/23
  • 18. Case Study placed and routed HT-Cave with RTR-support and pattern-matcher as RTRM Resource utilization of XC4VFX60: 4 clock regions for HT Cave with RTR-support 12 clock regions for RTRM WHTRA 2009, Heidelberg Run-Time Reconfiguration for HyperTransport coupled FPGAs using ACCFS Jochen Strunk 18/23
  • 19. Outline 1 Introduction / Goals 2 Run-Time Reconfiguration on FPGAs 3 HyperTransport Cave with Run-Time Reconfiguration Support 4 The Accelerator File System (ACCFS) as Software Framework 5 Case Study - two RTRMs 6 Conclusion WHTRA 2009, Heidelberg Run-Time Reconfiguration for HyperTransport coupled FPGAs using ACCFS Jochen Strunk 19/23
  • 20. Conclusion By using the ability of run-time reconfiguration of FPGAs it is possible to build single FPGA chip solutions for host coupled accelerators. A design of RTR-capable infrastructure was shown which allows to manage RTR modules during run-time. The implementation was done for FPGA directly coupled to the HyperTransport processor bus of the host system. The software framework ACCFS provides a generic interface to user applications which is able to satisfy the demand of RTR computing. The concept provided is applicable to other processor and peripheral bus coupled FPGAs. WHTRA 2009, Heidelberg Run-Time Reconfiguration for HyperTransport coupled FPGAs using ACCFS Jochen Strunk 20/23
  • 21. End The End. Thank you for your attention! Questions? WHTRA 2009, Heidelberg Run-Time Reconfiguration for HyperTransport coupled FPGAs using ACCFS Jochen Strunk 21/23
  • 22. ACCFS-Project Status ACCFS 0.5 alpha available http://www.tu-chemnitz.de/cs/ra/projects/accfs Features Host support for x86 and x86 64 (ppc32/64 available soon!) Support for recent Linux kernels Fully operational VFS interface Device handler support for UoM HTX Virtex-4 FPGA Card TODO Resource discovery interface (via proc or sysfs) Extend vendor interface for better virtualization support Other device handlers?: Cell/B.E., Clearspeed, ... WHTRA 2009, Heidelberg Run-Time Reconfiguration for HyperTransport coupled FPGAs using ACCFS Jochen Strunk 22/23
  • 23. ACCFS Development Road Map 2009 111111111 000000000 111111111 000000000 Extended support for 111111111 000000000 Case study ? 111111111 000000000 111111111 000000000 UoH HTX FPGA card ClearSpeed / GPGPUs 111111111 000000000 111111111 000000000 Support for Virtex−5 PCIe board IBM QS21 PCIe coupled Cell/B.E. SPE integration Virtualization facilitating functions Q4 2008 Q1 2009 Q2 2009 Q3 2009 Q4 2009 Q1 2010 WHTRA 2009, Heidelberg Run-Time Reconfiguration for HyperTransport coupled FPGAs using ACCFS Jochen Strunk 23/23