SlideShare une entreprise Scribd logo
1  sur  30
Télécharger pour lire hors ligne
Cooperative VM Migration
 for a virtualized HPC Cluster
 with VMM-bypass I/O devices

                 Ryousei Takano, Hidemoto Nakada, Takahiro Hirofuchi,
                                  Yoshio Tanaka, and Tomohiro Kudoh

                                     Information Technology Research Institute,
National Institute of Advanced Industrial Science and Technology (AIST), Japan

               IEEE eScience 2012, Oct. 11 2012, Chicago
Background
•  HPC cloud is a promising e-Science platform.
   –  HPC users begin to take an interest in Cloud computing,
      e.g., Amazon EC2 Cluster Compute Instances.
•  Virtualization is a key technology.
   –  Pro: It makes migration of computing elements easy.
      •  VM migration is useful for achieving fault tolerance, server
         consolidation, etc.
   –  Con: It introduces a large overhead, spoiling I/O
      performance.
      •  VMM-bypass I/O technologies, e.g., PCI passthrough and
         SR-IOV, can significantly mitigate the overhead.

  VMM-bypass I/O makes it impossible to migrate a VM.
                                                                        2
Contribution
•  Goal:
   –  To realize VM migration and checkpoint/restart on a
      virtualized cluster with VMM-bypass I/O devices.
      •  E.g., VM migration on an Infiniband cluster
•  Contributions:
   –  We propose cooperative VM migration based on the
      Symbiotic Virtualization (SymVirt) mechanism.
   –  We demonstrate reactive/proactive fault tolerant (FT)
      systems.
   –  We show postcopy migration helps to reduce the
      service downtime in the proactive FT system.

                                                              3
Agenda
•    Background and Motivation
•    SymVirt: Symbiotic Virtualization Mechanism
•    Experiment
•    Related Work
•    Conclusion and Future Work




                                                   4
Motivating Observation
 •  Performance evaluation of HPC cloud
                                   –  (Para-)virtualized I/O incurs a large overhead.
                                   –  PCI passthrough significantly mitigate the overhead.
                                                                               KVM (IB)    KVM (virtio)
                             300
                                               BMM (IB)        BMM (10GbE)                 VM1
                                                                              VM1
                                               KVM (IB)        KVM (virtio)
                             250
                                                                               Guest OS     Guest OS
  Execution time [seconds]




                             200                                               Physical       Guest
                                                                                driver        driver
                             150

                             100                                              VMM          VMM

                              50                                                             Physical
                                                                                              driver
                               0
                                     BT   CG     EP       FT          LU
The overhead of I/O virtualization on the NAS                                 IB QDR HCA   10GbE NIC
Parallel Benchmarks 3.3.1 class C, 64 processes.
                                                                                            BMM: Bare Metal Machine
                                                                                                                      5
Para-virtualized                            VMM-Bypass I/O
 device (virtio_net)                PCI passthrough          SR-IOV
VM1              VM2               VM1           VM2           VM1                VM2
 Guest OS                           Guest OS                    Guest OS
                       …                               …                                …
  Guest                             Physical                        Physical
  driver                             driver                          driver


VMM                                VMM                         VMM
            vSwitch

            Physical
             driver

NIC                                NIC                         NIC

                                                                        Switch (VEB)

                       Para-virt             PCI           SR-IOV
                        device           passthrough                           We address
  Performance                                                                  this issue!
  Device sharing
  VM migration
                                                                                            6
Problem
VMM-bypass I/O technologies make VM migration
and checkpoint/restart impossible.
  1.  VMM does not know the time when VMM-bypass I/O
      devices are detached safely.
     •  To perform such migration without losing in-flight data,
        packet transmission to/from the VM should be stopped prior
        to detaching.
     •  With a VMM, it is hard to know the communication status of
        an application inside the VM, especially if VMM-bypass I/O
        devices are used.
  2.  VMM cannot migrate the state of VMM-bypass I/O
      devices from the source to the destination.
     •  With Infiniband, Local ID, QueuePair Numbers, etc.
                                                                     7
Goal
                                migration

                                Ethernet
Cluster 1                                                Cluster 2



 VM1        VM2     VM3   VM4


               detach                        re-attach
            Infiniband                      Infiniband




   We need a mechanism of combining VM migration and
   PCI device hot-plugging.

                                                                     8
SymVirt: Symbiotic Virtualization
Cooperative VM Migration
Existing VM Migration          Cooperative VM Migration
 (Black-box approach)            (Gray-box approach)
     Pro: portability              Pro: performance

  Guest OS                         Guest OS
                                                     Global
                                                  coordination
        Application                      Application
                                              Device setup

                                 Cooperation
               Global
  VMM        coordination          VMM                   VMM-bypass
                Device setup
                                                            I/O
 Migration                        Migration


  NIC                              NIC


                                                                  10
SymVirt: Symbiotic Virtualization
•  We focus on MPI programs.
•  We design and implement
   a symbiotic virtualization         Guest OS
                                                        Global
   (SymVirt) mechanism.                              coordination
                                            Application
  –  It is a cross-layer mechanism
                                                 Device setup
     between a VMM and an MPI
     runtime system.                             SymVirt
                                     Cooperation

                                      VMM                   VMM-bypass
                                                               I/O
                                     Migration


                                      NIC


                                                                    11
SymVirt: Overview
                                                               node #0                                         node #n
                                                                                                 ...
                                       Guest OS                                                              Guest OS


                                                  MPI app.                 MPI app.                             MPI app.
Cloud scheduler




                                                  MPI lib.                 MPI lib.                             MPI lib.
                  1) trigger events




                                                SymVirt                    SymVirt         2) coordination      SymVirt
                                              coordinator                coordinator                          coordinator


                                      3) SymVirt wait
                                                             5) SymVirt signal
                                       VMM                                                                   VMM
                                                                       4) do something, e.g.,
                                          SymVirt        SymVirt      migration/checkpointing                  SymVirt
                                         controller       agent                                                 agent




                                                                                                                            12
SymVirt wait and signal calls
•  SymVirt provides a simple Guest OS-to-VMM
   communication mechanism.
•  SymVirt coordinator issues a SymVirt wait call, the guest
   OS is blocked until a SymVirt signal call is issued.
•  In the meantime, SymVirt agent controls the VM via a
   VMM monitor interface.
    Application
                           confirm                           confirm linkup
                                      SymVirt coordinator

     SymVirt
      wait   SymVirt                                                Guest OS mode
              signal                                                     VMM mode
                  detach             migration         re-attach

                  SymVirt controller/agent
                                                                                    13
SymVirt: Implementation
•  We implemented SymVirt on top of QEMU/KVM
   and the Open MPI system.
•  User application and the MPI runtime system
   can work without any modifications.
•  QEMU/KVM is slightly modified for supporting
   SymVirt wait and signal calls.
  –  A SymVirt wait call is implemented by            SymVirt
                                                    coordinator
     using a VMCALL Intel VT-x instruction.
                                               SymVirt        SymVirt
  –  A SymVirt signal call is implemented as    wait           signal
     a new QEMU/KVM monitor command.
                                                         SymVirt
                                                          agent

                                                                    14
SymVirt: Implementation (cont’d)
•  SymVirt coordinator is heavily relied on the Open
   MPI checkpoint/restart (C/R) framework.
  –  Global coordination of SymVirt is the same as a
     coordination protocol for MPI programs.
  –  SymVirt executes VM-level migration or C/R instead
     of process-level C/R using the BLCR system.
  –  SymVirt does not need to take care of changing LIDs
     and QPNs after a migration, because Open MPI’s
     BTL modules are re-constructed and connections are
     re-established at continue or restart phases.


                            BTL: Point-to-Point Byte Transfer Layer
                                                                      15
SymVirt: Implementation (cont’d)
•  SymVirt controller and agent are written in Python.
import symvirt!                            !
agent_list = [migrate_from]!               # device attach!
ctl = symvirt.Controller(agent_list)!      ctl.append_agent(migrate_to)!
!                                          ctl.wait_all()!
# device detach!                           kwargs = {'pci_id':'04:00.0',
ctl.wait_all()!                            'tag':'vf0'}!
kwargs = {'tag':'vf0'}!                    ctl.device_attach(**kwargs)!
ctl.device_detach(**kwargs)!               ctl.signal()!
ctl.signal()!                              !
!                                          ctl.close()
# vm migration!
ctl.wait_all()!
kwargs = {'postcopy':True, 'uri':'tcp:%s:%d' !
          % (migrate_to[0], migrate_port)}!
ctl.migrate(**kwargs)!
ctl.remove_agent(migrate_from)!

                                                                           16
SymPFT: Proactive FT system
•  A VM-level fault tolerant (FT) system is a use
   case of SymVirt.


      Cloud&scheduler
           allocation

                           User requires a virtualized cluster consists
                           of 4 nodes (16 CPUs).




         global&storage&
          (VM&images)
                                                                          17
SymPFT: Proactive FT system
•  A VM-level fault tolerant (FT) system is a use
   case of SymVirt.
•  A VM is migrated from a “unhealthy” node to a
   “healthy” node before the node crashes.
      Cloud&scheduler                                Cloud&scheduler
           allocation                                     re-allocation
                                         Failure
                                 re!!
                           Failu        prediction



                                                     VM migration

         global&storage&                                global&storage&
          (VM&images)                                    (VM&images)
                                                                          18
Experiment
Experiment
•  The overhead of SymPFT
  –  We used 8 VMs on an Infiniband cluster.
  –  We migrated a VM once during a benchmark
     execution.
•  Two benchmark programs written in MPI
  –  memtest: a simple memory intensive benchmark
  –  NAS Parallel Benchmarks (NPB) version 3.3.1


•  Overhead reduction using postcopy migration


                                                    20
Experimental setting
      We used a 16 node Infiniband cluster, which is
      a part of the AIST Green Cloud.
      Blade server Dell PowerEdge M610              Host machine environment
CPU        Intel quad-core Xeon E5540/2.53GHz x2   OS             Debian 7.0

Chipset    Intel 5520                              Linux kernel   3.2.18

Memory     48 GB DDR3                              QEMU/KVM       1.1-rc3

InfiniBand Mellanox ConnectX (MT26428)             MPI            Open MPI 1.6
                                                   OFED           1.5.4.1
                 Blade switch
                                                   Compiler       gcc/gfortran 4.4.6
InfiniBand Mellanox M3601Q (QDR 16 ports)                   VM environment
                                                   VCPU       8
 Only 1 VM runs on 1 host, and an IB HCA is !
 assigned to the VM by using PCI passthrough.      Memory     20 GB
                                                                                   21
Result: memtest
•  The migration time is dependent on the memory footprint.
   –  The migration throughput is less than 3 Gbps.
•  Both hotplug and link-up times are approximately constant.
   –  The link-up time is not a negligible overhead. c.f., Ethernet
                     100
                           migration   hotplug   linkup
                      80
    Execution Time
      [Seconds]




                                                          44.2     53.7
                      60   35.9           38.7

                      40
                           14.6           13.5            12.5     11.3
                      20
                           28.5           28.5            28.5     28.6
                       0
                           2GB            4GB             8GB     16GB
                                         memory footprint
                                                 This result does not include our proceeding.
                                                                                           22
Result: NAS Parallel Benchmarks
                           1400
                                                                                 linkup                        postcopy migration
                           1200                                                  hotplug                       precopy migration
                                  +105 s
Execution time [seconds]




                                                                                 application
                           1000
                                                   There is no overhead                                         +103 s

                            800                    during normal operations
                                                             +97 s
                                                                                       +299 s
                            600

                            400
                                                                                                       The overhead is proportional
                            200                                                                        to the memory footprint.
                              0
                                  baseline precopy postcopy baseline precopy postcopy baseline precopy postcopy baseline precopy postcopy
                                             BT                       CG                         FT                       LU

                                                                             Transferred Memory Size during VM Migration [MB]
                                                                                  BT             CG              FT             LU
                                                                                4417            3394           15678           2348

                                                                                                                                            23
Integration with postcopy migration
 •  In contrast to precopy migration, postcopy migration
    transfers memory pages on demand after the execution
    node is switched to the destination.
 •  Postcopy migration can hide the overhead of the hot-add
    and link-up times by overlapping them and migration.
 •  We used our postcopy migration implementation for
    QEMU/KVM, Yabusame.

   SymPFT      a) hot-del   b) migration   c) hot-add d) link-up
  (precopy)


   SymPFT                                      overhead mitigation
  (postcopy)


                                                                     24
Result: Effect of postcopy migration
                           1400
                                                                                 linkup                         postcopy migration
                                                     -15 %
                           1200                                                  hotplug                        precopy migration
Execution time [seconds]




                                                                                 application                                      -13 s
                           1000

                            800                                                -14 %
                                                                                                        -53 %
                            600

                            400
                                                                                        Postcopy migration can hide the
                            200                                                         overhead of hotplug and link-up by
                              0                                                         overlapping them and migration.
                                  baseline precopy postcopy baseline precopy postcopy baseline precopy postcopy baseline precopy postcopy
                                             BT                       CG                         FT                       LU

                                                                             Transferred Memory Size during VM Migration [MB]
                                                                                  BT             CG              FT              LU
                                                                                 4417           3394            15678           2348

                                                                                                                                            25
Related Work
•  Some VM-level reactive and proactive FT systems have
   been proposed for HPC systems.
   –  E.g., VNsnap: a distributed snapshots of VMs
      •  The coordination is executed by snooping the traffic of a software
         switch outside the VMs.
   –  They do not support VMM-bypass I/O devices.
•  Mercury: a self-virtualization technique
   –  An OS can turn virtualization on and off on demand.
   –  It lacks a coordination mechanism among distributed VMM.
•  SymCall: an upcall mechanism from a VMM to the guest
   OS, using a nested VM Exit call
   –  SymVirt is a simple hypercall mechanism from a guest OS to the
      VMM, assuming it works in cooperation with a cloud scheduler.
                                                                              26
Conclusion and Future Work
Conclustion
•  We have proposed a cooperative VM migration
   mechanism that enables us to migrate VMs with VMM-
   bypass I/O devices, using a simple Guest OS-to-VMM
   communication mechanism, called SymVirt.
•  Using the proposed mechanism, we demonstrated a
   proactive FT system in a virtualized Infiniband cluster.
•  We also confirmed that postcopy migration helps to
   reduce the downtime in the proactive FT system.

•  SymVirt can be useful for not only fault tolerant but also
   load balancing and server consolidation.

                                                                28
Future Work
•  Interconnect-transparent migration, called
   “Ninja migration”
  –  We have submitted another conference paper.
•  Overhead mitigation of SymVirt
  –  Very long link-up time problem
  –  Better integration with postcopy migration
•  A generic communication layer supporting
   cooperative VM migration
  –  It is independent on an MPI runtime system.


                                                   29
Thanks for your attention!




 This work was partly supported by JSPS KAKENHI 24700040
 and ARGO GRAPHICS, Inc.

                                                           30

Contenu connexe

Tendances

IBM PowerVM Virtualization Introduction and Configuration
IBM PowerVM Virtualization Introduction and ConfigurationIBM PowerVM Virtualization Introduction and Configuration
IBM PowerVM Virtualization Introduction and ConfigurationIBM India Smarter Computing
 
Mobile Virtualization using the Xen Technologies
Mobile Virtualization using the Xen TechnologiesMobile Virtualization using the Xen Technologies
Mobile Virtualization using the Xen TechnologiesThe Linux Foundation
 
Keynote Speech: Xen ARM Virtualization
Keynote Speech: Xen ARM VirtualizationKeynote Speech: Xen ARM Virtualization
Keynote Speech: Xen ARM VirtualizationThe Linux Foundation
 
Hyper V R2 Deep Dive
Hyper V R2 Deep DiveHyper V R2 Deep Dive
Hyper V R2 Deep DiveAidan Finn
 
Presentation power vm virtualization without limits
Presentation   power vm virtualization without limitsPresentation   power vm virtualization without limits
Presentation power vm virtualization without limitssolarisyougood
 
z/VM 6.2: Increasing the Endless Possibilities of Virtualization
z/VM 6.2: Increasing the Endless Possibilities of Virtualizationz/VM 6.2: Increasing the Endless Possibilities of Virtualization
z/VM 6.2: Increasing the Endless Possibilities of VirtualizationIBM India Smarter Computing
 
Introduction to the Linux on System z Terminal Server using z/VM IUCV
Introduction to the Linux on System z Terminal Server using z/VM IUCVIntroduction to the Linux on System z Terminal Server using z/VM IUCV
Introduction to the Linux on System z Terminal Server using z/VM IUCVIBM India Smarter Computing
 
Track A-Shmuel Panijel, Windriver
Track A-Shmuel Panijel, WindriverTrack A-Shmuel Panijel, Windriver
Track A-Shmuel Panijel, Windriverchiportal
 
Windows server 8 hyper v networking (aidan finn)
Windows server 8 hyper v networking (aidan finn)Windows server 8 hyper v networking (aidan finn)
Windows server 8 hyper v networking (aidan finn)hypervnu
 
Windows Server 8 Hyper V Networking
Windows Server 8 Hyper V NetworkingWindows Server 8 Hyper V Networking
Windows Server 8 Hyper V NetworkingAidan Finn
 
i//:squared Business Continuity Event
i//:squared Business Continuity Eventi//:squared Business Continuity Event
i//:squared Business Continuity EventJonathan Allmayer
 

Tendances (20)

IBM PowerVM Best Practices
IBM PowerVM Best PracticesIBM PowerVM Best Practices
IBM PowerVM Best Practices
 
VPM in action
VPM in actionVPM in action
VPM in action
 
IBM PowerVM Virtualization Introduction and Configuration
IBM PowerVM Virtualization Introduction and ConfigurationIBM PowerVM Virtualization Introduction and Configuration
IBM PowerVM Virtualization Introduction and Configuration
 
Mobile Virtualization using the Xen Technologies
Mobile Virtualization using the Xen TechnologiesMobile Virtualization using the Xen Technologies
Mobile Virtualization using the Xen Technologies
 
Link Virtualization based on Xen
Link Virtualization based on XenLink Virtualization based on Xen
Link Virtualization based on Xen
 
Graphics virtualization
Graphics virtualizationGraphics virtualization
Graphics virtualization
 
XS Boston 2008 Fault Tolerance
XS Boston 2008 Fault ToleranceXS Boston 2008 Fault Tolerance
XS Boston 2008 Fault Tolerance
 
Williams xen summit 2010
Williams   xen summit 2010Williams   xen summit 2010
Williams xen summit 2010
 
Keynote Speech: Xen ARM Virtualization
Keynote Speech: Xen ARM VirtualizationKeynote Speech: Xen ARM Virtualization
Keynote Speech: Xen ARM Virtualization
 
Minimizing I/O Latency in Xen-ARM
Minimizing I/O Latency in Xen-ARMMinimizing I/O Latency in Xen-ARM
Minimizing I/O Latency in Xen-ARM
 
Hyper V R2 Deep Dive
Hyper V R2 Deep DiveHyper V R2 Deep Dive
Hyper V R2 Deep Dive
 
Presentation power vm virtualization without limits
Presentation   power vm virtualization without limitsPresentation   power vm virtualization without limits
Presentation power vm virtualization without limits
 
z/VM 6.2: Increasing the Endless Possibilities of Virtualization
z/VM 6.2: Increasing the Endless Possibilities of Virtualizationz/VM 6.2: Increasing the Endless Possibilities of Virtualization
z/VM 6.2: Increasing the Endless Possibilities of Virtualization
 
Introduction to the Linux on System z Terminal Server using z/VM IUCV
Introduction to the Linux on System z Terminal Server using z/VM IUCVIntroduction to the Linux on System z Terminal Server using z/VM IUCV
Introduction to the Linux on System z Terminal Server using z/VM IUCV
 
Track A-Shmuel Panijel, Windriver
Track A-Shmuel Panijel, WindriverTrack A-Shmuel Panijel, Windriver
Track A-Shmuel Panijel, Windriver
 
Windows server 8 hyper v networking (aidan finn)
Windows server 8 hyper v networking (aidan finn)Windows server 8 hyper v networking (aidan finn)
Windows server 8 hyper v networking (aidan finn)
 
Linux PV on HVM
Linux PV on HVMLinux PV on HVM
Linux PV on HVM
 
Windows Server 8 Hyper V Networking
Windows Server 8 Hyper V NetworkingWindows Server 8 Hyper V Networking
Windows Server 8 Hyper V Networking
 
Ian Pratt Usenix 08 Keynote
Ian Pratt Usenix 08 KeynoteIan Pratt Usenix 08 Keynote
Ian Pratt Usenix 08 Keynote
 
i//:squared Business Continuity Event
i//:squared Business Continuity Eventi//:squared Business Continuity Event
i//:squared Business Continuity Event
 

En vedette

クラウドを利用した電力可視化システムの構築
クラウドを利用した電力可視化システムの構築クラウドを利用した電力可視化システムの構築
クラウドを利用した電力可視化システムの構築Ryousei Takano
 
インタークラウドにおける仮想インフラ構築システムの提案
インタークラウドにおける仮想インフラ構築システムの提案インタークラウドにおける仮想インフラ構築システムの提案
インタークラウドにおける仮想インフラ構築システムの提案Ryousei Takano
 
SoNIC: Precise Realtime Software Access and Control of Wired Networks
SoNIC: Precise Realtime Software Access and Control of Wired NetworksSoNIC: Precise Realtime Software Access and Control of Wired Networks
SoNIC: Precise Realtime Software Access and Control of Wired NetworksRyousei Takano
 
マッコスXで始めるPlan 9
マッコスXで始めるPlan 9マッコスXで始めるPlan 9
マッコスXで始めるPlan 9Ryousei Takano
 
Expectations for optical network from the viewpoint of system software research
Expectations for optical network from the viewpoint of system software researchExpectations for optical network from the viewpoint of system software research
Expectations for optical network from the viewpoint of system software researchRyousei Takano
 
InfiniBandをPCIパススルーで用いるHPC仮想化クラスタの性能評価
InfiniBandをPCIパススルーで用いるHPC仮想化クラスタの性能評価InfiniBandをPCIパススルーで用いるHPC仮想化クラスタの性能評価
InfiniBandをPCIパススルーで用いるHPC仮想化クラスタの性能評価Ryousei Takano
 

En vedette (6)

クラウドを利用した電力可視化システムの構築
クラウドを利用した電力可視化システムの構築クラウドを利用した電力可視化システムの構築
クラウドを利用した電力可視化システムの構築
 
インタークラウドにおける仮想インフラ構築システムの提案
インタークラウドにおける仮想インフラ構築システムの提案インタークラウドにおける仮想インフラ構築システムの提案
インタークラウドにおける仮想インフラ構築システムの提案
 
SoNIC: Precise Realtime Software Access and Control of Wired Networks
SoNIC: Precise Realtime Software Access and Control of Wired NetworksSoNIC: Precise Realtime Software Access and Control of Wired Networks
SoNIC: Precise Realtime Software Access and Control of Wired Networks
 
マッコスXで始めるPlan 9
マッコスXで始めるPlan 9マッコスXで始めるPlan 9
マッコスXで始めるPlan 9
 
Expectations for optical network from the viewpoint of system software research
Expectations for optical network from the viewpoint of system software researchExpectations for optical network from the viewpoint of system software research
Expectations for optical network from the viewpoint of system software research
 
InfiniBandをPCIパススルーで用いるHPC仮想化クラスタの性能評価
InfiniBandをPCIパススルーで用いるHPC仮想化クラスタの性能評価InfiniBandをPCIパススルーで用いるHPC仮想化クラスタの性能評価
InfiniBandをPCIパススルーで用いるHPC仮想化クラスタの性能評価
 

Similaire à Cooperative VM Migration for a virtualized HPC Cluster with VMM-bypass I/O devices

Hardware supports for Virtualization
Hardware supports for VirtualizationHardware supports for Virtualization
Hardware supports for VirtualizationYoonje Choi
 
Virtualization Technology Overview
Virtualization Technology OverviewVirtualization Technology Overview
Virtualization Technology OverviewOpenCity Community
 
12th Japan CloudStack User Group Meetup MidoNet with scalable virtual router
12th Japan CloudStack User Group Meetup   MidoNet with scalable virtual router12th Japan CloudStack User Group Meetup   MidoNet with scalable virtual router
12th Japan CloudStack User Group Meetup MidoNet with scalable virtual routerTakeshi Nakajima
 
12th Japan CloudStack User Group Meetup
12th Japan CloudStack User Group Meetup12th Japan CloudStack User Group Meetup
12th Japan CloudStack User Group MeetupMidokura
 
HPC Cloud: Clouds on supercomputers for HPC
HPC Cloud: Clouds on supercomputers for HPCHPC Cloud: Clouds on supercomputers for HPC
HPC Cloud: Clouds on supercomputers for HPCRyousei Takano
 
Virtualization Primer for Java Developers
Virtualization Primer for Java DevelopersVirtualization Primer for Java Developers
Virtualization Primer for Java DevelopersRichard McDougall
 
I/O仮想化最前線〜ネットワークI/Oを中心に〜
I/O仮想化最前線〜ネットワークI/Oを中心に〜I/O仮想化最前線〜ネットワークI/Oを中心に〜
I/O仮想化最前線〜ネットワークI/Oを中心に〜Ryousei Takano
 
Security Best Practices For Hyper V And Server Virtualization
Security Best Practices For Hyper V And Server VirtualizationSecurity Best Practices For Hyper V And Server Virtualization
Security Best Practices For Hyper V And Server Virtualizationrsnarayanan
 
virtualization tutorial at ACM bangalore Compute 2009
virtualization tutorial at ACM bangalore Compute 2009virtualization tutorial at ACM bangalore Compute 2009
virtualization tutorial at ACM bangalore Compute 2009ACMBangalore
 
Nova for Physicalization and Virtualization compute models
Nova for Physicalization and Virtualization compute modelsNova for Physicalization and Virtualization compute models
Nova for Physicalization and Virtualization compute modelsopenstackindia
 
Presentation from physical to virtual to cloud emc
Presentation   from physical to virtual to cloud emcPresentation   from physical to virtual to cloud emc
Presentation from physical to virtual to cloud emcxKinAnx
 
2014.08.30 Virtual Machine Threat 세미나
2014.08.30 Virtual Machine Threat 세미나2014.08.30 Virtual Machine Threat 세미나
2014.08.30 Virtual Machine Threat 세미나용환 노
 
VMWorld 2009 Presentation
VMWorld 2009 PresentationVMWorld 2009 Presentation
VMWorld 2009 PresentationAhmed Sallam
 
Virtualization securityv2
Virtualization securityv2Virtualization securityv2
Virtualization securityv2vivekbhat
 
Presentation cloud computing workshop - virtualization
Presentation   cloud computing workshop - virtualizationPresentation   cloud computing workshop - virtualization
Presentation cloud computing workshop - virtualizationxKinAnx
 
CSA Presentation 26th May Virtualization securityv2
CSA Presentation 26th May Virtualization securityv2CSA Presentation 26th May Virtualization securityv2
CSA Presentation 26th May Virtualization securityv2vivekbhat
 
Visão geral sobre Citrix XenServer 6 - Ferramentas e Licenciamento
Visão geral sobre Citrix XenServer 6 - Ferramentas e LicenciamentoVisão geral sobre Citrix XenServer 6 - Ferramentas e Licenciamento
Visão geral sobre Citrix XenServer 6 - Ferramentas e LicenciamentoLorscheider Santiago
 

Similaire à Cooperative VM Migration for a virtualized HPC Cluster with VMM-bypass I/O devices (20)

Hardware supports for Virtualization
Hardware supports for VirtualizationHardware supports for Virtualization
Hardware supports for Virtualization
 
Virtualization Technology Overview
Virtualization Technology OverviewVirtualization Technology Overview
Virtualization Technology Overview
 
12th Japan CloudStack User Group Meetup MidoNet with scalable virtual router
12th Japan CloudStack User Group Meetup   MidoNet with scalable virtual router12th Japan CloudStack User Group Meetup   MidoNet with scalable virtual router
12th Japan CloudStack User Group Meetup MidoNet with scalable virtual router
 
12th Japan CloudStack User Group Meetup
12th Japan CloudStack User Group Meetup12th Japan CloudStack User Group Meetup
12th Japan CloudStack User Group Meetup
 
Windows Server 2012 Hyper-V Networking Evolved
Windows Server 2012 Hyper-V Networking Evolved Windows Server 2012 Hyper-V Networking Evolved
Windows Server 2012 Hyper-V Networking Evolved
 
HPC Cloud: Clouds on supercomputers for HPC
HPC Cloud: Clouds on supercomputers for HPCHPC Cloud: Clouds on supercomputers for HPC
HPC Cloud: Clouds on supercomputers for HPC
 
XS Japan 2008 BitVisor English
XS Japan 2008 BitVisor EnglishXS Japan 2008 BitVisor English
XS Japan 2008 BitVisor English
 
Virtualization Primer for Java Developers
Virtualization Primer for Java DevelopersVirtualization Primer for Java Developers
Virtualization Primer for Java Developers
 
I/O仮想化最前線〜ネットワークI/Oを中心に〜
I/O仮想化最前線〜ネットワークI/Oを中心に〜I/O仮想化最前線〜ネットワークI/Oを中心に〜
I/O仮想化最前線〜ネットワークI/Oを中心に〜
 
Security Best Practices For Hyper V And Server Virtualization
Security Best Practices For Hyper V And Server VirtualizationSecurity Best Practices For Hyper V And Server Virtualization
Security Best Practices For Hyper V And Server Virtualization
 
virtualization tutorial at ACM bangalore Compute 2009
virtualization tutorial at ACM bangalore Compute 2009virtualization tutorial at ACM bangalore Compute 2009
virtualization tutorial at ACM bangalore Compute 2009
 
What is a virtual tap?
What is a virtual tap?What is a virtual tap?
What is a virtual tap?
 
Nova for Physicalization and Virtualization compute models
Nova for Physicalization and Virtualization compute modelsNova for Physicalization and Virtualization compute models
Nova for Physicalization and Virtualization compute models
 
Presentation from physical to virtual to cloud emc
Presentation   from physical to virtual to cloud emcPresentation   from physical to virtual to cloud emc
Presentation from physical to virtual to cloud emc
 
2014.08.30 Virtual Machine Threat 세미나
2014.08.30 Virtual Machine Threat 세미나2014.08.30 Virtual Machine Threat 세미나
2014.08.30 Virtual Machine Threat 세미나
 
VMWorld 2009 Presentation
VMWorld 2009 PresentationVMWorld 2009 Presentation
VMWorld 2009 Presentation
 
Virtualization securityv2
Virtualization securityv2Virtualization securityv2
Virtualization securityv2
 
Presentation cloud computing workshop - virtualization
Presentation   cloud computing workshop - virtualizationPresentation   cloud computing workshop - virtualization
Presentation cloud computing workshop - virtualization
 
CSA Presentation 26th May Virtualization securityv2
CSA Presentation 26th May Virtualization securityv2CSA Presentation 26th May Virtualization securityv2
CSA Presentation 26th May Virtualization securityv2
 
Visão geral sobre Citrix XenServer 6 - Ferramentas e Licenciamento
Visão geral sobre Citrix XenServer 6 - Ferramentas e LicenciamentoVisão geral sobre Citrix XenServer 6 - Ferramentas e Licenciamento
Visão geral sobre Citrix XenServer 6 - Ferramentas e Licenciamento
 

Plus de Ryousei Takano

Error Permissive Computing
Error Permissive ComputingError Permissive Computing
Error Permissive ComputingRyousei Takano
 
Opportunities of ML-based data analytics in ABCI
Opportunities of ML-based data analytics in ABCIOpportunities of ML-based data analytics in ABCI
Opportunities of ML-based data analytics in ABCIRyousei Takano
 
ABCI: An Open Innovation Platform for Advancing AI Research and Deployment
ABCI: An Open Innovation Platform for Advancing AI Research and DeploymentABCI: An Open Innovation Platform for Advancing AI Research and Deployment
ABCI: An Open Innovation Platform for Advancing AI Research and DeploymentRyousei Takano
 
クラウド環境におけるキャッシュメモリQoS制御の評価
クラウド環境におけるキャッシュメモリQoS制御の評価クラウド環境におけるキャッシュメモリQoS制御の評価
クラウド環境におけるキャッシュメモリQoS制御の評価Ryousei Takano
 
USENIX NSDI 2016 (Session: Resource Sharing)
USENIX NSDI 2016 (Session: Resource Sharing)USENIX NSDI 2016 (Session: Resource Sharing)
USENIX NSDI 2016 (Session: Resource Sharing)Ryousei Takano
 
User-space Network Processing
User-space Network ProcessingUser-space Network Processing
User-space Network ProcessingRyousei Takano
 
Flow-centric Computing - A Datacenter Architecture in the Post Moore Era
Flow-centric Computing - A Datacenter Architecture in the Post Moore EraFlow-centric Computing - A Datacenter Architecture in the Post Moore Era
Flow-centric Computing - A Datacenter Architecture in the Post Moore EraRyousei Takano
 
A Look Inside Google’s Data Center Networks
A Look Inside Google’s Data Center NetworksA Look Inside Google’s Data Center Networks
A Look Inside Google’s Data Center NetworksRyousei Takano
 
クラウド時代の半導体メモリー技術
クラウド時代の半導体メモリー技術クラウド時代の半導体メモリー技術
クラウド時代の半導体メモリー技術Ryousei Takano
 
AIST Super Green Cloud: lessons learned from the operation and the performanc...
AIST Super Green Cloud: lessons learned from the operation and the performanc...AIST Super Green Cloud: lessons learned from the operation and the performanc...
AIST Super Green Cloud: lessons learned from the operation and the performanc...Ryousei Takano
 
IEEE CloudCom 2014参加報告
IEEE CloudCom 2014参加報告IEEE CloudCom 2014参加報告
IEEE CloudCom 2014参加報告Ryousei Takano
 
Exploring the Performance Impact of Virtualization on an HPC Cloud
Exploring the Performance Impact of Virtualization on an HPC CloudExploring the Performance Impact of Virtualization on an HPC Cloud
Exploring the Performance Impact of Virtualization on an HPC CloudRyousei Takano
 
不揮発メモリとOS研究にまつわる何か
不揮発メモリとOS研究にまつわる何か不揮発メモリとOS研究にまつわる何か
不揮発メモリとOS研究にまつわる何かRyousei Takano
 
High-resolution Timer-based Packet Pacing Mechanism on the Linux Operating Sy...
High-resolution Timer-based Packet Pacing Mechanism on the Linux Operating Sy...High-resolution Timer-based Packet Pacing Mechanism on the Linux Operating Sy...
High-resolution Timer-based Packet Pacing Mechanism on the Linux Operating Sy...Ryousei Takano
 
クラウドの垣根を超えた高性能計算に向けて~AIST Super Green Cloudでの試み~
クラウドの垣根を超えた高性能計算に向けて~AIST Super Green Cloudでの試み~クラウドの垣根を超えた高性能計算に向けて~AIST Super Green Cloudでの試み~
クラウドの垣根を超えた高性能計算に向けて~AIST Super Green Cloudでの試み~Ryousei Takano
 
From Rack scale computers to Warehouse scale computers
From Rack scale computers to Warehouse scale computersFrom Rack scale computers to Warehouse scale computers
From Rack scale computers to Warehouse scale computersRyousei Takano
 
高性能かつスケールアウト可能なHPCクラウド AIST Super Green Cloud
高性能かつスケールアウト可能なHPCクラウド AIST Super Green Cloud高性能かつスケールアウト可能なHPCクラウド AIST Super Green Cloud
高性能かつスケールアウト可能なHPCクラウド AIST Super Green CloudRyousei Takano
 
Iris: Inter-cloud Resource Integration System for Elastic Cloud Data Center
Iris: Inter-cloud Resource Integration System for Elastic Cloud Data CenterIris: Inter-cloud Resource Integration System for Elastic Cloud Data Center
Iris: Inter-cloud Resource Integration System for Elastic Cloud Data CenterRyousei Takano
 

Plus de Ryousei Takano (20)

Error Permissive Computing
Error Permissive ComputingError Permissive Computing
Error Permissive Computing
 
Opportunities of ML-based data analytics in ABCI
Opportunities of ML-based data analytics in ABCIOpportunities of ML-based data analytics in ABCI
Opportunities of ML-based data analytics in ABCI
 
ABCI: An Open Innovation Platform for Advancing AI Research and Deployment
ABCI: An Open Innovation Platform for Advancing AI Research and DeploymentABCI: An Open Innovation Platform for Advancing AI Research and Deployment
ABCI: An Open Innovation Platform for Advancing AI Research and Deployment
 
ABCI Data Center
ABCI Data CenterABCI Data Center
ABCI Data Center
 
クラウド環境におけるキャッシュメモリQoS制御の評価
クラウド環境におけるキャッシュメモリQoS制御の評価クラウド環境におけるキャッシュメモリQoS制御の評価
クラウド環境におけるキャッシュメモリQoS制御の評価
 
USENIX NSDI 2016 (Session: Resource Sharing)
USENIX NSDI 2016 (Session: Resource Sharing)USENIX NSDI 2016 (Session: Resource Sharing)
USENIX NSDI 2016 (Session: Resource Sharing)
 
User-space Network Processing
User-space Network ProcessingUser-space Network Processing
User-space Network Processing
 
Flow-centric Computing - A Datacenter Architecture in the Post Moore Era
Flow-centric Computing - A Datacenter Architecture in the Post Moore EraFlow-centric Computing - A Datacenter Architecture in the Post Moore Era
Flow-centric Computing - A Datacenter Architecture in the Post Moore Era
 
A Look Inside Google’s Data Center Networks
A Look Inside Google’s Data Center NetworksA Look Inside Google’s Data Center Networks
A Look Inside Google’s Data Center Networks
 
クラウド時代の半導体メモリー技術
クラウド時代の半導体メモリー技術クラウド時代の半導体メモリー技術
クラウド時代の半導体メモリー技術
 
AIST Super Green Cloud: lessons learned from the operation and the performanc...
AIST Super Green Cloud: lessons learned from the operation and the performanc...AIST Super Green Cloud: lessons learned from the operation and the performanc...
AIST Super Green Cloud: lessons learned from the operation and the performanc...
 
IEEE CloudCom 2014参加報告
IEEE CloudCom 2014参加報告IEEE CloudCom 2014参加報告
IEEE CloudCom 2014参加報告
 
Exploring the Performance Impact of Virtualization on an HPC Cloud
Exploring the Performance Impact of Virtualization on an HPC CloudExploring the Performance Impact of Virtualization on an HPC Cloud
Exploring the Performance Impact of Virtualization on an HPC Cloud
 
不揮発メモリとOS研究にまつわる何か
不揮発メモリとOS研究にまつわる何か不揮発メモリとOS研究にまつわる何か
不揮発メモリとOS研究にまつわる何か
 
High-resolution Timer-based Packet Pacing Mechanism on the Linux Operating Sy...
High-resolution Timer-based Packet Pacing Mechanism on the Linux Operating Sy...High-resolution Timer-based Packet Pacing Mechanism on the Linux Operating Sy...
High-resolution Timer-based Packet Pacing Mechanism on the Linux Operating Sy...
 
クラウドの垣根を超えた高性能計算に向けて~AIST Super Green Cloudでの試み~
クラウドの垣根を超えた高性能計算に向けて~AIST Super Green Cloudでの試み~クラウドの垣根を超えた高性能計算に向けて~AIST Super Green Cloudでの試み~
クラウドの垣根を超えた高性能計算に向けて~AIST Super Green Cloudでの試み~
 
From Rack scale computers to Warehouse scale computers
From Rack scale computers to Warehouse scale computersFrom Rack scale computers to Warehouse scale computers
From Rack scale computers to Warehouse scale computers
 
高性能かつスケールアウト可能なHPCクラウド AIST Super Green Cloud
高性能かつスケールアウト可能なHPCクラウド AIST Super Green Cloud高性能かつスケールアウト可能なHPCクラウド AIST Super Green Cloud
高性能かつスケールアウト可能なHPCクラウド AIST Super Green Cloud
 
Iris: Inter-cloud Resource Integration System for Elastic Cloud Data Center
Iris: Inter-cloud Resource Integration System for Elastic Cloud Data CenterIris: Inter-cloud Resource Integration System for Elastic Cloud Data Center
Iris: Inter-cloud Resource Integration System for Elastic Cloud Data Center
 
IEEE/ACM SC2013報告
IEEE/ACM SC2013報告IEEE/ACM SC2013報告
IEEE/ACM SC2013報告
 

Dernier

DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DaySri Ambati
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 

Dernier (20)

DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 

Cooperative VM Migration for a virtualized HPC Cluster with VMM-bypass I/O devices

  • 1. Cooperative VM Migration for a virtualized HPC Cluster with VMM-bypass I/O devices Ryousei Takano, Hidemoto Nakada, Takahiro Hirofuchi, Yoshio Tanaka, and Tomohiro Kudoh Information Technology Research Institute, National Institute of Advanced Industrial Science and Technology (AIST), Japan IEEE eScience 2012, Oct. 11 2012, Chicago
  • 2. Background •  HPC cloud is a promising e-Science platform. –  HPC users begin to take an interest in Cloud computing, e.g., Amazon EC2 Cluster Compute Instances. •  Virtualization is a key technology. –  Pro: It makes migration of computing elements easy. •  VM migration is useful for achieving fault tolerance, server consolidation, etc. –  Con: It introduces a large overhead, spoiling I/O performance. •  VMM-bypass I/O technologies, e.g., PCI passthrough and SR-IOV, can significantly mitigate the overhead. VMM-bypass I/O makes it impossible to migrate a VM. 2
  • 3. Contribution •  Goal: –  To realize VM migration and checkpoint/restart on a virtualized cluster with VMM-bypass I/O devices. •  E.g., VM migration on an Infiniband cluster •  Contributions: –  We propose cooperative VM migration based on the Symbiotic Virtualization (SymVirt) mechanism. –  We demonstrate reactive/proactive fault tolerant (FT) systems. –  We show postcopy migration helps to reduce the service downtime in the proactive FT system. 3
  • 4. Agenda •  Background and Motivation •  SymVirt: Symbiotic Virtualization Mechanism •  Experiment •  Related Work •  Conclusion and Future Work 4
  • 5. Motivating Observation •  Performance evaluation of HPC cloud –  (Para-)virtualized I/O incurs a large overhead. –  PCI passthrough significantly mitigate the overhead. KVM (IB) KVM (virtio) 300 BMM (IB) BMM (10GbE) VM1 VM1 KVM (IB) KVM (virtio) 250 Guest OS Guest OS Execution time [seconds] 200 Physical Guest driver driver 150 100 VMM VMM 50 Physical driver 0 BT CG EP FT LU The overhead of I/O virtualization on the NAS IB QDR HCA 10GbE NIC Parallel Benchmarks 3.3.1 class C, 64 processes. BMM: Bare Metal Machine 5
  • 6. Para-virtualized VMM-Bypass I/O device (virtio_net) PCI passthrough SR-IOV VM1 VM2 VM1 VM2 VM1 VM2 Guest OS Guest OS Guest OS … … … Guest Physical Physical driver driver driver VMM VMM VMM vSwitch Physical driver NIC NIC NIC Switch (VEB) Para-virt PCI SR-IOV device passthrough We address Performance this issue! Device sharing VM migration 6
  • 7. Problem VMM-bypass I/O technologies make VM migration and checkpoint/restart impossible. 1.  VMM does not know the time when VMM-bypass I/O devices are detached safely. •  To perform such migration without losing in-flight data, packet transmission to/from the VM should be stopped prior to detaching. •  With a VMM, it is hard to know the communication status of an application inside the VM, especially if VMM-bypass I/O devices are used. 2.  VMM cannot migrate the state of VMM-bypass I/O devices from the source to the destination. •  With Infiniband, Local ID, QueuePair Numbers, etc. 7
  • 8. Goal migration Ethernet Cluster 1 Cluster 2 VM1 VM2 VM3 VM4 detach re-attach Infiniband Infiniband We need a mechanism of combining VM migration and PCI device hot-plugging. 8
  • 10. Cooperative VM Migration Existing VM Migration Cooperative VM Migration (Black-box approach) (Gray-box approach) Pro: portability Pro: performance Guest OS Guest OS Global coordination Application Application Device setup Cooperation Global VMM coordination VMM VMM-bypass Device setup I/O Migration Migration NIC NIC 10
  • 11. SymVirt: Symbiotic Virtualization •  We focus on MPI programs. •  We design and implement a symbiotic virtualization Guest OS Global (SymVirt) mechanism. coordination Application –  It is a cross-layer mechanism Device setup between a VMM and an MPI runtime system. SymVirt Cooperation VMM VMM-bypass I/O Migration NIC 11
  • 12. SymVirt: Overview node #0 node #n ... Guest OS Guest OS MPI app. MPI app. MPI app. Cloud scheduler MPI lib. MPI lib. MPI lib. 1) trigger events SymVirt SymVirt 2) coordination SymVirt coordinator coordinator coordinator 3) SymVirt wait 5) SymVirt signal VMM VMM 4) do something, e.g., SymVirt SymVirt migration/checkpointing SymVirt controller agent agent 12
  • 13. SymVirt wait and signal calls •  SymVirt provides a simple Guest OS-to-VMM communication mechanism. •  SymVirt coordinator issues a SymVirt wait call, the guest OS is blocked until a SymVirt signal call is issued. •  In the meantime, SymVirt agent controls the VM via a VMM monitor interface. Application confirm confirm linkup SymVirt coordinator SymVirt wait SymVirt Guest OS mode signal VMM mode detach migration re-attach SymVirt controller/agent 13
  • 14. SymVirt: Implementation •  We implemented SymVirt on top of QEMU/KVM and the Open MPI system. •  User application and the MPI runtime system can work without any modifications. •  QEMU/KVM is slightly modified for supporting SymVirt wait and signal calls. –  A SymVirt wait call is implemented by SymVirt coordinator using a VMCALL Intel VT-x instruction. SymVirt SymVirt –  A SymVirt signal call is implemented as wait signal a new QEMU/KVM monitor command. SymVirt agent 14
  • 15. SymVirt: Implementation (cont’d) •  SymVirt coordinator is heavily relied on the Open MPI checkpoint/restart (C/R) framework. –  Global coordination of SymVirt is the same as a coordination protocol for MPI programs. –  SymVirt executes VM-level migration or C/R instead of process-level C/R using the BLCR system. –  SymVirt does not need to take care of changing LIDs and QPNs after a migration, because Open MPI’s BTL modules are re-constructed and connections are re-established at continue or restart phases. BTL: Point-to-Point Byte Transfer Layer 15
  • 16. SymVirt: Implementation (cont’d) •  SymVirt controller and agent are written in Python. import symvirt! ! agent_list = [migrate_from]! # device attach! ctl = symvirt.Controller(agent_list)! ctl.append_agent(migrate_to)! ! ctl.wait_all()! # device detach! kwargs = {'pci_id':'04:00.0', ctl.wait_all()! 'tag':'vf0'}! kwargs = {'tag':'vf0'}! ctl.device_attach(**kwargs)! ctl.device_detach(**kwargs)! ctl.signal()! ctl.signal()! ! ! ctl.close() # vm migration! ctl.wait_all()! kwargs = {'postcopy':True, 'uri':'tcp:%s:%d' ! % (migrate_to[0], migrate_port)}! ctl.migrate(**kwargs)! ctl.remove_agent(migrate_from)! 16
  • 17. SymPFT: Proactive FT system •  A VM-level fault tolerant (FT) system is a use case of SymVirt. Cloud&scheduler allocation User requires a virtualized cluster consists of 4 nodes (16 CPUs). global&storage& (VM&images) 17
  • 18. SymPFT: Proactive FT system •  A VM-level fault tolerant (FT) system is a use case of SymVirt. •  A VM is migrated from a “unhealthy” node to a “healthy” node before the node crashes. Cloud&scheduler Cloud&scheduler allocation re-allocation Failure re!! Failu prediction VM migration global&storage& global&storage& (VM&images) (VM&images) 18
  • 20. Experiment •  The overhead of SymPFT –  We used 8 VMs on an Infiniband cluster. –  We migrated a VM once during a benchmark execution. •  Two benchmark programs written in MPI –  memtest: a simple memory intensive benchmark –  NAS Parallel Benchmarks (NPB) version 3.3.1 •  Overhead reduction using postcopy migration 20
  • 21. Experimental setting We used a 16 node Infiniband cluster, which is a part of the AIST Green Cloud. Blade server Dell PowerEdge M610 Host machine environment CPU Intel quad-core Xeon E5540/2.53GHz x2 OS Debian 7.0 Chipset Intel 5520 Linux kernel 3.2.18 Memory 48 GB DDR3 QEMU/KVM 1.1-rc3 InfiniBand Mellanox ConnectX (MT26428) MPI Open MPI 1.6 OFED 1.5.4.1 Blade switch Compiler gcc/gfortran 4.4.6 InfiniBand Mellanox M3601Q (QDR 16 ports) VM environment VCPU 8 Only 1 VM runs on 1 host, and an IB HCA is ! assigned to the VM by using PCI passthrough. Memory 20 GB 21
  • 22. Result: memtest •  The migration time is dependent on the memory footprint. –  The migration throughput is less than 3 Gbps. •  Both hotplug and link-up times are approximately constant. –  The link-up time is not a negligible overhead. c.f., Ethernet 100 migration hotplug linkup 80 Execution Time [Seconds] 44.2 53.7 60 35.9 38.7 40 14.6 13.5 12.5 11.3 20 28.5 28.5 28.5 28.6 0 2GB 4GB 8GB 16GB memory footprint This result does not include our proceeding. 22
  • 23. Result: NAS Parallel Benchmarks 1400 linkup postcopy migration 1200 hotplug precopy migration +105 s Execution time [seconds] application 1000 There is no overhead +103 s 800 during normal operations +97 s +299 s 600 400 The overhead is proportional 200 to the memory footprint. 0 baseline precopy postcopy baseline precopy postcopy baseline precopy postcopy baseline precopy postcopy BT CG FT LU Transferred Memory Size during VM Migration [MB] BT CG FT LU 4417 3394 15678 2348 23
  • 24. Integration with postcopy migration •  In contrast to precopy migration, postcopy migration transfers memory pages on demand after the execution node is switched to the destination. •  Postcopy migration can hide the overhead of the hot-add and link-up times by overlapping them and migration. •  We used our postcopy migration implementation for QEMU/KVM, Yabusame. SymPFT a) hot-del b) migration c) hot-add d) link-up (precopy) SymPFT overhead mitigation (postcopy) 24
  • 25. Result: Effect of postcopy migration 1400 linkup postcopy migration -15 % 1200 hotplug precopy migration Execution time [seconds] application -13 s 1000 800 -14 % -53 % 600 400 Postcopy migration can hide the 200 overhead of hotplug and link-up by 0 overlapping them and migration. baseline precopy postcopy baseline precopy postcopy baseline precopy postcopy baseline precopy postcopy BT CG FT LU Transferred Memory Size during VM Migration [MB] BT CG FT LU 4417 3394 15678 2348 25
  • 26. Related Work •  Some VM-level reactive and proactive FT systems have been proposed for HPC systems. –  E.g., VNsnap: a distributed snapshots of VMs •  The coordination is executed by snooping the traffic of a software switch outside the VMs. –  They do not support VMM-bypass I/O devices. •  Mercury: a self-virtualization technique –  An OS can turn virtualization on and off on demand. –  It lacks a coordination mechanism among distributed VMM. •  SymCall: an upcall mechanism from a VMM to the guest OS, using a nested VM Exit call –  SymVirt is a simple hypercall mechanism from a guest OS to the VMM, assuming it works in cooperation with a cloud scheduler. 26
  • 28. Conclustion •  We have proposed a cooperative VM migration mechanism that enables us to migrate VMs with VMM- bypass I/O devices, using a simple Guest OS-to-VMM communication mechanism, called SymVirt. •  Using the proposed mechanism, we demonstrated a proactive FT system in a virtualized Infiniband cluster. •  We also confirmed that postcopy migration helps to reduce the downtime in the proactive FT system. •  SymVirt can be useful for not only fault tolerant but also load balancing and server consolidation. 28
  • 29. Future Work •  Interconnect-transparent migration, called “Ninja migration” –  We have submitted another conference paper. •  Overhead mitigation of SymVirt –  Very long link-up time problem –  Better integration with postcopy migration •  A generic communication layer supporting cooperative VM migration –  It is independent on an MPI runtime system. 29
  • 30. Thanks for your attention! This work was partly supported by JSPS KAKENHI 24700040 and ARGO GRAPHICS, Inc. 30