SlideShare une entreprise Scribd logo
1  sur  15
Télécharger pour lire hors ligne
Building an Outsourcing Ecosystem
                for Science

                           Kate Keahey
                       keahey@mcs.anl.gov
                  Argonne National Laboratory
             Computation Institute, University of Chicago


11/24/2011                   www.nimbusproject.org          1
Outsourcing Computing for Science

                      • Control over environment
                          – Complexity
                          – Consistency
                      • Control over availability
                          – Many real-time scenarios
                          – Naturally bursty character




             www.nimbusproject.org
Introducing Nimbus
                      High-quality, extensible, customizable,
                           open source implementation
                                 Nimbus Platform
                 Context                                        Elastic
                                      Cloudinit.d
                 Broker                                      Scaling Tools
                           Enable users to use IaaS clouds


                 Nimbus Infrastructure
                  Workspace
                                     Cumulus
                   Service
             Enable providers to build IaaS clouds

                 Enable developers to extend,
                 experiment and customize


11/24/2011                           www.nimbusproject.org                   3
On the Nimbus Team…


                     Applications and Patterns




             Providing infrastructure outsourcing for science


11/24/2011                  www.nimbusproject.org               4
Applications and Patterns




11/24/2011            www.nimbusproject.org   5
Scientific Cloud Resources
• Science Clouds                                        Magellan
     – UC, UFL, Wispy@Purdue
     – ~300 cores
• Magellan
     – DOE cloud @ ANL&LBNL                           FutureGrid
     – ~4000 cores@ANL
• FutureGrid
     – ~6000 cores
• MRI projects, e.g., DIAG = has cloud cycles for you!
     – Data Intensive Academic Grid
     – U of Maryland School of                        DIAG
       Medicine in Baltimore
     – ~1200-1500 cores
• Outside of US:
     – WestGrid, Grid’5000


11/24/2011                    www.nimbusproject.org                6
Sam Angiuoli
                                       Institute for Genome Sciences
                                       University of Maryland School of Medicine


• The emergent need for
  processing
• A virtual appliance for
  automated and portable
  sequence analysis
• Approach:
     – Running on user’s desktop
     – Then running on Nimbus
       Science Clouds, Magellan and
       EC2
     – A platform for building
       appliances representing push-
       button pipelines
• Impact
     – From desktop to cloud
     – http://clovr.org



11/24/2011                       www.nimbusproject.org                         7
Work by the UVIC team



• Detailed analysis of data from
  the MACHO experiment Dark
  Matter search
• Provide infrastructure for six
  observational astronomy
  survey projects
• Approach:
    – Appliance creation and
      management
    – Running on a Nimbus cloud on
      WestGrid
    – Dynamic Condor pool for
      astronomy
• Status:
    – In production operation since
      July 2010



 11/24/2011                           www.nimbusproject.org                    8
Work by Pierre Riteau et al,
    Sky Computing                                        University of Rennes 1

• Sky Computing = a Federation              “Sky Computing”
                                            IEEE Internet Computing, September 2009
  of Clouds
• Approach:
   – Combine resources obtained in
     multiple Nimbus clouds in
     FutureGrid and Grid’ 5000
   – Combine Context Broker, ViNe,
     fast image deployment
   – Deployed a virtual cluster of over
     1000 cores on Grid5000 and
     FutureGrid – largest of this type at
     the time
• Grid’5000 Large Scale
  Deployment Challenge award
• Demonstrated at OGF 29 06/10
• TeraGrid ’10 poster
• More at: www.isgtw.org/?pid=1002832

   11/24/2011                     www.nimbusproject.org                               9
Work by the UVIC team
               Canadian Efforts
• BarBar Experiment at
  SLAC in Stanford, CA
• Using clouds to simulate
  electron-positron
  collisions in their detector
• Exploring virtualization as
  a vehicle for data
  preservation
• Approach:
     – Appliance preparation and
       management
     – Distributed Nimbus clouds
     – Cloud Scheduler
• Running production
  BaBar workloads
11/24/2011                   www.nimbusproject.org                   10
Work by J. Lauret, J. Balewski
                                                         and the STAR team


           Challenge: what makes a proton spin?




• Why run in the cloud?
   – Reduce "time to science”
   – Near real-time processing
                                                              Images courtesy of J. Balewski


   11/24/2011                      www.nimbusproject.org                            11
• Towards Observatory
        Science
      • Sensor-driven processing
      • Scalable and Highly
        Available (HA) services
      • Nimbus team leading the
        development of Common
        Execution Infrastructure
      • From regional Nimbus clouds
        to commercial clouds
      • Building platform services for
        integrated, repeatable
        support for on-demand
        science


www.nimbusproject.org               12
Array Network Facility (ANF)
 Using elastic processing to operate on seismic data




                        www.nimbusproject.org          13
Trends and Patterns
• On-demand, elastic processing
     – Observatories, experiments, conference deadlines, growth
       management…
• Multiple providers
     – Risk-mitigation: not enough cycles, failure, market factors
• From the desktop to the cloud
     – Escalation pattern: seamlessly integrating more resources
• Ease of use/low barrier
     – Automated provisioning of infrastructure resources
• From one-offs to production runs
     – Steadily increasing in both size and buy-in
• Emphasis still on loosely-coupled
     – … EC2 ClusterCompute virtual clusters on Top500 list


11/24/2011                   www.nimbusproject.org              14
Outsourcing for Science:
    Building an Infrastructure Platform




11/24/2011        www.nimbusproject.org   15

Contenu connexe

Tendances

2011 boston open stack meetup 11 29_r1jmm
2011 boston open stack meetup 11 29_r1jmm2011 boston open stack meetup 11 29_r1jmm
2011 boston open stack meetup 11 29_r1jmm
DellCloudEdge
 
Accumulo Summit 2014: Addressing big data challenges through innovative archi...
Accumulo Summit 2014: Addressing big data challenges through innovative archi...Accumulo Summit 2014: Addressing big data challenges through innovative archi...
Accumulo Summit 2014: Addressing big data challenges through innovative archi...
Accumulo Summit
 
Synergy 2014 - Syn122 Moving Australian National Research into the Cloud
Synergy 2014 - Syn122 Moving Australian National Research into the CloudSynergy 2014 - Syn122 Moving Australian National Research into the Cloud
Synergy 2014 - Syn122 Moving Australian National Research into the Cloud
Citrix
 
A tutorial on GreenCloud
A tutorial on GreenCloudA tutorial on GreenCloud
A tutorial on GreenCloud
Habibur Rahman
 

Tendances (16)

20181219 ucc open stack 5 years v3
20181219 ucc open stack 5 years v320181219 ucc open stack 5 years v3
20181219 ucc open stack 5 years v3
 
Hypriot Cluster Lab – An ARM-Powered Cloud Solution Utilizing Docker
Hypriot Cluster Lab – An ARM-Powered Cloud Solution Utilizing DockerHypriot Cluster Lab – An ARM-Powered Cloud Solution Utilizing Docker
Hypriot Cluster Lab – An ARM-Powered Cloud Solution Utilizing Docker
 
Hybrid Cloud for CERN
Hybrid Cloud for CERN Hybrid Cloud for CERN
Hybrid Cloud for CERN
 
Calit2 Technology Overview for New Channels for Bio Com
Calit2 Technology Overview for New Channels for Bio ComCalit2 Technology Overview for New Channels for Bio Com
Calit2 Technology Overview for New Channels for Bio Com
 
Adventures in Research
Adventures in ResearchAdventures in Research
Adventures in Research
 
g-Social - Enhancing e-Science Tools with Social Networking Functionality
g-Social - Enhancing e-Science Tools with Social Networking Functionalityg-Social - Enhancing e-Science Tools with Social Networking Functionality
g-Social - Enhancing e-Science Tools with Social Networking Functionality
 
2011 boston open stack meetup 11 29_r1jmm
2011 boston open stack meetup 11 29_r1jmm2011 boston open stack meetup 11 29_r1jmm
2011 boston open stack meetup 11 29_r1jmm
 
GigaOM 2013 highlights
GigaOM 2013 highlightsGigaOM 2013 highlights
GigaOM 2013 highlights
 
Cloud present, future and trajectory (Amazon Web Services) - JIsc Digifest 2016
Cloud present, future and trajectory (Amazon Web Services) - JIsc Digifest 2016Cloud present, future and trajectory (Amazon Web Services) - JIsc Digifest 2016
Cloud present, future and trajectory (Amazon Web Services) - JIsc Digifest 2016
 
Impact of Grid Computing on Network Operators and HW Vendors
Impact of Grid Computing on Network Operators and HW VendorsImpact of Grid Computing on Network Operators and HW Vendors
Impact of Grid Computing on Network Operators and HW Vendors
 
Accumulo Summit 2014: Addressing big data challenges through innovative archi...
Accumulo Summit 2014: Addressing big data challenges through innovative archi...Accumulo Summit 2014: Addressing big data challenges through innovative archi...
Accumulo Summit 2014: Addressing big data challenges through innovative archi...
 
Synergy 2014 - Syn122 Moving Australian National Research into the Cloud
Synergy 2014 - Syn122 Moving Australian National Research into the CloudSynergy 2014 - Syn122 Moving Australian National Research into the Cloud
Synergy 2014 - Syn122 Moving Australian National Research into the Cloud
 
TOWARDS Hybrid OpenStack Clouds in the Real World
TOWARDS Hybrid OpenStack Clouds in the Real WorldTOWARDS Hybrid OpenStack Clouds in the Real World
TOWARDS Hybrid OpenStack Clouds in the Real World
 
Data-intensive bioinformatics on HPC and Cloud
Data-intensive bioinformatics on HPC and CloudData-intensive bioinformatics on HPC and Cloud
Data-intensive bioinformatics on HPC and Cloud
 
A tutorial on GreenCloud
A tutorial on GreenCloudA tutorial on GreenCloud
A tutorial on GreenCloud
 
What the cloud has to do with a burning house?
What the cloud has to do with a burning house?What the cloud has to do with a burning house?
What the cloud has to do with a burning house?
 

En vedette (9)

Robert Tomkinson Presentation
Robert Tomkinson PresentationRobert Tomkinson Presentation
Robert Tomkinson Presentation
 
Seth Lieberman
Seth LiebermanSeth Lieberman
Seth Lieberman
 
Wisconsin Birds 09 10
Wisconsin Birds 09 10Wisconsin Birds 09 10
Wisconsin Birds 09 10
 
1. five habits of highly successful clouds
1. five habits of highly successful clouds1. five habits of highly successful clouds
1. five habits of highly successful clouds
 
Birds Of Wisconsin by 3rd Graders
Birds Of Wisconsin by 3rd GradersBirds Of Wisconsin by 3rd Graders
Birds Of Wisconsin by 3rd Graders
 
ESU 10 Staff BBP Annual Review
ESU 10 Staff BBP Annual ReviewESU 10 Staff BBP Annual Review
ESU 10 Staff BBP Annual Review
 
David Steinberger Presentation
David Steinberger PresentationDavid Steinberger Presentation
David Steinberger Presentation
 
Marty Weintraub
Marty WeintraubMarty Weintraub
Marty Weintraub
 
Inside3DPrinting_bramdezwart
Inside3DPrinting_bramdezwartInside3DPrinting_bramdezwart
Inside3DPrinting_bramdezwart
 

Similaire à Building an Outsourcing Ecosystem for Science

The Future of R&E networks and cyber-infrastructure
The Future of R&E networks and cyber-infrastructureThe Future of R&E networks and cyber-infrastructure
The Future of R&E networks and cyber-infrastructure
Bill St. Arnaud
 
Opening the Path to Technical Excellence
Opening the Path to Technical ExcellenceOpening the Path to Technical Excellence
Opening the Path to Technical Excellence
NETWAYS
 
Clould Computing and its application in Libraries
Clould Computing and its application in LibrariesClould Computing and its application in Libraries
Clould Computing and its application in Libraries
Amit Shaw
 

Similaire à Building an Outsourcing Ecosystem for Science (20)

Grid is Dead ? Nimrod on the Cloud
Grid is Dead ? Nimrod on the CloudGrid is Dead ? Nimrod on the Cloud
Grid is Dead ? Nimrod on the Cloud
 
Adoption of Cloud Computing in Scientific Research
Adoption of Cloud Computing in Scientific ResearchAdoption of Cloud Computing in Scientific Research
Adoption of Cloud Computing in Scientific Research
 
Federated Cloud Computing
Federated Cloud ComputingFederated Cloud Computing
Federated Cloud Computing
 
Research in the Cloud
Research in the CloudResearch in the Cloud
Research in the Cloud
 
End-to-end Optical Fiber Cyberinfrastructure for Data-Intensive Research: Imp...
End-to-end Optical Fiber Cyberinfrastructure for Data-Intensive Research: Imp...End-to-end Optical Fiber Cyberinfrastructure for Data-Intensive Research: Imp...
End-to-end Optical Fiber Cyberinfrastructure for Data-Intensive Research: Imp...
 
Cloud101-Introduction to cloud
Cloud101-Introduction to cloud Cloud101-Introduction to cloud
Cloud101-Introduction to cloud
 
ApacheCon NA 2013
ApacheCon NA 2013ApacheCon NA 2013
ApacheCon NA 2013
 
Cloud computing and bioinformatics
Cloud computing and bioinformaticsCloud computing and bioinformatics
Cloud computing and bioinformatics
 
The Pacific Research Platform: a Science-Driven Big-Data Freeway System
The Pacific Research Platform: a Science-Driven Big-Data Freeway SystemThe Pacific Research Platform: a Science-Driven Big-Data Freeway System
The Pacific Research Platform: a Science-Driven Big-Data Freeway System
 
GSA Presentation - MILLER 251-4.pdf
GSA Presentation - MILLER 251-4.pdfGSA Presentation - MILLER 251-4.pdf
GSA Presentation - MILLER 251-4.pdf
 
Virtualization for HPC at NCI
Virtualization for HPC at NCIVirtualization for HPC at NCI
Virtualization for HPC at NCI
 
Summit 16: Multi-site OPNFV Testing Challenges
Summit 16: Multi-site OPNFV Testing ChallengesSummit 16: Multi-site OPNFV Testing Challenges
Summit 16: Multi-site OPNFV Testing Challenges
 
The Future of R&E networks and cyber-infrastructure
The Future of R&E networks and cyber-infrastructureThe Future of R&E networks and cyber-infrastructure
The Future of R&E networks and cyber-infrastructure
 
Chep2012
Chep2012Chep2012
Chep2012
 
OpenNebulaConf 2013 - Keynote: Opening the Path to Technical Excellence by Jo...
OpenNebulaConf 2013 - Keynote: Opening the Path to Technical Excellence by Jo...OpenNebulaConf 2013 - Keynote: Opening the Path to Technical Excellence by Jo...
OpenNebulaConf 2013 - Keynote: Opening the Path to Technical Excellence by Jo...
 
Opening the Path to Technical Excellence
Opening the Path to Technical ExcellenceOpening the Path to Technical Excellence
Opening the Path to Technical Excellence
 
Session19 Globus
Session19 GlobusSession19 Globus
Session19 Globus
 
Towards-cloud-native-HPC.pdf
Towards-cloud-native-HPC.pdfTowards-cloud-native-HPC.pdf
Towards-cloud-native-HPC.pdf
 
Clould Computing and its application in Libraries
Clould Computing and its application in LibrariesClould Computing and its application in Libraries
Clould Computing and its application in Libraries
 
Cloud Standards in the Real World: Cloud Standards Testing for Developers
Cloud Standards in the Real World: Cloud Standards Testing for DevelopersCloud Standards in the Real World: Cloud Standards Testing for Developers
Cloud Standards in the Real World: Cloud Standards Testing for Developers
 

Plus de EuroCloud

Cloudy Datacenter Survey
Cloudy Datacenter SurveyCloudy Datacenter Survey
Cloudy Datacenter Survey
EuroCloud
 
A Mobile Sensing Architecture for Massive Urban Scanning
A Mobile Sensing Architecture for Massive Urban ScanningA Mobile Sensing Architecture for Massive Urban Scanning
A Mobile Sensing Architecture for Massive Urban Scanning
EuroCloud
 
Evaluation of Virtual Clusters Performance on a Cloud Computing Infrastructure
Evaluation of Virtual Clusters Performance on a Cloud Computing InfrastructureEvaluation of Virtual Clusters Performance on a Cloud Computing Infrastructure
Evaluation of Virtual Clusters Performance on a Cloud Computing Infrastructure
EuroCloud
 
Self Optimizing transactional data grids for elastic cloud environments
Self Optimizing transactional data grids for elastic cloud environmentsSelf Optimizing transactional data grids for elastic cloud environments
Self Optimizing transactional data grids for elastic cloud environments
EuroCloud
 
Cloudviews eurocloud rcosta
Cloudviews eurocloud rcostaCloudviews eurocloud rcosta
Cloudviews eurocloud rcosta
EuroCloud
 
Cloud views2010 google docs privacy
Cloud views2010   google docs privacyCloud views2010   google docs privacy
Cloud views2010 google docs privacy
EuroCloud
 
Cil 2010 cloud comp1.0
Cil 2010 cloud comp1.0Cil 2010 cloud comp1.0
Cil 2010 cloud comp1.0
EuroCloud
 
CardMobili @ CloudViews2010
CardMobili @ CloudViews2010CardMobili @ CloudViews2010
CardMobili @ CloudViews2010
EuroCloud
 
Hive solutions cloudviews 2010 presentation
Hive solutions cloudviews 2010 presentationHive solutions cloudviews 2010 presentation
Hive solutions cloudviews 2010 presentation
EuroCloud
 
Closetask 10 mins en
Closetask 10 mins enClosetask 10 mins en
Closetask 10 mins en
EuroCloud
 
Apresentacao produtiv cloud views
Apresentacao   produtiv cloud viewsApresentacao   produtiv cloud views
Apresentacao produtiv cloud views
EuroCloud
 
Apresentação novastic mp
Apresentação novastic mpApresentação novastic mp
Apresentação novastic mp
EuroCloud
 
Ap4 construction platform_presentation_cloud_views_2010
Ap4 construction platform_presentation_cloud_views_2010Ap4 construction platform_presentation_cloud_views_2010
Ap4 construction platform_presentation_cloud_views_2010
EuroCloud
 
2010.05.21 invicta angels cloud views.callforbusiness
2010.05.21 invicta angels cloud views.callforbusiness2010.05.21 invicta angels cloud views.callforbusiness
2010.05.21 invicta angels cloud views.callforbusiness
EuroCloud
 
Luis lima v3
Luis lima v3Luis lima v3
Luis lima v3
EuroCloud
 
Fx apresentacao evento cloudview 2010
Fx apresentacao evento cloudview 2010Fx apresentacao evento cloudview 2010
Fx apresentacao evento cloudview 2010
EuroCloud
 

Plus de EuroCloud (20)

Cloudy Datacenter Survey
Cloudy Datacenter SurveyCloudy Datacenter Survey
Cloudy Datacenter Survey
 
A Mobile Sensing Architecture for Massive Urban Scanning
A Mobile Sensing Architecture for Massive Urban ScanningA Mobile Sensing Architecture for Massive Urban Scanning
A Mobile Sensing Architecture for Massive Urban Scanning
 
Cities in the Cloud
Cities in the CloudCities in the Cloud
Cities in the Cloud
 
Evaluation of Virtual Clusters Performance on a Cloud Computing Infrastructure
Evaluation of Virtual Clusters Performance on a Cloud Computing InfrastructureEvaluation of Virtual Clusters Performance on a Cloud Computing Infrastructure
Evaluation of Virtual Clusters Performance on a Cloud Computing Infrastructure
 
Self Optimizing transactional data grids for elastic cloud environments
Self Optimizing transactional data grids for elastic cloud environmentsSelf Optimizing transactional data grids for elastic cloud environments
Self Optimizing transactional data grids for elastic cloud environments
 
Cloudviews eurocloud rcosta
Cloudviews eurocloud rcostaCloudviews eurocloud rcosta
Cloudviews eurocloud rcosta
 
Cloud views2010 google docs privacy
Cloud views2010   google docs privacyCloud views2010   google docs privacy
Cloud views2010 google docs privacy
 
Cil 2010 cloud comp1.0
Cil 2010 cloud comp1.0Cil 2010 cloud comp1.0
Cil 2010 cloud comp1.0
 
CardMobili @ CloudViews2010
CardMobili @ CloudViews2010CardMobili @ CloudViews2010
CardMobili @ CloudViews2010
 
Muchbeta
MuchbetaMuchbeta
Muchbeta
 
Hive solutions cloudviews 2010 presentation
Hive solutions cloudviews 2010 presentationHive solutions cloudviews 2010 presentation
Hive solutions cloudviews 2010 presentation
 
Closetask 10 mins en
Closetask 10 mins enClosetask 10 mins en
Closetask 10 mins en
 
Cardmobili
CardmobiliCardmobili
Cardmobili
 
Apresentacao produtiv cloud views
Apresentacao   produtiv cloud viewsApresentacao   produtiv cloud views
Apresentacao produtiv cloud views
 
Apresentação novastic mp
Apresentação novastic mpApresentação novastic mp
Apresentação novastic mp
 
Ap4 construction platform_presentation_cloud_views_2010
Ap4 construction platform_presentation_cloud_views_2010Ap4 construction platform_presentation_cloud_views_2010
Ap4 construction platform_presentation_cloud_views_2010
 
2010.05.21 invicta angels cloud views.callforbusiness
2010.05.21 invicta angels cloud views.callforbusiness2010.05.21 invicta angels cloud views.callforbusiness
2010.05.21 invicta angels cloud views.callforbusiness
 
Jorge gomes
Jorge gomesJorge gomes
Jorge gomes
 
Luis lima v3
Luis lima v3Luis lima v3
Luis lima v3
 
Fx apresentacao evento cloudview 2010
Fx apresentacao evento cloudview 2010Fx apresentacao evento cloudview 2010
Fx apresentacao evento cloudview 2010
 

Dernier

Dernier (20)

Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 

Building an Outsourcing Ecosystem for Science

  • 1. Building an Outsourcing Ecosystem for Science Kate Keahey keahey@mcs.anl.gov Argonne National Laboratory Computation Institute, University of Chicago 11/24/2011 www.nimbusproject.org 1
  • 2. Outsourcing Computing for Science • Control over environment – Complexity – Consistency • Control over availability – Many real-time scenarios – Naturally bursty character www.nimbusproject.org
  • 3. Introducing Nimbus High-quality, extensible, customizable, open source implementation Nimbus Platform Context Elastic Cloudinit.d Broker Scaling Tools Enable users to use IaaS clouds Nimbus Infrastructure Workspace Cumulus Service Enable providers to build IaaS clouds Enable developers to extend, experiment and customize 11/24/2011 www.nimbusproject.org 3
  • 4. On the Nimbus Team… Applications and Patterns Providing infrastructure outsourcing for science 11/24/2011 www.nimbusproject.org 4
  • 5. Applications and Patterns 11/24/2011 www.nimbusproject.org 5
  • 6. Scientific Cloud Resources • Science Clouds Magellan – UC, UFL, Wispy@Purdue – ~300 cores • Magellan – DOE cloud @ ANL&LBNL FutureGrid – ~4000 cores@ANL • FutureGrid – ~6000 cores • MRI projects, e.g., DIAG = has cloud cycles for you! – Data Intensive Academic Grid – U of Maryland School of DIAG Medicine in Baltimore – ~1200-1500 cores • Outside of US: – WestGrid, Grid’5000 11/24/2011 www.nimbusproject.org 6
  • 7. Sam Angiuoli Institute for Genome Sciences University of Maryland School of Medicine • The emergent need for processing • A virtual appliance for automated and portable sequence analysis • Approach: – Running on user’s desktop – Then running on Nimbus Science Clouds, Magellan and EC2 – A platform for building appliances representing push- button pipelines • Impact – From desktop to cloud – http://clovr.org 11/24/2011 www.nimbusproject.org 7
  • 8. Work by the UVIC team • Detailed analysis of data from the MACHO experiment Dark Matter search • Provide infrastructure for six observational astronomy survey projects • Approach: – Appliance creation and management – Running on a Nimbus cloud on WestGrid – Dynamic Condor pool for astronomy • Status: – In production operation since July 2010 11/24/2011 www.nimbusproject.org 8
  • 9. Work by Pierre Riteau et al, Sky Computing University of Rennes 1 • Sky Computing = a Federation “Sky Computing” IEEE Internet Computing, September 2009 of Clouds • Approach: – Combine resources obtained in multiple Nimbus clouds in FutureGrid and Grid’ 5000 – Combine Context Broker, ViNe, fast image deployment – Deployed a virtual cluster of over 1000 cores on Grid5000 and FutureGrid – largest of this type at the time • Grid’5000 Large Scale Deployment Challenge award • Demonstrated at OGF 29 06/10 • TeraGrid ’10 poster • More at: www.isgtw.org/?pid=1002832 11/24/2011 www.nimbusproject.org 9
  • 10. Work by the UVIC team Canadian Efforts • BarBar Experiment at SLAC in Stanford, CA • Using clouds to simulate electron-positron collisions in their detector • Exploring virtualization as a vehicle for data preservation • Approach: – Appliance preparation and management – Distributed Nimbus clouds – Cloud Scheduler • Running production BaBar workloads 11/24/2011 www.nimbusproject.org 10
  • 11. Work by J. Lauret, J. Balewski and the STAR team Challenge: what makes a proton spin? • Why run in the cloud? – Reduce "time to science” – Near real-time processing Images courtesy of J. Balewski 11/24/2011 www.nimbusproject.org 11
  • 12. • Towards Observatory Science • Sensor-driven processing • Scalable and Highly Available (HA) services • Nimbus team leading the development of Common Execution Infrastructure • From regional Nimbus clouds to commercial clouds • Building platform services for integrated, repeatable support for on-demand science www.nimbusproject.org 12
  • 13. Array Network Facility (ANF) Using elastic processing to operate on seismic data www.nimbusproject.org 13
  • 14. Trends and Patterns • On-demand, elastic processing – Observatories, experiments, conference deadlines, growth management… • Multiple providers – Risk-mitigation: not enough cycles, failure, market factors • From the desktop to the cloud – Escalation pattern: seamlessly integrating more resources • Ease of use/low barrier – Automated provisioning of infrastructure resources • From one-offs to production runs – Steadily increasing in both size and buy-in • Emphasis still on loosely-coupled – … EC2 ClusterCompute virtual clusters on Top500 list 11/24/2011 www.nimbusproject.org 14
  • 15. Outsourcing for Science: Building an Infrastructure Platform 11/24/2011 www.nimbusproject.org 15