SlideShare une entreprise Scribd logo
1  sur  15
Cloud BioLinux: Standardized, Pre-Configured and On-Demand
            Computing for Genomics and Beyond



                    Ntino Krampis, PhD
                         GSC 2011
                        Hinxton, UK
Expensive sequencing and large organizations
                    Commodity sequencing and small labs

●
    large sequencing center, multi-million, broad-impact sequencing projects
●   dedicated bioinformatics department, coordination with other centers


●   small-factor, bench-top sequencer available: GS Junior by 454
●   sequencing as a standard technique in basic biology and genetics research
●   RNAseq and ChiPseq, and each biologist will be tackling a metagenome
“Bioinformatics nation is a land of city-states” Lincoln Stein

●   smaller labs building small-scale bioinformatics infrastructures
●   duplication of effort in compiling and installing software tools
●   some labs have no hardware, expertise, or time to install and run software


●   early pioneer in this area was NEBC BioLinux ( tinyurl.com/BioLinux-NEBC )
●
    desktop linux with with 100+ pre-configured bioinformatics tools
●   example: glimmer, hmmer, phylip, rasmol, genespring, clustalw, EMBOSS


                                  how about large-scale sequence
                                  datasets ?
Cloud BioLinux
standardized, pre-configured and on-demand bioinformatics computing on the cloud


                                 ●   JCVI's cloud computing expertise
                                 ●   NEBC's bioinformatics software repository
                                 ●   community effort – ISMB / BOSC 2010
                                 ●   standardized, pre-configured Virtual Machine (VM, image)

      +                          ●   VM: emulates a computer server, encapsulates operating
                                     system, software libraries and bioinformatics tools
                                 ●   Amazon EC2 computational capacity as a utility, on-demand
                                 ●   rich interface through a remote desktop client

      =

tinyurl.com/CloudBioLinux-JCVI
http://cloudbiolinux.com
Cloud BioLinux and Genomic Standards
      framework to distribute bioinformatics tools, data and analysis results


    create cloud VM / images with standardized software configurations
● customize Cloud BioLinux VMs, based on community requirements
● share customized VMs with collaborators, avoiding effort duplication

● mix and match software from NEBC or other (DebianMed, Scientific Linux etc.)




    whole system snapshot exchange (Dudley and Butte 2010)
● capture the state of the computing system and data
● software execution parameters and “massaged” input datasets

● save into cloud VM / image and share along with analysis results




    democratize access to computing resources
● large-scale computing independently of institutional or geographic boundaries
● only need a desktop computer with internet access
Cloud BioLinux and Genomic Standards
        create cloud VM / images with standard software configurations

●   framework to describe software components in cloud VM / image
●   based on python-fabric automated deployment tool
●   software components listed in simple text files
●   edit the files to mix and match software according to your community needs
●   community members use files to share descriptions of customized systems
●   start with a bare-bones VM, fabric downloads and installs specified software
●   Labs with sensitive data and capacity for private clouds: works identically on
Amazon EC2 or Eucalyptus open-source cloud




tinyurl.com/python-fabric       open.eucalyptus.com
software domains in bioinformatics: nextgen
sequencing, de novo assembly, annotation, phylogeny,
    molecular structures, gene expression analysis

 high-level configuration describing software groups
    for each group individual bioinformatics tools
         tinyurl.com/CloudBioLinux-github
Cloud BioLinux and Genomic Standards
          whole system snapshot exchange


                                                 simply signup at

                                                aws.amazon.com
                                                      then
                                             aws.amazon.com/console
                                                      and




http://tinyurl.com/cloud-biolinux-tutorial
Cloud BioLinux and Genomic Standards
       whole system snapshot exchange



                                              find Cloud Biolinux
                                                   using ID

                                                  enter desired
                                              password for remote
                                                 desktop login

                                                all other default
 http://tinyurl.com/cloud-biolinux-tutorial
free remote desktop client:
nomachine.com/download.php

  simply enter VM IP address
     and your password
What if I want to
    share my
alignments with
a collaborator?

save your data as
   a new VM

  0.10$ / GB /
     month

at 15GB, it costs
  1.5$ / month
Cloud BioLinux and Genomic Standards
                                whole system snapshot exchange
share your analysis results: publicly or only with your
                     collaborators

authorized users can access the cloud VM/image with
       all the software, data, analysis results
Cloud BioLinux and Genomic Standards
                        whole system snapshot exchange



                start VM / image           share


                perform analysis           snapshot           researcher B
researcher A

                snapshot                   perform analysis


                share                      start VM / image
Cloud Biolinux
                                  The future


●   expand community, receive feedback, add more software to the VM

●   analysis pipelines that are used by large sequencing centers

●   actively seeking funding to put major effort in development

●   2011 ISMB/BOSC in Vienna, Austria, http://metalab.at/

●   tinyurl.com/cloudbiolinux-lists or community@cloudbiolinux.com
Acknowledgments & Credits
Brad Chapman      - development of the fabric scripts and community organizer
Tim Booth, Bela Tiwari, Dawn Field – BioLinux 6.0 development and EC2 documentation
Deepak Singh and AWS - education grant supporting ISMB / BOSC workshop
Justin Johnson    –   community and sponsorship of cloudbiolinux.com
J. Craig Venter Inst. - time allowed to work on an open-source project
D. Gomez, E. Navarro, J. Shao, I. Singh – JCVI technology innovation

Members of the Cloud Biolinux community:
Enis Afgan
Michael Heuer
Richard Holland
Mark Jensen                                        Thank you !
Dave Messina
Steffen Möller
Roman Valls

Contenu connexe

En vedette

powerpoint
powerpointpowerpoint
powerpointUruguayo
 
Lee.Portfolio 09
Lee.Portfolio 09Lee.Portfolio 09
Lee.Portfolio 09leesalcone
 
ARINDON - Corporate Presentation
ARINDON - Corporate PresentationARINDON - Corporate Presentation
ARINDON - Corporate PresentationTIJJAY MITCHELL
 
How the Tablet Shopping Experience Will Impact Holiday Retail Sales
How the Tablet Shopping Experience Will Impact Holiday Retail SalesHow the Tablet Shopping Experience Will Impact Holiday Retail Sales
How the Tablet Shopping Experience Will Impact Holiday Retail SalesUserZoom
 
Dokumentacia sommeliers seminars1
Dokumentacia sommeliers seminars1Dokumentacia sommeliers seminars1
Dokumentacia sommeliers seminars1Nural Tataoglu
 
Innovation Benefits Realization for Industrial Research (Part-6)
Innovation Benefits Realization for Industrial Research (Part-6)Innovation Benefits Realization for Industrial Research (Part-6)
Innovation Benefits Realization for Industrial Research (Part-6)Iain Sanders
 
Exposicion de financiamientos
Exposicion de financiamientosExposicion de financiamientos
Exposicion de financiamientosAlita Orpe
 
Laporan observasi rpp dan laboratorium
Laporan observasi rpp dan laboratoriumLaporan observasi rpp dan laboratorium
Laporan observasi rpp dan laboratoriumSamantars17
 
εργασία: κριτήρια επιλογής, συνέπειες βιομηχανικής επανάστασης
εργασία: κριτήρια επιλογής, συνέπειες βιομηχανικής επανάστασηςεργασία: κριτήρια επιλογής, συνέπειες βιομηχανικής επανάστασης
εργασία: κριτήρια επιλογής, συνέπειες βιομηχανικής επανάστασηςπεντάλ σχολικό
 
Understanding Android Handling of Touch Events
Understanding Android Handling of Touch EventsUnderstanding Android Handling of Touch Events
Understanding Android Handling of Touch Eventsjensmohr
 
Mobile note mobile by MAdvertise et Bemobee
Mobile note mobile by MAdvertise et BemobeeMobile note mobile by MAdvertise et Bemobee
Mobile note mobile by MAdvertise et BemobeeFranck Deville
 
Comparison of Jacob's Creek, Rosemount Estate, McGuigan Wines, Lindeman's and...
Comparison of Jacob's Creek, Rosemount Estate, McGuigan Wines, Lindeman's and...Comparison of Jacob's Creek, Rosemount Estate, McGuigan Wines, Lindeman's and...
Comparison of Jacob's Creek, Rosemount Estate, McGuigan Wines, Lindeman's and...Unmetric
 

En vedette (20)

Les xarxes socials
Les xarxes socialsLes xarxes socials
Les xarxes socials
 
powerpoint
powerpointpowerpoint
powerpoint
 
Lee.Portfolio 09
Lee.Portfolio 09Lee.Portfolio 09
Lee.Portfolio 09
 
ARINDON - Corporate Presentation
ARINDON - Corporate PresentationARINDON - Corporate Presentation
ARINDON - Corporate Presentation
 
How the Tablet Shopping Experience Will Impact Holiday Retail Sales
How the Tablet Shopping Experience Will Impact Holiday Retail SalesHow the Tablet Shopping Experience Will Impact Holiday Retail Sales
How the Tablet Shopping Experience Will Impact Holiday Retail Sales
 
Portfolio
PortfolioPortfolio
Portfolio
 
Dokumentacia sommeliers seminars1
Dokumentacia sommeliers seminars1Dokumentacia sommeliers seminars1
Dokumentacia sommeliers seminars1
 
2013 eswc-bio2rdf-r2
2013 eswc-bio2rdf-r22013 eswc-bio2rdf-r2
2013 eswc-bio2rdf-r2
 
Innovation Benefits Realization for Industrial Research (Part-6)
Innovation Benefits Realization for Industrial Research (Part-6)Innovation Benefits Realization for Industrial Research (Part-6)
Innovation Benefits Realization for Industrial Research (Part-6)
 
Ivan Pellegrin
Ivan PellegrinIvan Pellegrin
Ivan Pellegrin
 
Baby talk 2
Baby talk 2Baby talk 2
Baby talk 2
 
resume
resumeresume
resume
 
Great quotes of bruce lee
Great quotes of bruce leeGreat quotes of bruce lee
Great quotes of bruce lee
 
Exposicion de financiamientos
Exposicion de financiamientosExposicion de financiamientos
Exposicion de financiamientos
 
Clases de noviembre 5 ños 1
Clases de noviembre 5 ños 1Clases de noviembre 5 ños 1
Clases de noviembre 5 ños 1
 
Laporan observasi rpp dan laboratorium
Laporan observasi rpp dan laboratoriumLaporan observasi rpp dan laboratorium
Laporan observasi rpp dan laboratorium
 
εργασία: κριτήρια επιλογής, συνέπειες βιομηχανικής επανάστασης
εργασία: κριτήρια επιλογής, συνέπειες βιομηχανικής επανάστασηςεργασία: κριτήρια επιλογής, συνέπειες βιομηχανικής επανάστασης
εργασία: κριτήρια επιλογής, συνέπειες βιομηχανικής επανάστασης
 
Understanding Android Handling of Touch Events
Understanding Android Handling of Touch EventsUnderstanding Android Handling of Touch Events
Understanding Android Handling of Touch Events
 
Mobile note mobile by MAdvertise et Bemobee
Mobile note mobile by MAdvertise et BemobeeMobile note mobile by MAdvertise et Bemobee
Mobile note mobile by MAdvertise et Bemobee
 
Comparison of Jacob's Creek, Rosemount Estate, McGuigan Wines, Lindeman's and...
Comparison of Jacob's Creek, Rosemount Estate, McGuigan Wines, Lindeman's and...Comparison of Jacob's Creek, Rosemount Estate, McGuigan Wines, Lindeman's and...
Comparison of Jacob's Creek, Rosemount Estate, McGuigan Wines, Lindeman's and...
 

Similaire à Ntino Krampis GSC 2011

Cloud BioLinux S.Africa
Cloud BioLinux S.AfricaCloud BioLinux S.Africa
Cloud BioLinux S.AfricaNtino Krampis
 
Principles of Reproducible Workflows (U-DAWS) nfcamp2019
Principles of Reproducible Workflows (U-DAWS) nfcamp2019Principles of Reproducible Workflows (U-DAWS) nfcamp2019
Principles of Reproducible Workflows (U-DAWS) nfcamp2019Venkat Malladi
 
Oscon 2017: Build your own container-based system with the Moby project
Oscon 2017: Build your own container-based system with the Moby projectOscon 2017: Build your own container-based system with the Moby project
Oscon 2017: Build your own container-based system with the Moby projectPatrick Chanezon
 
Kitware: Qt and Scientific Computing
Kitware: Qt and Scientific ComputingKitware: Qt and Scientific Computing
Kitware: Qt and Scientific Computingaccount inactive
 
Machine Learning , Analytics & Cyber Security the Next Level Threat Analytics...
Machine Learning , Analytics & Cyber Security the Next Level Threat Analytics...Machine Learning , Analytics & Cyber Security the Next Level Threat Analytics...
Machine Learning , Analytics & Cyber Security the Next Level Threat Analytics...PranavPatil822557
 
Clc Bio Basic Company Presentation
Clc Bio Basic Company PresentationClc Bio Basic Company Presentation
Clc Bio Basic Company Presentationclcbio
 
CNCF Introduction - Feb 2018
CNCF Introduction - Feb 2018CNCF Introduction - Feb 2018
CNCF Introduction - Feb 2018Krishna-Kumar
 
Federating Infrastructure as a Service cloud computing systems to create a un...
Federating Infrastructure as a Service cloud computing systems to create a un...Federating Infrastructure as a Service cloud computing systems to create a un...
Federating Infrastructure as a Service cloud computing systems to create a un...David Wallom
 
Executive Briefing: The Why, What, and Where of Containers
Executive Briefing: The Why, What, and Where of ContainersExecutive Briefing: The Why, What, and Where of Containers
Executive Briefing: The Why, What, and Where of ContainersNVISIA
 
Enabling Production Grade Containerized Applications through Policy Based Inf...
Enabling Production Grade Containerized Applications through Policy Based Inf...Enabling Production Grade Containerized Applications through Policy Based Inf...
Enabling Production Grade Containerized Applications through Policy Based Inf...Docker, Inc.
 
Desktop as a Service supporting Environmental ‘omics
Desktop as a Service supporting Environmental ‘omicsDesktop as a Service supporting Environmental ‘omics
Desktop as a Service supporting Environmental ‘omicsDavid Wallom
 
20160629 Habitat Introduction: Austin DevOps/Mesos User Group
20160629 Habitat Introduction: Austin DevOps/Mesos User Group 20160629 Habitat Introduction: Austin DevOps/Mesos User Group
20160629 Habitat Introduction: Austin DevOps/Mesos User Group Matt Ray
 
InfoSec 2011: Crash Course Open Source Cloud Computing
InfoSec 2011: Crash Course Open Source Cloud ComputingInfoSec 2011: Crash Course Open Source Cloud Computing
InfoSec 2011: Crash Course Open Source Cloud ComputingMark Hinkle
 
Understanding Kubernetes
Understanding KubernetesUnderstanding Kubernetes
Understanding KubernetesTu Pham
 
IBM Multicloud Management on the OpenShift Container Platform
IBM Multicloud Management on theOpenShift Container PlatformIBM Multicloud Management on theOpenShift Container Platform
IBM Multicloud Management on the OpenShift Container PlatformMichael Elder
 
Cloud foundry Docker Openstack - Leading Open Source Triumvirate
Cloud foundry Docker Openstack - Leading Open Source TriumvirateCloud foundry Docker Openstack - Leading Open Source Triumvirate
Cloud foundry Docker Openstack - Leading Open Source TriumvirateAnimesh Singh
 
Continuous Integration for Oracle Database Development
Continuous Integration for Oracle Database DevelopmentContinuous Integration for Oracle Database Development
Continuous Integration for Oracle Database DevelopmentVladimir Bakhov
 
Docker Roadshow 2016
Docker Roadshow 2016Docker Roadshow 2016
Docker Roadshow 2016Docker, Inc.
 

Similaire à Ntino Krampis GSC 2011 (20)

Cloud BioLinux S.Africa
Cloud BioLinux S.AfricaCloud BioLinux S.Africa
Cloud BioLinux S.Africa
 
Principles of Reproducible Workflows (U-DAWS) nfcamp2019
Principles of Reproducible Workflows (U-DAWS) nfcamp2019Principles of Reproducible Workflows (U-DAWS) nfcamp2019
Principles of Reproducible Workflows (U-DAWS) nfcamp2019
 
Oscon 2017: Build your own container-based system with the Moby project
Oscon 2017: Build your own container-based system with the Moby projectOscon 2017: Build your own container-based system with the Moby project
Oscon 2017: Build your own container-based system with the Moby project
 
Kitware: Qt and Scientific Computing
Kitware: Qt and Scientific ComputingKitware: Qt and Scientific Computing
Kitware: Qt and Scientific Computing
 
Machine Learning , Analytics & Cyber Security the Next Level Threat Analytics...
Machine Learning , Analytics & Cyber Security the Next Level Threat Analytics...Machine Learning , Analytics & Cyber Security the Next Level Threat Analytics...
Machine Learning , Analytics & Cyber Security the Next Level Threat Analytics...
 
Clc Bio Basic Company Presentation
Clc Bio Basic Company PresentationClc Bio Basic Company Presentation
Clc Bio Basic Company Presentation
 
Domestic cloud
Domestic cloudDomestic cloud
Domestic cloud
 
CNCF Introduction - Feb 2018
CNCF Introduction - Feb 2018CNCF Introduction - Feb 2018
CNCF Introduction - Feb 2018
 
Federating Infrastructure as a Service cloud computing systems to create a un...
Federating Infrastructure as a Service cloud computing systems to create a un...Federating Infrastructure as a Service cloud computing systems to create a un...
Federating Infrastructure as a Service cloud computing systems to create a un...
 
Executive Briefing: The Why, What, and Where of Containers
Executive Briefing: The Why, What, and Where of ContainersExecutive Briefing: The Why, What, and Where of Containers
Executive Briefing: The Why, What, and Where of Containers
 
Enabling Production Grade Containerized Applications through Policy Based Inf...
Enabling Production Grade Containerized Applications through Policy Based Inf...Enabling Production Grade Containerized Applications through Policy Based Inf...
Enabling Production Grade Containerized Applications through Policy Based Inf...
 
Desktop as a Service supporting Environmental ‘omics
Desktop as a Service supporting Environmental ‘omicsDesktop as a Service supporting Environmental ‘omics
Desktop as a Service supporting Environmental ‘omics
 
20160629 Habitat Introduction: Austin DevOps/Mesos User Group
20160629 Habitat Introduction: Austin DevOps/Mesos User Group 20160629 Habitat Introduction: Austin DevOps/Mesos User Group
20160629 Habitat Introduction: Austin DevOps/Mesos User Group
 
InfoSec 2011: Crash Course Open Source Cloud Computing
InfoSec 2011: Crash Course Open Source Cloud ComputingInfoSec 2011: Crash Course Open Source Cloud Computing
InfoSec 2011: Crash Course Open Source Cloud Computing
 
final proposal-cloud storage
final proposal-cloud storagefinal proposal-cloud storage
final proposal-cloud storage
 
Understanding Kubernetes
Understanding KubernetesUnderstanding Kubernetes
Understanding Kubernetes
 
IBM Multicloud Management on the OpenShift Container Platform
IBM Multicloud Management on theOpenShift Container PlatformIBM Multicloud Management on theOpenShift Container Platform
IBM Multicloud Management on the OpenShift Container Platform
 
Cloud foundry Docker Openstack - Leading Open Source Triumvirate
Cloud foundry Docker Openstack - Leading Open Source TriumvirateCloud foundry Docker Openstack - Leading Open Source Triumvirate
Cloud foundry Docker Openstack - Leading Open Source Triumvirate
 
Continuous Integration for Oracle Database Development
Continuous Integration for Oracle Database DevelopmentContinuous Integration for Oracle Database Development
Continuous Integration for Oracle Database Development
 
Docker Roadshow 2016
Docker Roadshow 2016Docker Roadshow 2016
Docker Roadshow 2016
 

Dernier

Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentationphoebematthew05
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Bluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfBluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfngoud9212
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxnull - The Open Security Community
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsHyundai Motor Group
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Neo4j
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 

Dernier (20)

Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentation
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Bluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfBluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdf
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptxVulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 

Ntino Krampis GSC 2011

  • 1. Cloud BioLinux: Standardized, Pre-Configured and On-Demand Computing for Genomics and Beyond Ntino Krampis, PhD GSC 2011 Hinxton, UK
  • 2. Expensive sequencing and large organizations Commodity sequencing and small labs ● large sequencing center, multi-million, broad-impact sequencing projects ● dedicated bioinformatics department, coordination with other centers ● small-factor, bench-top sequencer available: GS Junior by 454 ● sequencing as a standard technique in basic biology and genetics research ● RNAseq and ChiPseq, and each biologist will be tackling a metagenome
  • 3. “Bioinformatics nation is a land of city-states” Lincoln Stein ● smaller labs building small-scale bioinformatics infrastructures ● duplication of effort in compiling and installing software tools ● some labs have no hardware, expertise, or time to install and run software ● early pioneer in this area was NEBC BioLinux ( tinyurl.com/BioLinux-NEBC ) ● desktop linux with with 100+ pre-configured bioinformatics tools ● example: glimmer, hmmer, phylip, rasmol, genespring, clustalw, EMBOSS how about large-scale sequence datasets ?
  • 4. Cloud BioLinux standardized, pre-configured and on-demand bioinformatics computing on the cloud ● JCVI's cloud computing expertise ● NEBC's bioinformatics software repository ● community effort – ISMB / BOSC 2010 ● standardized, pre-configured Virtual Machine (VM, image) + ● VM: emulates a computer server, encapsulates operating system, software libraries and bioinformatics tools ● Amazon EC2 computational capacity as a utility, on-demand ● rich interface through a remote desktop client = tinyurl.com/CloudBioLinux-JCVI http://cloudbiolinux.com
  • 5. Cloud BioLinux and Genomic Standards framework to distribute bioinformatics tools, data and analysis results create cloud VM / images with standardized software configurations ● customize Cloud BioLinux VMs, based on community requirements ● share customized VMs with collaborators, avoiding effort duplication ● mix and match software from NEBC or other (DebianMed, Scientific Linux etc.) whole system snapshot exchange (Dudley and Butte 2010) ● capture the state of the computing system and data ● software execution parameters and “massaged” input datasets ● save into cloud VM / image and share along with analysis results democratize access to computing resources ● large-scale computing independently of institutional or geographic boundaries ● only need a desktop computer with internet access
  • 6. Cloud BioLinux and Genomic Standards create cloud VM / images with standard software configurations ● framework to describe software components in cloud VM / image ● based on python-fabric automated deployment tool ● software components listed in simple text files ● edit the files to mix and match software according to your community needs ● community members use files to share descriptions of customized systems ● start with a bare-bones VM, fabric downloads and installs specified software ● Labs with sensitive data and capacity for private clouds: works identically on Amazon EC2 or Eucalyptus open-source cloud tinyurl.com/python-fabric open.eucalyptus.com
  • 7. software domains in bioinformatics: nextgen sequencing, de novo assembly, annotation, phylogeny, molecular structures, gene expression analysis high-level configuration describing software groups for each group individual bioinformatics tools tinyurl.com/CloudBioLinux-github
  • 8. Cloud BioLinux and Genomic Standards whole system snapshot exchange simply signup at aws.amazon.com then aws.amazon.com/console and http://tinyurl.com/cloud-biolinux-tutorial
  • 9. Cloud BioLinux and Genomic Standards whole system snapshot exchange find Cloud Biolinux using ID enter desired password for remote desktop login all other default http://tinyurl.com/cloud-biolinux-tutorial
  • 10. free remote desktop client: nomachine.com/download.php simply enter VM IP address and your password
  • 11. What if I want to share my alignments with a collaborator? save your data as a new VM 0.10$ / GB / month at 15GB, it costs 1.5$ / month
  • 12. Cloud BioLinux and Genomic Standards whole system snapshot exchange share your analysis results: publicly or only with your collaborators authorized users can access the cloud VM/image with all the software, data, analysis results
  • 13. Cloud BioLinux and Genomic Standards whole system snapshot exchange start VM / image share perform analysis snapshot researcher B researcher A snapshot perform analysis share start VM / image
  • 14. Cloud Biolinux The future ● expand community, receive feedback, add more software to the VM ● analysis pipelines that are used by large sequencing centers ● actively seeking funding to put major effort in development ● 2011 ISMB/BOSC in Vienna, Austria, http://metalab.at/ ● tinyurl.com/cloudbiolinux-lists or community@cloudbiolinux.com
  • 15. Acknowledgments & Credits Brad Chapman - development of the fabric scripts and community organizer Tim Booth, Bela Tiwari, Dawn Field – BioLinux 6.0 development and EC2 documentation Deepak Singh and AWS - education grant supporting ISMB / BOSC workshop Justin Johnson – community and sponsorship of cloudbiolinux.com J. Craig Venter Inst. - time allowed to work on an open-source project D. Gomez, E. Navarro, J. Shao, I. Singh – JCVI technology innovation Members of the Cloud Biolinux community: Enis Afgan Michael Heuer Richard Holland Mark Jensen Thank you ! Dave Messina Steffen Möller Roman Valls