SlideShare a Scribd company logo
1 of 19
Download to read offline
C A M P A I G N
                                                        Chief Development Officer
                                                        Emory University
                       C A M P A I G N
                                                        School of Nursing Development
                                                        Emory University
                                                        1520 Clifton Road, NE
                                                        School of Nursing Development
                                                        Atlanta, Georgia 30322-4207
                                                        1520 Clifton Road, NE
                                                        P 404.727.1234
                                                        Atlanta, Georgia 30322-4207
                                                        E adorrill@emory.edu
                Discovery of human heteroplasmic sites  P 404.727.1234

              Enabling of human heteroplasmic sites
                 enabled NGS analysis with(out)
                                 E adorrill@emory.edu
                Discovery by an accessible interface to
                    cloud computing infrastructure to
                 enabled by infrastructure
                       the an accessible interface
                    cloud computing infrastructure
                   Enis Afgan, the Galaxy Team,Francesca Chiaromonte
                    Enis Afgan, Hiroki Goto, Ian Paul,
                                                       Anton Nekrutenko,
                                        James Taylor
                       Kateryna Makova, Anton Nekrutenko, James Taylor
                    Enis Afgan, Hiroki Goto, Ian Paul, Francesca Chiaromonte
                       Kateryna Makova, Anton Nekrutenko, James Taylor
                            Bioinformatics Open Source Conference, July 16, 2011, Vienna




missions: you are free to      blog or    live-blog about this presentation as long as you attribute the work to its au
CloudMan-as-a-Bridge
A. Users in different labs   B. Isolated Galaxy instance(s)   C. Dense data center
CloudMan-as-a-Bridge
A. Users in different labs   B. Isolated Galaxy instance(s)   C. Dense data center




SaaS
CloudMan-as-a-Bridge
A. Users in different labs   B. Isolated Galaxy instance(s)   C. Dense data center




SaaS                                                                         IaaS
CloudMan-as-a-Bridge
A. Users in different labs   B. Isolated Galaxy instance(s)   C. Dense data center




SaaS                         CloudMan + Galaxy                               IaaS
CloudMan Platform

A complete solution for instantiating and
managing cloud resources
  With automatically configured Galaxy (if desired)
  Scope of tools and reference datasets exceed Galaxy Main



Deploy a (Galaxy) cluster in minutes!
CloudMan features
•   Deployment on Amazon Web Services Cloud
    •   Wizard-guided setup: requires no computational expertise, no
        infrastructure, no software
•   Automated (thus reproducible) configuration for machine image, tools,
    and data
•   Four modes of cluster type setup
•   Dynamic persistent storage
•   Elastic resource scaling: manual or automatic based on workload
•   Standalone deployment, requiring no external dependencies or services
•   Customizable by individual users
•   Sharing of derived cluster instances -> even the customized ones!
CloudMan features
•   Deployment on Amazon Web Services Cloud
    •   Wizard-guided setup: requires no computational expertise, no
        infrastructure, no software
•   Automated (thus reproducible) configuration for machine image, tools,
    and data
•   Four modes of cluster type setup
•   Dynamic persistent storage
•   Elastic resource scaling: manual or automatic based on workload
•   Standalone deployment, requiring no external dependencies or services
•   Customizable by individual users
•   Sharing of derived cluster instances -> even the customized ones!
Deploying a cluster on AWS




1.
Deploying a cluster on AWS




1.
     2.
Deploying a cluster on AWS




1.
     2.
          3.
Deploying a cluster on AWS




1.
     2.
          3.
                 4.
CloudBioLinux + Galaxy
        + CloudMan =
     •   A lot of (NGS) tools immediately available
         and easily accessible
     •   700GB of reference genome data

Bowtie, BWA, Samtools, MAQ, BFAST, ABySS, Velvet,
MACS, Tophat, Cufflinks, MegaBLAST, BLAST, Sputnik,
Taxonomy, HyPhy, Lastz, Perm, GATK, Srma, Beam, Pass,
LPS, Plink, Haploview, Freebayes, Mosaik, Picard, ...
But what if your tool
  (or data) is missing?
1. Add it! (via automation)
   - CloudMan instances
   are self-contained



2. Save & share               Add tools
   - With individual users
   or make it public
                              Add data
Deployment sharing
Deployment sharing
Deployment sharing
Use CloudMan as SaaS

     Use CloudMan as PaaS

It’s automated, reproducible, extensible, and
                 transparent.
Enis Afgan          Dannon Baker               Dan Blankenberg         Nate Coraor




        Dave Clements        Jeremy Goecks               Jennifer Jackson      Greg von Kuster




          Kanwei Li           James Taylor                Kelly Vincent       Anton Nekrutenko




Supported by the NHGRI (HG005542, HG004909, HG005133), NSF (DBI-0850103), Penn State University,
                Emory University, and the Pennsylvania Department of Public Health

More Related Content

Similar to F06-Cloud-Enabling NGS

Afgan bosc2010 galaxy_cloud
Afgan bosc2010 galaxy_cloudAfgan bosc2010 galaxy_cloud
Afgan bosc2010 galaxy_cloud
BOSC 2010
 
resume_Yuli_Liang
resume_Yuli_Liangresume_Yuli_Liang
resume_Yuli_Liang
Yuli Liang
 
REU_2015_poster_DustynTubbs_PatrickStreifel
REU_2015_poster_DustynTubbs_PatrickStreifelREU_2015_poster_DustynTubbs_PatrickStreifel
REU_2015_poster_DustynTubbs_PatrickStreifel
Patrick Streifel
 
Services For Science April 2009
Services For Science April 2009Services For Science April 2009
Services For Science April 2009
Ian Foster
 

Similar to F06-Cloud-Enabling NGS (20)

Afgan bosc2010 galaxy_cloud
Afgan bosc2010 galaxy_cloudAfgan bosc2010 galaxy_cloud
Afgan bosc2010 galaxy_cloud
 
Communicating trust, enabling criticism
Communicating trust, enabling criticismCommunicating trust, enabling criticism
Communicating trust, enabling criticism
 
Thoughts on Knowledge Graphs & Deeper Provenance
Thoughts on Knowledge Graphs  & Deeper ProvenanceThoughts on Knowledge Graphs  & Deeper Provenance
Thoughts on Knowledge Graphs & Deeper Provenance
 
GENI Engineering Conference -- Ian Foster
GENI Engineering Conference -- Ian FosterGENI Engineering Conference -- Ian Foster
GENI Engineering Conference -- Ian Foster
 
resume_Yuli_Liang
resume_Yuli_Liangresume_Yuli_Liang
resume_Yuli_Liang
 
(SEC313) Security & Compliance at the Petabyte Scale
(SEC313) Security & Compliance at the Petabyte Scale(SEC313) Security & Compliance at the Petabyte Scale
(SEC313) Security & Compliance at the Petabyte Scale
 
REU_2015_poster_DustynTubbs_PatrickStreifel
REU_2015_poster_DustynTubbs_PatrickStreifelREU_2015_poster_DustynTubbs_PatrickStreifel
REU_2015_poster_DustynTubbs_PatrickStreifel
 
The W3C PROV standard: data model for the provenance of information, and enab...
The W3C PROV standard:data model for the provenance of information, and enab...The W3C PROV standard:data model for the provenance of information, and enab...
The W3C PROV standard: data model for the provenance of information, and enab...
 
CILogon and InCommon: Technical Update
CILogon and InCommon: Technical UpdateCILogon and InCommon: Technical Update
CILogon and InCommon: Technical Update
 
Services For Science April 2009
Services For Science April 2009Services For Science April 2009
Services For Science April 2009
 
The need to redefine genomic data sharing - moving towards Open Science Oct ...
The need to redefine genomic data sharing - moving towards Open Science  Oct ...The need to redefine genomic data sharing - moving towards Open Science  Oct ...
The need to redefine genomic data sharing - moving towards Open Science Oct ...
 
Cyberinfrastructure for Ocean Cabled Observatories
Cyberinfrastructure for Ocean Cabled ObservatoriesCyberinfrastructure for Ocean Cabled Observatories
Cyberinfrastructure for Ocean Cabled Observatories
 
Accelerating Data-driven Discovery in Energy Science
Accelerating Data-driven Discovery in Energy ScienceAccelerating Data-driven Discovery in Energy Science
Accelerating Data-driven Discovery in Energy Science
 
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
 
Finding Emerging Topics Using Chaos and Community Detection in Social Media G...
Finding Emerging Topics Using Chaos and Community Detection in Social Media G...Finding Emerging Topics Using Chaos and Community Detection in Social Media G...
Finding Emerging Topics Using Chaos and Community Detection in Social Media G...
 
NIH NCI Childhood Cancer Data Initiative (CCDI) Symposium Globus Poster
NIH NCI Childhood Cancer Data Initiative (CCDI) Symposium Globus PosterNIH NCI Childhood Cancer Data Initiative (CCDI) Symposium Globus Poster
NIH NCI Childhood Cancer Data Initiative (CCDI) Symposium Globus Poster
 
Docker in Open Science Data Analysis Challenges by Bruce Hoff
Docker in Open Science Data Analysis Challenges by Bruce HoffDocker in Open Science Data Analysis Challenges by Bruce Hoff
Docker in Open Science Data Analysis Challenges by Bruce Hoff
 
re:Invent 2013-foster-madduri
re:Invent 2013-foster-maddurire:Invent 2013-foster-madduri
re:Invent 2013-foster-madduri
 
myExperiment @ Nettab
myExperiment @ NettabmyExperiment @ Nettab
myExperiment @ Nettab
 
Jim Basney
Jim BasneyJim Basney
Jim Basney
 

More from Bioinformatics Open Source Conference

More from Bioinformatics Open Source Conference (20)

Bosc mercer
Bosc mercerBosc mercer
Bosc mercer
 
Bosc2011 arakawa
Bosc2011 arakawaBosc2011 arakawa
Bosc2011 arakawa
 
Bosc2011 isobar-fbp
Bosc2011 isobar-fbpBosc2011 isobar-fbp
Bosc2011 isobar-fbp
 
Talk6 biopython bosc2011
Talk6 biopython bosc2011Talk6 biopython bosc2011
Talk6 biopython bosc2011
 
Unipro ugene bosc 2011 update
Unipro ugene bosc 2011 updateUnipro ugene bosc 2011 update
Unipro ugene bosc 2011 update
 
Bosc2011 ntino-krampis-full
Bosc2011 ntino-krampis-fullBosc2011 ntino-krampis-full
Bosc2011 ntino-krampis-full
 
Bosc talk 7-15-2011x
Bosc talk 7-15-2011xBosc talk 7-15-2011x
Bosc talk 7-15-2011x
 
F02-Cloud-Cloud BioLinux
F02-Cloud-Cloud BioLinuxF02-Cloud-Cloud BioLinux
F02-Cloud-Cloud BioLinux
 
B07-GenomeContent-Biomart
B07-GenomeContent-BiomartB07-GenomeContent-Biomart
B07-GenomeContent-Biomart
 
B03-GenomeContent-Intermine
B03-GenomeContent-IntermineB03-GenomeContent-Intermine
B03-GenomeContent-Intermine
 
D03-NextGen-Bio-NGS
D03-NextGen-Bio-NGSD03-NextGen-Bio-NGS
D03-NextGen-Bio-NGS
 
F07-Cloud-Hadoop-BAM
F07-Cloud-Hadoop-BAMF07-Cloud-Hadoop-BAM
F07-Cloud-Hadoop-BAM
 
C03-Visualization-Webapollo
C03-Visualization-WebapolloC03-Visualization-Webapollo
C03-Visualization-Webapollo
 
F01-Cloud-Mygene.info
F01-Cloud-Mygene.infoF01-Cloud-Mygene.info
F01-Cloud-Mygene.info
 
A01-Openness in knowledge-based systems
A01-Openness in knowledge-based systemsA01-Openness in knowledge-based systems
A01-Openness in knowledge-based systems
 
F03-Cloud-Obiwee
F03-Cloud-ObiweeF03-Cloud-Obiwee
F03-Cloud-Obiwee
 
F05-Cloud-Sequencescape
F05-Cloud-SequencescapeF05-Cloud-Sequencescape
F05-Cloud-Sequencescape
 
C02-Visualization-Applying visual analytics
C02-Visualization-Applying visual analyticsC02-Visualization-Applying visual analytics
C02-Visualization-Applying visual analytics
 
B04-GenomeContent-EasyDAS
B04-GenomeContent-EasyDASB04-GenomeContent-EasyDAS
B04-GenomeContent-EasyDAS
 
G04-Misc-Debianmed
G04-Misc-DebianmedG04-Misc-Debianmed
G04-Misc-Debianmed
 

Recently uploaded

CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Recently uploaded (20)

GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 

F06-Cloud-Enabling NGS

  • 1. C A M P A I G N Chief Development Officer Emory University C A M P A I G N School of Nursing Development Emory University 1520 Clifton Road, NE School of Nursing Development Atlanta, Georgia 30322-4207 1520 Clifton Road, NE P 404.727.1234 Atlanta, Georgia 30322-4207 E adorrill@emory.edu Discovery of human heteroplasmic sites P 404.727.1234 Enabling of human heteroplasmic sites enabled NGS analysis with(out) E adorrill@emory.edu Discovery by an accessible interface to cloud computing infrastructure to enabled by infrastructure the an accessible interface cloud computing infrastructure Enis Afgan, the Galaxy Team,Francesca Chiaromonte Enis Afgan, Hiroki Goto, Ian Paul, Anton Nekrutenko, James Taylor Kateryna Makova, Anton Nekrutenko, James Taylor Enis Afgan, Hiroki Goto, Ian Paul, Francesca Chiaromonte Kateryna Makova, Anton Nekrutenko, James Taylor Bioinformatics Open Source Conference, July 16, 2011, Vienna missions: you are free to blog or live-blog about this presentation as long as you attribute the work to its au
  • 2. CloudMan-as-a-Bridge A. Users in different labs B. Isolated Galaxy instance(s) C. Dense data center
  • 3. CloudMan-as-a-Bridge A. Users in different labs B. Isolated Galaxy instance(s) C. Dense data center SaaS
  • 4. CloudMan-as-a-Bridge A. Users in different labs B. Isolated Galaxy instance(s) C. Dense data center SaaS IaaS
  • 5. CloudMan-as-a-Bridge A. Users in different labs B. Isolated Galaxy instance(s) C. Dense data center SaaS CloudMan + Galaxy IaaS
  • 6. CloudMan Platform A complete solution for instantiating and managing cloud resources With automatically configured Galaxy (if desired) Scope of tools and reference datasets exceed Galaxy Main Deploy a (Galaxy) cluster in minutes!
  • 7. CloudMan features • Deployment on Amazon Web Services Cloud • Wizard-guided setup: requires no computational expertise, no infrastructure, no software • Automated (thus reproducible) configuration for machine image, tools, and data • Four modes of cluster type setup • Dynamic persistent storage • Elastic resource scaling: manual or automatic based on workload • Standalone deployment, requiring no external dependencies or services • Customizable by individual users • Sharing of derived cluster instances -> even the customized ones!
  • 8. CloudMan features • Deployment on Amazon Web Services Cloud • Wizard-guided setup: requires no computational expertise, no infrastructure, no software • Automated (thus reproducible) configuration for machine image, tools, and data • Four modes of cluster type setup • Dynamic persistent storage • Elastic resource scaling: manual or automatic based on workload • Standalone deployment, requiring no external dependencies or services • Customizable by individual users • Sharing of derived cluster instances -> even the customized ones!
  • 10. Deploying a cluster on AWS 1. 2.
  • 11. Deploying a cluster on AWS 1. 2. 3.
  • 12. Deploying a cluster on AWS 1. 2. 3. 4.
  • 13. CloudBioLinux + Galaxy + CloudMan = • A lot of (NGS) tools immediately available and easily accessible • 700GB of reference genome data Bowtie, BWA, Samtools, MAQ, BFAST, ABySS, Velvet, MACS, Tophat, Cufflinks, MegaBLAST, BLAST, Sputnik, Taxonomy, HyPhy, Lastz, Perm, GATK, Srma, Beam, Pass, LPS, Plink, Haploview, Freebayes, Mosaik, Picard, ...
  • 14. But what if your tool (or data) is missing? 1. Add it! (via automation) - CloudMan instances are self-contained 2. Save & share Add tools - With individual users or make it public Add data
  • 18. Use CloudMan as SaaS Use CloudMan as PaaS It’s automated, reproducible, extensible, and transparent.
  • 19. Enis Afgan Dannon Baker Dan Blankenberg Nate Coraor Dave Clements Jeremy Goecks Jennifer Jackson Greg von Kuster Kanwei Li James Taylor Kelly Vincent Anton Nekrutenko Supported by the NHGRI (HG005542, HG004909, HG005133), NSF (DBI-0850103), Penn State University, Emory University, and the Pennsylvania Department of Public Health