SlideShare une entreprise Scribd logo
1  sur  34
Cloud BioLinux: pre-configured and on-demand  computing for genomics without institutional, geographic or economic boundaries  Ntino Krampis, PhD JCVI-NIAID-UL workshop  S. Africa 2011
Low-cost sequencing technology ,[object Object]
example: GS Junior by 454
sequencing becoming standard in biology and genetics research
besides whole genomes: RNAseq, ChiPseq, and  metagenomics 1
[object Object]
Problem 1 : sequence data analysis requires high performance
and expensive computing hardware
Problem 2 :  many commonly used bioinformatics tools are difficult to install,
usually available only as source code - need technical expertise Acquiring the sequence data is only the first step 2
[object Object]
we are all using the cloud: Gmail, Google Docs, Yahoo! Mail, FaceBook; you store and access data on a remote computer
cloud computers rented pay-as-you-go by service providers such as Amazon Elastic Compute Cloud (EC2) Solving problem 1: computational capacity on the cloud 3
Cloud computing with Amazon EC2 Additional services besides computing and storage : http://aws.amazon.com ,[object Object]
cloud computers cost $0.085 - $2 per hr (max 64GB memory and 8 processors)
used by companies that need additional computers without investing on hardware
physical locations  US East / West regions, EU, Singapore, Japan  r esearchers
work on the closest location, then distribute results world-wide
democratizes access to computing resources outside of institutional, economic or national  boundaries 750 hours free for new users! : http://aws.amazon.com/free/ Additional services besides computing and storage : http://aws.amazon.com Additional services besides computing and storage : http://aws.amazon.com 4
[object Object]
a VM is uploaded on the cloud; runs using on-demand computing capacity from the  EC2  cloud service
can be accessed world-wide through a desktop / laptop computer with Internet access
removes need for local computing infrastructure at each laboratory  How does cloud computing work ? local desktop computers Internet remote Amazon EC2 cloud computing service VM VM VM 5
[object Object]
Cloud BioLinux offers a VM on the cloud with 100+ pre-installed and configured bioinformatics tools
sequence analysis,  de novo  assembly, annotation, phylogeny, molecular modeling, gene expression
a researcher can initiate a practically unlimited number of VMs for large-scale data analysis  Solving problem 2:  Cloud BioLinux 6
sign- in to the Amazon  EC2  cloud control console http://aws.amazon.com/console Username:  [email_address] Password:  SAcloud! 7 Starting our tutorial: using the cloud
Launch Cloud BioLinux through the EC2 cloud console Click the Launch Instance button 8
[object Object],2.   select computational capacity: Large -  2 CPU cores  7.5 GB memory ,[object Object],Cloud BioLinux launch wizard: steps 1 & 2  9
[object Object],Cloud BioLinux launch wizard: step 3  10
Cloud BioLinux launch wizard: steps 4 & 5  ,[object Object],5.   select  “ Proceed without a Key Pair” ,[object Object],11
Cloud BioLinux launch wizard: steps 6 & 7  ,[object Object],[object Object],12
Cloud BioLinux launch status ,[object Object],13

Contenu connexe

En vedette

(Online) Censorship in Southeast Asia | #rp15
(Online) Censorship in Southeast Asia | #rp15(Online) Censorship in Southeast Asia | #rp15
(Online) Censorship in Southeast Asia | #rp15Sascha Funk
 
PresentacióN1
PresentacióN1PresentacióN1
PresentacióN1Alex_27
 
Management System Audits
Management System AuditsManagement System Audits
Management System AuditsTom_Forman
 
Referansegruppe 200209
Referansegruppe 200209Referansegruppe 200209
Referansegruppe 200209Glenn Melby
 
Nastas Lecture Graduate School of Business Michgan State University
Nastas Lecture Graduate School of Business Michgan State UniversityNastas Lecture Graduate School of Business Michgan State University
Nastas Lecture Graduate School of Business Michgan State UniversityThomas Nastas
 
Part 5: Putting it all together
Part 5: Putting it all togetherPart 5: Putting it all together
Part 5: Putting it all togetherNAPWA
 
Pride Law Fund Auction Catalog 2009
Pride Law Fund Auction Catalog 2009Pride Law Fund Auction Catalog 2009
Pride Law Fund Auction Catalog 2009Gallery560
 
Combining Quantitative & Qualitative Data in a Single Large scale User Resear...
Combining Quantitative & Qualitative Data in a Single Large scale User Resear...Combining Quantitative & Qualitative Data in a Single Large scale User Resear...
Combining Quantitative & Qualitative Data in a Single Large scale User Resear...UserZoom
 
55 ways to get more energy
55 ways to get more energy55 ways to get more energy
55 ways to get more energyHome
 

En vedette (20)

(Online) Censorship in Southeast Asia | #rp15
(Online) Censorship in Southeast Asia | #rp15(Online) Censorship in Southeast Asia | #rp15
(Online) Censorship in Southeast Asia | #rp15
 
PresentacióN1
PresentacióN1PresentacióN1
PresentacióN1
 
Management System Audits
Management System AuditsManagement System Audits
Management System Audits
 
Referansegruppe 200209
Referansegruppe 200209Referansegruppe 200209
Referansegruppe 200209
 
Ishii presentation
Ishii presentationIshii presentation
Ishii presentation
 
Ds Consumer Samples
Ds Consumer SamplesDs Consumer Samples
Ds Consumer Samples
 
Proekt Kaladina L
Proekt Kaladina LProekt Kaladina L
Proekt Kaladina L
 
Social Media Summit
Social Media SummitSocial Media Summit
Social Media Summit
 
Nastas Lecture Graduate School of Business Michgan State University
Nastas Lecture Graduate School of Business Michgan State UniversityNastas Lecture Graduate School of Business Michgan State University
Nastas Lecture Graduate School of Business Michgan State University
 
Ieeej 2010
Ieeej 2010Ieeej 2010
Ieeej 2010
 
Burlata
BurlataBurlata
Burlata
 
Part 5: Putting it all together
Part 5: Putting it all togetherPart 5: Putting it all together
Part 5: Putting it all together
 
Pride Law Fund Auction Catalog 2009
Pride Law Fund Auction Catalog 2009Pride Law Fund Auction Catalog 2009
Pride Law Fund Auction Catalog 2009
 
2011 CANARIE User's Forum
2011 CANARIE User's Forum2011 CANARIE User's Forum
2011 CANARIE User's Forum
 
Combining Quantitative & Qualitative Data in a Single Large scale User Resear...
Combining Quantitative & Qualitative Data in a Single Large scale User Resear...Combining Quantitative & Qualitative Data in a Single Large scale User Resear...
Combining Quantitative & Qualitative Data in a Single Large scale User Resear...
 
Irudiak
IrudiakIrudiak
Irudiak
 
Northstar So
Northstar SoNorthstar So
Northstar So
 
Roses
RosesRoses
Roses
 
55 ways to get more energy
55 ways to get more energy55 ways to get more energy
55 ways to get more energy
 
HR head dilemma ideate assignment
HR head dilemma ideate assignmentHR head dilemma ideate assignment
HR head dilemma ideate assignment
 

Similaire à Cloud BioLinux S.Africa

Ntino Krampis GSC 2011
Ntino Krampis GSC 2011Ntino Krampis GSC 2011
Ntino Krampis GSC 2011Ntino Krampis
 
High Performance Computing (HPC) and Engineering Simulations in the Cloud
High Performance Computing (HPC) and Engineering Simulations in the CloudHigh Performance Computing (HPC) and Engineering Simulations in the Cloud
High Performance Computing (HPC) and Engineering Simulations in the CloudThe UberCloud
 
Chi next gen-ntino-krampis
Chi next gen-ntino-krampisChi next gen-ntino-krampis
Chi next gen-ntino-krampisNtino Krampis
 
Laporan Praktikum Keamanan Siber - Tugas 1 - Kelas C - Kelompok 3.pdf
Laporan Praktikum Keamanan Siber - Tugas 1 - Kelas C - Kelompok 3.pdfLaporan Praktikum Keamanan Siber - Tugas 1 - Kelas C - Kelompok 3.pdf
Laporan Praktikum Keamanan Siber - Tugas 1 - Kelas C - Kelompok 3.pdfIGedeArieYogantaraSu
 
Amazon resource for bioinformatics
Amazon resource for bioinformaticsAmazon resource for bioinformatics
Amazon resource for bioinformaticsBrad Chapman
 
Machine Learning , Analytics & Cyber Security the Next Level Threat Analytics...
Machine Learning , Analytics & Cyber Security the Next Level Threat Analytics...Machine Learning , Analytics & Cyber Security the Next Level Threat Analytics...
Machine Learning , Analytics & Cyber Security the Next Level Threat Analytics...PranavPatil822557
 
Volunteer Computing using BOINC
Volunteer Computing using BOINCVolunteer Computing using BOINC
Volunteer Computing using BOINCPooyan Mehrparvar
 
2015 04 bio it world
2015 04 bio it world2015 04 bio it world
2015 04 bio it worldChris Dwan
 
Isolation of vm
Isolation of vmIsolation of vm
Isolation of vmHome
 
The world of Docker and Kubernetes
The world of Docker and Kubernetes The world of Docker and Kubernetes
The world of Docker and Kubernetes vty
 
OSCON 2013 - Planning an OpenStack Cloud - Tom Fifield
OSCON 2013 - Planning an OpenStack Cloud - Tom FifieldOSCON 2013 - Planning an OpenStack Cloud - Tom Fifield
OSCON 2013 - Planning an OpenStack Cloud - Tom FifieldOSCON Byrum
 
Kubernetes and Container Technologies from Cloud Native Computing Foundation
Kubernetes and Container Technologies from Cloud Native Computing FoundationKubernetes and Container Technologies from Cloud Native Computing Foundation
Kubernetes and Container Technologies from Cloud Native Computing FoundationCloud Standards Customer Council
 
Cloud computing overview
Cloud computing overviewCloud computing overview
Cloud computing overviewkarthik s
 

Similaire à Cloud BioLinux S.Africa (20)

F02-Cloud-Cloud BioLinux
F02-Cloud-Cloud BioLinuxF02-Cloud-Cloud BioLinux
F02-Cloud-Cloud BioLinux
 
Ntino Krampis GSC 2011
Ntino Krampis GSC 2011Ntino Krampis GSC 2011
Ntino Krampis GSC 2011
 
Bosc2011 ntino-krampis-full
Bosc2011 ntino-krampis-fullBosc2011 ntino-krampis-full
Bosc2011 ntino-krampis-full
 
High Performance Computing (HPC) and Engineering Simulations in the Cloud
High Performance Computing (HPC) and Engineering Simulations in the CloudHigh Performance Computing (HPC) and Engineering Simulations in the Cloud
High Performance Computing (HPC) and Engineering Simulations in the Cloud
 
Chi next gen-ntino-krampis
Chi next gen-ntino-krampisChi next gen-ntino-krampis
Chi next gen-ntino-krampis
 
Laporan Praktikum Keamanan Siber - Tugas 1 - Kelas C - Kelompok 3.pdf
Laporan Praktikum Keamanan Siber - Tugas 1 - Kelas C - Kelompok 3.pdfLaporan Praktikum Keamanan Siber - Tugas 1 - Kelas C - Kelompok 3.pdf
Laporan Praktikum Keamanan Siber - Tugas 1 - Kelas C - Kelompok 3.pdf
 
Amazon resource for bioinformatics
Amazon resource for bioinformaticsAmazon resource for bioinformatics
Amazon resource for bioinformatics
 
Machine Learning , Analytics & Cyber Security the Next Level Threat Analytics...
Machine Learning , Analytics & Cyber Security the Next Level Threat Analytics...Machine Learning , Analytics & Cyber Security the Next Level Threat Analytics...
Machine Learning , Analytics & Cyber Security the Next Level Threat Analytics...
 
Cloud computing components
Cloud computing componentsCloud computing components
Cloud computing components
 
Volunteer Computing using BOINC
Volunteer Computing using BOINCVolunteer Computing using BOINC
Volunteer Computing using BOINC
 
Internship presentation
Internship presentationInternship presentation
Internship presentation
 
Cloud computing
Cloud computingCloud computing
Cloud computing
 
Cloud computing: highlights
Cloud computing: highlightsCloud computing: highlights
Cloud computing: highlights
 
2015 04 bio it world
2015 04 bio it world2015 04 bio it world
2015 04 bio it world
 
Isolation of vm
Isolation of vmIsolation of vm
Isolation of vm
 
The world of Docker and Kubernetes
The world of Docker and Kubernetes The world of Docker and Kubernetes
The world of Docker and Kubernetes
 
OSCON 2013 - Planning an OpenStack Cloud - Tom Fifield
OSCON 2013 - Planning an OpenStack Cloud - Tom FifieldOSCON 2013 - Planning an OpenStack Cloud - Tom Fifield
OSCON 2013 - Planning an OpenStack Cloud - Tom Fifield
 
Kubernetes and Container Technologies from Cloud Native Computing Foundation
Kubernetes and Container Technologies from Cloud Native Computing FoundationKubernetes and Container Technologies from Cloud Native Computing Foundation
Kubernetes and Container Technologies from Cloud Native Computing Foundation
 
Zerovm backgroud
Zerovm backgroudZerovm backgroud
Zerovm backgroud
 
Cloud computing overview
Cloud computing overviewCloud computing overview
Cloud computing overview
 

Dernier

2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 

Dernier (20)

2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 

Cloud BioLinux S.Africa

  • 1. Cloud BioLinux: pre-configured and on-demand computing for genomics without institutional, geographic or economic boundaries Ntino Krampis, PhD JCVI-NIAID-UL workshop S. Africa 2011
  • 2.
  • 4. sequencing becoming standard in biology and genetics research
  • 5. besides whole genomes: RNAseq, ChiPseq, and metagenomics 1
  • 6.
  • 7. Problem 1 : sequence data analysis requires high performance
  • 9. Problem 2 : many commonly used bioinformatics tools are difficult to install,
  • 10. usually available only as source code - need technical expertise Acquiring the sequence data is only the first step 2
  • 11.
  • 12. we are all using the cloud: Gmail, Google Docs, Yahoo! Mail, FaceBook; you store and access data on a remote computer
  • 13. cloud computers rented pay-as-you-go by service providers such as Amazon Elastic Compute Cloud (EC2) Solving problem 1: computational capacity on the cloud 3
  • 14.
  • 15. cloud computers cost $0.085 - $2 per hr (max 64GB memory and 8 processors)
  • 16. used by companies that need additional computers without investing on hardware
  • 17. physical locations US East / West regions, EU, Singapore, Japan r esearchers
  • 18. work on the closest location, then distribute results world-wide
  • 19. democratizes access to computing resources outside of institutional, economic or national boundaries 750 hours free for new users! : http://aws.amazon.com/free/ Additional services besides computing and storage : http://aws.amazon.com Additional services besides computing and storage : http://aws.amazon.com 4
  • 20.
  • 21. a VM is uploaded on the cloud; runs using on-demand computing capacity from the EC2 cloud service
  • 22. can be accessed world-wide through a desktop / laptop computer with Internet access
  • 23. removes need for local computing infrastructure at each laboratory How does cloud computing work ? local desktop computers Internet remote Amazon EC2 cloud computing service VM VM VM 5
  • 24.
  • 25. Cloud BioLinux offers a VM on the cloud with 100+ pre-installed and configured bioinformatics tools
  • 26. sequence analysis, de novo assembly, annotation, phylogeny, molecular modeling, gene expression
  • 27. a researcher can initiate a practically unlimited number of VMs for large-scale data analysis Solving problem 2: Cloud BioLinux 6
  • 28. sign- in to the Amazon EC2 cloud control console http://aws.amazon.com/console Username: [email_address] Password: SAcloud! 7 Starting our tutorial: using the cloud
  • 29. Launch Cloud BioLinux through the EC2 cloud console Click the Launch Instance button 8
  • 30.
  • 31.
  • 32.
  • 33.
  • 34.
  • 35.
  • 36. Genbank and Ensembl databases, 1000 human genomes project, influenza
  • 37. data hosted for free, users pay only for the computing time used
  • 38. community program: http://aws.amazon.com/datasets/submit
  • 39. advantage: putting the data where computational capacity is available
  • 40. Amazon EC2 education-research grants: http://aws.amazon.com/education/ Any questions before we get to the exercises ?
  • 41.
  • 42. Connecting remotely to Cloud BioLinux click the NX client icon on your computer's desktop: A. paste the DNS in the “Host” box B. select “Unix”, “Gnome”, remote desktop size C. “ubuntu” is the default user Login “ workshop” is the password we set 16
  • 43. 17
  • 44. 18 a. b. c.
  • 45. 19 two S.aureus strains and one S.carnosus species drag & drop the .fna files on the Cloud BioLinux desktop
  • 46. 20
  • 47. 21
  • 48. 22
  • 49. 23
  • 50. 24
  • 51. 25
  • 52. 26
  • 53. 27
  • 54. 28
  • 55. 29
  • 56. 30
  • 57. save and share the Virtual Machine (VM) containing your analysis results with a collaborator storage costs: 0.10$ / GB / month 31
  • 58. authorize access to the VM: public or for certain users other researchers can access the VM with all the software, data, analysis results directly on the cloud Cloud BioLinux: whole system snapshot exchange 32
  • 59. Acknowledgments & Credits Brad Chapman,Tim Booth, Bela Tiwari, Dawn Field – Cloud BioLinux development Deepak Singh and AWS - compute credits on EC2 supporting initial development J. Craig Venter Inst. - sponsorship / time allowed to work on this project D. Gomez, E. Navarro, J. Shao, I. Singh, D. Edwards, M. Stout – JCVI tech innovation Members of the Cloud Biolinux community: Enis Afgan Michael Heuer Richard Holland Mark Jensen Dave Messina Steffen Möller Roman Valls Thank you !