SlideShare une entreprise Scribd logo
1  sur  21
Télécharger pour lire hors ligne
Assembly tools and Visualisation
Matthias Haimel




                          EBI is an Outstation of the European Molecular Biology Laboratory.
Overview
    • Assemblers
               • ABySS
               • SOAPdenovo
    • Visualisation
               • Tablet
               • AbySS-Explorer
    • Read mapping
               • Sam / Bam
    • Visualisation
               • Artemis
               • IGV - Integrative Genomics Viewer



2   25.04.11             Assemblers
ABySS                                 Assembly By Short Sequences

    • Genome Sciences Centre, Vancouver
               • http://www.bcgsc.ca/platform/bioinfo/software/abyss
               • Open source, BCCA Licence
    • de Bruijn graph
               • Trimming (tip clipping), bubble popping
               • Use paired-end information: resolve ambiguities between contigs
               • parallel (use cluster)
    • Files
               • Fasta / Fastq
               • Sam/Bam
               • colour-space



3   25.04.11             Assemblers
ABySS
    • ABYSS (singe end)
               • e.g. ABYSS -k27 single.fastq -o contigs.fa
    • abyss-pe (paired end)
               • e.g. abyss-pe k=27 n=10 in='read_1.fastq read_2.fastq' name=ecli
    • Multiple libraries
               •   ... lib=’read1 read2’ read1=’read1_1.fa read1_2.fa’ read2=’read2_1.fa read1_2.fa’




4   25.04.11                 Assemblers
SOAPdenovo
    • Beijing Genomics Institute (BGI), China
               • http://soap.genomics.org.cn/soapdenovo.html
               • Panda genome
               • Source available
    • de Bruijn graph
               • pre-set Kmer frequency threshold
               • Bubble removing
    • Build scaffold
               • mapping reads to contigs
               • gap filling




5   25.04.11             Assemblers
SOAPdenovo
    • Full run
               • e.g. SOAPdenovo all -s read.config -K 27 -o contigs.fa
    • Run sub steps
               •   pregraph    = velveth
               •   contig      = velvetg
               •   map         map reads to contigs
               •   scaff      scaffolding
    • Configuration
               • Config file input instead of read files
               • Specify rank, usage (assembly/scaffolding), insert size




6   25.04.11              Assemblers
Visualisation                     http://bioinf.scri.ac.uk/tablet/

    • Tablet
               • Lightweight
               • Easy to use
    • Formats
               •   ACE
               •   AFG
               •   BAM
               •   BANK (AMOS)




7   25.04.11              Assemblers
Visualisation - Velvet
    • Tablet
               • Velvetg ... -amos_file yes
    • GraphViz
               •   Transform velvet graph into GraphViz format
               •   Contributed by Paul Harrison
               •   <velvet>/contrib/layout/
               •   Velvet -> .dot file (Python script)
               •   .dot -> png (graphviz)




8   25.04.11               Assemblers
Visualisation                   http://www.bcgsc.ca/platform/bioinfo/software/abyss-explorer


    • ABySS-Explorer
               • Visualizes ABySS assemblies
               • Interactive graph structure
               • Filter contigs




9   25.04.11            Assemblers
Assembler - Practical
     • Assemblers
                • ABySS
                • SOAPdenovo
     • Visualisation
                • Tablet
                • ABySS-Explorer




10   25.04.11            Assemblers
Read mapping                             http://samtools.sourceforge.net/SAM1.pdf

     • SAM / BAM
                •   Sequence Alignment / Map format (SAM)
                •   Binary form of SAM (BAM)
                •   generic format
                •   Flexible and simple
                •   Compact (BAM)
                •   Allow indexing
                •   Load regions
                •   Support streaming




11   25.04.11              Assemblers
SAM
     • Header
                •   File format version information
                •   Sequence dictionary (name/length/..)
                •   Read group (platform/library/...)
                •   Program info
     • Body
                • Alignment information




12   25.04.11               Assemblers
SAM Header
     • '@' followed by record type (two characters)
                @HD   VN:1.0
                @SQ   SN:chr20 LN:62435964
                @RG   ID:L1 PU:SC_1_10 LB:SC_1 SM:NA12891
                @RG   ID:L2 PU:SC_2_12 LB:SC_2 SM:NA12891




13   25.04.11         Assemblers
SAM Alignment
     • Tab delimited lines




14   25.04.11    Assemblers
SAM Alignment
     • Tab delimited lines

      Read_28833_29006_6945 99 chr20 28833 20 10M1D25M = 28993 195 
      AGCT... <<<<... NM:i:1 RG:Z:L1
      read_28701_28881_323b 147 chr20 28834 30 35M = 28701 -168 
      ACCT... <<7;:... MF:i:18 RG:Z:L2




15   25.04.11     Assemblers
Tools
     • Mapping Reads
                • BWA
                • Bowtie
                • SSAHA2
     • Manipulate SAM/BAM
                • SAM Tools package
                • Picard




16   25.04.11            Assemblers
BWA
     • Burrows-Wheeler Alignment Tool
                • Map (singe/paired-end/long) reads to a sequence
     • Index database
                • bwa index -a bwtsw database.fasta
     • Align reads
                • bwa aln database.fasta short_read.fastq > aln_sa.sai
     • Generate alignments
                •   bwa sampe database.fasta aln_sa1.sai aln_sa2.sai read1.fq read2.fq > aln.sam
     • Long reads
                • bwa bwasw database.fasta long_read.fastq > aln.sam




17   25.04.11                 Assemblers
SAM tools
     • Utilities for SAM format
                • samtools <command> ...
     • Commands:
                •   view: SAM <-> BAM
                •   sort: sort BAM file
                •   index: build BAM file index
                •   merge: merges x BAM files
                •   pileup: alignment in the pileup format
                •   tview: integrated Text alignment viewer




18   25.04.11               Assemblers
Visualisation                    Integrative Genomics Viewer
                                       http://www.broadinstitute.org/igv/
     • IGV
                • Good integration
     • Formats
                •   DAS
                •   BAM
                •   GFF
                •   ...
     • Tools
                • Run scripts
                • Export region
                • ...



19   25.04.11             Assemblers
Visualisation
                                   http://www.sanger.ac.uk/resources/software/artemis/

     • Artemis
          • Sequence Viewer
          • Annotation tool
     • Formats
          •     EMBL
          •     GENBANK
          •     GFF
          •     FASTA
          •     BAM




20   25.04.11         Assemblers
Mapping - Practical
     • Mapping reads + prepare for visalization
                • BWA
                • samtools
     • Visualisation
                • IGV




21   25.04.11            Assemblers

Contenu connexe

Similaire à 2011-04-26_various-assemblers-presentation

Java Night 2010 SteamCannon
Java Night 2010 SteamCannonJava Night 2010 SteamCannon
Java Night 2010 SteamCannon
marekgoldmann
 

Similaire à 2011-04-26_various-assemblers-presentation (20)

CouchDB at its Core: Global Data Storage and Rich Incremental Indexing at Clo...
CouchDB at its Core: Global Data Storage and Rich Incremental Indexing at Clo...CouchDB at its Core: Global Data Storage and Rich Incremental Indexing at Clo...
CouchDB at its Core: Global Data Storage and Rich Incremental Indexing at Clo...
 
Zero to the Cloud with @NetflixOSS
Zero to the Cloud with @NetflixOSSZero to the Cloud with @NetflixOSS
Zero to the Cloud with @NetflixOSS
 
Australian OpenStack User Group August 2012: Chef for OpenStack
Australian OpenStack User Group August 2012: Chef for OpenStackAustralian OpenStack User Group August 2012: Chef for OpenStack
Australian OpenStack User Group August 2012: Chef for OpenStack
 
PLAT-16 Using Enterprise Content in Grails
PLAT-16 Using Enterprise Content in GrailsPLAT-16 Using Enterprise Content in Grails
PLAT-16 Using Enterprise Content in Grails
 
Chef for OpenStack - OpenStack Fall 2012 Summit
Chef for OpenStack  - OpenStack Fall 2012 SummitChef for OpenStack  - OpenStack Fall 2012 Summit
Chef for OpenStack - OpenStack Fall 2012 Summit
 
Chef for OpenStack- Fall 2012.pdf
Chef for OpenStack- Fall 2012.pdfChef for OpenStack- Fall 2012.pdf
Chef for OpenStack- Fall 2012.pdf
 
Sutol How To Be A Lion Tamer
Sutol How To Be A Lion TamerSutol How To Be A Lion Tamer
Sutol How To Be A Lion Tamer
 
Rails - getting started
Rails - getting startedRails - getting started
Rails - getting started
 
JBoss AS 7 from a user perspective
JBoss AS 7 from a user perspectiveJBoss AS 7 from a user perspective
JBoss AS 7 from a user perspective
 
spring-cloud.pptx
spring-cloud.pptxspring-cloud.pptx
spring-cloud.pptx
 
Java Night 2010 SteamCannon
Java Night 2010 SteamCannonJava Night 2010 SteamCannon
Java Night 2010 SteamCannon
 
Achieving Infrastructure Portability with Chef
Achieving Infrastructure Portability with ChefAchieving Infrastructure Portability with Chef
Achieving Infrastructure Portability with Chef
 
How to be a lion tamer
How to be a lion tamerHow to be a lion tamer
How to be a lion tamer
 
How to be a lion tamer
How to be a lion tamerHow to be a lion tamer
How to be a lion tamer
 
Implementing FaaS on Kubernetes using Kubeless
Implementing FaaS on Kubernetes using KubelessImplementing FaaS on Kubernetes using Kubeless
Implementing FaaS on Kubernetes using Kubeless
 
OpenStack Deployments with Chef
OpenStack Deployments with ChefOpenStack Deployments with Chef
OpenStack Deployments with Chef
 
Inexpensive storage
Inexpensive storageInexpensive storage
Inexpensive storage
 
Play Framework and Activator
Play Framework and ActivatorPlay Framework and Activator
Play Framework and Activator
 
Django In The Real World
Django In The Real WorldDjango In The Real World
Django In The Real World
 
Optaros Surf Code Camp Introduction
Optaros Surf Code Camp IntroductionOptaros Surf Code Camp Introduction
Optaros Surf Code Camp Introduction
 

Dernier

Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
kauryashika82
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
QucHHunhnh
 

Dernier (20)

SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentation
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
Spatium Project Simulation student brief
Spatium Project Simulation student briefSpatium Project Simulation student brief
Spatium Project Simulation student brief
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
psychiatric nursing HISTORY COLLECTION .docx
psychiatric  nursing HISTORY  COLLECTION  .docxpsychiatric  nursing HISTORY  COLLECTION  .docx
psychiatric nursing HISTORY COLLECTION .docx
 
Magic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptxMagic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptx
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 
Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...
 
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701
 

2011-04-26_various-assemblers-presentation

  • 1. Assembly tools and Visualisation Matthias Haimel EBI is an Outstation of the European Molecular Biology Laboratory.
  • 2. Overview • Assemblers • ABySS • SOAPdenovo • Visualisation • Tablet • AbySS-Explorer • Read mapping • Sam / Bam • Visualisation • Artemis • IGV - Integrative Genomics Viewer 2 25.04.11 Assemblers
  • 3. ABySS Assembly By Short Sequences • Genome Sciences Centre, Vancouver • http://www.bcgsc.ca/platform/bioinfo/software/abyss • Open source, BCCA Licence • de Bruijn graph • Trimming (tip clipping), bubble popping • Use paired-end information: resolve ambiguities between contigs • parallel (use cluster) • Files • Fasta / Fastq • Sam/Bam • colour-space 3 25.04.11 Assemblers
  • 4. ABySS • ABYSS (singe end) • e.g. ABYSS -k27 single.fastq -o contigs.fa • abyss-pe (paired end) • e.g. abyss-pe k=27 n=10 in='read_1.fastq read_2.fastq' name=ecli • Multiple libraries • ... lib=’read1 read2’ read1=’read1_1.fa read1_2.fa’ read2=’read2_1.fa read1_2.fa’ 4 25.04.11 Assemblers
  • 5. SOAPdenovo • Beijing Genomics Institute (BGI), China • http://soap.genomics.org.cn/soapdenovo.html • Panda genome • Source available • de Bruijn graph • pre-set Kmer frequency threshold • Bubble removing • Build scaffold • mapping reads to contigs • gap filling 5 25.04.11 Assemblers
  • 6. SOAPdenovo • Full run • e.g. SOAPdenovo all -s read.config -K 27 -o contigs.fa • Run sub steps • pregraph = velveth • contig = velvetg • map map reads to contigs • scaff scaffolding • Configuration • Config file input instead of read files • Specify rank, usage (assembly/scaffolding), insert size 6 25.04.11 Assemblers
  • 7. Visualisation http://bioinf.scri.ac.uk/tablet/ • Tablet • Lightweight • Easy to use • Formats • ACE • AFG • BAM • BANK (AMOS) 7 25.04.11 Assemblers
  • 8. Visualisation - Velvet • Tablet • Velvetg ... -amos_file yes • GraphViz • Transform velvet graph into GraphViz format • Contributed by Paul Harrison • <velvet>/contrib/layout/ • Velvet -> .dot file (Python script) • .dot -> png (graphviz) 8 25.04.11 Assemblers
  • 9. Visualisation http://www.bcgsc.ca/platform/bioinfo/software/abyss-explorer • ABySS-Explorer • Visualizes ABySS assemblies • Interactive graph structure • Filter contigs 9 25.04.11 Assemblers
  • 10. Assembler - Practical • Assemblers • ABySS • SOAPdenovo • Visualisation • Tablet • ABySS-Explorer 10 25.04.11 Assemblers
  • 11. Read mapping http://samtools.sourceforge.net/SAM1.pdf • SAM / BAM • Sequence Alignment / Map format (SAM) • Binary form of SAM (BAM) • generic format • Flexible and simple • Compact (BAM) • Allow indexing • Load regions • Support streaming 11 25.04.11 Assemblers
  • 12. SAM • Header • File format version information • Sequence dictionary (name/length/..) • Read group (platform/library/...) • Program info • Body • Alignment information 12 25.04.11 Assemblers
  • 13. SAM Header • '@' followed by record type (two characters) @HD VN:1.0 @SQ SN:chr20 LN:62435964 @RG ID:L1 PU:SC_1_10 LB:SC_1 SM:NA12891 @RG ID:L2 PU:SC_2_12 LB:SC_2 SM:NA12891 13 25.04.11 Assemblers
  • 14. SAM Alignment • Tab delimited lines 14 25.04.11 Assemblers
  • 15. SAM Alignment • Tab delimited lines Read_28833_29006_6945 99 chr20 28833 20 10M1D25M = 28993 195 AGCT... <<<<... NM:i:1 RG:Z:L1 read_28701_28881_323b 147 chr20 28834 30 35M = 28701 -168 ACCT... <<7;:... MF:i:18 RG:Z:L2 15 25.04.11 Assemblers
  • 16. Tools • Mapping Reads • BWA • Bowtie • SSAHA2 • Manipulate SAM/BAM • SAM Tools package • Picard 16 25.04.11 Assemblers
  • 17. BWA • Burrows-Wheeler Alignment Tool • Map (singe/paired-end/long) reads to a sequence • Index database • bwa index -a bwtsw database.fasta • Align reads • bwa aln database.fasta short_read.fastq > aln_sa.sai • Generate alignments • bwa sampe database.fasta aln_sa1.sai aln_sa2.sai read1.fq read2.fq > aln.sam • Long reads • bwa bwasw database.fasta long_read.fastq > aln.sam 17 25.04.11 Assemblers
  • 18. SAM tools • Utilities for SAM format • samtools <command> ... • Commands: • view: SAM <-> BAM • sort: sort BAM file • index: build BAM file index • merge: merges x BAM files • pileup: alignment in the pileup format • tview: integrated Text alignment viewer 18 25.04.11 Assemblers
  • 19. Visualisation Integrative Genomics Viewer http://www.broadinstitute.org/igv/ • IGV • Good integration • Formats • DAS • BAM • GFF • ... • Tools • Run scripts • Export region • ... 19 25.04.11 Assemblers
  • 20. Visualisation http://www.sanger.ac.uk/resources/software/artemis/ • Artemis • Sequence Viewer • Annotation tool • Formats • EMBL • GENBANK • GFF • FASTA • BAM 20 25.04.11 Assemblers
  • 21. Mapping - Practical • Mapping reads + prepare for visalization • BWA • samtools • Visualisation • IGV 21 25.04.11 Assemblers