SlideShare une entreprise Scribd logo
1  sur  32
Next Gen Sequencing [NGS]
• History of DNA Sequencing
– Maxam-Gilbert
– Sanger
– ABI
• NGS Technologies:
– 454, Illumina, PacBio, ABI, Helicos,
– Ion Torrent, Nanopores
• Applications:
– Genomes, RNASeq, ChIPSeq, CGH, CancerGenome
, Environmental
Human Genome: 1990-2000
Presented by Dominic Suciu, Ph.D.
Preliminaries: Central Dogma
Gene ~ Protein ~ Enzyme
Gene (DNA)
[Program in directory]
Protein (PolyPeptide)
[Program in RAM]
~~ Enzyme ~~
Functional agent
Messenger RNA
Genome (DNA)
[Hard drive]
Preliminaries: Phages
BacterioPhages are viruses that infect bacteria
Some Bacteria are immune to certain phages
[Hamilton O. Smith, early 70‟s]
Restriction Endonucleases: Enzymes that specifically
cleave certain DNA sequences.
Bacterial cells use these as a crude anti-phage
defense mechanisms
Preliminaries: Restriction Enzymes
• Molecular scissors
• Their discovery allowed researchers to physically map genomes
• Big confirmatory clue that Genome sequence determines species and even individuals
Preliminaries: Cloning
Start with picograms of DNA
End up with microgarms of highly purified copies
Each Colony is highly enriched
Each colony is endlessly amplifyable
pBR322: is a vector, an
engineered phage.
It can reproduce itself
inside a bacterial host
and do nothing else.
Preliminaries: PCR [1985]
As long as you know
the beginning and
end of a sequence,
you can amplify
anything
Deconstructing Sequencing
• DNA source: gel-purified fragment, cloning
product, random fragmentation.
• DNA Amplification: need enough to be able to detect
signal given off by base interrogation
• DNA Seq Method: Base interrogation method to
uniquely detect G,A,T,C bases.
• Sequence Positioning: Need an organizing principal to
place these bases into a sequence.
The methods presented here represent unique ways to solve each of these issues
Maxam-Gilbert 1975
Fragment
population
distribution
corresponds to
appearance of
base within
sequence
Maxam-Gilbert 1975
Chemical Sequencing
Issues:
• Need perfectly pure single species of DNA
• Nasty Chemicals
• Radioactive End-labeling
• 4-lanes/read
• Sequence only what you can purify
Advantages:
- 1st DNA sequencing available
- 2-300 bp/read
Fragment
population
distribution
corresponds to
appearance of
base within
sequence
Sanger “Sequencing-by-Synthesis” 1977
Issues:
- Radioactive End-labeling
- 4-lanes/read
- Sequencing gels
Advantages:
- 4-500 bp/reads
- Radioactive Incorporation
- Primer gives you control
dNTP ddNTP
PCR Dye-Terminator 1990‟s
Issues:
- Sequencing gels
- 1 run/day
Advantages:
- 600-700 bp/reads
- 96 reads/run
- Each terminator dye has a different
color. Lets you combine all 4 reactions
in one lane.
- Single lane/read
- Primer gives you control
Human Genome Project (15 years) Hierarchical
Shotgun Sequencing [start1990]
- Randomly insert Human DNA into BAC clones (~150kbp each)
- Combine these BAC clones to create a scaffold of the human
genome. Each BAC clone will be mapped to a region on a
Human Chromosome
- Pass BAC clones to different Genome Centers throughout US
- At each center, each vector is sequenced using shotgun sequencing
- Wait 15 years for results.
Issues with Shotgun Sequencing
• Reads-> contigs -> scaffolds -> genome reconstruction
• Repeat regions can confuse Contig assemblers.
• It was hoped that by focusing each shotgun run to a single 40-150kb region, these
issues would be minimized.
• According to Venter, it simply multiplied the number of times one encountered the
same problem
Shotgun Sequencing: Venter 1997
Same approach is used throughout NGS
Paired-end sequencing:
1. Randomly cut genomic DNA.
2. Use Gel-purification to make three
libraries of random DNA fragments:
2kb, 10kb, 50kb
2. Sequence from both ends.
3. Use distance information to assemble
contigs into scaffolds.
Distance information allows you to
„jump‟ over repeat regions.
This approach allowed Venter to „jump‟
over the federal sequencing project
NGS Revolution: Roche / 454 -> [2005]
ABI 3700 state of the art
in 1997
- 1 sample per rxn (96
rxns) in 2 hrs
- Each sample had to be
individually manipulated
454 solved both these problems
PPi + H+
Paired-end reads can be done by including both primers on each micro-bead
Emulsion PCR:
Roche / 454 -> [2005]
• emPCR: No need for
cells
• Each well is a single
sequencing run.
• Very fast reaction
Illumina [Solexa 2007]
No need for Cell-based
amplification
Bridge Amplification: PCR on
a surface
Illumina
Advantages:
• No need for cells
• Each cluster of DNA
molecules is a single reaction.
• Enormous amounts of reads
• Paired ends Sequence from
both sides.
Disadvantages:
• Slow
• Short reads
• Reagent costs
Ion Torrent/LifeTechnologies [2010]
Method:
• Emulsion PCR
• Each bead is placed in a
single well.
• CHEAP/Rugged
Disadvantages:
• Low density
• Sample prep
PPi + H+
ABI-SOLiD
Advantages:
• Extremely accurate
Disadvantages:
• Takes a long time
• Expensive reagent costs
12/cycles/position
Complete Genomics
Advantages:
• Whole genome in 3 months
• 40x coverage!!!
Disadvantages:
• Labor Intensive Takes a long
time: 3 months sample prep
• Expensive: $10-20k/GENOME
• No Instrument: CRO model
Helicos
Advantages:
• No amplification
Single Molecule Detection
Disadvantages:
• It doesn‟t work
8-10
days
PacBio
Key Factors:
• Zero-mode waveguide
• Zeptoliter vol
• Continuous process
• Lariat sequencing
• Low reagent costs
Disadvantages:
• Low Num reads
Next-Next Generation:
NanoPores
Illumina/Oxford Nanopore
Roche/IBM all-semiconductor
Stratos genomics
NabSys (Graphene monolayer)
Applications: Genome Sequencing
Sequencing of whole genomes: bacterial, animal, human.
De novo Genome Sequencing: Even with the large number of
reads, putting a genome together from raw sequence reads is still
a non-trivial task, due to sample prep and inherent complexity.
Re-sequencing:
Sequencing individual with a genetic disease in
order to find hereditary mutations.
Read depth allows one to compute allele-
frequencies.
454: Due to its long reads, this method is best for de novo.
Useful for scaffolding.
SOLiD, Illumina: used for re-sequencing
SOLiD: wins out due to accuracy loses based on
complexity/cost
Complete Genomics: CRO model, depth 40x
Applications: Exon Sequencing
Mutational screening: what are the mutations in the actual
coding regions?
Most heritable disease models have mutations in the
coding regions.
Use enrichment to focus sequencing to expressed space.
Then make as many reads as possible in order to
accurately compute mutations.
Illumina, 454, ABI
Enrichment: Microarrays are Not dead!
Why?:
In order to focus sequencing run on the
region you are interested in.
Ex:
• Expressed region of genome (1%)
• Genes of interest: mutational studies.
Three ways:
• Micro-droplet PCR: each droplet has
unique set of amplification primers.
• MIP-PCR
• On-chip enrichment, using
microarrays.
• On-bead enrichment: make oligo
pools, use them to capture targets for
sequencing.
Two approaches for finding causative mutation responsible
for Miller Syndrome
Sequence Whole Genome: Complete Genomics
• Sequenced Mother, Father and 2 kids (both affected) 1 kindred
• Regions where they share both copies from parents (22%)
• Both diseases are rare: look for locations with low prevalence
SNP‟s (dbSNP)
• Narrowed down to 4 genes
• 2 of these were found to be causative agent in exome sequencing
study
Exome Array: Just sequence expressed sequence space
(1%): Illumina GAII
• Sequenced genomes from 4 affected individuals in 3
kindreds
• Found 4600 mutants
• Ignored any previously discovered SNPs from dbSNP
• Looked for mutations that appeared in all 3 kindreds
• Focused on damaging mutations Non-synonymous, stop
codon
• Discovered causative locus by elimination
Applications: RNA-Seq
Microarrays are Dead!
Don‟t have to design probes ahead of time, just sequence
mRNA and count number of sequences for each gene.
Read count ~ Expression level
In environmental genomics, sequencing can be used to
determine which genes are being expressed in a sample.
Illumina: Only method that has the read depth to get
useful spread between high and low-expressed
genes.
Its Dynamic Range far surpasses microarrays in this
respect, especially for smaller genomes.
Applications: ChIP-Seq
ChIP Chromosomal Immune Precipitation
Illumina, ABI-SOLiD
Where does my DNA-
binding transcription factor
bind within the genome?
Environmental Genomics
GAM: Genome Annotation Machine:
• Genome Annotation
• Gene Identification
• Comparative Genomics
• Functional characterization
• Phylogenetic char.
• Protein Structural char.
whowhat
Summary

Contenu connexe

Tendances

Tendances (20)

Introduction to Next Generation Sequencing
Introduction to Next Generation SequencingIntroduction to Next Generation Sequencing
Introduction to Next Generation Sequencing
 
NEXT GENERATION SEQUENCING
NEXT GENERATION SEQUENCINGNEXT GENERATION SEQUENCING
NEXT GENERATION SEQUENCING
 
Next generation sequencing methods
Next generation sequencing methods Next generation sequencing methods
Next generation sequencing methods
 
Introduction to NGS
Introduction to NGSIntroduction to NGS
Introduction to NGS
 
Ion torrent sequencing
Ion torrent sequencingIon torrent sequencing
Ion torrent sequencing
 
Ngs introduction
Ngs introductionNgs introduction
Ngs introduction
 
New Generation Sequencing Technologies: an overview
New Generation Sequencing Technologies: an overviewNew Generation Sequencing Technologies: an overview
New Generation Sequencing Technologies: an overview
 
Next Generation Sequencing
Next Generation SequencingNext Generation Sequencing
Next Generation Sequencing
 
Next Generation Sequencing of DNA
Next Generation Sequencing of DNANext Generation Sequencing of DNA
Next Generation Sequencing of DNA
 
Ion Torrent Sequencing
Ion Torrent SequencingIon Torrent Sequencing
Ion Torrent Sequencing
 
Next Generation Sequencing
Next Generation SequencingNext Generation Sequencing
Next Generation Sequencing
 
Ngs intro_v6_public
 Ngs intro_v6_public Ngs intro_v6_public
Ngs intro_v6_public
 
Next Generation Sequencing - the basics
Next Generation Sequencing - the basicsNext Generation Sequencing - the basics
Next Generation Sequencing - the basics
 
Intro to illumina sequencing
Intro to illumina sequencingIntro to illumina sequencing
Intro to illumina sequencing
 
Next generation sequencing
Next  generation  sequencingNext  generation  sequencing
Next generation sequencing
 
Types of PCR
Types of PCRTypes of PCR
Types of PCR
 
Next Generation Sequencing (NGS)
Next Generation Sequencing (NGS)Next Generation Sequencing (NGS)
Next Generation Sequencing (NGS)
 
THIRD GEN SEQUENCING.pptx
THIRD GEN SEQUENCING.pptxTHIRD GEN SEQUENCING.pptx
THIRD GEN SEQUENCING.pptx
 
next generation sequencing
next generation sequencingnext generation sequencing
next generation sequencing
 
Next generation sequencing
Next generation sequencingNext generation sequencing
Next generation sequencing
 

Similaire à Next Gen Sequencing (NGS) Technology Overview

Similaire à Next Gen Sequencing (NGS) Technology Overview (20)

High Throughput Sequencing Technologies: On the path to the $0* genome
High Throughput Sequencing Technologies: On the path to the $0* genomeHigh Throughput Sequencing Technologies: On the path to the $0* genome
High Throughput Sequencing Technologies: On the path to the $0* genome
 
Recombinant DNA Technology
Recombinant DNA TechnologyRecombinant DNA Technology
Recombinant DNA Technology
 
Hamas 1
Hamas 1Hamas 1
Hamas 1
 
Advanced diagnostic techniques
Advanced diagnostic techniquesAdvanced diagnostic techniques
Advanced diagnostic techniques
 
Third Generation Sequencing
Third Generation Sequencing Third Generation Sequencing
Third Generation Sequencing
 
NGS.pptx
NGS.pptxNGS.pptx
NGS.pptx
 
Gene cloning
Gene cloningGene cloning
Gene cloning
 
Polymerase chain reactions
Polymerase chain reactionsPolymerase chain reactions
Polymerase chain reactions
 
SNPs analysis methods
SNPs analysis methodsSNPs analysis methods
SNPs analysis methods
 
Microbial physiology in genomic era
Microbial physiology in genomic eraMicrobial physiology in genomic era
Microbial physiology in genomic era
 
Dna sequencing
Dna sequencingDna sequencing
Dna sequencing
 
Human genome project
Human genome projectHuman genome project
Human genome project
 
Studying the microbiome
Studying the microbiomeStudying the microbiome
Studying the microbiome
 
RDT, HGP, GENE THERAPY power point presentation
RDT, HGP, GENE THERAPY power point presentationRDT, HGP, GENE THERAPY power point presentation
RDT, HGP, GENE THERAPY power point presentation
 
Presentation blotting
Presentation blottingPresentation blotting
Presentation blotting
 
Recombinant DNA technology
Recombinant DNA technologyRecombinant DNA technology
Recombinant DNA technology
 
Biological technologies
Biological technologiesBiological technologies
Biological technologies
 
Next Generation Sequencing
Next Generation SequencingNext Generation Sequencing
Next Generation Sequencing
 
Polymerase Chain Reaction,RT-PCR and FISH
Polymerase Chain Reaction,RT-PCR and FISHPolymerase Chain Reaction,RT-PCR and FISH
Polymerase Chain Reaction,RT-PCR and FISH
 
Gene sequencing technique
Gene sequencing techniqueGene sequencing technique
Gene sequencing technique
 

Dernier

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 

Dernier (20)

How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 

Next Gen Sequencing (NGS) Technology Overview

  • 1. Next Gen Sequencing [NGS] • History of DNA Sequencing – Maxam-Gilbert – Sanger – ABI • NGS Technologies: – 454, Illumina, PacBio, ABI, Helicos, – Ion Torrent, Nanopores • Applications: – Genomes, RNASeq, ChIPSeq, CGH, CancerGenome , Environmental Human Genome: 1990-2000 Presented by Dominic Suciu, Ph.D.
  • 2. Preliminaries: Central Dogma Gene ~ Protein ~ Enzyme Gene (DNA) [Program in directory] Protein (PolyPeptide) [Program in RAM] ~~ Enzyme ~~ Functional agent Messenger RNA Genome (DNA) [Hard drive]
  • 3. Preliminaries: Phages BacterioPhages are viruses that infect bacteria Some Bacteria are immune to certain phages [Hamilton O. Smith, early 70‟s] Restriction Endonucleases: Enzymes that specifically cleave certain DNA sequences. Bacterial cells use these as a crude anti-phage defense mechanisms
  • 4. Preliminaries: Restriction Enzymes • Molecular scissors • Their discovery allowed researchers to physically map genomes • Big confirmatory clue that Genome sequence determines species and even individuals
  • 5. Preliminaries: Cloning Start with picograms of DNA End up with microgarms of highly purified copies Each Colony is highly enriched Each colony is endlessly amplifyable pBR322: is a vector, an engineered phage. It can reproduce itself inside a bacterial host and do nothing else.
  • 6. Preliminaries: PCR [1985] As long as you know the beginning and end of a sequence, you can amplify anything
  • 7. Deconstructing Sequencing • DNA source: gel-purified fragment, cloning product, random fragmentation. • DNA Amplification: need enough to be able to detect signal given off by base interrogation • DNA Seq Method: Base interrogation method to uniquely detect G,A,T,C bases. • Sequence Positioning: Need an organizing principal to place these bases into a sequence. The methods presented here represent unique ways to solve each of these issues
  • 9. Maxam-Gilbert 1975 Chemical Sequencing Issues: • Need perfectly pure single species of DNA • Nasty Chemicals • Radioactive End-labeling • 4-lanes/read • Sequence only what you can purify Advantages: - 1st DNA sequencing available - 2-300 bp/read Fragment population distribution corresponds to appearance of base within sequence
  • 10. Sanger “Sequencing-by-Synthesis” 1977 Issues: - Radioactive End-labeling - 4-lanes/read - Sequencing gels Advantages: - 4-500 bp/reads - Radioactive Incorporation - Primer gives you control dNTP ddNTP
  • 11. PCR Dye-Terminator 1990‟s Issues: - Sequencing gels - 1 run/day Advantages: - 600-700 bp/reads - 96 reads/run - Each terminator dye has a different color. Lets you combine all 4 reactions in one lane. - Single lane/read - Primer gives you control
  • 12. Human Genome Project (15 years) Hierarchical Shotgun Sequencing [start1990] - Randomly insert Human DNA into BAC clones (~150kbp each) - Combine these BAC clones to create a scaffold of the human genome. Each BAC clone will be mapped to a region on a Human Chromosome - Pass BAC clones to different Genome Centers throughout US - At each center, each vector is sequenced using shotgun sequencing - Wait 15 years for results.
  • 13. Issues with Shotgun Sequencing • Reads-> contigs -> scaffolds -> genome reconstruction • Repeat regions can confuse Contig assemblers. • It was hoped that by focusing each shotgun run to a single 40-150kb region, these issues would be minimized. • According to Venter, it simply multiplied the number of times one encountered the same problem
  • 14. Shotgun Sequencing: Venter 1997 Same approach is used throughout NGS Paired-end sequencing: 1. Randomly cut genomic DNA. 2. Use Gel-purification to make three libraries of random DNA fragments: 2kb, 10kb, 50kb 2. Sequence from both ends. 3. Use distance information to assemble contigs into scaffolds. Distance information allows you to „jump‟ over repeat regions. This approach allowed Venter to „jump‟ over the federal sequencing project
  • 15. NGS Revolution: Roche / 454 -> [2005] ABI 3700 state of the art in 1997 - 1 sample per rxn (96 rxns) in 2 hrs - Each sample had to be individually manipulated 454 solved both these problems PPi + H+ Paired-end reads can be done by including both primers on each micro-bead Emulsion PCR:
  • 16. Roche / 454 -> [2005] • emPCR: No need for cells • Each well is a single sequencing run. • Very fast reaction
  • 17. Illumina [Solexa 2007] No need for Cell-based amplification Bridge Amplification: PCR on a surface
  • 18. Illumina Advantages: • No need for cells • Each cluster of DNA molecules is a single reaction. • Enormous amounts of reads • Paired ends Sequence from both sides. Disadvantages: • Slow • Short reads • Reagent costs
  • 19. Ion Torrent/LifeTechnologies [2010] Method: • Emulsion PCR • Each bead is placed in a single well. • CHEAP/Rugged Disadvantages: • Low density • Sample prep PPi + H+
  • 20. ABI-SOLiD Advantages: • Extremely accurate Disadvantages: • Takes a long time • Expensive reagent costs 12/cycles/position
  • 21. Complete Genomics Advantages: • Whole genome in 3 months • 40x coverage!!! Disadvantages: • Labor Intensive Takes a long time: 3 months sample prep • Expensive: $10-20k/GENOME • No Instrument: CRO model
  • 22. Helicos Advantages: • No amplification Single Molecule Detection Disadvantages: • It doesn‟t work 8-10 days
  • 23. PacBio Key Factors: • Zero-mode waveguide • Zeptoliter vol • Continuous process • Lariat sequencing • Low reagent costs Disadvantages: • Low Num reads
  • 24. Next-Next Generation: NanoPores Illumina/Oxford Nanopore Roche/IBM all-semiconductor Stratos genomics NabSys (Graphene monolayer)
  • 25. Applications: Genome Sequencing Sequencing of whole genomes: bacterial, animal, human. De novo Genome Sequencing: Even with the large number of reads, putting a genome together from raw sequence reads is still a non-trivial task, due to sample prep and inherent complexity. Re-sequencing: Sequencing individual with a genetic disease in order to find hereditary mutations. Read depth allows one to compute allele- frequencies. 454: Due to its long reads, this method is best for de novo. Useful for scaffolding. SOLiD, Illumina: used for re-sequencing SOLiD: wins out due to accuracy loses based on complexity/cost Complete Genomics: CRO model, depth 40x
  • 26. Applications: Exon Sequencing Mutational screening: what are the mutations in the actual coding regions? Most heritable disease models have mutations in the coding regions. Use enrichment to focus sequencing to expressed space. Then make as many reads as possible in order to accurately compute mutations. Illumina, 454, ABI
  • 27. Enrichment: Microarrays are Not dead! Why?: In order to focus sequencing run on the region you are interested in. Ex: • Expressed region of genome (1%) • Genes of interest: mutational studies. Three ways: • Micro-droplet PCR: each droplet has unique set of amplification primers. • MIP-PCR • On-chip enrichment, using microarrays. • On-bead enrichment: make oligo pools, use them to capture targets for sequencing.
  • 28. Two approaches for finding causative mutation responsible for Miller Syndrome Sequence Whole Genome: Complete Genomics • Sequenced Mother, Father and 2 kids (both affected) 1 kindred • Regions where they share both copies from parents (22%) • Both diseases are rare: look for locations with low prevalence SNP‟s (dbSNP) • Narrowed down to 4 genes • 2 of these were found to be causative agent in exome sequencing study Exome Array: Just sequence expressed sequence space (1%): Illumina GAII • Sequenced genomes from 4 affected individuals in 3 kindreds • Found 4600 mutants • Ignored any previously discovered SNPs from dbSNP • Looked for mutations that appeared in all 3 kindreds • Focused on damaging mutations Non-synonymous, stop codon • Discovered causative locus by elimination
  • 29. Applications: RNA-Seq Microarrays are Dead! Don‟t have to design probes ahead of time, just sequence mRNA and count number of sequences for each gene. Read count ~ Expression level In environmental genomics, sequencing can be used to determine which genes are being expressed in a sample. Illumina: Only method that has the read depth to get useful spread between high and low-expressed genes. Its Dynamic Range far surpasses microarrays in this respect, especially for smaller genomes.
  • 30. Applications: ChIP-Seq ChIP Chromosomal Immune Precipitation Illumina, ABI-SOLiD Where does my DNA- binding transcription factor bind within the genome?
  • 31. Environmental Genomics GAM: Genome Annotation Machine: • Genome Annotation • Gene Identification • Comparative Genomics • Functional characterization • Phylogenetic char. • Protein Structural char. whowhat