SlideShare a Scribd company logo
1 of 59
Download to read offline
Surya Saha
Cornell University & Boyce Thompson Institute
suryasaha@cornell.edu @SahaSurya
Directorate of Soybean Research, Indore
June 7,2014
Slides: http://bit.ly/Soybean_Indore_2014
http://www.acgt.me/blog/2014/3/7/next-generation-sequencing-must-die
6/6/2014 Directorate of Soybean Research, Indore 2
You are free to:
Copy, share, adapt, or re-mix;
Photograph, film, or broadcast;
Blog, live-blog, or post video of;
This presentation. Provided that:
You attribute the work to its author and respect the rights
and licenses associated with its components.
Slide Concept by Cameron Neylon, who has waived all copyright and related or neighbouring rights. This slide only ccZero. Social Media Icons adapted with
permission from originals by Christopher Ross. Original images are available under GPL at
http://www.thisismyurl.com/free-downloads/15-free-speech-bubble-icons-for-popular-websites
6/7/2014 Directorate of Soybean Research, Indore 3
Sequencing
1953
DNA Structure
discovery
1977
2012
Sanger DNA sequencing by
chain-terminating inhibitors
1984
Epstein-Barr
virus
(170 Kb)
1987Abi370
Sequencer
1995
2001
Homo
sapiens
(3.0 Gb)
2005
454
Solexa
Solid
2007
2011
Ion
Torrent
PacBio
Haemophilus
influenzae
(1.83 Mb)
2013
Slide credit: Aureliano Bombarely
Sequencing over the Ages
Illumina
Illumina
Hiseq X
454
6/7/2014 Directorate of Soybean Research, Indore 4
Pinus
taeda
(24 Gb)
2014
MinION
The Next Generation
6/6/2014 Directorate of Soybean Research, Indore 5
Its all about the $£€¥
http://www.genome.gov/sequencingcosts/
6/6/2014 Directorate of Soybean Research, Indore 6
First generation sequencing
Sanger method
6/6/2014 Directorate of Soybean Research, Indore 7
Frederick Sanger
13 Aug 1918 – 19 Nov 2013
Won the Nobel Prize for Chemistry in 1958 and
1980. Published the dideoxy chain termination
method or “Sanger method” in 1977
http://dailym.ai/1f1XeTB
Sanger method
6/6/2014 Directorate of Soybean Research, Indore 8
http://bit.ly/1g6Cudq
http://bit.ly/1lcQO4J
First generation sequencing
• Very high quality sequences (99.999%)
• Very low throughput
6/6/2014 Directorate of Soybean Research, Indore 9
Run Time Read Length Reads / Run
Total
nucleotides
sequenced
Cost / MB
Capillary
Sequencing
(ABI3730xl)
20m-3h 400-900 bp 96 or 386 1.9-84 Kb $2400
http://bit.ly/1clLps3
http://1.usa.gov/1cLqIRd
Next generation sequencing
6/6/2014 Directorate of Soybean Research, Indore 10
6/6/2014 Directorate of Soybean Research, Indore 11
http://bit.ly/1keDtZQ
• Second generation
• Third generation
• Fourth generation
• Next-next-generation
• Next-next-next
generation
http://www.acgt.me/blog/2014/3/10/next-generation-
sequencing-must-diepart-2
Use the specific technology used
to generate the data
– Illumina Hiseq/Miseq/NextSeq
– Pacific Biosciences RS I/RS II
– Ion Torrent Proton/PGM
– SOLiD
– 454
6/6/2014 Directorate of Soybean Research, Indore 12
http://www.acgt.me/blog/2014/3/10/next-generation-
sequencing-must-diepart-2
454 Pyrosequencing
One purified DNA
fragment, to one bead, to
one read.
6/6/2014 Directorate of Soybean Research, Indore 13
http://bit.ly/1ehwxWN
GS FLX
Titanium
http://bit.ly/1ehAcEh
Illumina
6/6/2014 Directorate of Soybean Research, Indore 14
Output 15 Gb 120 GB 1000 GB 1800 GB
Number
of Reads
25 Million 400 Million 4 Billion 6 Billion
Read
Length
2x300 bp 2x150 bp 2x125 bp
(2x250 update mid-2014)
2x150 bp
Cost $99K $250K $740K $10M
Source: Illumina
Illumina
6/6/2014 Directorate of Soybean Research, Indore 15
Output 15 Gb 120 GB 1000 GB 1800 GB
Number
of Reads
25 Million 400 Million 4 Billion 6 Billion
Read
Length
2x300 bp 2x150 bp 2x125 bp
(2x250 update mid-2014)
2x150 bp
Cost $99K $250K $740K $10M
Source: Illumina
$1000 human
genome??
Illumina
6/6/2014 Directorate of Soybean Research, Indore 16
http://1.usa.gov/1fP9ybl
Illumina:Moleculo
6/6/2014 Directorate of Soybean Research, Indore 17
http://bit.ly/1aEPOBn
Pacific Biosciences SMRT sequencing
Single Molecule Real
Time sequencing
6/6/2014 Directorate of Soybean Research, Indore 18
http://bit.ly/1naxgTe
Pacific Biosciences SMRT sequencing
Error correction methods
6/6/2014 Directorate of Soybean Research, Indore 19
Hierarchical genome-assembly
process (HGAP)
PBJelly
Enlish et al., PLOS One. 2012
PBJelly
6/6/2014 Directorate of Soybean Research, Indore 20
Pacific Biosciences SMRT sequencing
Read Lengths
http://www.igs.umaryland.edu/labs/grc/
Mean Read Length: 8391 bp
Maximum Subread Length: 24585 bp
Oxford Nanopore
6/6/2014 Directorate of Soybean Research, Indore 21
https://www.nanoporetech.com/
• No data yet
• Error model
http://erlichya.tumblr.com/post/66376172948/hands-on-
experience-with-oxford-nanopore-minion
Others
• Ion Torrent Proton/PGM
• Nabsys
• SOLiD
6/6/2014 Directorate of Soybean Research, Indore 22
Comparison
6/6/2014 Directorate of Soybean Research, Indore 23
Next generation sequencing
6/6/2014 Directorate of Soybean Research, Indore 24
Run Time Read Length Quality
Total
nucleotides
sequenced
Cost /MB
454
Pyrosequencing
24h 700 bp Q20-Q30 0.7 GB $10
Illumina Miseq 27h 2x250bp > Q30 15 GB $0.15
Illumina Hiseq
2500
11days 2x125bp >Q30 1000 GB $0.05
Ion torrent 2h 400bp >Q20 50MB-1GB $1
Pacific
Biosciences
2h 5.5-8.5kb
>Q30 consensus
>Q10 single
400-800MB
/SMRT cell
$0.33-$1
http://bit.ly/1clLps3
http://1.usa.gov/1cLqIRd
http://omicsmaps.com/
Next Generation Genomics:
World Map of High-throughput Sequencers
Directorate of Soybean Research, Indore6/6/2014 25
6/6/2014 Directorate of Soybean Research, Indore 26
http://bit.ly/18pfUId
Real cost of Sequencing!!
Sboner, Genome Biology, 2011
6/7/2014 27Directorate of Soybean Research, Indore
Library Types
Single end
Pair end (PE, 150-800 bp, Fwd:/1, Rev:/2)
Mate pair (MP, 2Kb to 20 Kb)
6/6/2014 Directorate of Soybean Research, Indore 28
F
F R
F R 454/Roche
FR Illumina
Illumina
Slide credit: Aureliano Bombarely
Implications of Choice of Library
6/6/2014 Directorate of Soybean Research, Indore 29
Slide credit: Aureliano Bombarely
Consensus sequence
(Contig)
Reads
Scaffold
(or Supercontig)
Pair Read information
NNNNN
Pseudomolecule
(or ultracontig)
F
Genetic information (markers)
NNNNN NN
6/6/2014 Directorate of Soybean Research, Indore 30
Quality control: Encoding
http://bit.ly/N28yUd
Phred score of a base is:
Qphred = -10 log10 (e)
where e is the estimated probability of a base
being incorrect
Which technology to use??
• Microbial genomes
• Eukaryotic genomes
• Resequencing genomes
• RNAseq and other XXXseq methods
6/6/2014 Directorate of Soybean Research, Indore 31
http://bit.ly/1ko9Kgh
6/7/2014 Directorate of Soybean Research, Indore 32
SOL Genomics Network
6/6/2014 Directorate of Soybean Research, Indore 33
The SGN Team!!
6/6/2014 Directorate of Soybean Research, Indore 34
Surya Saha, Tom Fisher-York, Hartmut Foerster, Suzy Strickler, Jeremy Edwards,
Noe Fernandez, Naama Menda, Aure Bombarely, Aimin Yan, Isaak Tecle
What's new on SGN?
• Tomato genome release 2.5
• Incorporates results from FISH
• Nicotiana benthamiana genome sequence
• Genome sequence and annotation
• VIGS Tool
• Select specific probes for VIGS
• New BLAST interface
• New Breeder functions
• Later this year: Tomato genome release 3.0
6/6/2014 Directorate of Soybean Research, Indore 35
SGN Website
6/6/2014 Directorate of Soybean Research, Indore 36
http://solgenomics.net
6/6/2014 Directorate of Soybean Research, Indore 37
Main web page (front page):
WEB ICONS
TOOL BAR
6/6/2014 Directorate of Soybean Research, Indore 38
Main web page (front page):
TOOL BAR
(MENUS)
6/6/2014 Directorate of Soybean Research, Indore 39
But the DATA also can be
edited
LocusLocus Editor Data
Community Data Curation
6/6/2014 Directorate of Soybean Research, Indore 40
You need
• SGN account.
• Activate submitter / Locus Editor privileges by SGN curator
LocusLocus Editor Data
6/6/2014 Directorate of Soybean Research, Indore 41
Tools
6/6/2014 Directorate of Soybean Research, Indore 42
Genome Browser
6/6/2014 Directorate of Soybean Research, Indore 43
Genomes in SGN
6/6/2014 Directorate of Soybean Research, Indore 44
6/7/2014 Directorate of Soybean Research, Indore 45
CassavaBase
6/7/2014 Directorate of Soybean Research, Indore 46
Cassava
● Tropical and subtropical regions
● Mainly grown for starchy roots
● Native to South America
● Major crop in Africa
● Food for 500 million people around the world
● Clonally propagated
● Accumulates toxic cyanogenic glucosides
● Requires processing before consumption
6/7/2014 Directorate of Soybean Research, Indore 47
NextGen Cassava Project
● Project: Adapt SGN database for Cassava Breeding
● Goal: Apply Genomic Selection to cassava breeding
● Predict breeding values from genotype information
● Shorten the breeding cycle
● Massive amounts of genotypic data (GBS)
● Phenotypic data
● Data management challenge
● Improve flowering
● http://nextgencassava.org
6/7/2014 Directorate of Soybean Research, Indore 48
CassavaBase
http://cassavabase.org/
SGN/Cassavabase behind the scenes
6/7/2014 Directorate of Soybean Research, Indore 49
● Perl/Catalyst MVC Framework
● PostgreSQL Database
● Generic Model Organism Database (GMOD)
– Chado relational database schema
– GBrowse
– JBrowse
● R
– Experimental design
– QTL mapping
– Genomic selection
Objectives
Provide cassava breeders and researchers access
to data and tools in a centralized, user-friendly
and reliable database.
– Improve partner breeding program information
tracking
– Streamline management of genotypic and
phenotypic data
– Pipeline genotypic and phenotypic data through
Genomic Selection prediction analyses
6/7/2014 Directorate of Soybean Research, Indore 50
6/7/2014 Directorate of Soybean Research, Indore 51
Genomic Selection
The 'training population' is genotyped and phenotyped to 'train'
the genomic selection (GS) prediction model. Genotypic
information from the breeding material is then fed into the
model to calculate genomic estimated breeding values (GEBV)
for these lines. From Heffner et al. 2009 Crop Sci. 49:1–12
Information from a majority of lines in the breeding population (the training set) is used to create the
prediction model. The model is then used to predict the phenotypes of the remaining lines (the validation
set), using genotypic information only. The results from the model are compared to the actual data to give
the prediction accuracy. Image courtesy of Martha Hamblin, Cornell University
Flow diagram of a genomic selection breeding program.
Breeding cycle time is shortened by removing phenotypic
evaluation of lines before selection as parents for the next
cycle. From Heffner et al. 2009 Crop Sci. 49:1–12
Slide credit: Jeremy Edwards
6/7/2014 Directorate of Soybean Research, Indore 52
Data collection in the field
● Android tablets
● Field book app
– Jesse Poland's group at
USDA-ARS / Kansas
State University
Slide credit: Jeremy Edwards
6/7/2014 Directorate of Soybean Research, Indore 53
● Tassel 4 pipeline from
Ed Bucker's group
● Discovery vs
production
● Filtering
● Imputation
● Storing in
Cassavabase
Slide credit: Jeremy Edwards
Genotyping by sequencing (GBS)
Genotyping by sequencing (GBS)
6/7/2014 Directorate of Soybean Research, Indore 54
6/7/2014 Directorate of Soybean Research, Indore 55
SolGS: A tool for genomic selection
Phenotyped
&
Genotyped Lines
Prediction Model
Predicted
Breeding
Values
Genotyped
Lines
Slide credit: Jeremy Edwards
Cassava Trait Ontology
6/7/2014 Directorate of Soybean Research, Indore 56
Kulakow et al. 2011
Kulakow et al. 2011
● Standard terminology
● Facilitate the sharing of information
● Allow users to query keywords related to traits
Slide credit: Jeremy Edwards
6/6/2014 Directorate of Soybean Research, Indore 58
Position available at Solgenomics
Cassavabase project
Plant Breeding + Bioinformatician
● Familiar with breeding
● Programming in Perl, R, SQL, Hadoop
● Linux
● Africa
● Genius
http://www.cassavabase.org/forum/posts
.pl?topic_id=9
Thank you!!
Questions??
6/6/2014 Directorate of Soybean Research, Indore 59

More Related Content

Similar to ICAR Soybean Indore 2014

Sequencing, Genome Assembly and the SGN Platform
Sequencing, Genome Assembly and the SGN PlatformSequencing, Genome Assembly and the SGN Platform
Sequencing, Genome Assembly and the SGN PlatformSurya Saha
 
Sequencing: The Next Generation
Sequencing: The Next GenerationSequencing: The Next Generation
Sequencing: The Next GenerationSurya Saha
 
Al-Amin Bio-data Modi-latest
Al-Amin Bio-data Modi-latestAl-Amin Bio-data Modi-latest
Al-Amin Bio-data Modi-latestDr. Md. Al-Amin
 
Asian Food and Agriculture Cooperation Initiative
Asian Food and Agriculture Cooperation InitiativeAsian Food and Agriculture Cooperation Initiative
Asian Food and Agriculture Cooperation InitiativeExternalEvents
 
Increasing micronutrient bioavailability in foods by phytase applications.pptx
Increasing micronutrient bioavailability in foods by phytase applications.pptxIncreasing micronutrient bioavailability in foods by phytase applications.pptx
Increasing micronutrient bioavailability in foods by phytase applications.pptxsangwanpunesh
 
CGIAR Research Program on Grain Legumes, Value for Money
CGIAR Research Program on Grain Legumes, Value for MoneyCGIAR Research Program on Grain Legumes, Value for Money
CGIAR Research Program on Grain Legumes, Value for MoneyCGIAR
 
African Chicken Genetic Gains: ACGG-Nigeria report
African Chicken Genetic Gains: ACGG-Nigeria reportAfrican Chicken Genetic Gains: ACGG-Nigeria report
African Chicken Genetic Gains: ACGG-Nigeria reportILRI
 
Low Cyanide High Protein Dry Fufu Powder Processed from Cassava using Starter...
Low Cyanide High Protein Dry Fufu Powder Processed from Cassava using Starter...Low Cyanide High Protein Dry Fufu Powder Processed from Cassava using Starter...
Low Cyanide High Protein Dry Fufu Powder Processed from Cassava using Starter...ijtsrd
 
The Crop Ontology: a resource for enabling access to breeders’ data
The Crop Ontology: a resource for enabling access to breeders’ data The Crop Ontology: a resource for enabling access to breeders’ data
The Crop Ontology: a resource for enabling access to breeders’ data Decision and Policy Analysis Program
 
Benjamin D.K. Ahiabor
Benjamin D.K. Ahiabor  Benjamin D.K. Ahiabor
Benjamin D.K. Ahiabor ExternalEvents
 
2 2010-comparison of the functional properties of pea, chickpea and lentil pr...
2 2010-comparison of the functional properties of pea, chickpea and lentil pr...2 2010-comparison of the functional properties of pea, chickpea and lentil pr...
2 2010-comparison of the functional properties of pea, chickpea and lentil pr...Bảo Dung Phan
 
The chicken of the future: Options from breeding and research
The chicken of the future: Options from breeding and researchThe chicken of the future: Options from breeding and research
The chicken of the future: Options from breeding and researchILRI
 
Advances in Genomics Research and Molecular Breeding in Dryland Crops through...
Advances in Genomics Research and Molecular Breeding in Dryland Crops through...Advances in Genomics Research and Molecular Breeding in Dryland Crops through...
Advances in Genomics Research and Molecular Breeding in Dryland Crops through...apaari
 
Next Generation Sequencing
Next Generation SequencingNext Generation Sequencing
Next Generation SequencingSurya Saha
 

Similar to ICAR Soybean Indore 2014 (20)

Sequencing, Genome Assembly and the SGN Platform
Sequencing, Genome Assembly and the SGN PlatformSequencing, Genome Assembly and the SGN Platform
Sequencing, Genome Assembly and the SGN Platform
 
Sequencing: The Next Generation
Sequencing: The Next GenerationSequencing: The Next Generation
Sequencing: The Next Generation
 
Industrial Visit report
Industrial Visit report Industrial Visit report
Industrial Visit report
 
Sequencing
SequencingSequencing
Sequencing
 
Al-Amin Bio-data Modi-latest
Al-Amin Bio-data Modi-latestAl-Amin Bio-data Modi-latest
Al-Amin Bio-data Modi-latest
 
Asian Food and Agriculture Cooperation Initiative
Asian Food and Agriculture Cooperation InitiativeAsian Food and Agriculture Cooperation Initiative
Asian Food and Agriculture Cooperation Initiative
 
Increasing micronutrient bioavailability in foods by phytase applications.pptx
Increasing micronutrient bioavailability in foods by phytase applications.pptxIncreasing micronutrient bioavailability in foods by phytase applications.pptx
Increasing micronutrient bioavailability in foods by phytase applications.pptx
 
Nutritional Parameters for Growth Profile Study of Protease Producing Halotol...
Nutritional Parameters for Growth Profile Study of Protease Producing Halotol...Nutritional Parameters for Growth Profile Study of Protease Producing Halotol...
Nutritional Parameters for Growth Profile Study of Protease Producing Halotol...
 
CGIAR Research Program on Grain Legumes, Value for Money
CGIAR Research Program on Grain Legumes, Value for MoneyCGIAR Research Program on Grain Legumes, Value for Money
CGIAR Research Program on Grain Legumes, Value for Money
 
African Chicken Genetic Gains: ACGG-Nigeria report
African Chicken Genetic Gains: ACGG-Nigeria reportAfrican Chicken Genetic Gains: ACGG-Nigeria report
African Chicken Genetic Gains: ACGG-Nigeria report
 
Updated CV Abid
Updated CV AbidUpdated CV Abid
Updated CV Abid
 
Low Cyanide High Protein Dry Fufu Powder Processed from Cassava using Starter...
Low Cyanide High Protein Dry Fufu Powder Processed from Cassava using Starter...Low Cyanide High Protein Dry Fufu Powder Processed from Cassava using Starter...
Low Cyanide High Protein Dry Fufu Powder Processed from Cassava using Starter...
 
The Crop Ontology: a resource for enabling access to breeders’ data
The Crop Ontology: a resource for enabling access to breeders’ data The Crop Ontology: a resource for enabling access to breeders’ data
The Crop Ontology: a resource for enabling access to breeders’ data
 
Benjamin D.K. Ahiabor
Benjamin D.K. Ahiabor  Benjamin D.K. Ahiabor
Benjamin D.K. Ahiabor
 
2 2010-comparison of the functional properties of pea, chickpea and lentil pr...
2 2010-comparison of the functional properties of pea, chickpea and lentil pr...2 2010-comparison of the functional properties of pea, chickpea and lentil pr...
2 2010-comparison of the functional properties of pea, chickpea and lentil pr...
 
Faciltating Introduction of Innovative Foods
Faciltating Introduction of Innovative FoodsFaciltating Introduction of Innovative Foods
Faciltating Introduction of Innovative Foods
 
The chicken of the future: Options from breeding and research
The chicken of the future: Options from breeding and researchThe chicken of the future: Options from breeding and research
The chicken of the future: Options from breeding and research
 
Advances in Genomics Research and Molecular Breeding in Dryland Crops through...
Advances in Genomics Research and Molecular Breeding in Dryland Crops through...Advances in Genomics Research and Molecular Breeding in Dryland Crops through...
Advances in Genomics Research and Molecular Breeding in Dryland Crops through...
 
Anp ppt
Anp pptAnp ppt
Anp ppt
 
Next Generation Sequencing
Next Generation SequencingNext Generation Sequencing
Next Generation Sequencing
 

More from Surya Saha

An open access resource portal for arthropod vectors and agricultural pathosy...
An open access resource portal for arthropod vectors and agricultural pathosy...An open access resource portal for arthropod vectors and agricultural pathosy...
An open access resource portal for arthropod vectors and agricultural pathosy...Surya Saha
 
Functional annotation of invertebrate genomes
Functional annotation of invertebrate genomesFunctional annotation of invertebrate genomes
Functional annotation of invertebrate genomesSurya Saha
 
Saha UC Davis Plant Pathology seminar Infrastructure for battling the Citrus ...
Saha UC Davis Plant Pathology seminar Infrastructure for battling the Citrus ...Saha UC Davis Plant Pathology seminar Infrastructure for battling the Citrus ...
Saha UC Davis Plant Pathology seminar Infrastructure for battling the Citrus ...Surya Saha
 
Updates on Citrusgreening.org database from USDA NIFA project meeting
Updates on Citrusgreening.org database from USDA NIFA project meetingUpdates on Citrusgreening.org database from USDA NIFA project meeting
Updates on Citrusgreening.org database from USDA NIFA project meetingSurya Saha
 
Updates on the ACP v3 genome and annotation from USDA NIFA project meeting
Updates on the ACP v3 genome and annotation from USDA NIFA project meetingUpdates on the ACP v3 genome and annotation from USDA NIFA project meeting
Updates on the ACP v3 genome and annotation from USDA NIFA project meetingSurya Saha
 
AgriVectors: A Data and Systems Resource for Arthropod Vectors of Plant Diseases
AgriVectors: A Data and Systems Resource for Arthropod Vectors of Plant DiseasesAgriVectors: A Data and Systems Resource for Arthropod Vectors of Plant Diseases
AgriVectors: A Data and Systems Resource for Arthropod Vectors of Plant DiseasesSurya Saha
 
Visualization of insect vector-plant pathogen interactions in the citrus gree...
Visualization of insect vector-plant pathogen interactions in the citrus gree...Visualization of insect vector-plant pathogen interactions in the citrus gree...
Visualization of insect vector-plant pathogen interactions in the citrus gree...Surya Saha
 
Deciphering the genome of Diaphorina citri to develop solutions for the citru...
Deciphering the genome of Diaphorina citri to develop solutions for the citru...Deciphering the genome of Diaphorina citri to develop solutions for the citru...
Deciphering the genome of Diaphorina citri to develop solutions for the citru...Surya Saha
 
Quality Control of Sequencing Data
Quality Control of Sequencing Data Quality Control of Sequencing Data
Quality Control of Sequencing Data Surya Saha
 
Sequencing 2017
Sequencing 2017Sequencing 2017
Sequencing 2017Surya Saha
 
Community resources for all y’all Omics
Community resources for all y’all OmicsCommunity resources for all y’all Omics
Community resources for all y’all OmicsSurya Saha
 
CitrusCyc: Metabolic Pathway Databases for the C. clementina and C. sinensis...
 CitrusCyc: Metabolic Pathway Databases for the C. clementina and C. sinensis... CitrusCyc: Metabolic Pathway Databases for the C. clementina and C. sinensis...
CitrusCyc: Metabolic Pathway Databases for the C. clementina and C. sinensis...Surya Saha
 
Using Long Reads, Optical Maps and Long-Range Scaffolding to improve the Diap...
Using Long Reads, Optical Maps and Long-Range Scaffolding to improve the Diap...Using Long Reads, Optical Maps and Long-Range Scaffolding to improve the Diap...
Using Long Reads, Optical Maps and Long-Range Scaffolding to improve the Diap...Surya Saha
 
Sequencing 2016
Sequencing 2016Sequencing 2016
Sequencing 2016Surya Saha
 
Tomato Genome Build SL3.0
Tomato Genome Build SL3.0Tomato Genome Build SL3.0
Tomato Genome Build SL3.0Surya Saha
 
Sequencing and Bioinformatics PGRP Summer 2015
Sequencing and Bioinformatics PGRP Summer 2015Sequencing and Bioinformatics PGRP Summer 2015
Sequencing and Bioinformatics PGRP Summer 2015Surya Saha
 
Quality Control of Sequencing Data
Quality Control of Sequencing DataQuality Control of Sequencing Data
Quality Control of Sequencing DataSurya Saha
 
Sequencing: The Next Generation 2015
Sequencing: The Next Generation 2015Sequencing: The Next Generation 2015
Sequencing: The Next Generation 2015Surya Saha
 
Tomato Genome SL2.50 and Beyond…
Tomato Genome SL2.50 and Beyond…Tomato Genome SL2.50 and Beyond…
Tomato Genome SL2.50 and Beyond…Surya Saha
 
Quality Control of NGS Data
Quality Control of NGS Data Quality Control of NGS Data
Quality Control of NGS Data Surya Saha
 

More from Surya Saha (20)

An open access resource portal for arthropod vectors and agricultural pathosy...
An open access resource portal for arthropod vectors and agricultural pathosy...An open access resource portal for arthropod vectors and agricultural pathosy...
An open access resource portal for arthropod vectors and agricultural pathosy...
 
Functional annotation of invertebrate genomes
Functional annotation of invertebrate genomesFunctional annotation of invertebrate genomes
Functional annotation of invertebrate genomes
 
Saha UC Davis Plant Pathology seminar Infrastructure for battling the Citrus ...
Saha UC Davis Plant Pathology seminar Infrastructure for battling the Citrus ...Saha UC Davis Plant Pathology seminar Infrastructure for battling the Citrus ...
Saha UC Davis Plant Pathology seminar Infrastructure for battling the Citrus ...
 
Updates on Citrusgreening.org database from USDA NIFA project meeting
Updates on Citrusgreening.org database from USDA NIFA project meetingUpdates on Citrusgreening.org database from USDA NIFA project meeting
Updates on Citrusgreening.org database from USDA NIFA project meeting
 
Updates on the ACP v3 genome and annotation from USDA NIFA project meeting
Updates on the ACP v3 genome and annotation from USDA NIFA project meetingUpdates on the ACP v3 genome and annotation from USDA NIFA project meeting
Updates on the ACP v3 genome and annotation from USDA NIFA project meeting
 
AgriVectors: A Data and Systems Resource for Arthropod Vectors of Plant Diseases
AgriVectors: A Data and Systems Resource for Arthropod Vectors of Plant DiseasesAgriVectors: A Data and Systems Resource for Arthropod Vectors of Plant Diseases
AgriVectors: A Data and Systems Resource for Arthropod Vectors of Plant Diseases
 
Visualization of insect vector-plant pathogen interactions in the citrus gree...
Visualization of insect vector-plant pathogen interactions in the citrus gree...Visualization of insect vector-plant pathogen interactions in the citrus gree...
Visualization of insect vector-plant pathogen interactions in the citrus gree...
 
Deciphering the genome of Diaphorina citri to develop solutions for the citru...
Deciphering the genome of Diaphorina citri to develop solutions for the citru...Deciphering the genome of Diaphorina citri to develop solutions for the citru...
Deciphering the genome of Diaphorina citri to develop solutions for the citru...
 
Quality Control of Sequencing Data
Quality Control of Sequencing Data Quality Control of Sequencing Data
Quality Control of Sequencing Data
 
Sequencing 2017
Sequencing 2017Sequencing 2017
Sequencing 2017
 
Community resources for all y’all Omics
Community resources for all y’all OmicsCommunity resources for all y’all Omics
Community resources for all y’all Omics
 
CitrusCyc: Metabolic Pathway Databases for the C. clementina and C. sinensis...
 CitrusCyc: Metabolic Pathway Databases for the C. clementina and C. sinensis... CitrusCyc: Metabolic Pathway Databases for the C. clementina and C. sinensis...
CitrusCyc: Metabolic Pathway Databases for the C. clementina and C. sinensis...
 
Using Long Reads, Optical Maps and Long-Range Scaffolding to improve the Diap...
Using Long Reads, Optical Maps and Long-Range Scaffolding to improve the Diap...Using Long Reads, Optical Maps and Long-Range Scaffolding to improve the Diap...
Using Long Reads, Optical Maps and Long-Range Scaffolding to improve the Diap...
 
Sequencing 2016
Sequencing 2016Sequencing 2016
Sequencing 2016
 
Tomato Genome Build SL3.0
Tomato Genome Build SL3.0Tomato Genome Build SL3.0
Tomato Genome Build SL3.0
 
Sequencing and Bioinformatics PGRP Summer 2015
Sequencing and Bioinformatics PGRP Summer 2015Sequencing and Bioinformatics PGRP Summer 2015
Sequencing and Bioinformatics PGRP Summer 2015
 
Quality Control of Sequencing Data
Quality Control of Sequencing DataQuality Control of Sequencing Data
Quality Control of Sequencing Data
 
Sequencing: The Next Generation 2015
Sequencing: The Next Generation 2015Sequencing: The Next Generation 2015
Sequencing: The Next Generation 2015
 
Tomato Genome SL2.50 and Beyond…
Tomato Genome SL2.50 and Beyond…Tomato Genome SL2.50 and Beyond…
Tomato Genome SL2.50 and Beyond…
 
Quality Control of NGS Data
Quality Control of NGS Data Quality Control of NGS Data
Quality Control of NGS Data
 

Recently uploaded

Observational constraints on mergers creating magnetism in massive stars
Observational constraints on mergers creating magnetism in massive starsObservational constraints on mergers creating magnetism in massive stars
Observational constraints on mergers creating magnetism in massive starsSérgio Sacani
 
bonjourmadame.tumblr.com bhaskar's girls
bonjourmadame.tumblr.com bhaskar's girlsbonjourmadame.tumblr.com bhaskar's girls
bonjourmadame.tumblr.com bhaskar's girlshansessene
 
well logging & petrophysical analysis.pptx
well logging & petrophysical analysis.pptxwell logging & petrophysical analysis.pptx
well logging & petrophysical analysis.pptxzaydmeerab121
 
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)User Guide: Pulsar™ Weather Station (Columbia Weather Systems)
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)Columbia Weather Systems
 
User Guide: Orion™ Weather Station (Columbia Weather Systems)
User Guide: Orion™ Weather Station (Columbia Weather Systems)User Guide: Orion™ Weather Station (Columbia Weather Systems)
User Guide: Orion™ Weather Station (Columbia Weather Systems)Columbia Weather Systems
 
Observation of Gravitational Waves from the Coalescence of a 2.5–4.5 M⊙ Compa...
Observation of Gravitational Waves from the Coalescence of a 2.5–4.5 M⊙ Compa...Observation of Gravitational Waves from the Coalescence of a 2.5–4.5 M⊙ Compa...
Observation of Gravitational Waves from the Coalescence of a 2.5–4.5 M⊙ Compa...Sérgio Sacani
 
6.2 Pests of Sesame_Identification_Binomics_Dr.UPR
6.2 Pests of Sesame_Identification_Binomics_Dr.UPR6.2 Pests of Sesame_Identification_Binomics_Dr.UPR
6.2 Pests of Sesame_Identification_Binomics_Dr.UPRPirithiRaju
 
办理麦克马斯特大学毕业证成绩单|购买加拿大文凭证书
办理麦克马斯特大学毕业证成绩单|购买加拿大文凭证书办理麦克马斯特大学毕业证成绩单|购买加拿大文凭证书
办理麦克马斯特大学毕业证成绩单|购买加拿大文凭证书zdzoqco
 
Gas-ExchangeS-in-Plants-and-Animals.pptx
Gas-ExchangeS-in-Plants-and-Animals.pptxGas-ExchangeS-in-Plants-and-Animals.pptx
Gas-ExchangeS-in-Plants-and-Animals.pptxGiovaniTrinidad
 
Loudspeaker- direct radiating type and horn type.pptx
Loudspeaker- direct radiating type and horn type.pptxLoudspeaker- direct radiating type and horn type.pptx
Loudspeaker- direct radiating type and horn type.pptxpriyankatabhane
 
Abnormal LFTs rate of deco and NAFLD.pptx
Abnormal LFTs rate of deco and NAFLD.pptxAbnormal LFTs rate of deco and NAFLD.pptx
Abnormal LFTs rate of deco and NAFLD.pptxzeus70441
 
Base editing, prime editing, Cas13 & RNA editing and organelle base editing
Base editing, prime editing, Cas13 & RNA editing and organelle base editingBase editing, prime editing, Cas13 & RNA editing and organelle base editing
Base editing, prime editing, Cas13 & RNA editing and organelle base editingNetHelix
 
GenAI talk for Young at Wageningen University & Research (WUR) March 2024
GenAI talk for Young at Wageningen University & Research (WUR) March 2024GenAI talk for Young at Wageningen University & Research (WUR) March 2024
GenAI talk for Young at Wageningen University & Research (WUR) March 2024Jene van der Heide
 
PROJECTILE MOTION-Horizontal and Vertical
PROJECTILE MOTION-Horizontal and VerticalPROJECTILE MOTION-Horizontal and Vertical
PROJECTILE MOTION-Horizontal and VerticalMAESTRELLAMesa2
 
Q4-Mod-1c-Quiz-Projectile-333344444.pptx
Q4-Mod-1c-Quiz-Projectile-333344444.pptxQ4-Mod-1c-Quiz-Projectile-333344444.pptx
Q4-Mod-1c-Quiz-Projectile-333344444.pptxtuking87
 
The Sensory Organs, Anatomy and Function
The Sensory Organs, Anatomy and FunctionThe Sensory Organs, Anatomy and Function
The Sensory Organs, Anatomy and FunctionJadeNovelo1
 
Quarter 4_Grade 8_Digestive System Structure and Functions
Quarter 4_Grade 8_Digestive System Structure and FunctionsQuarter 4_Grade 8_Digestive System Structure and Functions
Quarter 4_Grade 8_Digestive System Structure and FunctionsCharlene Llagas
 
Microteaching on terms used in filtration .Pharmaceutical Engineering
Microteaching on terms used in filtration .Pharmaceutical EngineeringMicroteaching on terms used in filtration .Pharmaceutical Engineering
Microteaching on terms used in filtration .Pharmaceutical EngineeringPrajakta Shinde
 
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubai
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In DubaiDubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubai
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubaikojalkojal131
 

Recently uploaded (20)

Observational constraints on mergers creating magnetism in massive stars
Observational constraints on mergers creating magnetism in massive starsObservational constraints on mergers creating magnetism in massive stars
Observational constraints on mergers creating magnetism in massive stars
 
bonjourmadame.tumblr.com bhaskar's girls
bonjourmadame.tumblr.com bhaskar's girlsbonjourmadame.tumblr.com bhaskar's girls
bonjourmadame.tumblr.com bhaskar's girls
 
well logging & petrophysical analysis.pptx
well logging & petrophysical analysis.pptxwell logging & petrophysical analysis.pptx
well logging & petrophysical analysis.pptx
 
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)User Guide: Pulsar™ Weather Station (Columbia Weather Systems)
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)
 
User Guide: Orion™ Weather Station (Columbia Weather Systems)
User Guide: Orion™ Weather Station (Columbia Weather Systems)User Guide: Orion™ Weather Station (Columbia Weather Systems)
User Guide: Orion™ Weather Station (Columbia Weather Systems)
 
PLASMODIUM. PPTX
PLASMODIUM. PPTXPLASMODIUM. PPTX
PLASMODIUM. PPTX
 
Observation of Gravitational Waves from the Coalescence of a 2.5–4.5 M⊙ Compa...
Observation of Gravitational Waves from the Coalescence of a 2.5–4.5 M⊙ Compa...Observation of Gravitational Waves from the Coalescence of a 2.5–4.5 M⊙ Compa...
Observation of Gravitational Waves from the Coalescence of a 2.5–4.5 M⊙ Compa...
 
6.2 Pests of Sesame_Identification_Binomics_Dr.UPR
6.2 Pests of Sesame_Identification_Binomics_Dr.UPR6.2 Pests of Sesame_Identification_Binomics_Dr.UPR
6.2 Pests of Sesame_Identification_Binomics_Dr.UPR
 
办理麦克马斯特大学毕业证成绩单|购买加拿大文凭证书
办理麦克马斯特大学毕业证成绩单|购买加拿大文凭证书办理麦克马斯特大学毕业证成绩单|购买加拿大文凭证书
办理麦克马斯特大学毕业证成绩单|购买加拿大文凭证书
 
Gas-ExchangeS-in-Plants-and-Animals.pptx
Gas-ExchangeS-in-Plants-and-Animals.pptxGas-ExchangeS-in-Plants-and-Animals.pptx
Gas-ExchangeS-in-Plants-and-Animals.pptx
 
Loudspeaker- direct radiating type and horn type.pptx
Loudspeaker- direct radiating type and horn type.pptxLoudspeaker- direct radiating type and horn type.pptx
Loudspeaker- direct radiating type and horn type.pptx
 
Abnormal LFTs rate of deco and NAFLD.pptx
Abnormal LFTs rate of deco and NAFLD.pptxAbnormal LFTs rate of deco and NAFLD.pptx
Abnormal LFTs rate of deco and NAFLD.pptx
 
Base editing, prime editing, Cas13 & RNA editing and organelle base editing
Base editing, prime editing, Cas13 & RNA editing and organelle base editingBase editing, prime editing, Cas13 & RNA editing and organelle base editing
Base editing, prime editing, Cas13 & RNA editing and organelle base editing
 
GenAI talk for Young at Wageningen University & Research (WUR) March 2024
GenAI talk for Young at Wageningen University & Research (WUR) March 2024GenAI talk for Young at Wageningen University & Research (WUR) March 2024
GenAI talk for Young at Wageningen University & Research (WUR) March 2024
 
PROJECTILE MOTION-Horizontal and Vertical
PROJECTILE MOTION-Horizontal and VerticalPROJECTILE MOTION-Horizontal and Vertical
PROJECTILE MOTION-Horizontal and Vertical
 
Q4-Mod-1c-Quiz-Projectile-333344444.pptx
Q4-Mod-1c-Quiz-Projectile-333344444.pptxQ4-Mod-1c-Quiz-Projectile-333344444.pptx
Q4-Mod-1c-Quiz-Projectile-333344444.pptx
 
The Sensory Organs, Anatomy and Function
The Sensory Organs, Anatomy and FunctionThe Sensory Organs, Anatomy and Function
The Sensory Organs, Anatomy and Function
 
Quarter 4_Grade 8_Digestive System Structure and Functions
Quarter 4_Grade 8_Digestive System Structure and FunctionsQuarter 4_Grade 8_Digestive System Structure and Functions
Quarter 4_Grade 8_Digestive System Structure and Functions
 
Microteaching on terms used in filtration .Pharmaceutical Engineering
Microteaching on terms used in filtration .Pharmaceutical EngineeringMicroteaching on terms used in filtration .Pharmaceutical Engineering
Microteaching on terms used in filtration .Pharmaceutical Engineering
 
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubai
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In DubaiDubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubai
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubai
 

ICAR Soybean Indore 2014

  • 1. Surya Saha Cornell University & Boyce Thompson Institute suryasaha@cornell.edu @SahaSurya Directorate of Soybean Research, Indore June 7,2014 Slides: http://bit.ly/Soybean_Indore_2014 http://www.acgt.me/blog/2014/3/7/next-generation-sequencing-must-die
  • 2. 6/6/2014 Directorate of Soybean Research, Indore 2 You are free to: Copy, share, adapt, or re-mix; Photograph, film, or broadcast; Blog, live-blog, or post video of; This presentation. Provided that: You attribute the work to its author and respect the rights and licenses associated with its components. Slide Concept by Cameron Neylon, who has waived all copyright and related or neighbouring rights. This slide only ccZero. Social Media Icons adapted with permission from originals by Christopher Ross. Original images are available under GPL at http://www.thisismyurl.com/free-downloads/15-free-speech-bubble-icons-for-popular-websites
  • 3. 6/7/2014 Directorate of Soybean Research, Indore 3 Sequencing
  • 4. 1953 DNA Structure discovery 1977 2012 Sanger DNA sequencing by chain-terminating inhibitors 1984 Epstein-Barr virus (170 Kb) 1987Abi370 Sequencer 1995 2001 Homo sapiens (3.0 Gb) 2005 454 Solexa Solid 2007 2011 Ion Torrent PacBio Haemophilus influenzae (1.83 Mb) 2013 Slide credit: Aureliano Bombarely Sequencing over the Ages Illumina Illumina Hiseq X 454 6/7/2014 Directorate of Soybean Research, Indore 4 Pinus taeda (24 Gb) 2014 MinION The Next Generation
  • 5. 6/6/2014 Directorate of Soybean Research, Indore 5 Its all about the $£€¥ http://www.genome.gov/sequencingcosts/
  • 6. 6/6/2014 Directorate of Soybean Research, Indore 6 First generation sequencing
  • 7. Sanger method 6/6/2014 Directorate of Soybean Research, Indore 7 Frederick Sanger 13 Aug 1918 – 19 Nov 2013 Won the Nobel Prize for Chemistry in 1958 and 1980. Published the dideoxy chain termination method or “Sanger method” in 1977 http://dailym.ai/1f1XeTB
  • 8. Sanger method 6/6/2014 Directorate of Soybean Research, Indore 8 http://bit.ly/1g6Cudq http://bit.ly/1lcQO4J
  • 9. First generation sequencing • Very high quality sequences (99.999%) • Very low throughput 6/6/2014 Directorate of Soybean Research, Indore 9 Run Time Read Length Reads / Run Total nucleotides sequenced Cost / MB Capillary Sequencing (ABI3730xl) 20m-3h 400-900 bp 96 or 386 1.9-84 Kb $2400 http://bit.ly/1clLps3 http://1.usa.gov/1cLqIRd
  • 10. Next generation sequencing 6/6/2014 Directorate of Soybean Research, Indore 10
  • 11. 6/6/2014 Directorate of Soybean Research, Indore 11 http://bit.ly/1keDtZQ • Second generation • Third generation • Fourth generation • Next-next-generation • Next-next-next generation http://www.acgt.me/blog/2014/3/10/next-generation- sequencing-must-diepart-2
  • 12. Use the specific technology used to generate the data – Illumina Hiseq/Miseq/NextSeq – Pacific Biosciences RS I/RS II – Ion Torrent Proton/PGM – SOLiD – 454 6/6/2014 Directorate of Soybean Research, Indore 12 http://www.acgt.me/blog/2014/3/10/next-generation- sequencing-must-diepart-2
  • 13. 454 Pyrosequencing One purified DNA fragment, to one bead, to one read. 6/6/2014 Directorate of Soybean Research, Indore 13 http://bit.ly/1ehwxWN GS FLX Titanium http://bit.ly/1ehAcEh
  • 14. Illumina 6/6/2014 Directorate of Soybean Research, Indore 14 Output 15 Gb 120 GB 1000 GB 1800 GB Number of Reads 25 Million 400 Million 4 Billion 6 Billion Read Length 2x300 bp 2x150 bp 2x125 bp (2x250 update mid-2014) 2x150 bp Cost $99K $250K $740K $10M Source: Illumina
  • 15. Illumina 6/6/2014 Directorate of Soybean Research, Indore 15 Output 15 Gb 120 GB 1000 GB 1800 GB Number of Reads 25 Million 400 Million 4 Billion 6 Billion Read Length 2x300 bp 2x150 bp 2x125 bp (2x250 update mid-2014) 2x150 bp Cost $99K $250K $740K $10M Source: Illumina $1000 human genome??
  • 16. Illumina 6/6/2014 Directorate of Soybean Research, Indore 16 http://1.usa.gov/1fP9ybl
  • 17. Illumina:Moleculo 6/6/2014 Directorate of Soybean Research, Indore 17 http://bit.ly/1aEPOBn
  • 18. Pacific Biosciences SMRT sequencing Single Molecule Real Time sequencing 6/6/2014 Directorate of Soybean Research, Indore 18 http://bit.ly/1naxgTe
  • 19. Pacific Biosciences SMRT sequencing Error correction methods 6/6/2014 Directorate of Soybean Research, Indore 19 Hierarchical genome-assembly process (HGAP) PBJelly Enlish et al., PLOS One. 2012 PBJelly
  • 20. 6/6/2014 Directorate of Soybean Research, Indore 20 Pacific Biosciences SMRT sequencing Read Lengths http://www.igs.umaryland.edu/labs/grc/ Mean Read Length: 8391 bp Maximum Subread Length: 24585 bp
  • 21. Oxford Nanopore 6/6/2014 Directorate of Soybean Research, Indore 21 https://www.nanoporetech.com/ • No data yet • Error model http://erlichya.tumblr.com/post/66376172948/hands-on- experience-with-oxford-nanopore-minion
  • 22. Others • Ion Torrent Proton/PGM • Nabsys • SOLiD 6/6/2014 Directorate of Soybean Research, Indore 22
  • 23. Comparison 6/6/2014 Directorate of Soybean Research, Indore 23
  • 24. Next generation sequencing 6/6/2014 Directorate of Soybean Research, Indore 24 Run Time Read Length Quality Total nucleotides sequenced Cost /MB 454 Pyrosequencing 24h 700 bp Q20-Q30 0.7 GB $10 Illumina Miseq 27h 2x250bp > Q30 15 GB $0.15 Illumina Hiseq 2500 11days 2x125bp >Q30 1000 GB $0.05 Ion torrent 2h 400bp >Q20 50MB-1GB $1 Pacific Biosciences 2h 5.5-8.5kb >Q30 consensus >Q10 single 400-800MB /SMRT cell $0.33-$1 http://bit.ly/1clLps3 http://1.usa.gov/1cLqIRd
  • 25. http://omicsmaps.com/ Next Generation Genomics: World Map of High-throughput Sequencers Directorate of Soybean Research, Indore6/6/2014 25
  • 26. 6/6/2014 Directorate of Soybean Research, Indore 26 http://bit.ly/18pfUId
  • 27. Real cost of Sequencing!! Sboner, Genome Biology, 2011 6/7/2014 27Directorate of Soybean Research, Indore
  • 28. Library Types Single end Pair end (PE, 150-800 bp, Fwd:/1, Rev:/2) Mate pair (MP, 2Kb to 20 Kb) 6/6/2014 Directorate of Soybean Research, Indore 28 F F R F R 454/Roche FR Illumina Illumina Slide credit: Aureliano Bombarely
  • 29. Implications of Choice of Library 6/6/2014 Directorate of Soybean Research, Indore 29 Slide credit: Aureliano Bombarely Consensus sequence (Contig) Reads Scaffold (or Supercontig) Pair Read information NNNNN Pseudomolecule (or ultracontig) F Genetic information (markers) NNNNN NN
  • 30. 6/6/2014 Directorate of Soybean Research, Indore 30 Quality control: Encoding http://bit.ly/N28yUd Phred score of a base is: Qphred = -10 log10 (e) where e is the estimated probability of a base being incorrect
  • 31. Which technology to use?? • Microbial genomes • Eukaryotic genomes • Resequencing genomes • RNAseq and other XXXseq methods 6/6/2014 Directorate of Soybean Research, Indore 31 http://bit.ly/1ko9Kgh
  • 32. 6/7/2014 Directorate of Soybean Research, Indore 32 SOL Genomics Network
  • 33. 6/6/2014 Directorate of Soybean Research, Indore 33
  • 34. The SGN Team!! 6/6/2014 Directorate of Soybean Research, Indore 34 Surya Saha, Tom Fisher-York, Hartmut Foerster, Suzy Strickler, Jeremy Edwards, Noe Fernandez, Naama Menda, Aure Bombarely, Aimin Yan, Isaak Tecle
  • 35. What's new on SGN? • Tomato genome release 2.5 • Incorporates results from FISH • Nicotiana benthamiana genome sequence • Genome sequence and annotation • VIGS Tool • Select specific probes for VIGS • New BLAST interface • New Breeder functions • Later this year: Tomato genome release 3.0 6/6/2014 Directorate of Soybean Research, Indore 35
  • 36. SGN Website 6/6/2014 Directorate of Soybean Research, Indore 36 http://solgenomics.net
  • 37. 6/6/2014 Directorate of Soybean Research, Indore 37 Main web page (front page): WEB ICONS TOOL BAR
  • 38. 6/6/2014 Directorate of Soybean Research, Indore 38 Main web page (front page): TOOL BAR (MENUS)
  • 39. 6/6/2014 Directorate of Soybean Research, Indore 39 But the DATA also can be edited LocusLocus Editor Data Community Data Curation
  • 40. 6/6/2014 Directorate of Soybean Research, Indore 40 You need • SGN account. • Activate submitter / Locus Editor privileges by SGN curator LocusLocus Editor Data
  • 41. 6/6/2014 Directorate of Soybean Research, Indore 41 Tools
  • 42. 6/6/2014 Directorate of Soybean Research, Indore 42 Genome Browser
  • 43. 6/6/2014 Directorate of Soybean Research, Indore 43 Genomes in SGN
  • 44. 6/6/2014 Directorate of Soybean Research, Indore 44
  • 45. 6/7/2014 Directorate of Soybean Research, Indore 45 CassavaBase
  • 46. 6/7/2014 Directorate of Soybean Research, Indore 46 Cassava ● Tropical and subtropical regions ● Mainly grown for starchy roots ● Native to South America ● Major crop in Africa ● Food for 500 million people around the world ● Clonally propagated ● Accumulates toxic cyanogenic glucosides ● Requires processing before consumption
  • 47. 6/7/2014 Directorate of Soybean Research, Indore 47 NextGen Cassava Project ● Project: Adapt SGN database for Cassava Breeding ● Goal: Apply Genomic Selection to cassava breeding ● Predict breeding values from genotype information ● Shorten the breeding cycle ● Massive amounts of genotypic data (GBS) ● Phenotypic data ● Data management challenge ● Improve flowering ● http://nextgencassava.org
  • 48. 6/7/2014 Directorate of Soybean Research, Indore 48 CassavaBase http://cassavabase.org/
  • 49. SGN/Cassavabase behind the scenes 6/7/2014 Directorate of Soybean Research, Indore 49 ● Perl/Catalyst MVC Framework ● PostgreSQL Database ● Generic Model Organism Database (GMOD) – Chado relational database schema – GBrowse – JBrowse ● R – Experimental design – QTL mapping – Genomic selection
  • 50. Objectives Provide cassava breeders and researchers access to data and tools in a centralized, user-friendly and reliable database. – Improve partner breeding program information tracking – Streamline management of genotypic and phenotypic data – Pipeline genotypic and phenotypic data through Genomic Selection prediction analyses 6/7/2014 Directorate of Soybean Research, Indore 50
  • 51. 6/7/2014 Directorate of Soybean Research, Indore 51 Genomic Selection The 'training population' is genotyped and phenotyped to 'train' the genomic selection (GS) prediction model. Genotypic information from the breeding material is then fed into the model to calculate genomic estimated breeding values (GEBV) for these lines. From Heffner et al. 2009 Crop Sci. 49:1–12 Information from a majority of lines in the breeding population (the training set) is used to create the prediction model. The model is then used to predict the phenotypes of the remaining lines (the validation set), using genotypic information only. The results from the model are compared to the actual data to give the prediction accuracy. Image courtesy of Martha Hamblin, Cornell University Flow diagram of a genomic selection breeding program. Breeding cycle time is shortened by removing phenotypic evaluation of lines before selection as parents for the next cycle. From Heffner et al. 2009 Crop Sci. 49:1–12 Slide credit: Jeremy Edwards
  • 52. 6/7/2014 Directorate of Soybean Research, Indore 52 Data collection in the field ● Android tablets ● Field book app – Jesse Poland's group at USDA-ARS / Kansas State University Slide credit: Jeremy Edwards
  • 53. 6/7/2014 Directorate of Soybean Research, Indore 53 ● Tassel 4 pipeline from Ed Bucker's group ● Discovery vs production ● Filtering ● Imputation ● Storing in Cassavabase Slide credit: Jeremy Edwards Genotyping by sequencing (GBS)
  • 54. Genotyping by sequencing (GBS) 6/7/2014 Directorate of Soybean Research, Indore 54
  • 55. 6/7/2014 Directorate of Soybean Research, Indore 55 SolGS: A tool for genomic selection Phenotyped & Genotyped Lines Prediction Model Predicted Breeding Values Genotyped Lines Slide credit: Jeremy Edwards
  • 56. Cassava Trait Ontology 6/7/2014 Directorate of Soybean Research, Indore 56 Kulakow et al. 2011 Kulakow et al. 2011 ● Standard terminology ● Facilitate the sharing of information ● Allow users to query keywords related to traits Slide credit: Jeremy Edwards
  • 57.
  • 58. 6/6/2014 Directorate of Soybean Research, Indore 58 Position available at Solgenomics Cassavabase project Plant Breeding + Bioinformatician ● Familiar with breeding ● Programming in Perl, R, SQL, Hadoop ● Linux ● Africa ● Genius http://www.cassavabase.org/forum/posts .pl?topic_id=9
  • 59. Thank you!! Questions?? 6/6/2014 Directorate of Soybean Research, Indore 59