Neurodevelopmental disorders according to the dsm 5 tr
Using Long Reads, Optical Maps and Long-Range Scaffolding to improve the Diaphorina citri Genome: An update
1. www.citrusgreening.org
Using Long Reads, Optical Maps and Long-
Range Scaffolding to improve the Diaphorina
citri Genome: An update
Surya Saha1, Susan J. Brown2 and Lukas Mueller1
1Boyce Thompson Institute; 2Kansas State Univeristy
ss2489@cornell.edu @SahaSurya
PAG XXV
Arthropod Genomics Workshop
3. www.citrusgreening.org
Citrus Greening: Huanglongbing
• Most significant disease of citrus worldwide
• More than $4.5 billion in lost citrus production and more than 8,200 lost jobs
(2006/07 to 2010/11)
• Associated with gram negative bacterium Candidatus Liberibacter asiaticus (CLas)
• Spread by insect vector, Diaphorina citri (Asian citrus psyllid, ACP)
Annie Kruse
10. Miniasm Assembly (Raw Reads)
https://github.com/lh3/miniasm/blob/master/README.md
No error correction
Very fast
Contig N50: 83,490bp
(was 34,407bp)
Counts
Number of
contigs
8,060
Total bases 458,143,096 or 458 Mb
Longest 1,188,453 bp
Shortest 5, 633 bp
Average length 56,841.6 bp
11. CANU Assembly
http://canu.readthedocs.io/en/stable/
Error correction of 40% longest reads
26.5X coverage after correction
Error rate 0.013 Error rate 0.015
Number of
contigs
7,832 8,030
Total bases 462,838,769 or
462 Mb
493,169,880 or
493.1 Mb
Longest 1,677,652 bp 1,757,402 bp
Shortest 4,425 bp 5,079 bp
Average length 59,095.9 bp 61,415.9 bp
Contig N50 85,832 bp 92,630 bp
12. PBJelly Scaffolding of
CANU Err 0.013 Assembly
Error rate 0.013 Error rate 0.015 Scaffolded 0.03
Number of
contigs
7,832 8,030 8,352
Total bases 462,838,769 or
462 Mb
493,169,880 or
493.1 Mb
591,730,999 bp
or 591.7 Mb
Longest 1,677,652 bp 1,757,402 bp 2,096,698 bp
Shortest 4,425 bp 5,079 bp 1,547 bp
Average
length
59,095.9 bp 61,415.9 bp 70,849.0 bp
Contig N50 85,832 bp 92,630 bp 115,896 bp
5,290 gap extensions; 535 gaps filled; Number of Ns: 0 bp
13. www.citrusgreening.org
Benchmarking
Complete Fragmented Missing
Diaci 1.1 90% 6% 4%
Diaci 1.9 92% 1% 7%
White fly 98.2% 0.5% 1.3%
PE RNAseq
622 Mill reads
Overall
alignment rate
Concordant
alignment rate
Diaci 1.1 82% 0.62%
Diaci 1.9 88% 79%
Benchmarking sets of Universal Single-Copy Orthologs based on a set of 1,066
single-copy orthologs from 133 arthropods species
17. www.citrusgreening.org
Haplotyping Contigs with 10X
Long read information from short reads using 14bp bar codes
Very low input DNA (0.625 ng for ACP)
1ng of DNA is split across 100,000 Gel Coated Beads (GEMs)
Chromium instrument
http://www.10xgenomics.com/products/
19. www.citrusgreening.org
Example: Human MHC map
• Sample prep requires very high molecular weight DNA
• Nicks at 10 sites / 100kb
• Individual molecules are assembles into optical maps (Cmaps)
• Optical maps and sequences are merged in a hybrid assembly
http://www.bionanogenomics.com/technology/why-genome-mapping/
ACP molecule N50: 240 kb
1 Mb DNA fragments are ideal
Optimizing enzymes for ACP