- PacBio HiFi reads are long (>10 kb) and accurate (>99%). HiFi reads are available now for HG002 and soon for HG001 and HG005.
- HiFi reads will be useful for comprehensive variant detection and phasing. Plans are outlined to apply HiFi reads to structural variant benchmarking and expand small variant calling to difficult regions.
8. HG002 HIFI DATASETS
Coverage Average Read Length
30-fold 9.6 kb
28-fold 13.5 kb
- Reads on SRA and NIST FTP
- Alignments on NIST FTP
Richard Hall, Paul Peluso, Yufeng Qian, David Rank, Billy Rowell
https://bit.ly/2THv47q (NIST FTP)
9. Sequel System (1M)
Sequel II System (8M)
Subread Yield 318 Gb
CCS Yield 16 Gb
CCS Accuracy 99.8%
Read length (kb)
0 50 100 150 200 250 300
Yieldperunitreadlength(kb)
0
25
50
100
150
175
75
125
SEQUEL II SYSTEM
11. PLANS FOR NEXT QUARTER
Sample Sequencer CCS Dataset
HG002 Sequel System 30-fold with 9.6 kb reads
HG002 Sequel System 28-fold with 13.5 kb reads
HG002 Sequel II System 30-fold with 11.0 kb reads
HG001 Sequel II System 30-fold with 10 kb reads
HG005 Sequel II System 30-fold with 10 kb reads
Submitted
In Progress
Primo Baybayan, Shreya Chakraborty, Alicia Yang
12. APPLICATIONS OF HIFI READS FOR GENOME IN A BOTTLE
Structural variant benchmark
-HG002: HiFi structural variant callset rivals multi-technology benchmark
-Extend to HG001, HG005
Small variant benchmark (v4α for all samples)
-Expand to difficult regions
-Correct mistakes
Phasing variants
14. SUMMARY
bioRxiv 519025
doi:10.1101/519025
Baylor – Medhat Mahmoud, Fritz Sedlazeck
Dana-Farber – Heng Li
Chinese Academy of Agricultural Sciences – Jue Ruan
DNAnexus – Chen-Shan Chin, Arkarachai Fungtammasan
Google – Andrew Carroll, Pi-Chuan Chang, Mark DePristo,
Alexey Kolesnikov
Johns Hopkins – Michael Alonge, Michael Schatz
Max Planck Dresden – Gene Myers
NIH/NHGRI – Sergey Koren, Adam Phillippy
NIST – Nathan Olson, Justin Zook
PacBio – Greg Concepcion, Richard Hall, Paul Peluso, Yufeng Qian, David Rank, William
Rowell, Armin Töpfer, Aaron Wenger
Saarland University – Jana Ebler, Tobias Marschall
-PacBio HiFi reads are long (>10 kb) and accurate (>99%).
-HiFi reads are available now for HG002 and soon for HG001 and HG005.
-HiFi reads will be useful for comprehensive variant detection and phasing.