SlideShare une entreprise Scribd logo
1  sur  11
Télécharger pour lire hors ligne
For Research Use Only. Not for use in diagnostic procedures. © Copyright 2019 by Pacific Biosciences of California, Inc. All rights reserved.
Reviewing benchmark v3.3.2 PacBio CCS
discrepancies in benchmark v4!
Billy Rowell
@nothingclever
OVERALL BENCHMARK CHANGES
Type Metric v3.3.2 v4! "
SNP
TP 3,034,399 3,404,145 369,746
FN 13,438 29,593 16,155
FP 16,248 17,351 1,103
INDEL
TP 377,613 431,725 54,112
FN 87,151 104,232 17,081
FP 103,619 106,628 3,009
Type Metric v3.3.2 v4! "
SNP
Recall 99.56% 99.14% -0.42%
Precision 99.47% 99.49% 0.02%
F1 Score 99.51% 99.32% -0.19%
INDEL
Recall 81.25% 80.55% -0.70%
Precision 78.98% 80.68% 1.70%
F1 Score 80.10% 80.61% 0.51%
Comparing GATK4HC calls on Sequel CCS 13.5kb library
SUPPLEMENTARY TABLE 9
- Supplementary Table 9. Manual curation of
small variant discrepancies between CCS
callset and Genome in a Bottle benchmark.
For the “Discrepancy” column, “AM” means
genotype difference, “FN” means false negative
(in benchmark but not callset), and “FP s” means
false positive (in callset but not benchmark).
“Repeat family” column is from the
RepeatMasker track from the UCSC Genome
Browser. “Correct Call” column is “GIAB” when
the benchmark was deemed correct by expert
curators, and “CCS” when the CCS callset was
deemed correct. Rows where the correct call is
from the CCS callset are colored blue.
bioRxiv 519025
doi:10.1101/519025
UPDATES TO DISCREPANCIES BETWEEN CCS AND V3.3.2
CHR POS Discrepancy
4 11468804AM
5 42740225AM
2 5143996AM
13 48291499AM
8 5930728FN
15 41943823FN
6 9737425FN
7 157385671FN
17 32064214FN
1 94256825FP
2 153864971FP
4 112819087FP
4 165026074FP
11 23338682FP
1 35034071FP
3 79181734FP
4 94532444FP
8 46873565FP
9 22350168FP
21 42288851FP
13/20 SITES WERE REMOVED FROM HIGH CONFIDENCE
REGION
CHR POS Discrepancy high conf
4 11468804 AM BORDER
5 42740225 AM FALSE
2 5143996 AM FALSE
13 48291499 AM FALSE
8 5930728 FN FALSE
15 41943823 FN FALSE
6 9737425 FN FALSE
7 157385671 FN TRUE
17 32064214 FN FALSE
1 94256825 FP TRUE
2 153864971 FP FALSE
4 112819087 FP FALSE
4 165026074 FP TRUE
11 23338682 FP TRUE
1 35034071 FP FALSE
3 79181734 FP TRUE
4 94532444 FP FALSE
8 46873565 FP FALSE
9 22350168 FP TRUE
21 42288851 FP FALSE
7/20 SITES WERE MODIFIED IN V4! TO MATCH GATK CALLS
CHR POS Discrepancy high conf
4 11468804 AM BORDER
5 42740225 AM FALSE
2 5143996 AM FALSE
13 48291499 AM FALSE
8 5930728 FN FALSE
15 41943823 FN FALSE
6 9737425 FN FALSE
7 157385671 FN TRUE
17 32064214 FN FALSE
1 94256825 FP TRUE
2 153864971 FP FALSE
4 112819087 FP FALSE
4 165026074 FP TRUE
11 23338682 FP TRUE
1 35034071 FP FALSE
3 79181734 FP TRUE
4 94532444 FP FALSE
8 46873565 FP FALSE
9 22350168 FP TRUE
21 42288851 FP FALSE
SOME OF THESE REGIONS MAY BE ADDED BACK TO HIGH
CONFIDENCE AFTER CLOSER INSPECTION
CHR POS Discrepancy high conf Notes
4 11468804 AM BORDER fixed
5 42740225 AM FALSE complex variant; L1PA2
2 5143996 AM FALSE mis-mapped short reads; HERVH-int
13 48291499 AM FALSE highly variable region; L1PA3
8 5930728 FN FALSE long reads identify long insertion
15 41943823 FN FALSE simple repeat with some variability causes alignment issues
6 9737425 FN FALSE segmental dup
7 157385671 FN TRUE segmental dup, fixed
17 32064214 FN FALSE
1 94256825 FP TRUE fixed; L1PA2
2 153864971 FP FALSE supported by CCS, mate pairs, 10X, ONT; L1HS
4 112819087 FP FALSE L1HS
4 165026074 FP TRUE fixed; L1PA2
11 23338682 FP TRUE fixed; L1P1
1 35034071 FP FALSE L1HS
3 79181734 FP TRUE fixed; L1HS
4 94532444 FP FALSE supported by CCS, mate pairs, ONT; L1HS
8 46873565 FP FALSE ALR/Alpha
9 22350168 FP TRUE fixed; L1PA2
21 42288851 FP FALSE supported by CCS, mate pairs, ONT; L1PA2
CHR6:9737425
-V3.3.2 HET
-V4 – removed from high confidence
region
-segmental duplication15kb CCS
2x250
6kb mate pair
Linked-read
Ultralong
Seg dup
V3.3.2 high conf
V4 high conf
CHR13:48291499
-V3.3.2 - HOMALT
-V4 - removed from high confidence
region
-Support for HET in CCS, linked-
reads, mate pairs, and ultralong
reads
-L1PA3
15kb CCS
2x250
6kb mate pair
Linked-read
Ultralong
Seg dup
V3.3.2 high conf
V4 high conf
CONCLUSION
-v4! increases TP variant calls in CCS-GATK4HC call set by > 400k
-slight decreases in recall, but increases in precision
-v4! resolves all discrepancies reported in Wenger & Peluso et al.
-There may be some cases where the high confidence region could be further
expanded, based on agreement between CCS, 10X, ONT, and 6kb mate
pairs.
For Research Use Only. Not for use in diagnostic procedures. © Copyright 2019 by Pacific Biosciences of California, Inc. All rights reserved. Pacific Biosciences, the Pacific Biosciences logo, PacBio,
SMRT, SMRTbell, Iso-Seq, and Sequel are trademarks of Pacific Biosciences. BluePippin and SageELF are trademarks of Sage Science. NGS-go and NGSengine are trademarks of GenDx. FEMTO
Pulse and Fragment Analyzer are trademarks of Advanced Analytical Technologies.
All other trademarks are the sole property of their respective owners.
www.pacb.com

Contenu connexe

Tendances (7)

Ciclotron dynamic 12000 h 2ω
Ciclotron   dynamic 12000 h 2ωCiclotron   dynamic 12000 h 2ω
Ciclotron dynamic 12000 h 2ω
 
Dc mol testing7oct14
Dc mol testing7oct14Dc mol testing7oct14
Dc mol testing7oct14
 
5LINX-SECURITY-SYSTEMS-Sales-Flyer_en
5LINX-SECURITY-SYSTEMS-Sales-Flyer_en5LINX-SECURITY-SYSTEMS-Sales-Flyer_en
5LINX-SECURITY-SYSTEMS-Sales-Flyer_en
 
5LINX-SECURITY-SYSTEMS-Sales-Flyer_en (1)
5LINX-SECURITY-SYSTEMS-Sales-Flyer_en (1)5LINX-SECURITY-SYSTEMS-Sales-Flyer_en (1)
5LINX-SECURITY-SYSTEMS-Sales-Flyer_en (1)
 
Termometro
TermometroTermometro
Termometro
 
Ciclotron tip2000 2 h ii
Ciclotron   tip2000 2 h iiCiclotron   tip2000 2 h ii
Ciclotron tip2000 2 h ii
 
M61266 Datasheet PDF
M61266 Datasheet PDFM61266 Datasheet PDF
M61266 Datasheet PDF
 

Similaire à New methods pacbio evaluation of draft v4alpha

Electrónica: Kit de radio AM/FM, La radio superhet modelo AM / FM-108CK conti...
Electrónica: Kit de radio AM/FM, La radio superhet modelo AM / FM-108CK conti...Electrónica: Kit de radio AM/FM, La radio superhet modelo AM / FM-108CK conti...
Electrónica: Kit de radio AM/FM, La radio superhet modelo AM / FM-108CK conti...
SANTIAGO PABLO ALBERTO
 
DCW Spring 2010 - 415VAC Power Distribution
DCW Spring 2010 - 415VAC Power DistributionDCW Spring 2010 - 415VAC Power Distribution
DCW Spring 2010 - 415VAC Power Distribution
miller0204
 
iPhone 6 Schematic
iPhone 6 SchematiciPhone 6 Schematic
iPhone 6 Schematic
Daniel Ferch
 

Similaire à New methods pacbio evaluation of draft v4alpha (20)

Cee Nergy Presentation
Cee Nergy PresentationCee Nergy Presentation
Cee Nergy Presentation
 
RFNT_CDR
RFNT_CDRRFNT_CDR
RFNT_CDR
 
Solution on Portable Blood Pressure Monitor System
Solution on Portable Blood Pressure Monitor SystemSolution on Portable Blood Pressure Monitor System
Solution on Portable Blood Pressure Monitor System
 
18 electronics
18 electronics18 electronics
18 electronics
 
Electrónica: Kit de radio AM/FM, La radio superhet modelo AM / FM-108CK conti...
Electrónica: Kit de radio AM/FM, La radio superhet modelo AM / FM-108CK conti...Electrónica: Kit de radio AM/FM, La radio superhet modelo AM / FM-108CK conti...
Electrónica: Kit de radio AM/FM, La radio superhet modelo AM / FM-108CK conti...
 
ISL97684 Led Driver Evaluation Board User Manual
ISL97684 Led Driver Evaluation Board User ManualISL97684 Led Driver Evaluation Board User Manual
ISL97684 Led Driver Evaluation Board User Manual
 
nonsyndromic orofacial cleft and palate
nonsyndromic orofacial cleft and palatenonsyndromic orofacial cleft and palate
nonsyndromic orofacial cleft and palate
 
Vialourdes1.3 PLANO ALUMBRADO
Vialourdes1.3 PLANO ALUMBRADOVialourdes1.3 PLANO ALUMBRADO
Vialourdes1.3 PLANO ALUMBRADO
 
ONEAL+OCM550.pdf
ONEAL+OCM550.pdfONEAL+OCM550.pdf
ONEAL+OCM550.pdf
 
70483100
7048310070483100
70483100
 
LG CRT TV MODEL Ct 21q92ke
LG CRT TV MODEL Ct 21q92keLG CRT TV MODEL Ct 21q92ke
LG CRT TV MODEL Ct 21q92ke
 
SprinklerApplicationChartFlipbook.PDF
SprinklerApplicationChartFlipbook.PDFSprinklerApplicationChartFlipbook.PDF
SprinklerApplicationChartFlipbook.PDF
 
I phone 5 full Schematic Diagram 820 3141-b
I phone 5 full Schematic Diagram 820 3141-bI phone 5 full Schematic Diagram 820 3141-b
I phone 5 full Schematic Diagram 820 3141-b
 
DCW Spring 2010 - 415VAC Power Distribution
DCW Spring 2010 - 415VAC Power DistributionDCW Spring 2010 - 415VAC Power Distribution
DCW Spring 2010 - 415VAC Power Distribution
 
Nodal Insights for Generation Owners and Modeling with AURORAxmp
Nodal Insights for Generation Owners and Modeling with AURORAxmpNodal Insights for Generation Owners and Modeling with AURORAxmp
Nodal Insights for Generation Owners and Modeling with AURORAxmp
 
iPhone 6 Schematic
iPhone 6 SchematiciPhone 6 Schematic
iPhone 6 Schematic
 
SDH ALARMS
SDH ALARMSSDH ALARMS
SDH ALARMS
 
Bảng giá mitsubishi 2020
Bảng giá mitsubishi 2020Bảng giá mitsubishi 2020
Bảng giá mitsubishi 2020
 
Handheld Point of Sale Terminal
Handheld Point of Sale TerminalHandheld Point of Sale Terminal
Handheld Point of Sale Terminal
 
Lc1 d
Lc1 dLc1 d
Lc1 d
 

Plus de GenomeInABottle

Plus de GenomeInABottle (20)

2023 GIAB AMP Update
2023 GIAB AMP Update2023 GIAB AMP Update
2023 GIAB AMP Update
 
GIAB Tumor Normal ASHG 2023
GIAB Tumor Normal ASHG 2023GIAB Tumor Normal ASHG 2023
GIAB Tumor Normal ASHG 2023
 
Stratomod ASHG 2023
Stratomod ASHG 2023Stratomod ASHG 2023
Stratomod ASHG 2023
 
GIAB_ASHG_JZook_2023.pdf
GIAB_ASHG_JZook_2023.pdfGIAB_ASHG_JZook_2023.pdf
GIAB_ASHG_JZook_2023.pdf
 
Using accurate long reads to improve Genome in a Bottle Benchmarks 220923
Using accurate long reads to improve Genome in a Bottle Benchmarks 220923Using accurate long reads to improve Genome in a Bottle Benchmarks 220923
Using accurate long reads to improve Genome in a Bottle Benchmarks 220923
 
Benchmarking with GIAB 220907
Benchmarking with GIAB 220907Benchmarking with GIAB 220907
Benchmarking with GIAB 220907
 
Genome in a Bottle- reference materials to benchmark challenging variants and...
Genome in a Bottle- reference materials to benchmark challenging variants and...Genome in a Bottle- reference materials to benchmark challenging variants and...
Genome in a Bottle- reference materials to benchmark challenging variants and...
 
GIAB Technical Germline Benchmark roadmap discussion
GIAB Technical Germline Benchmark roadmap discussionGIAB Technical Germline Benchmark roadmap discussion
GIAB Technical Germline Benchmark roadmap discussion
 
GIAB Benchmarks for SVs and Repeats for stanford genetics sv 200511
GIAB Benchmarks for SVs and Repeats for stanford genetics sv 200511GIAB Benchmarks for SVs and Repeats for stanford genetics sv 200511
GIAB Benchmarks for SVs and Repeats for stanford genetics sv 200511
 
Giab agbt small_var_2020
Giab agbt small_var_2020Giab agbt small_var_2020
Giab agbt small_var_2020
 
GIAB for AMP GeT-RM Forum
GIAB for AMP GeT-RM ForumGIAB for AMP GeT-RM Forum
GIAB for AMP GeT-RM Forum
 
Ga4gh 2019 - Assuring data quality with benchmarking tools from GIAB and GA4GH
Ga4gh 2019 - Assuring data quality with benchmarking tools from GIAB and GA4GHGa4gh 2019 - Assuring data quality with benchmarking tools from GIAB and GA4GH
Ga4gh 2019 - Assuring data quality with benchmarking tools from GIAB and GA4GH
 
GIAB ASHG 2019 Structural Variant poster
GIAB ASHG 2019 Structural Variant posterGIAB ASHG 2019 Structural Variant poster
GIAB ASHG 2019 Structural Variant poster
 
GIAB GRC Workshop ASHG 2019 Billy Rowell Evaluation of v4 with CCS GATK
GIAB GRC Workshop ASHG 2019 Billy Rowell Evaluation of v4 with CCS GATKGIAB GRC Workshop ASHG 2019 Billy Rowell Evaluation of v4 with CCS GATK
GIAB GRC Workshop ASHG 2019 Billy Rowell Evaluation of v4 with CCS GATK
 
GIAB ASHG 2019 Small Variant poster
GIAB ASHG 2019 Small Variant posterGIAB ASHG 2019 Small Variant poster
GIAB ASHG 2019 Small Variant poster
 
GRC GIAB Workshop ASHG 2019 Small Variant Benchmark
GRC GIAB Workshop ASHG 2019 Small Variant BenchmarkGRC GIAB Workshop ASHG 2019 Small Variant Benchmark
GRC GIAB Workshop ASHG 2019 Small Variant Benchmark
 
Jason Chin MHC diploid assembly
Jason Chin MHC diploid assemblyJason Chin MHC diploid assembly
Jason Chin MHC diploid assembly
 
GIAB update for GRC GIAB workshop 191015
GIAB update for GRC GIAB workshop 191015GIAB update for GRC GIAB workshop 191015
GIAB update for GRC GIAB workshop 191015
 
Giab for jax long read 190917
Giab for jax long read 190917Giab for jax long read 190917
Giab for jax long read 190917
 
Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...
Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...
Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...
 

Dernier

Premium Call Girls Nagpur {9xx000xx09} ❤️VVIP POOJA Call Girls in Nagpur Maha...
Premium Call Girls Nagpur {9xx000xx09} ❤️VVIP POOJA Call Girls in Nagpur Maha...Premium Call Girls Nagpur {9xx000xx09} ❤️VVIP POOJA Call Girls in Nagpur Maha...
Premium Call Girls Nagpur {9xx000xx09} ❤️VVIP POOJA Call Girls in Nagpur Maha...
Sheetaleventcompany
 
👉 Chennai Sexy Aunty’s WhatsApp Number 👉📞 7427069034 👉📞 Just📲 Call Ruhi Colle...
👉 Chennai Sexy Aunty’s WhatsApp Number 👉📞 7427069034 👉📞 Just📲 Call Ruhi Colle...👉 Chennai Sexy Aunty’s WhatsApp Number 👉📞 7427069034 👉📞 Just📲 Call Ruhi Colle...
👉 Chennai Sexy Aunty’s WhatsApp Number 👉📞 7427069034 👉📞 Just📲 Call Ruhi Colle...
rajnisinghkjn
 
Gorgeous Call Girls Dehradun {8854095900} ❤️VVIP ROCKY Call Girls in Dehradun...
Gorgeous Call Girls Dehradun {8854095900} ❤️VVIP ROCKY Call Girls in Dehradun...Gorgeous Call Girls Dehradun {8854095900} ❤️VVIP ROCKY Call Girls in Dehradun...
Gorgeous Call Girls Dehradun {8854095900} ❤️VVIP ROCKY Call Girls in Dehradun...
Sheetaleventcompany
 
Ahmedabad Call Girls Book Now 8980367676 Top Class Ahmedabad Escort Service A...
Ahmedabad Call Girls Book Now 8980367676 Top Class Ahmedabad Escort Service A...Ahmedabad Call Girls Book Now 8980367676 Top Class Ahmedabad Escort Service A...
Ahmedabad Call Girls Book Now 8980367676 Top Class Ahmedabad Escort Service A...
Genuine Call Girls
 
Difference Between Skeletal Smooth and Cardiac Muscles
Difference Between Skeletal Smooth and Cardiac MusclesDifference Between Skeletal Smooth and Cardiac Muscles
Difference Between Skeletal Smooth and Cardiac Muscles
MedicoseAcademics
 
Kolkata Call Girls Service ❤️🍑 9xx000xx09 👄🫦 Independent Escort Service Kolka...
Kolkata Call Girls Service ❤️🍑 9xx000xx09 👄🫦 Independent Escort Service Kolka...Kolkata Call Girls Service ❤️🍑 9xx000xx09 👄🫦 Independent Escort Service Kolka...
Kolkata Call Girls Service ❤️🍑 9xx000xx09 👄🫦 Independent Escort Service Kolka...
Sheetaleventcompany
 
Call Girl In Indore 📞9235973566📞 Just📲 Call Inaaya Indore Call Girls Service ...
Call Girl In Indore 📞9235973566📞 Just📲 Call Inaaya Indore Call Girls Service ...Call Girl In Indore 📞9235973566📞 Just📲 Call Inaaya Indore Call Girls Service ...
Call Girl In Indore 📞9235973566📞 Just📲 Call Inaaya Indore Call Girls Service ...
Sheetaleventcompany
 

Dernier (20)

Premium Call Girls Nagpur {9xx000xx09} ❤️VVIP POOJA Call Girls in Nagpur Maha...
Premium Call Girls Nagpur {9xx000xx09} ❤️VVIP POOJA Call Girls in Nagpur Maha...Premium Call Girls Nagpur {9xx000xx09} ❤️VVIP POOJA Call Girls in Nagpur Maha...
Premium Call Girls Nagpur {9xx000xx09} ❤️VVIP POOJA Call Girls in Nagpur Maha...
 
👉 Chennai Sexy Aunty’s WhatsApp Number 👉📞 7427069034 👉📞 Just📲 Call Ruhi Colle...
👉 Chennai Sexy Aunty’s WhatsApp Number 👉📞 7427069034 👉📞 Just📲 Call Ruhi Colle...👉 Chennai Sexy Aunty’s WhatsApp Number 👉📞 7427069034 👉📞 Just📲 Call Ruhi Colle...
👉 Chennai Sexy Aunty’s WhatsApp Number 👉📞 7427069034 👉📞 Just📲 Call Ruhi Colle...
 
Call Girls Kathua Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Kathua Just Call 8250077686 Top Class Call Girl Service AvailableCall Girls Kathua Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Kathua Just Call 8250077686 Top Class Call Girl Service Available
 
Cardiac Output, Venous Return, and Their Regulation
Cardiac Output, Venous Return, and Their RegulationCardiac Output, Venous Return, and Their Regulation
Cardiac Output, Venous Return, and Their Regulation
 
Kolkata Call Girls Naktala 💯Call Us 🔝 8005736733 🔝 💃 Top Class Call Girl Se...
Kolkata Call Girls Naktala  💯Call Us 🔝 8005736733 🔝 💃  Top Class Call Girl Se...Kolkata Call Girls Naktala  💯Call Us 🔝 8005736733 🔝 💃  Top Class Call Girl Se...
Kolkata Call Girls Naktala 💯Call Us 🔝 8005736733 🔝 💃 Top Class Call Girl Se...
 
Low Cost Call Girls Bangalore {9179660964} ❤️VVIP NISHA Call Girls in Bangalo...
Low Cost Call Girls Bangalore {9179660964} ❤️VVIP NISHA Call Girls in Bangalo...Low Cost Call Girls Bangalore {9179660964} ❤️VVIP NISHA Call Girls in Bangalo...
Low Cost Call Girls Bangalore {9179660964} ❤️VVIP NISHA Call Girls in Bangalo...
 
Chandigarh Call Girls Service ❤️🍑 9809698092 👄🫦Independent Escort Service Cha...
Chandigarh Call Girls Service ❤️🍑 9809698092 👄🫦Independent Escort Service Cha...Chandigarh Call Girls Service ❤️🍑 9809698092 👄🫦Independent Escort Service Cha...
Chandigarh Call Girls Service ❤️🍑 9809698092 👄🫦Independent Escort Service Cha...
 
Exclusive Call Girls Bangalore {7304373326} ❤️VVIP POOJA Call Girls in Bangal...
Exclusive Call Girls Bangalore {7304373326} ❤️VVIP POOJA Call Girls in Bangal...Exclusive Call Girls Bangalore {7304373326} ❤️VVIP POOJA Call Girls in Bangal...
Exclusive Call Girls Bangalore {7304373326} ❤️VVIP POOJA Call Girls in Bangal...
 
Gorgeous Call Girls Dehradun {8854095900} ❤️VVIP ROCKY Call Girls in Dehradun...
Gorgeous Call Girls Dehradun {8854095900} ❤️VVIP ROCKY Call Girls in Dehradun...Gorgeous Call Girls Dehradun {8854095900} ❤️VVIP ROCKY Call Girls in Dehradun...
Gorgeous Call Girls Dehradun {8854095900} ❤️VVIP ROCKY Call Girls in Dehradun...
 
tongue disease lecture Dr Assadawy legacy
tongue disease lecture Dr Assadawy legacytongue disease lecture Dr Assadawy legacy
tongue disease lecture Dr Assadawy legacy
 
Ahmedabad Call Girls Book Now 8980367676 Top Class Ahmedabad Escort Service A...
Ahmedabad Call Girls Book Now 8980367676 Top Class Ahmedabad Escort Service A...Ahmedabad Call Girls Book Now 8980367676 Top Class Ahmedabad Escort Service A...
Ahmedabad Call Girls Book Now 8980367676 Top Class Ahmedabad Escort Service A...
 
Call Girls in Lucknow Just Call 👉👉 8875999948 Top Class Call Girl Service Ava...
Call Girls in Lucknow Just Call 👉👉 8875999948 Top Class Call Girl Service Ava...Call Girls in Lucknow Just Call 👉👉 8875999948 Top Class Call Girl Service Ava...
Call Girls in Lucknow Just Call 👉👉 8875999948 Top Class Call Girl Service Ava...
 
Difference Between Skeletal Smooth and Cardiac Muscles
Difference Between Skeletal Smooth and Cardiac MusclesDifference Between Skeletal Smooth and Cardiac Muscles
Difference Between Skeletal Smooth and Cardiac Muscles
 
Call Girls Rishikesh Just Call 9667172968 Top Class Call Girl Service Available
Call Girls Rishikesh Just Call 9667172968 Top Class Call Girl Service AvailableCall Girls Rishikesh Just Call 9667172968 Top Class Call Girl Service Available
Call Girls Rishikesh Just Call 9667172968 Top Class Call Girl Service Available
 
Kolkata Call Girls Shobhabazar 💯Call Us 🔝 8005736733 🔝 💃 Top Class Call Gir...
Kolkata Call Girls Shobhabazar  💯Call Us 🔝 8005736733 🔝 💃  Top Class Call Gir...Kolkata Call Girls Shobhabazar  💯Call Us 🔝 8005736733 🔝 💃  Top Class Call Gir...
Kolkata Call Girls Shobhabazar 💯Call Us 🔝 8005736733 🔝 💃 Top Class Call Gir...
 
Race Course Road } Book Call Girls in Bangalore | Whatsapp No 6378878445 VIP ...
Race Course Road } Book Call Girls in Bangalore | Whatsapp No 6378878445 VIP ...Race Course Road } Book Call Girls in Bangalore | Whatsapp No 6378878445 VIP ...
Race Course Road } Book Call Girls in Bangalore | Whatsapp No 6378878445 VIP ...
 
(RIYA)🎄Airhostess Call Girl Jaipur Call Now 8445551418 Premium Collection Of ...
(RIYA)🎄Airhostess Call Girl Jaipur Call Now 8445551418 Premium Collection Of ...(RIYA)🎄Airhostess Call Girl Jaipur Call Now 8445551418 Premium Collection Of ...
(RIYA)🎄Airhostess Call Girl Jaipur Call Now 8445551418 Premium Collection Of ...
 
Circulatory Shock, types and stages, compensatory mechanisms
Circulatory Shock, types and stages, compensatory mechanismsCirculatory Shock, types and stages, compensatory mechanisms
Circulatory Shock, types and stages, compensatory mechanisms
 
Kolkata Call Girls Service ❤️🍑 9xx000xx09 👄🫦 Independent Escort Service Kolka...
Kolkata Call Girls Service ❤️🍑 9xx000xx09 👄🫦 Independent Escort Service Kolka...Kolkata Call Girls Service ❤️🍑 9xx000xx09 👄🫦 Independent Escort Service Kolka...
Kolkata Call Girls Service ❤️🍑 9xx000xx09 👄🫦 Independent Escort Service Kolka...
 
Call Girl In Indore 📞9235973566📞 Just📲 Call Inaaya Indore Call Girls Service ...
Call Girl In Indore 📞9235973566📞 Just📲 Call Inaaya Indore Call Girls Service ...Call Girl In Indore 📞9235973566📞 Just📲 Call Inaaya Indore Call Girls Service ...
Call Girl In Indore 📞9235973566📞 Just📲 Call Inaaya Indore Call Girls Service ...
 

New methods pacbio evaluation of draft v4alpha

  • 1. For Research Use Only. Not for use in diagnostic procedures. © Copyright 2019 by Pacific Biosciences of California, Inc. All rights reserved. Reviewing benchmark v3.3.2 PacBio CCS discrepancies in benchmark v4! Billy Rowell @nothingclever
  • 2. OVERALL BENCHMARK CHANGES Type Metric v3.3.2 v4! " SNP TP 3,034,399 3,404,145 369,746 FN 13,438 29,593 16,155 FP 16,248 17,351 1,103 INDEL TP 377,613 431,725 54,112 FN 87,151 104,232 17,081 FP 103,619 106,628 3,009 Type Metric v3.3.2 v4! " SNP Recall 99.56% 99.14% -0.42% Precision 99.47% 99.49% 0.02% F1 Score 99.51% 99.32% -0.19% INDEL Recall 81.25% 80.55% -0.70% Precision 78.98% 80.68% 1.70% F1 Score 80.10% 80.61% 0.51% Comparing GATK4HC calls on Sequel CCS 13.5kb library
  • 3. SUPPLEMENTARY TABLE 9 - Supplementary Table 9. Manual curation of small variant discrepancies between CCS callset and Genome in a Bottle benchmark. For the “Discrepancy” column, “AM” means genotype difference, “FN” means false negative (in benchmark but not callset), and “FP s” means false positive (in callset but not benchmark). “Repeat family” column is from the RepeatMasker track from the UCSC Genome Browser. “Correct Call” column is “GIAB” when the benchmark was deemed correct by expert curators, and “CCS” when the CCS callset was deemed correct. Rows where the correct call is from the CCS callset are colored blue. bioRxiv 519025 doi:10.1101/519025
  • 4. UPDATES TO DISCREPANCIES BETWEEN CCS AND V3.3.2 CHR POS Discrepancy 4 11468804AM 5 42740225AM 2 5143996AM 13 48291499AM 8 5930728FN 15 41943823FN 6 9737425FN 7 157385671FN 17 32064214FN 1 94256825FP 2 153864971FP 4 112819087FP 4 165026074FP 11 23338682FP 1 35034071FP 3 79181734FP 4 94532444FP 8 46873565FP 9 22350168FP 21 42288851FP
  • 5. 13/20 SITES WERE REMOVED FROM HIGH CONFIDENCE REGION CHR POS Discrepancy high conf 4 11468804 AM BORDER 5 42740225 AM FALSE 2 5143996 AM FALSE 13 48291499 AM FALSE 8 5930728 FN FALSE 15 41943823 FN FALSE 6 9737425 FN FALSE 7 157385671 FN TRUE 17 32064214 FN FALSE 1 94256825 FP TRUE 2 153864971 FP FALSE 4 112819087 FP FALSE 4 165026074 FP TRUE 11 23338682 FP TRUE 1 35034071 FP FALSE 3 79181734 FP TRUE 4 94532444 FP FALSE 8 46873565 FP FALSE 9 22350168 FP TRUE 21 42288851 FP FALSE
  • 6. 7/20 SITES WERE MODIFIED IN V4! TO MATCH GATK CALLS CHR POS Discrepancy high conf 4 11468804 AM BORDER 5 42740225 AM FALSE 2 5143996 AM FALSE 13 48291499 AM FALSE 8 5930728 FN FALSE 15 41943823 FN FALSE 6 9737425 FN FALSE 7 157385671 FN TRUE 17 32064214 FN FALSE 1 94256825 FP TRUE 2 153864971 FP FALSE 4 112819087 FP FALSE 4 165026074 FP TRUE 11 23338682 FP TRUE 1 35034071 FP FALSE 3 79181734 FP TRUE 4 94532444 FP FALSE 8 46873565 FP FALSE 9 22350168 FP TRUE 21 42288851 FP FALSE
  • 7. SOME OF THESE REGIONS MAY BE ADDED BACK TO HIGH CONFIDENCE AFTER CLOSER INSPECTION CHR POS Discrepancy high conf Notes 4 11468804 AM BORDER fixed 5 42740225 AM FALSE complex variant; L1PA2 2 5143996 AM FALSE mis-mapped short reads; HERVH-int 13 48291499 AM FALSE highly variable region; L1PA3 8 5930728 FN FALSE long reads identify long insertion 15 41943823 FN FALSE simple repeat with some variability causes alignment issues 6 9737425 FN FALSE segmental dup 7 157385671 FN TRUE segmental dup, fixed 17 32064214 FN FALSE 1 94256825 FP TRUE fixed; L1PA2 2 153864971 FP FALSE supported by CCS, mate pairs, 10X, ONT; L1HS 4 112819087 FP FALSE L1HS 4 165026074 FP TRUE fixed; L1PA2 11 23338682 FP TRUE fixed; L1P1 1 35034071 FP FALSE L1HS 3 79181734 FP TRUE fixed; L1HS 4 94532444 FP FALSE supported by CCS, mate pairs, ONT; L1HS 8 46873565 FP FALSE ALR/Alpha 9 22350168 FP TRUE fixed; L1PA2 21 42288851 FP FALSE supported by CCS, mate pairs, ONT; L1PA2
  • 8. CHR6:9737425 -V3.3.2 HET -V4 – removed from high confidence region -segmental duplication15kb CCS 2x250 6kb mate pair Linked-read Ultralong Seg dup V3.3.2 high conf V4 high conf
  • 9. CHR13:48291499 -V3.3.2 - HOMALT -V4 - removed from high confidence region -Support for HET in CCS, linked- reads, mate pairs, and ultralong reads -L1PA3 15kb CCS 2x250 6kb mate pair Linked-read Ultralong Seg dup V3.3.2 high conf V4 high conf
  • 10. CONCLUSION -v4! increases TP variant calls in CCS-GATK4HC call set by > 400k -slight decreases in recall, but increases in precision -v4! resolves all discrepancies reported in Wenger & Peluso et al. -There may be some cases where the high confidence region could be further expanded, based on agreement between CCS, 10X, ONT, and 6kb mate pairs.
  • 11. For Research Use Only. Not for use in diagnostic procedures. © Copyright 2019 by Pacific Biosciences of California, Inc. All rights reserved. Pacific Biosciences, the Pacific Biosciences logo, PacBio, SMRT, SMRTbell, Iso-Seq, and Sequel are trademarks of Pacific Biosciences. BluePippin and SageELF are trademarks of Sage Science. NGS-go and NGSengine are trademarks of GenDx. FEMTO Pulse and Fragment Analyzer are trademarks of Advanced Analytical Technologies. All other trademarks are the sole property of their respective owners. www.pacb.com