SlideShare a Scribd company logo
1 of 22
Improving the accuracy of
         variant identification

                Deanna M. Church
                     NCBI




@deannachurch
Image credit: http://www.tohlejokes.com
ISCA submitted variant: nsv534088
CDC27


                           rs11491191




http://www.ncbi.nlm.nih.gov/variation/tools/1000genomes
APOL1




 APOL1




CEPH:
 A=1.000
 G=0
 http://www.ncbi.nlm.nih.gov/variation/tools/1000genomes
APOL1




                            Multiple                Suspect
                           submissions
                                                1000G
                                    Frequency
                                       Data



                                            YRI:
                                             A=0.5852
                                             G=0.4148
                                    Sudmant et al., 2010
http://www.ncbi.nlm.nih.gov/variation/tools/1000genomes
http://genomereference.org
Submitted on NCBI35 (hg17)
nsv832911 (nstd68)




http://www.ncbi.nlm.nih.gov/dbvar
Moved approximately 2 Mb
                            distal on chr15
NCBI35 (hg17) Tiling Path
NC_0000015.8 (chr15)




            Gap Inserted              Removed from assembly




GRCh37 (hg19) Tiling Path              Added to assembly
NC_0000015.9 (chr15)



HG-24
GRCh37.p10
(160 regions: 2.89% of chromosomes)



111 Fix PATCHES: Chromosome update in GRCh38
  (adds >5 Mb of novel sequence to the assembly)

71 Novel PATCHES: Additional sequence added
  (adds >800K of novel sequence to the assembly)

Releasing patches quarterly                   Summer of 2013
MHC (chr6)
Chr 6 representation (PGF)




Alt_Ref_Locus_2 (COX)
Richa Agarwala
Eugene Yaschenko
1q32       1q21    1p21

1p21 patch alignment to chromosome 1




Dennis et al., 2012
http://www.lrg-sequence.org/
http://www.ncbi.nlm.nih.gov/refseq/rsg
RefSeq Gene




  L         R




http://www.ncbi.nlm.nih.gov/refseq/rsg
Assembly<->RefSeqGene
   (including transcripts and coding sequences)
http://www.ncbi.nlm.nih.gov/genome/tools/remap
Coordinate Remapping




            NCBI36 <->GRCh37

http://www.ncbi.nlm.nih.gov/genome/tools/remap
The Reference Assembly is Evolving

  Centralization of assembly and sequence data facilitates:
    Reporting problems
    Implementing fixes
    Building tools
    Data management
Thanks!

Genome Reference Consortium
 The Genome Institute at Washington University
 The Wellcome Trust Sanger Institute
 European Bioinformatics Institute
 National Center for Biotechnology Information

NCBI
 Assembly Group
 RefSeq Group
 Genome Annotation Group

More Related Content

Viewers also liked

Viewers also liked (16)

Church ngs
Church ngsChurch ngs
Church ngs
 
Church_GenomeAccess_2013_genome2013
Church_GenomeAccess_2013_genome2013Church_GenomeAccess_2013_genome2013
Church_GenomeAccess_2013_genome2013
 
Church SFAF2014 keynote
Church SFAF2014 keynoteChurch SFAF2014 keynote
Church SFAF2014 keynote
 
Wwcode2
Wwcode2Wwcode2
Wwcode2
 
Feels Like Ruby - Ruby Kaigi 2010
Feels Like Ruby - Ruby Kaigi 2010Feels Like Ruby - Ruby Kaigi 2010
Feels Like Ruby - Ruby Kaigi 2010
 
Ruby APIs for NoSQL - Polyglot Persistence - GoGaRuCo 2010
Ruby APIs for NoSQL - Polyglot Persistence - GoGaRuCo 2010Ruby APIs for NoSQL - Polyglot Persistence - GoGaRuCo 2010
Ruby APIs for NoSQL - Polyglot Persistence - GoGaRuCo 2010
 
Teaching Ruby to Kids
Teaching Ruby to KidsTeaching Ruby to Kids
Teaching Ruby to Kids
 
Moving the Needle: How SF Ruby Got to 18%
Moving the Needle: How SF Ruby Got to 18%Moving the Needle: How SF Ruby Got to 18%
Moving the Needle: How SF Ruby Got to 18%
 
Beyond (No)SQL
Beyond (No)SQLBeyond (No)SQL
Beyond (No)SQL
 
Corporate Name 2008 2
Corporate Name 2008 2Corporate Name 2008 2
Corporate Name 2008 2
 
Csr
CsrCsr
Csr
 
Iv Tyrimai
Iv TyrimaiIv Tyrimai
Iv Tyrimai
 
Ivadas
IvadasIvadas
Ivadas
 
Identitetas
IdentitetasIdentitetas
Identitetas
 
Organizacijos ivaizdis
Organizacijos ivaizdisOrganizacijos ivaizdis
Organizacijos ivaizdis
 
rysiai su visuomene - organizacijos ivaizdzio (reputacijos) vadybos irankis
rysiai su visuomene - organizacijos ivaizdzio (reputacijos) vadybos irankisrysiai su visuomene - organizacijos ivaizdzio (reputacijos) vadybos irankis
rysiai su visuomene - organizacijos ivaizdzio (reputacijos) vadybos irankis
 

Similar to Church clinical2012

Concordance_of_HTA_array_and_real_time_qPCR_results
Concordance_of_HTA_array_and_real_time_qPCR_resultsConcordance_of_HTA_array_and_real_time_qPCR_results
Concordance_of_HTA_array_and_real_time_qPCR_results
Andrea Ujvari
 
NUGEN-X-Gen_2011_poster_trancriptome_sequencing_RNA-Seq
NUGEN-X-Gen_2011_poster_trancriptome_sequencing_RNA-SeqNUGEN-X-Gen_2011_poster_trancriptome_sequencing_RNA-Seq
NUGEN-X-Gen_2011_poster_trancriptome_sequencing_RNA-Seq
Himanshu Sethi
 

Similar to Church clinical2012 (20)

26072016 uc davis_small
26072016 uc davis_small26072016 uc davis_small
26072016 uc davis_small
 
Concordance_of_HTA_array_and_real_time_qPCR_results
Concordance_of_HTA_array_and_real_time_qPCR_resultsConcordance_of_HTA_array_and_real_time_qPCR_results
Concordance_of_HTA_array_and_real_time_qPCR_results
 
Assign 2.0 software for the analysis of Phred quality values for quality con...
Assign 2.0  software for the analysis of Phred quality values for quality con...Assign 2.0  software for the analysis of Phred quality values for quality con...
Assign 2.0 software for the analysis of Phred quality values for quality con...
 
Cross-Kingdom Standards in Genomics, Epigenomics and Metagenomics
Cross-Kingdom Standards in Genomics, Epigenomics and MetagenomicsCross-Kingdom Standards in Genomics, Epigenomics and Metagenomics
Cross-Kingdom Standards in Genomics, Epigenomics and Metagenomics
 
Jan2016 pac bio giab
Jan2016 pac bio giabJan2016 pac bio giab
Jan2016 pac bio giab
 
CDAC 2018 Boeva discovery
CDAC 2018 Boeva discoveryCDAC 2018 Boeva discovery
CDAC 2018 Boeva discovery
 
Big Data at Golden Helix: Scaling to Meet the Demand of Clinical and Research...
Big Data at Golden Helix: Scaling to Meet the Demand of Clinical and Research...Big Data at Golden Helix: Scaling to Meet the Demand of Clinical and Research...
Big Data at Golden Helix: Scaling to Meet the Demand of Clinical and Research...
 
Genotype imputation study in Gir dairy cattle of Gujarat
Genotype imputation study in Gir dairy cattle of GujaratGenotype imputation study in Gir dairy cattle of Gujarat
Genotype imputation study in Gir dairy cattle of Gujarat
 
Saliva Magnetic Bead v3
Saliva Magnetic Bead v3Saliva Magnetic Bead v3
Saliva Magnetic Bead v3
 
Church isca2012
Church isca2012Church isca2012
Church isca2012
 
NUGEN-X-Gen_2011_poster_trancriptome_sequencing_RNA-Seq
NUGEN-X-Gen_2011_poster_trancriptome_sequencing_RNA-SeqNUGEN-X-Gen_2011_poster_trancriptome_sequencing_RNA-Seq
NUGEN-X-Gen_2011_poster_trancriptome_sequencing_RNA-Seq
 
Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...
Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...
Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...
 
155 dna microarray
155 dna microarray155 dna microarray
155 dna microarray
 
155 dna microarray
155 dna microarray155 dna microarray
155 dna microarray
 
Dna microarray mehran
Dna microarray  mehranDna microarray  mehran
Dna microarray mehran
 
Functional annotation of invertebrate genomes
Functional annotation of invertebrate genomesFunctional annotation of invertebrate genomes
Functional annotation of invertebrate genomes
 
Real-time Phylogenomics: Joe Parker
Real-time Phylogenomics: Joe ParkerReal-time Phylogenomics: Joe Parker
Real-time Phylogenomics: Joe Parker
 
Next generation sequencing
Next generation sequencingNext generation sequencing
Next generation sequencing
 
nextgenerationsequencing-170606100132.pdf
nextgenerationsequencing-170606100132.pdfnextgenerationsequencing-170606100132.pdf
nextgenerationsequencing-170606100132.pdf
 
Sleep and the Gut Microbiome-bioRxiv-199075 1
Sleep and the Gut Microbiome-bioRxiv-199075 1Sleep and the Gut Microbiome-bioRxiv-199075 1
Sleep and the Gut Microbiome-bioRxiv-199075 1
 

More from Deanna Church (12)

Church_NCBIvariation2013
Church_NCBIvariation2013Church_NCBIvariation2013
Church_NCBIvariation2013
 
Church iowa2013
Church iowa2013Church iowa2013
Church iowa2013
 
Church GeT-RM
Church GeT-RMChurch GeT-RM
Church GeT-RM
 
Church sfaf13
Church sfaf13Church sfaf13
Church sfaf13
 
Church gia13
Church gia13Church gia13
Church gia13
 
Church apr2013
Church apr2013Church apr2013
Church apr2013
 
Church agbt13 merge
Church agbt13 mergeChurch agbt13 merge
Church agbt13 merge
 
Church nhgri 2012
Church nhgri 2012Church nhgri 2012
Church nhgri 2012
 
Church gmod2012 pt2
Church gmod2012 pt2Church gmod2012 pt2
Church gmod2012 pt2
 
Church gmod2012 pt1
Church gmod2012 pt1Church gmod2012 pt1
Church gmod2012 pt1
 
Imgc2011 bioinformatics tutorial
Imgc2011 bioinformatics tutorialImgc2011 bioinformatics tutorial
Imgc2011 bioinformatics tutorial
 
Church Fif2009
Church Fif2009Church Fif2009
Church Fif2009
 

Recently uploaded

College Call Girls in Haridwar 9667172968 Short 4000 Night 10000 Best call gi...
College Call Girls in Haridwar 9667172968 Short 4000 Night 10000 Best call gi...College Call Girls in Haridwar 9667172968 Short 4000 Night 10000 Best call gi...
College Call Girls in Haridwar 9667172968 Short 4000 Night 10000 Best call gi...
perfect solution
 

Recently uploaded (20)

Call Girls Varanasi Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Varanasi Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Varanasi Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Varanasi Just Call 9907093804 Top Class Call Girl Service Available
 
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...
 
Call Girls Bareilly Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Bareilly Just Call 8250077686 Top Class Call Girl Service AvailableCall Girls Bareilly Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Bareilly Just Call 8250077686 Top Class Call Girl Service Available
 
Premium Bangalore Call Girls Jigani Dail 6378878445 Escort Service For Hot Ma...
Premium Bangalore Call Girls Jigani Dail 6378878445 Escort Service For Hot Ma...Premium Bangalore Call Girls Jigani Dail 6378878445 Escort Service For Hot Ma...
Premium Bangalore Call Girls Jigani Dail 6378878445 Escort Service For Hot Ma...
 
Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...
Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...
Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...
 
Top Quality Call Girl Service Kalyanpur 6378878445 Available Call Girls Any Time
Top Quality Call Girl Service Kalyanpur 6378878445 Available Call Girls Any TimeTop Quality Call Girl Service Kalyanpur 6378878445 Available Call Girls Any Time
Top Quality Call Girl Service Kalyanpur 6378878445 Available Call Girls Any Time
 
College Call Girls in Haridwar 9667172968 Short 4000 Night 10000 Best call gi...
College Call Girls in Haridwar 9667172968 Short 4000 Night 10000 Best call gi...College Call Girls in Haridwar 9667172968 Short 4000 Night 10000 Best call gi...
College Call Girls in Haridwar 9667172968 Short 4000 Night 10000 Best call gi...
 
Top Rated Hyderabad Call Girls Erragadda ⟟ 6297143586 ⟟ Call Me For Genuine ...
Top Rated  Hyderabad Call Girls Erragadda ⟟ 6297143586 ⟟ Call Me For Genuine ...Top Rated  Hyderabad Call Girls Erragadda ⟟ 6297143586 ⟟ Call Me For Genuine ...
Top Rated Hyderabad Call Girls Erragadda ⟟ 6297143586 ⟟ Call Me For Genuine ...
 
Pondicherry Call Girls Book Now 9630942363 Top Class Pondicherry Escort Servi...
Pondicherry Call Girls Book Now 9630942363 Top Class Pondicherry Escort Servi...Pondicherry Call Girls Book Now 9630942363 Top Class Pondicherry Escort Servi...
Pondicherry Call Girls Book Now 9630942363 Top Class Pondicherry Escort Servi...
 
Call Girls Nagpur Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Nagpur Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Nagpur Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Nagpur Just Call 9907093804 Top Class Call Girl Service Available
 
Call Girls Cuttack Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Cuttack Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Cuttack Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Cuttack Just Call 9907093804 Top Class Call Girl Service Available
 
Call Girls Ooty Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Ooty Just Call 8250077686 Top Class Call Girl Service AvailableCall Girls Ooty Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Ooty Just Call 8250077686 Top Class Call Girl Service Available
 
Bangalore Call Girls Nelamangala Number 9332606886 Meetin With Bangalore Esc...
Bangalore Call Girls Nelamangala Number 9332606886  Meetin With Bangalore Esc...Bangalore Call Girls Nelamangala Number 9332606886  Meetin With Bangalore Esc...
Bangalore Call Girls Nelamangala Number 9332606886 Meetin With Bangalore Esc...
 
Call Girls Gwalior Just Call 8617370543 Top Class Call Girl Service Available
Call Girls Gwalior Just Call 8617370543 Top Class Call Girl Service AvailableCall Girls Gwalior Just Call 8617370543 Top Class Call Girl Service Available
Call Girls Gwalior Just Call 8617370543 Top Class Call Girl Service Available
 
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 6297143586 𖠋 Will You Mis...
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 6297143586 𖠋 Will You Mis...The Most Attractive Hyderabad Call Girls Kothapet 𖠋 6297143586 𖠋 Will You Mis...
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 6297143586 𖠋 Will You Mis...
 
Call Girls Coimbatore Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Coimbatore Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Coimbatore Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Coimbatore Just Call 9907093804 Top Class Call Girl Service Available
 
Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...
Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...
Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...
 
(Low Rate RASHMI ) Rate Of Call Girls Jaipur ❣ 8445551418 ❣ Elite Models & Ce...
(Low Rate RASHMI ) Rate Of Call Girls Jaipur ❣ 8445551418 ❣ Elite Models & Ce...(Low Rate RASHMI ) Rate Of Call Girls Jaipur ❣ 8445551418 ❣ Elite Models & Ce...
(Low Rate RASHMI ) Rate Of Call Girls Jaipur ❣ 8445551418 ❣ Elite Models & Ce...
 
VIP Hyderabad Call Girls Bahadurpally 7877925207 ₹5000 To 25K With AC Room 💚😋
VIP Hyderabad Call Girls Bahadurpally 7877925207 ₹5000 To 25K With AC Room 💚😋VIP Hyderabad Call Girls Bahadurpally 7877925207 ₹5000 To 25K With AC Room 💚😋
VIP Hyderabad Call Girls Bahadurpally 7877925207 ₹5000 To 25K With AC Room 💚😋
 
Call Girls Visakhapatnam Just Call 9907093804 Top Class Call Girl Service Ava...
Call Girls Visakhapatnam Just Call 9907093804 Top Class Call Girl Service Ava...Call Girls Visakhapatnam Just Call 9907093804 Top Class Call Girl Service Ava...
Call Girls Visakhapatnam Just Call 9907093804 Top Class Call Girl Service Ava...
 

Church clinical2012

Editor's Notes

  1. What is variant calling? Identifying differences from a reference.
  2. Technical noise: Ideogram showing gaps in human genome
  3. Technical noise example: ISCA variant submitted that was completely within a gap. Note: most people don’t look at assembly tracks when they review data- you might not ever see anything odd about this unless you did (this is actually what caused us to start doing validation on variant definition based on the assembly.
  4. Technical issue: We get tiny reads (relative to the genome) that we have to align to a reference and interpret.Screen shot from 1000 genomes data in the 1000 genomes browser. Tenth exon of CDC27. Highlighting samples that have been sequencing using one technology and aligned using two different methods. Not lack of 1000 genomes calls in this region as well as questionable het SNP (see in BWA, but not Mosaik)
  5. This problem can also arise due to population specific issues. APOL1 genes seems to be correctly assembled but there may be an African specific copy number duplication that causes a SNP to be called- this may not be a SNP but rather a difference between paralogous gene copies.
  6. Focal segmental glomerulosclerosis 4: utility of having data in a centralized resource: we were able to add annotation from multiple sources onto this genome location. Utility of having variant calls in a central repository to allow for addition of knowledge. Early in the new year we will be adding tracks for Evan’s SUN data and alignment of known paralogs/pseudogenes to the genome.
  7. To address assembly issues the GRC to centralize the production of the reference assembly. This gives the community a single point of contact for reporting problems and finding information about the assembly. Additionally, we serve as an aggregator of information- as individual labs find or fix problems, we can integrate this information into the reference assembly so everyone can have access to this data.
  8. Region curation slide: We curate the genome region by region, and make this information available to users on a web site (and as downloadable files for integration with browsers).
  9. The GRC releases patches to the human assembly on a quarterly cycle. There are two varieties of patches:FIX patches correct existing assembly problems: chromosome will update, patches integrated in GRCh38NOVEL patches add new sequence representations: will become alternate lociApproximately 3% of the current public human assembly GRCh37 is associated with a region that is represented by a patch or alternate locus.As you can see, the GRC has been very busy with updating assemblies. I’d now like to talk about the tools and software we use to do this.
  10. If you are not using the entire assembly in your efforts, you may be missing genes in your exome capture reagents.
  11. Example of a fix patches- no one is really screening for these right now despite clear importance in neuronal development.
  12. RefSeqGene/LRG screen shot: stable coordinate system for gene level reporting. Gene centric genomic sequences.
  13. Distribution of RefSeqGenes on GRCh37