SlideShare une entreprise Scribd logo
1  sur  9
Télécharger pour lire hors ligne
Analyzing Copy Number
Variation from a
Bioinformatics Approach
July 24th, 2009
by Gagandeep Singh Anand
ganand@uwaterloo.ca
General Overview - Study
The Study:
• Identify and Annotate germline genomic alterations that
predispose to familial pancreatic cancer (FPC)
• Determine: functionally inactivated / over-expressed genes in pancreatic
adenocarcinoma tumours
• Considered as candidate tumour suppressor or oncogenic genes in FPC
• By finding copy number variation (variability in copy [>2
or <2] of DNA regions) in FPC Case vs. Controls to find
“interesting” regions of inactivation / over expression
My Contributions:
• Assist with data mining and pipelining – a technique to
find “interesting” genes
• Make data analysis a more efficient process with the aid
of numerous programming languages and bioinformatics
tools
General Overview - Bioinformatics
• Compare the Database of Genomic Variants (+ our
Controls) with our FPC CNV regions – find overlaps/non-
overlaps.
• Compare all regions (i.e. Somatic CNVs, miRNAs,
Hypo/Methylated regions) from other papers – also
based on pancreatic adenocarcinoma – with our FPC
Cases.
• Determine all the Genes coinciding with our FPC CNV
regions
DGV + our Controls vs. FPC Cases...
Overlap of FPC CNVs vs. All Controls:
565 FPC gain CNVs /
235 FPC deleted CNVs coinciding with any control
Non-overlap of FPC CNVs vs. All Controls:
240 FPC gain CNVs /
55 FPC deleted CNVs not overlapping with controls
FPC
Amplifications
(631)
FPC Deletions
(266)
DGV + Controls
(25700)
...DGV + our Controls vs. FPC Cases
Database of Genomic Variants (DGV) Tracks:
• Using the DGV - Genomic Browser, we can view tracks
(list of sequence annotations) on the same page:
• FPC Case CNVs as well as all (DGV + our controls)
control regions
• CNV Non-Overlaps with all (DGV + our controls)
control regions
• This gives us the ability to visualize a multitude of
different annotations on the same page
• For example…
Somatic CNVs, miRNA's &
Methylated regions vs. FPC CNVs
• Analyzed 12 different papers containing 1265 (27
hypo/methylated, 12 miRNA’s, 1233 somatic PC CNVs)
interesting regions (regions that had high affinity in
pancreatic adenocarcinoma cases)
• Observed 63 FPC deleted CNVs overlapping with other
regions (somatic CNVs, miRNA's or Methylated regions)
• Observed 240 FPC gain CNVs overlapping with other
regions (somatic CNVs, miRNA's or Methylated regions)
• Ongoing, as we search for more papers
Annotate FPC Regions with Standard
Gene IDs
• Get a list of genes that coincide with a set of CNVs /
Controls
• Get a list of CNVs / Controls that coincide with a set
of genes
Ensembl ID Entrez
ID
Chr. Gene
Start
Gene End Known Gene
Aliases
# of
CNVs
CNV Names
ENSG00000005
206
56928 19 2279629 2306099 AC004410.1 3 G_247,G_281,
G_177
ENSG00000005
844
16 30391572 30442006 ITGAL 2 G_171,G_241
Temp_ID Chr. Case Start Case End Case Size # of
Genes
Ensembl Gene
Names (Gene
Alias)
G_151 14 106223861 106246130 22270 2 ENSG0000021937
3 (IGHV1-68),…
G_152 7 61718176 62343621 625446 2 ENSG0000019923
1 (snoU2_19),…
Annotate FPC Regions with Standard
Gene IDs
• Next we would like to annotate "interesting" genes in
detail and prioritize them based on their relevance to
FPC – I hope to automate this pipeline!
• Previously, we determined regions of non-overlaps
based on regions of overlapping CNVs vs. Controls
• Now, using a variety of genome browsers, we find the
genes that coincide with our FPC CNVs and Our
Controls then find the set of disjoint genes
Only FPC AMP
Genes (1864)
Only FPC Del
Genes (84)
Only CTRL
AMP Genes
(14272)
Only CTRL DEL
Genes (1257)
Genes in
FPC &
CTRL AMPs
(1678)
Genes in
FPC &
CTRL DELs
(428)
Thank you for listening…
Acknowledgements: Dr. Wigdan Al-Sukhni

Contenu connexe

En vedette

Single Nucleotide Polymorphism Genotyping Using Kompetitive Allele Specific ...
Single Nucleotide Polymorphism Genotyping Using Kompetitive Allele Specific ...Single Nucleotide Polymorphism Genotyping Using Kompetitive Allele Specific ...
Single Nucleotide Polymorphism Genotyping Using Kompetitive Allele Specific ...MANGLAM ARYA
 
Single Nucleotide Polymorphism Analysis (SNPs)
Single Nucleotide Polymorphism Analysis (SNPs)Single Nucleotide Polymorphism Analysis (SNPs)
Single Nucleotide Polymorphism Analysis (SNPs)Data Science Thailand
 
Single nucleotide polymorphism
Single nucleotide polymorphismSingle nucleotide polymorphism
Single nucleotide polymorphismBipul Das
 

En vedette (6)

SNP Genotyping Technologies
SNP Genotyping TechnologiesSNP Genotyping Technologies
SNP Genotyping Technologies
 
Single Nucleotide Polymorphism Genotyping Using Kompetitive Allele Specific ...
Single Nucleotide Polymorphism Genotyping Using Kompetitive Allele Specific ...Single Nucleotide Polymorphism Genotyping Using Kompetitive Allele Specific ...
Single Nucleotide Polymorphism Genotyping Using Kompetitive Allele Specific ...
 
Single Nucleotide Polymorphism Analysis (SNPs)
Single Nucleotide Polymorphism Analysis (SNPs)Single Nucleotide Polymorphism Analysis (SNPs)
Single Nucleotide Polymorphism Analysis (SNPs)
 
Snp
SnpSnp
Snp
 
Wigard P. Kloosterman
Wigard P. KloostermanWigard P. Kloosterman
Wigard P. Kloosterman
 
Single nucleotide polymorphism
Single nucleotide polymorphismSingle nucleotide polymorphism
Single nucleotide polymorphism
 

Similaire à Analyzing Copy Number Variation from a Bioinformatics Approach

2017 molecular profiling_wim_vancriekinge
2017 molecular profiling_wim_vancriekinge2017 molecular profiling_wim_vancriekinge
2017 molecular profiling_wim_vancriekingeProf. Wim Van Criekinge
 
20170209 ngs for_cancer_genomics_101
20170209 ngs for_cancer_genomics_10120170209 ngs for_cancer_genomics_101
20170209 ngs for_cancer_genomics_101Ino de Bruijn
 
PadminiNarayanan-Intro-2018.pptx
PadminiNarayanan-Intro-2018.pptxPadminiNarayanan-Intro-2018.pptx
PadminiNarayanan-Intro-2018.pptxDESMONDEZIEKE1
 
Utilization of NGS to Identify Clinically-Relevant Mutations in cfDNA: Meet t...
Utilization of NGS to Identify Clinically-Relevant Mutations in cfDNA: Meet t...Utilization of NGS to Identify Clinically-Relevant Mutations in cfDNA: Meet t...
Utilization of NGS to Identify Clinically-Relevant Mutations in cfDNA: Meet t...QIAGEN
 
Bioinformatics tools for development, analysis, and preclinical testing of in...
Bioinformatics tools for development, analysis, and preclinical testing of in...Bioinformatics tools for development, analysis, and preclinical testing of in...
Bioinformatics tools for development, analysis, and preclinical testing of in...Malachi Griffith
 
Annotation capabilities
Annotation capabilitiesAnnotation capabilities
Annotation capabilitiesGolden Helix
 
Experimental methods and the big data sets
Experimental methods and the big data sets Experimental methods and the big data sets
Experimental methods and the big data sets improvemed
 
Exploiting microRNAs for precision oncology
Exploiting microRNAs for precision oncologyExploiting microRNAs for precision oncology
Exploiting microRNAs for precision oncologyGhent University
 
GIAB update for GRC GIAB workshop 191015
GIAB update for GRC GIAB workshop 191015GIAB update for GRC GIAB workshop 191015
GIAB update for GRC GIAB workshop 191015GenomeInABottle
 
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...VHIR Vall d’Hebron Institut de Recerca
 
GIAB Benchmarks for SVs and Repeats for stanford genetics sv 200511
GIAB Benchmarks for SVs and Repeats for stanford genetics sv 200511GIAB Benchmarks for SVs and Repeats for stanford genetics sv 200511
GIAB Benchmarks for SVs and Repeats for stanford genetics sv 200511GenomeInABottle
 
Genome in a bottle for ashg grc giab workshop 181016
Genome in a bottle for ashg grc giab workshop 181016Genome in a bottle for ashg grc giab workshop 181016
Genome in a bottle for ashg grc giab workshop 181016GenomeInABottle
 
Developing tools & Methodologies for the NExt Generation of Genomics & Bio In...
Developing tools & Methodologies for the NExt Generation of Genomics & Bio In...Developing tools & Methodologies for the NExt Generation of Genomics & Bio In...
Developing tools & Methodologies for the NExt Generation of Genomics & Bio In...Intel IT Center
 
Día 19 - Noel Chen - Introducción a Novogene
Día 19 - Noel Chen - Introducción a Novogene Día 19 - Noel Chen - Introducción a Novogene
Día 19 - Noel Chen - Introducción a Novogene Alejandro Borges
 
NGS Applications II (UEB-UAT Bioinformatics Course - Session 2.1.3 - VHIR, Ba...
NGS Applications II (UEB-UAT Bioinformatics Course - Session 2.1.3 - VHIR, Ba...NGS Applications II (UEB-UAT Bioinformatics Course - Session 2.1.3 - VHIR, Ba...
NGS Applications II (UEB-UAT Bioinformatics Course - Session 2.1.3 - VHIR, Ba...VHIR Vall d’Hebron Institut de Recerca
 
VarSeq 2.4.0: VSClinical ACMG Workflow from the User Perspective
VarSeq 2.4.0: VSClinical ACMG Workflow from the User PerspectiveVarSeq 2.4.0: VSClinical ACMG Workflow from the User Perspective
VarSeq 2.4.0: VSClinical ACMG Workflow from the User PerspectiveGolden Helix
 
VarSeq 2.4.0: VSClinical ACMG Workflow from the User Perspective
VarSeq 2.4.0: VSClinical ACMG Workflow from the User PerspectiveVarSeq 2.4.0: VSClinical ACMG Workflow from the User Perspective
VarSeq 2.4.0: VSClinical ACMG Workflow from the User PerspectiveGolden Helix
 

Similaire à Analyzing Copy Number Variation from a Bioinformatics Approach (20)

2017 molecular profiling_wim_vancriekinge
2017 molecular profiling_wim_vancriekinge2017 molecular profiling_wim_vancriekinge
2017 molecular profiling_wim_vancriekinge
 
20170209 ngs for_cancer_genomics_101
20170209 ngs for_cancer_genomics_10120170209 ngs for_cancer_genomics_101
20170209 ngs for_cancer_genomics_101
 
PadminiNarayanan-Intro-2018.pptx
PadminiNarayanan-Intro-2018.pptxPadminiNarayanan-Intro-2018.pptx
PadminiNarayanan-Intro-2018.pptx
 
Utilization of NGS to Identify Clinically-Relevant Mutations in cfDNA: Meet t...
Utilization of NGS to Identify Clinically-Relevant Mutations in cfDNA: Meet t...Utilization of NGS to Identify Clinically-Relevant Mutations in cfDNA: Meet t...
Utilization of NGS to Identify Clinically-Relevant Mutations in cfDNA: Meet t...
 
Bioinformatics tools for development, analysis, and preclinical testing of in...
Bioinformatics tools for development, analysis, and preclinical testing of in...Bioinformatics tools for development, analysis, and preclinical testing of in...
Bioinformatics tools for development, analysis, and preclinical testing of in...
 
Annotation capabilities
Annotation capabilitiesAnnotation capabilities
Annotation capabilities
 
Experimental methods and the big data sets
Experimental methods and the big data sets Experimental methods and the big data sets
Experimental methods and the big data sets
 
Rna seq
Rna seq Rna seq
Rna seq
 
Exploiting microRNAs for precision oncology
Exploiting microRNAs for precision oncologyExploiting microRNAs for precision oncology
Exploiting microRNAs for precision oncology
 
GIAB update for GRC GIAB workshop 191015
GIAB update for GRC GIAB workshop 191015GIAB update for GRC GIAB workshop 191015
GIAB update for GRC GIAB workshop 191015
 
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...
 
Genomics experimental-methods
Genomics experimental-methodsGenomics experimental-methods
Genomics experimental-methods
 
GIAB Benchmarks for SVs and Repeats for stanford genetics sv 200511
GIAB Benchmarks for SVs and Repeats for stanford genetics sv 200511GIAB Benchmarks for SVs and Repeats for stanford genetics sv 200511
GIAB Benchmarks for SVs and Repeats for stanford genetics sv 200511
 
Genome in a bottle for ashg grc giab workshop 181016
Genome in a bottle for ashg grc giab workshop 181016Genome in a bottle for ashg grc giab workshop 181016
Genome in a bottle for ashg grc giab workshop 181016
 
Developing tools & Methodologies for the NExt Generation of Genomics & Bio In...
Developing tools & Methodologies for the NExt Generation of Genomics & Bio In...Developing tools & Methodologies for the NExt Generation of Genomics & Bio In...
Developing tools & Methodologies for the NExt Generation of Genomics & Bio In...
 
Día 19 - Noel Chen - Introducción a Novogene
Día 19 - Noel Chen - Introducción a Novogene Día 19 - Noel Chen - Introducción a Novogene
Día 19 - Noel Chen - Introducción a Novogene
 
NGS Applications II (UEB-UAT Bioinformatics Course - Session 2.1.3 - VHIR, Ba...
NGS Applications II (UEB-UAT Bioinformatics Course - Session 2.1.3 - VHIR, Ba...NGS Applications II (UEB-UAT Bioinformatics Course - Session 2.1.3 - VHIR, Ba...
NGS Applications II (UEB-UAT Bioinformatics Course - Session 2.1.3 - VHIR, Ba...
 
VarSeq 2.4.0: VSClinical ACMG Workflow from the User Perspective
VarSeq 2.4.0: VSClinical ACMG Workflow from the User PerspectiveVarSeq 2.4.0: VSClinical ACMG Workflow from the User Perspective
VarSeq 2.4.0: VSClinical ACMG Workflow from the User Perspective
 
VarSeq 2.4.0: VSClinical ACMG Workflow from the User Perspective
VarSeq 2.4.0: VSClinical ACMG Workflow from the User PerspectiveVarSeq 2.4.0: VSClinical ACMG Workflow from the User Perspective
VarSeq 2.4.0: VSClinical ACMG Workflow from the User Perspective
 
2013 10 23_dna_for_dummies_v_presented
2013 10 23_dna_for_dummies_v_presented2013 10 23_dna_for_dummies_v_presented
2013 10 23_dna_for_dummies_v_presented
 

Analyzing Copy Number Variation from a Bioinformatics Approach

  • 1. Analyzing Copy Number Variation from a Bioinformatics Approach July 24th, 2009 by Gagandeep Singh Anand ganand@uwaterloo.ca
  • 2. General Overview - Study The Study: • Identify and Annotate germline genomic alterations that predispose to familial pancreatic cancer (FPC) • Determine: functionally inactivated / over-expressed genes in pancreatic adenocarcinoma tumours • Considered as candidate tumour suppressor or oncogenic genes in FPC • By finding copy number variation (variability in copy [>2 or <2] of DNA regions) in FPC Case vs. Controls to find “interesting” regions of inactivation / over expression My Contributions: • Assist with data mining and pipelining – a technique to find “interesting” genes • Make data analysis a more efficient process with the aid of numerous programming languages and bioinformatics tools
  • 3. General Overview - Bioinformatics • Compare the Database of Genomic Variants (+ our Controls) with our FPC CNV regions – find overlaps/non- overlaps. • Compare all regions (i.e. Somatic CNVs, miRNAs, Hypo/Methylated regions) from other papers – also based on pancreatic adenocarcinoma – with our FPC Cases. • Determine all the Genes coinciding with our FPC CNV regions
  • 4. DGV + our Controls vs. FPC Cases... Overlap of FPC CNVs vs. All Controls: 565 FPC gain CNVs / 235 FPC deleted CNVs coinciding with any control Non-overlap of FPC CNVs vs. All Controls: 240 FPC gain CNVs / 55 FPC deleted CNVs not overlapping with controls FPC Amplifications (631) FPC Deletions (266) DGV + Controls (25700)
  • 5. ...DGV + our Controls vs. FPC Cases Database of Genomic Variants (DGV) Tracks: • Using the DGV - Genomic Browser, we can view tracks (list of sequence annotations) on the same page: • FPC Case CNVs as well as all (DGV + our controls) control regions • CNV Non-Overlaps with all (DGV + our controls) control regions • This gives us the ability to visualize a multitude of different annotations on the same page • For example…
  • 6. Somatic CNVs, miRNA's & Methylated regions vs. FPC CNVs • Analyzed 12 different papers containing 1265 (27 hypo/methylated, 12 miRNA’s, 1233 somatic PC CNVs) interesting regions (regions that had high affinity in pancreatic adenocarcinoma cases) • Observed 63 FPC deleted CNVs overlapping with other regions (somatic CNVs, miRNA's or Methylated regions) • Observed 240 FPC gain CNVs overlapping with other regions (somatic CNVs, miRNA's or Methylated regions) • Ongoing, as we search for more papers
  • 7. Annotate FPC Regions with Standard Gene IDs • Get a list of genes that coincide with a set of CNVs / Controls • Get a list of CNVs / Controls that coincide with a set of genes Ensembl ID Entrez ID Chr. Gene Start Gene End Known Gene Aliases # of CNVs CNV Names ENSG00000005 206 56928 19 2279629 2306099 AC004410.1 3 G_247,G_281, G_177 ENSG00000005 844 16 30391572 30442006 ITGAL 2 G_171,G_241 Temp_ID Chr. Case Start Case End Case Size # of Genes Ensembl Gene Names (Gene Alias) G_151 14 106223861 106246130 22270 2 ENSG0000021937 3 (IGHV1-68),… G_152 7 61718176 62343621 625446 2 ENSG0000019923 1 (snoU2_19),…
  • 8. Annotate FPC Regions with Standard Gene IDs • Next we would like to annotate "interesting" genes in detail and prioritize them based on their relevance to FPC – I hope to automate this pipeline! • Previously, we determined regions of non-overlaps based on regions of overlapping CNVs vs. Controls • Now, using a variety of genome browsers, we find the genes that coincide with our FPC CNVs and Our Controls then find the set of disjoint genes Only FPC AMP Genes (1864) Only FPC Del Genes (84) Only CTRL AMP Genes (14272) Only CTRL DEL Genes (1257) Genes in FPC & CTRL AMPs (1678) Genes in FPC & CTRL DELs (428)
  • 9. Thank you for listening… Acknowledgements: Dr. Wigdan Al-Sukhni