SlideShare a Scribd company logo
1 of 108
Genomics:
Experimental methods
Slides available
www.bioinformatics.be
Lab for Bioinformatics and
computational genomics

10 “genome hackers”
mostly engineers (statistics)

42 scientists
technicians, geneticists, clinicians

>100 people
hardware engineers,
mathematicians, molecular biologists
Overview

Personalized Medicine,
Biomarkers …
… Molecular Profiling
First Generation Molecular Profiling
Next Generation Molecular Profiling
Next Generation Epigenetic Profiling
Concluding Remarks
Overview

Personalized Medicine,
Biomarkers …
… Molecular Profiling
First Generation Molecular Profiling
Next Generation Molecular Profiling
Next Generation Epigenetic Profiling
Concluding Remarks
Personalized Medicine
• The use of diagnostic tests (aka biomarkers) to identify in advance
which patients are likely to respond well to a therapy
• The benefits of this approach are to
– avoid adverse drug reactions
– improve efficacy
– adjust the dose to suit the patient
– differentiate a product in a competitive market
– meet future legal or regulatory requirements
• Potential uses of biomarkers
– Risk assessment
– Initial/early detection
– Prognosis
– Prediction/therapy selection
– Response assessment
– Monitoring for recurrence
Biomarker

First used in 1971 … An objective and
« predictive » measure … at the molecular
level … of normal and pathogenic processes
and responses to therapeutic interventions
Characteristic that is objectively measured and
evaluated as an indicator of normal biologic
or pathogenic processes or pharmacologic
response to a drug
A biomarker is valid if:
– It can be measured in a test system with well
established performance characteristics
– Evidence for its clinical significance has been
established
Rationale 1:
Why now ? Regulatory path becoming more clear
There is more at stake than
efficient drug
development. FDA
« critical path initiative »
Pharmacogenomics
guideline
Biomarkers are the
foundation of « evidence
based medicine » - who
should be treated, how
and with what.
Without Biomarkers
advances in targeted
therapy will be limited and
treatment remain largely
emperical. It is imperative
that Biomarker
development be
accelarated along with
therapeutics
Why now ?

First and maturing second generation molecular
profiling methodologies allow to stratify clinical
trial participants to include those most likely to
benefit from the drug candidate—and exclude
those who likely will not—pharmacogenomicsbased
Clinical trials should attain more specific results
with smaller numbers of patients. Smaller
numbers mean fewer costs (factor 2-10)
An additional benefit for trial participants and
internal review boards (IRBs) is that
stratification, given the correct biomarker, may
reduce or eliminate adverse events.
Molecular Profiling

The study of specific patterns (fingerprints) of proteins,
DNA, and/or mRNA and how these patterns correlate
with an individual's physical characteristics or
symptoms of disease.
Generic Health advice

• Exercise (Hypertrophic Cardiomyopathy)
• Drink your milk (MCM6 Lactose intolarance)
• Eat your green beans (glucose-6-phosphate
dehydrogenase Deficiency)
• & your grains (HLA-DQ2 – Celiac disease)
• & your iron (HFE - Hemochromatosis)
• Get more rest (HLA-DR2 - Narcolepsy)
Generic Health advice (UNLESS)

• Exercise (Hypertrophic Cardiomyopathy)
• Drink your milk (MCM6 Lactose intolarance)
• Eat your green beans (glucose-6-phosphate
dehydrogenase Deficiency)
• & your grains (HLA-DQ2 – Celiac disease)
• & your iron (HFE - Hemochromatosis)
• Get more rest (HLA-DR2 - Narcolepsy)
Generic Health advice (UNLESS)

• Exercise (Hypertrophic Cardiomyopathy)
• Drink your milk (MCM6 Lactose intolerance)
• Eat your green beans (glucose-6-phosphate
dehydrogenase Deficiency)
• & your grains (HLA-DQ2 – Celiac disease)
• & your iron (HFE - Hemochromatosis)
• Get more rest (HLA-DR2 - Narcolepsy)
Generic Health advice (UNLESS)

• Exercise (Hypertrophic Cardiomyopathy)
• Drink your milk (MCM6 Lactose intolerance)
• Eat your green beans (glucose-6-phosphate
dehydrogenase Deficiency)
• & your grains (HLA-DQ2 – Celiac disease)
• & your iron (HFE - Hemochromatosis)
• Get more rest (HLA-DR2 - Narcolepsy)
EGFR based therapy in mCRC
Overview

Personalized Medicine,
Biomarkers …
… Molecular Profiling
First Generation Molecular Profiling
Next Generation Molecular Profiling
Next Generation Epigenetic Profiling
Concluding Remarks
Before molecular profiling …
Before molecular profiling …
Before molecular profiling …
First Generation Molecular Profiling

• Flow cytometry correlates surface markers,
cell size and other parameters
• Circulating tumor cell assays (CTC’s)
quantitate the number of tumor cells in the
peripheral blood.
• Exosomes are 30-90 nm vesicles secreted by
a wide range of mammalian cell types.
• Immunohistochemistry (IHC) measures
protein expression, usually on the cell
surface.
First Generation Molecular Profiling

• Gene sequencing for mutation detection
• Microarray for m-RNA message detection
• RT-PCR for gene expression
• FISH analysis for gene copy number
• Comparative Genome Hybridization (CGH) for
gene copy number
Basics of the ―old‖ technology

• Clone the DNA.
• Generate a ladder of labeled (colored)
molecules that are different by 1 nucleotide.
• Separate mixture on some matrix.
• Detect fluorochrome by laser.
• Interpret peaks as string of DNA.
• Strings are 500 to 1,000 letters long
• 1 machine generates 57,000 nucleotides/run
• Assemble all strings into a genome.
Genetic Variation
Among People
Single nucleotide polymorphisms
(SNPs)

GATTTAGATCGCGATAGAG
GATTTAGATCTCGATAGAG

0.1% difference among
people
The genome fits as an e-mail attachment
First Generation Molecular Profiling

• Gene sequencing for mutation detection
• Microarray for m-RNA message detection
• RT-PCR for gene expression
• FISH analysis for gene copy number
• Comparative Genome Hybridization (CGH) for
gene copy number
mRNA Expression Microarray
First Generation Molecular Profiling

• Gene sequencing for mutation detection
• Microarray for m-RNA message detection
• RT-PCR for gene expression
• FISH analysis for gene copy number
• Comparative Genome Hybridization (CGH) for
gene copy number
Overview

Personalized Medicine,
Biomarkers …
… Molecular Profiling
First Generation Molecular Profiling
Next Generation Molecular Profiling
Next Generation Epigenetic Profiling
Concluding Remarks
Basics of the ―new‖ technology

• Get DNA.
• Attach it to something.
• Extend and amplify signal with some color
scheme.
• Detect fluorochrome by microscopy.
• Interpret series of spots as short strings of
DNA.
• Strings are 30-300 letters long
• Multiple images are interpreted as 0.4 to 1.2
GB/run (1,200,000,000 letters/day).
• Map or align strings to one or many genome.
Next Generation Technologies

• Roche (454)
–Emulsion PCR
–Polymerase
–Natural Nucleotides

• 100-500 Mb for 5-15k
–1% error rate
–Homopolymers
One additional insight ...
% of Paired K-mers with Uniquely
Assignable Location

Read Length is Not As Important For Resequencing

100%
90%
80%
70%
60%
E.COLI

50%

HUMAN

40%
30%
20%
10%
0%
8

Jay Shendure

10

12

14 16

18

20

Length of K-mer Reads (bp)
Short Read Techologies

• Illumina GA (HiSeq, MySeq)

• ABI SOLID
Other second generation technology: (ABI) SOLID
So what ?
Second generation DNA/RNA profiling
Second Generation DNA profiling

• Enrichment Sequencing
• ChIP-Seq (Chromosome
Immunoprecipitation)
• A substitute for ChIP-chip
• Eg. to find the binding sequence of
proteins (TFBS)
Paired End Reads are Important!
Known Distance

Repetitive DNA
Read 1Unique DNA 2
Read

Single read maps to
multiple positions
Paired End Reads are Important!
Known Distance

Repetitive DNA
Read 1Unique DNA 2
Read

Single read maps to
multiple positions
Second Generation DNA profiling

• Exome Sequencing (aka known as
targeted exome capture) is an
efficient strategy to selectively
sequence the coding regions of the
genome to identify novel genes
associated with rare and common
disorders.
• 160K exons
Second Generation DNA profiling
Second Generation DNA profiling
Bioinformatics tools
Bioinformatics tools
Second Generation RNA profiling

Besides the 6000 protein coding-genes …
140 ribosomal RNA genes
275 transfer RNA gnes
40 small nuclear RNA genes
>100 small nucleolar genes

Contents-Schedule

Function of RNA genes
pRNA in 29 rotary packaging motor (Simpson
et el. Nature 408:745-750,2000)
Cartilage-hair hypoplasmia mapped to an RNA
(Ridanpoa et al. Cell 104:195-203,2001)
The human Prader-Willi ciritical region (Cavaille
et al. PNAS 97:14035-7, 2000)
Second Generation RNA profiling

RNA genes can be hard to detects
UGAGGUAGUAGGUUGUAUAGU
C.elegans let-27; 21 nt
(Pasquinelli et al. Nature 408:86-89,2000)

Often small
Sometimes multicopy and redundant
Often not polyadenylated
(not represented in ESTs)
Immune to frameshift and nonsense
mutations
No open reading frame, no codon bias
Often evolving rapidly in primary sequence
Second Generation RNA profiling

Although details of the methods vary, the concept
behind RNA-seq is simple:
• isolate all mRNA
• convert to cDNA using reverse transcriptase
• sequence the cDNA
• map sequences to the genome
The more times a given sequence is detected, the
more abundantly transcribed it is. If enough
sequences are generated, a comprehensive and
quantitative view of the entire transcriptome of an
organism or tissue can be obtained.
Second Generation RNA profiling

• Comparing to microarray
– Microarray
• Closed technology: Prior knowledge required
• Affected by pseudo-genes (homologous of real genes)
• Low sensitivity

– RNA-Seq
• Open technology: No prior knowledge required
• Not affected by pseudo-genes because exact
sequence is measured
• Other information could be yielded (SNP, Alternative
splicing)
ncRNAs in human genome

tRNA
18S rRNA
5.8S rRNA
28S rRNA
5S rRNA
snoRNA
miRNA
U1
U2
U4
U5
U6
U4atac
U6atac
U11
U12

600
200
200
200
200
300
250
40
30
30
30
20
5
5
5
5

SRP RNA

1

RNase P RNA

1

Telomerase RNA

1

RNase MRP

1

Y RNA

5

Vault

4

7SK RNA

1

Xist

1

H19

1

BIC

1

Antisense RNAs 1000s?
Cis reg regions
Others

100s?
?
Mapping Structural Variation in Humans
>1 kb segments
- Thought to be Common
12% of the genome
(Redon et al. 2006)
- Likely involved in phenotype
variation and disease
CNVs

- Until recently most methods for
detection were low resolution
(>50 kb)
Size Distribution of CNV in a Human Genome
Next next generation sequencing
Third generation sequencing
Now sequencing
Ultra-low-cost SINGLE molecule sequencing
Pacific Biosciences: A Third Generation Sequencing Technology

Eid et al 2008
Complete genomics
Nanopore Sequencing
Second Generation Protein profiling

• Proteomics MS-MS-based
exclusively in discovery mode
• Automate diagnostics assay
generation (next generation
proteomics)
• Aptamers as alternative to antibodies
• ImmunoPCR
MS/MS identification
pipeline

pipeline overview

Bonanza

Bonanza + IggyPep

Goal
filter dataset
prior to
database
Goal
search
define PTMs
profile
prior to
database
search

Goal
multi-tiered
database
search
Second Generation Protein profiling

• Proteomics MS-MS-based
exclusively in discovery mode
• Automate diagnostics assay
generation (next generation
proteomics)
• Aptamers as alternative to antibodies
• ImmunoPCR
Overview

Personalized Medicine,
Biomarkers …
… Molecular Profiling
First Generation Molecular Profiling
Next Generation Molecular Profiling
Next Generation Epigenetic Profiling
Concluding Remarks
Genome-wide methylation
…. by next generation sequencing
# markers

3 000 000

MethylCap_Seq

6 000
EpiHealth

50
Deep_Seq
5

Discovery

<50
only models
and fresh frozen

Verification

Validation

> 50

# samples

CONFIDENTIAL
Molecular
Unification
E
P
I

Whole-genome
Bisulphite seq

G
E
N
E
T
I
C

Whole-genome
sequencing

Enrichment seq
(MBD, RRBS)

Probes
(450-27K)

Ultra
Deep

Enrichment
Targeted Panels

Enrichment seq
(Exome)

Enrichment
Targeted Panels

Deep
Seq
bp

Full genome

109

108

RUO

107

106

105

104

103

Sequencing

102

101

1

Clinical
CONFIDENTIAL
Overview

Personalized Medicine,
Biomarkers …
… Molecular Profiling
First Generation Molecular Profiling
Next Generation Molecular Profiling
Next Generation Epigenetic Profiling
Concluding Remarks
Bioinformatics, a life science discipline …

Math

Informatics

(Molecular)
Biology
Bioinformatics, a life science discipline …

Math

Computer Science

Theoretical Biology

Informatics
Computational Biology

(Molecular)
Biology
Bioinformatics, a life science discipline …

Math

Theoretical Biology

Computer Science

Bioinformatics

Informatics
Computational Biology

(Molecular)
Biology
Bioinformatics, a life science discipline … management of expectations

Math

Theoretical Biology

Computer Science
NP
Datamining

AI, Image Analysis
structure prediction (HTX)

Bioinformatics
Interface Design

Expert Annotation

Sequence Analysis
Informatics
Computational Biology

(Molecular)
Biology
Bioinformatics, a life science discipline … management of expectations

Math

Theoretical Biology

Computer Science
NP
Datamining

AI, Image Analysis
structure prediction (HTX)

Bioinformatics
Discovery Informatics – Computational Genomics
Interface Design

Expert Annotation

Sequence Analysis
Informatics
Computational Biology

(Molecular)
Biology
Translational Medicine: An inconvenient truth

• 1% of genome codes for proteins, however
more than 90% is transcribed
• Less than 10% of protein experimentally
measured can be ―explained‖ from the
genome
• 1 genome ? Structural variation
• > 200 Epigenomes ??
• Space/time continuum …
Translational Medicine: An inconvenient truth

• 1% of genome codes for proteins, however
more than 90% is transcribed
• Less than 10% of protein experimentally
measured can be ―explained‖ from the
genome
• 1 genome ? Structural variation
• > 200 Epigenomes …
• ―space/time‖ continuum
Cellular programming

Epigenetic (meta)information = stem cells
Cellular reprogramming

Tumor
Tumor
Development
and
Growth

Epigenetically
altered, selfrenewing cancer
stem cells
Cellular reprogramming

Gene-specific
Epigenetic
reprogramming
Wobblebase Mission
provide tools to both specialists (researchers,
bioinformaticians, health care providers) and
individual consumers that unlock the power of
genomic data to the USER
enable personalized genomics today by simplifying
the way we organize, visualize and manage
genomic data.
PGM: Personal Genomics Manifesto

Everybody who wants to get his genome sequenced has the human right to do so.
No third party can own your genetic data, your genetic data is exclusively yours.

Nobody can be forced to get his genome analyzed or to reveal his genome to a
third party.
Your genome should allways be treated as confidential, private information.
People should be advised not to share their identity AND their entire genome on a
public forum.
People should be advised to use secure technologies that allow to maximally
protect phenotypic and/or genotype data.
People should be able to actively explore, manage and get updated interpretation
on their genomic data.
Wobblebase Mission
•

change the
diagnostic/healthcare industry
forever by setting a new
standard and empowering the
user
Choosing the Red Pill
The Technical Feasibility Argument
The Quality Argument
The Price Argument
The Logistics around the sample on howto
manage the data Argument
The Ethical debate
The Privacy/Security concern
Notifications
Updates are the single moste
important feature of Wobblebase
#Rs1805007
Bioinformatics
Analysis
pipelines

Social
network
twitter

Wobblebase

Updates
Notifications

eHealth
(fixed
vocabulary)
Comparison
biobix
wvcrieki

biobix.be
bioinformatics.be
108

More Related Content

What's hot

Statistical significance of alignments
Statistical significance of alignmentsStatistical significance of alignments
Statistical significance of alignments
avrilcoghlan
 

What's hot (20)

Integrative omics approches
Integrative omics approches   Integrative omics approches
Integrative omics approches
 
Genome annotation 2013
Genome annotation 2013Genome annotation 2013
Genome annotation 2013
 
Transcriptomics and metabolomics
Transcriptomics and metabolomicsTranscriptomics and metabolomics
Transcriptomics and metabolomics
 
What is comparative genomics
What is comparative genomicsWhat is comparative genomics
What is comparative genomics
 
Transcriptomics approaches
Transcriptomics approachesTranscriptomics approaches
Transcriptomics approaches
 
Genome
GenomeGenome
Genome
 
Challenges and opportunities in personal omics profiling
Challenges and opportunities in personal omics profilingChallenges and opportunities in personal omics profiling
Challenges and opportunities in personal omics profiling
 
Genomics
GenomicsGenomics
Genomics
 
RNA-seq Data Analysis Overview
RNA-seq Data Analysis OverviewRNA-seq Data Analysis Overview
RNA-seq Data Analysis Overview
 
Statistical significance of alignments
Statistical significance of alignmentsStatistical significance of alignments
Statistical significance of alignments
 
Transcriptome analysis
Transcriptome analysisTranscriptome analysis
Transcriptome analysis
 
Comparative Genomics and Visualisation BS32010
Comparative Genomics and Visualisation BS32010Comparative Genomics and Visualisation BS32010
Comparative Genomics and Visualisation BS32010
 
Comparative genomics presentation
Comparative genomics presentationComparative genomics presentation
Comparative genomics presentation
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
 
Genomics, Transcriptomics, Proteomics, Metabolomics - Basic concepts for clin...
Genomics, Transcriptomics, Proteomics, Metabolomics - Basic concepts for clin...Genomics, Transcriptomics, Proteomics, Metabolomics - Basic concepts for clin...
Genomics, Transcriptomics, Proteomics, Metabolomics - Basic concepts for clin...
 
Transcriptomics
TranscriptomicsTranscriptomics
Transcriptomics
 
Applications of genomics and proteomics ppt
Applications of genomics and  proteomics pptApplications of genomics and  proteomics ppt
Applications of genomics and proteomics ppt
 
Lecture 2
Lecture 2Lecture 2
Lecture 2
 
Whole Genome Sequencing Analysis
Whole Genome Sequencing AnalysisWhole Genome Sequencing Analysis
Whole Genome Sequencing Analysis
 
Impacts of genomics, proteomics, and metabolomics ppt
Impacts of genomics, proteomics, and metabolomics pptImpacts of genomics, proteomics, and metabolomics ppt
Impacts of genomics, proteomics, and metabolomics ppt
 

Similar to Genomics experimental-methods

Molecular and genomic surgery
Molecular and genomic surgeryMolecular and genomic surgery
Molecular and genomic surgery
thaannush
 
Developing tools & Methodologies for the NExt Generation of Genomics & Bio In...
Developing tools & Methodologies for the NExt Generation of Genomics & Bio In...Developing tools & Methodologies for the NExt Generation of Genomics & Bio In...
Developing tools & Methodologies for the NExt Generation of Genomics & Bio In...
Intel IT Center
 
Microarrays;application
Microarrays;applicationMicroarrays;application
Microarrays;application
Fyzah Bashir
 

Similar to Genomics experimental-methods (20)

Molecular profiling 2013
Molecular profiling 2013Molecular profiling 2013
Molecular profiling 2013
 
2017 molecular profiling_wim_vancriekinge
2017 molecular profiling_wim_vancriekinge2017 molecular profiling_wim_vancriekinge
2017 molecular profiling_wim_vancriekinge
 
Molecular techniques for pathology research - MDX .pdf
Molecular techniques for pathology research - MDX .pdfMolecular techniques for pathology research - MDX .pdf
Molecular techniques for pathology research - MDX .pdf
 
PadminiNarayanan-Intro-2018.pptx
PadminiNarayanan-Intro-2018.pptxPadminiNarayanan-Intro-2018.pptx
PadminiNarayanan-Intro-2018.pptx
 
Total RNA Discovery for RNA Biomarker Development Webinar
Total RNA Discovery for RNA Biomarker Development WebinarTotal RNA Discovery for RNA Biomarker Development Webinar
Total RNA Discovery for RNA Biomarker Development Webinar
 
2011 10 26_quantitative_cell_biology_molecular_profiling_v_twitter
2011 10 26_quantitative_cell_biology_molecular_profiling_v_twitter2011 10 26_quantitative_cell_biology_molecular_profiling_v_twitter
2011 10 26_quantitative_cell_biology_molecular_profiling_v_twitter
 
Molecular profiling 2012
Molecular profiling 2012Molecular profiling 2012
Molecular profiling 2012
 
2013 10 23_dna_for_dummies_v_presented
2013 10 23_dna_for_dummies_v_presented2013 10 23_dna_for_dummies_v_presented
2013 10 23_dna_for_dummies_v_presented
 
6. Molek tech.pptx [Repaired].pptx
6. Molek tech.pptx [Repaired].pptx6. Molek tech.pptx [Repaired].pptx
6. Molek tech.pptx [Repaired].pptx
 
Brian_Strahl 2013_class_on_genomics_and_proteomics
Brian_Strahl 2013_class_on_genomics_and_proteomicsBrian_Strahl 2013_class_on_genomics_and_proteomics
Brian_Strahl 2013_class_on_genomics_and_proteomics
 
Dr. John Svaren - 'Neuropatías periféricas hereditarias'
Dr. John Svaren - 'Neuropatías periféricas hereditarias'Dr. John Svaren - 'Neuropatías periféricas hereditarias'
Dr. John Svaren - 'Neuropatías periféricas hereditarias'
 
Microarry andd NGS.pdf
Microarry andd NGS.pdfMicroarry andd NGS.pdf
Microarry andd NGS.pdf
 
Molecular and genomic surgery
Molecular and genomic surgeryMolecular and genomic surgery
Molecular and genomic surgery
 
Axt microarrays
Axt microarraysAxt microarrays
Axt microarrays
 
Developing tools & Methodologies for the NExt Generation of Genomics & Bio In...
Developing tools & Methodologies for the NExt Generation of Genomics & Bio In...Developing tools & Methodologies for the NExt Generation of Genomics & Bio In...
Developing tools & Methodologies for the NExt Generation of Genomics & Bio In...
 
2015 bioinformatics personal_genomics_wim_vancriekinge
2015 bioinformatics personal_genomics_wim_vancriekinge2015 bioinformatics personal_genomics_wim_vancriekinge
2015 bioinformatics personal_genomics_wim_vancriekinge
 
Clinical molecular diagnostics for drug guidance
Clinical molecular diagnostics for drug guidanceClinical molecular diagnostics for drug guidance
Clinical molecular diagnostics for drug guidance
 
Clinical Applications of Next Generation Sequencing
Clinical Applications of Next Generation SequencingClinical Applications of Next Generation Sequencing
Clinical Applications of Next Generation Sequencing
 
Diagnosis of cancer
Diagnosis of cancerDiagnosis of cancer
Diagnosis of cancer
 
Microarrays;application
Microarrays;applicationMicroarrays;application
Microarrays;application
 

More from Prof. Wim Van Criekinge

More from Prof. Wim Van Criekinge (20)

2020 02 11_biological_databases_part1
2020 02 11_biological_databases_part12020 02 11_biological_databases_part1
2020 02 11_biological_databases_part1
 
2019 03 05_biological_databases_part5_v_upload
2019 03 05_biological_databases_part5_v_upload2019 03 05_biological_databases_part5_v_upload
2019 03 05_biological_databases_part5_v_upload
 
2019 03 05_biological_databases_part4_v_upload
2019 03 05_biological_databases_part4_v_upload2019 03 05_biological_databases_part4_v_upload
2019 03 05_biological_databases_part4_v_upload
 
2019 03 05_biological_databases_part3_v_upload
2019 03 05_biological_databases_part3_v_upload2019 03 05_biological_databases_part3_v_upload
2019 03 05_biological_databases_part3_v_upload
 
2019 02 21_biological_databases_part2_v_upload
2019 02 21_biological_databases_part2_v_upload2019 02 21_biological_databases_part2_v_upload
2019 02 21_biological_databases_part2_v_upload
 
2019 02 12_biological_databases_part1_v_upload
2019 02 12_biological_databases_part1_v_upload2019 02 12_biological_databases_part1_v_upload
2019 02 12_biological_databases_part1_v_upload
 
P7 2018 biopython3
P7 2018 biopython3P7 2018 biopython3
P7 2018 biopython3
 
P6 2018 biopython2b
P6 2018 biopython2bP6 2018 biopython2b
P6 2018 biopython2b
 
P4 2018 io_functions
P4 2018 io_functionsP4 2018 io_functions
P4 2018 io_functions
 
P3 2018 python_regexes
P3 2018 python_regexesP3 2018 python_regexes
P3 2018 python_regexes
 
T1 2018 bioinformatics
T1 2018 bioinformaticsT1 2018 bioinformatics
T1 2018 bioinformatics
 
P1 2018 python
P1 2018 pythonP1 2018 python
P1 2018 python
 
Bio ontologies and semantic technologies[2]
Bio ontologies and semantic technologies[2]Bio ontologies and semantic technologies[2]
Bio ontologies and semantic technologies[2]
 
2018 05 08_biological_databases_no_sql
2018 05 08_biological_databases_no_sql2018 05 08_biological_databases_no_sql
2018 05 08_biological_databases_no_sql
 
2018 03 27_biological_databases_part4_v_upload
2018 03 27_biological_databases_part4_v_upload2018 03 27_biological_databases_part4_v_upload
2018 03 27_biological_databases_part4_v_upload
 
2018 03 20_biological_databases_part3
2018 03 20_biological_databases_part32018 03 20_biological_databases_part3
2018 03 20_biological_databases_part3
 
2018 02 20_biological_databases_part2_v_upload
2018 02 20_biological_databases_part2_v_upload2018 02 20_biological_databases_part2_v_upload
2018 02 20_biological_databases_part2_v_upload
 
2018 02 20_biological_databases_part1_v_upload
2018 02 20_biological_databases_part1_v_upload2018 02 20_biological_databases_part1_v_upload
2018 02 20_biological_databases_part1_v_upload
 
P7 2017 biopython3
P7 2017 biopython3P7 2017 biopython3
P7 2017 biopython3
 
P6 2017 biopython2
P6 2017 biopython2P6 2017 biopython2
P6 2017 biopython2
 

Recently uploaded

Spellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseSpellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please Practise
AnaAcapella
 

Recently uploaded (20)

SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptxSKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
 
Interdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptxInterdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptx
 
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
 
FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentation
 
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptx
 
Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and Modifications
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - English
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Google Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxGoogle Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptx
 
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptxHMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.
 
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
 
Spellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseSpellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please Practise
 

Genomics experimental-methods

  • 2. Lab for Bioinformatics and computational genomics 10 “genome hackers” mostly engineers (statistics) 42 scientists technicians, geneticists, clinicians >100 people hardware engineers, mathematicians, molecular biologists
  • 3. Overview Personalized Medicine, Biomarkers … … Molecular Profiling First Generation Molecular Profiling Next Generation Molecular Profiling Next Generation Epigenetic Profiling Concluding Remarks
  • 4. Overview Personalized Medicine, Biomarkers … … Molecular Profiling First Generation Molecular Profiling Next Generation Molecular Profiling Next Generation Epigenetic Profiling Concluding Remarks
  • 5.
  • 6.
  • 7.
  • 8.
  • 9.
  • 10. Personalized Medicine • The use of diagnostic tests (aka biomarkers) to identify in advance which patients are likely to respond well to a therapy • The benefits of this approach are to – avoid adverse drug reactions – improve efficacy – adjust the dose to suit the patient – differentiate a product in a competitive market – meet future legal or regulatory requirements • Potential uses of biomarkers – Risk assessment – Initial/early detection – Prognosis – Prediction/therapy selection – Response assessment – Monitoring for recurrence
  • 11. Biomarker First used in 1971 … An objective and « predictive » measure … at the molecular level … of normal and pathogenic processes and responses to therapeutic interventions Characteristic that is objectively measured and evaluated as an indicator of normal biologic or pathogenic processes or pharmacologic response to a drug A biomarker is valid if: – It can be measured in a test system with well established performance characteristics – Evidence for its clinical significance has been established
  • 12. Rationale 1: Why now ? Regulatory path becoming more clear There is more at stake than efficient drug development. FDA « critical path initiative » Pharmacogenomics guideline Biomarkers are the foundation of « evidence based medicine » - who should be treated, how and with what. Without Biomarkers advances in targeted therapy will be limited and treatment remain largely emperical. It is imperative that Biomarker development be accelarated along with therapeutics
  • 13. Why now ? First and maturing second generation molecular profiling methodologies allow to stratify clinical trial participants to include those most likely to benefit from the drug candidate—and exclude those who likely will not—pharmacogenomicsbased Clinical trials should attain more specific results with smaller numbers of patients. Smaller numbers mean fewer costs (factor 2-10) An additional benefit for trial participants and internal review boards (IRBs) is that stratification, given the correct biomarker, may reduce or eliminate adverse events.
  • 14. Molecular Profiling The study of specific patterns (fingerprints) of proteins, DNA, and/or mRNA and how these patterns correlate with an individual's physical characteristics or symptoms of disease.
  • 15. Generic Health advice • Exercise (Hypertrophic Cardiomyopathy) • Drink your milk (MCM6 Lactose intolarance) • Eat your green beans (glucose-6-phosphate dehydrogenase Deficiency) • & your grains (HLA-DQ2 – Celiac disease) • & your iron (HFE - Hemochromatosis) • Get more rest (HLA-DR2 - Narcolepsy)
  • 16. Generic Health advice (UNLESS) • Exercise (Hypertrophic Cardiomyopathy) • Drink your milk (MCM6 Lactose intolarance) • Eat your green beans (glucose-6-phosphate dehydrogenase Deficiency) • & your grains (HLA-DQ2 – Celiac disease) • & your iron (HFE - Hemochromatosis) • Get more rest (HLA-DR2 - Narcolepsy)
  • 17. Generic Health advice (UNLESS) • Exercise (Hypertrophic Cardiomyopathy) • Drink your milk (MCM6 Lactose intolerance) • Eat your green beans (glucose-6-phosphate dehydrogenase Deficiency) • & your grains (HLA-DQ2 – Celiac disease) • & your iron (HFE - Hemochromatosis) • Get more rest (HLA-DR2 - Narcolepsy)
  • 18. Generic Health advice (UNLESS) • Exercise (Hypertrophic Cardiomyopathy) • Drink your milk (MCM6 Lactose intolerance) • Eat your green beans (glucose-6-phosphate dehydrogenase Deficiency) • & your grains (HLA-DQ2 – Celiac disease) • & your iron (HFE - Hemochromatosis) • Get more rest (HLA-DR2 - Narcolepsy)
  • 20. Overview Personalized Medicine, Biomarkers … … Molecular Profiling First Generation Molecular Profiling Next Generation Molecular Profiling Next Generation Epigenetic Profiling Concluding Remarks
  • 22.
  • 23.
  • 24.
  • 27.
  • 28. First Generation Molecular Profiling • Flow cytometry correlates surface markers, cell size and other parameters • Circulating tumor cell assays (CTC’s) quantitate the number of tumor cells in the peripheral blood. • Exosomes are 30-90 nm vesicles secreted by a wide range of mammalian cell types. • Immunohistochemistry (IHC) measures protein expression, usually on the cell surface.
  • 29.
  • 30.
  • 31.
  • 32. First Generation Molecular Profiling • Gene sequencing for mutation detection • Microarray for m-RNA message detection • RT-PCR for gene expression • FISH analysis for gene copy number • Comparative Genome Hybridization (CGH) for gene copy number
  • 33. Basics of the ―old‖ technology • Clone the DNA. • Generate a ladder of labeled (colored) molecules that are different by 1 nucleotide. • Separate mixture on some matrix. • Detect fluorochrome by laser. • Interpret peaks as string of DNA. • Strings are 500 to 1,000 letters long • 1 machine generates 57,000 nucleotides/run • Assemble all strings into a genome.
  • 34.
  • 35. Genetic Variation Among People Single nucleotide polymorphisms (SNPs) GATTTAGATCGCGATAGAG GATTTAGATCTCGATAGAG 0.1% difference among people
  • 36. The genome fits as an e-mail attachment
  • 37. First Generation Molecular Profiling • Gene sequencing for mutation detection • Microarray for m-RNA message detection • RT-PCR for gene expression • FISH analysis for gene copy number • Comparative Genome Hybridization (CGH) for gene copy number
  • 39. First Generation Molecular Profiling • Gene sequencing for mutation detection • Microarray for m-RNA message detection • RT-PCR for gene expression • FISH analysis for gene copy number • Comparative Genome Hybridization (CGH) for gene copy number
  • 40.
  • 41. Overview Personalized Medicine, Biomarkers … … Molecular Profiling First Generation Molecular Profiling Next Generation Molecular Profiling Next Generation Epigenetic Profiling Concluding Remarks
  • 42. Basics of the ―new‖ technology • Get DNA. • Attach it to something. • Extend and amplify signal with some color scheme. • Detect fluorochrome by microscopy. • Interpret series of spots as short strings of DNA. • Strings are 30-300 letters long • Multiple images are interpreted as 0.4 to 1.2 GB/run (1,200,000,000 letters/day). • Map or align strings to one or many genome.
  • 43. Next Generation Technologies • Roche (454) –Emulsion PCR –Polymerase –Natural Nucleotides • 100-500 Mb for 5-15k –1% error rate –Homopolymers
  • 44.
  • 45.
  • 46.
  • 47.
  • 49. % of Paired K-mers with Uniquely Assignable Location Read Length is Not As Important For Resequencing 100% 90% 80% 70% 60% E.COLI 50% HUMAN 40% 30% 20% 10% 0% 8 Jay Shendure 10 12 14 16 18 20 Length of K-mer Reads (bp)
  • 50. Short Read Techologies • Illumina GA (HiSeq, MySeq) • ABI SOLID
  • 51.
  • 52.
  • 53.
  • 54. Other second generation technology: (ABI) SOLID
  • 55.
  • 57.
  • 59. Second Generation DNA profiling • Enrichment Sequencing • ChIP-Seq (Chromosome Immunoprecipitation) • A substitute for ChIP-chip • Eg. to find the binding sequence of proteins (TFBS)
  • 60. Paired End Reads are Important! Known Distance Repetitive DNA Read 1Unique DNA 2 Read Single read maps to multiple positions
  • 61. Paired End Reads are Important! Known Distance Repetitive DNA Read 1Unique DNA 2 Read Single read maps to multiple positions
  • 62. Second Generation DNA profiling • Exome Sequencing (aka known as targeted exome capture) is an efficient strategy to selectively sequence the coding regions of the genome to identify novel genes associated with rare and common disorders. • 160K exons
  • 67. Second Generation RNA profiling Besides the 6000 protein coding-genes … 140 ribosomal RNA genes 275 transfer RNA gnes 40 small nuclear RNA genes >100 small nucleolar genes Contents-Schedule Function of RNA genes pRNA in 29 rotary packaging motor (Simpson et el. Nature 408:745-750,2000) Cartilage-hair hypoplasmia mapped to an RNA (Ridanpoa et al. Cell 104:195-203,2001) The human Prader-Willi ciritical region (Cavaille et al. PNAS 97:14035-7, 2000)
  • 68. Second Generation RNA profiling RNA genes can be hard to detects UGAGGUAGUAGGUUGUAUAGU C.elegans let-27; 21 nt (Pasquinelli et al. Nature 408:86-89,2000) Often small Sometimes multicopy and redundant Often not polyadenylated (not represented in ESTs) Immune to frameshift and nonsense mutations No open reading frame, no codon bias Often evolving rapidly in primary sequence
  • 69. Second Generation RNA profiling Although details of the methods vary, the concept behind RNA-seq is simple: • isolate all mRNA • convert to cDNA using reverse transcriptase • sequence the cDNA • map sequences to the genome The more times a given sequence is detected, the more abundantly transcribed it is. If enough sequences are generated, a comprehensive and quantitative view of the entire transcriptome of an organism or tissue can be obtained.
  • 70. Second Generation RNA profiling • Comparing to microarray – Microarray • Closed technology: Prior knowledge required • Affected by pseudo-genes (homologous of real genes) • Low sensitivity – RNA-Seq • Open technology: No prior knowledge required • Not affected by pseudo-genes because exact sequence is measured • Other information could be yielded (SNP, Alternative splicing)
  • 71. ncRNAs in human genome tRNA 18S rRNA 5.8S rRNA 28S rRNA 5S rRNA snoRNA miRNA U1 U2 U4 U5 U6 U4atac U6atac U11 U12 600 200 200 200 200 300 250 40 30 30 30 20 5 5 5 5 SRP RNA 1 RNase P RNA 1 Telomerase RNA 1 RNase MRP 1 Y RNA 5 Vault 4 7SK RNA 1 Xist 1 H19 1 BIC 1 Antisense RNAs 1000s? Cis reg regions Others 100s? ?
  • 72.
  • 73. Mapping Structural Variation in Humans >1 kb segments - Thought to be Common 12% of the genome (Redon et al. 2006) - Likely involved in phenotype variation and disease CNVs - Until recently most methods for detection were low resolution (>50 kb)
  • 74. Size Distribution of CNV in a Human Genome
  • 75.
  • 76. Next next generation sequencing Third generation sequencing Now sequencing
  • 78. Pacific Biosciences: A Third Generation Sequencing Technology Eid et al 2008
  • 81. Second Generation Protein profiling • Proteomics MS-MS-based exclusively in discovery mode • Automate diagnostics assay generation (next generation proteomics) • Aptamers as alternative to antibodies • ImmunoPCR
  • 82. MS/MS identification pipeline pipeline overview Bonanza Bonanza + IggyPep Goal filter dataset prior to database Goal search define PTMs profile prior to database search Goal multi-tiered database search
  • 83. Second Generation Protein profiling • Proteomics MS-MS-based exclusively in discovery mode • Automate diagnostics assay generation (next generation proteomics) • Aptamers as alternative to antibodies • ImmunoPCR
  • 84. Overview Personalized Medicine, Biomarkers … … Molecular Profiling First Generation Molecular Profiling Next Generation Molecular Profiling Next Generation Epigenetic Profiling Concluding Remarks
  • 85. Genome-wide methylation …. by next generation sequencing # markers 3 000 000 MethylCap_Seq 6 000 EpiHealth 50 Deep_Seq 5 Discovery <50 only models and fresh frozen Verification Validation > 50 # samples CONFIDENTIAL
  • 86. Molecular Unification E P I Whole-genome Bisulphite seq G E N E T I C Whole-genome sequencing Enrichment seq (MBD, RRBS) Probes (450-27K) Ultra Deep Enrichment Targeted Panels Enrichment seq (Exome) Enrichment Targeted Panels Deep Seq bp Full genome 109 108 RUO 107 106 105 104 103 Sequencing 102 101 1 Clinical CONFIDENTIAL
  • 87.
  • 88. Overview Personalized Medicine, Biomarkers … … Molecular Profiling First Generation Molecular Profiling Next Generation Molecular Profiling Next Generation Epigenetic Profiling Concluding Remarks
  • 89. Bioinformatics, a life science discipline … Math Informatics (Molecular) Biology
  • 90. Bioinformatics, a life science discipline … Math Computer Science Theoretical Biology Informatics Computational Biology (Molecular) Biology
  • 91. Bioinformatics, a life science discipline … Math Theoretical Biology Computer Science Bioinformatics Informatics Computational Biology (Molecular) Biology
  • 92. Bioinformatics, a life science discipline … management of expectations Math Theoretical Biology Computer Science NP Datamining AI, Image Analysis structure prediction (HTX) Bioinformatics Interface Design Expert Annotation Sequence Analysis Informatics Computational Biology (Molecular) Biology
  • 93. Bioinformatics, a life science discipline … management of expectations Math Theoretical Biology Computer Science NP Datamining AI, Image Analysis structure prediction (HTX) Bioinformatics Discovery Informatics – Computational Genomics Interface Design Expert Annotation Sequence Analysis Informatics Computational Biology (Molecular) Biology
  • 94. Translational Medicine: An inconvenient truth • 1% of genome codes for proteins, however more than 90% is transcribed • Less than 10% of protein experimentally measured can be ―explained‖ from the genome • 1 genome ? Structural variation • > 200 Epigenomes ?? • Space/time continuum …
  • 95. Translational Medicine: An inconvenient truth • 1% of genome codes for proteins, however more than 90% is transcribed • Less than 10% of protein experimentally measured can be ―explained‖ from the genome • 1 genome ? Structural variation • > 200 Epigenomes … • ―space/time‖ continuum
  • 96.
  • 97.
  • 101. Wobblebase Mission provide tools to both specialists (researchers, bioinformaticians, health care providers) and individual consumers that unlock the power of genomic data to the USER enable personalized genomics today by simplifying the way we organize, visualize and manage genomic data.
  • 102. PGM: Personal Genomics Manifesto Everybody who wants to get his genome sequenced has the human right to do so. No third party can own your genetic data, your genetic data is exclusively yours. Nobody can be forced to get his genome analyzed or to reveal his genome to a third party. Your genome should allways be treated as confidential, private information. People should be advised not to share their identity AND their entire genome on a public forum. People should be advised to use secure technologies that allow to maximally protect phenotypic and/or genotype data. People should be able to actively explore, manage and get updated interpretation on their genomic data.
  • 103. Wobblebase Mission • change the diagnostic/healthcare industry forever by setting a new standard and empowering the user
  • 104. Choosing the Red Pill The Technical Feasibility Argument The Quality Argument The Price Argument The Logistics around the sample on howto manage the data Argument The Ethical debate The Privacy/Security concern
  • 105. Notifications Updates are the single moste important feature of Wobblebase