SlideShare une entreprise Scribd logo
1  sur  11
Télécharger pour lire hors ligne
Introduction to RNA-seq
Paul Gardner
July 6, 2015
Paul Gardner RNA-seq intro
Where on the bio/math spectrum do you lie?
Paul Gardner RNA-seq intro
What is RNA?
��
���
2R
1R
�
�
��
�
���
�
��
�
�
�
��
�
�
�
�
�
��
��
�
1R
1R
1R
���
�
�
�
�
�
��
���
���
1
2R
R : −OH −H
: −H −CH3
�������
�����������
�� ����
����
��� �
�����
��� ��
��� ��
��� ��
��� ����
���
���
���
���
���
���
���
�����
�����
�����
�����
�������
��������
�������
�������
������������������������
RNA DNA
������������������
IUPAC ambiguity chars:
Paul Gardner RNA-seq intro
Crick’s “central dogma of molecular biology”
Paul Gardner RNA-seq intro
Types of RNA
Protein coding RNA:
Messenger RNA
Noncoding RNA:
Ribosomal RNA & Transfer RNA
Spliceosomal RNAs (U1, U2, U4, U5, & U6)
SRP RNA (protein export)
RNase P RNA
snoRNAs & microRNAs (David Humphreys)
Cis-regulatory RNA (riboswitches,
thermosensors, leaders)
Self-splicing introns
“Long” non-coding RNAs (lncRNA)
Clustered regularly interspaced short
palindromic repeats (CRISPR)
RNAs of Unknown Function (RUFs)
Paul Gardner RNA-seq intro
What is RNA-seq?
Martin & Wang (2011) Next-generation transcriptome
assembly. Nature Reviews Genetics.
Paul Gardner RNA-seq intro
Run the best statistical test in the universe:
Eye-ball results: positive & negative controls
Remember: only RNAs expressed under exp. conditions will
be observed
Paul Gardner RNA-seq intro
Applications and extensions of RNA-seq
Applications
Genome annotation (mRNAs, ncRNAs, spliceforms, UTRs)
Quantification (Listen to Alicia Oshlack)
Extensions
Infer RNA structure (SHAPE) (Lucks et al. (2011))
RNA:RNA (CLASH) (Travis et al. (2014))
RNA:protein (RIP-seq) (Cook et al. (2015))
Paul Gardner RNA-seq intro
RNA-seq identifies 1,000s of new RNAs
SraB yceD rpmF E.RUF plsX
E.coli K12
E.coli E24377A
C.rodentium
S.enterica
K.pneumoniae
rmf RNA motif rmf P.RUF pyrD
S.maltophilia
X.axonopodis
secY X.RUF rpsM
D. Enterobacteriaceae RUF E. Pseudomonas RUF
F. Xanthomonadaceae RUF
G A U U A C C A
G
C
A
C
G
C
C
C
Y
A U C
C
G
G
G
C
G
G
C
G G G C
RGCCC
A
G
G
G
G
C
U
C
C
Y
YR
R
G
G
A
G
C
C
C
Y
U
UUUU
5'
terminator
G
U
C
U
C
GY
G
C
G
C
GG
G
U
G
G
A
Y
G
G
Y
G
G
UC
C
U
G
C
G
C
Y
G
GA
G
U
A
G
C
G
C
G
G
G
C
GRY
C
G
R
R
R
Y
Y Y C R
G G
Y C
A R
C
Y
R
U
C
C
G
G
C
G
C
C
G
G
A G
C
R
U
GGG
CA
C
A
C U C C C
C
A
Y
GC C G
G
G
U
Y
C
R
Y
G
G
A
A
C
C
R A
G
U
U
C
C
R
Y
G
G
G
C
U
U
C
C
A
G
Y
A
A
Y
CC
G
R
G
A
C
C U U G Y U A
A
U
U
C
A
G
U
U
C
A
C
U
U
5´
U A
A
U
C
A
C
G
C
R
Y
G
C
G
U
G
A
U
G A A
G
C
U
U
A
G
U
G
A
G
G
A
Y
U
U
C
C
C
C
G
G
C A
A
Y
G
G
G
G
A
A
Y
A
C
C
G
A
A
C
C
R
G
G
C
R G C G A C G A U A C C U U G5´
GNRA tetraloop
0-48 BPs
A. Enterobacteriaceae RUF
B. Pseudomonas RUF
C. Xanthomonadaceae RUF
Gaps
0-9
10-99
100-999
1,000-4,999
>5,000
Number of RNA-seq reads
80%
90% 70%
40%
nucleotide
present
nucleotide
identity
N
N 90%
N 80%
covarying mutations
base pair annotations
compatible mutations
no mutations observed
R = A or G. Y = C or U.
Legend
70%
P.putida
P.aeruginosa-PAO1
P.aeruginosa-PA14
T
T
A
G
C
G
C
C
G
G
A
A
A
C
C
A
G
G
C
G
T
C
A
T
G
A
G
C
C
T
G
C
A
A
C
A
T
A
T
G
G
C
C
C
T
A
T
C
G
A
C
G
A
A
A
G
C
G
T
T
A
A
G
T
C
T
T
T
A
T
G
A
C
A
A
A
T
C
G
G
T
C
A
T
T
C
A
C
A
C
G
C
C
T
G
A
A
C
G
C
T
T
T
G
G
T
T
A
G
A
A
C
T
C
C
A
G
T
T
A
A
T
C
C
G
C
C
C
A
C
C
G
C
A
A
C
G
G
T
G
T
C
G
G
G
C
G
A
−
−
G
G
G
T
C
G
T
C
A
C
G
C
C
G
G
C
A
A
C
G
A
C
C
C
C
T
T
−
T
C
G
G
C
G
A
A
A
−
−
G
C
T
T
C
G
C
C
A
G
G
C
C
T
C
C
C
C
T
G
G
G
G
G
C
C
A
A
C
G
G
G
A
C
A
T
A
A
C
A
G
T
C
A
A
C
A
A
G
T
G
A
G
G
G
C
A
A
C
A
C
C
C
T
A
T
G
A
G
A
A
G
A
C
T
T
A
A
G
C
G
T
G
A
T
C
C
G
T
T
G
G
A
A
A
G
A
G
C
C
T
T
C
T
T
G
C
G
T
G
G
T
T
A
T
C
A
G
A
A
C
G
G
C
A
T
A
A
C
C
G
G
T
A
A
A
T
C
T
C
G
T
G
A
T
C
T
T
T
G
T
C
C
G
T
T
C
A
C
C
C
A
T
C
C
T
A
C
G
A
C
G
C
G
G
C
A
G
T
C
C
T
G
G
C
T
C
A
A
C
G
G
C
T
G
G
C
G
C
G
A
G
G
G
C
C
G
T
G
G
C
G
A
C
A
A
C
T
G
G
G
A
C
G
G
C
C
T
C
A
C
T
G
G
C
A
C
G
G
C
C
G
G
C
T
T
A
C
A
A
C
G
T
C
T
C
A
A
T
C
A
A
C
T
C
C
A
G
C
A
C
G
T
G
T
A
A
G
C
G
A
C
A
A
C
A
C
G
G
A
T
A
G
C
A
C
C
G
A
T
T
T
C
C
C
C
A
A
G
G
C
A
C
G
C
C
C
C
A
T
C
C
G
G
G
C
G
G
C
G
G
G
C
G
C
A
A
G
C
C
C
A
A
G
G
G
C
T
C
C
G
C
−
A
A
G
G
A
G
C
C
C
T
T
T
T
C
A
A
T
T
C
C
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
G
C
C
G
C
G
G
C
A
A
T
G
C
G
G
C
G
A
T
G
G
C
G
T
C
C
A
C
C
G
C
T
T
C
G
C
G
G
A
T
C
A
A
C
G
C
C
G
G
T
C
C
C
T
T
G
T
A
G
A
T
G
A
A
A
C
C
C
G
A
A
T
A
G
A
T
C
T
G
C
A
C
C
A
G
G
C
T
C
G
C
C
C
C
G
G
C
G
G
C
G
A
T
C
T
T
C
T
TAGGCATATTTTTTTCCATCAGATATAGCGTATTGATGATAGCCATTTTAAACTATGCGC−−−TTCGTTTTGCAGGTTGATGTTTGTTATCAGCACTGAACGAAAATAAAGCAGTAACCCGCAATGTGTGCGAATTATTGGCAAAAGGCAACCACAGGCTGCCTTTTTCTTTGACTCTATGACGTTACAAAGTTAATATGCGCGCCCTATGCAAAAGGTAAAATTACCCCTGACTCTCGATCCGGTTCGTACGGCTCAAAAACGCCTTGATTACCAGGGTATCTATACCCCTGATCAGGTTGAGCGCGTCGCCGAATCCGTAGTCAGTGTGGACAGTGATGTGGAATGCTCCATGTCGTTCGCTATCGATAACCAACGTCTCGCAGTGTTAAACGGCGATGCGAAGGTGACGGTAACGCTCGAGTGTCAGCGTTGCGGGAAGCCGTTTACTCATCAGGTCTACACAACGTATTGTTTTAGTCCTGTGCGTTCAGACGAACAGGCTGAAGCACTGCCGGAAGCGTATGAACCGATTGAGGTTAACGAATTCGGTGAAATCGATCTGCTTGCAATGGTTGAAGATGAAATCATCCTCGCCTTGCCGGTAGTTCCGGTGCATGATTCTGAACACTGTGAAGTGTCCGAAGCGGACATGGTCTTTGGTGAACTGCCTGAAGAAGCGCAAAAGCCAAACCCATTTGCCGTATTAGCCAGCTTAAAGCGTAAGTAATTGGTGCTCCCCGTTGGATCGGGGATAAACCGTAATTGAGGAGTAAGGTCCATGGCCGTACAACAGAATAAACCAACCCGTTCCAAACGTGGCATGCGTCGTTCCCATGACGCGCTGACCGCAGTCACCAGCCTGTCTGTAGACAAAACTTCTGGTGAAAAACACCTGCGTCACCACATCACTGCCGACGGTTACTACCGCGGCCGCAAGGTCATCGCTAAGTAATCACGCA−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−TCTGC−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−GTGATGAAGCTTAGTGAGGATTTTCCCCAGGCAACTGGGGAAAGACCAAACCGGGCGGCGACGATACCTTGACACGTCTAACCCTGGCGTTAGATGTCATGGGAGGGGATTTTGGCCCTTCCGTGACAGTGCCTGCAGCATTGCAGGCACTGAATTCTAATTCGCAACTCACTCTTCTTTTAGTCGGCAATTCCGACGCCATCACGCCATTACTTGCTAAAGCTGACTTTGAACAACGTTCGCGTCTGCAGATTATTCCTGCGCAGTCAGT
ATGGCGCAGGCTGGCATTGGTAACCTCGGCGGCGGGCTCGGCAAGTTCACGGAACTTCGCCAGCGGTTGCTGTTCGTCCTCGGGGCATTGATCGTTTATCGCATCGGCTGCTATGTGCCGGTGCCTGGCGTGAATCCCGATGCCATGCTTTCGTTGATGCAGGCGCAGGGCGGCGGCATCGTGGACATGTTCAACATGTTCTCGGGCGGCGCCCTGCACCGTTTCAGTATTTTTGCATTGAACGTGATGCCGTATATCTCGGCATCGATCGTGATCCAGTTGGCCACGCACATCTTTCCCGCCCTCAAGGCGATGCAGAAAGAAGGCGAATCGGGCCGACGCAAGATCACCCAATATTCGCGCATCGGTGCGGTGTTGCTGGCGGTGGTGCAGGGCGGCAGTATCGCGCTGGCACTGCAGAACCAGACCGCCCCTGGTGGCGCTCCGGTGGTGTATGCGCCGGGCATGGGCTTCGTGCTCACCGCGGTGATCGCTTTGACCGCTGGTACCATCTTCCTGATGTGGGTAGGCGAGCAGGTTACCGAGCGCGGCATCGGTAACGGCGTATCGCTGATCATCTTTGCCGGCATCGTGGCTGGCCTGCCGTCGGCGGCCATCCAGACGGTCGAAGCCTTCCGCGAAGGCAATCTGAGCTTCATTTCGCTGTTGTTGATCGTCATCACCATCCTGGCGTTCACGCTGTTCGTCGTGTTTGTCGAGCGTGGGCAGCGGCGGATCACGGTCAACTACGCGCGCCGCCAGGGCGGTCGCAATGCGTACATGAACCAGACCTCGTTCTTGCCGCTCAAGCTGAACATGGCCGGTGTGATTCCGCCGATCTTTGCGTCCAGCATCCTGGCATTCCCGGCAACGTTGTCGATGTGGTCGGGTCAGGCTGC−−ATCGG−GTGGTATCGGCTCGTGGCTGCAGAAGATTGCCAACGCGCTTGGCCCCGGTGAGCCGGTACACATGCTGGTCTTCGCTGCGCTGATCATCGGTTTTGCATTCTTCTACACCGCGCTGGTGTTCAACTCGCAGGAAACCGCCGACAACCTCAAGAAATCGGGCGCGCTGATTCCGGGCATCCGTCCAGGCAAGGCCACCGCAGATTACGTCGATGGCGTACTGACGCGCCTGACAGCTGCCGGTTCGTTGTACCTGGTAATCGTCTGCCTGCTGCCGGAAATCATGCGCACGCAGCTCGGCACTTCGTTCCACTTCGGGGGCACCTCGCTATTGATTGCAGTGGTGGTGGTGATGGACTTCATTGCGCAGATCCAGGCGCACCTGATGTCGCACCAGTATGAGAGCTTGCTGAAGAAGGCCAACCTCAAGGGCGGCTCACGCGGCGGTCTTGCGCGCGGTTAAGTGGTACACTAGATCTTCATC−−−−−−ACGTGAAGACGGC−CTGGTTCCCGGGCCACGATCTTCCGATCAGAAGGGCGGCTCGCGCGACG−TCTCGCGCGCGGGTGTGACGGGGTGGTTCTGTGCGGGAGTAGCACAGGCGATTC−GGAGTGGTTTTCTGGATCAGCACCGTCCGGCGCCGGAGCGAGGGCACACTCCCCACGCCGGGTCCATGGAACCTCTGGTTCCACGGGCTTCAAAGCAATCCGAGGCCTTGCTATAATTCCGAGTTCACTTT−−TGATCCATCCTGCCGGATGG−−−CGCCTGGG−−−CGCTGTCGGGCCATCACTCAGTTGGAGAATCGCGTCATGGCGCGTATTGCAGGCGTCAACCTGCCAGCCCAGAAGCACGTCTGGGTCGGGTTGCAAAGCATCTACGGCATCGGCCGTACCCGTTCAAAGAAGCTCTGCGAATCCGCAGGCGTTACCTCGACCACGAAGATTCGTGATCTGTCCGAACCCGAAATCGAGCGCCTGCGCGCCGAAGTCGGCAAGTATGTCGTCGAAGGCGACCTGCGCCGCGAAATCGGTATCGCGATCAAGCGACTGATGGACCTCGGCTGCTATCGCGGTCTGCGTCATCGCCGTGGTCTTCCGCTGCGTGGTCAGCGCACCCGTACCAACGCCCGCACCCGCAAGGGTCCGCGCAAGGCGATCAGGAAGTAA
Lindgreen et al. (2014) Robust identification of noncoding RNA from transcriptomes requires
phylogenetically-informed sampling. PLOS Computational Biology.
Paul Gardner RNA-seq intro
Some open questions
How much transcription is ”functional”?
What’s a good negative control for transcriptome
experiments?
What causes variation in [protein]:[mRNA] ratios?
Lu, Vogel et al. (2007) Absolute protein expression profiling estimates the relative contributions of transcriptional
and translational regulation. Nature Biotechnology
Paul Gardner RNA-seq intro
Thanks & a plug
QMB 2015
Computational genomics
Proteins, animal genetics, genomes & disease, microbes &
disease
Paul Gardner RNA-seq intro

Contenu connexe

Tendances

Assembly and finishing
Assembly and finishingAssembly and finishing
Assembly and finishing
Nikolay Vyahhi
 

Tendances (20)

Introduction to real-Time Quantitative PCR (qPCR) - Download the slides
Introduction to real-Time Quantitative PCR (qPCR) - Download the slidesIntroduction to real-Time Quantitative PCR (qPCR) - Download the slides
Introduction to real-Time Quantitative PCR (qPCR) - Download the slides
 
RNA-Seq
RNA-SeqRNA-Seq
RNA-Seq
 
Assembly and finishing
Assembly and finishingAssembly and finishing
Assembly and finishing
 
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...
 
SNP
SNPSNP
SNP
 
Single Nucleotide Polymorphism (SNP)
Single Nucleotide Polymorphism (SNP)Single Nucleotide Polymorphism (SNP)
Single Nucleotide Polymorphism (SNP)
 
LncRNA (Long noncoding RNA)
LncRNA (Long noncoding RNA)LncRNA (Long noncoding RNA)
LncRNA (Long noncoding RNA)
 
Long non coding rna
Long non coding rnaLong non coding rna
Long non coding rna
 
Next generation sequencing
Next generation sequencingNext generation sequencing
Next generation sequencing
 
Microarray (DNA and SNP microarray)
Microarray (DNA and SNP microarray)Microarray (DNA and SNP microarray)
Microarray (DNA and SNP microarray)
 
Gene expression concept and analysis
Gene expression concept and analysisGene expression concept and analysis
Gene expression concept and analysis
 
ChIP-seq
ChIP-seqChIP-seq
ChIP-seq
 
Long non coding RNA lncRNAs
Long non coding RNA lncRNAsLong non coding RNA lncRNAs
Long non coding RNA lncRNAs
 
Next generation sequencing technologies for crop improvement
Next generation sequencing technologies for crop improvementNext generation sequencing technologies for crop improvement
Next generation sequencing technologies for crop improvement
 
SiRNA
SiRNASiRNA
SiRNA
 
Next Generation Sequencing of DNA
Next Generation Sequencing of DNANext Generation Sequencing of DNA
Next Generation Sequencing of DNA
 
An introduction to RNA-seq data analysis
An introduction to RNA-seq data analysisAn introduction to RNA-seq data analysis
An introduction to RNA-seq data analysis
 
Introduction to Next Generation Sequencing
Introduction to Next Generation SequencingIntroduction to Next Generation Sequencing
Introduction to Next Generation Sequencing
 
NEXT GENERATION SEQUENCING
NEXT GENERATION SEQUENCINGNEXT GENERATION SEQUENCING
NEXT GENERATION SEQUENCING
 
Restriction fragment length polymorphism
Restriction fragment length polymorphismRestriction fragment length polymorphism
Restriction fragment length polymorphism
 

En vedette

Role of transcriptomics in gene expression studies and
Role of transcriptomics in gene expression studies andRole of transcriptomics in gene expression studies and
Role of transcriptomics in gene expression studies and
Sarla Rao
 
Microarray and dna chips for transcriptome study
Microarray and dna chips for transcriptome studyMicroarray and dna chips for transcriptome study
Microarray and dna chips for transcriptome study
Bia Khan
 
Next-generation genomics: an integrative approach
Next-generation genomics: an integrative approachNext-generation genomics: an integrative approach
Next-generation genomics: an integrative approach
Hong ChangBum
 
Comparison between RNASeq and Microarray for Gene Expression Analysis
Comparison between RNASeq and Microarray for Gene Expression AnalysisComparison between RNASeq and Microarray for Gene Expression Analysis
Comparison between RNASeq and Microarray for Gene Expression Analysis
Yaoyu Wang
 

En vedette (17)

Random RNA interactions control protein expression in prokaryotes
Random RNA interactions control protein expression in prokaryotesRandom RNA interactions control protein expression in prokaryotes
Random RNA interactions control protein expression in prokaryotes
 
Transcriptomics sequencing
Transcriptomics sequencingTranscriptomics sequencing
Transcriptomics sequencing
 
Role of transcriptomics in gene expression studies and
Role of transcriptomics in gene expression studies andRole of transcriptomics in gene expression studies and
Role of transcriptomics in gene expression studies and
 
Transcriptomics and metabolomics
Transcriptomics and metabolomicsTranscriptomics and metabolomics
Transcriptomics and metabolomics
 
Transcriptomics
TranscriptomicsTranscriptomics
Transcriptomics
 
Transcriptomica PDF
Transcriptomica PDFTranscriptomica PDF
Transcriptomica PDF
 
Microarray and dna chips for transcriptome study
Microarray and dna chips for transcriptome studyMicroarray and dna chips for transcriptome study
Microarray and dna chips for transcriptome study
 
"Overview of the European Union reference laboratory (eu rl) for Listeria mon...
"Overview of the European Union reference laboratory (eu rl) for Listeria mon..."Overview of the European Union reference laboratory (eu rl) for Listeria mon...
"Overview of the European Union reference laboratory (eu rl) for Listeria mon...
 
Listeria and Omics Approaches for Understanding Its Biology
Listeria and Omics Approaches for Understanding Its BiologyListeria and Omics Approaches for Understanding Its Biology
Listeria and Omics Approaches for Understanding Its Biology
 
Next-generation genomics: an integrative approach
Next-generation genomics: an integrative approachNext-generation genomics: an integrative approach
Next-generation genomics: an integrative approach
 
Comparison between RNASeq and Microarray for Gene Expression Analysis
Comparison between RNASeq and Microarray for Gene Expression AnalysisComparison between RNASeq and Microarray for Gene Expression Analysis
Comparison between RNASeq and Microarray for Gene Expression Analysis
 
Microarray
Microarray Microarray
Microarray
 
Deep learning with Tensorflow in R
Deep learning with Tensorflow in RDeep learning with Tensorflow in R
Deep learning with Tensorflow in R
 
Introduction to data integration in bioinformatics
Introduction to data integration in bioinformaticsIntroduction to data integration in bioinformatics
Introduction to data integration in bioinformatics
 
DNA Chip
DNA ChipDNA Chip
DNA Chip
 
Rnaseq basics ngs_application1
Rnaseq basics ngs_application1Rnaseq basics ngs_application1
Rnaseq basics ngs_application1
 
RNA-seq differential expression analysis
RNA-seq differential expression analysisRNA-seq differential expression analysis
RNA-seq differential expression analysis
 

Similaire à Introduction to RNA-seq

Kathleen big Poster 2016 final copy
Kathleen big Poster 2016 final copyKathleen big Poster 2016 final copy
Kathleen big Poster 2016 final copy
Kathleen Barakat
 
Benasque RNA 2012: RNA Motifs
Benasque RNA 2012: RNA Motifs Benasque RNA 2012: RNA Motifs
Benasque RNA 2012: RNA Motifs
Paul Gardner
 
2013 transcription
2013 transcription2013 transcription
2013 transcription
kuldip sodhi
 
2013 transcription
2013 transcription2013 transcription
2013 transcription
kuldip sodhi
 
Lecture 9 10 dna-genes_chromosomes
Lecture 9 10 dna-genes_chromosomesLecture 9 10 dna-genes_chromosomes
Lecture 9 10 dna-genes_chromosomes
Abo Ali
 

Similaire à Introduction to RNA-seq (20)

01 nc rna-intro
01 nc rna-intro01 nc rna-intro
01 nc rna-intro
 
Cell 671
Cell 671Cell 671
Cell 671
 
Cell 672
Cell 672Cell 672
Cell 672
 
1-s2.0-S0167488913002401-main
1-s2.0-S0167488913002401-main1-s2.0-S0167488913002401-main
1-s2.0-S0167488913002401-main
 
Kathleen big Poster 2016 final copy
Kathleen big Poster 2016 final copyKathleen big Poster 2016 final copy
Kathleen big Poster 2016 final copy
 
Benasque RNA 2012: RNA Motifs
Benasque RNA 2012: RNA Motifs Benasque RNA 2012: RNA Motifs
Benasque RNA 2012: RNA Motifs
 
RNA and Dendritic Granules
RNA and Dendritic GranulesRNA and Dendritic Granules
RNA and Dendritic Granules
 
R_loops_presentation_v4.0
R_loops_presentation_v4.0R_loops_presentation_v4.0
R_loops_presentation_v4.0
 
RFLP & RAPD
RFLP & RAPDRFLP & RAPD
RFLP & RAPD
 
An introduction to RNAi technology - Petr Svoboda - Institute of Molecular Ge...
An introduction to RNAi technology - Petr Svoboda - Institute of Molecular Ge...An introduction to RNAi technology - Petr Svoboda - Institute of Molecular Ge...
An introduction to RNAi technology - Petr Svoboda - Institute of Molecular Ge...
 
Sars co v-2 polymerase inhibition with remdesivir
Sars co v-2 polymerase inhibition with remdesivirSars co v-2 polymerase inhibition with remdesivir
Sars co v-2 polymerase inhibition with remdesivir
 
2013 transcription
2013 transcription2013 transcription
2013 transcription
 
2013 transcription
2013 transcription2013 transcription
2013 transcription
 
Visualizing SNVs to quantify allele-specific expression in single cells
Visualizing SNVs to quantify allele-specific expression in single cellsVisualizing SNVs to quantify allele-specific expression in single cells
Visualizing SNVs to quantify allele-specific expression in single cells
 
Replication of RNA viruses.ppt
Replication of RNA viruses.pptReplication of RNA viruses.ppt
Replication of RNA viruses.ppt
 
Functions of micro RNA
Functions of micro RNAFunctions of micro RNA
Functions of micro RNA
 
Lecture 9 10 dna-genes_chromosomes
Lecture 9 10 dna-genes_chromosomesLecture 9 10 dna-genes_chromosomes
Lecture 9 10 dna-genes_chromosomes
 
Seminario Biologia molecular
Seminario Biologia molecularSeminario Biologia molecular
Seminario Biologia molecular
 
DNA-based methods for bioaerosol analysis
DNA-based methods for bioaerosol analysisDNA-based methods for bioaerosol analysis
DNA-based methods for bioaerosol analysis
 
CRISPR Cas System concept
CRISPR Cas System conceptCRISPR Cas System concept
CRISPR Cas System concept
 

Plus de Paul Gardner

Plus de Paul Gardner (20)

ppgardner-lecture07-genome-function.pdf
ppgardner-lecture07-genome-function.pdfppgardner-lecture07-genome-function.pdf
ppgardner-lecture07-genome-function.pdf
 
ppgardner-lecture06-homologysearch.pdf
ppgardner-lecture06-homologysearch.pdfppgardner-lecture06-homologysearch.pdf
ppgardner-lecture06-homologysearch.pdf
 
ppgardner-lecture05-alignment-comparativegenomics.pdf
ppgardner-lecture05-alignment-comparativegenomics.pdfppgardner-lecture05-alignment-comparativegenomics.pdf
ppgardner-lecture05-alignment-comparativegenomics.pdf
 
ppgardner-lecture04-annotation-comparativegenomics.pdf
ppgardner-lecture04-annotation-comparativegenomics.pdfppgardner-lecture04-annotation-comparativegenomics.pdf
ppgardner-lecture04-annotation-comparativegenomics.pdf
 
ppgardner-lecture03-genomesize-complexity.pdf
ppgardner-lecture03-genomesize-complexity.pdfppgardner-lecture03-genomesize-complexity.pdf
ppgardner-lecture03-genomesize-complexity.pdf
 
Does RNA avoidance dictate protein expression level?
Does RNA avoidance dictate protein expression level?Does RNA avoidance dictate protein expression level?
Does RNA avoidance dictate protein expression level?
 
Machine learning methods
Machine learning methodsMachine learning methods
Machine learning methods
 
Clustering
ClusteringClustering
Clustering
 
Monte Carlo methods
Monte Carlo methodsMonte Carlo methods
Monte Carlo methods
 
The jackknife and bootstrap
The jackknife and bootstrapThe jackknife and bootstrap
The jackknife and bootstrap
 
Contingency tables
Contingency tablesContingency tables
Contingency tables
 
Regression (II)
Regression (II)Regression (II)
Regression (II)
 
Regression (I)
Regression (I)Regression (I)
Regression (I)
 
Analysis of covariation and correlation
Analysis of covariation and correlationAnalysis of covariation and correlation
Analysis of covariation and correlation
 
Analysis of two samples
Analysis of two samplesAnalysis of two samples
Analysis of two samples
 
Analysis of single samples
Analysis of single samplesAnalysis of single samples
Analysis of single samples
 
Centrality and spread
Centrality and spreadCentrality and spread
Centrality and spread
 
Fundamentals of statistical analysis
Fundamentals of statistical analysisFundamentals of statistical analysis
Fundamentals of statistical analysis
 
Avoidance of stochastic RNA interactions can be harnessed to control protein ...
Avoidance of stochastic RNA interactions can be harnessed to control protein ...Avoidance of stochastic RNA interactions can be harnessed to control protein ...
Avoidance of stochastic RNA interactions can be harnessed to control protein ...
 
A meta-analysis of computational biology benchmarks reveals predictors of pro...
A meta-analysis of computational biology benchmarks reveals predictors of pro...A meta-analysis of computational biology benchmarks reveals predictors of pro...
A meta-analysis of computational biology benchmarks reveals predictors of pro...
 

Introduction to RNA-seq

  • 1. Introduction to RNA-seq Paul Gardner July 6, 2015 Paul Gardner RNA-seq intro
  • 2. Where on the bio/math spectrum do you lie? Paul Gardner RNA-seq intro
  • 3. What is RNA? �� ��� 2R 1R � � �� � ��� � �� � � � �� � � � � � �� �� � 1R 1R 1R ��� � � � � � �� ��� ��� 1 2R R : −OH −H : −H −CH3 ������� ����������� �� ���� ���� ��� � ����� ��� �� ��� �� ��� �� ��� ���� ��� ��� ��� ��� ��� ��� ��� ����� ����� ����� ����� ������� �������� ������� ������� ������������������������ RNA DNA ������������������ IUPAC ambiguity chars: Paul Gardner RNA-seq intro
  • 4. Crick’s “central dogma of molecular biology” Paul Gardner RNA-seq intro
  • 5. Types of RNA Protein coding RNA: Messenger RNA Noncoding RNA: Ribosomal RNA & Transfer RNA Spliceosomal RNAs (U1, U2, U4, U5, & U6) SRP RNA (protein export) RNase P RNA snoRNAs & microRNAs (David Humphreys) Cis-regulatory RNA (riboswitches, thermosensors, leaders) Self-splicing introns “Long” non-coding RNAs (lncRNA) Clustered regularly interspaced short palindromic repeats (CRISPR) RNAs of Unknown Function (RUFs) Paul Gardner RNA-seq intro
  • 6. What is RNA-seq? Martin & Wang (2011) Next-generation transcriptome assembly. Nature Reviews Genetics. Paul Gardner RNA-seq intro
  • 7. Run the best statistical test in the universe: Eye-ball results: positive & negative controls Remember: only RNAs expressed under exp. conditions will be observed Paul Gardner RNA-seq intro
  • 8. Applications and extensions of RNA-seq Applications Genome annotation (mRNAs, ncRNAs, spliceforms, UTRs) Quantification (Listen to Alicia Oshlack) Extensions Infer RNA structure (SHAPE) (Lucks et al. (2011)) RNA:RNA (CLASH) (Travis et al. (2014)) RNA:protein (RIP-seq) (Cook et al. (2015)) Paul Gardner RNA-seq intro
  • 9. RNA-seq identifies 1,000s of new RNAs SraB yceD rpmF E.RUF plsX E.coli K12 E.coli E24377A C.rodentium S.enterica K.pneumoniae rmf RNA motif rmf P.RUF pyrD S.maltophilia X.axonopodis secY X.RUF rpsM D. Enterobacteriaceae RUF E. Pseudomonas RUF F. Xanthomonadaceae RUF G A U U A C C A G C A C G C C C Y A U C C G G G C G G C G G G C RGCCC A G G G G C U C C Y YR R G G A G C C C Y U UUUU 5' terminator G U C U C GY G C G C GG G U G G A Y G G Y G G UC C U G C G C Y G GA G U A G C G C G G G C GRY C G R R R Y Y Y C R G G Y C A R C Y R U C C G G C G C C G G A G C R U GGG CA C A C U C C C C A Y GC C G G G U Y C R Y G G A A C C R A G U U C C R Y G G G C U U C C A G Y A A Y CC G R G A C C U U G Y U A A U U C A G U U C A C U U 5´ U A A U C A C G C R Y G C G U G A U G A A G C U U A G U G A G G A Y U U C C C C G G C A A Y G G G G A A Y A C C G A A C C R G G C R G C G A C G A U A C C U U G5´ GNRA tetraloop 0-48 BPs A. Enterobacteriaceae RUF B. Pseudomonas RUF C. Xanthomonadaceae RUF Gaps 0-9 10-99 100-999 1,000-4,999 >5,000 Number of RNA-seq reads 80% 90% 70% 40% nucleotide present nucleotide identity N N 90% N 80% covarying mutations base pair annotations compatible mutations no mutations observed R = A or G. Y = C or U. Legend 70% P.putida P.aeruginosa-PAO1 P.aeruginosa-PA14 T T A G C G C C G G A A A C C A G G C G T C A T G A G C C T G C A A C A T A T G G C C C T A T C G A C G A A A G C G T T A A G T C T T T A T G A C A A A T C G G T C A T T C A C A C G C C T G A A C G C T T T G G T T A G A A C T C C A G T T A A T C C G C C C A C C G C A A C G G T G T C G G G C G A − − G G G T C G T C A C G C C G G C A A C G A C C C C T T − T C G G C G A A A − − G C T T C G C C A G G C C T C C C C T G G G G G C C A A C G G G A C A T A A C A G T C A A C A A G T G A G G G C A A C A C C C T A T G A G A A G A C T T A A G C G T G A T C C G T T G G A A A G A G C C T T C T T G C G T G G T T A T C A G A A C G G C A T A A C C G G T A A A T C T C G T G A T C T T T G T C C G T T C A C C C A T C C T A C G A C G C G G C A G T C C T G G C T C A A C G G C T G G C G C G A G G G C C G T G G C G A C A A C T G G G A C G G C C T C A C T G G C A C G G C C G G C T T A C A A C G T C T C A A T C A A C T C C A G C A C G T G T A A G C G A C A A C A C G G A T A G C A C C G A T T T C C C C A A G G C A C G C C C C A T C C G G G C G G C G G G C G C A A G C C C A A G G G C T C C G C − A A G G A G C C C T T T T C A A T T C C − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − G C C G C G G C A A T G C G G C G A T G G C G T C C A C C G C T T C G C G G A T C A A C G C C G G T C C C T T G T A G A T G A A A C C C G A A T A G A T C T G C A C C A G G C T C G C C C C G G C G G C G A T C T T C T TAGGCATATTTTTTTCCATCAGATATAGCGTATTGATGATAGCCATTTTAAACTATGCGC−−−TTCGTTTTGCAGGTTGATGTTTGTTATCAGCACTGAACGAAAATAAAGCAGTAACCCGCAATGTGTGCGAATTATTGGCAAAAGGCAACCACAGGCTGCCTTTTTCTTTGACTCTATGACGTTACAAAGTTAATATGCGCGCCCTATGCAAAAGGTAAAATTACCCCTGACTCTCGATCCGGTTCGTACGGCTCAAAAACGCCTTGATTACCAGGGTATCTATACCCCTGATCAGGTTGAGCGCGTCGCCGAATCCGTAGTCAGTGTGGACAGTGATGTGGAATGCTCCATGTCGTTCGCTATCGATAACCAACGTCTCGCAGTGTTAAACGGCGATGCGAAGGTGACGGTAACGCTCGAGTGTCAGCGTTGCGGGAAGCCGTTTACTCATCAGGTCTACACAACGTATTGTTTTAGTCCTGTGCGTTCAGACGAACAGGCTGAAGCACTGCCGGAAGCGTATGAACCGATTGAGGTTAACGAATTCGGTGAAATCGATCTGCTTGCAATGGTTGAAGATGAAATCATCCTCGCCTTGCCGGTAGTTCCGGTGCATGATTCTGAACACTGTGAAGTGTCCGAAGCGGACATGGTCTTTGGTGAACTGCCTGAAGAAGCGCAAAAGCCAAACCCATTTGCCGTATTAGCCAGCTTAAAGCGTAAGTAATTGGTGCTCCCCGTTGGATCGGGGATAAACCGTAATTGAGGAGTAAGGTCCATGGCCGTACAACAGAATAAACCAACCCGTTCCAAACGTGGCATGCGTCGTTCCCATGACGCGCTGACCGCAGTCACCAGCCTGTCTGTAGACAAAACTTCTGGTGAAAAACACCTGCGTCACCACATCACTGCCGACGGTTACTACCGCGGCCGCAAGGTCATCGCTAAGTAATCACGCA−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−TCTGC−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−GTGATGAAGCTTAGTGAGGATTTTCCCCAGGCAACTGGGGAAAGACCAAACCGGGCGGCGACGATACCTTGACACGTCTAACCCTGGCGTTAGATGTCATGGGAGGGGATTTTGGCCCTTCCGTGACAGTGCCTGCAGCATTGCAGGCACTGAATTCTAATTCGCAACTCACTCTTCTTTTAGTCGGCAATTCCGACGCCATCACGCCATTACTTGCTAAAGCTGACTTTGAACAACGTTCGCGTCTGCAGATTATTCCTGCGCAGTCAGT ATGGCGCAGGCTGGCATTGGTAACCTCGGCGGCGGGCTCGGCAAGTTCACGGAACTTCGCCAGCGGTTGCTGTTCGTCCTCGGGGCATTGATCGTTTATCGCATCGGCTGCTATGTGCCGGTGCCTGGCGTGAATCCCGATGCCATGCTTTCGTTGATGCAGGCGCAGGGCGGCGGCATCGTGGACATGTTCAACATGTTCTCGGGCGGCGCCCTGCACCGTTTCAGTATTTTTGCATTGAACGTGATGCCGTATATCTCGGCATCGATCGTGATCCAGTTGGCCACGCACATCTTTCCCGCCCTCAAGGCGATGCAGAAAGAAGGCGAATCGGGCCGACGCAAGATCACCCAATATTCGCGCATCGGTGCGGTGTTGCTGGCGGTGGTGCAGGGCGGCAGTATCGCGCTGGCACTGCAGAACCAGACCGCCCCTGGTGGCGCTCCGGTGGTGTATGCGCCGGGCATGGGCTTCGTGCTCACCGCGGTGATCGCTTTGACCGCTGGTACCATCTTCCTGATGTGGGTAGGCGAGCAGGTTACCGAGCGCGGCATCGGTAACGGCGTATCGCTGATCATCTTTGCCGGCATCGTGGCTGGCCTGCCGTCGGCGGCCATCCAGACGGTCGAAGCCTTCCGCGAAGGCAATCTGAGCTTCATTTCGCTGTTGTTGATCGTCATCACCATCCTGGCGTTCACGCTGTTCGTCGTGTTTGTCGAGCGTGGGCAGCGGCGGATCACGGTCAACTACGCGCGCCGCCAGGGCGGTCGCAATGCGTACATGAACCAGACCTCGTTCTTGCCGCTCAAGCTGAACATGGCCGGTGTGATTCCGCCGATCTTTGCGTCCAGCATCCTGGCATTCCCGGCAACGTTGTCGATGTGGTCGGGTCAGGCTGC−−ATCGG−GTGGTATCGGCTCGTGGCTGCAGAAGATTGCCAACGCGCTTGGCCCCGGTGAGCCGGTACACATGCTGGTCTTCGCTGCGCTGATCATCGGTTTTGCATTCTTCTACACCGCGCTGGTGTTCAACTCGCAGGAAACCGCCGACAACCTCAAGAAATCGGGCGCGCTGATTCCGGGCATCCGTCCAGGCAAGGCCACCGCAGATTACGTCGATGGCGTACTGACGCGCCTGACAGCTGCCGGTTCGTTGTACCTGGTAATCGTCTGCCTGCTGCCGGAAATCATGCGCACGCAGCTCGGCACTTCGTTCCACTTCGGGGGCACCTCGCTATTGATTGCAGTGGTGGTGGTGATGGACTTCATTGCGCAGATCCAGGCGCACCTGATGTCGCACCAGTATGAGAGCTTGCTGAAGAAGGCCAACCTCAAGGGCGGCTCACGCGGCGGTCTTGCGCGCGGTTAAGTGGTACACTAGATCTTCATC−−−−−−ACGTGAAGACGGC−CTGGTTCCCGGGCCACGATCTTCCGATCAGAAGGGCGGCTCGCGCGACG−TCTCGCGCGCGGGTGTGACGGGGTGGTTCTGTGCGGGAGTAGCACAGGCGATTC−GGAGTGGTTTTCTGGATCAGCACCGTCCGGCGCCGGAGCGAGGGCACACTCCCCACGCCGGGTCCATGGAACCTCTGGTTCCACGGGCTTCAAAGCAATCCGAGGCCTTGCTATAATTCCGAGTTCACTTT−−TGATCCATCCTGCCGGATGG−−−CGCCTGGG−−−CGCTGTCGGGCCATCACTCAGTTGGAGAATCGCGTCATGGCGCGTATTGCAGGCGTCAACCTGCCAGCCCAGAAGCACGTCTGGGTCGGGTTGCAAAGCATCTACGGCATCGGCCGTACCCGTTCAAAGAAGCTCTGCGAATCCGCAGGCGTTACCTCGACCACGAAGATTCGTGATCTGTCCGAACCCGAAATCGAGCGCCTGCGCGCCGAAGTCGGCAAGTATGTCGTCGAAGGCGACCTGCGCCGCGAAATCGGTATCGCGATCAAGCGACTGATGGACCTCGGCTGCTATCGCGGTCTGCGTCATCGCCGTGGTCTTCCGCTGCGTGGTCAGCGCACCCGTACCAACGCCCGCACCCGCAAGGGTCCGCGCAAGGCGATCAGGAAGTAA Lindgreen et al. (2014) Robust identification of noncoding RNA from transcriptomes requires phylogenetically-informed sampling. PLOS Computational Biology. Paul Gardner RNA-seq intro
  • 10. Some open questions How much transcription is ”functional”? What’s a good negative control for transcriptome experiments? What causes variation in [protein]:[mRNA] ratios? Lu, Vogel et al. (2007) Absolute protein expression profiling estimates the relative contributions of transcriptional and translational regulation. Nature Biotechnology Paul Gardner RNA-seq intro
  • 11. Thanks & a plug QMB 2015 Computational genomics Proteins, animal genetics, genomes & disease, microbes & disease Paul Gardner RNA-seq intro