5. Types of RNA
Protein coding RNA:
Messenger RNA
Noncoding RNA:
Ribosomal RNA & Transfer RNA
Spliceosomal RNAs (U1, U2, U4, U5, & U6)
SRP RNA (protein export)
RNase P RNA
snoRNAs & microRNAs (David Humphreys)
Cis-regulatory RNA (riboswitches,
thermosensors, leaders)
Self-splicing introns
“Long” non-coding RNAs (lncRNA)
Clustered regularly interspaced short
palindromic repeats (CRISPR)
RNAs of Unknown Function (RUFs)
Paul Gardner RNA-seq intro
6. What is RNA-seq?
Martin & Wang (2011) Next-generation transcriptome
assembly. Nature Reviews Genetics.
Paul Gardner RNA-seq intro
7. Run the best statistical test in the universe:
Eye-ball results: positive & negative controls
Remember: only RNAs expressed under exp. conditions will
be observed
Paul Gardner RNA-seq intro
8. Applications and extensions of RNA-seq
Applications
Genome annotation (mRNAs, ncRNAs, spliceforms, UTRs)
Quantification (Listen to Alicia Oshlack)
Extensions
Infer RNA structure (SHAPE) (Lucks et al. (2011))
RNA:RNA (CLASH) (Travis et al. (2014))
RNA:protein (RIP-seq) (Cook et al. (2015))
Paul Gardner RNA-seq intro
9. RNA-seq identifies 1,000s of new RNAs
SraB yceD rpmF E.RUF plsX
E.coli K12
E.coli E24377A
C.rodentium
S.enterica
K.pneumoniae
rmf RNA motif rmf P.RUF pyrD
S.maltophilia
X.axonopodis
secY X.RUF rpsM
D. Enterobacteriaceae RUF E. Pseudomonas RUF
F. Xanthomonadaceae RUF
G A U U A C C A
G
C
A
C
G
C
C
C
Y
A U C
C
G
G
G
C
G
G
C
G G G C
RGCCC
A
G
G
G
G
C
U
C
C
Y
YR
R
G
G
A
G
C
C
C
Y
U
UUUU
5'
terminator
G
U
C
U
C
GY
G
C
G
C
GG
G
U
G
G
A
Y
G
G
Y
G
G
UC
C
U
G
C
G
C
Y
G
GA
G
U
A
G
C
G
C
G
G
G
C
GRY
C
G
R
R
R
Y
Y Y C R
G G
Y C
A R
C
Y
R
U
C
C
G
G
C
G
C
C
G
G
A G
C
R
U
GGG
CA
C
A
C U C C C
C
A
Y
GC C G
G
G
U
Y
C
R
Y
G
G
A
A
C
C
R A
G
U
U
C
C
R
Y
G
G
G
C
U
U
C
C
A
G
Y
A
A
Y
CC
G
R
G
A
C
C U U G Y U A
A
U
U
C
A
G
U
U
C
A
C
U
U
5´
U A
A
U
C
A
C
G
C
R
Y
G
C
G
U
G
A
U
G A A
G
C
U
U
A
G
U
G
A
G
G
A
Y
U
U
C
C
C
C
G
G
C A
A
Y
G
G
G
G
A
A
Y
A
C
C
G
A
A
C
C
R
G
G
C
R G C G A C G A U A C C U U G5´
GNRA tetraloop
0-48 BPs
A. Enterobacteriaceae RUF
B. Pseudomonas RUF
C. Xanthomonadaceae RUF
Gaps
0-9
10-99
100-999
1,000-4,999
>5,000
Number of RNA-seq reads
80%
90% 70%
40%
nucleotide
present
nucleotide
identity
N
N 90%
N 80%
covarying mutations
base pair annotations
compatible mutations
no mutations observed
R = A or G. Y = C or U.
Legend
70%
P.putida
P.aeruginosa-PAO1
P.aeruginosa-PA14
T
T
A
G
C
G
C
C
G
G
A
A
A
C
C
A
G
G
C
G
T
C
A
T
G
A
G
C
C
T
G
C
A
A
C
A
T
A
T
G
G
C
C
C
T
A
T
C
G
A
C
G
A
A
A
G
C
G
T
T
A
A
G
T
C
T
T
T
A
T
G
A
C
A
A
A
T
C
G
G
T
C
A
T
T
C
A
C
A
C
G
C
C
T
G
A
A
C
G
C
T
T
T
G
G
T
T
A
G
A
A
C
T
C
C
A
G
T
T
A
A
T
C
C
G
C
C
C
A
C
C
G
C
A
A
C
G
G
T
G
T
C
G
G
G
C
G
A
−
−
G
G
G
T
C
G
T
C
A
C
G
C
C
G
G
C
A
A
C
G
A
C
C
C
C
T
T
−
T
C
G
G
C
G
A
A
A
−
−
G
C
T
T
C
G
C
C
A
G
G
C
C
T
C
C
C
C
T
G
G
G
G
G
C
C
A
A
C
G
G
G
A
C
A
T
A
A
C
A
G
T
C
A
A
C
A
A
G
T
G
A
G
G
G
C
A
A
C
A
C
C
C
T
A
T
G
A
G
A
A
G
A
C
T
T
A
A
G
C
G
T
G
A
T
C
C
G
T
T
G
G
A
A
A
G
A
G
C
C
T
T
C
T
T
G
C
G
T
G
G
T
T
A
T
C
A
G
A
A
C
G
G
C
A
T
A
A
C
C
G
G
T
A
A
A
T
C
T
C
G
T
G
A
T
C
T
T
T
G
T
C
C
G
T
T
C
A
C
C
C
A
T
C
C
T
A
C
G
A
C
G
C
G
G
C
A
G
T
C
C
T
G
G
C
T
C
A
A
C
G
G
C
T
G
G
C
G
C
G
A
G
G
G
C
C
G
T
G
G
C
G
A
C
A
A
C
T
G
G
G
A
C
G
G
C
C
T
C
A
C
T
G
G
C
A
C
G
G
C
C
G
G
C
T
T
A
C
A
A
C
G
T
C
T
C
A
A
T
C
A
A
C
T
C
C
A
G
C
A
C
G
T
G
T
A
A
G
C
G
A
C
A
A
C
A
C
G
G
A
T
A
G
C
A
C
C
G
A
T
T
T
C
C
C
C
A
A
G
G
C
A
C
G
C
C
C
C
A
T
C
C
G
G
G
C
G
G
C
G
G
G
C
G
C
A
A
G
C
C
C
A
A
G
G
G
C
T
C
C
G
C
−
A
A
G
G
A
G
C
C
C
T
T
T
T
C
A
A
T
T
C
C
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
−
G
C
C
G
C
G
G
C
A
A
T
G
C
G
G
C
G
A
T
G
G
C
G
T
C
C
A
C
C
G
C
T
T
C
G
C
G
G
A
T
C
A
A
C
G
C
C
G
G
T
C
C
C
T
T
G
T
A
G
A
T
G
A
A
A
C
C
C
G
A
A
T
A
G
A
T
C
T
G
C
A
C
C
A
G
G
C
T
C
G
C
C
C
C
G
G
C
G
G
C
G
A
T
C
T
T
C
T
TAGGCATATTTTTTTCCATCAGATATAGCGTATTGATGATAGCCATTTTAAACTATGCGC−−−TTCGTTTTGCAGGTTGATGTTTGTTATCAGCACTGAACGAAAATAAAGCAGTAACCCGCAATGTGTGCGAATTATTGGCAAAAGGCAACCACAGGCTGCCTTTTTCTTTGACTCTATGACGTTACAAAGTTAATATGCGCGCCCTATGCAAAAGGTAAAATTACCCCTGACTCTCGATCCGGTTCGTACGGCTCAAAAACGCCTTGATTACCAGGGTATCTATACCCCTGATCAGGTTGAGCGCGTCGCCGAATCCGTAGTCAGTGTGGACAGTGATGTGGAATGCTCCATGTCGTTCGCTATCGATAACCAACGTCTCGCAGTGTTAAACGGCGATGCGAAGGTGACGGTAACGCTCGAGTGTCAGCGTTGCGGGAAGCCGTTTACTCATCAGGTCTACACAACGTATTGTTTTAGTCCTGTGCGTTCAGACGAACAGGCTGAAGCACTGCCGGAAGCGTATGAACCGATTGAGGTTAACGAATTCGGTGAAATCGATCTGCTTGCAATGGTTGAAGATGAAATCATCCTCGCCTTGCCGGTAGTTCCGGTGCATGATTCTGAACACTGTGAAGTGTCCGAAGCGGACATGGTCTTTGGTGAACTGCCTGAAGAAGCGCAAAAGCCAAACCCATTTGCCGTATTAGCCAGCTTAAAGCGTAAGTAATTGGTGCTCCCCGTTGGATCGGGGATAAACCGTAATTGAGGAGTAAGGTCCATGGCCGTACAACAGAATAAACCAACCCGTTCCAAACGTGGCATGCGTCGTTCCCATGACGCGCTGACCGCAGTCACCAGCCTGTCTGTAGACAAAACTTCTGGTGAAAAACACCTGCGTCACCACATCACTGCCGACGGTTACTACCGCGGCCGCAAGGTCATCGCTAAGTAATCACGCA−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−TCTGC−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−GTGATGAAGCTTAGTGAGGATTTTCCCCAGGCAACTGGGGAAAGACCAAACCGGGCGGCGACGATACCTTGACACGTCTAACCCTGGCGTTAGATGTCATGGGAGGGGATTTTGGCCCTTCCGTGACAGTGCCTGCAGCATTGCAGGCACTGAATTCTAATTCGCAACTCACTCTTCTTTTAGTCGGCAATTCCGACGCCATCACGCCATTACTTGCTAAAGCTGACTTTGAACAACGTTCGCGTCTGCAGATTATTCCTGCGCAGTCAGT
ATGGCGCAGGCTGGCATTGGTAACCTCGGCGGCGGGCTCGGCAAGTTCACGGAACTTCGCCAGCGGTTGCTGTTCGTCCTCGGGGCATTGATCGTTTATCGCATCGGCTGCTATGTGCCGGTGCCTGGCGTGAATCCCGATGCCATGCTTTCGTTGATGCAGGCGCAGGGCGGCGGCATCGTGGACATGTTCAACATGTTCTCGGGCGGCGCCCTGCACCGTTTCAGTATTTTTGCATTGAACGTGATGCCGTATATCTCGGCATCGATCGTGATCCAGTTGGCCACGCACATCTTTCCCGCCCTCAAGGCGATGCAGAAAGAAGGCGAATCGGGCCGACGCAAGATCACCCAATATTCGCGCATCGGTGCGGTGTTGCTGGCGGTGGTGCAGGGCGGCAGTATCGCGCTGGCACTGCAGAACCAGACCGCCCCTGGTGGCGCTCCGGTGGTGTATGCGCCGGGCATGGGCTTCGTGCTCACCGCGGTGATCGCTTTGACCGCTGGTACCATCTTCCTGATGTGGGTAGGCGAGCAGGTTACCGAGCGCGGCATCGGTAACGGCGTATCGCTGATCATCTTTGCCGGCATCGTGGCTGGCCTGCCGTCGGCGGCCATCCAGACGGTCGAAGCCTTCCGCGAAGGCAATCTGAGCTTCATTTCGCTGTTGTTGATCGTCATCACCATCCTGGCGTTCACGCTGTTCGTCGTGTTTGTCGAGCGTGGGCAGCGGCGGATCACGGTCAACTACGCGCGCCGCCAGGGCGGTCGCAATGCGTACATGAACCAGACCTCGTTCTTGCCGCTCAAGCTGAACATGGCCGGTGTGATTCCGCCGATCTTTGCGTCCAGCATCCTGGCATTCCCGGCAACGTTGTCGATGTGGTCGGGTCAGGCTGC−−ATCGG−GTGGTATCGGCTCGTGGCTGCAGAAGATTGCCAACGCGCTTGGCCCCGGTGAGCCGGTACACATGCTGGTCTTCGCTGCGCTGATCATCGGTTTTGCATTCTTCTACACCGCGCTGGTGTTCAACTCGCAGGAAACCGCCGACAACCTCAAGAAATCGGGCGCGCTGATTCCGGGCATCCGTCCAGGCAAGGCCACCGCAGATTACGTCGATGGCGTACTGACGCGCCTGACAGCTGCCGGTTCGTTGTACCTGGTAATCGTCTGCCTGCTGCCGGAAATCATGCGCACGCAGCTCGGCACTTCGTTCCACTTCGGGGGCACCTCGCTATTGATTGCAGTGGTGGTGGTGATGGACTTCATTGCGCAGATCCAGGCGCACCTGATGTCGCACCAGTATGAGAGCTTGCTGAAGAAGGCCAACCTCAAGGGCGGCTCACGCGGCGGTCTTGCGCGCGGTTAAGTGGTACACTAGATCTTCATC−−−−−−ACGTGAAGACGGC−CTGGTTCCCGGGCCACGATCTTCCGATCAGAAGGGCGGCTCGCGCGACG−TCTCGCGCGCGGGTGTGACGGGGTGGTTCTGTGCGGGAGTAGCACAGGCGATTC−GGAGTGGTTTTCTGGATCAGCACCGTCCGGCGCCGGAGCGAGGGCACACTCCCCACGCCGGGTCCATGGAACCTCTGGTTCCACGGGCTTCAAAGCAATCCGAGGCCTTGCTATAATTCCGAGTTCACTTT−−TGATCCATCCTGCCGGATGG−−−CGCCTGGG−−−CGCTGTCGGGCCATCACTCAGTTGGAGAATCGCGTCATGGCGCGTATTGCAGGCGTCAACCTGCCAGCCCAGAAGCACGTCTGGGTCGGGTTGCAAAGCATCTACGGCATCGGCCGTACCCGTTCAAAGAAGCTCTGCGAATCCGCAGGCGTTACCTCGACCACGAAGATTCGTGATCTGTCCGAACCCGAAATCGAGCGCCTGCGCGCCGAAGTCGGCAAGTATGTCGTCGAAGGCGACCTGCGCCGCGAAATCGGTATCGCGATCAAGCGACTGATGGACCTCGGCTGCTATCGCGGTCTGCGTCATCGCCGTGGTCTTCCGCTGCGTGGTCAGCGCACCCGTACCAACGCCCGCACCCGCAAGGGTCCGCGCAAGGCGATCAGGAAGTAA
Lindgreen et al. (2014) Robust identification of noncoding RNA from transcriptomes requires
phylogenetically-informed sampling. PLOS Computational Biology.
Paul Gardner RNA-seq intro
10. Some open questions
How much transcription is ”functional”?
What’s a good negative control for transcriptome
experiments?
What causes variation in [protein]:[mRNA] ratios?
Lu, Vogel et al. (2007) Absolute protein expression profiling estimates the relative contributions of transcriptional
and translational regulation. Nature Biotechnology
Paul Gardner RNA-seq intro