1. DNA methylation coverage in two
tissues of the Pacific Oyster
Claire Ellis
Bioinformatics Terminal Product
3/14/13
2. DNA methylation patterns in
Crassostrea gigas
Epigenetics describes DNA
modifications that change gene
expression without altering
nucleotide sequence.
DNA methylation in organisms
is extremely diverse, variable
among species, and can change
genome function under CH3
external influences.
DNA methylation
Source:
http://www.nist.gov/pml/div689/dna
3. Sequencing Approaches
Bisulfite sequencing was used
Bisulfite sequencing
to examine DNA methylation
Cm= methylated cytosine
in gonad tissue
C= unmethylated cytosine
MBD-Seq was used to 5’ ACmGTTCGCTTGAG 3’
examine DNA methylation in 3’ TGCmAAGCGAACTC 5’
gill tissue (Mackenzie)
Bisulfite Treatment
5’ ACmGTTUGUTTGAG 3’
3’ TGCmAAGUGAAUTU 5’
4. Approach
Bisulfite converted reads aligned to genome and %
methylation value per base calculated by processing
alignments
methylKit is an R package for DNA methylation
analysis and annotation from high-throughput
bisulfite sequencing
5. methylKit
Goal: obtain methylation coverage and
examine differential methylation between
gonad and gill tissues
Read annotation files and perform basic statistical
analyses for differentially methylated regions or
bases
6.
7. Methylation Statistics
Histogram of % CpG methylation Histogram of % CpG methylation
test test
Gonad 94.7
Gill 16.8
40000
200000
12.5
30000
150000
9.8
Frequency
Frequency
8.1
20000
100000
7.1
6.4
5.4
4.5
4 4.2
10000
3.8
50000
2.4 2.6 2.7
1.8 1.7 1.6 2
1.4
4.4 1
0 0 0 0 0 0.3 0 0 0 0 0 0.3 0 0 0 0 0
0
0
20 40 60 80 100 0 20 40 60 80 100
% methylation per base % methylation per base
8. Coverage Statistics
Histogram of CpG coverage Histogram of CpG coverage
test test
91.6 Gonad 16.9 Gill
40000
200000
13.6
12.7
12.5
30000
150000
9.4
Frequency
Frequency 8.5
100000
20000
7.6
5.5
50000
10000
3.9
2.9
2
7.2 1.5
1
0.70.5
0 0 0.7 0 0.2 0 0 0 0 0 0 0.1 0 0 0 0 0 0 0 0.30.20.10.1 0 0 0
0 0
0
0
0.0 0.5 1.0 1.5 2.0 1.0 1.5 2.0 2.5 3.0
log10 of read coverage per base log10 of read coverage per base
12. Conclusions
Additional analyses included examining type of differential
methylation (hypo and hyper)
Extracted bases with a q-value <0.01 and % methylation
difference >25%
The methylKit package was successfully used to
characterize DNA methylation
Differences between gonad and gill methylation profiles
may be due to library prep
Will use R script for future analyses comparing different
samples’ methylation profiles
The objective of this study is to use C. gigasas a model organism to characterize the distribution and identify potential functions of DNA methylation
Input=% methylation value
% methylation per CytosineGonad- most bases were 100% methylatedGill- variation in % methylation, most were 100% methylated however a small peak at low % methylation**Differences due to library prep (gill enriched for methylation first so much higher % meth)
Read coverage distribution- Histogram of read coverage per cytosine
Pairwise correlation score between the % methylation profiles.Scatterplots of % methylation scores.
PCA of two oyster tissue profiles, shows principal component 1 and 2 for each sample. Samples closer to each other in principal component space are similar in their methylation profiles.
Hypo=under the level or parHyper= over the limit or above normal level