SlideShare une entreprise Scribd logo
1  sur  35
Télécharger pour lire hors ligne
Jared Simpson
!
Ontario Institute for Cancer Research
&
Department of Computer Science
University ofToronto
Error correction, assembly
and consensus algorithms
for MinION data
London Calling, May 14th, 2015
Our collaboration
2
An overview of NGS assembly
• Illumina data: short reads, very accurate, very deep	

• nearly all Illumina assembly is based on exact matching algorithms	

• fragmented assemblies	

!
• Algorithms for Illumina data do not work for long, noisy reads	

• PacBio developed a pipeline (“HGAP”) to assemble their data	

• We used this recipe as a starting point but with custom components
3
Long read assembly pipeline
4
Error correction
Celera Assembler
Consensus
Input reads
Genome Assembly
Input Data
• First challenge is finding overlaps for reads with 15-20% errors
5
Overlap Detection
6
we use github.com/thegenemyers/daligner to compute overlaps
Partial Order Graphs
7
add read GCTACGAT that we want to correct to graph
Partial Order Graphs
8
add sequence GCTCGAT to graph
Partial Order Graphs
9
add sequence GCTCGATT to graph
Partial Order Graphs
10
maximum weight path GCTCGAT is the corrected read
Error Correction
11
Contig Assembly
12
Celera Assembler produces one contig at 98.5% identity
Assembly Polishing
• Consensus problem is viewed as choosing a sequence C’ that maximizes
the probability of the event data
13
C0
= arg max
S2C
P(D|S)
P(D|S) =
rY
k=1
P(ei,k, ei+1,k, ..., ej,k|S, ⇥)
where
Selecting a Consensus
14
Mutate
ACTACGATCGACTTACGA
CCTACGATCGACTTACGA
TCTACGATCGACTTACGA
...
-CTACGATCGACTTACGA
G-TACGATCGACTTACGA
GC-ACGATCGACTTACGA
GCT-CGATCGACTTACGA
...
GACTACGATCGACTTACGA
GCCTACGATCGACTTACGA
GGCTACGATCGACTTACGA
GTCTACGATCGACTTACGA
P(D|S)
-190
-187
-192
-176
-191
-193
-168
-198
-191
-195
-181
GCTACGATCGACTTACGA
Selecting a Consensus
15
GCT-CGATCGACTTACGA
Mutate
A
C
T
...
-
G
GC
GCT
...
G
G
G
G
P(D|S)
-190
-187
-192
-176
-191
-193
-168
-198
-191
-195
-181
Select new consensus
GCT-CGATCGACTTACGA -168
Pore Models
16
Generating Events
• What do we expect events from a given sequence to look like?
17
GCTACGATT
Sample Current ●●●
●
●●●●●
●●
●
●
●●
●
●●●
●
●●
●
●●●●●
●
●
●
●●●●
●
●●
●●
●●●●●
●
●
●●●●
●
●●
●●
●
●
●
●●●
●●●
●●●●
●●
●
●
●●●●
●
●
●
●
●●●
●●
●●●●●●●
●
●
●
●
●
●●●
●
●
●●●●●●●●
●
●
●
●●●●
●
●●●
●●●
●
●●
●●●
●●
●●
●●●
●
●●
●●●●●●●●
●
●●●●
●
●●●●●●
●●●●
●●●
●
●●●●
●
●
●●
●
●
●●●●●●●
●●●
●●●
●●●
●
●●●
●●
●●
●
●
●●●
●●
●●●
●●●●●●●
●
●
●
●●●●
●
●
●●●●●
●
●●●●
●●
●
●
●
●
●
●
●
●
●●
●
●●
●
●●●●
●●
●●
●
●
●●
●
●
●●●●●
●
●
●●
●●●●
●
●
●●●
●
●
●●
●●
●●
●
●●
●
●●●●●●●●●●●
●●●
●
●●
●
●●●
●
●●●
●
●
●●●●●●●
●
●●
●
●●
●●●●●
●
●
●●
●
●
●●
●●
●
●
●●
●
●●●
●●●
●
●
●
●
●
●
●
●
●
●●●
●
●●●●●●●●●●●●●●
●
●
●●●●●●
●●
●●
●
●●
●●●
●●●
●●●
●
●●
●
●
●●
●●
●●●●●
●●
●●●
●●●●
●
●
●●●
●
●●
●●●
●
●●●●●●
●
●
●●
●●●
●●●●
●
●
●●●
●
●
●
●●●●
●
●●●●●
●●
●
●
●●
●
●●●●
●●●
●
●●
●●
●●
●
●●●●●
●●●●●
●●
●
0
25
50
75
100
0.00 0.25 0.50 0.75 1.00 1.25
time (s)Current(pA)
Generating Events
• What do we expect events from a given sequence to look like?
18
GCTACGATT
Sample Current ●●●
●
●●●●●
●●
●
●
●●
●
●●●
●
●●
●
●●●●●
●
●
●
●●●●
●
●●
●●
●●●●●
●
●
●●●●
●
●●
●●
●
●
●
●●●
●●●
●●●●
●●
●
●
●●●●
●
●
●
●
●●●
●●
●●●●●●●
●
●
●
●
●
●●●
●
●
●●●●●●●●
●
●
●
●●●●
●
●●●
●●●
●
●●
●●●
●●
●●
●●●
●
●●
●●●●●●●●
●
●●●●
●
●●●●●●
●●●●
●●●
●
●●●●
●
●
●●
●
●
●●●●●●●
●●●
●●●
●●●
●
●●●
●●
●●
●
●
●●●
●●
●●●
●●●●●●●
●
●
●
●●●●
●
●
●●●●●
●
●●●●
●●
●
●
●
●
●
●
●
●
●●
●
●●
●
●●●●
●●
●●
●
●
●●
●
●
●●●●●
●
●
●●
●●●●
●
●
●●●
●
●
●●
●●
●●
●
●●
●
●●●●●●●●●●●
●●●
●
●●
●
●●●
●
●●●
●
●
●●●●●●●
●
●●
●
●●
●●●●●
●
●
●●
●
●
●●
●●
●
●
●●
●
●●●
●●●
●
●
●
●
●
●
●
●
●
●●●
●
●●●●●●●●●●●●●●
●
●
●●●●●●
●●
●●
●
●●
●●●
●●●
●●●
●
●●
●
●
●●
●●
●●●●●
●●
●●●
●●●●
●
●
●●●
●
●●
●●●
●
●●●●●●
●
●
●●
●●●
●●●●
●
●
●●●
●
●
●
●●●●
●
●●●●●
●●
●
●
●●
●
●●●●
●●●
●
●●
●●
●●
●
●●●●●
●●●●●
●●
●
●●
●●
●●●●
●
●
●●
●
●
●
●
●●
●
●
●
●
●
●●
●●●●●●●
●
●●
●
●●●●
●●
●
●
●
●
●
●●
●
●
●
●●●●●
●
●●●
●
●
●
●●
●
●
●●●●●●
●●
●
●●●●
●●
●
●
●●●
●●●●
●
●
●●●●●
●●
●●
●
●
●●●●●●●
0
25
50
75
100
0.00 0.25 0.50 0.75 1.00 1.25
time (s)Current(pA)
Generating Events
• What do we expect events from a given sequence to look like?
19
GCTACGATT
Sample Current ●●●
●
●●●●●
●●
●
●
●●
●
●●●
●
●●
●
●●●●●
●
●
●
●●●●
●
●●
●●
●●●●●
●
●
●●●●
●
●●
●●
●
●
●
●●●
●●●
●●●●
●●
●
●
●●●●
●
●
●
●
●●●
●●
●●●●●●●
●
●
●
●
●
●●●
●
●
●●●●●●●●
●
●
●
●●●●
●
●●●
●●●
●
●●
●●●
●●
●●
●●●
●
●●
●●●●●●●●
●
●●●●
●
●●●●●●
●●●●
●●●
●
●●●●
●
●
●●
●
●
●●●●●●●
●●●
●●●
●●●
●
●●●
●●
●●
●
●
●●●
●●
●●●
●●●●●●●
●
●
●
●●●●
●
●
●●●●●
●
●●●●
●●
●
●
●
●
●
●
●
●
●●
●
●●
●
●●●●
●●
●●
●
●
●●
●
●
●●●●●
●
●
●●
●●●●
●
●
●●●
●
●
●●
●●
●●
●
●●
●
●●●●●●●●●●●
●●●
●
●●
●
●●●
●
●●●
●
●
●●●●●●●
●
●●
●
●●
●●●●●
●
●
●●
●
●
●●
●●
●
●
●●
●
●●●
●●●
●
●
●
●
●
●
●
●
●
●●●
●
●●●●●●●●●●●●●●
●
●
●●●●●●
●●
●●
●
●●
●●●
●●●
●●●
●
●●
●
●
●●
●●
●●●●●
●●
●●●
●●●●
●
●
●●●
●
●●
●●●
●
●●●●●●
●
●
●●
●●●
●●●●
●
●
●●●
●
●
●
●●●●
●
●●●●●
●●
●
●
●●
●
●●●●
●●●
●
●●
●●
●●
●
●●●●●
●●●●●
●●
●
●●
●●
●●●●
●
●
●●
●
●
●
●
●●
●
●
●
●
●
●●
●●●●●●●
●
●●
●
●●●●
●●
●
●
●
●
●
●●
●
●
●
●●●●●
●
●●●
●
●
●
●●
●
●
●●●●●●
●●
●
●●●●
●●
●
●
●●●
●●●●
●
●
●●●●●
●●
●●
●
●
●●●●●●●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
●●●
●
●
●
●●
●
●
●
●
●●
●
●
●
●
●
●
●
●●●
●
●
●●
●●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●●
●●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●●
●●
●
●●
●
●
●
●●
●●
●●
●
●●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●●
●
●
●●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●●●
●
●
●●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●●
●●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●●●●
●
●●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●●●
●
●●
●
●
●●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●●
●●
●
0
25
50
75
100
0.00 0.25 0.50 0.75 1.00 1.25
time (s)Current(pA)
Generating Events
• What do we expect events from a given sequence to look like?
20
GCTACGATT
Sample Current ●●●
●
●●●●●
●●
●
●
●●
●
●●●
●
●●
●
●●●●●
●
●
●
●●●●
●
●●
●●
●●●●●
●
●
●●●●
●
●●
●●
●
●
●
●●●
●●●
●●●●
●●
●
●
●●●●
●
●
●
●
●●●
●●
●●●●●●●
●
●
●
●
●
●●●
●
●
●●●●●●●●
●
●
●
●●●●
●
●●●
●●●
●
●●
●●●
●●
●●
●●●
●
●●
●●●●●●●●
●
●●●●
●
●●●●●●
●●●●
●●●
●
●●●●
●
●
●●
●
●
●●●●●●●
●●●
●●●
●●●
●
●●●
●●
●●
●
●
●●●
●●
●●●
●●●●●●●
●
●
●
●●●●
●
●
●●●●●
●
●●●●
●●
●
●
●
●
●
●
●
●
●●
●
●●
●
●●●●
●●
●●
●
●
●●
●
●
●●●●●
●
●
●●
●●●●
●
●
●●●
●
●
●●
●●
●●
●
●●
●
●●●●●●●●●●●
●●●
●
●●
●
●●●
●
●●●
●
●
●●●●●●●
●
●●
●
●●
●●●●●
●
●
●●
●
●
●●
●●
●
●
●●
●
●●●
●●●
●
●
●
●
●
●
●
●
●
●●●
●
●●●●●●●●●●●●●●
●
●
●●●●●●
●●
●●
●
●●
●●●
●●●
●●●
●
●●
●
●
●●
●●
●●●●●
●●
●●●
●●●●
●
●
●●●
●
●●
●●●
●
●●●●●●
●
●
●●
●●●
●●●●
●
●
●●●
●
●
●
●●●●
●
●●●●●
●●
●
●
●●
●
●●●●
●●●
●
●●
●●
●●
●
●●●●●
●●●●●
●●
●
●●
●●
●●●●
●
●
●●
●
●
●
●
●●
●
●
●
●
●
●●
●●●●●●●
●
●●
●
●●●●
●●
●
●
●
●
●
●●
●
●
●
●●●●●
●
●●●
●
●
●
●●
●
●
●●●●●●
●●
●
●●●●
●●
●
●
●●●
●●●●
●
●
●●●●●
●●
●●
●
●
●●●●●●●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
●●●
●
●
●
●●
●
●
●
●
●●
●
●
●
●
●
●
●
●●●
●
●
●●
●●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●●
●●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●●
●●
●
●●
●
●
●
●●
●●
●●
●
●●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●●
●
●
●●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●●●
●
●
●●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●●
●●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●●●●
●
●●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●●●
●
●●
●
●
●●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●●
●●
●
●
●
●
●
●●
●●
●
●
●●●
●
●
●●●
●●●
●
●●●
●
●
●
●●●
●
●
●
●●
●
●
●●
●
●
●
●
●●
●
●●
●●
0
25
50
75
100
0.00 0.25 0.50 0.75 1.00 1.25
time (s)Current(pA)
Generating Events
• What do we expect events from a given sequence to look like?
21
CTACGATT
Sample Current ●●●
●
●●●●●
●●
●
●
●●
●
●●●
●
●●
●
●●●●●
●
●
●
●●●●
●
●●
●●
●●●●●
●
●
●●●●
●
●●
●●
●
●
●
●●●
●●●
●●●●
●●
●
●
●●●●
●
●
●
●
●●●
●●
●●●●●●●
●
●
●
●
●
●●●
●
●
●●●●●●●●
●
●
●
●●●●
●
●●●
●●●
●
●●
●●●
●●
●●
●●●
●
●●
●●●●●●●●
●
●●●●
●
●●●●●●
●●●●
●●●
●
●●●●
●
●
●●
●
●
●●●●●●●
●●●
●●●
●●●
●
●●●
●●
●●
●
●
●●●
●●
●●●
●●●●●●●
●
●
●
●●●●
●
●
●●●●●
●
●●●●
●●
●
●
●
●
●
●
●
●
●●
●
●●
●
●●●●
●●
●●
●
●
●●
●
●
●●●●●
●
●
●●
●●●●
●
●
●●●
●
●
●●
●●
●●
●
●●
●
●●●●●●●●●●●
●●●
●
●●
●
●●●
●
●●●
●
●
●●●●●●●
●
●●
●
●●
●●●●●
●
●
●●
●
●
●●
●●
●
●
●●
●
●●●
●●●
●
●
●
●
●
●
●
●
●
●●●
●
●●●●●●●●●●●●●●
●
●
●●●●●●
●●
●●
●
●●
●●●
●●●
●●●
●
●●
●
●
●●
●●
●●●●●
●●
●●●
●●●●
●
●
●●●
●
●●
●●●
●
●●●●●●
●
●
●●
●●●
●●●●
●
●
●●●
●
●
●
●●●●
●
●●●●●
●●
●
●
●●
●
●●●●
●●●
●
●●
●●
●●
●
●●●●●
●●●●●
●●
●
●●
●●
●●●●
●
●
●●
●
●
●
●
●●
●
●
●
●
●
●●
●●●●●●●
●
●●
●
●●●●
●●
●
●
●
●
●
●●
●
●
●
●●●●●
●
●●●
●
●
●
●●
●
●
●●●●●●
●●
●
●●●●
●●
●
●
●●●
●●●●
●
●
●●●●●
●●
●●
●
●
●●●●●●●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
●●●
●
●
●
●●
●
●
●
●
●●
●
●
●
●
●
●
●
●●●
●
●
●●
●●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●●
●●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●●
●●
●
●●
●
●
●
●●
●●
●●
●
●●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●●
●
●
●●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●●●
●
●
●●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●●
●●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●●●●
●
●●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●●●
●
●●
●
●
●●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●●
●●
●
●
●
●
●
●●
●●
●
●
●●●
●
●
●●●
●●●
●
●●●
●
●
●
●●●
●
●
●
●●
●
●
●●
●
●
●
●
●●
●
●●
●●
●
●
●
●
●●
●
●
●
●
●
●●
●
●●●●
●●●
●
●
●●
●
●●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●●
●
●
●●●●●●
●
●
●
●
●●
●●●●
●
●●●
●●
●
●
●
●
●●●
●●
●
●
●
●●●
●
●
●
●●
●●
●
●
●
●
●●
●
●●
●●●●
●●●
●
●
●
●
●●
●●
●●●
●
●
●●
●●●
●
●
●
●●●
●●●●●
●
●
●
●
●
●
0
25
50
75
100
0.00 0.25 0.50 0.75 1.00 1.25
time (s)Current(pA)
Event Detection
22
●●●
●
●●●●●
●●
●
●
●●
●
●●●
●
●●
●
●●●●●
●
●
●
●●●●
●
●●
●●
●●●●●
●
●
●●●●
●
●●
●●
●
●
●
●●●
●●●
●●●●
●●
●
●
●●●●
●
●
●
●
●●●
●●
●●●●●●●
●
●
●
●
●
●●●
●
●
●●●●●●●●
●
●
●
●●●●
●
●●●
●●●
●
●●
●●●
●●
●●
●●●
●
●●
●●●●●●●●
●
●●●●
●
●●●●●●
●●●●
●●●
●
●●●●
●
●
●●
●
●
●●●●●●●
●●●
●●●
●●●
●
●●●
●●
●●
●
●
●●●
●●
●●●
●●●●●●●
●
●
●
●●●●
●
●
●●●●●
●
●●●●
●●
●
●
●
●
●
●
●
●
●●
●
●●
●
●●●●
●●
●●
●
●
●●
●
●
●●●●●
●
●
●●
●●●●
●
●
●●●
●
●
●●
●●
●●
●
●●
●
●●●●●●●●●●●
●●●
●
●●
●
●●●
●
●●●
●
●
●●●●●●●
●
●●
●
●●
●●●●●
●
●
●●
●
●
●●
●●
●
●
●●
●
●●●
●●●
●
●
●
●
●
●
●
●
●
●●●
●
●●●●●●●●●●●●●●
●
●
●●●●●●
●●
●●
●
●●
●●●
●●●
●●●
●
●●
●
●
●●
●●
●●●●●
●●
●●●
●●●●
●
●
●●●
●
●●
●●●
●
●●●●●●
●
●
●●
●●●
●●●●
●
●
●●●
●
●
●
●●●●
●
●●●●●
●●
●
●
●●
●
●●●●
●●●
●
●●
●●
●●
●
●●●●●
●●●●●
●●
●
●●
●●
●●●●
●
●
●●
●
●
●
●
●●
●
●
●
●
●
●●
●●●●●●●
●
●●
●
●●●●
●●
●
●
●
●
●
●●
●
●
●
●●●●●
●
●●●
●
●
●
●●
●
●
●●●●●●
●●
●
●●●●
●●
●
●
●●●
●●●●
●
●
●●●●●
●●
●●
●
●
●●●●●●●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
●●●
●
●
●
●●
●
●
●
●
●●
●
●
●
●
●
●
●
●●●
●
●
●●
●●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●●
●●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●●
●●
●
●●
●
●
●
●●
●●
●●
●
●●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●●
●
●
●●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●●●
●
●
●●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●●
●●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●●●●
●
●●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●●●
●
●●
●
●
●●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●●
●●
●
●
●
●
●
●●
●●
●
●
●●●
●
●
●●●
●●●
●
●●●
●
●
●
●●●
●
●
●
●●
●
●
●●
●
●
●
●
●●
●
●●
●●
●
●
●
●
●●
●
●
●
●
●
●●
●
●●●●
●●●
●
●
●●
●
●●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●●
●
●
●●●●●●
●
●
●
●
●●
●●●●
●
●●●
●●
●
●
●
●
●●●
●●
●
●
●
●●●
●
●
●
●●
●●
●
●
●
●
●●
●
●●
●●●●
●●●
●
●
●
●
●●
●●
●●●
●
●
●●
●●●
●
●
●
●●●
●●●●●
●
●
●
●
●
●
0
25
50
75
100
0.00 0.25 0.50 0.75 1.00 1.25
time (s)Current(pA)
Event mean current (pA) current stdv duration (s)
1 60.3 0.7 0.521
2 40.6 1.0 0.112
3 52.2 2.0 0.356
4 54.1 1.2 0.291
5 49.5 1.5 0.141
A simple model
• What is the probability of observing events E given a sequence S?	

• Assuming for the moment there are no missing or extra events:
23
P(e1, e2, ..., en|s1, s2, ..., sn, ⇥) =
nY
i=1
P(ei|si, µsi , si )
P(ei|k, µk, k) = N(µk, 2
k)
Complications
24
●
●
●●
●●
●
●●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●●
●
●
●
●
●●●
●
●
●
●
●
●
●●
●
●●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●●
●
●
●
●●
●●
●
●●●●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●●●
●
●●
●
●
●
●
●
●
●
●
●
●●
●●
●
●
●
●
●
●●●
●
●●
●●
●
●
●●
●
●
●
●
●
●●
●●
●●
●
●
●●
●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●●
●
●●
●●
●
●●
●●
●
●●
●
●
●
●
●
●
●●
●
●
●●●
●
●
●
●●
●
●
●
●●
●
●
●●●
●
●
●
●
●
●
●
●
●
●●
●●●
●
●●
●●
●
●
●●●
●
●
●●●●
●
●
●●
●
●
●
●
●
●●●
●
●
●
●●●
●
●
●
●
●
●
●
●
●●
●
●
●●●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●●
●
●
●
●
●
●
●●
●
●●
●
●
●●
●
●
●●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●●
●
●
●●
●
●
●
●
●
●
●
●
●●●●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●●●●
●
●
●
●
●●
●
●
●
●
●
●
●
●●●
●●
●
●
●
●
●
●●●
●
●
●
●
●
30
40
50
60
70
0.0 0.2 0.4 0.6
time (s)
Current(pA)
Complications
25
●
●
●●
●●
●
●●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●●
●
●
●
●
●●●
●
●
●
●
●
●
●●
●
●●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●●
●
●
●
●●
●●
●
●●●●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●●●
●
●●
●
●
●
●
●
●
●
●
●
●●
●●
●
●
●
●
●
●●●
●
●●
●●
●
●
●●
●
●
●
●
●
●●
●●
●●
●
●
●●
●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●●
●
●●
●●
●
●●
●●
●
●●
●
●
●
●
●
●
●●
●
●
●●●
●
●
●
●●
●
●
●
●●
●
●
●●●
●
●
●
●
●
●
●
●
●
●●
●●●
●
●●
●●
●
●
●●●
●
●
●●●●
●
●
●●
●
●
●
●
●
●●●
●
●
●
●●●
●
●
●
●
●
●
●
●
●●
●
●
●●●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●●
●
●
●
●
●
●
●●
●
●●
●
●
●●
●
●
●●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●●
●
●
●●
●
●
●
●
●
●
●
●
●●●●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●●●●
●
●
●
●
●●
●
●
●
●
●
●
●
●●●
●●
●
●
●
●
●
●●●
●
●
●
●
●
30
40
50
60
70
0.0 0.2 0.4 0.6
time (s)
Current(pA)
Is this an event ?
Complications
26
●
●
●●
●●
●
●●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●●
●
●
●
●
●●●
●
●
●
●
●
●
●●
●
●●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●●
●
●
●
●●
●●
●
●●●●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●●●
●
●●
●
●
●
●
●
●
●
●
●
●●
●●
●
●
●
●
●
●●●
●
●●
●●
●
●
●●
●
●
●
●
●
●●
●●
●●
●
●
●●
●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●●
●
●●
●●
●
●●
●●
●
●●
●
●
●
●
●
●
●●
●
●
●●●
●
●
●
●●
●
●
●
●●
●
●
●●●
●
●
●
●
●
●
●
●
●
●●
●●●
●
●●
●●
●
●
●●●
●
●
●●●●
●
●
●●
●
●
●
●
●
●●●
●
●
●
●●●
●
●
●
●
●
●
●
●
●●
●
●
●●●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●●
●
●
●
●
●
●
●●
●
●●
●
●
●●
●
●
●●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●●
●
●
●●
●
●
●
●
●
●
●
●
●●●●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●●●●
●
●
●
●
●●
●
●
●
●
●
●
●
●●●
●●
●
●
●
●
●
●●●
●
●
●
●
●
30
40
50
60
70
0.0 0.2 0.4 0.6
time (s)
Current(pA) One event or two ?
Nanopore HMM
• must consider:	

• over segmentation	

• under segmentation	

• missed short events	

• HMM:	

• M states: match event to 5-mers	

• E states: extra obs. of an event	

• K states: no event obs. for 5-mer
27
P(D|S)
P(⇡, e1, e2, ..., en|S, ⇥) =
nY
i=1
P(ei|⇡i, µsi , si )P(⇡i|⇡i 1, S)
P(e1, e2, ..., en|S, ⇥) =
X
⇡
P(⇡, e1, e2, ..., en|S, ⇥)
Transition Probabilities
• Probability of not observing an
event is a function of absolute
difference between (expected)
current
28
●
●
●●
●●
●
●●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●●
●
●
●
●
●●●
●
●
●
●
●
●
●●
●
●●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●●
●
●
●
●●
●●
●
●●●●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●●●
●
●●
●
●
●
●
●
●
●
●
●
●●
●●
●
●
●
●
●
●●●
●
●●
●●
●
●
●●
●
●
●
●
●
●●
●●
●●
●
●
●●
●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●●
●
●●
●●
●
●●
●●
●
●●
●
●
●
●
●
●
●●
●
●
●●●
●
●
●
●●
●
●
●
●●
●
●
●●●
●
●
●
●
●
●
●
●
●
●●
●●●
●
●●
●●
●
●
●●●
●
●
●●●●
●
●
●●
●
●
●
●
●
●●●
●
●
●
●●●
●
●
●
●
●
●
●
●
●●
●
●
●●●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●●
●
●
●
●
●
●
●●
●
●●
●
●
●●
●
●
●●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●●
●
●
●●
●
●
●
●
●
●
●
●
●●●●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●●●●
●
●
●
●
●●
●
●
●
●
●
●
●
●●●
●●
●
●
●
●
●
●●●
●
●
●
●
●
30
40
50
60
70
0.0 0.2 0.4 0.6
time (s)
Current(pA)
Transition Probabilities
• Probability of not observing an
event is a function of absolute
difference between (expected)
current
29
●
●
●●
●●
●
●●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●●
●
●
●
●
●●●
●
●
●
●
●
●
●●
●
●●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●●
●
●
●
●●
●●
●
●●●●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●●●
●
●●
●
●
●
●
●
●
●
●
●
●●
●●
●
●
●
●
●
●●●
●
●●
●●
●
●
●●
●
●
●
●
●
●●
●●
●●
●
●
●●
●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●●
●
●●
●●
●
●●
●●
●
●●
●
●
●
●
●
●
●●
●
●
●●●
●
●
●
●●
●
●
●
●●
●
●
●●●
●
●
●
●
●
●
●
●
●
●●
●●●
●
●●
●●
●
●
●●●
●
●
●●●●
●
●
●●
●
●
●
●
●
●●●
●
●
●
●●●
●
●
●
●
●
●
●
●
●●
●
●
●●●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●●
●
●
●
●
●
●
●●
●
●●
●
●
●●
●
●
●●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●●
●
●
●●
●
●
●
●
●
●
●
●
●●●●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●●●●
●
●
●
●
●●
●
●
●
●
●
●
●
●●●
●●
●
●
●
●
●
●●●
●
●
●
●
●
30
40
50
60
70
0.0 0.2 0.4 0.6
time (s)
Current(pA)
Transition Probabilities
30
●
●
●
●
●
●
●
●
●
●
●
● ● ● ●
●
●
●
● ● ● ● ●
● ●
● ● ●
●
●
●
●
● ●
●
●
●
●
● ●
0.1
0.2
0.3
0.4
0.5
0 5 10 15 20
absolute difference (pA)
SkipProbability
Assembly Accuracy
31
0
5000
10000
0 5000 10000
5 mer count in reference
5mercountindraftassembly
0
3000
6000
9000
12000
TTTTT AAAAA TTTTG CAAAA CTTTT AAAAG CCCCC GGGGG
kmer
count
draft
reference
y
12000
A
C
B
D
0
5000
0 5000 10000
5 mer count in reference
5mercou
0
5000
10000
0 5000 10000
5 mer count in reference
5mercountinpolishedassembly
C D
Draft: 98.5% accuracy Polished: 99.5% accuracy
Assembly Accuracy
32
0
3000
6000
9000
12000
TTTTT AAAAA TTTTG CAAAA CTTTT AAAAG CCCCC GGGGG
kmer
count
draft
reference
9000
12000
B
D
0
0 5000 10000
5 mer count in reference
5mer
0
3000
TTTTT AAAAA TTTTG CAAAA CTTTT AAAAG CCCCC GGGGG
kmer
0
5000
10000
0 5000 10000
5 mer count in reference
5mercountinpolishedassembly
0
3000
6000
9000
12000
TTTTT AAAAA TTTTG CAAAA CTTTT AAAAG CCCCC GGGGG
kmer
count
polished
reference
C D
Aligning Events to a Reference
• HMM can also align events to a reference genome



!
!
!
!
!
• Read about it here:	

• http://simpsonlab.github.io/2015/04/08/eventalign/
33
Planned Improvements
• Model dwell duration to better call homopolymers	

!
!
!
• SNP calling/genotyping	

!
!
!
• Improve scalability to handle larger genomes	

• Use signal data during error correction
34
CTAAAAAAAAAAAAGTACA
P(gi|D) =
P(D|gi)P(gi)
P(D)
Acknowledgements & Code
• Collaborators:	

• Nick Loman, Josh Quick (Birmingham)	

• Jonathan Dursi (OICR)	

!
!
• Code:	

• github.com/jts/nanocorrect (error correction)	

• github.com/jts/nanopolish (signal-level algorithms)	

• github.com/jts/nanopore-paper-analysis (reproduce our paper)
35

Contenu connexe

Similaire à 150514 jts london_calling

Towards reading genomic data using deep learning-driven NLP techniques
Towards reading genomic data using deep learning-driven NLP techniquesTowards reading genomic data using deep learning-driven NLP techniques
Towards reading genomic data using deep learning-driven NLP techniquesWesley De Neve
 
design-compiler.pdf
design-compiler.pdfdesign-compiler.pdf
design-compiler.pdfFrangoCamila
 
A Fast Intro to Fast Query with ClickHouse, by Robert Hodges
A Fast Intro to Fast Query with ClickHouse, by Robert HodgesA Fast Intro to Fast Query with ClickHouse, by Robert Hodges
A Fast Intro to Fast Query with ClickHouse, by Robert HodgesAltinity Ltd
 
Proportional-integral genetic algorithm controller for stability of TCP network
Proportional-integral genetic algorithm controller for stability of TCP network Proportional-integral genetic algorithm controller for stability of TCP network
Proportional-integral genetic algorithm controller for stability of TCP network IJECEIAES
 
Overlap Layout Consensus assembly
Overlap Layout Consensus assemblyOverlap Layout Consensus assembly
Overlap Layout Consensus assemblyZhuyi Xue
 
Automated Machine Learning via Sequential Uniform Designs
Automated Machine Learning via Sequential Uniform DesignsAutomated Machine Learning via Sequential Uniform Designs
Automated Machine Learning via Sequential Uniform DesignsAijun Zhang
 
Scalable and Adaptive Graph Querying with MapReduce
Scalable and Adaptive Graph Querying with MapReduceScalable and Adaptive Graph Querying with MapReduce
Scalable and Adaptive Graph Querying with MapReduceKyong-Ha Lee
 
PLNOG20 - Paweł Małachowski - Stress your DUT–wykorzystanie narzędzi open sou...
PLNOG20 - Paweł Małachowski - Stress your DUT–wykorzystanie narzędzi open sou...PLNOG20 - Paweł Małachowski - Stress your DUT–wykorzystanie narzędzi open sou...
PLNOG20 - Paweł Małachowski - Stress your DUT–wykorzystanie narzędzi open sou...PROIDEA
 
CLUSTBIGFIM-FREQUENT ITEMSET MINING OF BIG DATA USING PRE-PROCESSING BASED ON...
CLUSTBIGFIM-FREQUENT ITEMSET MINING OF BIG DATA USING PRE-PROCESSING BASED ON...CLUSTBIGFIM-FREQUENT ITEMSET MINING OF BIG DATA USING PRE-PROCESSING BASED ON...
CLUSTBIGFIM-FREQUENT ITEMSET MINING OF BIG DATA USING PRE-PROCESSING BASED ON...ijfcstjournal
 
Clustbigfim frequent itemset mining of
Clustbigfim frequent itemset mining ofClustbigfim frequent itemset mining of
Clustbigfim frequent itemset mining ofijfcstjournal
 
CARI2020: A CGM-Based Parallel Algorithm Using the Four-Russians Speedup for ...
CARI2020: A CGM-Based Parallel Algorithm Using the Four-Russians Speedup for ...CARI2020: A CGM-Based Parallel Algorithm Using the Four-Russians Speedup for ...
CARI2020: A CGM-Based Parallel Algorithm Using the Four-Russians Speedup for ...Mokhtar SELLAMI
 
Improving go-git performance
Improving go-git performanceImproving go-git performance
Improving go-git performancesource{d}
 
Langmead bosc2010 cloud-genomics
Langmead bosc2010 cloud-genomicsLangmead bosc2010 cloud-genomics
Langmead bosc2010 cloud-genomicsBOSC 2010
 

Similaire à 150514 jts london_calling (20)

Tree building 2
Tree building 2Tree building 2
Tree building 2
 
Metrics with Ganglia
Metrics with GangliaMetrics with Ganglia
Metrics with Ganglia
 
Towards reading genomic data using deep learning-driven NLP techniques
Towards reading genomic data using deep learning-driven NLP techniquesTowards reading genomic data using deep learning-driven NLP techniques
Towards reading genomic data using deep learning-driven NLP techniques
 
design-compiler.pdf
design-compiler.pdfdesign-compiler.pdf
design-compiler.pdf
 
A Fast Intro to Fast Query with ClickHouse, by Robert Hodges
A Fast Intro to Fast Query with ClickHouse, by Robert HodgesA Fast Intro to Fast Query with ClickHouse, by Robert Hodges
A Fast Intro to Fast Query with ClickHouse, by Robert Hodges
 
Proportional-integral genetic algorithm controller for stability of TCP network
Proportional-integral genetic algorithm controller for stability of TCP network Proportional-integral genetic algorithm controller for stability of TCP network
Proportional-integral genetic algorithm controller for stability of TCP network
 
Overlap Layout Consensus assembly
Overlap Layout Consensus assemblyOverlap Layout Consensus assembly
Overlap Layout Consensus assembly
 
Python lec 1004_ch02_excercies
Python lec 1004_ch02_excerciesPython lec 1004_ch02_excercies
Python lec 1004_ch02_excercies
 
Automated Machine Learning via Sequential Uniform Designs
Automated Machine Learning via Sequential Uniform DesignsAutomated Machine Learning via Sequential Uniform Designs
Automated Machine Learning via Sequential Uniform Designs
 
Scalable and Adaptive Graph Querying with MapReduce
Scalable and Adaptive Graph Querying with MapReduceScalable and Adaptive Graph Querying with MapReduce
Scalable and Adaptive Graph Querying with MapReduce
 
Stress your DUT
Stress your DUTStress your DUT
Stress your DUT
 
PLNOG20 - Paweł Małachowski - Stress your DUT–wykorzystanie narzędzi open sou...
PLNOG20 - Paweł Małachowski - Stress your DUT–wykorzystanie narzędzi open sou...PLNOG20 - Paweł Małachowski - Stress your DUT–wykorzystanie narzędzi open sou...
PLNOG20 - Paweł Małachowski - Stress your DUT–wykorzystanie narzędzi open sou...
 
TiReX: Tiled Regular eXpression matching architecture
TiReX: Tiled Regular eXpression matching architectureTiReX: Tiled Regular eXpression matching architecture
TiReX: Tiled Regular eXpression matching architecture
 
Blast fasta 4
Blast fasta 4Blast fasta 4
Blast fasta 4
 
CLUSTBIGFIM-FREQUENT ITEMSET MINING OF BIG DATA USING PRE-PROCESSING BASED ON...
CLUSTBIGFIM-FREQUENT ITEMSET MINING OF BIG DATA USING PRE-PROCESSING BASED ON...CLUSTBIGFIM-FREQUENT ITEMSET MINING OF BIG DATA USING PRE-PROCESSING BASED ON...
CLUSTBIGFIM-FREQUENT ITEMSET MINING OF BIG DATA USING PRE-PROCESSING BASED ON...
 
Clustbigfim frequent itemset mining of
Clustbigfim frequent itemset mining ofClustbigfim frequent itemset mining of
Clustbigfim frequent itemset mining of
 
CARI2020: A CGM-Based Parallel Algorithm Using the Four-Russians Speedup for ...
CARI2020: A CGM-Based Parallel Algorithm Using the Four-Russians Speedup for ...CARI2020: A CGM-Based Parallel Algorithm Using the Four-Russians Speedup for ...
CARI2020: A CGM-Based Parallel Algorithm Using the Four-Russians Speedup for ...
 
Improving go-git performance
Improving go-git performanceImproving go-git performance
Improving go-git performance
 
Langmead bosc2010 cloud-genomics
Langmead bosc2010 cloud-genomicsLangmead bosc2010 cloud-genomics
Langmead bosc2010 cloud-genomics
 
Thiele
ThieleThiele
Thiele
 

Dernier

Broad bean, Lima Bean, Jack bean, Ullucus.pptx
Broad bean, Lima Bean, Jack bean, Ullucus.pptxBroad bean, Lima Bean, Jack bean, Ullucus.pptx
Broad bean, Lima Bean, Jack bean, Ullucus.pptxjana861314
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...Sérgio Sacani
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsAArockiyaNisha
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxgindu3009
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPirithiRaju
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsSérgio Sacani
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptxanandsmhk
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Lokesh Kothari
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PPRINCE C P
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)PraveenaKalaiselvan1
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfSumit Kumar yadav
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...ssifa0344
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfSumit Kumar yadav
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​kaibalyasahoo82800
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bSérgio Sacani
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Sérgio Sacani
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...Sérgio Sacani
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTSérgio Sacani
 

Dernier (20)

Broad bean, Lima Bean, Jack bean, Ullucus.pptx
Broad bean, Lima Bean, Jack bean, Ullucus.pptxBroad bean, Lima Bean, Jack bean, Ullucus.pptx
Broad bean, Lima Bean, Jack bean, Ullucus.pptx
 
CELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdfCELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdf
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based Nanomaterials
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptx
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C P
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)
 
The Philosophy of Science
The Philosophy of ScienceThe Philosophy of Science
The Philosophy of Science
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdf
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdf
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOST
 

150514 jts london_calling