SlideShare une entreprise Scribd logo
1  sur  2
GENOME ASSEMBLY SNAPSHOT
FLOW-DIAGRAM
•Link the read files from /import/sequence/read-archive/
Link Files
•The goal here is to remove identical reads or reads-pair.
•It also helps in making the dataset size more manageableRedundancy Analysis
•The goal here is to remove the adaptor sequences from the given data
sets.
•The adaptor are identified on the basis of information available from LIMS
Adaptor Trimming
•The goal is to remove all sequences which are below a certain threshold
(15).Low Quality Score Filtering
•The goal here is to remove all sequences that map to chloroplast and
mitochondrial database based on their insert size.Contamination Analysis
•The goal is to calculate the GC-content in the particular lane and plot a
graph of the same.GC-content Analysis
•The goal here is to understand the distribution of k-mer across the given
dataset and thus estimating the genome size from the distribution.K-mer Analysis
•The goal is to join the short overlap reads if present , to produce longer
reads, therefore making it possible to span gaps or repeats in the genome.Join Reads
•The goal here is to align and merge fragments of a longer DNA sequence
to reconstruct the original sequence.
•This is done using command line CLC Bio De novo assembler.
Assembly

Contenu connexe

En vedette

En vedette (6)

Geiger–Müller Counter
Geiger–Müller CounterGeiger–Müller Counter
Geiger–Müller Counter
 
Geiger muller counting system
Geiger muller counting systemGeiger muller counting system
Geiger muller counting system
 
Functional genomics
Functional genomicsFunctional genomics
Functional genomics
 
Geiger muller counter
Geiger muller counterGeiger muller counter
Geiger muller counter
 
Metagenomics
MetagenomicsMetagenomics
Metagenomics
 
Metagenomics
MetagenomicsMetagenomics
Metagenomics
 

Similaire à Genome assembly snapshot flow diagram

Big data Clustering Algorithms And Strategies
Big data Clustering Algorithms And StrategiesBig data Clustering Algorithms And Strategies
Big data Clustering Algorithms And StrategiesFarzad Nozarian
 
clustering and distance metrics.pptx
clustering and distance metrics.pptxclustering and distance metrics.pptx
clustering and distance metrics.pptxssuser2e437f
 
Phylogenetic analysis in nutshell
Phylogenetic analysis in nutshellPhylogenetic analysis in nutshell
Phylogenetic analysis in nutshellAvinash Kumar
 
Basic Local Alignment Search Tool (BLAST)
Basic Local Alignment Search Tool (BLAST)Basic Local Alignment Search Tool (BLAST)
Basic Local Alignment Search Tool (BLAST)Asiri Wijesinghe
 
RNASeq Experiment Design
RNASeq Experiment DesignRNASeq Experiment Design
RNASeq Experiment DesignYaoyu Wang
 
Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignmentAfra Fathima
 
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...Edge AI and Vision Alliance
 
05 Clustering in Data Mining
05 Clustering in Data Mining05 Clustering in Data Mining
05 Clustering in Data MiningValerii Klymchuk
 
computational statistics machine learning unit 5.pptx
computational statistics machine learning unit 5.pptxcomputational statistics machine learning unit 5.pptx
computational statistics machine learning unit 5.pptxAnubhavKushagra
 
sequence alignment
sequence alignmentsequence alignment
sequence alignmentammar kareem
 
Data analysis
Data analysisData analysis
Data analysisamlbinder
 
Mean shift and Hierarchical clustering
Mean shift and Hierarchical clustering Mean shift and Hierarchical clustering
Mean shift and Hierarchical clustering Yan Xu
 
AI 바이오 (4일차).pdf
AI 바이오 (4일차).pdfAI 바이오 (4일차).pdf
AI 바이오 (4일차).pdfH K Yoon
 
Global and Local Sequence Alignment
Global and Local Sequence AlignmentGlobal and Local Sequence Alignment
Global and Local Sequence AlignmentAjayPatil210
 
Progressive duplicate detection
Progressive duplicate detectionProgressive duplicate detection
Progressive duplicate detectionieeepondy
 
Vchunk join an efficient algorithm for edit similarity joins
Vchunk join an efficient algorithm for edit similarity joinsVchunk join an efficient algorithm for edit similarity joins
Vchunk join an efficient algorithm for edit similarity joinsVijay Koushik
 

Similaire à Genome assembly snapshot flow diagram (20)

Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Big data Clustering Algorithms And Strategies
Big data Clustering Algorithms And StrategiesBig data Clustering Algorithms And Strategies
Big data Clustering Algorithms And Strategies
 
clustering and distance metrics.pptx
clustering and distance metrics.pptxclustering and distance metrics.pptx
clustering and distance metrics.pptx
 
Parwati sihag
Parwati sihagParwati sihag
Parwati sihag
 
Phylogenetic analysis in nutshell
Phylogenetic analysis in nutshellPhylogenetic analysis in nutshell
Phylogenetic analysis in nutshell
 
Basic Local Alignment Search Tool (BLAST)
Basic Local Alignment Search Tool (BLAST)Basic Local Alignment Search Tool (BLAST)
Basic Local Alignment Search Tool (BLAST)
 
RNASeq Experiment Design
RNASeq Experiment DesignRNASeq Experiment Design
RNASeq Experiment Design
 
Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignment
 
I- Tasser
I- TasserI- Tasser
I- Tasser
 
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
 
05 Clustering in Data Mining
05 Clustering in Data Mining05 Clustering in Data Mining
05 Clustering in Data Mining
 
computational statistics machine learning unit 5.pptx
computational statistics machine learning unit 5.pptxcomputational statistics machine learning unit 5.pptx
computational statistics machine learning unit 5.pptx
 
Text summarization
Text summarization Text summarization
Text summarization
 
sequence alignment
sequence alignmentsequence alignment
sequence alignment
 
Data analysis
Data analysisData analysis
Data analysis
 
Mean shift and Hierarchical clustering
Mean shift and Hierarchical clustering Mean shift and Hierarchical clustering
Mean shift and Hierarchical clustering
 
AI 바이오 (4일차).pdf
AI 바이오 (4일차).pdfAI 바이오 (4일차).pdf
AI 바이오 (4일차).pdf
 
Global and Local Sequence Alignment
Global and Local Sequence AlignmentGlobal and Local Sequence Alignment
Global and Local Sequence Alignment
 
Progressive duplicate detection
Progressive duplicate detectionProgressive duplicate detection
Progressive duplicate detection
 
Vchunk join an efficient algorithm for edit similarity joins
Vchunk join an efficient algorithm for edit similarity joinsVchunk join an efficient algorithm for edit similarity joins
Vchunk join an efficient algorithm for edit similarity joins
 

Plus de Ronak Shah

Poster at EMBL: Diagnosis and monitoring of Leptomeningeal Disease using Circ...
Poster at EMBL: Diagnosis and monitoring of Leptomeningeal Disease using Circ...Poster at EMBL: Diagnosis and monitoring of Leptomeningeal Disease using Circ...
Poster at EMBL: Diagnosis and monitoring of Leptomeningeal Disease using Circ...Ronak Shah
 
Comparison of LUMPY vs. DELLY for structural variant detection
Comparison of LUMPY vs. DELLY for structural variant detectionComparison of LUMPY vs. DELLY for structural variant detection
Comparison of LUMPY vs. DELLY for structural variant detectionRonak Shah
 
The IMPACT of INDEL realignment: Detecting insertions and deletions longer th...
The IMPACT of INDEL realignment: Detecting insertions and deletions longer th...The IMPACT of INDEL realignment: Detecting insertions and deletions longer th...
The IMPACT of INDEL realignment: Detecting insertions and deletions longer th...Ronak Shah
 
Detecting clinically actionable somatic structural aberrations from targeted ...
Detecting clinically actionable somatic structural aberrations from targeted ...Detecting clinically actionable somatic structural aberrations from targeted ...
Detecting clinically actionable somatic structural aberrations from targeted ...Ronak Shah
 
Developing a framework for for detection of low frequency somatic genetic alt...
Developing a framework for for detection of low frequency somatic genetic alt...Developing a framework for for detection of low frequency somatic genetic alt...
Developing a framework for for detection of low frequency somatic genetic alt...Ronak Shah
 
Protein-Protein Interaction using SVM based kernel,Jacob Coefficient and Gene...
Protein-Protein Interaction using SVM based kernel,Jacob Coefficient and Gene...Protein-Protein Interaction using SVM based kernel,Jacob Coefficient and Gene...
Protein-Protein Interaction using SVM based kernel,Jacob Coefficient and Gene...Ronak Shah
 

Plus de Ronak Shah (6)

Poster at EMBL: Diagnosis and monitoring of Leptomeningeal Disease using Circ...
Poster at EMBL: Diagnosis and monitoring of Leptomeningeal Disease using Circ...Poster at EMBL: Diagnosis and monitoring of Leptomeningeal Disease using Circ...
Poster at EMBL: Diagnosis and monitoring of Leptomeningeal Disease using Circ...
 
Comparison of LUMPY vs. DELLY for structural variant detection
Comparison of LUMPY vs. DELLY for structural variant detectionComparison of LUMPY vs. DELLY for structural variant detection
Comparison of LUMPY vs. DELLY for structural variant detection
 
The IMPACT of INDEL realignment: Detecting insertions and deletions longer th...
The IMPACT of INDEL realignment: Detecting insertions and deletions longer th...The IMPACT of INDEL realignment: Detecting insertions and deletions longer th...
The IMPACT of INDEL realignment: Detecting insertions and deletions longer th...
 
Detecting clinically actionable somatic structural aberrations from targeted ...
Detecting clinically actionable somatic structural aberrations from targeted ...Detecting clinically actionable somatic structural aberrations from targeted ...
Detecting clinically actionable somatic structural aberrations from targeted ...
 
Developing a framework for for detection of low frequency somatic genetic alt...
Developing a framework for for detection of low frequency somatic genetic alt...Developing a framework for for detection of low frequency somatic genetic alt...
Developing a framework for for detection of low frequency somatic genetic alt...
 
Protein-Protein Interaction using SVM based kernel,Jacob Coefficient and Gene...
Protein-Protein Interaction using SVM based kernel,Jacob Coefficient and Gene...Protein-Protein Interaction using SVM based kernel,Jacob Coefficient and Gene...
Protein-Protein Interaction using SVM based kernel,Jacob Coefficient and Gene...
 

Genome assembly snapshot flow diagram

  • 2. •Link the read files from /import/sequence/read-archive/ Link Files •The goal here is to remove identical reads or reads-pair. •It also helps in making the dataset size more manageableRedundancy Analysis •The goal here is to remove the adaptor sequences from the given data sets. •The adaptor are identified on the basis of information available from LIMS Adaptor Trimming •The goal is to remove all sequences which are below a certain threshold (15).Low Quality Score Filtering •The goal here is to remove all sequences that map to chloroplast and mitochondrial database based on their insert size.Contamination Analysis •The goal is to calculate the GC-content in the particular lane and plot a graph of the same.GC-content Analysis •The goal here is to understand the distribution of k-mer across the given dataset and thus estimating the genome size from the distribution.K-mer Analysis •The goal is to join the short overlap reads if present , to produce longer reads, therefore making it possible to span gaps or repeats in the genome.Join Reads •The goal here is to align and merge fragments of a longer DNA sequence to reconstruct the original sequence. •This is done using command line CLC Bio De novo assembler. Assembly