SlideShare une entreprise Scribd logo
1  sur  16
Télécharger pour lire hors ligne
Shigehiro Kuraku
Unit Leader
Genome Resource & Analysis Unit, RIKEN CDB
http://www.cdb.riken.jp/gra/skuraku.html
The extended version of this presentation as well as its Japanese version
is available at SlideShare ( http://www.slideshare.net/cdb_gras/ )
aLeaves: web server (http://aleaves.cdb.riken.jp/aleaves/)
for handy phylogenetic analysis
Tutorial movies available
Powered by
“Collecting amino acid sequences and
building a phylogenetic tree on the aLeaves
and MAFFT servers”
https://www.youtube.com/watch?v=0hpp-IqhpyQ
「aLeavesとMAFFTを使って1つのアミノ酸配列
から系統樹を推定する」
https://www.youtube.com/watch?v=N9qPLRhHfIQ
Motivation of aLeaves development
While we have access to various methods for molecular phylogenetic tree
inference and enriched sequence data from large-scale sequencing projects,
phylogenetic tree building is not handy but rather cumbersome for
biologists working in labs.
Launch an online tool which performs comprehensive sequence
searches covering scattered large-scale resources and systematic
data slimming using biologist-friendly cues.
Background
What is hidden paralogy ? ex) zebrafish Emx3
Derobert et al., 2002 etc.
Morita et al., 1995
Reviewed in Kuraku, 2010. Integ. Comp. Biol.
What is hidden paralogy ? ex) zebrafish Emx3
Derobert et al., 2002 etc.
Morita et al., 1995
Reviewed in Kuraku, 2010. Integ. Comp. Biol.
Heuristic collection
B)
A)
Exhaustive search
of homologs
How do you prepare a homolog set?
Using BLAST server at NCBI
“Every BLAST search is an experiment” by
Scattered information prevents our smooth work
EnsemblNCBI Protein
(annotated)
Individual web sites
of genome projects
Your sequences
NCBI Refseq
(annotated)
Ensembl Metazoa
Dataset
Collaborators
GRAS, RIKEN CDB CBRC, AIST
&
iFReC, Osaka Univ.
Christian M. Zmasek
Sanford-Burnham
Medical Research Institute
USA
Kazutaka KatohOsamu Nishimura
aLeaves – http://aleaves.cdb.riken.jp
Output a multi-fasta
sequence file
in several minutes
A single search to cover
diverse species
Enter a query sequence
in a peptide
Taxonomic coverage (1)
Taxonomic coverage (2)
Downstream analysis on MAFFT server
Systematic selection/deletion of seqs based on various criteria
・Sequence length filter
・Delete identical/similar sequences (CD-HIT)
・Delete sequences with large gaps (Max-Align)
・Select only particular species
・Select/delete particular subgroups in a guide-tree
Managed by K. Katoh
Heuristic identification of homologs
(in publications, etc.)
Exhaustive collection of homologs Careful refinement of data set
by deleting unnecessary sequences
Phylogenetic tree inference
Retrieval of limited number of
sequences
(on MAFFT server at CBRC, AIST)
(on aLeaves server at CDB, RIKEN)
Workflow using aLeaves-MAFFT
Warning
・aLeaves is based on sequence resources already made public in other
online databases and does not release original sequence information.
・aLeaves project does not predict and validate protein coding sequences
available at other web sites and just adopt them for integrative searches.
・aLeaves-MAFFT link allows you to perform sequence data set
refinement and preliminary molecular phylogenetic analysis, but
please perform more sophisticated analyses on your local system
by downloading the data set.
Citing aLeaves
http://nar.oxfordjournals.org/content/41/W1/W22.long

Contenu connexe

En vedette

Designing Communities101507
Designing Communities101507Designing Communities101507
Designing Communities101507Christina Wodtke
 
Brief introduction of aLeaves (mainly in Japanese)
Brief introduction of aLeaves (mainly in Japanese)Brief introduction of aLeaves (mainly in Japanese)
Brief introduction of aLeaves (mainly in Japanese)cdb_gras
 
Evaluation of music magazine- Media portfolio J.O.F.A
Evaluation of music magazine- Media portfolio J.O.F.AEvaluation of music magazine- Media portfolio J.O.F.A
Evaluation of music magazine- Media portfolio J.O.F.AJoy_Favour
 
Quelques reponses de base sur des strategies faciles de cheminee Pinterest d'...
Quelques reponses de base sur des strategies faciles de cheminee Pinterest d'...Quelques reponses de base sur des strategies faciles de cheminee Pinterest d'...
Quelques reponses de base sur des strategies faciles de cheminee Pinterest d'...3chemineesans-conduit4
 
Joy's School magazine
Joy's School magazineJoy's School magazine
Joy's School magazineJoy_Favour
 
フリーソフトではじめるがん体細胞変異解析入門 第33回勉強会資料
フリーソフトではじめるがん体細胞変異解析入門 第33回勉強会資料フリーソフトではじめるがん体細胞変異解析入門 第33回勉強会資料
フリーソフトではじめるがん体細胞変異解析入門 第33回勉強会資料Amelieff
 
Mpk senarai semak permohonan pelan bangunan (kuatkuasa 010112)
Mpk senarai semak permohonan pelan bangunan (kuatkuasa 010112)Mpk senarai semak permohonan pelan bangunan (kuatkuasa 010112)
Mpk senarai semak permohonan pelan bangunan (kuatkuasa 010112)Hafiz Eswan
 
How to get into open source and contribute
How to get into open source and contributeHow to get into open source and contribute
How to get into open source and contributeShubham Chaudhary
 

En vedette (14)

Designing Communities101507
Designing Communities101507Designing Communities101507
Designing Communities101507
 
Brief introduction of aLeaves (mainly in Japanese)
Brief introduction of aLeaves (mainly in Japanese)Brief introduction of aLeaves (mainly in Japanese)
Brief introduction of aLeaves (mainly in Japanese)
 
Evaluation of music magazine- Media portfolio J.O.F.A
Evaluation of music magazine- Media portfolio J.O.F.AEvaluation of music magazine- Media portfolio J.O.F.A
Evaluation of music magazine- Media portfolio J.O.F.A
 
Quelques reponses de base sur des strategies faciles de cheminee Pinterest d'...
Quelques reponses de base sur des strategies faciles de cheminee Pinterest d'...Quelques reponses de base sur des strategies faciles de cheminee Pinterest d'...
Quelques reponses de base sur des strategies faciles de cheminee Pinterest d'...
 
Filtros
FiltrosFiltros
Filtros
 
Joy's School magazine
Joy's School magazineJoy's School magazine
Joy's School magazine
 
フリーソフトではじめるがん体細胞変異解析入門 第33回勉強会資料
フリーソフトではじめるがん体細胞変異解析入門 第33回勉強会資料フリーソフトではじめるがん体細胞変異解析入門 第33回勉強会資料
フリーソフトではじめるがん体細胞変異解析入門 第33回勉強会資料
 
Mpk senarai semak permohonan pelan bangunan (kuatkuasa 010112)
Mpk senarai semak permohonan pelan bangunan (kuatkuasa 010112)Mpk senarai semak permohonan pelan bangunan (kuatkuasa 010112)
Mpk senarai semak permohonan pelan bangunan (kuatkuasa 010112)
 
Michael graves
Michael gravesMichael graves
Michael graves
 
Git
GitGit
Git
 
How to get into open source and contribute
How to get into open source and contributeHow to get into open source and contribute
How to get into open source and contribute
 
Gsoc 2013 presentation
Gsoc 2013 presentationGsoc 2013 presentation
Gsoc 2013 presentation
 
Desktop Alternatives
Desktop AlternativesDesktop Alternatives
Desktop Alternatives
 
Sfd 2013 gnome_opw
Sfd 2013 gnome_opwSfd 2013 gnome_opw
Sfd 2013 gnome_opw
 

Similaire à Brief introduction of aLeaves

100505 koenig biological_databases
100505 koenig biological_databases100505 koenig biological_databases
100505 koenig biological_databasesMeetika Gupta
 
Making Use of NGS Data: From Reads to Trees and Annotations
Making Use of NGS Data: From Reads to Trees and AnnotationsMaking Use of NGS Data: From Reads to Trees and Annotations
Making Use of NGS Data: From Reads to Trees and AnnotationsJoão André Carriço
 
Sequencedatabases
SequencedatabasesSequencedatabases
SequencedatabasesAbhik Seal
 
Computational Resources In Infectious Disease
Computational Resources In Infectious DiseaseComputational Resources In Infectious Disease
Computational Resources In Infectious DiseaseJoão André Carriço
 
Phylogeny-driven approaches to microbial & microbiome studies: talk by Jonath...
Phylogeny-driven approaches to microbial & microbiome studies: talk by Jonath...Phylogeny-driven approaches to microbial & microbiome studies: talk by Jonath...
Phylogeny-driven approaches to microbial & microbiome studies: talk by Jonath...Jonathan Eisen
 
Tools for Transcriptome Data Analysis
Tools for Transcriptome Data AnalysisTools for Transcriptome Data Analysis
Tools for Transcriptome Data AnalysisSANJANA PANDEY
 
Talk by J. Eisen for NZ Computational Genomics meeting
Talk by J. Eisen for NZ Computational Genomics meetingTalk by J. Eisen for NZ Computational Genomics meeting
Talk by J. Eisen for NZ Computational Genomics meetingJonathan Eisen
 
Imgc2011 bioinformatics tutorial
Imgc2011 bioinformatics tutorialImgc2011 bioinformatics tutorial
Imgc2011 bioinformatics tutorialDeanna Church
 
Hands on training_biological_databases.ppt
Hands on training_biological_databases.pptHands on training_biological_databases.ppt
Hands on training_biological_databases.pptSoumen Barman
 
Datasets and tools_from_ncbi_and_elsewhere_for_microbiome_research_v_62817
Datasets and tools_from_ncbi_and_elsewhere_for_microbiome_research_v_62817Datasets and tools_from_ncbi_and_elsewhere_for_microbiome_research_v_62817
Datasets and tools_from_ncbi_and_elsewhere_for_microbiome_research_v_62817Ben Busby
 
Metagenomic Data Provenance and Management using the ISA infrastructure --- o...
Metagenomic Data Provenance and Management using the ISA infrastructure --- o...Metagenomic Data Provenance and Management using the ISA infrastructure --- o...
Metagenomic Data Provenance and Management using the ISA infrastructure --- o...Alejandra Gonzalez-Beltran
 
Bioinformatics MiRON
Bioinformatics MiRONBioinformatics MiRON
Bioinformatics MiRONPrabin Shakya
 
Galaxy dna-seq-variant calling-presentationandpractical_gent_april-2016
Galaxy dna-seq-variant calling-presentationandpractical_gent_april-2016Galaxy dna-seq-variant calling-presentationandpractical_gent_april-2016
Galaxy dna-seq-variant calling-presentationandpractical_gent_april-2016Prof. Wim Van Criekinge
 
BioSamples Database Linked Data, SWAT4LS Tutorial
BioSamples Database Linked Data, SWAT4LS TutorialBioSamples Database Linked Data, SWAT4LS Tutorial
BioSamples Database Linked Data, SWAT4LS TutorialRothamsted Research, UK
 

Similaire à Brief introduction of aLeaves (20)

100505 koenig biological_databases
100505 koenig biological_databases100505 koenig biological_databases
100505 koenig biological_databases
 
Making Use of NGS Data: From Reads to Trees and Annotations
Making Use of NGS Data: From Reads to Trees and AnnotationsMaking Use of NGS Data: From Reads to Trees and Annotations
Making Use of NGS Data: From Reads to Trees and Annotations
 
Sequencedatabases
SequencedatabasesSequencedatabases
Sequencedatabases
 
Parkinson mibbi
Parkinson mibbiParkinson mibbi
Parkinson mibbi
 
Understanding Genome
Understanding Genome Understanding Genome
Understanding Genome
 
Computational Resources In Infectious Disease
Computational Resources In Infectious DiseaseComputational Resources In Infectious Disease
Computational Resources In Infectious Disease
 
Phylogeny-driven approaches to microbial & microbiome studies: talk by Jonath...
Phylogeny-driven approaches to microbial & microbiome studies: talk by Jonath...Phylogeny-driven approaches to microbial & microbiome studies: talk by Jonath...
Phylogeny-driven approaches to microbial & microbiome studies: talk by Jonath...
 
Tools for Transcriptome Data Analysis
Tools for Transcriptome Data AnalysisTools for Transcriptome Data Analysis
Tools for Transcriptome Data Analysis
 
Talk by J. Eisen for NZ Computational Genomics meeting
Talk by J. Eisen for NZ Computational Genomics meetingTalk by J. Eisen for NZ Computational Genomics meeting
Talk by J. Eisen for NZ Computational Genomics meeting
 
Imgc2011 bioinformatics tutorial
Imgc2011 bioinformatics tutorialImgc2011 bioinformatics tutorial
Imgc2011 bioinformatics tutorial
 
Hands on training_biological_databases.ppt
Hands on training_biological_databases.pptHands on training_biological_databases.ppt
Hands on training_biological_databases.ppt
 
Datasets and tools_from_ncbi_and_elsewhere_for_microbiome_research_v_62817
Datasets and tools_from_ncbi_and_elsewhere_for_microbiome_research_v_62817Datasets and tools_from_ncbi_and_elsewhere_for_microbiome_research_v_62817
Datasets and tools_from_ncbi_and_elsewhere_for_microbiome_research_v_62817
 
BioSD Tutorial 2014 Editition
BioSD Tutorial 2014 EdititionBioSD Tutorial 2014 Editition
BioSD Tutorial 2014 Editition
 
NCBI
NCBINCBI
NCBI
 
Metagenomic Data Provenance and Management using the ISA infrastructure --- o...
Metagenomic Data Provenance and Management using the ISA infrastructure --- o...Metagenomic Data Provenance and Management using the ISA infrastructure --- o...
Metagenomic Data Provenance and Management using the ISA infrastructure --- o...
 
Bioinformatics MiRON
Bioinformatics MiRONBioinformatics MiRON
Bioinformatics MiRON
 
Article
ArticleArticle
Article
 
Major databases in bioinformatics
Major databases in bioinformaticsMajor databases in bioinformatics
Major databases in bioinformatics
 
Galaxy dna-seq-variant calling-presentationandpractical_gent_april-2016
Galaxy dna-seq-variant calling-presentationandpractical_gent_april-2016Galaxy dna-seq-variant calling-presentationandpractical_gent_april-2016
Galaxy dna-seq-variant calling-presentationandpractical_gent_april-2016
 
BioSamples Database Linked Data, SWAT4LS Tutorial
BioSamples Database Linked Data, SWAT4LS TutorialBioSamples Database Linked Data, SWAT4LS Tutorial
BioSamples Database Linked Data, SWAT4LS Tutorial
 

Dernier

High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑Damini Dixit
 
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...dkNET
 
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...ssuser79fe74
 
Seismic Method Estimate velocity from seismic data.pptx
Seismic Method Estimate velocity from seismic  data.pptxSeismic Method Estimate velocity from seismic  data.pptx
Seismic Method Estimate velocity from seismic data.pptxAlMamun560346
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000Sapana Sha
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bSérgio Sacani
 
biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY1301aanya
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticssakshisoni2385
 
pumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit flypumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit flyPRADYUMMAURYA1
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPirithiRaju
 
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Servicenishacall1
 
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)Joonhun Lee
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learninglevieagacer
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfrohankumarsinghrore1
 
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...Silpa
 
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Silpa
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)Areesha Ahmad
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsSérgio Sacani
 

Dernier (20)

High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
 
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
 
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
 
Seismic Method Estimate velocity from seismic data.pptx
Seismic Method Estimate velocity from seismic  data.pptxSeismic Method Estimate velocity from seismic  data.pptx
Seismic Method Estimate velocity from seismic data.pptx
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
 
biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
 
pumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit flypumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit fly
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
 
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
 
CELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdfCELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdf
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdf
 
Site Acceptance Test .
Site Acceptance Test                    .Site Acceptance Test                    .
Site Acceptance Test .
 
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
 
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
 

Brief introduction of aLeaves

  • 1. Shigehiro Kuraku Unit Leader Genome Resource & Analysis Unit, RIKEN CDB http://www.cdb.riken.jp/gra/skuraku.html The extended version of this presentation as well as its Japanese version is available at SlideShare ( http://www.slideshare.net/cdb_gras/ ) aLeaves: web server (http://aleaves.cdb.riken.jp/aleaves/) for handy phylogenetic analysis
  • 2. Tutorial movies available Powered by “Collecting amino acid sequences and building a phylogenetic tree on the aLeaves and MAFFT servers” https://www.youtube.com/watch?v=0hpp-IqhpyQ 「aLeavesとMAFFTを使って1つのアミノ酸配列 から系統樹を推定する」 https://www.youtube.com/watch?v=N9qPLRhHfIQ
  • 3. Motivation of aLeaves development While we have access to various methods for molecular phylogenetic tree inference and enriched sequence data from large-scale sequencing projects, phylogenetic tree building is not handy but rather cumbersome for biologists working in labs. Launch an online tool which performs comprehensive sequence searches covering scattered large-scale resources and systematic data slimming using biologist-friendly cues. Background
  • 4. What is hidden paralogy ? ex) zebrafish Emx3 Derobert et al., 2002 etc. Morita et al., 1995 Reviewed in Kuraku, 2010. Integ. Comp. Biol.
  • 5. What is hidden paralogy ? ex) zebrafish Emx3 Derobert et al., 2002 etc. Morita et al., 1995 Reviewed in Kuraku, 2010. Integ. Comp. Biol.
  • 6. Heuristic collection B) A) Exhaustive search of homologs How do you prepare a homolog set?
  • 7. Using BLAST server at NCBI “Every BLAST search is an experiment” by
  • 8. Scattered information prevents our smooth work EnsemblNCBI Protein (annotated) Individual web sites of genome projects Your sequences NCBI Refseq (annotated) Ensembl Metazoa Dataset
  • 9. Collaborators GRAS, RIKEN CDB CBRC, AIST & iFReC, Osaka Univ. Christian M. Zmasek Sanford-Burnham Medical Research Institute USA Kazutaka KatohOsamu Nishimura
  • 10. aLeaves – http://aleaves.cdb.riken.jp Output a multi-fasta sequence file in several minutes A single search to cover diverse species Enter a query sequence in a peptide
  • 13. Downstream analysis on MAFFT server Systematic selection/deletion of seqs based on various criteria ・Sequence length filter ・Delete identical/similar sequences (CD-HIT) ・Delete sequences with large gaps (Max-Align) ・Select only particular species ・Select/delete particular subgroups in a guide-tree Managed by K. Katoh
  • 14. Heuristic identification of homologs (in publications, etc.) Exhaustive collection of homologs Careful refinement of data set by deleting unnecessary sequences Phylogenetic tree inference Retrieval of limited number of sequences (on MAFFT server at CBRC, AIST) (on aLeaves server at CDB, RIKEN) Workflow using aLeaves-MAFFT
  • 15. Warning ・aLeaves is based on sequence resources already made public in other online databases and does not release original sequence information. ・aLeaves project does not predict and validate protein coding sequences available at other web sites and just adopt them for integrative searches. ・aLeaves-MAFFT link allows you to perform sequence data set refinement and preliminary molecular phylogenetic analysis, but please perform more sophisticated analyses on your local system by downloading the data set.