SlideShare a Scribd company logo
1 of 38
PHYLOGENETICS
An introduction to the concepts and analysis
using MEGA 6.0
Today’s Objectives
• To introduce the basis concepts involved in phylogenetic
analysis.
• To learn the usage of the phylogenetic package MEGA
6.0
• To discuss the manner in which you can apply
phylogenetic analysis in your research approach, thesis
and publications.
Why use Phylogenetics ?
• The human mind is naturally inclined to classify
information.
• Classification facilitates logical understanding as well as
the detection of heuristic patterns within data sets.
• Logical understanding of a process facilitates the process
of discovery.
Where will it be of use to
me?
• Classifying my sequence data within a global
perspective.
• Finding unique regions within my sequence data by
comparison with a global data set.
• Identification of genes which have not yet been widely
characterized.
• Infinitely many possibilities
Traditional Classification
schemes
• Based on Phenotypic traits (Phenetic) and taxonomic
classifiers (TU)
• Low level of resolution
• Not applicable to molecular data
• Difficult to resolve taxonomic ambiguities at higher
levels.
From TUs to Genomic
databases
• DNA technology prompted a quantum shift in the
resolving power of phylogenetics.
• TU: < 100 classifiers
• Amino Acids: Millions of combinations of AAs
• Genomic level: Billions of bp of nucleotide data
Does more information solve the problem?
0
100000
200000
300000
400000
500000
600000
700000
800000
900000
1000000
RESOLUTION
Taxonomic unit
Protein
Nucleic acid
Species trees
• A species tree establishes the hierarchy of a species
within a globally accepted framework of classification.
• ITS:16s
• ITS: rDNA
• ITS: chloroplast and mitochondria
• Genes: rbcL, ADH, cytC, Ig(SC)
Crab rRNA sequence data used to construct UPGMA tree, Note the out-group
species that has been added to establish a perspective scale.
Gene trees
• Gene trees facilitate the understanding of evolutionary
processes occurring within genes across taxa or within a
species.
• The rates of evolution offer insights into the manner in
which genes evolve as a family.
• Gene trees can be transformed into species trees if they
conform to evolutionary criteria.
Species v/s Gene trees
• Which one do we select?
The choice is determined by what we intend to characterize:
Is it the organism within a genus / species? OR
Is it a gene which is distributed across taxa?
Molecular taxonomy
based on genes
• Prokaryotes: 16s rDNA
• Higher organisms: ITS rDNA, Cp, Mt
• Do you want an evolutionary tree?
• Does your “molecular tree” corroborate your “taxonomic
tree”?
D. affinidisjuncta
D. heteroneura
D. mimica
D. adiastola
D. nigra
S. albovittata
D. crassifemur
S. lebanonensis
D. mulleri
D. melanogaster
D. pseudoobscura
0.000.050.100.150.200.25
Gene tree constructed using the Alcohol Dehydrogenase (ADH) gene from
Drosophila spp. (UPGMA)
The molecular clock
• A digital clock displays time as the cumulative function
of the frequency of a silicon crystal.
• A molecular clock graphically depicts evolution as the
function of changing nucleotide / amino acid
frequency versus time.
A highly simplified and idealized
molecular clock ! The red bar is a
gene, the colored bars represent
nucleotide positions which change as
a function of time.
Phylogenetic trees
•Distance based methods: inclusive
•Maximum parsimony methods: assumptive
NJT
• Constructed Purely on the basis of pairwise genetic
distance.
• No prior assumptions are made pertaining to tree
topology and branch lengths
Japanese
Korean
Southern Chinese
Australian
Papuan
North Amerind
South Amerind
Finn
Italian
German
English
San
Bantu
Pygmy
Nigerian
0.01
Neighbor Joining Tree (NJT) based on human genetic distance matrix:
compares Pairwise Genetic Distances only
UPGMA
• Originally developed for Phenogram construction (Sokal &
Michener, 1958)
• Adapted for Dendrogram construction
• Can be used when there is a correlation between the distance
measure used and the evolutionary timescale.
Japanese
Korean
Southern Chinese
North Amerind
South Amerind
Italian
Finn
German
English
Australian
Papuan
San
Pygmy
Nigerian
Bantu
0.000.010.020.030.040.05
UPGMA tree based on human genetic distance matrix:
Assumes a constant rate molecular clock
VALIDATION:
Bootstrapping
• The concept of parsimony.
• This is a re-sampling method by replacement with the
same data matrix.
• It allows calculation of standard deviations and variances.
Zea
Oryza
Nicotiana
Pinus
Marchantia
Odontella
Porphyra
Synechocys
Cyanophora
Euglena
100
91
100
100
100
100
100
0.05
Bootstrap consensus tree constructed using the NJT algorithm.
Based on chloroplast DNA protein coding regions.
Zea
Oryza
Nicotiana
Marchantia
Pinus
Odontella
Synechocys
Porphyra
Cyanophora
Euglena
100
100
100
100
100
100
100
0.000.050.100.150.20
Bootstrap consensus tree constructed using the UPGMA algorithm
Based on Chloroplast DNA protein coding regions
Why use MEGA 6.0 ?
• Single platform, combines the functions of BIOEDIT,CLUSTALW,
PAUP and TREEDIST
• Imports FASTA files directly from GenBank: No editing!
• Publication quality output / statistical corroboration.
• Executes on your laptop / desktop.
• User friendly GUI
• Versatile / Flexible
• Highest number of citations
• Open source / Freeware
• No codes to memorize
What can MEGA 6.0 do
for you?
• Download data from a Database / File / Sequencer
• Align data using CLUSTAL W
• Perform phylogenetic analysis using various Algorithms
• Graphically depict phylogenetic trees
• Perform evolutionary tests: Tajima’s Molecular Clock,
Tajima’s neutrality, Z-test, Fishers-exact test, Nei-
Gojobori distance
Getting started with
MEGA
• Input file
• Processing commands
• Output file
THE INPUT FILE
• FASTA format
• ABI format
• Distance matrix files
THE ALIGNMENT
COMMAND
• This step requires discretion. After sequences have been
aligned using CLUSTALW, 5’ and 3’ ends must be
trimmed to develop a blunt composite set.
• Save your output as XXXXX.MAS file
• Before exiting save as XXXXX.MEG file
The ends of the composite sequence should be trimmed after
CLUSTALW alignment as they can contribute significantly to error
in determining true evolutionary divergence / sequence similarity
DEFINING YOUR OUTPUT
• Distance Matrix File
• Phylogenies: NJT / UPGMA / MP / ME
• Parsimony trees
• Evolutionary parameters
• Molecular clocks
Some concepts to think
about:
• Gene clusters
• Genes across geographical boundaries
• Why does genetic evolution transcend species
boundaries?
• Why do some genes evolve faster that others?
• Why do some genes evolve concurrently?
Some concepts to think
about:
• RNA families: clustering of ESTs
• Comparative genomics within a supra genome
• Evolutionary linkages within human genes
CITATION
MEGA should be cited as:
Tamura K, Dudley J, Nei M & Kumar S (2007) MEGA4: Molecular
Evolutionary Genetics Analysis (MEGA) software version 4.0.
Molecular Biology and Evolution 24:1596-1599. (Publication PDF
at http://www.kumarlab.net/publications)
BIOINFORMATICS
SESSION
Follow the instructions on the screen and obtain your tree.
If you have WIFI access to NCBI, you can develop your
own unique alignments
THANK YOU
“In the greater scheme of things, all systems tend to unity… all of
human understanding and logic is based on this underlying
principle.. and the genome is no exception… “

More Related Content

What's hot

Introduction to sequence alignment
Introduction to sequence alignmentIntroduction to sequence alignment
Introduction to sequence alignmentKubuldinho
 
MULTIPLE SEQUENCE ALIGNMENT
MULTIPLE  SEQUENCE  ALIGNMENTMULTIPLE  SEQUENCE  ALIGNMENT
MULTIPLE SEQUENCE ALIGNMENTMariya Raju
 
sequence alignment
sequence alignmentsequence alignment
sequence alignmentammar kareem
 
Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignmentRamya S
 
Secondary Structure Prediction of proteins
Secondary Structure Prediction of proteins Secondary Structure Prediction of proteins
Secondary Structure Prediction of proteins Vijay Hemmadi
 
Distance based method
Distance based method Distance based method
Distance based method Adhena Lulli
 
Third Generation Sequencing
Third Generation Sequencing Third Generation Sequencing
Third Generation Sequencing priyanka raviraj
 
Gene bank by kk sahu
Gene bank by kk sahuGene bank by kk sahu
Gene bank by kk sahuKAUSHAL SAHU
 
Genome Browsing, Genomic Data Mining and Genome Data Visualization with Ensem...
Genome Browsing, Genomic Data Mining and Genome Data Visualization with Ensem...Genome Browsing, Genomic Data Mining and Genome Data Visualization with Ensem...
Genome Browsing, Genomic Data Mining and Genome Data Visualization with Ensem...VHIR Vall d’Hebron Institut de Recerca
 
Dynamic programming and pairwise sequence alignment
Dynamic programming and pairwise sequence alignmentDynamic programming and pairwise sequence alignment
Dynamic programming and pairwise sequence alignmentGeethanjaliAnilkumar2
 
GENOMICS AND BIOINFORMATICS
GENOMICS AND BIOINFORMATICSGENOMICS AND BIOINFORMATICS
GENOMICS AND BIOINFORMATICSsandeshGM
 

What's hot (20)

Prosite
PrositeProsite
Prosite
 
Introduction to sequence alignment
Introduction to sequence alignmentIntroduction to sequence alignment
Introduction to sequence alignment
 
Bioinformatics principles and applications
Bioinformatics principles and applicationsBioinformatics principles and applications
Bioinformatics principles and applications
 
MULTIPLE SEQUENCE ALIGNMENT
MULTIPLE  SEQUENCE  ALIGNMENTMULTIPLE  SEQUENCE  ALIGNMENT
MULTIPLE SEQUENCE ALIGNMENT
 
Phylogenetics
PhylogeneticsPhylogenetics
Phylogenetics
 
Phylogenetic data analysis
Phylogenetic data analysisPhylogenetic data analysis
Phylogenetic data analysis
 
sequence alignment
sequence alignmentsequence alignment
sequence alignment
 
Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignment
 
Secondary Structure Prediction of proteins
Secondary Structure Prediction of proteins Secondary Structure Prediction of proteins
Secondary Structure Prediction of proteins
 
Distance based method
Distance based method Distance based method
Distance based method
 
Third Generation Sequencing
Third Generation Sequencing Third Generation Sequencing
Third Generation Sequencing
 
(Expasy)
(Expasy)(Expasy)
(Expasy)
 
Gene bank by kk sahu
Gene bank by kk sahuGene bank by kk sahu
Gene bank by kk sahu
 
Blast
BlastBlast
Blast
 
Homology
HomologyHomology
Homology
 
Genome Browsing, Genomic Data Mining and Genome Data Visualization with Ensem...
Genome Browsing, Genomic Data Mining and Genome Data Visualization with Ensem...Genome Browsing, Genomic Data Mining and Genome Data Visualization with Ensem...
Genome Browsing, Genomic Data Mining and Genome Data Visualization with Ensem...
 
Dynamic programming and pairwise sequence alignment
Dynamic programming and pairwise sequence alignmentDynamic programming and pairwise sequence alignment
Dynamic programming and pairwise sequence alignment
 
Maximum parsimony
Maximum parsimonyMaximum parsimony
Maximum parsimony
 
Clustal W - Multiple Sequence alignment
Clustal W - Multiple Sequence alignment   Clustal W - Multiple Sequence alignment
Clustal W - Multiple Sequence alignment
 
GENOMICS AND BIOINFORMATICS
GENOMICS AND BIOINFORMATICSGENOMICS AND BIOINFORMATICS
GENOMICS AND BIOINFORMATICS
 

Similar to PHYLOGENETICS WITH MEGA

Bls 303 l1.phylogenetics
Bls 303 l1.phylogeneticsBls 303 l1.phylogenetics
Bls 303 l1.phylogeneticsBruno Mmassy
 
Reconstructing paleoenvironments using metagenomics
Reconstructing paleoenvironments using metagenomicsReconstructing paleoenvironments using metagenomics
Reconstructing paleoenvironments using metagenomicsRutger Vos
 
Introduction to Modern Biosystemaics for Fungal Classification
Introduction to Modern Biosystemaics for Fungal ClassificationIntroduction to Modern Biosystemaics for Fungal Classification
Introduction to Modern Biosystemaics for Fungal ClassificationMrinal Vashisth
 
BTC 506 Phylogenetic Analysis.pptx
BTC 506 Phylogenetic Analysis.pptxBTC 506 Phylogenetic Analysis.pptx
BTC 506 Phylogenetic Analysis.pptxChijiokeNsofor
 
Introduction to epigenetics and study design
Introduction to epigenetics and study designIntroduction to epigenetics and study design
Introduction to epigenetics and study designamlbinder
 
Softwares For Phylogentic Analysis
Softwares For Phylogentic AnalysisSoftwares For Phylogentic Analysis
Softwares For Phylogentic AnalysisPrasanthperceptron
 
Phylogenetic analysis
Phylogenetic analysis Phylogenetic analysis
Phylogenetic analysis Nitin Naik
 
Methods of illustrating evolutionary relationship
Methods of illustrating evolutionary relationshipMethods of illustrating evolutionary relationship
Methods of illustrating evolutionary relationshipEmaSushan
 
Genomics: Organization of Genome, Strategies of Genome Sequencing, Model Plan...
Genomics: Organization of Genome, Strategies of Genome Sequencing, Model Plan...Genomics: Organization of Genome, Strategies of Genome Sequencing, Model Plan...
Genomics: Organization of Genome, Strategies of Genome Sequencing, Model Plan...Promila Sheoran
 
[DSC Europe 23][DigiHealth] Vesna Pajic - Machine Learning Techniques for omi...
[DSC Europe 23][DigiHealth] Vesna Pajic - Machine Learning Techniques for omi...[DSC Europe 23][DigiHealth] Vesna Pajic - Machine Learning Techniques for omi...
[DSC Europe 23][DigiHealth] Vesna Pajic - Machine Learning Techniques for omi...DataScienceConferenc1
 
Curation Introduction - Apollo Workshop
Curation Introduction - Apollo WorkshopCuration Introduction - Apollo Workshop
Curation Introduction - Apollo WorkshopMonica Munoz-Torres
 
2015 beacon-metagenome-tutorial
2015 beacon-metagenome-tutorial2015 beacon-metagenome-tutorial
2015 beacon-metagenome-tutorialc.titus.brown
 
Bioinformatics (Exam point of view)
Bioinformatics (Exam point of view)Bioinformatics (Exam point of view)
Bioinformatics (Exam point of view)Sijo A
 
GIAB Integrating multiple technologies to form benchmark SVs 180517
GIAB Integrating multiple technologies to form benchmark SVs 180517GIAB Integrating multiple technologies to form benchmark SVs 180517
GIAB Integrating multiple technologies to form benchmark SVs 180517GenomeInABottle
 
Giab for jax long read 190917
Giab for jax long read 190917Giab for jax long read 190917
Giab for jax long read 190917GenomeInABottle
 
EVE161: Microbial Phylogenomics - Class 1 - Introduction
EVE161: Microbial Phylogenomics - Class 1 - IntroductionEVE161: Microbial Phylogenomics - Class 1 - Introduction
EVE161: Microbial Phylogenomics - Class 1 - IntroductionJonathan Eisen
 
Map based cloning of genome
Map based cloning of genomeMap based cloning of genome
Map based cloning of genomeKAUSHAL SAHU
 
Molecular basis of evolution and softwares used in phylogenetic tree contruction
Molecular basis of evolution and softwares used in phylogenetic tree contructionMolecular basis of evolution and softwares used in phylogenetic tree contruction
Molecular basis of evolution and softwares used in phylogenetic tree contructionUdayBhanushali111
 

Similar to PHYLOGENETICS WITH MEGA (20)

Bls 303 l1.phylogenetics
Bls 303 l1.phylogeneticsBls 303 l1.phylogenetics
Bls 303 l1.phylogenetics
 
Reconstructing paleoenvironments using metagenomics
Reconstructing paleoenvironments using metagenomicsReconstructing paleoenvironments using metagenomics
Reconstructing paleoenvironments using metagenomics
 
07_Phylogeny_2022.pdf
07_Phylogeny_2022.pdf07_Phylogeny_2022.pdf
07_Phylogeny_2022.pdf
 
Introduction to Modern Biosystemaics for Fungal Classification
Introduction to Modern Biosystemaics for Fungal ClassificationIntroduction to Modern Biosystemaics for Fungal Classification
Introduction to Modern Biosystemaics for Fungal Classification
 
BTC 506 Phylogenetic Analysis.pptx
BTC 506 Phylogenetic Analysis.pptxBTC 506 Phylogenetic Analysis.pptx
BTC 506 Phylogenetic Analysis.pptx
 
Introduction to epigenetics and study design
Introduction to epigenetics and study designIntroduction to epigenetics and study design
Introduction to epigenetics and study design
 
Softwares For Phylogentic Analysis
Softwares For Phylogentic AnalysisSoftwares For Phylogentic Analysis
Softwares For Phylogentic Analysis
 
Phylogenetic analysis
Phylogenetic analysis Phylogenetic analysis
Phylogenetic analysis
 
Methods of illustrating evolutionary relationship
Methods of illustrating evolutionary relationshipMethods of illustrating evolutionary relationship
Methods of illustrating evolutionary relationship
 
Genomics: Organization of Genome, Strategies of Genome Sequencing, Model Plan...
Genomics: Organization of Genome, Strategies of Genome Sequencing, Model Plan...Genomics: Organization of Genome, Strategies of Genome Sequencing, Model Plan...
Genomics: Organization of Genome, Strategies of Genome Sequencing, Model Plan...
 
phy prAC.pptx
phy prAC.pptxphy prAC.pptx
phy prAC.pptx
 
[DSC Europe 23][DigiHealth] Vesna Pajic - Machine Learning Techniques for omi...
[DSC Europe 23][DigiHealth] Vesna Pajic - Machine Learning Techniques for omi...[DSC Europe 23][DigiHealth] Vesna Pajic - Machine Learning Techniques for omi...
[DSC Europe 23][DigiHealth] Vesna Pajic - Machine Learning Techniques for omi...
 
Curation Introduction - Apollo Workshop
Curation Introduction - Apollo WorkshopCuration Introduction - Apollo Workshop
Curation Introduction - Apollo Workshop
 
2015 beacon-metagenome-tutorial
2015 beacon-metagenome-tutorial2015 beacon-metagenome-tutorial
2015 beacon-metagenome-tutorial
 
Bioinformatics (Exam point of view)
Bioinformatics (Exam point of view)Bioinformatics (Exam point of view)
Bioinformatics (Exam point of view)
 
GIAB Integrating multiple technologies to form benchmark SVs 180517
GIAB Integrating multiple technologies to form benchmark SVs 180517GIAB Integrating multiple technologies to form benchmark SVs 180517
GIAB Integrating multiple technologies to form benchmark SVs 180517
 
Giab for jax long read 190917
Giab for jax long read 190917Giab for jax long read 190917
Giab for jax long read 190917
 
EVE161: Microbial Phylogenomics - Class 1 - Introduction
EVE161: Microbial Phylogenomics - Class 1 - IntroductionEVE161: Microbial Phylogenomics - Class 1 - Introduction
EVE161: Microbial Phylogenomics - Class 1 - Introduction
 
Map based cloning of genome
Map based cloning of genomeMap based cloning of genome
Map based cloning of genome
 
Molecular basis of evolution and softwares used in phylogenetic tree contruction
Molecular basis of evolution and softwares used in phylogenetic tree contructionMolecular basis of evolution and softwares used in phylogenetic tree contruction
Molecular basis of evolution and softwares used in phylogenetic tree contruction
 

More from UNIVERSITI MALAYSIA SABAH (11)

Plasmid DNA
Plasmid DNAPlasmid DNA
Plasmid DNA
 
ISO9001:2015
ISO9001:2015ISO9001:2015
ISO9001:2015
 
Reverse Transcription
Reverse TranscriptionReverse Transcription
Reverse Transcription
 
Reverse Transcription of RNA
Reverse Transcription of RNAReverse Transcription of RNA
Reverse Transcription of RNA
 
Breeding Plants using Chemical Mutagens
Breeding Plants using Chemical MutagensBreeding Plants using Chemical Mutagens
Breeding Plants using Chemical Mutagens
 
Genome Editing with TALENS
Genome Editing with TALENSGenome Editing with TALENS
Genome Editing with TALENS
 
PRINCIPLE OF CRISPR GENOME EDITING
PRINCIPLE OF CRISPR GENOME EDITINGPRINCIPLE OF CRISPR GENOME EDITING
PRINCIPLE OF CRISPR GENOME EDITING
 
An overview of the Pharmaceutical Industry
An overview of the Pharmaceutical Industry An overview of the Pharmaceutical Industry
An overview of the Pharmaceutical Industry
 
Effluent treatment
Effluent treatmentEffluent treatment
Effluent treatment
 
BACTERIAL GENOME SEQUENCING PROJECT
BACTERIAL GENOME SEQUENCING PROJECTBACTERIAL GENOME SEQUENCING PROJECT
BACTERIAL GENOME SEQUENCING PROJECT
 
Molecular Breeding in Plants is an introduction to the fundamental techniques...
Molecular Breeding in Plants is an introduction to the fundamental techniques...Molecular Breeding in Plants is an introduction to the fundamental techniques...
Molecular Breeding in Plants is an introduction to the fundamental techniques...
 

Recently uploaded

Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Patrick Diehl
 
Green chemistry and Sustainable development.pptx
Green chemistry  and Sustainable development.pptxGreen chemistry  and Sustainable development.pptx
Green chemistry and Sustainable development.pptxRajatChauhan518211
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)PraveenaKalaiselvan1
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsSérgio Sacani
 
Botany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsBotany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsSumit Kumar yadav
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxgindu3009
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfSumit Kumar yadav
 
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxkessiyaTpeter
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksSérgio Sacani
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )aarthirajkumar25
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Nistarini College, Purulia (W.B) India
 
GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxAleenaTreesaSaji
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsAArockiyaNisha
 
Orientation, design and principles of polyhouse
Orientation, design and principles of polyhouseOrientation, design and principles of polyhouse
Orientation, design and principles of polyhousejana861314
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfSumit Kumar yadav
 
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCESTERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCEPRINCE C P
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfSumit Kumar yadav
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...Sérgio Sacani
 

Recently uploaded (20)

Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?
 
Green chemistry and Sustainable development.pptx
Green chemistry  and Sustainable development.pptxGreen chemistry  and Sustainable development.pptx
Green chemistry and Sustainable development.pptx
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
 
Botany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsBotany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questions
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptx
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdf
 
The Philosophy of Science
The Philosophy of ScienceThe Philosophy of Science
The Philosophy of Science
 
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disks
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )
 
Engler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomyEngler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomy
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...
 
GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptx
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based Nanomaterials
 
Orientation, design and principles of polyhouse
Orientation, design and principles of polyhouseOrientation, design and principles of polyhouse
Orientation, design and principles of polyhouse
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdf
 
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCESTERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdf
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
 

PHYLOGENETICS WITH MEGA

  • 1. PHYLOGENETICS An introduction to the concepts and analysis using MEGA 6.0
  • 2. Today’s Objectives • To introduce the basis concepts involved in phylogenetic analysis. • To learn the usage of the phylogenetic package MEGA 6.0 • To discuss the manner in which you can apply phylogenetic analysis in your research approach, thesis and publications.
  • 3. Why use Phylogenetics ? • The human mind is naturally inclined to classify information. • Classification facilitates logical understanding as well as the detection of heuristic patterns within data sets. • Logical understanding of a process facilitates the process of discovery.
  • 4. Where will it be of use to me? • Classifying my sequence data within a global perspective. • Finding unique regions within my sequence data by comparison with a global data set. • Identification of genes which have not yet been widely characterized. • Infinitely many possibilities
  • 5. Traditional Classification schemes • Based on Phenotypic traits (Phenetic) and taxonomic classifiers (TU) • Low level of resolution • Not applicable to molecular data • Difficult to resolve taxonomic ambiguities at higher levels.
  • 6. From TUs to Genomic databases • DNA technology prompted a quantum shift in the resolving power of phylogenetics. • TU: < 100 classifiers • Amino Acids: Millions of combinations of AAs • Genomic level: Billions of bp of nucleotide data Does more information solve the problem?
  • 8. Species trees • A species tree establishes the hierarchy of a species within a globally accepted framework of classification. • ITS:16s • ITS: rDNA • ITS: chloroplast and mitochondria • Genes: rbcL, ADH, cytC, Ig(SC)
  • 9. Crab rRNA sequence data used to construct UPGMA tree, Note the out-group species that has been added to establish a perspective scale.
  • 10. Gene trees • Gene trees facilitate the understanding of evolutionary processes occurring within genes across taxa or within a species. • The rates of evolution offer insights into the manner in which genes evolve as a family. • Gene trees can be transformed into species trees if they conform to evolutionary criteria.
  • 11. Species v/s Gene trees • Which one do we select? The choice is determined by what we intend to characterize: Is it the organism within a genus / species? OR Is it a gene which is distributed across taxa?
  • 12. Molecular taxonomy based on genes • Prokaryotes: 16s rDNA • Higher organisms: ITS rDNA, Cp, Mt • Do you want an evolutionary tree? • Does your “molecular tree” corroborate your “taxonomic tree”?
  • 13. D. affinidisjuncta D. heteroneura D. mimica D. adiastola D. nigra S. albovittata D. crassifemur S. lebanonensis D. mulleri D. melanogaster D. pseudoobscura 0.000.050.100.150.200.25 Gene tree constructed using the Alcohol Dehydrogenase (ADH) gene from Drosophila spp. (UPGMA)
  • 14. The molecular clock • A digital clock displays time as the cumulative function of the frequency of a silicon crystal. • A molecular clock graphically depicts evolution as the function of changing nucleotide / amino acid frequency versus time.
  • 15. A highly simplified and idealized molecular clock ! The red bar is a gene, the colored bars represent nucleotide positions which change as a function of time.
  • 16. Phylogenetic trees •Distance based methods: inclusive •Maximum parsimony methods: assumptive
  • 17. NJT • Constructed Purely on the basis of pairwise genetic distance. • No prior assumptions are made pertaining to tree topology and branch lengths
  • 18. Japanese Korean Southern Chinese Australian Papuan North Amerind South Amerind Finn Italian German English San Bantu Pygmy Nigerian 0.01 Neighbor Joining Tree (NJT) based on human genetic distance matrix: compares Pairwise Genetic Distances only
  • 19. UPGMA • Originally developed for Phenogram construction (Sokal & Michener, 1958) • Adapted for Dendrogram construction • Can be used when there is a correlation between the distance measure used and the evolutionary timescale.
  • 20. Japanese Korean Southern Chinese North Amerind South Amerind Italian Finn German English Australian Papuan San Pygmy Nigerian Bantu 0.000.010.020.030.040.05 UPGMA tree based on human genetic distance matrix: Assumes a constant rate molecular clock
  • 21. VALIDATION: Bootstrapping • The concept of parsimony. • This is a re-sampling method by replacement with the same data matrix. • It allows calculation of standard deviations and variances.
  • 24. Why use MEGA 6.0 ? • Single platform, combines the functions of BIOEDIT,CLUSTALW, PAUP and TREEDIST • Imports FASTA files directly from GenBank: No editing! • Publication quality output / statistical corroboration. • Executes on your laptop / desktop. • User friendly GUI • Versatile / Flexible • Highest number of citations • Open source / Freeware • No codes to memorize
  • 25. What can MEGA 6.0 do for you? • Download data from a Database / File / Sequencer • Align data using CLUSTAL W • Perform phylogenetic analysis using various Algorithms • Graphically depict phylogenetic trees • Perform evolutionary tests: Tajima’s Molecular Clock, Tajima’s neutrality, Z-test, Fishers-exact test, Nei- Gojobori distance
  • 26. Getting started with MEGA • Input file • Processing commands • Output file
  • 27.
  • 28. THE INPUT FILE • FASTA format • ABI format • Distance matrix files
  • 29. THE ALIGNMENT COMMAND • This step requires discretion. After sequences have been aligned using CLUSTALW, 5’ and 3’ ends must be trimmed to develop a blunt composite set. • Save your output as XXXXX.MAS file • Before exiting save as XXXXX.MEG file
  • 30.
  • 31. The ends of the composite sequence should be trimmed after CLUSTALW alignment as they can contribute significantly to error in determining true evolutionary divergence / sequence similarity
  • 32. DEFINING YOUR OUTPUT • Distance Matrix File • Phylogenies: NJT / UPGMA / MP / ME • Parsimony trees • Evolutionary parameters • Molecular clocks
  • 33.
  • 34. Some concepts to think about: • Gene clusters • Genes across geographical boundaries • Why does genetic evolution transcend species boundaries? • Why do some genes evolve faster that others? • Why do some genes evolve concurrently?
  • 35. Some concepts to think about: • RNA families: clustering of ESTs • Comparative genomics within a supra genome • Evolutionary linkages within human genes
  • 36. CITATION MEGA should be cited as: Tamura K, Dudley J, Nei M & Kumar S (2007) MEGA4: Molecular Evolutionary Genetics Analysis (MEGA) software version 4.0. Molecular Biology and Evolution 24:1596-1599. (Publication PDF at http://www.kumarlab.net/publications)
  • 37. BIOINFORMATICS SESSION Follow the instructions on the screen and obtain your tree. If you have WIFI access to NCBI, you can develop your own unique alignments
  • 38. THANK YOU “In the greater scheme of things, all systems tend to unity… all of human understanding and logic is based on this underlying principle.. and the genome is no exception… “