SlideShare une entreprise Scribd logo
1  sur  40
SEMINAR
ON
Genomics
Genome Comparison Techniques, Advanced
& Classical Approaches
Presented By
Jajati Keshari Nayak
Dept. Molecular Biology
GB PANT UNIVERSITY
PhD 1st yr
ID-55493
GENOMICS
Field of biology that attempts to understand the content,
organization, function and evolution of genetic information
contained in the whole genome
Three Levels of Genome Research
Structural Genomics
EXON1 EXON2 EXON3
Structural genomics seeks to describe the structural features of
genes & 3-dimensional structure of every protein encoded by a
given genome
•Sequence and size in bp
•Map/ position
•Repeats
•Motifs/ domains etc
Functional genomics
Use of the vast wealth of data produced by genomic projects
(such as genome sequencing projects) to
describe gene (and protein) functions and interactions.
Study of Functional transcript and protein product encoded by
genes of a genome & their functional characterization
Role of gene
Time and tissue specific expression
Eg.- Gene encoding Florigen hormone in plants
What is Comparative Genomics?
Analyzing & comparing different genomes for
studying the gene content , function, organization &
evolution of different organism
Not a core technique but application of various techniques
of genome sequencing, mapping, bioinformatics etc with the sole
objective of comparing genomes
What to compare?
• Size of the genome: total number of base pairs
• Genome organization: circular, linear, ss/ ds genome, extra
chromosomal elements etc.
•Percentage of the genome (coding)
• Total number of predicted ORFs
• Average length of ORF
• Repetitive DNA/ junk DNA
• Functional assignment
• Paralogs & orthologs
• Genomic organization & gene location/order
• Gene structure
– Exon number
– Exon lengths
– Intron lengths
– Sequence similarity
• Gene characteristics
– Splice sites
– Codon usage
– Conserved synteny
What to compare contd..Units of comparison
Basic points of consideration in comparative
genomics
• All existing genomes had a common ancestor and that each
organism is a combination of ancestor and the action of
evolution.
• New genes are derived from existing sequences
• Information gained in one organism can have application in
other even distantly related organisms.
• The existing variation among current forms of living organisms
is due to
– Selection
– Speciation
– Divergence
– Gene mutations/ duplications etc
New genes are derived from existing
sequences
• Understanding the similarity & difference between the
genomes that lead to special phenotypes or diseases.
• Identifying genes and discovering their functions by studying
their counterparts in other organisms.
• Revealing the evolutionary relationships between different
organisms.
Purpose or Goals of Comparative Genomics
“Nothing in biology makes sense except in the light of evolution”
Theodosius Dobzhansky (1900 – 1975)
All modern biological processes evolved from related
processes.
Every modern gene evolved from other genes
Every gene has an ortholog in related species
most genes have paralogs in the same species.
Patterns of Gene Evolution
CASE I CASE II
Gene Orthologs Gene paralogs
Homologous genes
CASE III
Homologous genes
arise from common
ancestral organism
and show structural
similarity (sequence)
G1A & G2A are paralogs
G1A & G1B OR G2A G2B are orthologs
 Great deal of information on an organism can be extracted by
examining their counterparts in simpler model organisms
 Is conducted using model organisms
 Model organisms offer a cost-effective way to follow the inheritance
of genes through many generations in a relatively short time.
Mammals: Homo sapiens, mouse
Insects: Drosophila melanogaster
Roundworms: C. elegans
Fungi: Saccharomyces cerevisiae
Bacteria: Escherichia coli
Fish: Zebrafish
 Arabidopsis thaliana: Model plant
genome
 Rice: Model cereal genome
 Medicago trancatuala: Model tree
genome
How did it all start?
The Concept of Model organisms OR Model genomes
Genomes sequenced
First bacterial genomes sequenced
H.influenzae and M.genitalium
The yeast genome
1995
1996
1997
E.coli K12
1998
C.elegans
1999
Full sequence
of chr. 22
2000
D.melanogaster
Genome & Chr. 21
Human draft
2001
A.thaliana
•Mouse
•Ciona
•Rice
•Fugu
•Anopheles
2002
2003
Chimpanzee
2004
2005
•Human finished
•Rat
•Chicken
Xenopus
Zebrafish
Techniques of Comparative Genomics
• Use of molecular markers in plant genome analysis
• Comparative genome maps
• Studying Synteny
• Whole genome sequencing
• Bioinformatics tools:
– Homology search in public databases (BLASTn, BLASTp etc)
– Sequence alignment tools
• ClustalW
• ClustalX etc
Molecular markers in plant genome analysis
Comparative analysis of Aromatic rice varieties
Comparative maps for genome comparison
• Involves the use of molecular markers to map the genomes of
two species for a common set of markers (loci)
• To study genome evolution–how the genome has been
rearranged through time–and to make inferences about gene
organization, repeated sequences, etc
Overview of steps
• A map is constructed for a species using set of markers
• Align the map to reference map/ published map of related
species
• Work out common loci/ regions
Sorghum linkage map
produced from maize
RFLP genomic probes
Map location of of RFLP loci in maize
Comparative map
Maize Vs Sorghum
Comparative Genome Mapping of Sorghum and Maize
Gramene: A versatile tool for comparative mapping
Synteny Maps for Comparative Genomics
Map showing syntenic regions and homologous loci from another
species aligned against a map of a target organism.
Can give us information about shared ancestry, evolutionary history, or
a key to functional relationships between genes.
Genes present in one species are likely to be present in closely-related
species.
Synteny Maps
Synteny : defined as the preservation of the order of genes on a
chromosome
Goff et al (2002 Science 296: 92-100)
Rice- Maize comparative
synteny map
 Rice shows great synteny
with other cereals
 i.e. genes present in one
cereal will almost certainly be
present in the same order in
another
Regions of homology
between rice and maize of
greater than 80%.
Virtually every part of the
maize genome finds a
homologue in rice
Approximately 99% of mouse genes have a homologs in the human
genome.
For 96% the homologue lies within a similar conserved syntenic interval
in the human genome.
Conservation of synteny between mouse
and human genomes
Mural et al., Science, 2002, 296:1661
Mouse chromosome 16 is syntenic with:
Chr.’s 3,8,12,16,21,22 of Humans
 Chr.’s 10,11 of Rat
Comparison of mouse chromosome 16
and the human genome
Q: Why more breakpoints in mouse-
human than in mouse-rat?
Q: Why more conserved genes in
human than in rat?
• The longer the divergence time between 2
species, the more recombination has occurred
• 100 million years since human-mouse
divergence
• 40 million years since rat-mouse divergence
Bioinformatics Approaches:
Homology search in public databases
• Case: We have a gene/ protein sequence with no idea of its
function
• Subject sequence to homology/ sequence similarity search
across database (BLAST search)
• Look for the genes showing significant similarity
• Putative function can be inferred
Homology in bioinformatics = sequence similarity (expressed
in % sim/ identity)
BLAST: Basic Local Alignment Search Tool
An Example
 Isolate the fragment
Get it sequenced
Subject the sequence to
BLAST
Get idea on the role/
function of the novel protein
Correlate with phenotype
and confirm the findings
M 1 2 3 4 5
5 rice varieties grown under heat stress
Isolate protein from individual plants
Analyze on SDS PAGE
www.ncbi.nlm.nih.gov
Use of Sequence Alignment tools in comparative
genomics
Sequence alignment is a way of arranging the sequences to identify
regions of similarity that may be a consequence of functional, structural,
or evolutionary relationships between the sequences
SEQ1
SEQ2
SEQ1
SEQ2
Pairwise alignment
Multiple sequence alignment
Progressive multiple alignment techniques produce a phylogenetic tree
used to work out evolutionary relationship
No! This is the beginning of other advanced
comparative genomics approaches
What if target sequence shows no similarity with
existing sequences
Dead End ?
Prerequisites:
 Enough data & tools for processing large amounts of data
 Development of new computational methods
 Advanced statistical tools
 Knowledge of Algorithms
 “Informatics” techniques from applied maths, computer science and
statistics adapted to biological sequences
Advanced Techniques of Comparative Genomics
• Genes of related function are associated in various ways
Enzymes in a pathway, proteins in a complex
• Group of genes having similar biochemical function tend to
remain localized
E.g. Genes required for synthesis of tryptophan (trp genes)
in E. coli and other prokaryotes
• Whatever a gene’s associates do, the gene probably does the
similar function
Prediction of functions via ‘guilt by association’
principle
Proverbial principle: Show me your friends and I’ll tell you
who you are’
Plant association studies
 evidence is mainly post-
genomics studies
• Some plant post-genomic
resources:
• Microarray analysis
• Organellar targeting
prediction
• proteomics & phenomics
databases
Protein-protein
interactions
Organelle proteomes
Co-expression
Gene W
Gene X
Gene Y
Gene Z
Structures
Essentiality & other phenome data
A
B
C V M
A B C D
Orf XY
Orf YOrf X
Gene clustering
C
A
B
D
Gene fusion
Shared regulatory sites
XYYX
XYYX
XYYX
XYYX
Phylogenetic occurrence
+
+––
––
+
+
+
Genomic evidence Post-genomic evidence
Protein-protein
interactions
Organelle proteomes
Co-expression
Gene W
Gene X
Gene Y
Gene Z
Structures
Essentiality & other phenome data
A
B
A
B
C V M
A B C DA B C D
Orf XY
Orf YOrf X
Gene clustering
C
A
B
D
Gene fusion
Shared regulatory sites
XYYX
XYYX
XYYX
XYYX
Phylogenetic occurrence
+
+––
––
+
+
+
+
+
+
Genomic evidence Post-genomic evidence
Genome OnLine Database
Useful Websites
• COMPARATIVE GENOMICS VISUALIZATION TOOL
VISTA (www-gsd.lbl.gov/vista)
PipMaker (http://pipmaker.bx.psu.edu/pipmaker/)
• WHOLE GENOME ANNOTATION BROWSER
NCBI Map Viewer (wwcbi.nlm.nih.gov/projects/mapview)
UCSC genome browser (genome.ucsc.edu)
Ensemble (www.ensembl.org)
• WHOLE GENOME COMPARATIVE GENOME BROWSER
UCSC genome Browser (genome.ucsc.edu)
VISTA genome browser (www-gsd.lbl.gov/vista)
PipMaker ((http://pipmaker.bx.psu.edu/pipmaker/)
• CUSTOM COMPARISION TO WHOLE GENOME
Genome Vista (AVID) (http://pipline.lbl.gov-cgi-bgenomevista)
UCSC genome browser (genome.ucsc.edu)
ENSEMBLE (SSAHA) (www.ensembl.org)
NCBI (BLAST) .ncbi.nlm.nih.gov/BLAST)
• Locus Link/RefSeq
http://www.ncbi.nih.gov/LocusLink/
• PEDANT -Protein Extraction Description ANalysis Tool
http://pedant.gsf.de/
• MIPS –mammalian Protein Interaction Database
http://mips.gsf.de/
• COGs - Cluster of Orthologous Groups (of proteins)
http://www.ncbi.nih.gov/COG/
• KEGG - Kyoto Encyclopedia of Genes and Genomes
http://www.genome.ad.jp/kegg/
• MBGD - Microbial Genome Database –
http://mbgd.genome.ad.jp/
• GOLD - Genome OnLine Database –
http://www.genomesonline.org/
• TOGA (TIGR Orthologous Gene Alignment) –
http://www.tigr.org/tdb/toga/toga.shtml
General Databases Useful for Comparative
Genomics
What we have learned by comparing genomes
It will help us to understand the genetic basis of diversity in
organisms, both speciation & variation & important aspects of
evolutionary biology.
Provides “first pass” information on the function of the
putative gene based on the existence of conserved protein
sequences.
Comparative genomics provides a powerful way in which to
analyze sequence data.
THANK YOU

Contenu connexe

Tendances (20)

Express sequence tags
Express sequence tagsExpress sequence tags
Express sequence tags
 
genomic comparison
genomic comparison genomic comparison
genomic comparison
 
Genetic mapping
Genetic mappingGenetic mapping
Genetic mapping
 
Functional genomics
Functional genomicsFunctional genomics
Functional genomics
 
Map based cloning of genome
Map based cloning of genomeMap based cloning of genome
Map based cloning of genome
 
2 whole genome sequencing and analysis
2 whole genome sequencing and analysis2 whole genome sequencing and analysis
2 whole genome sequencing and analysis
 
Genome annotation 2013
Genome annotation 2013Genome annotation 2013
Genome annotation 2013
 
Genome mapping
Genome mapping Genome mapping
Genome mapping
 
Genomics
GenomicsGenomics
Genomics
 
Genomics(functional genomics)
Genomics(functional genomics)Genomics(functional genomics)
Genomics(functional genomics)
 
Comparative and functional genomics
Comparative and functional genomicsComparative and functional genomics
Comparative and functional genomics
 
PHYSICAL MAPPING STRATEGIES IN GENOMICS
PHYSICAL MAPPING STRATEGIES IN GENOMICSPHYSICAL MAPPING STRATEGIES IN GENOMICS
PHYSICAL MAPPING STRATEGIES IN GENOMICS
 
Genomics
GenomicsGenomics
Genomics
 
SAGE (Serial analysis of Gene Expression)
SAGE (Serial analysis of Gene Expression)SAGE (Serial analysis of Gene Expression)
SAGE (Serial analysis of Gene Expression)
 
Shotgun and clone contig method
Shotgun and clone contig methodShotgun and clone contig method
Shotgun and clone contig method
 
Types of genomics ppt
Types of genomics pptTypes of genomics ppt
Types of genomics ppt
 
Genome sequencing
Genome sequencingGenome sequencing
Genome sequencing
 
Genome origin
Genome originGenome origin
Genome origin
 
Sts
StsSts
Sts
 
Genome Assembly
Genome AssemblyGenome Assembly
Genome Assembly
 

Similaire à Comparative genomics

Comparative genomics.pdf
Comparative genomics.pdfComparative genomics.pdf
Comparative genomics.pdfshinycthomas
 
Comparative genomics and proteomics
Comparative genomics and proteomicsComparative genomics and proteomics
Comparative genomics and proteomicsNikhil Aggarwal
 
BIOINFORMATICS Applications And Challenges
BIOINFORMATICS Applications And ChallengesBIOINFORMATICS Applications And Challenges
BIOINFORMATICS Applications And ChallengesAmos Watentena
 
Molecular basis of evolution and softwares used in phylogenetic tree contruction
Molecular basis of evolution and softwares used in phylogenetic tree contructionMolecular basis of evolution and softwares used in phylogenetic tree contruction
Molecular basis of evolution and softwares used in phylogenetic tree contructionUdayBhanushali111
 
Rice stress related gene expression analysis
Rice stress related gene expression analysisRice stress related gene expression analysis
Rice stress related gene expression analysisRonHazarika
 
Chapter 20 ppt
Chapter 20 pptChapter 20 ppt
Chapter 20 pptrehman2009
 
Bioinformatics, comparative genemics and proteomics
Bioinformatics, comparative genemics and proteomicsBioinformatics, comparative genemics and proteomics
Bioinformatics, comparative genemics and proteomicsjuancarlosrise
 
GENOMICS AND BIOINFORMATICS
GENOMICS AND BIOINFORMATICSGENOMICS AND BIOINFORMATICS
GENOMICS AND BIOINFORMATICSsandeshGM
 
Comparative genomics 2
Comparative genomics 2Comparative genomics 2
Comparative genomics 2GCUF
 
Functional Genomic l Genomes l proteomic l DNA l #genomics #proteomics #scien...
Functional Genomic l Genomes l proteomic l DNA l #genomics #proteomics #scien...Functional Genomic l Genomes l proteomic l DNA l #genomics #proteomics #scien...
Functional Genomic l Genomes l proteomic l DNA l #genomics #proteomics #scien...DevikaPatel12
 
Genomic aided selection for crop improvement
Genomic aided selection for crop improvementGenomic aided selection for crop improvement
Genomic aided selection for crop improvementtanvic2
 
genomics proteomics metbolomics.pptx
genomics proteomics metbolomics.pptxgenomics proteomics metbolomics.pptx
genomics proteomics metbolomics.pptxRajesh Yadav
 
Apollo Workshop AGS2017 Introduction
Apollo Workshop AGS2017 IntroductionApollo Workshop AGS2017 Introduction
Apollo Workshop AGS2017 IntroductionMonica Munoz-Torres
 
Integrative omics approches
Integrative omics approches   Integrative omics approches
Integrative omics approches Sayali Magar
 

Similaire à Comparative genomics (20)

Comparative genomics.pdf
Comparative genomics.pdfComparative genomics.pdf
Comparative genomics.pdf
 
Genetics and genomic
Genetics and genomicGenetics and genomic
Genetics and genomic
 
Comparative genomics and proteomics
Comparative genomics and proteomicsComparative genomics and proteomics
Comparative genomics and proteomics
 
BIOINFORMATICS Applications And Challenges
BIOINFORMATICS Applications And ChallengesBIOINFORMATICS Applications And Challenges
BIOINFORMATICS Applications And Challenges
 
Molecular basis of evolution and softwares used in phylogenetic tree contruction
Molecular basis of evolution and softwares used in phylogenetic tree contructionMolecular basis of evolution and softwares used in phylogenetic tree contruction
Molecular basis of evolution and softwares used in phylogenetic tree contruction
 
Rice stress related gene expression analysis
Rice stress related gene expression analysisRice stress related gene expression analysis
Rice stress related gene expression analysis
 
Chapter 20 ppt
Chapter 20 pptChapter 20 ppt
Chapter 20 ppt
 
Pharmacogenomics
PharmacogenomicsPharmacogenomics
Pharmacogenomics
 
Bioinformatics, comparative genemics and proteomics
Bioinformatics, comparative genemics and proteomicsBioinformatics, comparative genemics and proteomics
Bioinformatics, comparative genemics and proteomics
 
GENOMICS AND BIOINFORMATICS
GENOMICS AND BIOINFORMATICSGENOMICS AND BIOINFORMATICS
GENOMICS AND BIOINFORMATICS
 
Comparative genomics 2
Comparative genomics 2Comparative genomics 2
Comparative genomics 2
 
GENOMICS
GENOMICSGENOMICS
GENOMICS
 
Functional Genomic l Genomes l proteomic l DNA l #genomics #proteomics #scien...
Functional Genomic l Genomes l proteomic l DNA l #genomics #proteomics #scien...Functional Genomic l Genomes l proteomic l DNA l #genomics #proteomics #scien...
Functional Genomic l Genomes l proteomic l DNA l #genomics #proteomics #scien...
 
Comparitive genomics
Comparitive genomicsComparitive genomics
Comparitive genomics
 
Genomic aided selection for crop improvement
Genomic aided selection for crop improvementGenomic aided selection for crop improvement
Genomic aided selection for crop improvement
 
rheumatoid arthritis
rheumatoid arthritisrheumatoid arthritis
rheumatoid arthritis
 
genomics proteomics metbolomics.pptx
genomics proteomics metbolomics.pptxgenomics proteomics metbolomics.pptx
genomics proteomics metbolomics.pptx
 
Apollo Workshop AGS2017 Introduction
Apollo Workshop AGS2017 IntroductionApollo Workshop AGS2017 Introduction
Apollo Workshop AGS2017 Introduction
 
Integrative omics approches
Integrative omics approches   Integrative omics approches
Integrative omics approches
 
Genomics
GenomicsGenomics
Genomics
 

Dernier

Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactPECB
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Celine George
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajanpragatimahajan3
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationnomboosow
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfciinovamais
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfJayanti Pande
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAssociation for Project Management
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfAdmir Softic
 
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...PsychoTech Services
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3JemimahLaneBuaron
 
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...Sapna Thakur
 
fourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingfourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingTeacherCyreneCayanan
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhikauryashika82
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Disha Kariya
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...christianmathematics
 

Dernier (20)

Advance Mobile Application Development class 07
Advance Mobile Application Development class 07Advance Mobile Application Development class 07
Advance Mobile Application Development class 07
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajan
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
 
fourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingfourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writing
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 

Comparative genomics

  • 1. SEMINAR ON Genomics Genome Comparison Techniques, Advanced & Classical Approaches Presented By Jajati Keshari Nayak Dept. Molecular Biology GB PANT UNIVERSITY PhD 1st yr ID-55493
  • 2. GENOMICS Field of biology that attempts to understand the content, organization, function and evolution of genetic information contained in the whole genome Three Levels of Genome Research
  • 3. Structural Genomics EXON1 EXON2 EXON3 Structural genomics seeks to describe the structural features of genes & 3-dimensional structure of every protein encoded by a given genome •Sequence and size in bp •Map/ position •Repeats •Motifs/ domains etc
  • 4. Functional genomics Use of the vast wealth of data produced by genomic projects (such as genome sequencing projects) to describe gene (and protein) functions and interactions. Study of Functional transcript and protein product encoded by genes of a genome & their functional characterization Role of gene Time and tissue specific expression Eg.- Gene encoding Florigen hormone in plants
  • 5. What is Comparative Genomics? Analyzing & comparing different genomes for studying the gene content , function, organization & evolution of different organism Not a core technique but application of various techniques of genome sequencing, mapping, bioinformatics etc with the sole objective of comparing genomes
  • 6. What to compare? • Size of the genome: total number of base pairs • Genome organization: circular, linear, ss/ ds genome, extra chromosomal elements etc. •Percentage of the genome (coding) • Total number of predicted ORFs • Average length of ORF • Repetitive DNA/ junk DNA • Functional assignment
  • 7. • Paralogs & orthologs • Genomic organization & gene location/order • Gene structure – Exon number – Exon lengths – Intron lengths – Sequence similarity • Gene characteristics – Splice sites – Codon usage – Conserved synteny What to compare contd..Units of comparison
  • 8. Basic points of consideration in comparative genomics • All existing genomes had a common ancestor and that each organism is a combination of ancestor and the action of evolution. • New genes are derived from existing sequences • Information gained in one organism can have application in other even distantly related organisms. • The existing variation among current forms of living organisms is due to – Selection – Speciation – Divergence – Gene mutations/ duplications etc
  • 9. New genes are derived from existing sequences
  • 10. • Understanding the similarity & difference between the genomes that lead to special phenotypes or diseases. • Identifying genes and discovering their functions by studying their counterparts in other organisms. • Revealing the evolutionary relationships between different organisms. Purpose or Goals of Comparative Genomics
  • 11. “Nothing in biology makes sense except in the light of evolution” Theodosius Dobzhansky (1900 – 1975) All modern biological processes evolved from related processes. Every modern gene evolved from other genes Every gene has an ortholog in related species most genes have paralogs in the same species.
  • 12. Patterns of Gene Evolution CASE I CASE II Gene Orthologs Gene paralogs Homologous genes
  • 13. CASE III Homologous genes arise from common ancestral organism and show structural similarity (sequence) G1A & G2A are paralogs G1A & G1B OR G2A G2B are orthologs
  • 14.  Great deal of information on an organism can be extracted by examining their counterparts in simpler model organisms  Is conducted using model organisms  Model organisms offer a cost-effective way to follow the inheritance of genes through many generations in a relatively short time. Mammals: Homo sapiens, mouse Insects: Drosophila melanogaster Roundworms: C. elegans Fungi: Saccharomyces cerevisiae Bacteria: Escherichia coli Fish: Zebrafish  Arabidopsis thaliana: Model plant genome  Rice: Model cereal genome  Medicago trancatuala: Model tree genome How did it all start? The Concept of Model organisms OR Model genomes
  • 15. Genomes sequenced First bacterial genomes sequenced H.influenzae and M.genitalium The yeast genome 1995 1996 1997 E.coli K12 1998 C.elegans 1999 Full sequence of chr. 22 2000 D.melanogaster Genome & Chr. 21 Human draft 2001 A.thaliana •Mouse •Ciona •Rice •Fugu •Anopheles 2002 2003 Chimpanzee 2004 2005 •Human finished •Rat •Chicken Xenopus Zebrafish
  • 16. Techniques of Comparative Genomics • Use of molecular markers in plant genome analysis • Comparative genome maps • Studying Synteny • Whole genome sequencing • Bioinformatics tools: – Homology search in public databases (BLASTn, BLASTp etc) – Sequence alignment tools • ClustalW • ClustalX etc
  • 17. Molecular markers in plant genome analysis Comparative analysis of Aromatic rice varieties
  • 18. Comparative maps for genome comparison • Involves the use of molecular markers to map the genomes of two species for a common set of markers (loci) • To study genome evolution–how the genome has been rearranged through time–and to make inferences about gene organization, repeated sequences, etc Overview of steps • A map is constructed for a species using set of markers • Align the map to reference map/ published map of related species • Work out common loci/ regions
  • 19. Sorghum linkage map produced from maize RFLP genomic probes Map location of of RFLP loci in maize Comparative map Maize Vs Sorghum Comparative Genome Mapping of Sorghum and Maize
  • 20. Gramene: A versatile tool for comparative mapping
  • 21. Synteny Maps for Comparative Genomics Map showing syntenic regions and homologous loci from another species aligned against a map of a target organism. Can give us information about shared ancestry, evolutionary history, or a key to functional relationships between genes. Genes present in one species are likely to be present in closely-related species. Synteny Maps Synteny : defined as the preservation of the order of genes on a chromosome
  • 22. Goff et al (2002 Science 296: 92-100) Rice- Maize comparative synteny map  Rice shows great synteny with other cereals  i.e. genes present in one cereal will almost certainly be present in the same order in another Regions of homology between rice and maize of greater than 80%. Virtually every part of the maize genome finds a homologue in rice
  • 23. Approximately 99% of mouse genes have a homologs in the human genome. For 96% the homologue lies within a similar conserved syntenic interval in the human genome. Conservation of synteny between mouse and human genomes Mural et al., Science, 2002, 296:1661 Mouse chromosome 16 is syntenic with: Chr.’s 3,8,12,16,21,22 of Humans  Chr.’s 10,11 of Rat
  • 24. Comparison of mouse chromosome 16 and the human genome Q: Why more breakpoints in mouse- human than in mouse-rat? Q: Why more conserved genes in human than in rat? • The longer the divergence time between 2 species, the more recombination has occurred • 100 million years since human-mouse divergence • 40 million years since rat-mouse divergence
  • 25. Bioinformatics Approaches: Homology search in public databases • Case: We have a gene/ protein sequence with no idea of its function • Subject sequence to homology/ sequence similarity search across database (BLAST search) • Look for the genes showing significant similarity • Putative function can be inferred Homology in bioinformatics = sequence similarity (expressed in % sim/ identity) BLAST: Basic Local Alignment Search Tool
  • 26. An Example  Isolate the fragment Get it sequenced Subject the sequence to BLAST Get idea on the role/ function of the novel protein Correlate with phenotype and confirm the findings M 1 2 3 4 5 5 rice varieties grown under heat stress Isolate protein from individual plants Analyze on SDS PAGE
  • 28.
  • 29. Use of Sequence Alignment tools in comparative genomics Sequence alignment is a way of arranging the sequences to identify regions of similarity that may be a consequence of functional, structural, or evolutionary relationships between the sequences SEQ1 SEQ2 SEQ1 SEQ2 Pairwise alignment Multiple sequence alignment Progressive multiple alignment techniques produce a phylogenetic tree used to work out evolutionary relationship
  • 30. No! This is the beginning of other advanced comparative genomics approaches What if target sequence shows no similarity with existing sequences Dead End ?
  • 31. Prerequisites:  Enough data & tools for processing large amounts of data  Development of new computational methods  Advanced statistical tools  Knowledge of Algorithms  “Informatics” techniques from applied maths, computer science and statistics adapted to biological sequences Advanced Techniques of Comparative Genomics
  • 32. • Genes of related function are associated in various ways Enzymes in a pathway, proteins in a complex • Group of genes having similar biochemical function tend to remain localized E.g. Genes required for synthesis of tryptophan (trp genes) in E. coli and other prokaryotes • Whatever a gene’s associates do, the gene probably does the similar function Prediction of functions via ‘guilt by association’ principle Proverbial principle: Show me your friends and I’ll tell you who you are’
  • 33. Plant association studies  evidence is mainly post- genomics studies • Some plant post-genomic resources: • Microarray analysis • Organellar targeting prediction • proteomics & phenomics databases Protein-protein interactions Organelle proteomes Co-expression Gene W Gene X Gene Y Gene Z Structures Essentiality & other phenome data A B C V M A B C D Orf XY Orf YOrf X Gene clustering C A B D Gene fusion Shared regulatory sites XYYX XYYX XYYX XYYX Phylogenetic occurrence + +–– –– + + + Genomic evidence Post-genomic evidence Protein-protein interactions Organelle proteomes Co-expression Gene W Gene X Gene Y Gene Z Structures Essentiality & other phenome data A B A B C V M A B C DA B C D Orf XY Orf YOrf X Gene clustering C A B D Gene fusion Shared regulatory sites XYYX XYYX XYYX XYYX Phylogenetic occurrence + +–– –– + + + + + + Genomic evidence Post-genomic evidence
  • 34.
  • 35.
  • 37. Useful Websites • COMPARATIVE GENOMICS VISUALIZATION TOOL VISTA (www-gsd.lbl.gov/vista) PipMaker (http://pipmaker.bx.psu.edu/pipmaker/) • WHOLE GENOME ANNOTATION BROWSER NCBI Map Viewer (wwcbi.nlm.nih.gov/projects/mapview) UCSC genome browser (genome.ucsc.edu) Ensemble (www.ensembl.org) • WHOLE GENOME COMPARATIVE GENOME BROWSER UCSC genome Browser (genome.ucsc.edu) VISTA genome browser (www-gsd.lbl.gov/vista) PipMaker ((http://pipmaker.bx.psu.edu/pipmaker/) • CUSTOM COMPARISION TO WHOLE GENOME Genome Vista (AVID) (http://pipline.lbl.gov-cgi-bgenomevista) UCSC genome browser (genome.ucsc.edu) ENSEMBLE (SSAHA) (www.ensembl.org) NCBI (BLAST) .ncbi.nlm.nih.gov/BLAST)
  • 38. • Locus Link/RefSeq http://www.ncbi.nih.gov/LocusLink/ • PEDANT -Protein Extraction Description ANalysis Tool http://pedant.gsf.de/ • MIPS –mammalian Protein Interaction Database http://mips.gsf.de/ • COGs - Cluster of Orthologous Groups (of proteins) http://www.ncbi.nih.gov/COG/ • KEGG - Kyoto Encyclopedia of Genes and Genomes http://www.genome.ad.jp/kegg/ • MBGD - Microbial Genome Database – http://mbgd.genome.ad.jp/ • GOLD - Genome OnLine Database – http://www.genomesonline.org/ • TOGA (TIGR Orthologous Gene Alignment) – http://www.tigr.org/tdb/toga/toga.shtml General Databases Useful for Comparative Genomics
  • 39. What we have learned by comparing genomes It will help us to understand the genetic basis of diversity in organisms, both speciation & variation & important aspects of evolutionary biology. Provides “first pass” information on the function of the putative gene based on the existence of conserved protein sequences. Comparative genomics provides a powerful way in which to analyze sequence data.