SlideShare une entreprise Scribd logo
1  sur  46
FUNCTIONAL GENOMICS
PRESENTED BY:
AJAY
M.Sc. Biotechnology (P)
Roll no. 4
INTRODUCTION
• Genomics - It is the study of genomes.
• The field of genomics comprises of two main
areas:
1.Structural genomics
2. Functional genomics
• Structural genomics - deals with genome
structures with a focus on the study of
genome mapping and assembly as well as
genome annotation and comparison.
Functional genomics
• It is largely experiment based with a focus on
gene functions at the whole genome level
using high throughput approaches.
• The high throughput analysis of all expressed
genes is also termed transcriptome analysis
• Transcriptome analysis can be conducted by
two approaches:
1) sequence based approaches
2) microarray based approaches
Sequence based approaches
• Expressed sequence tags :- ESTs are short
sequences of cDNA typically 200-400
nucleotides in length.
• Obtained from either 5’ end or 3’ end of cDNA
inserts of cDNA library.
ADVANTAGES OF E.S.T
• Provide a rough estimate of genes that are
actively expressed in a genome under a
particular physiological condition.
• Help in discovering new genes, due to
random sequencing of cDNA clones.
• EST libraries can be easily generated.
DRAWBACKS OF USING E.S.Ts
• Automatically generated without verification
thus contain high error rates.
• There is often contamination by vector
sequence , introns, ribosomal RNA,
mitochondrial RNA.
• Weakly expressed genes are hardly found in
EST sequencing survey.
• ESTs represent only partial sequences of
genes.
E.S.Ts DATABASE
• dbEST is a EST database provided by GenBank.
That contains EST collections for a large
number of organisms.
• The database id regularly updated.
EST INDEX CONSTRUCTION
• Goal of EST database – organize and combine the
EST data to improve the quality of sequence
information.
“A collection of nonredundant and annotated
EST sequence is known as gene index construction”
• main advantage – reduces redundancy.
STEPS INVOLVED
Remove vector contaminants and masks repeats using Vecscreen
Clustering – associates EST sequences with unique genes
Derive consensus sequences by fusing redundant , overlapping ESTs to
correct errors
Results in longer contigs
Coding region is defined by excluding introns and 3’- untranslated
sequences.
Contd…
coding sequence translated into protein sequence
for db similarity searching.
Alignment of these complied ESTs with genomic
sequence, performed by program SIM4
Major EST index sequence databases are:
• Unigene – is an NCBI EST cluster database.
• TIGR Gene indices
SAGE
• Serial analysis of gene expression
• Another high throughput, Sequence-based
approach for gene expression profile analysis.
• More quantitative in determining mRNA
expression in cell.
• Short fragments (taqs) of DNA, excised from cDNA
sequences , act as unique markers of gene
transcript
• SAGE invented at Johns Hopkins University in USA
(Oncology Center) by Dr. Victor in 1995.
Principle Underlining SAGE methodology
• A short sequence tag (10-14bp) contains sufficient
information to uniquely identify a transcript.
• Sequence tag can be linked together to form long
serial molecules that can be cloned and
sequenced.
• the number of times a particular tag is observed
provides the expression level of the
corresponding transcript.
Steps In Brief…
1. Trapping of RNA with beads
Coating of microscopic magnetic beads with “TTTTT” tails is done.
2. cDNA synthesis
ds cDNA is synthesized from the extracted
mRNA by means of biotinylated oligo (dT)
primer.
• cDNA synthesis is immobilized to streptavidin
beads.
3. Enzymatic cleavage of cDNA
• Type II restriction enzyme used (E.g. NlaIII.)
4. Ligation of Linkers to bound cDNA
• Captured cDNA are then ligated to linkers at
their ends.
5. Cleaving with tagging enzyme
• Tagging enzyme, (usually BsmF1) cleave DNA,
releasing the linker-adapted SAGE tag from
each cDNA.
6. Formation of Ditags
• Two groups of cDNAs are ligated to each
other, to create a “ditag” with linkers on either
end.
• Two tags are linked together using T4 DNA
ligase.
7. PCR amplification of Ditags
• linker-ditag-linker constructs are amplified by
PCR
PCR Amplification
8. Isolation of Ditags
• The cDNA is again digested by the Anchoring
enzyme (AE)
• Breaking the linker off right where it was added in
beginning.
9. Concatamerization of Ditags
• Tags are combined into much longer
molecules, called concatamers
Cloning Concatamers and Sequencing…
• Concatamers are inserted into bacteria, which
act like living “copy machines” to create
millions of copies from original.
• Copies are then sequenced, using machines
that can read the nucleotides in DNA.
• Software tools for SAGE analysis:
SAGE map
SAGE xprofiler
SAGE Genie
• Advantages over EST analysis:
a. Detect weakly expressed genes
b. It uses a short nucleotide tag and allows
sequencing of multiple tags in a single clone
MICROARRAY-BASED APPROACHES
A microarray is a pattern of ssDNA probes which are
immobilized on a surface called a chip or a slide.
• Microarrays use hybridization to detect a specific
DNA or RNA in a sample.
• DNA microarray uses a million different probes,
fixed on a solid surface.
• Microarray technology evolved from Southern
blotting.
WHY?
• To analyze the expression of thousands of
genes in single reaction, very quickly and in an
efficient manner.
• To understand the genetic causes for the
abnormal functioning of the human body.
• To understand which genes are active and
which genes are inactive in different cell
types.
OLIGONUCLEOTIDE DESIGN
• Fixed oligonucleotide onto a solid support such as glass
slide
• Length of oligonucleotides is in range of 25-70 bases
long
• Oligonucleotides are called probes (should be specific
to minimize cross-hybridization)
• Oligonucleotide should not form stable internal
secondary structure, A program M fold helps to detect
secondary str.
• All probes should have approx. equal Tm.
• OligoWiz & OligoArray are 2 programs used in
designing probe for microarray spotting.
STEPS
Sample
preparation
and
labeling
Hybridisation Washing
Image processing
and
Data analysis
1 .Sample preparation and labeling
• Isolate a total RNA containing mRNA that
ideally represents genes, that are expressed at
the time of sample collection.
• Preparation of cDNA from mRNA using a
reverse-transcriptase enzyme.
• Short primer is required to initiate cDNA
synthesis.
• Each cDNA (Sample and Control) is labelled with
fluorescent cyanine dyes (i.e. Cy3 and Cy5).
2. Hybridization
• labelled cDNA is competitively hybridized against
cDNA molecules spotted on a glass slide.
3. Image acquisition and data analysis
Scanning hybridization signals
Image analysis
Transformation and normalization of data
Analyzing data
Diagram
Image processing
• To locate and quantify hybridization spots
and to separate true hybridization signals from
background noise ( may be non specific hybridization,
unevenness of the slide surface, presence of
contaminants such as dust on surface of slide).
1. Computer programs are used to correctly locate the
boundaries of spots.
2. Arrayed signals are converted into numbers and
observed as ratio between cy5 and cy3 for each spot.
3. data are often presented as false colour.
4. If ratios value are above 1 the false colour appears as
red if it is below 1 then appear as green.
• Software programs to perform microarray image
analysis:
• ArrayDB
• TIGR Spotfinder
Data transformation and normalization
• Digitized gene expression data need to be further
processed before differentially expressed gene can
be identified.
• When raw fluorescence intensity cy5 is plotted
against cy3, most of data are clustered near bottom
left of plot.
• -It shows non- normal distribution of raw data.
• -It represents imbalance of red & green intensities.
Plot of raw fluorescence signal
intensities of Cy5 versus Cy3
Improvement in data quality
1). Transform raw cy5 & cy3 values by taking log to the
base of 2.
It produces uniform distribution of data.
2). Further normalization of data is done by ploting the
data points horizontally
plot the log ratios(cy5/cy3) against the average log
intensities (log2 cy5 + log2 cy3)/2
• This form is known as “intensity- ratio plot” In which
differentially expressed genes are more easily visible
about a horizontal axis.
• In all these types linear regression is used.
B. Plot of data after log transformation
C. Plot of mean log intensity v/s log ratio of two fluorescence intensities
• Software programs for microarray data
normalization:
Arrayplot
SNOMAD
Statistical Analysis to identify
Differentially Expressed Genes
• Differentially expressed genes are separated, using
normalization cutoff of generally two fold.
• But it is arbitrary cutoff value depending on data
variability.
• Ways to ensure that gene is truly differentially
expressed :
1)Multiple replicate experiments
2)Statistical testing
1. Multiple replicate experiment:
• Repeat experiments provide replicate data points
that provide information about variability of
expression data.
• Helps to identify differentially expressed genes.
2. Statistical Tests:
• t-Test
&
• ANOVA (Analysis of variance).
• Other Statistical Programs
1)MA-ANOVA
2)Cyber-T
• Advantage of microarray based approach:
To study the expression of many genes in
parallel & identify groups of genes that exhibit
similar expression patterns.
Comparison of SAGE &Microarray
based approach
SAGE
1)Doesn’t require prior
knowledge of transcript
sequence.
2)SAGE measures absolute
mRNA expression level.
3)A minute quantity of sample
mRNA is required.
4)It is expensive & gene
identification is difficult.
1)Detect only the genes
spotted on microarray.
2)It indicates the relative
expression level.
3)It requires larger quantity of
mRNA sample.
4)It is less expensive &
identity of probes are
already known.
DNA Microarray
THANK YOU

Contenu connexe

Tendances

Analysis of gene expression
Analysis of gene expressionAnalysis of gene expression
Analysis of gene expression
Tapeshwar Yadav
 

Tendances (20)

NCBI National Center for Biotechnology Information
NCBI National Center for Biotechnology InformationNCBI National Center for Biotechnology Information
NCBI National Center for Biotechnology Information
 
Express sequence tags
Express sequence tagsExpress sequence tags
Express sequence tags
 
SAGE (Serial analysis of Gene Expression)
SAGE (Serial analysis of Gene Expression)SAGE (Serial analysis of Gene Expression)
SAGE (Serial analysis of Gene Expression)
 
Genome annotation 2013
Genome annotation 2013Genome annotation 2013
Genome annotation 2013
 
Maxam–Gilbert sequencing
Maxam–Gilbert sequencingMaxam–Gilbert sequencing
Maxam–Gilbert sequencing
 
Molecular mapping
Molecular mappingMolecular mapping
Molecular mapping
 
Types of genomics ppt
Types of genomics pptTypes of genomics ppt
Types of genomics ppt
 
Genomics types
Genomics typesGenomics types
Genomics types
 
Sts
StsSts
Sts
 
Gen bank databases
Gen bank databasesGen bank databases
Gen bank databases
 
Analysis of gene expression
Analysis of gene expressionAnalysis of gene expression
Analysis of gene expression
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
 
Sequence alignment
Sequence alignmentSequence alignment
Sequence alignment
 
Genome sequencing
Genome sequencingGenome sequencing
Genome sequencing
 
PHYSICAL MAPPING STRATEGIES IN GENOMICS
PHYSICAL MAPPING STRATEGIES IN GENOMICSPHYSICAL MAPPING STRATEGIES IN GENOMICS
PHYSICAL MAPPING STRATEGIES IN GENOMICS
 
Applications of genomics and proteomics ppt
Applications of genomics and  proteomics pptApplications of genomics and  proteomics ppt
Applications of genomics and proteomics ppt
 
Gene mapping methods
Gene mapping methodsGene mapping methods
Gene mapping methods
 
GENOMICS AND BIOINFORMATICS
GENOMICS AND BIOINFORMATICSGENOMICS AND BIOINFORMATICS
GENOMICS AND BIOINFORMATICS
 
Genomics(functional genomics)
Genomics(functional genomics)Genomics(functional genomics)
Genomics(functional genomics)
 
DNA microarray
DNA microarrayDNA microarray
DNA microarray
 

En vedette

Transcription and Translation PowerPoint
Transcription and Translation PowerPointTranscription and Translation PowerPoint
Transcription and Translation PowerPoint
BiologyIB
 
Nuclear medicine
Nuclear medicineNuclear medicine
Nuclear medicine
Rad Tech
 
Central dogma of molecular biology
Central dogma of molecular biologyCentral dogma of molecular biology
Central dogma of molecular biology
Omar Jacalne
 

En vedette (20)

Difference between prokaryotic and eukaryotic transcription
Difference between prokaryotic and eukaryotic transcriptionDifference between prokaryotic and eukaryotic transcription
Difference between prokaryotic and eukaryotic transcription
 
Endoplasmic reticulum
Endoplasmic reticulumEndoplasmic reticulum
Endoplasmic reticulum
 
Central dogma
Central dogmaCentral dogma
Central dogma
 
Central dogma of molecular biology 20 11-2015
Central dogma of molecular biology 20 11-2015Central dogma of molecular biology 20 11-2015
Central dogma of molecular biology 20 11-2015
 
Transcription and Translation PowerPoint
Transcription and Translation PowerPointTranscription and Translation PowerPoint
Transcription and Translation PowerPoint
 
Central Dogma and Protein Synthesis
Central Dogma and Protein SynthesisCentral Dogma and Protein Synthesis
Central Dogma and Protein Synthesis
 
PCR
PCRPCR
PCR
 
Pcr
PcrPcr
Pcr
 
Gene therapy- An introduction & Concept
Gene therapy- An introduction & ConceptGene therapy- An introduction & Concept
Gene therapy- An introduction & Concept
 
Functional genomics
Functional genomicsFunctional genomics
Functional genomics
 
Pcr ii
Pcr iiPcr ii
Pcr ii
 
Nuclear medicine
Nuclear medicineNuclear medicine
Nuclear medicine
 
Pcr 29 07-2011 final
Pcr 29 07-2011 finalPcr 29 07-2011 final
Pcr 29 07-2011 final
 
Concepts in molecular biology
Concepts in molecular biologyConcepts in molecular biology
Concepts in molecular biology
 
PCR,polymerase chain reaction.Basic concept of PCR.
PCR,polymerase chain reaction.Basic concept of PCR.PCR,polymerase chain reaction.Basic concept of PCR.
PCR,polymerase chain reaction.Basic concept of PCR.
 
Central dogma of molecular biology
Central dogma of molecular biologyCentral dogma of molecular biology
Central dogma of molecular biology
 
Stem cell therapy
Stem cell therapyStem cell therapy
Stem cell therapy
 
Concept and basics of genetics
Concept and basics of geneticsConcept and basics of genetics
Concept and basics of genetics
 
Genomics
GenomicsGenomics
Genomics
 
BASICS OF MOLECULAR BIOLOGY
BASICS OF MOLECULAR BIOLOGYBASICS OF MOLECULAR BIOLOGY
BASICS OF MOLECULAR BIOLOGY
 

Similaire à Functional genomics

20100509 bioinformatics kapushesky_lecture03-04_0
20100509 bioinformatics kapushesky_lecture03-04_020100509 bioinformatics kapushesky_lecture03-04_0
20100509 bioinformatics kapushesky_lecture03-04_0
Computer Science Club
 
Microarray and dna chips for transcriptome study
Microarray and dna chips for transcriptome studyMicroarray and dna chips for transcriptome study
Microarray and dna chips for transcriptome study
Bia Khan
 

Similaire à Functional genomics (20)

31931 31941
31931 3194131931 31941
31931 31941
 
Comparative and functional genomics
Comparative and functional genomicsComparative and functional genomics
Comparative and functional genomics
 
DNA MICROARRAY TECHNOLOGY FOR PRINCIPLE OF DRUG DISCOVERY
DNA  MICROARRAY  TECHNOLOGY FOR  PRINCIPLE OF DRUG DISCOVERYDNA  MICROARRAY  TECHNOLOGY FOR  PRINCIPLE OF DRUG DISCOVERY
DNA MICROARRAY TECHNOLOGY FOR PRINCIPLE OF DRUG DISCOVERY
 
20100509 bioinformatics kapushesky_lecture03-04_0
20100509 bioinformatics kapushesky_lecture03-04_020100509 bioinformatics kapushesky_lecture03-04_0
20100509 bioinformatics kapushesky_lecture03-04_0
 
Microarray
MicroarrayMicroarray
Microarray
 
genomeannotation-160822182432.pdf
genomeannotation-160822182432.pdfgenomeannotation-160822182432.pdf
genomeannotation-160822182432.pdf
 
Genome annotation
Genome annotationGenome annotation
Genome annotation
 
12 arrays
12 arrays12 arrays
12 arrays
 
12 arrays
12 arrays12 arrays
12 arrays
 
Microarray technique
Microarray techniqueMicroarray technique
Microarray technique
 
Functional genomics
Functional genomicsFunctional genomics
Functional genomics
 
Applications of microarray
Applications of microarrayApplications of microarray
Applications of microarray
 
Microbial physiology in genomic era
Microbial physiology in genomic eraMicrobial physiology in genomic era
Microbial physiology in genomic era
 
SAGE- Serial Analysis of Gene Expression
SAGE- Serial Analysis of Gene ExpressionSAGE- Serial Analysis of Gene Expression
SAGE- Serial Analysis of Gene Expression
 
Genomics_Aishwarya Teli.pptx
Genomics_Aishwarya Teli.pptxGenomics_Aishwarya Teli.pptx
Genomics_Aishwarya Teli.pptx
 
Microarray and dna chips for transcriptome study
Microarray and dna chips for transcriptome studyMicroarray and dna chips for transcriptome study
Microarray and dna chips for transcriptome study
 
DNA Microarray introdution and application
DNA Microarray introdution and applicationDNA Microarray introdution and application
DNA Microarray introdution and application
 
Seq 301116
Seq 301116Seq 301116
Seq 301116
 
artificial neural network-gene prediction
artificial neural network-gene predictionartificial neural network-gene prediction
artificial neural network-gene prediction
 
DNA SEQUENCING METHODS AND STRATEGIES FOR GENOME SEQUENCING
DNA SEQUENCING METHODS AND STRATEGIES FOR GENOME SEQUENCINGDNA SEQUENCING METHODS AND STRATEGIES FOR GENOME SEQUENCING
DNA SEQUENCING METHODS AND STRATEGIES FOR GENOME SEQUENCING
 

Dernier

Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Sérgio Sacani
 
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptxSCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
RizalinePalanog2
 
Seismic Method Estimate velocity from seismic data.pptx
Seismic Method Estimate velocity from seismic  data.pptxSeismic Method Estimate velocity from seismic  data.pptx
Seismic Method Estimate velocity from seismic data.pptx
AlMamun560346
 
Introduction,importance and scope of horticulture.pptx
Introduction,importance and scope of horticulture.pptxIntroduction,importance and scope of horticulture.pptx
Introduction,importance and scope of horticulture.pptx
Bhagirath Gogikar
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
PirithiRaju
 

Dernier (20)

9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
 
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptxSCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
 
GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)
 
Seismic Method Estimate velocity from seismic data.pptx
Seismic Method Estimate velocity from seismic  data.pptxSeismic Method Estimate velocity from seismic  data.pptx
Seismic Method Estimate velocity from seismic data.pptx
 
Site Acceptance Test .
Site Acceptance Test                    .Site Acceptance Test                    .
Site Acceptance Test .
 
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
 
Introduction,importance and scope of horticulture.pptx
Introduction,importance and scope of horticulture.pptxIntroduction,importance and scope of horticulture.pptx
Introduction,importance and scope of horticulture.pptx
 
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
 
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdf
 
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICESAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
 
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxCOST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learning
 
IDENTIFICATION OF THE LIVING- forensic medicine
IDENTIFICATION OF THE LIVING- forensic medicineIDENTIFICATION OF THE LIVING- forensic medicine
IDENTIFICATION OF THE LIVING- forensic medicine
 
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
 

Functional genomics

  • 1. FUNCTIONAL GENOMICS PRESENTED BY: AJAY M.Sc. Biotechnology (P) Roll no. 4
  • 2. INTRODUCTION • Genomics - It is the study of genomes. • The field of genomics comprises of two main areas: 1.Structural genomics 2. Functional genomics • Structural genomics - deals with genome structures with a focus on the study of genome mapping and assembly as well as genome annotation and comparison.
  • 3. Functional genomics • It is largely experiment based with a focus on gene functions at the whole genome level using high throughput approaches. • The high throughput analysis of all expressed genes is also termed transcriptome analysis • Transcriptome analysis can be conducted by two approaches: 1) sequence based approaches 2) microarray based approaches
  • 4. Sequence based approaches • Expressed sequence tags :- ESTs are short sequences of cDNA typically 200-400 nucleotides in length. • Obtained from either 5’ end or 3’ end of cDNA inserts of cDNA library.
  • 5. ADVANTAGES OF E.S.T • Provide a rough estimate of genes that are actively expressed in a genome under a particular physiological condition. • Help in discovering new genes, due to random sequencing of cDNA clones. • EST libraries can be easily generated.
  • 6. DRAWBACKS OF USING E.S.Ts • Automatically generated without verification thus contain high error rates. • There is often contamination by vector sequence , introns, ribosomal RNA, mitochondrial RNA. • Weakly expressed genes are hardly found in EST sequencing survey. • ESTs represent only partial sequences of genes.
  • 7. E.S.Ts DATABASE • dbEST is a EST database provided by GenBank. That contains EST collections for a large number of organisms. • The database id regularly updated.
  • 8. EST INDEX CONSTRUCTION • Goal of EST database – organize and combine the EST data to improve the quality of sequence information. “A collection of nonredundant and annotated EST sequence is known as gene index construction” • main advantage – reduces redundancy.
  • 9. STEPS INVOLVED Remove vector contaminants and masks repeats using Vecscreen Clustering – associates EST sequences with unique genes Derive consensus sequences by fusing redundant , overlapping ESTs to correct errors Results in longer contigs Coding region is defined by excluding introns and 3’- untranslated sequences.
  • 10. Contd… coding sequence translated into protein sequence for db similarity searching. Alignment of these complied ESTs with genomic sequence, performed by program SIM4
  • 11. Major EST index sequence databases are: • Unigene – is an NCBI EST cluster database. • TIGR Gene indices
  • 12. SAGE • Serial analysis of gene expression • Another high throughput, Sequence-based approach for gene expression profile analysis. • More quantitative in determining mRNA expression in cell. • Short fragments (taqs) of DNA, excised from cDNA sequences , act as unique markers of gene transcript • SAGE invented at Johns Hopkins University in USA (Oncology Center) by Dr. Victor in 1995.
  • 13. Principle Underlining SAGE methodology • A short sequence tag (10-14bp) contains sufficient information to uniquely identify a transcript. • Sequence tag can be linked together to form long serial molecules that can be cloned and sequenced. • the number of times a particular tag is observed provides the expression level of the corresponding transcript.
  • 14. Steps In Brief… 1. Trapping of RNA with beads Coating of microscopic magnetic beads with “TTTTT” tails is done.
  • 15.
  • 16. 2. cDNA synthesis ds cDNA is synthesized from the extracted mRNA by means of biotinylated oligo (dT) primer. • cDNA synthesis is immobilized to streptavidin beads.
  • 17.
  • 18. 3. Enzymatic cleavage of cDNA • Type II restriction enzyme used (E.g. NlaIII.)
  • 19. 4. Ligation of Linkers to bound cDNA • Captured cDNA are then ligated to linkers at their ends.
  • 20. 5. Cleaving with tagging enzyme • Tagging enzyme, (usually BsmF1) cleave DNA, releasing the linker-adapted SAGE tag from each cDNA.
  • 21. 6. Formation of Ditags • Two groups of cDNAs are ligated to each other, to create a “ditag” with linkers on either end. • Two tags are linked together using T4 DNA ligase.
  • 22. 7. PCR amplification of Ditags • linker-ditag-linker constructs are amplified by PCR PCR Amplification
  • 23. 8. Isolation of Ditags • The cDNA is again digested by the Anchoring enzyme (AE) • Breaking the linker off right where it was added in beginning.
  • 24. 9. Concatamerization of Ditags • Tags are combined into much longer molecules, called concatamers
  • 25. Cloning Concatamers and Sequencing… • Concatamers are inserted into bacteria, which act like living “copy machines” to create millions of copies from original. • Copies are then sequenced, using machines that can read the nucleotides in DNA.
  • 26. • Software tools for SAGE analysis: SAGE map SAGE xprofiler SAGE Genie • Advantages over EST analysis: a. Detect weakly expressed genes b. It uses a short nucleotide tag and allows sequencing of multiple tags in a single clone
  • 27. MICROARRAY-BASED APPROACHES A microarray is a pattern of ssDNA probes which are immobilized on a surface called a chip or a slide. • Microarrays use hybridization to detect a specific DNA or RNA in a sample. • DNA microarray uses a million different probes, fixed on a solid surface. • Microarray technology evolved from Southern blotting.
  • 28. WHY? • To analyze the expression of thousands of genes in single reaction, very quickly and in an efficient manner. • To understand the genetic causes for the abnormal functioning of the human body. • To understand which genes are active and which genes are inactive in different cell types.
  • 29. OLIGONUCLEOTIDE DESIGN • Fixed oligonucleotide onto a solid support such as glass slide • Length of oligonucleotides is in range of 25-70 bases long • Oligonucleotides are called probes (should be specific to minimize cross-hybridization) • Oligonucleotide should not form stable internal secondary structure, A program M fold helps to detect secondary str. • All probes should have approx. equal Tm. • OligoWiz & OligoArray are 2 programs used in designing probe for microarray spotting.
  • 31. 1 .Sample preparation and labeling • Isolate a total RNA containing mRNA that ideally represents genes, that are expressed at the time of sample collection. • Preparation of cDNA from mRNA using a reverse-transcriptase enzyme. • Short primer is required to initiate cDNA synthesis. • Each cDNA (Sample and Control) is labelled with fluorescent cyanine dyes (i.e. Cy3 and Cy5).
  • 32. 2. Hybridization • labelled cDNA is competitively hybridized against cDNA molecules spotted on a glass slide.
  • 33. 3. Image acquisition and data analysis Scanning hybridization signals Image analysis Transformation and normalization of data Analyzing data
  • 35. Image processing • To locate and quantify hybridization spots and to separate true hybridization signals from background noise ( may be non specific hybridization, unevenness of the slide surface, presence of contaminants such as dust on surface of slide). 1. Computer programs are used to correctly locate the boundaries of spots. 2. Arrayed signals are converted into numbers and observed as ratio between cy5 and cy3 for each spot. 3. data are often presented as false colour. 4. If ratios value are above 1 the false colour appears as red if it is below 1 then appear as green.
  • 36. • Software programs to perform microarray image analysis: • ArrayDB • TIGR Spotfinder
  • 37. Data transformation and normalization • Digitized gene expression data need to be further processed before differentially expressed gene can be identified. • When raw fluorescence intensity cy5 is plotted against cy3, most of data are clustered near bottom left of plot. • -It shows non- normal distribution of raw data. • -It represents imbalance of red & green intensities.
  • 38. Plot of raw fluorescence signal intensities of Cy5 versus Cy3
  • 39. Improvement in data quality 1). Transform raw cy5 & cy3 values by taking log to the base of 2. It produces uniform distribution of data. 2). Further normalization of data is done by ploting the data points horizontally plot the log ratios(cy5/cy3) against the average log intensities (log2 cy5 + log2 cy3)/2 • This form is known as “intensity- ratio plot” In which differentially expressed genes are more easily visible about a horizontal axis. • In all these types linear regression is used.
  • 40. B. Plot of data after log transformation C. Plot of mean log intensity v/s log ratio of two fluorescence intensities
  • 41. • Software programs for microarray data normalization: Arrayplot SNOMAD
  • 42. Statistical Analysis to identify Differentially Expressed Genes • Differentially expressed genes are separated, using normalization cutoff of generally two fold. • But it is arbitrary cutoff value depending on data variability. • Ways to ensure that gene is truly differentially expressed : 1)Multiple replicate experiments 2)Statistical testing
  • 43. 1. Multiple replicate experiment: • Repeat experiments provide replicate data points that provide information about variability of expression data. • Helps to identify differentially expressed genes. 2. Statistical Tests: • t-Test & • ANOVA (Analysis of variance).
  • 44. • Other Statistical Programs 1)MA-ANOVA 2)Cyber-T • Advantage of microarray based approach: To study the expression of many genes in parallel & identify groups of genes that exhibit similar expression patterns.
  • 45. Comparison of SAGE &Microarray based approach SAGE 1)Doesn’t require prior knowledge of transcript sequence. 2)SAGE measures absolute mRNA expression level. 3)A minute quantity of sample mRNA is required. 4)It is expensive & gene identification is difficult. 1)Detect only the genes spotted on microarray. 2)It indicates the relative expression level. 3)It requires larger quantity of mRNA sample. 4)It is less expensive & identity of probes are already known. DNA Microarray