SlideShare une entreprise Scribd logo
1  sur  29
Télécharger pour lire hors ligne
Functional Annotation
      and the Gene Ontology


          Brett Tyler
Virginia Bioinformatics Institute
What is Annotation

• comments, notes, explanations, or other
  types of external remarks that can be
  attached to a document……
• For genomics
 functional annotation means attaching
 biological information to sequences
Functional Annotation
                        Structural Annotation


                                Searches


Nucleotide/Protein Databases                     Domain/Motifs



                                Assignments



       EC Number                Metabolic        Automated GO
                                Pathways



                               Manual curation
Functional Annotation
                     Structural Annotation


                          Searches


Nucleotide/Protein Databases                 Domain/Motifs



                           Assignments



    EC Number              Metabolic         Automated GO
                           Pathways



                         Manual curation
Automated Searches

• Search programs can be downloaded
  and run internally on unix system
• Graphic user interfaces but normally
  takes limited sequences
Homology or similarity based
               searches
•   Local pairwise alignment tools : look for any regions of
    similarity within the proteins that score well.
     – BLAST
         • fast

•   Global pairwise alignment tools take two sequences and
    attempt to find an alignment of the two over their full lengths.
     – Needleman-Wunsch
         • finds best out of all possible alignments

•   Multiple alignments tools try to align 3 or more proteins so that
    the maximal number of amino acids from each protein are
    matched in the alignment - this may or may not include the full
    length of some or all of the proteins
     – clustalW
BLAST Programs
• Blastn: Search a nucleotide database using
  a nucleotide query
• BlastP: Search protein database using a
  protein query
• Blastx: Search protein database using a
  translated nucleotide query
• Tblastn: Search translated nucleotide
  database using a protein query
• Tblastx: Search translated nucleotide
  database using a translated nucleotide
  query
Example of BLAST output




top row is the search protein (query) and the bottom row is the match protein
(subject).
Middle row is consensus
+ indicates similar amino acids
numbers indicate amino acid position in the sequence
Functional Annotation
                     Structural Annotation


                          Searches


Nucleotide/Protein Databases                 Domain/Motifs



                           Assignments


                               Metabolic
    EC Number                                  Automated GO
                               Pathways



                         Manual curation
Domain Search

Hidden Markov Models
• Stastistical models of the primary
  structure consensus of a sequence
  family
Pfam
         http://pfam.sanger.ac.uk/

• Large collection of protein families
  represented by multiple sequence
  alignments and HMMs
• Analyze protein sequences for Pfam match
• Look at multiple alignments of members of
  the gene family
INTERPRO
         http://www.ebi.ac.uk/interpro/

• Database of protein families, domains and
  sites with identified in known proteins
  which can be applied to new protein
  sequences
• Collects protein families from other
  databases such as Pfam, UniProtKb and
  TIGRFAMs
• Sequence search is done with InterProScan
      Downloadable (rans faster on own
  server, large set)
      GUI (limited number of sequences)
Subcellular localization

• Signal P:Predicts the presence and location
  of signal peptide and cleavage sites in
  organism
• TMHMM: Predicts transmembrane
• TargetP:Predicts subcellular location based
  on chlroplast transit peptide and
  mitochondrial targeting sequence
Signal P Search
http://www.cbs.dtu.dk/services/SignalP/
Sample SignalP Output
CRN2…confirmed with proteomics
Sample SignalP Output
CRN2…confirmed with proteomics
Search EC numbers
http://ca.expasy.org/enzyme/
Functional Annotation
                     Structural Annotation


                          Searches


Nucleotide/Protein Databases                 Domain/Motifs



                           Assignments



    EC Number              Metabolic         Automated GO
                           Pathways



                         Manual curation
Metabolic Pathways

•Help improve annotation by showing missing
genes in essentail pathways
•Useful for comparative genomics
KEGG:
http://www.genome.jp/kegg/pathway.html
Reactome: http://www.reactome.org
Metacyc:http://www.metacyc.org

Add lots of others
KEGG: Kyoto Encyclopedia of Genes and
               Genomes
http://www.genome.jp/kegg/pathway.html
Functional Annotation
                     Structural Annotation


                          Searches


Nucleotide/Protein Databases                 Domain/Motifs



                           Assignments



    EC Number              Metabolic         Automated GO
                           Pathways



                         Manual curation
Some initial PAMGO Biological Process Terms
      Included in initial 35 terms added Jan 2005

           First set of terms




 These processes are general to all associations
GO: 0052048 interaction with host via secreted substance
    GO: 0052044 induction by symbiont of host programmed cell death

bacterium




oomycete
GO: 0052048 interaction with host via secreted substance
    GO: 0052044 induction by symbiont of host programmed cell death

bacterium




oomycete



     GO: 0052048 interaction with host via secreted substance
     GO: 0052044 induction by symbiont of host programmed cell death
     GO: 0009405 pathogenesis
Functional Annotation
                     Structural Annotation


                          Searches


Nucleotide/Protein Databases                 Domain/Motifs



                           Assignments



    EC Number              Metabolic                  Automated GO
                           Pathways



                         Manual curation
Why manual Annotation
Combine all search information and evidence
Manually look through all information
Add experimental data from literature when available
Approach conservatively

Setback
Time-consuming and more expensive.

Contenu connexe

Tendances

Tendances (20)

Primary and secondary database
Primary and secondary databasePrimary and secondary database
Primary and secondary database
 
Sequence file formats
Sequence file formatsSequence file formats
Sequence file formats
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
 
Structural genomics
Structural genomicsStructural genomics
Structural genomics
 
Biological databases
Biological databases Biological databases
Biological databases
 
Introduction to databases.pptx
Introduction to databases.pptxIntroduction to databases.pptx
Introduction to databases.pptx
 
Ppt of genome annotatioon 2
Ppt of genome annotatioon 2Ppt of genome annotatioon 2
Ppt of genome annotatioon 2
 
Biological data bioinformatics
Biological data bioinformatics Biological data bioinformatics
Biological data bioinformatics
 
Computational Genomics - Bioinformatics - IK
Computational Genomics - Bioinformatics - IKComputational Genomics - Bioinformatics - IK
Computational Genomics - Bioinformatics - IK
 
Functional annotation
Functional annotationFunctional annotation
Functional annotation
 
Chromosome walking jumping transposon tagging map based cloning
Chromosome walking jumping transposon tagging map based cloningChromosome walking jumping transposon tagging map based cloning
Chromosome walking jumping transposon tagging map based cloning
 
Biological Database
Biological DatabaseBiological Database
Biological Database
 
STRUCTURAL GENOMICS, FUNCTIONAL GENOMICS, COMPARATIVE GENOMICS
STRUCTURAL GENOMICS, FUNCTIONAL GENOMICS, COMPARATIVE GENOMICSSTRUCTURAL GENOMICS, FUNCTIONAL GENOMICS, COMPARATIVE GENOMICS
STRUCTURAL GENOMICS, FUNCTIONAL GENOMICS, COMPARATIVE GENOMICS
 
genomic comparison
genomic comparison genomic comparison
genomic comparison
 
Functional annotation.pptx
Functional annotation.pptxFunctional annotation.pptx
Functional annotation.pptx
 
Est database
Est databaseEst database
Est database
 
Functional genomics
Functional genomicsFunctional genomics
Functional genomics
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Bioinformatics on internet
Bioinformatics on internetBioinformatics on internet
Bioinformatics on internet
 
Biological Databases
Biological DatabasesBiological Databases
Biological Databases
 

En vedette

BT631-6-structural_motifs
BT631-6-structural_motifsBT631-6-structural_motifs
BT631-6-structural_motifsRajesh G
 
Combining sequence motifs and protein interactions to unravel complex phospho...
Combining sequence motifs and protein interactions to unravel complex phospho...Combining sequence motifs and protein interactions to unravel complex phospho...
Combining sequence motifs and protein interactions to unravel complex phospho...Lars Juhl Jensen
 
Systematic discovery of phosphorylation networks - Combining linear motifs an...
Systematic discovery of phosphorylation networks - Combining linear motifs an...Systematic discovery of phosphorylation networks - Combining linear motifs an...
Systematic discovery of phosphorylation networks - Combining linear motifs an...Lars Juhl Jensen
 
Dna binding protein(motif)
Dna binding protein(motif)Dna binding protein(motif)
Dna binding protein(motif)mamad416
 
Motif presentation
Motif presentationMotif presentation
Motif presentationAmir Razmjou
 
HELIX-LOOP-HELIX, HELIX-TURN-HELIX
HELIX-LOOP-HELIX, HELIX-TURN-HELIXHELIX-LOOP-HELIX, HELIX-TURN-HELIX
HELIX-LOOP-HELIX, HELIX-TURN-HELIXnaren
 
Protein structure: details
Protein structure: detailsProtein structure: details
Protein structure: detailsdamarisb
 

En vedette (9)

Discovery Of Functional Protein Linear Motifs Using a Greaddy Algorithm and I...
Discovery Of Functional Protein Linear Motifs Using a Greaddy Algorithm and I...Discovery Of Functional Protein Linear Motifs Using a Greaddy Algorithm and I...
Discovery Of Functional Protein Linear Motifs Using a Greaddy Algorithm and I...
 
BT631-6-structural_motifs
BT631-6-structural_motifsBT631-6-structural_motifs
BT631-6-structural_motifs
 
Combining sequence motifs and protein interactions to unravel complex phospho...
Combining sequence motifs and protein interactions to unravel complex phospho...Combining sequence motifs and protein interactions to unravel complex phospho...
Combining sequence motifs and protein interactions to unravel complex phospho...
 
Systematic discovery of phosphorylation networks - Combining linear motifs an...
Systematic discovery of phosphorylation networks - Combining linear motifs an...Systematic discovery of phosphorylation networks - Combining linear motifs an...
Systematic discovery of phosphorylation networks - Combining linear motifs an...
 
Dna binding protein(motif)
Dna binding protein(motif)Dna binding protein(motif)
Dna binding protein(motif)
 
Motif presentation
Motif presentationMotif presentation
Motif presentation
 
Proteins biochem
Proteins biochemProteins biochem
Proteins biochem
 
HELIX-LOOP-HELIX, HELIX-TURN-HELIX
HELIX-LOOP-HELIX, HELIX-TURN-HELIXHELIX-LOOP-HELIX, HELIX-TURN-HELIX
HELIX-LOOP-HELIX, HELIX-TURN-HELIX
 
Protein structure: details
Protein structure: detailsProtein structure: details
Protein structure: details
 

Similaire à Tyler functional annotation thurs 1120

Mapping protein to function
Mapping protein to functionMapping protein to function
Mapping protein to functionAbhik Seal
 
Informal presentation on bioinformatics
Informal presentation on bioinformaticsInformal presentation on bioinformatics
Informal presentation on bioinformaticsAtai Rabby
 
BITS: Overview of important biological databases beyond sequences
BITS: Overview of important biological databases beyond sequencesBITS: Overview of important biological databases beyond sequences
BITS: Overview of important biological databases beyond sequencesBITS
 
GLBIO/CCBC Metagenomics Workshop
GLBIO/CCBC Metagenomics WorkshopGLBIO/CCBC Metagenomics Workshop
GLBIO/CCBC Metagenomics WorkshopMorgan Langille
 
BioInformatics Tools -Genomics , Proteomics and metablomics
BioInformatics Tools -Genomics , Proteomics and metablomicsBioInformatics Tools -Genomics , Proteomics and metablomics
BioInformatics Tools -Genomics , Proteomics and metablomicsAyeshaYousaf20
 
Curation Introduction - Apollo Workshop
Curation Introduction - Apollo WorkshopCuration Introduction - Apollo Workshop
Curation Introduction - Apollo WorkshopMonica Munoz-Torres
 
proteomic and Genomics and the available proteomic technologies and the data ...
proteomic and Genomics and the available proteomic technologies and the data ...proteomic and Genomics and the available proteomic technologies and the data ...
proteomic and Genomics and the available proteomic technologies and the data ...SamiMohamed28
 
Bioinformatics (Exam point of view)
Bioinformatics (Exam point of view)Bioinformatics (Exam point of view)
Bioinformatics (Exam point of view)Sijo A
 
PomBase conventions for improving annotation depth, breadth, consistency and ...
PomBase conventions for improving annotation depth, breadth, consistency and ...PomBase conventions for improving annotation depth, breadth, consistency and ...
PomBase conventions for improving annotation depth, breadth, consistency and ...Valerie Wood
 
Sequencedatabases
SequencedatabasesSequencedatabases
SequencedatabasesAbhik Seal
 
Apollo annotation guidelines for i5k projects Diaphorina citri
Apollo annotation guidelines for i5k projects Diaphorina citriApollo annotation guidelines for i5k projects Diaphorina citri
Apollo annotation guidelines for i5k projects Diaphorina citriMonica Munoz-Torres
 
Lecture__on__Proteomics_Introduction.ppt
Lecture__on__Proteomics_Introduction.pptLecture__on__Proteomics_Introduction.ppt
Lecture__on__Proteomics_Introduction.pptSachin Teotia
 
Protein Chemistry-Proteomics-Lec1_Intro.ppt
Protein Chemistry-Proteomics-Lec1_Intro.pptProtein Chemistry-Proteomics-Lec1_Intro.ppt
Protein Chemistry-Proteomics-Lec1_Intro.pptSachin Teotia
 
Introduction to Bioinformatics-1.pdf
Introduction to Bioinformatics-1.pdfIntroduction to Bioinformatics-1.pdf
Introduction to Bioinformatics-1.pdfkigaruantony
 
Protein function and bioinformatics
Protein function and bioinformaticsProtein function and bioinformatics
Protein function and bioinformaticsNeil Saunders
 
UniProt-GOA
UniProt-GOAUniProt-GOA
UniProt-GOAEBI
 

Similaire à Tyler functional annotation thurs 1120 (20)

Mapping protein to function
Mapping protein to functionMapping protein to function
Mapping protein to function
 
Informal presentation on bioinformatics
Informal presentation on bioinformaticsInformal presentation on bioinformatics
Informal presentation on bioinformatics
 
BITS: Overview of important biological databases beyond sequences
BITS: Overview of important biological databases beyond sequencesBITS: Overview of important biological databases beyond sequences
BITS: Overview of important biological databases beyond sequences
 
GLBIO/CCBC Metagenomics Workshop
GLBIO/CCBC Metagenomics WorkshopGLBIO/CCBC Metagenomics Workshop
GLBIO/CCBC Metagenomics Workshop
 
BioInformatics Tools -Genomics , Proteomics and metablomics
BioInformatics Tools -Genomics , Proteomics and metablomicsBioInformatics Tools -Genomics , Proteomics and metablomics
BioInformatics Tools -Genomics , Proteomics and metablomics
 
Curation Introduction - Apollo Workshop
Curation Introduction - Apollo WorkshopCuration Introduction - Apollo Workshop
Curation Introduction - Apollo Workshop
 
proteomic and Genomics and the available proteomic technologies and the data ...
proteomic and Genomics and the available proteomic technologies and the data ...proteomic and Genomics and the available proteomic technologies and the data ...
proteomic and Genomics and the available proteomic technologies and the data ...
 
Bioinformatics (Exam point of view)
Bioinformatics (Exam point of view)Bioinformatics (Exam point of view)
Bioinformatics (Exam point of view)
 
PomBase conventions for improving annotation depth, breadth, consistency and ...
PomBase conventions for improving annotation depth, breadth, consistency and ...PomBase conventions for improving annotation depth, breadth, consistency and ...
PomBase conventions for improving annotation depth, breadth, consistency and ...
 
PIR- Protein Information Resource
PIR- Protein Information ResourcePIR- Protein Information Resource
PIR- Protein Information Resource
 
Sequencedatabases
SequencedatabasesSequencedatabases
Sequencedatabases
 
Apollo annotation guidelines for i5k projects Diaphorina citri
Apollo annotation guidelines for i5k projects Diaphorina citriApollo annotation guidelines for i5k projects Diaphorina citri
Apollo annotation guidelines for i5k projects Diaphorina citri
 
Lecture__on__Proteomics_Introduction.ppt
Lecture__on__Proteomics_Introduction.pptLecture__on__Proteomics_Introduction.ppt
Lecture__on__Proteomics_Introduction.ppt
 
Protein Chemistry-Proteomics-Lec1_Intro.ppt
Protein Chemistry-Proteomics-Lec1_Intro.pptProtein Chemistry-Proteomics-Lec1_Intro.ppt
Protein Chemistry-Proteomics-Lec1_Intro.ppt
 
Databases_L2.pptx
Databases_L2.pptxDatabases_L2.pptx
Databases_L2.pptx
 
Introduction to Bioinformatics-1.pdf
Introduction to Bioinformatics-1.pdfIntroduction to Bioinformatics-1.pdf
Introduction to Bioinformatics-1.pdf
 
Protein function and bioinformatics
Protein function and bioinformaticsProtein function and bioinformatics
Protein function and bioinformatics
 
Important protein databases and proteomics softwares
Important protein databases and proteomics softwaresImportant protein databases and proteomics softwares
Important protein databases and proteomics softwares
 
UniProt-GOA
UniProt-GOAUniProt-GOA
UniProt-GOA
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 

Plus de Sucheta Tripathy (20)

Gal
GalGal
Gal
 
Ramorum2016 final
Ramorum2016 finalRamorum2016 final
Ramorum2016 final
 
Primer designgeneprediction
Primer designgenepredictionPrimer designgeneprediction
Primer designgeneprediction
 
Motif andpatterndatabase
Motif andpatterndatabaseMotif andpatterndatabase
Motif andpatterndatabase
 
Databases ii
Databases iiDatabases ii
Databases ii
 
Snps and microarray
Snps and microarraySnps and microarray
Snps and microarray
 
Stat2013
Stat2013Stat2013
Stat2013
 
26 nov2013seminar
26 nov2013seminar26 nov2013seminar
26 nov2013seminar
 
Stat2013
Stat2013Stat2013
Stat2013
 
Presentation2013
Presentation2013Presentation2013
Presentation2013
 
Lecture7,8
Lecture7,8Lecture7,8
Lecture7,8
 
Lecture5,6
Lecture5,6Lecture5,6
Lecture5,6
 
Primer designgeneprediction
Primer designgenepredictionPrimer designgeneprediction
Primer designgeneprediction
 
Lecture 3,4
Lecture 3,4Lecture 3,4
Lecture 3,4
 
Lecture 1,2
Lecture 1,2Lecture 1,2
Lecture 1,2
 
Sequence Alignment,Blast, Fasta, MSA
Sequence Alignment,Blast, Fasta, MSASequence Alignment,Blast, Fasta, MSA
Sequence Alignment,Blast, Fasta, MSA
 
Databases Part II
Databases Part IIDatabases Part II
Databases Part II
 
Biological databases
Biological databasesBiological databases
Biological databases
 
Genome sequencingprojects
Genome sequencingprojectsGenome sequencingprojects
Genome sequencingprojects
 
Human encodeproject
Human encodeprojectHuman encodeproject
Human encodeproject
 

Dernier

A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfAyushMahapatra5
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactPECB
 
General AI for Medical Educators April 2024
General AI for Medical Educators April 2024General AI for Medical Educators April 2024
General AI for Medical Educators April 2024Janet Corral
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsTechSoup
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhikauryashika82
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104misteraugie
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationnomboosow
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfJayanti Pande
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
Disha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfDisha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfchloefrazer622
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...fonyou31
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfAdmir Softic
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3JemimahLaneBuaron
 
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...Sapna Thakur
 

Dernier (20)

A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdf
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
General AI for Medical Educators April 2024
General AI for Medical Educators April 2024General AI for Medical Educators April 2024
General AI for Medical Educators April 2024
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Disha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfDisha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdf
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 
Advance Mobile Application Development class 07
Advance Mobile Application Development class 07Advance Mobile Application Development class 07
Advance Mobile Application Development class 07
 
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
 

Tyler functional annotation thurs 1120

  • 1. Functional Annotation and the Gene Ontology Brett Tyler Virginia Bioinformatics Institute
  • 2. What is Annotation • comments, notes, explanations, or other types of external remarks that can be attached to a document…… • For genomics functional annotation means attaching biological information to sequences
  • 3. Functional Annotation Structural Annotation Searches Nucleotide/Protein Databases Domain/Motifs Assignments EC Number Metabolic Automated GO Pathways Manual curation
  • 4. Functional Annotation Structural Annotation Searches Nucleotide/Protein Databases Domain/Motifs Assignments EC Number Metabolic Automated GO Pathways Manual curation
  • 5. Automated Searches • Search programs can be downloaded and run internally on unix system • Graphic user interfaces but normally takes limited sequences
  • 6. Homology or similarity based searches • Local pairwise alignment tools : look for any regions of similarity within the proteins that score well. – BLAST • fast • Global pairwise alignment tools take two sequences and attempt to find an alignment of the two over their full lengths. – Needleman-Wunsch • finds best out of all possible alignments • Multiple alignments tools try to align 3 or more proteins so that the maximal number of amino acids from each protein are matched in the alignment - this may or may not include the full length of some or all of the proteins – clustalW
  • 7. BLAST Programs • Blastn: Search a nucleotide database using a nucleotide query • BlastP: Search protein database using a protein query • Blastx: Search protein database using a translated nucleotide query • Tblastn: Search translated nucleotide database using a protein query • Tblastx: Search translated nucleotide database using a translated nucleotide query
  • 8. Example of BLAST output top row is the search protein (query) and the bottom row is the match protein (subject). Middle row is consensus + indicates similar amino acids numbers indicate amino acid position in the sequence
  • 9. Functional Annotation Structural Annotation Searches Nucleotide/Protein Databases Domain/Motifs Assignments Metabolic EC Number Automated GO Pathways Manual curation
  • 10. Domain Search Hidden Markov Models • Stastistical models of the primary structure consensus of a sequence family
  • 11. Pfam http://pfam.sanger.ac.uk/ • Large collection of protein families represented by multiple sequence alignments and HMMs • Analyze protein sequences for Pfam match • Look at multiple alignments of members of the gene family
  • 12. INTERPRO http://www.ebi.ac.uk/interpro/ • Database of protein families, domains and sites with identified in known proteins which can be applied to new protein sequences • Collects protein families from other databases such as Pfam, UniProtKb and TIGRFAMs • Sequence search is done with InterProScan Downloadable (rans faster on own server, large set) GUI (limited number of sequences)
  • 13. Subcellular localization • Signal P:Predicts the presence and location of signal peptide and cleavage sites in organism • TMHMM: Predicts transmembrane • TargetP:Predicts subcellular location based on chlroplast transit peptide and mitochondrial targeting sequence
  • 18. Functional Annotation Structural Annotation Searches Nucleotide/Protein Databases Domain/Motifs Assignments EC Number Metabolic Automated GO Pathways Manual curation
  • 19. Metabolic Pathways •Help improve annotation by showing missing genes in essentail pathways •Useful for comparative genomics KEGG: http://www.genome.jp/kegg/pathway.html Reactome: http://www.reactome.org Metacyc:http://www.metacyc.org Add lots of others
  • 20. KEGG: Kyoto Encyclopedia of Genes and Genomes http://www.genome.jp/kegg/pathway.html
  • 21. Functional Annotation Structural Annotation Searches Nucleotide/Protein Databases Domain/Motifs Assignments EC Number Metabolic Automated GO Pathways Manual curation
  • 22.
  • 23.
  • 24.
  • 25. Some initial PAMGO Biological Process Terms Included in initial 35 terms added Jan 2005 First set of terms These processes are general to all associations
  • 26. GO: 0052048 interaction with host via secreted substance GO: 0052044 induction by symbiont of host programmed cell death bacterium oomycete
  • 27. GO: 0052048 interaction with host via secreted substance GO: 0052044 induction by symbiont of host programmed cell death bacterium oomycete GO: 0052048 interaction with host via secreted substance GO: 0052044 induction by symbiont of host programmed cell death GO: 0009405 pathogenesis
  • 28. Functional Annotation Structural Annotation Searches Nucleotide/Protein Databases Domain/Motifs Assignments EC Number Metabolic Automated GO Pathways Manual curation
  • 29. Why manual Annotation Combine all search information and evidence Manually look through all information Add experimental data from literature when available Approach conservatively Setback Time-consuming and more expensive.