SlideShare une entreprise Scribd logo
1  sur  20
Télécharger pour lire hors ligne
Pathema Website Functionality
      Enhancements:
    Pathema gets its Sexy Back



       Tanja Davidsen, Ph.D.
       J. Craig Venter Institute
Improved Front Page
Improved Front Page
Improved Searches using Lucene

• Improved speed and functionality of the search queries
  on Pathema
• An open source information retrieval library supported by
  Apache
• At the core of the Lucene logical architecture is a
  document containing fields of text, independent of file
  format
• Prevents us from hitting the database for searches,
  especially helpful for inexact searches
• Used by Wikipedia, Monster, SourceForge, UniProt and
  EBI
Improved Searches using Lucene

• Improves our search speed from 30+s to 1-3s
• Filters will allow us to let the users build even
  more complex queries:
       Search for all genes in organism B.anthracis starting
   

       with the “dna” and assigned GO ID GO:0003677
GBrowse (GMOD)

• The most popular GMOD viewer
• Used to replace and/or accompany our in
  house genome viewers
• Order and appearance of tracks are
  customizable by administrator and end-user
• Supports third party annotation using GFF
  formats
• Third-party feature loading
• Customizable plug-in architecture (e.g. run
  BLAST, find oligonucleotides, design
  primers)
Gbrowse
ClosTox: The Clostridum Toxin DB

• The Clostridium community is primarily
  interested in the toxin genes
• We created a specialty toxin and neurotoxin
  associated proteins (NAPs) database for
  browsing on the Clostridium site
• Data for the database provided by Clostridium
  researchers/community
• Very successful debut at the last Botulism
  meeting
ClosTox
Sybil: Comparative Genomic Region


• Compares a reference to selected
  comparison genomes by protein clusters
• Specify how many clustered genes a non-
  reference sequence region must have in
  common to with the reference
Sybil: Comparative Genomic Region
Sybil: Synteny gradient display
• A color-coded display of conserved synteny
  between two or more sequences
• Select a reference sequence (bottom of the
  display) with the genes color-coded from the 5’
  end to the 3’ end
• Orthologs in the comparison genomes are
  shown in the color of the ortholog from the
  reference genome
• As a result one can see large and small-scale
  rearrangements at a glance, in addition to
  regions that may be inserted in one sequence
  relative to another
Sybil: Synteny Gradient Display
Pathway Tools
Pathway Tools
Abundance Profiler
New Data Types

• Virulence Factors
• Epitopes
• Experimentally characterized
  genes/proteins
• Multidrug transporters
• Genomic islands
• Community requested databases
  (ClosTox)
Acknowledgements

•   PI: Granger Sutton (JCVI)
•   Subcontract: Owen White (University of Maryland, Baltimore, IGS)
•   Project Manager: Lauren Brinkac

        JCVI Informatics Engineers   Analysts
    


        Tanja Davidsen (manager)     Scott Durkin (manager)
    


        Erin Beck                    Ramana Madupu
    


        Alex Richter                 Susmita Shrivastava
    


        Kevin Galinsky               Bob Dodson
    


        Jay Sundaram                 Derek Harkins
    


        Seth Schobel                 Lis Caler
    




        IGS Informatics Engineers
    


        Anu Ganapathy                YongMei Zhao
    


        Josh Orvis                   Aaron Gussman
    


        Kevin Galens                 Jonathon Crabtree
    

Contenu connexe

Similaire à Pathema Website Functionality Enhancements

Mar2013 Performance Metrics Working Group
Mar2013 Performance Metrics Working GroupMar2013 Performance Metrics Working Group
Mar2013 Performance Metrics Working Group
GenomeInABottle
 
Informal presentation on bioinformatics
Informal presentation on bioinformaticsInformal presentation on bioinformatics
Informal presentation on bioinformatics
Atai Rabby
 
Cool Informatics Tools and Services for Biomedical Research
Cool Informatics Tools and Services for Biomedical ResearchCool Informatics Tools and Services for Biomedical Research
Cool Informatics Tools and Services for Biomedical Research
David Ruau
 
Apollo: Scalable & collaborative curation of genomes - Biocuration 2015
Apollo: Scalable & collaborative curation of genomes - Biocuration 2015Apollo: Scalable & collaborative curation of genomes - Biocuration 2015
Apollo: Scalable & collaborative curation of genomes - Biocuration 2015
Monica Munoz-Torres
 

Similaire à Pathema Website Functionality Enhancements (20)

Ensembl annotation
Ensembl annotationEnsembl annotation
Ensembl annotation
 
Kim Pruitt trainingbiocuration2015
Kim Pruitt trainingbiocuration2015Kim Pruitt trainingbiocuration2015
Kim Pruitt trainingbiocuration2015
 
Making powerful science: an introduction to NGS data analysis
Making powerful science: an introduction to NGS data analysisMaking powerful science: an introduction to NGS data analysis
Making powerful science: an introduction to NGS data analysis
 
KnetMiner Overview Oct 2017
KnetMiner Overview Oct 2017KnetMiner Overview Oct 2017
KnetMiner Overview Oct 2017
 
Cloud bioinformatics 2
Cloud bioinformatics 2Cloud bioinformatics 2
Cloud bioinformatics 2
 
Arraygen_Brochure
Arraygen_BrochureArraygen_Brochure
Arraygen_Brochure
 
Implementation of GPU-based bioinformatic tools at the ENCODE DCC
Implementation of GPU-based bioinformatic tools at the ENCODE DCCImplementation of GPU-based bioinformatic tools at the ENCODE DCC
Implementation of GPU-based bioinformatic tools at the ENCODE DCC
 
Mar2013 Performance Metrics Working Group
Mar2013 Performance Metrics Working GroupMar2013 Performance Metrics Working Group
Mar2013 Performance Metrics Working Group
 
Final Acb All Hands 26 11 07.Key
Final Acb All Hands 26 11 07.KeyFinal Acb All Hands 26 11 07.Key
Final Acb All Hands 26 11 07.Key
 
Genome annotation with open source software: Apollo, Jbrowse and the GO in Ga...
Genome annotation with open source software: Apollo, Jbrowse and the GO in Ga...Genome annotation with open source software: Apollo, Jbrowse and the GO in Ga...
Genome annotation with open source software: Apollo, Jbrowse and the GO in Ga...
 
Día 19 - Noel Chen - Introducción a Novogene
Día 19 - Noel Chen - Introducción a Novogene Día 19 - Noel Chen - Introducción a Novogene
Día 19 - Noel Chen - Introducción a Novogene
 
TGAC Browser bosc 2014
TGAC Browser bosc 2014TGAC Browser bosc 2014
TGAC Browser bosc 2014
 
Informal presentation on bioinformatics
Informal presentation on bioinformaticsInformal presentation on bioinformatics
Informal presentation on bioinformatics
 
Reproducible research - to infinity
Reproducible research - to infinityReproducible research - to infinity
Reproducible research - to infinity
 
Giab for jax long read 190917
Giab for jax long read 190917Giab for jax long read 190917
Giab for jax long read 190917
 
Arraygen brochure
Arraygen brochureArraygen brochure
Arraygen brochure
 
Cool Informatics Tools and Services for Biomedical Research
Cool Informatics Tools and Services for Biomedical ResearchCool Informatics Tools and Services for Biomedical Research
Cool Informatics Tools and Services for Biomedical Research
 
Eln Jisc Mrc 18dec07 Nsb
Eln Jisc Mrc 18dec07 NsbEln Jisc Mrc 18dec07 Nsb
Eln Jisc Mrc 18dec07 Nsb
 
Next generation sequencing & microarray-- Genotypic Technology
Next generation sequencing & microarray-- Genotypic TechnologyNext generation sequencing & microarray-- Genotypic Technology
Next generation sequencing & microarray-- Genotypic Technology
 
Apollo: Scalable & collaborative curation of genomes - Biocuration 2015
Apollo: Scalable & collaborative curation of genomes - Biocuration 2015Apollo: Scalable & collaborative curation of genomes - Biocuration 2015
Apollo: Scalable & collaborative curation of genomes - Biocuration 2015
 

Plus de Pathema

Plus de Pathema (9)

Pathema-Clostridium A NIAID Bioinformatics Resource Center (BRC)
Pathema-Clostridium A NIAID Bioinformatics Resource Center (BRC)Pathema-Clostridium A NIAID Bioinformatics Resource Center (BRC)
Pathema-Clostridium A NIAID Bioinformatics Resource Center (BRC)
 
Pathema Burkholderia Annotation Jamboree: A Guide to MANATEE
Pathema Burkholderia Annotation Jamboree: A Guide to MANATEEPathema Burkholderia Annotation Jamboree: A Guide to MANATEE
Pathema Burkholderia Annotation Jamboree: A Guide to MANATEE
 
Clostox: A Clostridium Toxin Database and Phylogeny Viewer for Pathema-Clostr...
Clostox: A Clostridium Toxin Database and Phylogeny Viewer for Pathema-Clostr...Clostox: A Clostridium Toxin Database and Phylogeny Viewer for Pathema-Clostr...
Clostox: A Clostridium Toxin Database and Phylogeny Viewer for Pathema-Clostr...
 
Pathema Burkholderia Annotation Jamboree: Prokaryotic Annotation Overview
Pathema Burkholderia Annotation Jamboree: Prokaryotic Annotation OverviewPathema Burkholderia Annotation Jamboree: Prokaryotic Annotation Overview
Pathema Burkholderia Annotation Jamboree: Prokaryotic Annotation Overview
 
Pathema Burkholderia Annotation Jamboree: Introduction to Annotation Jamboree
Pathema Burkholderia Annotation Jamboree: Introduction to Annotation JamboreePathema Burkholderia Annotation Jamboree: Introduction to Annotation Jamboree
Pathema Burkholderia Annotation Jamboree: Introduction to Annotation Jamboree
 
Pathema: A Bioinformatics Resource Center
Pathema: A Bioinformatics Resource CenterPathema: A Bioinformatics Resource Center
Pathema: A Bioinformatics Resource Center
 
Pathema: A Clade Specific Bioinformatics Resource Center
Pathema: A Clade Specific Bioinformatics Resource CenterPathema: A Clade Specific Bioinformatics Resource Center
Pathema: A Clade Specific Bioinformatics Resource Center
 
Automated Prokaryotic Annotation at JCVI
Automated Prokaryotic Annotation at JCVIAutomated Prokaryotic Annotation at JCVI
Automated Prokaryotic Annotation at JCVI
 
Pathema Community Outreach
Pathema Community OutreachPathema Community Outreach
Pathema Community Outreach
 

Dernier

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Dernier (20)

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 

Pathema Website Functionality Enhancements

  • 1. Pathema Website Functionality Enhancements: Pathema gets its Sexy Back Tanja Davidsen, Ph.D. J. Craig Venter Institute
  • 4. Improved Searches using Lucene • Improved speed and functionality of the search queries on Pathema • An open source information retrieval library supported by Apache • At the core of the Lucene logical architecture is a document containing fields of text, independent of file format • Prevents us from hitting the database for searches, especially helpful for inexact searches • Used by Wikipedia, Monster, SourceForge, UniProt and EBI
  • 5. Improved Searches using Lucene • Improves our search speed from 30+s to 1-3s • Filters will allow us to let the users build even more complex queries: Search for all genes in organism B.anthracis starting  with the “dna” and assigned GO ID GO:0003677
  • 6. GBrowse (GMOD) • The most popular GMOD viewer • Used to replace and/or accompany our in house genome viewers • Order and appearance of tracks are customizable by administrator and end-user • Supports third party annotation using GFF formats • Third-party feature loading • Customizable plug-in architecture (e.g. run BLAST, find oligonucleotides, design primers)
  • 8. ClosTox: The Clostridum Toxin DB • The Clostridium community is primarily interested in the toxin genes • We created a specialty toxin and neurotoxin associated proteins (NAPs) database for browsing on the Clostridium site • Data for the database provided by Clostridium researchers/community • Very successful debut at the last Botulism meeting
  • 10.
  • 11. Sybil: Comparative Genomic Region • Compares a reference to selected comparison genomes by protein clusters • Specify how many clustered genes a non- reference sequence region must have in common to with the reference
  • 13. Sybil: Synteny gradient display • A color-coded display of conserved synteny between two or more sequences • Select a reference sequence (bottom of the display) with the genes color-coded from the 5’ end to the 3’ end • Orthologs in the comparison genomes are shown in the color of the ortholog from the reference genome • As a result one can see large and small-scale rearrangements at a glance, in addition to regions that may be inserted in one sequence relative to another
  • 17.
  • 19. New Data Types • Virulence Factors • Epitopes • Experimentally characterized genes/proteins • Multidrug transporters • Genomic islands • Community requested databases (ClosTox)
  • 20. Acknowledgements • PI: Granger Sutton (JCVI) • Subcontract: Owen White (University of Maryland, Baltimore, IGS) • Project Manager: Lauren Brinkac JCVI Informatics Engineers Analysts  Tanja Davidsen (manager) Scott Durkin (manager)  Erin Beck Ramana Madupu  Alex Richter Susmita Shrivastava  Kevin Galinsky Bob Dodson  Jay Sundaram Derek Harkins  Seth Schobel Lis Caler  IGS Informatics Engineers  Anu Ganapathy YongMei Zhao  Josh Orvis Aaron Gussman  Kevin Galens Jonathon Crabtree 