SlideShare une entreprise Scribd logo
1  sur  18
Télécharger pour lire hors ligne
Life as a scientific database curator


          Sandra Orchard




                EBI is an Outstation of the European Molecular Biology Laboratory.
What is a database curator

       Curator – OED

            - a keeper of a museum or other collection

            - from LATIN curare – take care of




2/17
What is a database curator

       The job
       • Creating a structure for unstructured biological data
       • Generating order from chaos
       • Combining literature and automated processes to provide
         biomolecules with correct sequence/structure,
         nomenclature, function and contextual information
       • Give biological context to large experimental datasets
       The qualification
       • Need an attention to detail which would annoy even the
         best of housemates
       • Passion for reading and understanding literature

3/17
What is a database curator

       The Pros

       • Read about and gain understanding of all areas of
         biology

       The Cons

       • No specialisation
       • Persuading biologists that there are benefits to this.




4/17
What is a database curator

• The International Society for Biocuration (ISB) definition:
...integration of information relevant to biology into a
    database or resource that enables integration of the
    scientific literature...and large experimental data sets.
• Goals are
...accurate and comprehensive representation...
...to facilitate access to data for scientists...as a resource for
    computational analysis
What does a database curator do?
Collects, annotates, and validates information (in a
database).


Extracts & organizes data from literature


Describes data using standards, protocols and
vocabularies (enabling computational queries and data
exchange).

Communicates with researchers to ensure the accuracy
of curated information and to foster good practice in data
exchange.
What does a database curator do?

            Takes part in the development of shared
            biomedical data standards and ontologies
            and (ideally) enforces their use.

            Trains users in effectively accessing and
            using the data in the databases

            Promotes database usage through talks,
            conference attendance/posters,
            publications etc…..



7/17
What do I do?

       • Curate the molecular interaction database




8/17
What do I do?




       Custom curation tools designed by the curation team


9/17
What do I do?

                        Controlled vocabulary maintenance




10/17
Qualifications for the job

        • A biology B.Sc./M.Sc./PhD + lab experience

              or

        • A bioinformatics M.Sc

        Plus – an enquiring mind, ability to write good English and
          the right attitude

        Training – largely database specific and will be given ‘on-
          the-job’



11/17
Qualifications for the job

        • Do I need to be able to do programming?

        • Answer – no. It is often helpful to have some database
          query ability but it is perfectly possible to do the job
          without (in most databases)




12/17
Career Progression

        Within the EBI
        • Progress as a curator – senior curator, curation
          coordinator

        • Project management – grant coordinator, project leader

        Post –EBI
        • Curation/project leadership positions at many other
          institutes
        • Related areas – academic research, research project
          management, lectureships, journal publishing

13/17
Will I still be allowed to publish?

        Curation
        The annotation of both human and mouse kinomes in
          UniProtKB/Swiss-Prot - (MCP)
        Data Standards
        The Minimum Information required for reporting a Molecular
          Interaction Experiment (MIMIx) – (NBT)
        Data Formats
        The HUPO PSI's molecular interaction format--a community
          standard for the representation of protein interaction data.
          – (NBT)



14/17
Will I still be allowed to publish?

        Tool development
          Rintact: enabling computational analysis of molecular
          interaction data from the IntAct repository.
          (Bioinformatics)
        Ontologies
        The use of common ontologies and controlled vocabularies
          to enable data exchange and deposition for complex
          proteomic experiments (Pac Symp Biocomput)
        Training
        Submit your interaction data the IMEx way - a step by step
          guide to trouble-free deposition (Proteomics)


15/17
Curation as a profession




16/17
Curation as a profession

        • Biocuration conference every 12 months – 2102 in
          Cambridge, UK

        • Opportunities for further training – bioinformatic tools,
          programming, career development/management

        • Attendance at biological/computational biology
          conferences encouraged – the EBI often provides
          speakers




17/17
Summary

        • Curation is not for everyone – it does require a certain
          mindset

        • Exposes you to all areas of biology (and chemistry)

        •   Now a recognised profession and our numbers are
            growing

        • Many opportunities to be become involved in “extra-
          curriculum” activities – its not all reading papers



18/17

Contenu connexe

Tendances

Introduction to NCBI
Introduction to NCBIIntroduction to NCBI
Introduction to NCBIgeetikaJethra
 
Ncbi basic intro_v_pitt_kent_osu
Ncbi basic intro_v_pitt_kent_osuNcbi basic intro_v_pitt_kent_osu
Ncbi basic intro_v_pitt_kent_osuBen Busby
 
Bioinformatics biological databases
Bioinformatics biological databasesBioinformatics biological databases
Bioinformatics biological databasesSangeeta Das
 
Major resources of bioinformatics 2
Major resources of bioinformatics 2Major resources of bioinformatics 2
Major resources of bioinformatics 2Mohd Affan
 
Sequence Submission Tools
Sequence Submission ToolsSequence Submission Tools
Sequence Submission ToolsRishikaMaji
 
Publicly available tools and open resources in Bioinformatics
Publicly available  tools and open resources in BioinformaticsPublicly available  tools and open resources in Bioinformatics
Publicly available tools and open resources in BioinformaticsArindam Ghosh
 
Primary, secondary, tertiary biological database
Primary, secondary, tertiary biological databasePrimary, secondary, tertiary biological database
Primary, secondary, tertiary biological databaseKAUSHAL SAHU
 
Nucleic acid database
Nucleic acid databaseNucleic acid database
Nucleic acid databaseEsakkiammal S
 
Biological databases
Biological databasesBiological databases
Biological databasesAfra Fathima
 
BIOLOGICAL SEQUENCE DATABASES
BIOLOGICAL SEQUENCE DATABASES BIOLOGICAL SEQUENCE DATABASES
BIOLOGICAL SEQUENCE DATABASES nadeem akhter
 
Databases in Bioinformatics
Databases in BioinformaticsDatabases in Bioinformatics
Databases in BioinformaticsMeghaj Mallick
 
Bioinformatics introduction
Bioinformatics introductionBioinformatics introduction
Bioinformatics introductionDrGopaSarma
 

Tendances (20)

Introduction to NCBI
Introduction to NCBIIntroduction to NCBI
Introduction to NCBI
 
Ncbi basic intro_v_pitt_kent_osu
Ncbi basic intro_v_pitt_kent_osuNcbi basic intro_v_pitt_kent_osu
Ncbi basic intro_v_pitt_kent_osu
 
Applications of bioinformatics
Applications of bioinformaticsApplications of bioinformatics
Applications of bioinformatics
 
Bioinformatics biological databases
Bioinformatics biological databasesBioinformatics biological databases
Bioinformatics biological databases
 
Data base in detail
Data base in detailData base in detail
Data base in detail
 
Major resources of bioinformatics 2
Major resources of bioinformatics 2Major resources of bioinformatics 2
Major resources of bioinformatics 2
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Sequence Submission Tools
Sequence Submission ToolsSequence Submission Tools
Sequence Submission Tools
 
Publicly available tools and open resources in Bioinformatics
Publicly available  tools and open resources in BioinformaticsPublicly available  tools and open resources in Bioinformatics
Publicly available tools and open resources in Bioinformatics
 
Primary, secondary, tertiary biological database
Primary, secondary, tertiary biological databasePrimary, secondary, tertiary biological database
Primary, secondary, tertiary biological database
 
Major databases in bioinformatics
Major databases in bioinformaticsMajor databases in bioinformatics
Major databases in bioinformatics
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Nucleic acid database
Nucleic acid databaseNucleic acid database
Nucleic acid database
 
European molecular biology laboratory (EMBL)
European molecular biology laboratory (EMBL)European molecular biology laboratory (EMBL)
European molecular biology laboratory (EMBL)
 
Biological databases
Biological databasesBiological databases
Biological databases
 
BIOLOGICAL SEQUENCE DATABASES
BIOLOGICAL SEQUENCE DATABASES BIOLOGICAL SEQUENCE DATABASES
BIOLOGICAL SEQUENCE DATABASES
 
EMBL- European Molecular Biology Laboratory
EMBL- European Molecular Biology LaboratoryEMBL- European Molecular Biology Laboratory
EMBL- European Molecular Biology Laboratory
 
Databases in Bioinformatics
Databases in BioinformaticsDatabases in Bioinformatics
Databases in Bioinformatics
 
Bioinformatics introduction
Bioinformatics introductionBioinformatics introduction
Bioinformatics introduction
 
Proteins databases
Proteins databasesProteins databases
Proteins databases
 

En vedette

P2 training and_life_as_a_postdoc_(kota_miura)
P2 training and_life_as_a_postdoc_(kota_miura)P2 training and_life_as_a_postdoc_(kota_miura)
P2 training and_life_as_a_postdoc_(kota_miura)phdcareers
 
P3 training and_life_as_a_postdoc_(felix_klein)
P3 training and_life_as_a_postdoc_(felix_klein)P3 training and_life_as_a_postdoc_(felix_klein)
P3 training and_life_as_a_postdoc_(felix_klein)phdcareers
 
Career Paths in the Life Sciences. Janssens, Summer 2012
Career Paths in the Life Sciences. Janssens, Summer 2012Career Paths in the Life Sciences. Janssens, Summer 2012
Career Paths in the Life Sciences. Janssens, Summer 2012phdcareers
 
E1 life as_an_outreach_project_leader_(giulietta_spudich)
E1 life as_an_outreach_project_leader_(giulietta_spudich)E1 life as_an_outreach_project_leader_(giulietta_spudich)
E1 life as_an_outreach_project_leader_(giulietta_spudich)phdcareers
 
E3 life as a ux analyst (jenny_cham)
E3 life as a ux analyst (jenny_cham)E3 life as a ux analyst (jenny_cham)
E3 life as a ux analyst (jenny_cham)phdcareers
 
Publishing Career Day Presentation AM
Publishing Career Day Presentation AMPublishing Career Day Presentation AM
Publishing Career Day Presentation AMphdcareers
 

En vedette (7)

P2 training and_life_as_a_postdoc_(kota_miura)
P2 training and_life_as_a_postdoc_(kota_miura)P2 training and_life_as_a_postdoc_(kota_miura)
P2 training and_life_as_a_postdoc_(kota_miura)
 
P3 training and_life_as_a_postdoc_(felix_klein)
P3 training and_life_as_a_postdoc_(felix_klein)P3 training and_life_as_a_postdoc_(felix_klein)
P3 training and_life_as_a_postdoc_(felix_klein)
 
Career Paths in the Life Sciences. Janssens, Summer 2012
Career Paths in the Life Sciences. Janssens, Summer 2012Career Paths in the Life Sciences. Janssens, Summer 2012
Career Paths in the Life Sciences. Janssens, Summer 2012
 
E1 life as_an_outreach_project_leader_(giulietta_spudich)
E1 life as_an_outreach_project_leader_(giulietta_spudich)E1 life as_an_outreach_project_leader_(giulietta_spudich)
E1 life as_an_outreach_project_leader_(giulietta_spudich)
 
E3 life as a ux analyst (jenny_cham)
E3 life as a ux analyst (jenny_cham)E3 life as a ux analyst (jenny_cham)
E3 life as a ux analyst (jenny_cham)
 
Publishing Career Day Presentation AM
Publishing Career Day Presentation AMPublishing Career Day Presentation AM
Publishing Career Day Presentation AM
 
PhDretreat
PhDretreat PhDretreat
PhDretreat
 

Similaire à E2 life as_a_scientific_database_curator_(sandra_orchard)

Big Data Standards - Workshop, ExpBio, Boston, 2015
Big Data Standards - Workshop, ExpBio, Boston, 2015Big Data Standards - Workshop, ExpBio, Boston, 2015
Big Data Standards - Workshop, ExpBio, Boston, 2015Susanna-Assunta Sansone
 
Teaching Case Studies
Teaching Case StudiesTeaching Case Studies
Teaching Case StudiesJulie Goldman
 
Scally The Library's Role in Research Data Management. OCLC partnership meeti...
Scally The Library's Role in Research Data Management. OCLC partnership meeti...Scally The Library's Role in Research Data Management. OCLC partnership meeti...
Scally The Library's Role in Research Data Management. OCLC partnership meeti...John Scally
 
"Perfection is the enemy of the good "Supporting research data management: A ...
"Perfection is the enemy of the good "Supporting research data management: A ..."Perfection is the enemy of the good "Supporting research data management: A ...
"Perfection is the enemy of the good "Supporting research data management: A ...Incremental Project
 
Exercising creativity to implement an institutional repository with limited r...
Exercising creativity to implement an institutional repository with limited r...Exercising creativity to implement an institutional repository with limited r...
Exercising creativity to implement an institutional repository with limited r...NASIG
 
E-Science: New Roles for Libraries
E-Science: New Roles for LibrariesE-Science: New Roles for Libraries
E-Science: New Roles for LibrariesElaine Martin
 
P4 training and_life_as_a_postdoc_(shinichi_sunagawa)
P4 training and_life_as_a_postdoc_(shinichi_sunagawa)P4 training and_life_as_a_postdoc_(shinichi_sunagawa)
P4 training and_life_as_a_postdoc_(shinichi_sunagawa)phdcareers
 
OeRC Seminar
OeRC SeminarOeRC Seminar
OeRC Seminarseanb
 
Supporting the development of a national Research Data Discovery Service - A ...
Supporting the development of a national Research Data Discovery Service - A ...Supporting the development of a national Research Data Discovery Service - A ...
Supporting the development of a national Research Data Discovery Service - A ...Historic Environment Scotland
 
LIBER Webinar: Supporting Data Literacy
LIBER Webinar: Supporting Data LiteracyLIBER Webinar: Supporting Data Literacy
LIBER Webinar: Supporting Data LiteracyLIBER Europe
 
Designing Biological Databases
Designing Biological DatabasesDesigning Biological Databases
Designing Biological DatabasesArjei Balandra
 
Supporting the development of a national Research Data Discovery Service – a ...
Supporting the development of a national Research Data Discovery Service – a ...Supporting the development of a national Research Data Discovery Service – a ...
Supporting the development of a national Research Data Discovery Service – a ...EDINA, University of Edinburgh
 
Sla2009 D Curation Heidorn
Sla2009 D Curation HeidornSla2009 D Curation Heidorn
Sla2009 D Curation HeidornBryan Heidorn
 
LIBRARY ASSESSMENT
LIBRARY ASSESSMENTLIBRARY ASSESSMENT
LIBRARY ASSESSMENTJen Rutner
 
Supporting researchers in the molecular life sciences Jeff Christiansen
Supporting researchers in the molecular life sciences Jeff Christiansen Supporting researchers in the molecular life sciences Jeff Christiansen
Supporting researchers in the molecular life sciences Jeff Christiansen ARDC
 
Designing and delivering an international MOOC on Research Data Management an...
Designing and delivering an international MOOC on Research Data Management an...Designing and delivering an international MOOC on Research Data Management an...
Designing and delivering an international MOOC on Research Data Management an...Robin Rice
 

Similaire à E2 life as_a_scientific_database_curator_(sandra_orchard) (20)

Big Data Standards - Workshop, ExpBio, Boston, 2015
Big Data Standards - Workshop, ExpBio, Boston, 2015Big Data Standards - Workshop, ExpBio, Boston, 2015
Big Data Standards - Workshop, ExpBio, Boston, 2015
 
Teaching Case Studies
Teaching Case StudiesTeaching Case Studies
Teaching Case Studies
 
Critical infrastructure to promote data synthesis
Critical infrastructure to promote data synthesis Critical infrastructure to promote data synthesis
Critical infrastructure to promote data synthesis
 
Scally The Library's Role in Research Data Management. OCLC partnership meeti...
Scally The Library's Role in Research Data Management. OCLC partnership meeti...Scally The Library's Role in Research Data Management. OCLC partnership meeti...
Scally The Library's Role in Research Data Management. OCLC partnership meeti...
 
"Perfection is the enemy of the good "Supporting research data management: A ...
"Perfection is the enemy of the good "Supporting research data management: A ..."Perfection is the enemy of the good "Supporting research data management: A ...
"Perfection is the enemy of the good "Supporting research data management: A ...
 
Exercising creativity to implement an institutional repository with limited r...
Exercising creativity to implement an institutional repository with limited r...Exercising creativity to implement an institutional repository with limited r...
Exercising creativity to implement an institutional repository with limited r...
 
Pine education-platform
Pine education-platformPine education-platform
Pine education-platform
 
E-Science: New Roles for Libraries
E-Science: New Roles for LibrariesE-Science: New Roles for Libraries
E-Science: New Roles for Libraries
 
P4 training and_life_as_a_postdoc_(shinichi_sunagawa)
P4 training and_life_as_a_postdoc_(shinichi_sunagawa)P4 training and_life_as_a_postdoc_(shinichi_sunagawa)
P4 training and_life_as_a_postdoc_(shinichi_sunagawa)
 
B4OS-2012
B4OS-2012B4OS-2012
B4OS-2012
 
OeRC Seminar
OeRC SeminarOeRC Seminar
OeRC Seminar
 
Supporting the development of a national Research Data Discovery Service - A ...
Supporting the development of a national Research Data Discovery Service - A ...Supporting the development of a national Research Data Discovery Service - A ...
Supporting the development of a national Research Data Discovery Service - A ...
 
Library Linkages
Library LinkagesLibrary Linkages
Library Linkages
 
LIBER Webinar: Supporting Data Literacy
LIBER Webinar: Supporting Data LiteracyLIBER Webinar: Supporting Data Literacy
LIBER Webinar: Supporting Data Literacy
 
Designing Biological Databases
Designing Biological DatabasesDesigning Biological Databases
Designing Biological Databases
 
Supporting the development of a national Research Data Discovery Service – a ...
Supporting the development of a national Research Data Discovery Service – a ...Supporting the development of a national Research Data Discovery Service – a ...
Supporting the development of a national Research Data Discovery Service – a ...
 
Sla2009 D Curation Heidorn
Sla2009 D Curation HeidornSla2009 D Curation Heidorn
Sla2009 D Curation Heidorn
 
LIBRARY ASSESSMENT
LIBRARY ASSESSMENTLIBRARY ASSESSMENT
LIBRARY ASSESSMENT
 
Supporting researchers in the molecular life sciences Jeff Christiansen
Supporting researchers in the molecular life sciences Jeff Christiansen Supporting researchers in the molecular life sciences Jeff Christiansen
Supporting researchers in the molecular life sciences Jeff Christiansen
 
Designing and delivering an international MOOC on Research Data Management an...
Designing and delivering an international MOOC on Research Data Management an...Designing and delivering an international MOOC on Research Data Management an...
Designing and delivering an international MOOC on Research Data Management an...
 

Dernier

Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 

Dernier (20)

Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 

E2 life as_a_scientific_database_curator_(sandra_orchard)

  • 1. Life as a scientific database curator Sandra Orchard EBI is an Outstation of the European Molecular Biology Laboratory.
  • 2. What is a database curator Curator – OED - a keeper of a museum or other collection - from LATIN curare – take care of 2/17
  • 3. What is a database curator The job • Creating a structure for unstructured biological data • Generating order from chaos • Combining literature and automated processes to provide biomolecules with correct sequence/structure, nomenclature, function and contextual information • Give biological context to large experimental datasets The qualification • Need an attention to detail which would annoy even the best of housemates • Passion for reading and understanding literature 3/17
  • 4. What is a database curator The Pros • Read about and gain understanding of all areas of biology The Cons • No specialisation • Persuading biologists that there are benefits to this. 4/17
  • 5. What is a database curator • The International Society for Biocuration (ISB) definition: ...integration of information relevant to biology into a database or resource that enables integration of the scientific literature...and large experimental data sets. • Goals are ...accurate and comprehensive representation... ...to facilitate access to data for scientists...as a resource for computational analysis
  • 6. What does a database curator do? Collects, annotates, and validates information (in a database). Extracts & organizes data from literature Describes data using standards, protocols and vocabularies (enabling computational queries and data exchange). Communicates with researchers to ensure the accuracy of curated information and to foster good practice in data exchange.
  • 7. What does a database curator do? Takes part in the development of shared biomedical data standards and ontologies and (ideally) enforces their use. Trains users in effectively accessing and using the data in the databases Promotes database usage through talks, conference attendance/posters, publications etc….. 7/17
  • 8. What do I do? • Curate the molecular interaction database 8/17
  • 9. What do I do? Custom curation tools designed by the curation team 9/17
  • 10. What do I do? Controlled vocabulary maintenance 10/17
  • 11. Qualifications for the job • A biology B.Sc./M.Sc./PhD + lab experience or • A bioinformatics M.Sc Plus – an enquiring mind, ability to write good English and the right attitude Training – largely database specific and will be given ‘on- the-job’ 11/17
  • 12. Qualifications for the job • Do I need to be able to do programming? • Answer – no. It is often helpful to have some database query ability but it is perfectly possible to do the job without (in most databases) 12/17
  • 13. Career Progression Within the EBI • Progress as a curator – senior curator, curation coordinator • Project management – grant coordinator, project leader Post –EBI • Curation/project leadership positions at many other institutes • Related areas – academic research, research project management, lectureships, journal publishing 13/17
  • 14. Will I still be allowed to publish? Curation The annotation of both human and mouse kinomes in UniProtKB/Swiss-Prot - (MCP) Data Standards The Minimum Information required for reporting a Molecular Interaction Experiment (MIMIx) – (NBT) Data Formats The HUPO PSI's molecular interaction format--a community standard for the representation of protein interaction data. – (NBT) 14/17
  • 15. Will I still be allowed to publish? Tool development Rintact: enabling computational analysis of molecular interaction data from the IntAct repository. (Bioinformatics) Ontologies The use of common ontologies and controlled vocabularies to enable data exchange and deposition for complex proteomic experiments (Pac Symp Biocomput) Training Submit your interaction data the IMEx way - a step by step guide to trouble-free deposition (Proteomics) 15/17
  • 16. Curation as a profession 16/17
  • 17. Curation as a profession • Biocuration conference every 12 months – 2102 in Cambridge, UK • Opportunities for further training – bioinformatic tools, programming, career development/management • Attendance at biological/computational biology conferences encouraged – the EBI often provides speakers 17/17
  • 18. Summary • Curation is not for everyone – it does require a certain mindset • Exposes you to all areas of biology (and chemistry) • Now a recognised profession and our numbers are growing • Many opportunities to be become involved in “extra- curriculum” activities – its not all reading papers 18/17