IPNI PhytoKeys integration

•Download as PPT, PDF•

0 likes•519 views

This document discusses the integration between IPNI and PhytoKeys. IPNI holds nomenclatural data for vascular plants as a collaboration between RBG Kew, Harvard University Herbaria, and the Australian National Botanic Garden. Data is entered from literature scanning and user reports, and around 7400 names are entered annually. PhytoKeys currently emails name details to IPNI, but future integration could involve PhytoKeys submitting structured data to IPNI services, which would create records and return IDs to embed in publications automatically. This would resolve nomenclatural issues pre-publication and seed IPNI identifiers into literature more efficiently.

IPNI & PhytoKeys
Integration
Nicky Nicolson (RBG Kew)

What is IPNI?
Nomenclator for vascular plants.
Collaboration btw RBG Kew (UK), Harvard
University Herbaria (US) and Australian
National Botanic Garden, Canberra (AU)
Composed of three parts:
• Data
• Expertise
• Services

What data does IPNI hold?
• What data types:
– ICN governed nomenclatural acts
– Standardised author list
– Publications
• Which groups:
– Vascular plants
• Which ranks:
– Family and below

How is data entered?
• Data entry:
– From literature scanning, journals received by
library at Kew, Harvard, Canberra
– User reports of missing nomenclatural acts,
usually accompanied by a link to digitised
literature page (BHL)
• How many?
– About 7400 names entered in average year
– About 6100 nomenclatural acts published / year
– … of these about 2800 are tax. novs.

Curation - after data entry
• Full audit history on core objects – names /
authors / publications.
• Average 300,000 edits on name records / year
• Standardisation effort ongoing :
– Assessment of nomenclatural status
– Epithet
– Author citation
– Publication title
– Collation
– Year

Current Phytokeys “integration”
• Phytokeys staff email details to IPNI
• IPNI editor creates record and returns IDs to
Phytokeys
• ID embedded in publication

email != integration

…but it is an opportunity to converse about the
content of the nomenclatural act, and an
opportunity to correct if necessary

Future Phytokeys integration
• Phytokeys submits structured (XML) message
to IPNI service
• IPNI service creates record “on-demand” and
returns ID to Phytokeys in structured response
• ID embedded in publication

IPNI retains control of un-suppression

No human communication – but we need to still
have the opportunity to correct

Evaluating it
Benefits
• Nomenclatural problems resolved pre-
publication (workflow slower, but quality
higher)
• IPNI editorial role switched from keying to
checking
• IPNI identifiers seeded into literature
• Published data more usable
• Useful (automated) route into IPNI
Costs (some but far smaller) :
• Development / testing time

Future
• Extend this model to work with other
publishers
• A step towards registration? This changes the
game:
– Currently: a name missed is to IPNI's detriment -
our dataset is deficient
– With registration: a name missed will not be valid
under the code

What's hot

Open Access NBIC Workshop April 19, 2011Philip Bourne

ScienceDirect Presentation: Seton Hallrachelmccullough

Enabling Semantically Aware Software Applications Trish Whetzel

RefWorks-Excel-RefWorks - deleting duplicates made easy?judithgulpers

Accessing The Materials You NeedDawn Lowe-Wincentsen

RDA - Long Tail Data Interest Group - NPG Scientitic Data oveviewSusanna-Assunta Sansone

Leicester Research Archive (LRA): the work of a repository administratorGaz Johnson

What's hot (7)

Open Access NBIC Workshop April 19, 2011

ScienceDirect Presentation: Seton Hall

Enabling Semantically Aware Software Applications

RefWorks-Excel-RefWorks - deleting duplicates made easy?

Accessing The Materials You Need

RDA - Long Tail Data Interest Group - NPG Scientitic Data oveview

Leicester Research Archive (LRA): the work of a repository administrator

Viewers also liked

Linq 2013 plenary_keynote_batesLINQ_Conference

How to deliver rich, real-time apps - AppsWorld 2014Andy Piper

Iref franchisee-presentationreddvise

Linq 2013 session_red_1_kameasLINQ_Conference

Build a shower cubicleZulaiha Amaria

Me and my movies presentationsmashingentertainment

Viewers also liked (6)

Linq 2013 plenary_keynote_bates

How to deliver rich, real-time apps - AppsWorld 2014

Iref franchisee-presentation

Linq 2013 session_red_1_kameas

Build a shower cubicle

Me and my movies presentation

Similar to IPNI PhytoKeys integration

Advancing the International Plant Names Index (IPNI) nickyn

Bibliographic References in BHLWilliam Ulate

Oct 15 NISO Webinar: 21st Century Resource Sharing: Which Inter-Library Loan ...National Information Standards Organization (NISO)

Elsevier - Smart Data and Algorithms for the Publishing IndustryAntonio Gulli

ICIC 2013 Conference Proceedings Antony Williams Royal Society of ChemistryDr. Haxel Consult

Big data challenges associated with building a national data repository for c...US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure

Semantics as a service at EMBL-EBISimon Jupp

Linked APIs for Life Sciences Tutorial at SWAT4LS 3011sspeiser

Crushing, Blending, and Stretching Transactional DataRay Schwartz

Martone acs presentationNeuroscience Information Framework

DOIs for African Partner JournalsCarol Anne Meyer

New Metaphors: Data Papers and Data CitationsJohn Kunze

eScience Resources for the Chemistry Community from the Royal Society of Chem...US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure

Accelerating Delivery of Data Products - The EBSCO WayMongoDB

ChemSpider – disseminating data and enabling an abundance of chemistry platformsUS Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure

Do you Need a New System? Jane Burke at ALIA 2013ProQuest

Globus in European Life ScienceGlobus

HKU Data Curation MLIM7350 Class 9 Scott Edmunds

eScience at the Royal Society of Chemistry and our current initiativesUS Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure

ChemSpider – disseminating data and enabling an abundance of chemistry platformsKen Karapetyan

Similar to IPNI PhytoKeys integration (20)

Advancing the International Plant Names Index (IPNI)

Bibliographic References in BHL

Oct 15 NISO Webinar: 21st Century Resource Sharing: Which Inter-Library Loan ...

Elsevier - Smart Data and Algorithms for the Publishing Industry

ICIC 2013 Conference Proceedings Antony Williams Royal Society of Chemistry

Big data challenges associated with building a national data repository for c...

Semantics as a service at EMBL-EBI

Linked APIs for Life Sciences Tutorial at SWAT4LS 3011

Crushing, Blending, and Stretching Transactional Data

Martone acs presentation

DOIs for African Partner Journals

New Metaphors: Data Papers and Data Citations

eScience Resources for the Chemistry Community from the Royal Society of Chem...

Accelerating Delivery of Data Products - The EBSCO Way

ChemSpider – disseminating data and enabling an abundance of chemistry platforms

Do you Need a New System? Jane Burke at ALIA 2013

Globus in European Life Science

HKU Data Curation MLIM7350 Class 9

eScience at the Royal Society of Chemistry and our current initiatives

ChemSpider – disseminating data and enabling an abundance of chemistry platforms

More from nickyn

829 tdwg-2015-nicolson-kew-strings-to-thingsnickyn

Rda p5-env-plenary-nnnickyn

Challenges in developing names services - RDAnickyn

Kew at the pro-iBiosphere data hackathonnickyn

names-backbone-graph-TDWGnickyn

A names backbone - a graph of taxonomynickyn

Services and Kew's (names) datanickyn

Building a names backbonenickyn

More from nickyn (8)

829 tdwg-2015-nicolson-kew-strings-to-things

Rda p5-env-plenary-nn

Challenges in developing names services - RDA

Kew at the pro-iBiosphere data hackathon

names-backbone-graph-TDWG

A names backbone - a graph of taxonomy

Services and Kew's (names) data

Building a names backbone

IPNI PhytoKeys integration

1. IPNI & PhytoKeys Integration Nicky Nicolson (RBG Kew)

2. What is IPNI? Nomenclator for vascular plants. Collaboration btw RBG Kew (UK), Harvard University Herbaria (US) and Australian National Botanic Garden, Canberra (AU) Composed of three parts: • Data • Expertise • Services

3. What data does IPNI hold? • What data types: – ICN governed nomenclatural acts – Standardised author list – Publications • Which groups: – Vascular plants • Which ranks: – Family and below

5. How is data entered? • Data entry: – From literature scanning, journals received by library at Kew, Harvard, Canberra – User reports of missing nomenclatural acts, usually accompanied by a link to digitised literature page (BHL) • How many? – About 7400 names entered in average year – About 6100 nomenclatural acts published / year – … of these about 2800 are tax. novs.

6. Curation - after data entry • Full audit history on core objects – names / authors / publications. • Average 300,000 edits on name records / year • Standardisation effort ongoing : – Assessment of nomenclatural status – Epithet – Author citation – Publication title – Collation – Year

7. Current Phytokeys “integration” • Phytokeys staff email details to IPNI • IPNI editor creates record and returns IDs to Phytokeys • ID embedded in publication email != integration …but it is an opportunity to converse about the content of the nomenclatural act, and an opportunity to correct if necessary

8. Future Phytokeys integration • Phytokeys submits structured (XML) message to IPNI service • IPNI service creates record “on-demand” and returns ID to Phytokeys in structured response • ID embedded in publication IPNI retains control of un-suppression No human communication – but we need to still have the opportunity to correct

9. Evaluating it Benefits • Nomenclatural problems resolved pre- publication (workflow slower, but quality higher) • IPNI editorial role switched from keying to checking • IPNI identifiers seeded into literature • Published data more usable • Useful (automated) route into IPNI Costs (some but far smaller) : • Development / testing time

10. Future • Extend this model to work with other publishers • A step towards registration? This changes the game: – Currently: a name missed is to IPNI's detriment - our dataset is deficient – With registration: a name missed will not be valid under the code

Editor's Notes

Provider of objective nomenclatural facts – the basis for taxonomic work. Scope (vasc plants) important – botanical code is wider, and Phytokeys scope is wider. IPNI is not just a dataset – it is actively / expertly curated
Standardised author Standardised publication Distribution form type Details about the type and where it is held Links to associated records – this name is a validation of an earlier name. The eariler invalid record is annotated with the relevant code article Full record history on all names Data available in a structured format
Stats derived from 2004 onwards. Most names aren’t entered until the hard copy arrives at K / HUH library – we estimate at most 2 year time lag between publication data and entry to IPNI.
We’ve now 10 years worth of audit log data.
Question: will resolving of nomenclatural problems pre-publication be maintained on automation?

IPNI PhytoKeys integration

Recommended

Recommended

More Related Content

What's hot

What's hot (7)

Viewers also liked

Viewers also liked (6)

Similar to IPNI PhytoKeys integration

Similar to IPNI PhytoKeys integration (20)

More from nickyn

More from nickyn (8)

IPNI PhytoKeys integration

Editor's Notes