Visit to the NI Vavilov Institute for Plant Industry (VIR) in April 2010. Installation of the GBIF IPT toolkit for data publishing as a test upgrade for the EURISCO data infrastructure of European genebanks.
EURISCO and GBIF IPT, at the Vavilov Institute in St Petersburg (27 April 2010)
1. Web service demo for EURISCO GBIF Tools and Darwin Core extension for germplasm N.I. Vavilov Research Institute of Plant Industry (VIR), April 26th – 29th 2010, St Petersburg, Russian Federation Dag Endresen, Jonas Nordling, Nordic Genetic Resources Center (NordGen)
27. Potential of the GBIF technology Using GBIF technology (and contributing to its development), the PGR community can easily establish specific PGR networks without duplicating GBIF's work. The compatibility of data standards between PGR and biodiversity collections made it possible to integrate the worldwide germplasm collections into the biodiversity community (TDWG, GBIF). http://data.gbif.org/datasets/network/2 7
28.
29. a flexible framework to maximize re-usability The Darwin Core can be extended by adding new terms to share additional information. Approved as TDWG standard 2009 “The Darwin Core is primarily based on taxa, their occurrence in nature as documented by observations, specimens, and samples, and related information.” http://rs.tdwg.org/dwc/ 8
32. Additional terms to describe germplasm samples
33. Includes the new terms for crop trait experiments developed as part of the European EPGRIS3 project
34. Includes a few additional terms for new international crop treaty regulationshttp://code.google.com/p/darwincore-germplasmhttp://rs.nordgen.org/dwc 9
35. Mapping of DwC-G terms to the MCPD descriptors (EURISCO data exchange format) 10
37. MCPD -> ABCD 2.06 (2004) for BioCASE National Inventory Code Institute Code AccessionNumber CollectingNumber Collecting Institute Code Genus Species SpeciesAuthority „Subtaxa“ „Subtaxa“ Authority Common Crop Name Accession Name Acquisition Date Donor Institute Code DonorAccessionNumber OtherIdentification (Number) associatedwiththeaccession Location of SafetyDuplicates Type of Germplasm Storage Remarks DecodedCollecting Institute DecodedBreeding Institute DecodedDonor Institute DecodedSafetyDuplicationLocation Accession URL Country of Origin Location of Collection Site Latitude of CS Longitude of CS Elevation of CS Collecting Date of Sample Breeding Institute Code Biological Status of Accession Ancestral Data Collecting/AcquisitionSource Helmut Knüpffer IPK Gatersleben http://www.ecpgr.cgiar.org/epgris/Tech_papers/EURISCO_Descriptors.pdf Walter Berendsohn BGBM 12
40. Integrated Publishing Toolkit (IPT) A tool for data publishers. A simple mechanism to share primary biodiversity data following the Darwin Core standard. Open source, Java based web application. Provides a local tool for data quality assessment, etc. 14
50. European ECPGR Crop Databases European EURISCO Catalog VIR (RUS001) Passport data Global Crop Registries VIR (RUS001) Crop departments 18
51. Same dataset available from multiple information systems... ?! VIR Crop dataset ECPGR Crop Databases VIR (RUS001) Passport data EURISCO Global Crop Registries 19
52. Resolvable persistent identifiers can direct the user to the publisher of the primary dataset (official original dataset) VIR Crop dataset ECPGR Crop Databases VIR (RUS001) Passport data EURISCO Global Crop Registries 20
53. Persistent Identifier The Persistent Identifier (PI) is a digital name tag Also called Global Unique Identifiers (GUID) Life Science Identifiers (LSID) is one example Digital Object Identifier (doi) is another example The Persistent Identifier concept for to naming and identification of data resources stored in multiple, distributed data stores. Effective identification of data objects is essential for linking the world’s biodiversity data. 21
54. Moving towards…global integration of information Genebank datasets Spatial data Threatened species Crop standards Migratory species Legislation and regulations etc. Crop collections in Europe Global crop system 22 Global crop collections