Ce diaporama a bien été signalé.
Nous utilisons votre profil LinkedIn et vos données d’activité pour vous proposer des publicités personnalisées et pertinentes. Vous pouvez changer vos préférences de publicités à tout moment.
DataCite – services and support
for opening up research data
Herbert Grüttemeier
Inist-CNRS
1st International Workshop on ...
Thousand years ago:
science was empirical
describing natural phenomena
Last few hundred years:
theoretical branch
using mo...
• Scientific Information is more than a journal article or a book
• Libraries should open their catalogues to any kind of
...
Simulation
Scientific Films
3D Objects
Grey Literature
Research Data
Software
Images
Including non-classical publications
DOI - what is it for ?
DOI (Digital Object Identifier): persistent identifier
enabling citation and providing a stable lin...
Digital Object Identifiers (DOI
names) offer a solution
Mostly widely used identifier for
scientific articles
Researcher...
http://www.doi.org
At the infrastructure level, DOI names are handles.
http://www.handle.net
From KE workshop presentation, The Hague, June 2011 (L. Lannom)
From KE workshop presentation, The Hague, June 2011 (L. Lannom)
From KE workshop presentation, The Hague, June 2011 (N. Paskin)
“The European Commission’s vision is that information
already paid for by the public purse should not be paid for
again ea...
Data publication improves access and sharing, and…
xxxxx
x
nevertheless:
DataCite
• Global consortium carried by local institutions
• Focused on improving the scholarly infrastructure around
data...
• Technische Informationsbibliothek (TIB)
• Canada Institute for Scientific and
Technical Information (CISTI),
• Californi...
DataCite Structure
International DOI
Foundation
DataCite
Member
Institution
Data CentreData CentreData Centre
Member
Insti...
DataCite – the different roles
The DataCite registration agency
• Maintains the resolution infrastructure
• Maintains a se...
Bridging the gap
Publishers Data centres
DOIs in Use: DataCite
CrossRef has registered more than 51 million DOIs on behalf...
Bridging the gap
Publishers’ data policies ?
Connecting article and underlying data via DOI:
The dataset:
Storz, D et al. (2009):
Planktic foraminiferal flux and fauna...
IRD
( gr av/ 10 cm 3)
Sand
( %)
C aC O3
( %)
TOC
( %)
R adio
( %/ sand)
Sme c t
( %/ clay)
IRD
( gr av/ 10 cm 3)
Sand
( %)...
Anything that is the foundation
of further research
is research data
Data is evidence
• Dataset
• Text
• Collection
• Even...
DataCite services
• DataCite Metadata Store (MDS)
DOI minting and metadata registration https://mds.datacite.org
• DataCit...
DataCite services
• DOI Citation Formatter
Creation of different citation formats (for DataCite and CrossRef DOIs)
http://...
Metadata
fields
Metadata
fields
Searchterm: relatedIdentifier:*
Searchterm: uploaded:[NOW-7DAY TO NOW]
http://stats.datacite.org
http://oai.datacite.org
DataCite Content Service
Service for displaying DataCite metadata
Different formats (BibTeX, RIS, RDF, etc.)
http://data.d...
DataCite Content Service
Content Negotation (through MIME-Type)
• Access through DOI proxy (http://dx.doi.org)
• First imp...
Resolving to
the resource
location
(landing page)
http://dx.doi.org/10.5524/100005
Resolving to the citation
http://data.datacite.org/application/x-
datacite+text/10.5524/100005
Li, j; Zhang, G; Lambert, D...
Research data repositories
http://databib.org
Databib & re3data.org: JOINING FORCES
1) Openness
2) Optimal quality assurance
3) Development of innovative functionalitie...
Related initiatives
• Thomson-Reuters Data Citation Index
• European Persistant Identifier Consortium (EPIC)
• ODIN Europe...
Related initiatives
• Thomson-Reuters Data Citation Index
• European Persistant Identifier Consortium (EPIC)
• ODIN Europe...
Measures of data citation and use
©2010ThomsonReuters
DATA CITATION INDEX
Launched October 2012
4M data records
• Enable the discovery of data
repositories,...
©2010ThomsonReuters
METADATA PROCESSING
Repository
provides
metadata
feed
• Collaboration on
metadata
handling
Normalisati...
Data
Citation
Index
Repository
1
Repository
2
Repository
3
Partnership with DataCite
DataCite
Repository
1
Repository
2
Re...
Agreement between DataCite and EPIC – special DOI prefix
http://odin-project.eu
http://datacite.labs.orcid-eu.org/
ORCID/DataCite
claim tool
http://www.codata.org/taskgroups/TGdatacitation/index.html
http://www.codata.org/taskgroups/TGdatacitation/index.html
http://rd-alliance.org
http://www.icsu-wds.org
http://datacite.inist.fr
Thank you!
DataCite - services and support for opening up research data
DataCite - services and support for opening up research data
DataCite - services and support for opening up research data
DataCite - services and support for opening up research data
DataCite - services and support for opening up research data
DataCite - services and support for opening up research data
DataCite - services and support for opening up research data
DataCite - services and support for opening up research data
DataCite - services and support for opening up research data
DataCite - services and support for opening up research data
DataCite - services and support for opening up research data
DataCite - services and support for opening up research data
DataCite - services and support for opening up research data
DataCite - services and support for opening up research data
DataCite - services and support for opening up research data
DataCite - services and support for opening up research data
DataCite - services and support for opening up research data
DataCite - services and support for opening up research data
DataCite - services and support for opening up research data
DataCite - services and support for opening up research data
DataCite - services and support for opening up research data
DataCite - services and support for opening up research data
DataCite - services and support for opening up research data
DataCite - services and support for opening up research data
DataCite - services and support for opening up research data
DataCite - services and support for opening up research data
DataCite - services and support for opening up research data
DataCite - services and support for opening up research data
DataCite - services and support for opening up research data
DataCite - services and support for opening up research data
DataCite - services and support for opening up research data
DataCite - services and support for opening up research data
DataCite - services and support for opening up research data
DataCite - services and support for opening up research data
DataCite - services and support for opening up research data
DataCite - services and support for opening up research data
Prochain SlideShare
Chargement dans…5
×

DataCite - services and support for opening up research data

374 vues

Publié le

Presentation at International Workshop on Open Research Data, Valencia, October 2014

Publié dans : Technologie
  • Soyez le premier à commenter

  • Soyez le premier à aimer ceci

DataCite - services and support for opening up research data

  1. 1. DataCite – services and support for opening up research data Herbert Grüttemeier Inist-CNRS 1st International Workshop on Open Research Data Valencia – 21 October, 2014
  2. 2. Thousand years ago: science was empirical describing natural phenomena Last few hundred years: theoretical branch using models, generalizations Last few decades: a computational branch simulating complex phenomena Today: data exploration (eScience) unify theory, experiment, and simulation Jim Gray, eScience Group, Microsoft Research 2 2 2 . 3 4 a cG a a             Science Paradigms
  3. 3. • Scientific Information is more than a journal article or a book • Libraries should open their catalogues to any kind of information • The catalogue of the future is NOT ONLY a window to the library‘s holding, but… • …a portal in a net of trusted providers of scientific content Consequences for Libraries
  4. 4. Simulation Scientific Films 3D Objects Grey Literature Research Data Software Images Including non-classical publications
  5. 5. DOI - what is it for ? DOI (Digital Object Identifier): persistent identifier enabling citation and providing a stable link to digital resources, like research data sets consists of two parts: 10.5072/datacenter.123xy Prefix Suffix XX
  6. 6. Digital Object Identifiers (DOI names) offer a solution Mostly widely used identifier for scientific articles Researchers, authors, publishers know how to use them Put datasets on the same playing field as articles Dataset Yancheva et al (2007). Analyses on sediment of Lake Maar. PANGAEA. doi:10.1594/PANGAEA.587840 URLs are not persistent (e.g. Wren JD: URL decay in MEDLINE- a 4-year follow-up study. Bioinformatics. 2008, Jun 1;24(11):1381-5).   DOI names for access and citations
  7. 7. http://www.doi.org
  8. 8. At the infrastructure level, DOI names are handles. http://www.handle.net
  9. 9. From KE workshop presentation, The Hague, June 2011 (L. Lannom)
  10. 10. From KE workshop presentation, The Hague, June 2011 (L. Lannom)
  11. 11. From KE workshop presentation, The Hague, June 2011 (N. Paskin)
  12. 12. “The European Commission’s vision is that information already paid for by the public purse should not be paid for again each time it is accessed or used, and that it should benefit European companies and citizens to the full.” Openly accessible research data can typically be accessed, mined, exploited, reproduced and disseminated, free of charge for the user.
  13. 13. Data publication improves access and sharing, and… xxxxx x
  14. 14. nevertheless:
  15. 15. DataCite • Global consortium carried by local institutions • Focused on improving the scholarly infrastructure around datasets and other non-textual information • Focused on working with data centres and organisations that hold data • Providing standards, workflows and best-practice • Initially, but not exclusively based on the DOI system • Memorandum of Understanding, Paris, February 2009 • Officially founded December 1st 2009 in London
  16. 16. • Technische Informationsbibliothek (TIB) • Canada Institute for Scientific and Technical Information (CISTI), • California Digital Library, USA • Purdue University, USA • Office of Scientific and Technical Information (OSTI), USA • Library of TU Delft, The Netherlands • Technical Information Center of Denmark • The British Library • ZBMed, Germany • ZBW, Germany • GESIS, Germany • Library of ETH Zürich • Institut de l’Information Scientifique et Technique (INIST-CNRS), France • Swedish National Data Service (SND) • Australian National Data Service (ANDS) • Conferenza dei Rettori delle Università Italiane (CRUI) • National Research Council of Thailand (NRCT) • MTA KIK - Hungarian Academy of Sciences • University of Tartu, Estonia • Japan Link Center (JaLC) • South African Environmental Observation Network (SAEON) • European Organisation for Nuclear Research (CERN) Affiliated members: • Digital Curation Center, UK • Microsoft Research • Interuniversity Consortium for Political and Social Research (ICPSR) • Korea Institute of Science and Technology Information (KISTI) • Bejiing Genomic Institute (BGI) • IEEE • Harvard University Library • World Data System (ICSU-WDS) • GWDG, Germany DataCite Members Currently no member from Spain !
  17. 17. DataCite Structure International DOI Foundation DataCite Member Institution Data CentreData CentreData Centre Member Institution Data CentreData CentreData Centre … Works with Managing Agent (TIB) Member Associate Stakeholder
  18. 18. DataCite – the different roles The DataCite registration agency • Maintains the resolution infrastructure • Maintains a searchable database of metadata • Manages the identifiers over the long term • Establishes and shares best practice Publishing agents (data centres, research institutes, repositories, data publishers) are responsible for • Quality assurance • Content storage and access • Creating the identifiers • Creating and updating metadata
  19. 19. Bridging the gap Publishers Data centres DOIs in Use: DataCite CrossRef has registered more than 51 million DOIs on behalf of scholarly publishers. But CrossRef DOIs are not the only DOIs available in the scholarly community. DOIs for datasets associated with scholarly research are being registered by institutions in the DataCite network. DataCite and CrossRef have committed to the interoperability of their DOIs. Ideally, scholarly content like journals will cite related data by the appropriate DataCite DOI, and in return, the data record will cite the relevant article’s CrossRef DOI. (from CrossRef Quarterly, January 2012)
  20. 20. Bridging the gap
  21. 21. Publishers’ data policies ?
  22. 22. Connecting article and underlying data via DOI: The dataset: Storz, D et al. (2009): Planktic foraminiferal flux and faunal composition of sediment trap L1_K276 in the northeastern Atlantic. http://dx.doi.org/10.1594/PANGAEA.724325 Is supplement to the article: Storz, David; Schulz, Hartmut; Waniek, Joanna J; Schulz-Bull, Detlef; Kucera, Michal (2009): Seasonal and interannual variability of the planktic foraminiferal flux in the vicinity of the Azores Current. Deep-Sea Research Part I-Oceanographic Research Papers, 56(1), 107-124, http://dx.doi.org/10.1016/j.dsr.2008.08.009 Data citation
  23. 23. IRD ( gr av/ 10 cm 3) Sand ( %) C aC O3 ( %) TOC ( %) R adio ( %/ sand) Sme c t ( %/ clay) IRD ( gr av/ 10 cm 3) Sand ( %) C aC O3 ( %) TOC ( %) R adio ( %/ sand) Sme c t ( %/ clay) IRD ( gr av/ 10 cm 3) Sand ( %) C aC O3 ( %) TOC ( %) R adio ( %/ sand) Sme c t ( %/ clay) IRD ( gr av/ 10 cm 3) Sand ( %) C aC O3 ( %) TOC ( %) R adio ( %/ sand) Sme c t ( %/ clay) IRD ( gr av/ 10 cm 3) Sand ( %) C aC O3 ( %) TOC ( %) R adio ( %/ sand) Sme c t ( %/ clay) PS 1389-3 PS 1390-3 PS 1431-1 PS 1640-1 PS 1648-1 Age (kyr) max. : 233.55 ky r PS1389-3f f 0.0 100.0 200.0 0 20 0 100 0 15 0 0. 5 0 50 0 100 0 20 0 100 0 15 0 0. 5 0 50 0 100 0 20 0 100 0 15 0 0. 5 0 50 0 100 0 20 0 100 0 15 0 0. 5 0 50 0 100 0 20 0 100 0 15 0 0. 5 0 50 0 100 54° 0' 54° 0' 54°30' 54°30' 55° 0' 55° 0' 55°30' 55°30' 11° 11° 12° 12° 13° 13° 14° 14° 15° 15° World vector shore line Grain size class KOLP A Grain size class KOEHN2 Grain size class KOEHN Geochemistry Grain size class KOLP B Grain size class KOLP DIN 20 m Scale: 1:2695194 at Latitude 0° Source: Baltic Sea Research Institute, Warnemünde. Earth quake events => doi:10.1594/GFZ.GEOFON.gfz2009kciu Climate models => doi:10.1594/WDCC/dphase_mpeps Sea bed photos => doi:10.1594/PANGAEA.757741 Digitized ancient documents => doi:10.12763/L401-06 Medical case studies => doi:10.1594/eaacinet2007/CR/5- 270407 Computational model => doi:10.4225/02/4E9F69C011BC8 Audio record => doi:10.1594/PANGAEA.339110 Grey Literature => doi:10.2314/GBV:489185967 Videos => doi:10.3207/2959859860 What type of data are we talking about ?
  24. 24. Anything that is the foundation of further research is research data Data is evidence • Dataset • Text • Collection • Event • Audiovisual • Image • InteractiveResource • Model • PhysicalObject • Service • Software • Sound • Workflow • Other Most frequent: Dataset (by far) > Text > Image > Collection, on the MDS platform DataCite resource types (resourceTypeGeneral property)
  25. 25. DataCite services • DataCite Metadata Store (MDS) DOI minting and metadata registration https://mds.datacite.org • DataCite Metadata Search Metadata search for datasets in MDS http://search.datacite.org • DataCite OAI Provider Exposure of metadata for harvesting (OAI-PMH) http://oai.datacite.org • DataCite Statistics DOI registration and resolution statistics http://stats.datacite.org
  26. 26. DataCite services • DOI Citation Formatter Creation of different citation formats (for DataCite and CrossRef DOIs) http://crosscite.org/citeproc • Content Negotiation Metadata display in multiple formats – direct access to content in specific formats defined by data centres http://data.datacite.org • DataCite Metadata Schema http://schema.datacite.org • DataCite Test Environment All services for testing purposes on a test machine http://test.datacite.org
  27. 27. Metadata fields
  28. 28. Metadata fields
  29. 29. Searchterm: relatedIdentifier:*
  30. 30. Searchterm: uploaded:[NOW-7DAY TO NOW]
  31. 31. http://stats.datacite.org
  32. 32. http://oai.datacite.org
  33. 33. DataCite Content Service Service for displaying DataCite metadata Different formats (BibTeX, RIS, RDF, etc.) http://data.datacite.org/MIME_TYPE/DOI http://data.datacite.org/MIME_TYPE/DOI
  34. 34. DataCite Content Service Content Negotation (through MIME-Type) • Access through DOI proxy (http://dx.doi.org) • First implemented by CNRI and CrossRef Optimized for m2m communication using the accept header of the http protocol curl -L -H "Accept: MIME_TYPE" http://dx.doi.org/DOI Documentation: http://www.crosscite.org/cn/
  35. 35. Resolving to the resource location (landing page) http://dx.doi.org/10.5524/100005
  36. 36. Resolving to the citation http://data.datacite.org/application/x- datacite+text/10.5524/100005 Li, j; Zhang, G; Lambert, D; Wang, J (2011): Genomic data from Emperor penguin. GigaScience. http://dx.doi.org/10.5524/100005 http://data.datacite.org/application/rdf+xml/10.5524/100005 / to the RDF metadata
  37. 37. Research data repositories
  38. 38. http://databib.org
  39. 39. Databib & re3data.org: JOINING FORCES 1) Openness 2) Optimal quality assurance 3) Development of innovative functionalities 4) Shared leadership 5) Sustainability 5 principles of agreement From presentation M.Kindling and M.Witt at DataCite Annual Conference 2014
  40. 40. Related initiatives • Thomson-Reuters Data Citation Index • European Persistant Identifier Consortium (EPIC) • ODIN European project (ORCID and DataCite Interoperability Network) • CODATA/ICSTI Working Group on Data Citation • FORCE 11 / Data Citation Synthesis Group • OpenAIREplus project • Research Data Alliance • World Data System (ICSU-WDS)
  41. 41. Related initiatives • Thomson-Reuters Data Citation Index • European Persistant Identifier Consortium (EPIC) • ODIN European project (ORCID and DataCite Interoperability Network) • CODATA/ICSTI Working Group on Data Citation • FORCE 11 / Data Citation Synthesis Group • OpenAIREplus project → Zenodo • Research Data Alliance • World Data System (ICSU-WDS)
  42. 42. Measures of data citation and use
  43. 43. ©2010ThomsonReuters DATA CITATION INDEX Launched October 2012 4M data records • Enable the discovery of data repositories, data studies and data sets in the context of traditional literature • Link data to research publications • Help researchers find data sets and studies and track the full impact of their research output • Provide expanded measurement of researcher and institutional research output and assessment • Facilitate more accurate and comprehensive bibliometric analyses From presentation N.Robinson at DataCite Annual Conference 2014
  44. 44. ©2010ThomsonReuters METADATA PROCESSING Repository provides metadata feed • Collaboration on metadata handling Normalisation and enhancement of metadata • Controlled vocabularies • Indexing Loading to DCI as data object records • Citations from repository • Citations from literature Metrics • Citation counts From presentation N.Robinson at DataCite Annual Conference 2014
  45. 45. Data Citation Index Repository 1 Repository 2 Repository 3 Partnership with DataCite DataCite Repository 1 Repository 2 Repository 3 Data Citation Index DataCite →
  46. 46. Agreement between DataCite and EPIC – special DOI prefix
  47. 47. http://odin-project.eu
  48. 48. http://datacite.labs.orcid-eu.org/ ORCID/DataCite claim tool
  49. 49. http://www.codata.org/taskgroups/TGdatacitation/index.html http://www.codata.org/taskgroups/TGdatacitation/index.html
  50. 50. http://rd-alliance.org
  51. 51. http://www.icsu-wds.org
  52. 52. http://datacite.inist.fr
  53. 53. Thank you!

×