SlideShare une entreprise Scribd logo
1  sur  41
Télécharger pour lire hors ligne
DataCite – Bridging the gap and
helping to find, access and reuse data

Herbert Gruttemeier
OpenAIREplus workshop
February 8th, 2013
Braga
Publishers’ data policies
Publishers’
data policies

extract from
Nature Publishing Group,
Editorial Policies,
Availability of data and
materials

H. GRUTTEMEIER
http://www.doi.org
At the infrastructure level, DOI names are handles.
http://www.handle.net
From KE workshop presentation, The Hague, June 2011 (L. Lannom)
From KE workshop presentation, The Hague, June 2011 (L. Lannom)
From KE workshop presentation, The Hague, June 2011 (N. Paskin)
plutôt: identifiant numérique d’objet

« The objects identified by DOI names may be of any form digital, physical, or abstract - as all these forms may be
necessary parts of a content management system. The DOI
system is an abstract framework which does not specify a
particular context of its application, but is designed with the
aim of working over the Internet. »
Norman Paskin, « Digital Object Identifier (DOI®) System »
DataCite

•
•

•
•

Global consortium carried by local institutions
Focused on improving the scholarly infrastructure around
datasets and other non-textual information
Focused on working with data centres and organisations that
hold data
Providing standards, workflows and best-practice
Initially, but not exclusively based on the DOI system

•
•

Memorandum of Understanding, Paris, February 2009
Officially founded December 1st 2009 in London

•
DataCite Members
• Technische Informationsbibliothek (TIB), Germany
• Canada Institute for Scientific and Technical Information (CISTI)
• California Digital Library, USA
• Purdue University, USA
• Office of Scientific and Technical
Information (OSTI), USA
• The British Library
• Technical Information Center
of Denmark (DTU)
• Library of TU Delft, The Netherlands
• ZBMed, Germany
• ZBW, Germany
• GESIS, Germany
• Library of ETH Zürich, Switzerland
• Institut de l’Information Scientifique et
Technique (INIST-CNRS), France
• Swedish National Data Service (SND)
• Australian National Data Service (ANDS)
• Conferenza dei Rettori delle
Università Italiane (CRUI)
• National Research Council of Thailand
(NRCT)

Affiliated members:
• Digital Curation Center, UK
• Microsoft Research
• Interuniversity Consortium for Political and Social Research (ICPSR), USA
• Korea Institute of Science and Technology Information (KISTI)
• Bejiing Genomic Institute (BGI)
DataCite
The DataCite registration agency
–
–
–
–

Maintains the resolution infrastructure
Maintains a searchable database of metadata
Manages the identifiers over the long term
Establishes and shares best practice

Publishing agents (data centres, research institutes, data
publishers) are responsible for
–
–
–
–

Quality assurance
Content storage and access
Creating the identifiers
Creating and updating metadata
What type of data are we talking about?
PS1389-3

PS1390-3

IRD

Sand

(grav/10 cm3)
0

CaCO3

(%)
20

0

TOC

(%)
100

0

Radio

(%)
15

0

Smect

(%/sand)
0.5

0

0

PS1431-1

IRD

(%/clay)
50

Sand

(grav/10 cm3)
100

0

CaCO3

(%)
20

0

TOC

(%)
100

0

Radio

(%)
15

0

Smect

(%/sand)
0.5

0

0

PS1640-1

IRD

(%/clay)
50

Sand

(grav/10 cm3)
100

0

CaCO3

(%)
20

0

TOC

(%)
100

0

Radio

(%)
15

0

Smect

(%/sand)
0.5

0

0

PS1648-1

IRD

(%/clay)
50

Sand

(grav/10 cm3)
100

0

CaCO3

(%)
20

0

TOC

(%)
100

0

Radio

(%)
15

0

Smect

(%/sand)
0.5

0

IRD

(%/clay)
50

0

Sand

(grav/10 cm3)
100

0

CaCO3

(%)
20

0

TOC

(%)
100

0

Radio

(%)
15

0

Smect

(%/sand)
0.5

0

(%/clay)
50

0

100

0.0

•

Earth quake events =>
doi:10.1594/GFZ.GEOFON.gfz2009kciu
Climate models => doi:10.1594/WDCC/dphase_mpeps
Sea bed photos => doi:10.1594/PANGAEA.757741
Distributes samples => doi:10.1594/PANGAEA.51749
Medical case studies => doi:10.1594/eaacinet2007/CR/5270407
Computational model => doi:10.4225/02/4E9F69C011BC8
Audio record => doi:10.1594/PANGAEA.339110
Grey Literature => doi:10.2314/GBV:489185967
Videos => doi:10.3207/2959859860
100.0

•
•
•
•
•
•
•
•

Anything that is the foundation
of further research
is research data
200.0

Age (kyr) max. : 233.55 kyr

Data is evidence

11°

12°

PS1389-3ff

13°

14°

15°

55°30'

55°30'

55° 0'

55° 0'

54°30'

54°30'

54° 0'

11°

12°

54° 0'

13°

14°

15°

Scale: 1:2695194 at Latitude 0°
Source: Baltic Sea Research Institute, Warnemünde.

World vector shore line
Grain size class KOLP A
Grain size class KOEHN2
Grain size class KOEHN
Geochemistry
Grain size class KOLP B
G i i
l
KOLP DIN
DataCite Structure
International DOI
Foundation
Member
DataCite

Managing Agent
(TIB)

Member
Institution

Member
Institution

Works
with

…
Data Centre
Data Centre
Data Centre

Associate
Stakeholder

Data Centre
Data Centre
Data Centre
Bridging the gap

DOIs in Use: DataCite
CrossRef has registered more than 51 million DOIs on behalf of scholarly publishers.
But CrossRef DOIs are not the only DOIs available in the scholarly community. DOIs
Publishers
Data centres
for datasets associated with scholarly research are being registered by institutions in
the DataCite network. DataCite and CrossRef have committed to the interoperability
of their DOIs. Ideally, scholarly content like journals will cite related data by the
appropriate DataCite DOI, and in return, the data record will cite the relevant article’s
(from CrossRef Quarterly, January 2012)
CrossRef DOI.
Bridging the gap
Data citation
Connecting article and underlying data via DOI:
The dataset:
Storz, D et al. (2009):
Planktic foraminiferal flux and faunal composition of sediment trap
L1_K276 in the northeastern Atlantic.
http://dx.doi.org/10.1594/PANGAEA.724325
Is supplement to the article:
Storz, David; Schulz, Hartmut; Waniek, Joanna J; Schulz-Bull,
Detlef; Kucera, Michal (2009): Seasonal and interannual
variability of the planktic foraminiferal flux in the vicinity of the
Azores Current.
Deep-Sea Research Part I-Oceanographic Research Papers, 56(1),
107-124,
http://dx.doi.org/10.1016/j.dsr.2008.08.009
Bridging the gap
•

DataCite supports researchers by enabling them to locate,
identify, and cite research datasets with confidence

•

DataCite supports data centres by providing workflows and
standards for data publication

•

DataCite supports publishers by enabling linking from articles
to the underlying data
http://www.datacite.org
http://schema.datacite.org
https://mds.datacite.org
http://search.datacite.org
http://oai.datacite.org
http://data.datacite.org
http://stats.datacite.org
Working Groups
•
•
•
•
•
•
•

Business Practices
Criteria for Data Centers
Identifier Syntax
Metadata
Services
Special Datasets
Technical Infrastructure
MDS: Central portal allowing
access to the metadata from
all registered objects (OAI)
DataCite Metadata 2.2 XML Schema
• Service for displaying DataCite metadata
• Different formats (BibTeX, RIS, RDF, etc.)
• Content Negotation (through MIME-Typ)
– Access through DOI proxy (http://dx.doi.org)
– First implemented by CNRI and CrossRef:
• Service for displaying DataCite metadata in different formats
•(BibTeX, RIS, RDF, etc.)
Documentation:
• A particular representation of the metadata can be requested via
content negotiation or by using DOI proxy (the "http://dx.doi.org"
formulation as a URL) and MIME-type

• http://www.crosscite.org/cn/

• Documentation: http://www.crosscite.org/cn/
Resolution - Current Status
Landing Page
with catalog
metadata
(human-readable)

Client (Web‐Browser) 
requesting PID

Persistent
Identifier
(DOI, URN, …)

Resolver
(DataCite, …)
Mapping Table
PID - URL

Problem
Not machine‐
actionable

Data
Details on
Data
(Rich
Metadata)
Details on
(human-readable)
Data
(Rich
Structured
Metadata)
(machine-
Content Negotiation - Based on the Solution
of CrossRef/DataCite
Web Page on Data
with catalog
metadata
(human-readable)

Client requesting PID
Persistent
Identifier
(DOI, URN, …)

Resolver
(DataCite, …)
Mapping Table
PID - URL

Different Accept Headers
in addition to URL
requesting different 
representations of PID

Details on
Data
(Rich
Metadata)
Details on
(human-readable)
Data
(Rich
Structured
Metadata)
(machineactionable)

Data
List of
repositories
for
research data
Some recent related developments
•
•
•
•

Thomson-Reuters Data Citation Index
ORCID official launch
ODIN European project
CODATA/ICSTI Working Group on Data
Citation
• Creation of the Research Data Alliance
ORCID and DataCite Interoperability Network

« ODIN will build on the ORCID
and DataCite initiatives to
uniquely identify scientists and
data sets and connect this
information across multiple
services and infrastructures for
scholarly communication.
It will address some of the
critical open questions in the
area: Referencing a data
object; Tracking of use and reuse; Links between a data
object, subsets, articles, rights
statements and every person
involved in its life-cycle. »
http://www.codata.org/taskgroups/TGdatacitation/index.html

http://www.codata.org/taskgroups/TGdatacitation/index.html
Thank you

Contenu connexe

Tendances

Authority files - Jisc Digital Festival 2014
Authority files - Jisc Digital Festival 2014Authority files - Jisc Digital Festival 2014
Authority files - Jisc Digital Festival 2014
Jisc
 
Now we are six: Integrating Edinburgh DataShare into local and internet in...
Now we are six: Integrating Edinburgh DataShare into local and internet in...Now we are six: Integrating Edinburgh DataShare into local and internet in...
Now we are six: Integrating Edinburgh DataShare into local and internet in...
Robin Rice
 

Tendances (17)

Think Big about Data: Archaeology and the Big Data Challenge
Think Big about Data: Archaeology and the Big Data ChallengeThink Big about Data: Archaeology and the Big Data Challenge
Think Big about Data: Archaeology and the Big Data Challenge
 
Dataset Citation and Identification
Dataset Citation and IdentificationDataset Citation and Identification
Dataset Citation and Identification
 
Authority files - Jisc Digital Festival 2014
Authority files - Jisc Digital Festival 2014Authority files - Jisc Digital Festival 2014
Authority files - Jisc Digital Festival 2014
 
EPSRC Policy Compliance: What researchers need to know
EPSRC Policy Compliance: What researchers need to knowEPSRC Policy Compliance: What researchers need to know
EPSRC Policy Compliance: What researchers need to know
 
Now we are six: Integrating Edinburgh DataShare into local and internet in...
Now we are six: Integrating Edinburgh DataShare into local and internet in...Now we are six: Integrating Edinburgh DataShare into local and internet in...
Now we are six: Integrating Edinburgh DataShare into local and internet in...
 
Recommendation to the EU Hearing on Access to and Preservation of Scientific ...
Recommendation to the EU Hearing on Access to and Preservation of Scientific ...Recommendation to the EU Hearing on Access to and Preservation of Scientific ...
Recommendation to the EU Hearing on Access to and Preservation of Scientific ...
 
Research Data Management: Why is it important?
Research Data Management: Why is it  important?Research Data Management: Why is it  important?
Research Data Management: Why is it important?
 
RDM for trainee physicians
RDM for trainee physiciansRDM for trainee physicians
RDM for trainee physicians
 
The fourth paradigm: data intensive scientific discovery - Jisc Digifest 2016
The fourth paradigm: data intensive scientific discovery - Jisc Digifest 2016The fourth paradigm: data intensive scientific discovery - Jisc Digifest 2016
The fourth paradigm: data intensive scientific discovery - Jisc Digifest 2016
 
The Needs of stakeholders in the RDM process - the role of LEARN. By Paul Ayr...
The Needs of stakeholders in the RDM process - the role of LEARN. By Paul Ayr...The Needs of stakeholders in the RDM process - the role of LEARN. By Paul Ayr...
The Needs of stakeholders in the RDM process - the role of LEARN. By Paul Ayr...
 
EPSRC research data expectations and PURE for datasets
EPSRC research data expectations and PURE for datasetsEPSRC research data expectations and PURE for datasets
EPSRC research data expectations and PURE for datasets
 
Certifying CISER! A Data Seal of Approval Case Study
Certifying CISER! A Data Seal of Approval Case StudyCertifying CISER! A Data Seal of Approval Case Study
Certifying CISER! A Data Seal of Approval Case Study
 
Connecting Museums with Linked Data
Connecting Museums with Linked DataConnecting Museums with Linked Data
Connecting Museums with Linked Data
 
Dataverse in the Universe of Data by Christine L. Borgman
Dataverse in the Universe of Data by Christine L. BorgmanDataverse in the Universe of Data by Christine L. Borgman
Dataverse in the Universe of Data by Christine L. Borgman
 
Enabling Precise Identification and Citability of Dynamic Data: Recommendatio...
Enabling Precise Identification and Citability of Dynamic Data: Recommendatio...Enabling Precise Identification and Citability of Dynamic Data: Recommendatio...
Enabling Precise Identification and Citability of Dynamic Data: Recommendatio...
 
Your Research Data Management with the support of 3TU.Datacentrum
Your Research Data Management with the support of 3TU.DatacentrumYour Research Data Management with the support of 3TU.Datacentrum
Your Research Data Management with the support of 3TU.Datacentrum
 
Going for GOLD - Adventures in Open Linked Geospatial Metadata
Going for GOLD - Adventures in Open Linked Geospatial MetadataGoing for GOLD - Adventures in Open Linked Geospatial Metadata
Going for GOLD - Adventures in Open Linked Geospatial Metadata
 

En vedette

Rainer Kuhlen: A commons-based foundation of open access and other open models
Rainer Kuhlen: A commons-based foundation of open access and other open models Rainer Kuhlen: A commons-based foundation of open access and other open models
Rainer Kuhlen: A commons-based foundation of open access and other open models
"Open Access - Open Data" conference, 13th/14th December, 2010
 

En vedette (8)

Rainer Kuhlen: A commons-based foundation of open access and other open models
Rainer Kuhlen: A commons-based foundation of open access and other open models Rainer Kuhlen: A commons-based foundation of open access and other open models
Rainer Kuhlen: A commons-based foundation of open access and other open models
 
Dataset Citation and Identification
Dataset Citation and IdentificationDataset Citation and Identification
Dataset Citation and Identification
 
Guido F. Herrmann:
Guido F. Herrmann:Guido F. Herrmann:
Guido F. Herrmann:
 
New OpenAIRE data providers: some of the most recent from September to Decemb...
New OpenAIRE data providers: some of the most recent from September to Decemb...New OpenAIRE data providers: some of the most recent from September to Decemb...
New OpenAIRE data providers: some of the most recent from September to Decemb...
 
OpenAIRE@info day_amsterdam_jan_2016
OpenAIRE@info day_amsterdam_jan_2016OpenAIRE@info day_amsterdam_jan_2016
OpenAIRE@info day_amsterdam_jan_2016
 
Derk Haank: Open Access publishing at Springer
Derk Haank: Open Access publishing at SpringerDerk Haank: Open Access publishing at Springer
Derk Haank: Open Access publishing at Springer
 
Jan Velterop: Science publishing: the different interests of record keeping a...
Jan Velterop: Science publishing: the different interests of record keeping a...Jan Velterop: Science publishing: the different interests of record keeping a...
Jan Velterop: Science publishing: the different interests of record keeping a...
 
Connecting the dots - e-Infra services for open science
Connecting the dots - e-Infra services for open scienceConnecting the dots - e-Infra services for open science
Connecting the dots - e-Infra services for open science
 

Similaire à DataCite – Bridging the gap and helping to find, access and reuse data – Herbert Gruttemeier

PIDs and DOI registration with DataCite - IATUL Workshop 2013
PIDs and DOI registration with DataCite - IATUL Workshop 2013PIDs and DOI registration with DataCite - IATUL Workshop 2013
PIDs and DOI registration with DataCite - IATUL Workshop 2013
Frauke Ziedorn
 
DOI registration with DataCite - COOPEUS, ENVRI, EUDAT workshop 2013
DOI registration with DataCite - COOPEUS, ENVRI, EUDAT workshop 2013DOI registration with DataCite - COOPEUS, ENVRI, EUDAT workshop 2013
DOI registration with DataCite - COOPEUS, ENVRI, EUDAT workshop 2013
Frauke Ziedorn
 
Scholze liber 2015-06-25_final
Scholze liber 2015-06-25_finalScholze liber 2015-06-25_final
Scholze liber 2015-06-25_final
Karlsruhe Institute of Technology (KIT)
 

Similaire à DataCite – Bridging the gap and helping to find, access and reuse data – Herbert Gruttemeier (20)

RDA-WDS Publishing Data Interest Group
RDA-WDS Publishing Data Interest GroupRDA-WDS Publishing Data Interest Group
RDA-WDS Publishing Data Interest Group
 
PIDs and DOI registration with DataCite - IATUL Workshop 2013
PIDs and DOI registration with DataCite - IATUL Workshop 2013PIDs and DOI registration with DataCite - IATUL Workshop 2013
PIDs and DOI registration with DataCite - IATUL Workshop 2013
 
CNI 2018: A Research Object Authoring Tool for the Data Commons
CNI 2018: A Research Object Authoring Tool for the Data CommonsCNI 2018: A Research Object Authoring Tool for the Data Commons
CNI 2018: A Research Object Authoring Tool for the Data Commons
 
DOI registration with DataCite - COOPEUS, ENVRI, EUDAT workshop 2013
DOI registration with DataCite - COOPEUS, ENVRI, EUDAT workshop 2013DOI registration with DataCite - COOPEUS, ENVRI, EUDAT workshop 2013
DOI registration with DataCite - COOPEUS, ENVRI, EUDAT workshop 2013
 
EUDAT data architecture and interoperability aspects – Daan Broeder
EUDAT data architecture and interoperability aspects – Daan BroederEUDAT data architecture and interoperability aspects – Daan Broeder
EUDAT data architecture and interoperability aspects – Daan Broeder
 
The Experimental Project of DOI Registration for Research Data at Japan Link...
The Experimental Project of DOI Registration for Research Data at Japan Link...The Experimental Project of DOI Registration for Research Data at Japan Link...
The Experimental Project of DOI Registration for Research Data at Japan Link...
 
Riding the wave - Paradigm shifts in information access
Riding the wave - Paradigm shifts in information accessRiding the wave - Paradigm shifts in information access
Riding the wave - Paradigm shifts in information access
 
Edinburgh DataShare: Tackling research data in a DSpace institutional repository
Edinburgh DataShare: Tackling research data in a DSpace institutional repositoryEdinburgh DataShare: Tackling research data in a DSpace institutional repository
Edinburgh DataShare: Tackling research data in a DSpace institutional repository
 
Hughes RDAP11 Data Publication Repositories
Hughes RDAP11 Data Publication RepositoriesHughes RDAP11 Data Publication Repositories
Hughes RDAP11 Data Publication Repositories
 
e-Science, Research Data and Libaries
e-Science, Research Data and Libariese-Science, Research Data and Libaries
e-Science, Research Data and Libaries
 
Toward universal information access on the digital object cloud
Toward universal information access on the digital object cloudToward universal information access on the digital object cloud
Toward universal information access on the digital object cloud
 
The Materials Data Facility: A Distributed Model for the Materials Data Commu...
The Materials Data Facility: A Distributed Model for the Materials Data Commu...The Materials Data Facility: A Distributed Model for the Materials Data Commu...
The Materials Data Facility: A Distributed Model for the Materials Data Commu...
 
Data management
Data management Data management
Data management
 
Dataset Metadata, Tools and Approaches for Access and Preservation
Dataset Metadata, Tools and Approaches for Access and PreservationDataset Metadata, Tools and Approaches for Access and Preservation
Dataset Metadata, Tools and Approaches for Access and Preservation
 
Scholze liber 2015-06-25_final
Scholze liber 2015-06-25_finalScholze liber 2015-06-25_final
Scholze liber 2015-06-25_final
 
Data Strategy and Services at the British Library: Data, Software and PIDs
Data Strategy and Services at the British Library: Data, Software and PIDsData Strategy and Services at the British Library: Data, Software and PIDs
Data Strategy and Services at the British Library: Data, Software and PIDs
 
The Role of OAIS Representation Information in the Digital Curation of Crysta...
The Role of OAIS Representation Information in the Digital Curation of Crysta...The Role of OAIS Representation Information in the Digital Curation of Crysta...
The Role of OAIS Representation Information in the Digital Curation of Crysta...
 
Research Data-DOI Experiment in Japanese DOI Registration Agency (Japan Link ...
Research Data-DOI Experiment in Japanese DOI Registration Agency (Japan Link ...Research Data-DOI Experiment in Japanese DOI Registration Agency (Japan Link ...
Research Data-DOI Experiment in Japanese DOI Registration Agency (Japan Link ...
 
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
 
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
 

Plus de OpenAIRE

Plus de OpenAIRE (20)

10th OpenAIRE Content Providers Community Call
10th OpenAIRE Content Providers Community Call10th OpenAIRE Content Providers Community Call
10th OpenAIRE Content Providers Community Call
 
9th Content Providers Community Call\
9th Content Providers Community Call\9th Content Providers Community Call\
9th Content Providers Community Call\
 
OpenAIRE in the European Open Science Cloud (EOSC)
OpenAIRE in the European Open Science Cloud (EOSC)OpenAIRE in the European Open Science Cloud (EOSC)
OpenAIRE in the European Open Science Cloud (EOSC)
 
8th Content Providers Community Call
8th Content Providers Community Call8th Content Providers Community Call
8th Content Providers Community Call
 
7th Content Providers Community Call
7th Content Providers Community Call7th Content Providers Community Call
7th Content Providers Community Call
 
OpenAIRE PROVIDE Dashboard for Turkish repository managers
OpenAIRE PROVIDE Dashboard for Turkish repository managersOpenAIRE PROVIDE Dashboard for Turkish repository managers
OpenAIRE PROVIDE Dashboard for Turkish repository managers
 
What will it cost to manage and share my data?
What will it cost to manage and share my data?What will it cost to manage and share my data?
What will it cost to manage and share my data?
 
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 3)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 3)Open Research Gateway for the ELIXIR-GR Infrastructure (Part 3)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 3)
 
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 2)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 2)Open Research Gateway for the ELIXIR-GR Infrastructure (Part 2)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 2)
 
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 1)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 1)Open Research Gateway for the ELIXIR-GR Infrastructure (Part 1)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 1)
 
6th Content Providers Community Call
6th Content Providers Community Call6th Content Providers Community Call
6th Content Providers Community Call
 
20200504_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
20200504_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data20200504_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
20200504_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
 
20200504_Research Data & the GDPR: How Open is Open?
20200504_Research Data & the GDPR: How Open is Open?20200504_Research Data & the GDPR: How Open is Open?
20200504_Research Data & the GDPR: How Open is Open?
 
20200504_Data, Data Ownership and Open Science
20200504_Data, Data Ownership and Open Science20200504_Data, Data Ownership and Open Science
20200504_Data, Data Ownership and Open Science
 
20200429_Research Data & the GDPR: How Open is Open? (updated version)
20200429_Research Data & the GDPR: How Open is Open? (updated version)20200429_Research Data & the GDPR: How Open is Open? (updated version)
20200429_Research Data & the GDPR: How Open is Open? (updated version)
 
20200429_Data, Data Ownership and Open Science
20200429_Data, Data Ownership and Open Science20200429_Data, Data Ownership and Open Science
20200429_Data, Data Ownership and Open Science
 
20200429_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
20200429_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data20200429_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
20200429_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
 
COVID-19: Activities, tools, best practice and contact points in Greece
 COVID-19: Activities, tools, best practice and contact points in Greece COVID-19: Activities, tools, best practice and contact points in Greece
COVID-19: Activities, tools, best practice and contact points in Greece
 
5th Content Providers Community Call
5th Content Providers Community Call5th Content Providers Community Call
5th Content Providers Community Call
 
4th Content Providers Community Call
4th Content Providers Community Call4th Content Providers Community Call
4th Content Providers Community Call
 

Dernier

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 

Dernier (20)

A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source Milvus
 
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelNavi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 

DataCite – Bridging the gap and helping to find, access and reuse data – Herbert Gruttemeier

  • 1. DataCite – Bridging the gap and helping to find, access and reuse data Herbert Gruttemeier OpenAIREplus workshop February 8th, 2013 Braga
  • 2.
  • 3.
  • 5. Publishers’ data policies extract from Nature Publishing Group, Editorial Policies, Availability of data and materials H. GRUTTEMEIER
  • 6.
  • 7.
  • 9. At the infrastructure level, DOI names are handles. http://www.handle.net
  • 10. From KE workshop presentation, The Hague, June 2011 (L. Lannom)
  • 11. From KE workshop presentation, The Hague, June 2011 (L. Lannom)
  • 12. From KE workshop presentation, The Hague, June 2011 (N. Paskin)
  • 13. plutôt: identifiant numérique d’objet « The objects identified by DOI names may be of any form digital, physical, or abstract - as all these forms may be necessary parts of a content management system. The DOI system is an abstract framework which does not specify a particular context of its application, but is designed with the aim of working over the Internet. » Norman Paskin, « Digital Object Identifier (DOI®) System »
  • 14. DataCite • • • • Global consortium carried by local institutions Focused on improving the scholarly infrastructure around datasets and other non-textual information Focused on working with data centres and organisations that hold data Providing standards, workflows and best-practice Initially, but not exclusively based on the DOI system • • Memorandum of Understanding, Paris, February 2009 Officially founded December 1st 2009 in London •
  • 15. DataCite Members • Technische Informationsbibliothek (TIB), Germany • Canada Institute for Scientific and Technical Information (CISTI) • California Digital Library, USA • Purdue University, USA • Office of Scientific and Technical Information (OSTI), USA • The British Library • Technical Information Center of Denmark (DTU) • Library of TU Delft, The Netherlands • ZBMed, Germany • ZBW, Germany • GESIS, Germany • Library of ETH Zürich, Switzerland • Institut de l’Information Scientifique et Technique (INIST-CNRS), France • Swedish National Data Service (SND) • Australian National Data Service (ANDS) • Conferenza dei Rettori delle Università Italiane (CRUI) • National Research Council of Thailand (NRCT) Affiliated members: • Digital Curation Center, UK • Microsoft Research • Interuniversity Consortium for Political and Social Research (ICPSR), USA • Korea Institute of Science and Technology Information (KISTI) • Bejiing Genomic Institute (BGI)
  • 16. DataCite The DataCite registration agency – – – – Maintains the resolution infrastructure Maintains a searchable database of metadata Manages the identifiers over the long term Establishes and shares best practice Publishing agents (data centres, research institutes, data publishers) are responsible for – – – – Quality assurance Content storage and access Creating the identifiers Creating and updating metadata
  • 17. What type of data are we talking about? PS1389-3 PS1390-3 IRD Sand (grav/10 cm3) 0 CaCO3 (%) 20 0 TOC (%) 100 0 Radio (%) 15 0 Smect (%/sand) 0.5 0 0 PS1431-1 IRD (%/clay) 50 Sand (grav/10 cm3) 100 0 CaCO3 (%) 20 0 TOC (%) 100 0 Radio (%) 15 0 Smect (%/sand) 0.5 0 0 PS1640-1 IRD (%/clay) 50 Sand (grav/10 cm3) 100 0 CaCO3 (%) 20 0 TOC (%) 100 0 Radio (%) 15 0 Smect (%/sand) 0.5 0 0 PS1648-1 IRD (%/clay) 50 Sand (grav/10 cm3) 100 0 CaCO3 (%) 20 0 TOC (%) 100 0 Radio (%) 15 0 Smect (%/sand) 0.5 0 IRD (%/clay) 50 0 Sand (grav/10 cm3) 100 0 CaCO3 (%) 20 0 TOC (%) 100 0 Radio (%) 15 0 Smect (%/sand) 0.5 0 (%/clay) 50 0 100 0.0 • Earth quake events => doi:10.1594/GFZ.GEOFON.gfz2009kciu Climate models => doi:10.1594/WDCC/dphase_mpeps Sea bed photos => doi:10.1594/PANGAEA.757741 Distributes samples => doi:10.1594/PANGAEA.51749 Medical case studies => doi:10.1594/eaacinet2007/CR/5270407 Computational model => doi:10.4225/02/4E9F69C011BC8 Audio record => doi:10.1594/PANGAEA.339110 Grey Literature => doi:10.2314/GBV:489185967 Videos => doi:10.3207/2959859860 100.0 • • • • • • • • Anything that is the foundation of further research is research data 200.0 Age (kyr) max. : 233.55 kyr Data is evidence 11° 12° PS1389-3ff 13° 14° 15° 55°30' 55°30' 55° 0' 55° 0' 54°30' 54°30' 54° 0' 11° 12° 54° 0' 13° 14° 15° Scale: 1:2695194 at Latitude 0° Source: Baltic Sea Research Institute, Warnemünde. World vector shore line Grain size class KOLP A Grain size class KOEHN2 Grain size class KOEHN Geochemistry Grain size class KOLP B G i i l KOLP DIN
  • 18. DataCite Structure International DOI Foundation Member DataCite Managing Agent (TIB) Member Institution Member Institution Works with … Data Centre Data Centre Data Centre Associate Stakeholder Data Centre Data Centre Data Centre
  • 19. Bridging the gap DOIs in Use: DataCite CrossRef has registered more than 51 million DOIs on behalf of scholarly publishers. But CrossRef DOIs are not the only DOIs available in the scholarly community. DOIs Publishers Data centres for datasets associated with scholarly research are being registered by institutions in the DataCite network. DataCite and CrossRef have committed to the interoperability of their DOIs. Ideally, scholarly content like journals will cite related data by the appropriate DataCite DOI, and in return, the data record will cite the relevant article’s (from CrossRef Quarterly, January 2012) CrossRef DOI.
  • 21. Data citation Connecting article and underlying data via DOI: The dataset: Storz, D et al. (2009): Planktic foraminiferal flux and faunal composition of sediment trap L1_K276 in the northeastern Atlantic. http://dx.doi.org/10.1594/PANGAEA.724325 Is supplement to the article: Storz, David; Schulz, Hartmut; Waniek, Joanna J; Schulz-Bull, Detlef; Kucera, Michal (2009): Seasonal and interannual variability of the planktic foraminiferal flux in the vicinity of the Azores Current. Deep-Sea Research Part I-Oceanographic Research Papers, 56(1), 107-124, http://dx.doi.org/10.1016/j.dsr.2008.08.009
  • 22.
  • 23.
  • 24. Bridging the gap • DataCite supports researchers by enabling them to locate, identify, and cite research datasets with confidence • DataCite supports data centres by providing workflows and standards for data publication • DataCite supports publishers by enabling linking from articles to the underlying data http://www.datacite.org http://schema.datacite.org https://mds.datacite.org http://search.datacite.org http://oai.datacite.org http://data.datacite.org http://stats.datacite.org
  • 25. Working Groups • • • • • • • Business Practices Criteria for Data Centers Identifier Syntax Metadata Services Special Datasets Technical Infrastructure
  • 26. MDS: Central portal allowing access to the metadata from all registered objects (OAI)
  • 27.
  • 28. DataCite Metadata 2.2 XML Schema
  • 29.
  • 30.
  • 31.
  • 32. • Service for displaying DataCite metadata • Different formats (BibTeX, RIS, RDF, etc.) • Content Negotation (through MIME-Typ) – Access through DOI proxy (http://dx.doi.org) – First implemented by CNRI and CrossRef: • Service for displaying DataCite metadata in different formats •(BibTeX, RIS, RDF, etc.) Documentation: • A particular representation of the metadata can be requested via content negotiation or by using DOI proxy (the "http://dx.doi.org" formulation as a URL) and MIME-type • http://www.crosscite.org/cn/ • Documentation: http://www.crosscite.org/cn/
  • 33. Resolution - Current Status Landing Page with catalog metadata (human-readable) Client (Web‐Browser)  requesting PID Persistent Identifier (DOI, URN, …) Resolver (DataCite, …) Mapping Table PID - URL Problem Not machine‐ actionable Data Details on Data (Rich Metadata) Details on (human-readable) Data (Rich Structured Metadata) (machine-
  • 34. Content Negotiation - Based on the Solution of CrossRef/DataCite Web Page on Data with catalog metadata (human-readable) Client requesting PID Persistent Identifier (DOI, URN, …) Resolver (DataCite, …) Mapping Table PID - URL Different Accept Headers in addition to URL requesting different  representations of PID Details on Data (Rich Metadata) Details on (human-readable) Data (Rich Structured Metadata) (machineactionable) Data
  • 36. Some recent related developments • • • • Thomson-Reuters Data Citation Index ORCID official launch ODIN European project CODATA/ICSTI Working Group on Data Citation • Creation of the Research Data Alliance
  • 37.
  • 38.
  • 39. ORCID and DataCite Interoperability Network « ODIN will build on the ORCID and DataCite initiatives to uniquely identify scientists and data sets and connect this information across multiple services and infrastructures for scholarly communication. It will address some of the critical open questions in the area: Referencing a data object; Tracking of use and reuse; Links between a data object, subsets, articles, rights statements and every person involved in its life-cycle. »