Open if Possible, Protected if Needed: Services and tools for the sharing of research data in the Netherlands and Europe, by Peter Doorn, DANS (8th OpenAIRE workshop)
Similaire à Open if Possible, Protected if Needed: Services and tools for the sharing of research data in the Netherlands and Europe, by Peter Doorn, DANS (8th OpenAIRE workshop)
Exploration, visualization and querying of linked open data sourcesLaura Po
Similaire à Open if Possible, Protected if Needed: Services and tools for the sharing of research data in the Netherlands and Europe, by Peter Doorn, DANS (8th OpenAIRE workshop) (20)
Formation of low mass protostars and their circumstellar disks
Open if Possible, Protected if Needed: Services and tools for the sharing of research data in the Netherlands and Europe, by Peter Doorn, DANS (8th OpenAIRE workshop)
1. www.dans.knaw.nl
DANS is an institute of KNAW and NWO
Open if Possible, Protected if Needed:
Services and tools for the sharing of research data
in the Netherlands and Europe
Peter Doorn – peter.doorn@dans.knaw.nl
Director DANS
Chair, Science Europe Working Group on Research Data
OpenAIRE workshop - Legal issues in Open Research Data
RDA 9th Plenary, Barcelona – 4 April 2017
@pkdoorn
Panel: The Infrastructures: services and tools
2. Topics
1. The changing European environment for data protection
1. Open Science trends
2. General Data Protection Regulation
2. Sharing research data at DANS
1. Which restrictions? DANS access licenses
2. A special case: private digital library for TDM
3. DataTags to establish data sharing opportunities
3. Science Europe Working Group on Research Data
1. Report and workshop on TDM
2. Report on Funding Research Infrastructures
3. Data Glossary
4. RDM Protocols
3. 1. The changing European environment
for data protection
Open Science
GDPR
4. Dutch Presidency of EU 2016
• April 2016: Open Science on the
political agenda
• May 2016: Council conclusions (joint
declaration of EU Ministers)
• Call for Action on Open Science
• Open Data default for publicly funded
research
• Introduce incentives for FAIR data
sharing by valuing data stewardship
• Require data to be cited
• Improve insight into IPR and privacy
• Develop and set standards on privacy
by design
• National Plan for Open Science
https://goo.gl/4nlm4K
https://www.openscience.nl/
5. General Data Protection Regulation (GDPR)
• European GDPR is to replace the current Data
Protection Directive (1995). Official publication:
http://goo.gl/u3nsy8
• Strong lobby by research community, especially
medical/health science and social sciences
– Main worry was that GDPR would restrict research on
individual data
– Exceptions claimed for scientific research
– Special attention for data and text mining
http://goo.gl/Yl3ea0
6. GDPR (99 articles, 88 pages)
Passed European Parliament on April 27, 2016
• Preparation 2012-2016
• Transition/implementation 2016-2018
• Shall apply from May 25, 2018 in all EU member states
Some differences compared to current situation:
• European Law, unlike the former Directive from 1995
• A data subject should have the right to have personal data
concerning him or her rectified and a ‘right to be forgotten’
• Principle of “data minimisation” (period, amount, purpose)
• Informed consent as an important basis
• Data Protection Officer obligatory
• Personal data breaches have to be reported within 72 hours
(“Right to Know”); high penalties apply
7. What stays the same (probably)
• Supervisory Authorities (Privacy watchdogs) remain nationally
organized
• Data transfers to third countries or international organizations
under strict terms (Articles 44-46); safeguards with regard to
the rights of data subjects.
• Member States to formulate derogations for scientific research:
– “Processing for archiving purposes in the public interest, scientific
or historical research purposes or statistical purposes, shall be
subject to appropriate safeguards, in accordance with this
Regulation, for the rights and freedoms of the data subject. Those
safeguards shall ensure that technical and organisational measures
are in place in particular in order to ensure respect for the
principle of data minimisation.
– Those measures may include pseudonymisation.”
8. 2. Sharing research data at DANS
1. Which restrictions? DANS access licenses
2. A special case: private digital library for TDM
3. DataTags to establish data sharing opportunities
9. Data Archiving & Networked Services
Institute of
Dutch Academy
and Research
Funding
Organisation
(KNAW & NWO)
since 2005
First predecessor
dates back to
1964 (Steinmetz
Foundation),
Historical Data
Archive 1989
Mission: promote
and provide
permanent
access to digital
research
resources
10. DANS motto
Open als het kan,
Beschermd als het moet
Open if possible,
Protected if necessary
14. Ewoud Sanders private digital library
- Dutch historian of language and
journalist
- Private library of 40 bookcases,
mainly on literature and linguistics
- Now: collection of 160,000
digitized books and journals
- Digitization allowed for
“private/home use” according to
Dutch copyright law (1912)
- Not allowed to share with
researchers for scholarly use
- Not even allowed to deposit in
digital repository
15. Reports on Text and Data Mining
Past years: several reports on Tekst & Data Mining require
legislators to reform the EU’s legal framework to facilitate TDM
for research purposes
16. • Use the Sanders Digital Collection as a test case
• Archive it in DANS Trustworthy Digital Archive EASY
• Make it available for TDM to researchers
• Users will have to guarantee:
• Usage for TDM only
• No further distribution of the base material
• Publishing restricted to snippets of the base texts
17. “Dutch/European GDPR” DataTags
• Many researchers have limited awareness of legal privacy
requirements (both today and under GDPR)
• Data archive staff have limited possibilities to check
deposited data on potential privacy infringements
• Data tagging system very useful:
- raise the awareness
- structured Q&A system to establish whether protection is
required (and to what level)
- Dutch and EU (“GDPR-compliant”) DataTags version desirable
(even with national variations because of derogations)
18. Personal Data and the DANS archive
• Researcher uploading data is primarily responsible
• DANS can only check marginally
• Tool needed to support decisions on required data
protection – compliant with GDPR and national legislation
• Use Harvard’s DataTags as starting point
The table on the left shows
a partial ordering of access
credentials associated with
an exemplary model set of
datatags. As levels change,
so does the confidence that
a recipient of the file can be
identified and contacted. At
the least enforced level, the
Blue datatag requires no
access credentials. The
Green datatag requires that
a requester’s online contact
be verified, such as by
sending a link in an email
message to which the
requester must respond
(Open Authorization, OAuth). From the Yellow datatag onwards, the requester must submit an application
and receives access only after approval. As the levels advance, so do the required steps by which the
requester accepts the terms of a data use agreement. The Blue and Green datatags may have a minimal
http://datatags.org/
19. How the Data-tagging Tool Works
• Step 1: Questionnaire. The person tagging the data answers
a series of questions from a dynamic interview application
designed to elicit the key properties of a given dataset while
minimizing the number of questions presented to the user.
• Step 2: Assessment. Based on the user’s responses, these
kinds of data-tagging tools apply inference rules to
determine which handling requirements are relevant to the
dataset.
• Step 3: Assignment. The data-tagging tools assign simple,
iconic DataTags and a custom policy that indicate how the
dataset can be stored, transmitted, or used based on its
properties and the applicable restrictions.
20. 3. Science Europe Working Group on
Research Data
1. Report and workshop on TDM
2. Report on Funding Research Infrastructures
3. Data Glossary
4. RDM Protocols
21. Results of First Mandate, 2014-2016
http://www.scienceeurope.org/policy/working-groups/research-data/
Short: http://goo.gl/6gC6Bb
http://sedataglossary.shoutwiki.com/
Science Europe Data Glossary
22. Results of Second Mandate, 2016-2017
Involving communities in Research Data Management
Science Europe M.O.’s (and others) to
set Data Protocols Framework (Terms of
Reference for Domain Protocols)
Domain Data Protocols (DDPs) to be
openly published
Session at RDA & Draft Report: Domain Repositories Interest Group
Wednesday, April 5, 11:30-13:00, Room MR6
https://www.rd-alliance.org/ig-domain-repositories-rda-9th-plenary-meeting
Short: http://goo.gl/nMTrhI
23. Questions?
Watch our videos on YouTube
https://www.youtube.com/user/DANSDataArchiving
peter.doorn@dans.knaw.nl