GEOTEC UJI and FOSTER project organized a training seminar in the context of GEO-C ESR titled “Open Science and European Open Access policies in H2020”.
The seminar took place in Castellon (Spain), Feb 12th from 9.30 to 14.00.
Pests of soyabean_Binomics_IdentificationDr.UPR.pdf
OpenAIRE services and tools for researchers/authors and projects (FOSTER workshop at University Jaume I)
1. OpenAIRE services and tools
for researchers/authors and projects
Pedro Príncipe
pedroprincipe@sdum.uminho.pt
2. OPEN ACCESS
OpenAIRE implements the
EC requirements
& SUPPORTS THE OPEN DATA PILOT
Universitat Jaume I :: GEO-C / FOSTER workshop :: 12 Feb. 2016
3. 3
OpenAIRE
Dec. 2009 - Nov. 2012
OpenAIREplus
Dec. 2011 – Dec. 2014
OpenAIRE2020
Jan. 2015 a Jun. 2018
From a pilot in FP7
to a key service in H2020
4. Guided by OA funder
policies (European
Commission)
Participatory
approach (human &
e-infrastructure)
Integrated Scientific
Information System
Project & Infrastruture…
4
5. OpenAIRE’s e-infrastructure Commons
Publications
repositories
Research Data
repositories
CRIS
systems
Registries
(e.g. projects)
OA
Journals
Software
Repositories
Validation
Cleaning De-duplication
Enrichment
By inference
Funders, research admins,
research communities
• Research impact
• Research trends
• Open Access trends
Content providers
• Repository validation
• Repository notification broker
• Repository analytics and usage stats
Researchers
• Claim publications, datasets, software
• Deposit publications, datasets, software
• Search & browse: interlinked publications, datasets,
projects
• Open Access & DMP Helpdesk
• End-User feedback
Content Providers
Info Space Services
End-User Services
Project initiative
FunderFunding
Result
Publication Data Software
Organization
GUIDE
LINES
TERMS
OF USE
21. 21
18 WEBINARS in 2015
802 participants
4
for NOADS
(2 helpdesk,
1 Zenodo, 1 FP7
Post-Grant OA
Publishing
Funds)
142
participants
6
National
by NOADs: 3
Repositories, 2
FP7 Post-Grant
OA Pub. Funds,
1 H2020 Open
Data Pilot)
143
participants
3
H2020 OA
policy
168
participants
2
H2020 Open
Data Pilot &
ZENODO
103
participants
2
Repository
compliance
Guidelines for
Data
Providers
146
participants
1
FP7 Post-
Grant OA
Publishing
Funds
100
participants
23. Workshop
• European Workshop on Open Access and Data Sharing
• Ghent, November 2015
• Attended by more than 100 key stakeholders (project
coordinators, research administrators, NPRs, NCPs, NOADs)
23
www.vimeo.com/channels/openaireworkshop5
27. OpenAIRE PORTAL
TOOLS FOR COMPLIANCE
Are you supposed to deposit?
What to deposit >
Where to deposit >
When to deposit >
When should Open Access be provided
Universitat Jaume I :: GEO-C / FOSTER workshop :: 12 Feb. 2016 27
2
28. Regardless of where you publish,
deposit the final manuscript or publisher’s PDF
in a repository, either institutional or disciplinary.
NB: It is not enough to list publications via a project website. They will go unnoticed!
Universitat Jaume I :: GEO-C / FOSTER workshop :: 12 Feb. 2016 28
29. 1 2 3INSTITUTIONAL
REPOSITORY
of the research
institution with which
they are affiliated
SUBJECT/THEMATIC
REPOSITORY
ZENODO REPOSITORY
Centralised option set
up by the OpenAIRE
project and CERN
Depositing
Universitat Jaume I :: GEO-C / FOSTER workshop :: 12 Feb. 2016 29
Through OpenAIRE they can be directed to: Publication repositories (OpenDOAR), Research Data
repositories (RE3DATA). If no repository is available: Zenodo at CERN (sponsored by OpenAIRE)
OpenAIRE harvests directly from a number of OpenAIRE compliant OA publishers and journal
aggregators.
30. LOCATE AN APPROPRIATED REPOSITORY
Deposit once!
Via OpenAIRE
Universitat Jaume I :: GEO-C / FOSTER workshop :: 12 Feb. 2016 30
31. Find where to deposit
Universitat Jaume I :: GEO-C / FOSTER workshop :: 12 Feb. 2016 31
32. Find where to deposit - DEMO
Universitat Jaume I :: GEO-C / FOSTER workshop :: 12 Feb. 2016 32
41. USE OUR LINKING SERVICES AFTERWARDS AND
ASSOCIATE YOUR RESEARCH RESULTS
When you publish or deposit in OA
make sure you use a fully OpenAIRE
compatible repository. If you don't…
Universitat Jaume I :: GEO-C / FOSTER workshop :: 12 Feb. 2016 41
5
42. LINK RESEARCH RESULTS TOOL
https://www.openaire.eu/participate/claim
Universitat Jaume I :: GEO-C / FOSTER workshop :: 12 Feb. 2016
Link publication or datasets
to projets.
Identify the project, select
publications or datasets and
set the access rights.
43. LINK RESEARCH RESULTS - Demo
Universitat Jaume I :: GEO-C / FOSTER workshop :: 12 Feb. 2016
44. STREAMLINES PROJECT REPORTING
OpenAIRE
We maintain a page for every EC (H2020, FP7) project,
featuring project information, related project
publications and datasets, and a statistics section.
Universitat Jaume I :: GEO-C / FOSTER workshop :: 12 Feb. 2016 44
6
49. CAN'T FIND OR SEE ALL OF YOUR
PROJECT'S PUBLICATIONS IN
OPENAIRE AT REPORTING TIME?
Universitat Jaume I :: GEO-C / FOSTER workshop :: 12 Feb. 2016 49
7
50. Linking research results
• 3 easy steps
• Identify projects (EC +)
• Find publications/data
• Set access rights
Publications and data to projects
53. YOUR PUBLICATIONS WILL BE REPORTED
AUTOMATICALLY TO THE EC'S PARTICIPANT
PORTAL AT REPORTING TIME.
Once you deposit in a fully
OpenAIRE compliant repository
Universitat Jaume I :: GEO-C / FOSTER workshop :: 12 Feb. 2016 53
55. Discover and reuse
Research in context. Navigate and find related research entities.
Universitat Jaume I :: GEO-C / FOSTER workshop :: 12 Feb. 2016 55
9
57. APPLY CLEANING, TRANSFORMATION,
DISAMBIGUATION PROCESSES, AND IDENTIFY
RELATIONSHIPS AMONG ALL RESEARCH ENTITIES
As we aggregate content from many
different sources, OpenAIRE can
Based on this enriched and linked information we are
able to produce aggregated statistics tailored to the
needs of any level research manager.
57
10
58. Statistics and monitoring
• Individual statistics
• Project, publication/data (usage), author (see demo)
• Aggregated statistics for
• Funding agency, programme, scientific area, country, institution
• OA evaluation
• Advanced statistics based on content classification and
clustering
• We welcome special requests – work in progress
Universitat Jaume I :: GEO-C / FOSTER workshop :: 12 Feb. 2016 58
62. Infrastructure for Open Knowledge
• Foster and facilitate the
shift of scholarly
communication towards
making science Open and
Reproducible
• Collaborative and
participatory approach at
European and Global level
Research
communities
Research
admins
Researchers
Funders
SMEsContent providers in
scholarly communication
Neworking &
e-Infrastructure
63.
64. Research data in OpenAIRE…
OPEN RESEARCH
DATA PILOT
in Horizon 2020
64
65. Open Research Data Pilot
• HORIZON 2020 GUIDELINES
• INFORMATION ABOUT THE PILOT AT THE OPENAIRE WEBSITE
• OPEN RESEARCH DATA PILOT
• CREATE A DATA MANAGEMENT PLAN
• SELECT A DATA REPOSITORY
• OPENAIRE BRIEFING PAPER
• OPENAIRE FACTSHEETS
• WEBINAR OPEN RESEARCH DATA PILOT AND ZENODO
• WORKSHOP PRESENTATION ABOUT RDM
• POSTCARD
• FAQS
Overview of the OpenAIRE support info
65
66. Open Research Data Pilot
• Open Research Data Pilot: https://www.openaire.eu/opendatapilot
• Create a Data Management Plan: https://www.openaire.eu/opendatapilot-dmp
• Select a Data Repository: https://www.openaire.eu/opendatapilot-repository
• Factsheet: https://www.openaire.eu/or-data-pilot-factsheet
Useful links to the support information available on the OpenAIRE portal:
66
68. Linking data and publications
• From a data-centric perspective publications are part of a
dataset’s context. However, there is no need to include
publications in the replication package:
• A lot of data repositories also accept publications, and allow
linking between publications and their underpinning data.
• By means of smart, persistent identifiers – consistently used –
linking is also possible across repositories.
68
69. In several domains there are indications that publications WITH links to data
receive more citations than papers WITHOUT data links.
This study in astrophysics is a recent example.
78. FP7 projects: publications + datasets
HYPOX > https://www.openaire.eu/search/project?projectId=corda_______::abb5725eaf2617c39ae240b4ce1cce3e
78
Open Access funded Publications
aggregated from repositories & journals
Datasets from Data
Repositories
79. Requirements Open Data Pilot
79
Data Management Plan required within six
months after project grant
Deposit your data in a research data repository
Open data is data that is free to access and
reuse: Creative Commons Licence CC-BY or CC0
80. How to write a DMP
• Template available from https://dmponline.dcc.ac.uk/
•
• And from a few national DMPonline sites, e.g. in Spain and Belgium
See https://www.openaire.eu/opendatapilot-dmp - Spain: http://pgd.consorciomadrono.es/ - Belgium - forthcoming 80
1
84. 84
Briefly specify
• how data will be captured/created
• how it will be documented
• according to what standards
• who will be able to access it
• where it will be stored
• how it will be backed up, and
• where and how it will be shared and
preserved long-term
87. Deliver the DMP
• Send the initial DMP version to the Commission within six months.
• EC: “Since DMPs are expected to mature during the project, more
developed versions of the plan can be included as additional deliverables
at later stages. (…) New versions of the DMP should be created whenever
important changes to the project occur due to inclusion of new data sets,
changes in consortium policies or external factors.”
87
88. Template mid-term review DMP
Broad notions: the data and associated metadata should be managed in a way
that allows for future reuse
88
90. Let’s recall the goal:
• Open access to research data refers to the right to
access and re-use digital research data. Openly
accessible research data can typically be accessed,
mined, exploited, reproduced and disseminated
free of charge for the user.
• The use of a Data Management Plan (DMP) is
required for projects participating in the Open
Research Data Pilot, detailing what data the project
will generate, whether and how they will be
exploited or made accessible for verification and re-
use, and how they will be curated and preserved.
http://ec.europa.eu/research/participants/data/ref/h2020/grants_manual/hi/oa_pilot/h2020-hi-oa-data-mgt_en.pdf 90
91. Negative intermezzo
• Stored data is not in itself “curated and preserved”
• Preserved (or: archived) data is not in itself findable
• Findable data is not in itself accessible
• Accessible data is not in itself understandable
• Understandable data is not in itself usable
91
What should be archived for
long-term reuse is a package
of data + context:
92. What should be deposited?
• The data needed to validate results in scientific publications (minimally!).
• The associated metadata: the dataset’s creator, title, year of publication, repository,
identifier etc.
• Follow a metadata standard in your line of work, or a generic standard, e.g. Dublin Core or
DataCite. Standards are important for discovering and exchanging data.
• The repository will assign a persistent ID to the dataset: important for discovering and citing...
• Documentation like code books, lab journals, informed consent forms – domain-dependent, and
important for understanding the data and combining them with other data sources.
• Software, hardware, tools, syntax queries, machine configurations – domain-dependent, and
important for really using the data. (Alternative: information about the software etc.)
Basically, everything that is needed to replicate a study should be available for others.
92
93. Repository, archive...
• A pilot requirement is to “deposit your data in a research data
repository”: a digital archive collecting and displaying datasets and their
metadata.
• Select a data repository that will preserve your data, metadata and
possibly tools in the long term.
• It is advisable to contact the repository of your choice when writing the first
version of your DMP.
• Repositories may offer guidelines for sustainable data formats and metadata
standards, as well as support for dealing with sensitive data and licensing.
But how to find a repository?
93
96. Main criteria for choosing a data repository:
• Certification as a ‘Trustworthy Digital Repository’, with an explicit
ambition to keep the data available in the long term.
• Matches your particular data needs: e.g. formats accepted; mixture of
Open and Restricted Access.
• Gives your submitted dataset a persistent and globally unique identifier:
for sustainable citations – both for data and publications – and to link
back to particular researchers and grants.
• Provides guidance on how to cite the data that has been deposited.
How to select a repository?
https://www.openaire.eu/opendatapilot-repository 96
97. Storage and Trust
• Local storage facilities during the research
• Network of trustworthy digital repositories for long-term preservation of (a
selection of) the data after the research is finished
• Certification of digital repositories in order to establish trust
• 4 certification standards available
98. Where to find a repository?
1. an external data archive or repository in your research domain
2. an institutional research data repository, or your research group’s
established data management facilities
1. Zenodo.org
2. or search for other data repositories at re3data.org
98
99. • re3data.org is a global registry of research data
repositories
• different academic disciplines
• It presents repositories for the permanent storage and
access of data sets
• Funded by the German Research Foundation (DFG)
• 2015: 1,368 reviewed repositories
99
I am going to guide you briefly through the main services that the OpenAIRE project offers for projects. This presentation will focus on the tools that will be of importance to you as researchers, research managers, project coordinators, so mainly tools that will make compliance with the OA publications.
I am going to guide you briefly through the main services that the OpenAIRE project offers for projects. This presentation will focus on the tools that will be of importance to you as ncps, research managers, project coordinators, so mainly tools that will make compliance with the OA publications.
I will highligth ten services/tools on how openaire can help research managers, ncps, project coordinator... Researchs that are participating in a H2020 project
OpenAIRE supports the various Open Access policies in the European Research Area. OpenAIRE offers a variety of resources if you want to know more about the EC’s Open Access Mandate and Open Data Pilot.
OpenAIRE supports the various Open Access policies in the European Research Area. OpenAIRE offers a variety of resources and support info materials, and if you still have questions – helpdesk.
STILL HAVE QUESTIONS? Ask our experts
If you cannot find an answer to your question, please contact us through our helpdesk.
STILL HAVE QUESTIONS? Ask our experts
Approach: deposit once – easy for researchers/research managers – and openaire will offer services.
If a researcher isn’t sure where to deposit , OPenAIRE can help.
If a researcher isn’t sure where to deposit , OPenAIRE can help.
Find also OpenAIRE validated repositories in the list of data providers – almost 700 providers.
Easy to use, easy to identify the project info in the metadata record.
This is stated in the AO H2020 mandate, OpenAIRE is helping to make it easy.
Example: zenodo
Example: Uminho repositor.
Alternative: claim a Publications and link it to the project info.
OpenAIRE portal have a specific page for each funded project!
You can also incorporate this publication list on your project website (better than adding pdfs)
A final useful tool resulting from the linking, is the facilitation of project reporting.
Once the research is deposited, it should be linked to the project information . This happens automatically, but it’s better if you check manually!
You can link data, publications and author IDs to each other , thus creating a mini CRIS system.
The linking of publications and data to project information is very useful during reporting time. You can do this manually by identifying your project and claim the related research output.
Three ways of linked:
- automatic: metadata (DC)
Automatic/experimental TDM procedures (FP7, wellcome, H2020, FCT)
Manual: Claiming
Mistakes are possible !
Claiming is not automatically done (there is also a delay).
Also to be used by participants portal reporting tool For publications accessible via OpenAIRE, references to these publications (with link to the project ID) are displayed automatically in the reporting section of the Participant Portal for the project. Simply check if the references are directly linked to the work performed within the project. If so, tick these publications as relevant and they will then be included in the table of publications when the report is generated.links and federates existing repositories. As such it provides access to publications that have been deposited in all repositories that are technically interoperable with OpenAIRE ("OpenAIRE compliant")
http://ec.europa.eu/research/participants/docs/h2020-funding-guide/grants/grant-management/reports/periodic-reports_en.htm
A final feature from the portal is the statistics and monitoring, on individual and aggregated levels.
We welcome special requests !
Very useful to check OA numbers etc when meeitng with project coordinators
Question about opting out projects?
Question about opting out projects?
“consistently used”: as in many situations this is to a small extent a technical matter: to a larger extent it depends on organising, agreements, and people citing reponsibly. So, if you are in a position to stimulate this, please do so.
On top of this, the OpenAIRE project investigates and improves automatic linking, also often via the PID.
392 datasets from PANGAEA AND 52 Publications from:
Unknown Repository (30)Biogeosciences (BG) (11)Biogeosciences (7)Biogeosciences Disc... (6)OceanRep (6)Ocean Science (OS) (4)Europe PubMed Central (3)Open Repository and... (2)Electronic Publicat... (2)NERC Open Research ... (2)Ocean Science Discu... (1)PLoS ONE (1)Ocean Science (OS) (1)e-Prints Soton (1)University of South... (1)Research@StAndrews:... (1)ArchiMer - Institut... (1)Earth-prints Reposi... (1)Archive ouverte UNIGE (1)Ghent University Ac... (1)
392 datasets
Second, you can select your organisation, but no problem if it’s not on the list. Note that ou may also find projects here, such as ELIXIR for life sciences.
You may want to include the guidance provided by the DCC. This is a good addition to the guidance that the EC provides on the questions of the template.
Next, click CREATE.
You’re asked to provide some basic information. Please note that the ID here is one that you enter yourself, for your convenience. I’ll show you in a second where I did this.
This page summarises that the DMP is a deliverable to be submitted within 6 months into the project. Below the orange bar it lists the topics of the initial DMP.
You’re asked to provide some basic information. Please note that the ID here is one that you enter yourself, for your convenience and that of your collaborators.
In this way the researcher proceeds to write the plan – more details follow in a second, but let’s first look ahead:
…and select an export format; for the EC PDF is fine.
The DMP is a deliverable/milestone to be delivered in the first 6 months AFTER the start of the project. The project officer and reviewers will ask for it, will evaluate it and give it a mark like any other deliverable (excellent, good, needs revision, rejected). This usually happens at the first review, unless the Project Officer is quite meticulous.
In subsequent reviews (or any time they feel like) the PO and reviewers may check to see if the DMP is followed (e.g., data files deposited, access status, metadata format, ...).
And make sure that you know what will be asked of you for the mid-term and the final review: the focus here is on enabling reuse of your data – by your future self and others.
In a couple of minutes I’ll tell you why this is a bit underspecified.
Okay, this is the easy part: there is a template. What’s really at stake of course is: what to write in the plan, and who should be involved?
The process of planning is also a process of communication, increasingly important in interdisciplinary / multi-partner research. Collaboration will be more harmonious if project partners (in industry, other universities, other countries…) are in accord.
Open Science encourages – and indeed requires – heterogeneous stakeholder groups to work together for a shared societal goal.
It’s worth bearing in mind that RDM and DMP are similarly hybrid activities, involving multiple stakeholder types…
The principal investigator (usually ultimately responsible for data)
Research assistants (may be more involved in day-to-day data management)
Ideally, they have a FO in the institute and/or in the domain: Library/IT/Legal/Funding office (The library may issue PIDs, or liaise with an external service who do this, e.g. DataCite. The funding office may have a compliance role)
And the FO ideally relies on back-office services, such as long-term archives and high-capacity data transfer.
Partners based in other institutions
Commercial partners
Publishers
etc
Many of us have a role in the FO or the BO: hand raising!!
Remember, we are still in the early stage of a project,
Re Software etc: in many cases copyright will prevent the archiving of software and tools. The alternative is a sensible description.
More about this in the break-out session after the lunch break…
http://www.veryicon.com/icons/object/package-icons/packageicon-zip.html
Proper data management won’t prevent theft of your laptop, but will help you to keep your data safe – even if they are not meant to go public.
Certification as a ‘trustworthy digital repository’, with an explicit ambition to keep the data available in the long term. We know of course that several domains have longstanding archives that are not certified as TRD, because they are unsure how much effort a certification process entails. We think that’s a pity. … three-tiered proces... And the Open Science, Open Access, Open Data effort should really encourage the willing repositories to apply for certification.
Matches your particular data needs (e.g. formats accepted; access, back-up and recovery, and sustainability of the service). Most of this information should be contained within the data repository’s policy pages.
Gives your submitted dataset a persistent and unique identifier: for sustainable citations – both for data and publications – and to link back to particular researchers and grants.
Lands visitors at the dataset or its metadata.
Helps to track how the data has been used by providing access and download statistics.
Offers clear terms and conditions that meet legal requirements (e.g. for data protection) and allow reuse without unnecessary licensing conditions.
Provides guidance on how to cite the data that has been deposited.
Elly will tell more about trustworthy repositories and also say a few words about storing data safely DURING the project, because that’s also part of data management.
mission to provide reliable, long-term access to managed digital resources to its designated community, now and into the future
constant monitoring, planning, and maintenance
understand threats to and risks within its systems
regular cycle of audit and/or certification
DIN 31644 / ISO 16363
Council for Science – World Data System (ICSU-WDS). Met deze certificering bevestigt WDS dat DANS betrouwbaar is als het gaat om: authenticiteit, integriteit, vertrouwenswaardigheid en beschikbaarheid van data en datadiensten.
Use an external data archive or repository already established for your research domain to preserve the data according to recognised standards in your discipline.
If available, use an institutional research data repository, or your research group’s established data management facilities.
Use a well-known data repository in your own country.
Use a cost-free (data) repository such as Zenodo.
Search for other data repositories here: re3data.org
If you dont have an appropriated repository use Zenodo - the catch-all repository hosted by CERN, is linked to OpenIARE and can be used to deposit research publications and data. Zenodo is of course harvested by the openaire portal
Zenodo is developed by CERN under the EU FP7 project OpenAIREplus