Presented at the Preforma Open Source Workshop 8 April 2016
As a library membership organization, LIBER works on addressing Open Science barriers. Standardisation of file formats can really help in overcoming some of these barriers: it enables us to process and preserve data in a controlled way, it helps ensure that outputs are really open and accessible in the long term and it improves interoperability of new tools and services. Making sure data is stored in a controlled way and can be (re) used today and in the future is an important element in Open Science. We see this as not only a technical challenge but also a social one: awareness, trust and community building is needed in order to ensure uptake of these standards. Libraries therefore have a valuable role to play in the development of good research data management throughout all phases of the Open Data lifecycle.
7. LIBER Europe
A central part of LIBER’s mission is to provide an
information infrastructure that enables research in
LIBER institutions to be world class.
For this infrastructure to thrive, it must be part
of an ecosystem that can accommodate and
nurture the changing nature of research and
innovation in the digital age.
9. Open Science Definition
“The conduction of science in a way that
others can collaborate and contribute,
where research data, lab notes and other
research processes are freely available,
with terms that allow reuse, redistribution
and reproduction of the research”
https://www.fosteropenscience.eu/foster-taxonomy/open-science-definition
11. LIBER and Open Science
So, we need alternative mechanisms for the recognition of
excellence in Open Science, e.g. ranking systems, to
Open up Science.
•From Publish or Perish to Open Science
•Scientific tools used need to be cited, and, in order to
make scientific experiments recreatable, there need to be
incentives to create open and sustainable software
13. Science Code Manifesto
•Code
source code written specifically to process data for a
published paper must be available to the reviewers and
readers of the paper.
18. LIBER and Open Science
Curation
‘Making sure data is stored in a controlled
way and can be (re) used today and in the
future is an important element in Open
Science’.
19. LIBER and Open Science
Standardisation of file formats will ensure (re-)
usability today and in the future, as it:
• enables processing and preservation of data in a
controlled way
• ensures outputs that are really open and
accessible in the long term
• improves interoperability of new tools and services
20. Workshop Nov 2015:
“Text and Data Mining in Europe:
Challenges and Action”
Participants: content providers (publishers,
data centers, museums and libraries)
Technical challenges identified:
•Quality of datasets
•Lack of a secure infrastructure
21. Solutions:
•Develop and use open standards
•Develop templates for metadata and content
•Allow for peer review of data quality, develop
validation tools,
•Appraise good quality data
•Organisations should invest resources to
improve the quality of their data
Workshop Nov 2015:
“Text and Data Mining in Europe:
Challenges and Action”
22. Workshop Feb 2016:
European Open Science Cloud
Opening paragraph of The European Open Science Cloud for Research Rome
Workshop Report:
‘The creation of a trusted environment for hosting and
processing research data (..) will help overcome many key
challenges currently facing scientific disciplines. These
challenges include a huge lack of awareness of the value of
data and the incentives for data sharing, a continued lack of
and urgent need for common standards to ensure
interoperability of data…’
24. Libraries enabling Open Science
Data Issue Libraries and data centres opportunities
Availability - Lower barriers to researchers to make their data available
- Integrate data sets into retrieval services
Findability - Support of persistent identifiers
- Engage in developing common meta description schemas
and common citation practices
- Promote use of common standards and tools among
researchers
Interpretability - Support crosslinks between publications and datasets
- Provide and help researchers understand meta-descriptions
of datasets
- Establish and maintain a knowledge base about data and
their context
.
25. Libraries enabling Open Science
Data Issue Libraries and data centres opportunities
Re-usability -Curate and preserve datasets
-Archive software needed for re-analysis of data
-Be transparent about conditions under which data sets can be
re-used (expert knowledge needed, software needed)
Citability -Engage in establishing uniform data citation standards
-Support and promote persistent identifiers
Curation/ -Transparency about curation of submitted data
Preservation -Promote good data management practice
-Collaborate with data creators
-Instruct researchers on discipline specific best practices in
data creation (preservation formats, documentation of
experiment,…)
26. Libraries enabling Open Science
Focus on Research Data Management:
• Growing variety of data types and volume
• Curation of data from the planning stage of research
projects
27. Libraries enabling Open Science
Awareness, trust and community building
•Institutions - develop policies and roadmaps
•Researchers - highlight benefits of open science
•(Other) Stakeholders at institutional level and
internationally
28. Libraries enabling Open Science
•Stay in control!
•Unite!
•Be active in projects like Preforma
•Advocate & Engage
29. Open Science
What can you do?
•Release data under CC0
•media components and arrangements of data
under CC BY
•Work from what is already working
•Use what is really open: freely available, can be
freely adopted, implemented and extended (no
license fees)
•Sign The Hague Declaration!
30. Elsevier TDM Policy
• Access through API only
• Text only- no images, tables
• Research must register details
• Click-through licence
• Terms can change any time
• Reproducibility of results
31. Thank you!
• The Hague Declaration: http://thehaguedeclaration.com
• LERU Roadmap for Research Data
http://www.leru.org/index.php/public/news/press-release-leru-
roadmap-for-research-data
• http://sciencecodemanifesto.org
• Research Data Alliance https://rd-alliance.org
• LIBER 10 Recommendations on Getting Started in RDM
http://libereurope.eu/wp-content/uploads/The%20research%20data
%20group%202012%20v7%20final.pdf
• OpenAire https://www.openaire.eu
• San Francisco Declaration
• http://www.ascb.org/dora-old/files/SFDeclarationFINAL.pdf
Notes de l'éditeur
First a few words about LIBER
LIBER is the Association of European Research Libraries, the main network for research libraries in Europe. Founded in 1971, the association now includes over 400 national, university and other special libraries.
We bring our members together through our annual conference.
This year in Helsinki,
Next year in Patras
And we bring our members together through participation in EU Projects, together with about 50 of our members.
The LIBER office is involved in nine EU projects, all to do with addressing barriers on the path towards Open Science.
We have projects on Text and Data mining, Open Access training, Open Access Policies, Research Data Management etcetera
Steering committees, working groups, three strategic directions
What is happening in Research Libraries nowadays is not « daily routine ». It’s about a revolution.
We have to re-invent the library for the future!
As a library membership organization, LIBER works on addressing Open Science barriers.
What is the Open Science ecosystem?
First of all: Open Science is more than just open access to publications. It also means open access to data, API’s, licenses, policies etcetera
It should be an ecosystem of sharing and collaboration, of re use and redistribution.
Open Science is difficult to define, but here is a definition from the FOSTER project: a project on open science training.
Shared as early as possible
Open unless…
In this eco system, the researcher is key. Researchers should contribute and benefit at the same time.
Incentives are needed for researchers to share and (re) use their data. In the end Open Science will change work habits and business models.
What we need is institutional recognition of alternative metrics. The publishing of software should rewarded, not the publication of the article describing the software research project. Why would researchers put effort in, for instance, to focus on an element of Open Science connected to the workshop of today; sustainable and open software if this is not rewarded at all?
Not new; an example: Out for a few years now, 1140 endorsements
This Science Code Manifesto speaks about Curation of open source software, but curation is needed in every area of the open science eco system
A few examples of my own: some Open Science events we co-organised as LIBER where standardisation of file formats was mentioned as part of OS:
In November, I co-hosted a Workshop aimed at content providers: publishers, data centers, museums and libraries, that are open to the use of Tekst and Data Mining on their data.
Another workshop co-organised by LIBER: The European Open Science Cloud workshop.
The workshop report begins with this statement:
And sligthly different, but just as interting for this crowd:
just this week, we co-organised four Open science cafes where we discussed all kinds of aspects of open science in an open discussion: one of the discussion statements was:
“libraries should spend money on preserving software in order to keep data available for re-use”
So, how do we see libraries in this? How can libraries help to enable Open Science?
Open Data: There are lots of opportunities to address issues with open data
Libraries have always played an important role in the curation of data
And, of course, all of this should be as open as possible.
Next to projects like Preforma, libraries can be active in Research Data Management
Slide 26: no mention of Digital Preservation yet: part of research data management from the start of projects on
We see not only technical challenges on the road towards open science, but also social challenges: awareness, trust and community building is needed in order to ensure uptake of eg. standards.
People say that we need registries of open source software, e.g. for TDM, but what about Github? Work from what is already working
Focus on really Open standards Definition in the line of Open Science: freely available, can be freely adopted, implemented and extended. ( no license fees)
As a library membership organization, LIBER brought global experts together to draft a collective statement, The Hague Declaration on knowledge discovery in the digital age, to show policy makers the strength of support in the research community for better access to facts, data and ideas.
The Hague Declaration has over 700 signatories - and counting.