More Related Content Similar to 11.20.18 DSpace for Research Data Management Webinar (20) 11.20.18 DSpace for Research Data Management Webinar1. DSpace for
Research Data Management:
Fordatis - a DSpace Solution
Andrea Wuchner & Dirk Eisengräber-Pabst,
Fraunhofer
Pascal Becker,
The Library Code GmbH
November 20, 2018
2. © Fraunhofer-Verbund
Innovationsforschung
R R
Page 2
FRAUNHOFER FACTS & FIGURES
Fraunhofer Lines: absorption lines in
the solar atmosphere caused by chemical
elements
The "missing colors" of the sun
■ Researcher
Discovery of “Fraunhofer
Lines“
fundamentals in optics &
spectral analysis
■ Inventor
New methods of lens
processing
■ Entrepreneur
Head of royal glass factory
Josef von Fraunhofer (1787 – 1826)
3. © Fraunhofer-Verbund
Innovationsforschung
Page 3
■ Point of departure
■ DSpace decision
■ Requirements and use cases authority data, linking
entities, descriptive metadata, identifier, integration
■ DSpace customization
■ Lesson learned
■ Still en route
AGENDA
4. © Fraunhofer-Verbund
Innovationsforschung
Page 4
Fraunhofer today at a glance
More than 70%
is derived from contracts
with industry and from
publicly financed
research projects.
Almost 30%
is contributed by the
German federal and states
Governments
Financevolume
€2.3 billion
2017
ContractResearch
€2.0 billion
Major infrastructure capital
expenditure and defense
research
25,327 staff
72 institutes
and research
units
FRAUNHOFER FACTS & FIGURES
5. © Fraunhofer-Verbund
Innovationsforschung
Page 5
Academic research structures
FRAUNHOFER‘s ACADEMIC SIDE
■ Extensive interrelations with universities
■ 80 Fraunhofer directors hold chairs at universities
■ Direct involvement into academic lecture content
■ Fraunhofer institutes offer PhD 2016 > 2053
dissertations, 2853 ongoing
■ x72: all institutes have own library and personel (scientific
information manager)
■ x1: central institutional repository, library and publication support
services (at IRB)
72 ∞64 Universities
48 Univ. of Applied Science
6. © Fraunhofer-Verbund
Innovationsforschung
Page 6
Open Access
OPEN SCIENCE JOURNEY
■ 2003: among first signatories of Berlin Declaration
■ 2006: Fraunhofer-Publica: open access view »Fraunhofer-ePrints«
[5]
■ 2009: open access policy, introduction of a central support team
■ 2012: extended publication support, deposit workflows,
community engangement
■ 2015: open access strategy and goals: 2020->50% OA, OA rate is
reported to ministry
■ 2018: consolidation of OA- management, own OA-research
projects
7. © Fraunhofer-Verbund
Innovationsforschung
Page 7
Open Access
OPEN SCIENCE JOURNEY
■ 2003: among first signatories of Berlin Declaration
■ 2006: Fraunhofer-Publica: open access view »Fraunhofer-ePrints«
[5]
■ 2009: open access policy, introduction of a central support team
■ 2012: extended publication support, deposit workflows,
community engangement
■ 2015: open access strategy and goals: 2020->50% OA, OA rate is
reported to ministry
■ 2018: consolidation of OA- management, own OA-research
projects
9. © Fraunhofer-Verbund
Innovationsforschung
Page 9
Open Data
OPEN SCIENCE JOURNEY
■ 2014: H2020 Open data pilot | DFG Guidelines on the handling of
Research Data
■ 2015: evaluation project research data management
■ 2016: FORDATIS project gets a go along with H2020 Research
Project JERRI
■ 2017: extended publication support, deposit workflows,
community engagement
■ 2018: information workshops and training on research data
management
10. © Fraunhofer-Verbund
Innovationsforschung
Page 10
Patenting, IP and industry relations
FRAUNHOFER‘s RTO SIDE
* Portfolio of active rights (patents and utility models) and patent applications at year end.
2015 2016 2017
Active patent families at year
end*
6573 6762 7036
Invention disclosures reports per
year
670 798 756
Patent applications per year 506 608 600
*2
Clarivate Analytics, 2017 [2]
Video encoding
White LEDmp
3
Fat free
saussageLupine protein■ RTOs are bridging the gap between basic research and market
2017: 11,000 projects of applied research with industry partners (7,500
with public funders)
■ RTO Mission is about adapting inventions to industry use and to
innovations
Among »Top 100 Global Innovators« (3 other German companies: BASF,
Bayer, Merck)*2
■ Collaborations with industry mostly confidential, exploitation interests
■ Different stakeholders and their requirements: industry, science, society,
politics
■ Tension field between open and closing –
high obligation to succeed economically and innovate with industry,
vs. transfering knowledge to society, politics and science
11. © Fraunhofer-Verbund
Innovationsforschung
Page 11
Benefits of opening research for RTOs and it‘s
stakeholders
■ SCIENCE
Contributing to scientific communities, accelerating scientific progress,
claiming scientific results
■ INDUSTRY
Input for open innovation processes – shorten innovation cycles
■ SOCIETY
Transfer of scientific results to society, create participation opportunities,
societal engagement
■ POLITICS
Support agenda setting processes through scientific knowledge
■ FUNDERS
Compliance with open strategies and requirements
■ ECONOMY
Fostering economic progress
FRAUNHOFER‘s RTO SIDE
13. © Fraunhofer-Verbund
Innovationsforschung
Page 13
WHY DO WE CARE FOR RESEARCH DATA
■ Publications are based on research results
■ Today, data as research results can easily be copied
■ We commited ourself to safeguard research data that were created at
Fraunhofer Institutes
■ As of today, many questions regarding research data are still open and
unclear
■ Our approach: do what we can do today as good as we can and improve
whenever possible
■ Create a solution as fast as we can and improve it instead of never ending
discussions without any solution
14. © Fraunhofer-Verbund
Innovationsforschung
Page 14
■ All repositories store metadata and files
■ The difference between a repository for research data and one for
publications are the type of the files
■ Research data repositores stores
■ Tables
■ Source code
■ Statistics
■ …
■ -> Any kind of file a researcher considers to have a scholarly value
REPOSITORIES FOR RESEARCH DATA
VS. REPOSITORIES FOR PUBLICATIONS
15. © Fraunhofer-Verbund
Innovationsforschung
Page 15
THINGS TO CONSIDER
■ Work together with researchers while planning, building and running the
repository
■ Work together with researchers to identify file formats that can be
considered to be lasting and adequate for reuse
■ Find out as good as possible what researchers in the organizations needs
■ Include as much of the whole organization as possible while planning the
workflows
16. © Fraunhofer-Verbund
Innovationsforschung
Page 16
OUR APPROACH
■ Prepration period of our project FORDATIS
■ Project FORDATIS to work together with all the stakeholders, to identify
the needs and to build a first version of our repository Fordatis
■ Survey among our researchers about their data and needs
■ Builded the repository in a very short time (June – November 2018,
together with The Library Code GmbH)
■ Now: period of testing, gaining experience and making enhancements
17. © Fraunhofer-Verbund
Innovationsforschung
Page 17
THING TO CONSIDER II
■ A repository is an infrastructure in an existing ecosystem
■ There are a lot of important conventions for repositories:
■ Use Persistent Identifiers
■ Be Open, be FAIR
■ Create and publish policies regarding longetivity and reuse of data
■ Use common protocols (sitemaps, OAI-PMH)
■ Typical workflows of repositories (self-submission)
■ Be part of a community
20. © Fraunhofer-Verbund
Innovationsforschung
Page 20
Requirements - Authority Data and Entities
Authority data
■ Projects as entities → new Publica
■ Persons as entities → new Publica
■ Organizations as entities → new Publica
■ DDC for subjects → Fordatis
Linking Entities
■ Linking between research data and publications → related identifier in
Fordatis
■ Linking between projects and research data → new Publica
■ Linking between persons and research data → new Publica
21. © Fraunhofer-Verbund
Innovationsforschung
Page 21
Requirements - FAIR-Principles
■ Access authorization: no access
to embargoed content planned.
■ Metadata longevity: A link to a
longevity concept for the
metadata will be available.
■ Usage of knowledge
representation language: for
metadata standards are used,
the specification will get
published.
■ Detailed provenance: fields
giving information on
provenance, field “description
provenance”
■ Meets community standards →
Standards are work in progress
22. © Fraunhofer-Verbund
Innovationsforschung
Page 22
Requirements - Organizational Embedding
■ Submission workflows: research data needs to get from researcher
quality assured into the Fordatis repository
■ Data management workflows: Research data management from cradle to
bare → in SAP within Fraunhofer Digital Project
■ Planned connection with Fraunhofer-Dataspace: Sparql-Endpoint for
linked data
23. © Fraunhofer-Verbund
Innovationsforschung
Page 23
DSpace customizations
■ DSpace JSPUI contains a modern ressumable upload mechanism already
■ Enhanced the metadata schema and the submission form
■ Hide the community & collection list and added browse indices for
Fraunhofer Institutes and groups and clusters instead
■ Changed the layout to achieve the style of Fraunhofer publications
■ Changed the item view
■ Don’t show the metadata in a table, put the content in the center
■ Format an item title as an title and an affiliation as such
■ Use badges for DOIs, dc.type and dc.date.issued
■ Collapse long metadata fields like abstract or file list
■ Move file list up to be directly reachable when Item view gets loaded
■ Added a PDF-Preview with PDF.js
25. © Fraunhofer-Verbund
Innovationsforschung
Page 25
Lessons Learned
■ Working on the FORDATIS project required different skills, knowledge and
roles, and different people were involved in success in different ways.
■ The project FORDATIS has interfaces and dependencies to other projects
and developments, for example the Publica and the project Fraunhofer
Dataspace, which have to be considered.
■ Research data management is a new development theme that will evolve
over the coming years. There are still many unanswered questions in this
context. Do now, what’s possible today.
■ In addition to the technical implementation, the organizational
embedding of a repository must be considered as well. For this, all
relevant actors and networks must be brought on board and integrated.
■ We’re highly awaiting DSpace 7
26. © Fraunhofer-Verbund
Innovationsforschung
Page 26
Still en route
■ Implementation
■ Beta-Version
■ Testing
■ Adjustment
■ Documentation
■ Go-Live
■ Beta-Version becomes Live-System
■ Organizational Embedding
■ Establishing of submission
workflows at the Institutes and
Competence Center Research
Services & Open Science
■ Communication: communication to
institutes, Fraunhofer-Reseachers,
world
28. © Fraunhofer-Verbund
Innovationsforschung
Page 28
Publication & Data – to mix or not to mix
■ Pro & contra
No reason not to mix – exception: special formats
■ Existing IT/Publication infrastructure
Redesign of the existing Fraunhofer IT infrastructure
Integrate Fraunhofer Publica and Fordatis with each other
■ Project “Fraunhofer Digital”
To make diverse Fraunhofer data sources available in Fraunhofer Data
Space
Still on departure
29. © Fraunhofer-Verbund
Innovationsforschung
Page 29
CC
Research
Services &
Open
Science
Dirk Eisengräber-Pabst
Dirk.eisengraeber-pabst@irb.fraun
hofer.de
Tel: +49 (0)711 / 970-2979
Competence Center Research Services & Open Science
Fraunhofer-Informationszentrum Raum und Bau (IRB)
www.irb.fraunhofer.de
Nobelstraße 12
D-70569 Stuttgart
Germany
Andrea Wuchner
Andrea.wuchner@irb.fraunhofer.de
Tel: +49 (0)711 / 970-2714
Expertise:
■ Open
Science
@
Fraunhofer
■ Indicators
■ Disseminatio
n-strategies
■ Publication
Support
■ Repository
Managment
■ Community
Engagement
CONTACT
Fraunhofer IRB
Fraunhofer-Inform
ationszentrum
Raum und Bau IRB