Research data as the main product of research can be unique and is often the result of a complex and cost-intensive research process. Reuse and reinterpretation of such material is envisioned, not only to maintain research integrity, but also to accelerate the advancement of science by sharing results in an early stage.
Generally speaking, there is little general experience with preservation, provision and publishing of research data. Thus so far little research has been done when it comes to researching data publishing models. In history, this has partly been due to the limited existing infrastructures, but with current information technologies, modern and tailored research data provision and publishing are facilitated.
Why tailored? Characteristics of research data vary across and within disciplines. This results in more complex prerequisites/specification when compared to the process of paper publication which is very similar across disciplines. Thus, tailored models are necessary to match the individual characteristics of research data across disciplines. Within this presentation three different approaches are distinguished: object centric, text centric and data centric. Prerequisites and limitations regarding timing and room of the data provision will be discussed and experiences with each of the different models presented.
Regardless of these models, it becomes apparent that due to the individual characteristics of research data, its provision and publication is only possible with the support and knowhow of the research community. This know-how needs to be linked to the competences of infrastructure facilities.
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
Sünje Dallmeier-Tiessen: Research data "publishing": models, roles and responsibilities
1. Research data “publishing” –
models, roles and responsibilities
Sünje Dallmeier-Tiessen
CERN | Scientific Information Service
Humboldt Universität, Berlin
2. Outline
Research data “publishing”
• Introduction: Research data
• Models
– Text-centered occurrence
– Data-centered occurrence
– Object-centric principle
• Roles
– Motivation, Competences, Responsibilities
• Conclusions
Open Access – Open Data 2010 Sunje Dallmeier-Tiessen | CERN
4. What is research data?
Research data How does it look like?
How can this be integrated
into modern scholarly
communication?
Foto: AWI Bremerhaven, Gutt
5. Research data “publishing” beyond
disciplines?
Discipline specific characteristics of research data,
BUT challenges are similar:
– Non existing data publishing culture
• Missing infrastructure
• Non established workflows
• Missing awareness/motivation/submission
“we need to get this paper out” vs. “we need to publish this
dataset”
– Diverse research datasets
• Complexity: From simple files to complex datasets
• Size: From small to “huge”
• Documentation: varying requirements
Open Access – Open Data 2010 Sunje Dallmeier-Tiessen | CERN
6. Models
How does the world of research
data publishing look like today?
stand-alone, text centered, data
centered
7. Occurence:
text centered publication model
Research
Data
• Long tradition
• Linked research data in
external data Metadata
Tables
respositories, e.g. DataCite
Text, Results,
• Attached supplemental Interpretation
material: tables, graphs,
audio/video/data files (on
same plattform as text
Graphs, Documen-
publication) Figures tation
Open Access – Open Data 2010 Sunje Dallmeier-Tiessen | CERN
8. Supplement
Source: Nature 2006
Open Access – Open Data 2010 Sunje Dallmeier-Tiessen | CERN
9. Occurrence: data centered publication
model
Interpretation,
Article
Services I
Interpretation,
Article
II
Research
Tables
Data
Metadata
Graphs,
Figures Documen-
tation
Open Access – Open Data 2010 Sunje Dallmeier-Tiessen | CERN
10. Data Journal
Data centered publication
• including data documentation
• data access
• quality assurance – peer review
• open access
Incentive: publication for
enhanced data provision
Open Access – Open Data 2010 Sunje Dallmeier-Tiessen | CERN
12. With modern information technology,
many opportunities
… to service the different needs in the research
communities:
• When do I want to publish my data?
• After production?
• Before or after the “text” publication?
• And with whom would I like to share?
• With my colleagues?
• With my community?
• With the entire world?
• … or maybe this might change over time?
Different models and responsibilities for involved
parties
Open Access – Open Data 2010 Sunje Dallmeier-Tiessen | CERN
13. Looking into the future:
Principle – object centered (“mash up”)
Materials and services as individually addressed objects
• Persistence of objects/services,
• Persistent identifier
Results and
Interpretation
Table
Figure
Research Data
Documentation
Services
Open Access – Open Data 2010 Sunje Dallmeier-Tiessen | CERN
14. Hands-on!?
Information
Science/
Publishers
IT/ Research
Computing Community
after Corral, 2008
Foto by katera, flickr
15. Prerequisites for Research Data Publishing
Cooperation of researchers and service providers
Access
Research data
Quality Infrastructure/
(Documentation,
Workflows
Metadata…)
Persistence,
Trust-
worthiness
Community/Researchers: Infrastructure provider
Workflows
Provision, Offer infrastructure
Documentation, Standards
Quality control Trustworthiness
Responsibilities
Open Access – Open Data 2010 Sunje Dallmeier-Tiessen | CERN
16. Role of researchers
… as data providers
[data production]
Foto by G. F. Wicke, flickr
• Data documentation
• Data submission (preparation of publishable
dataset)
• Data quality assurance/peer review
… as data users
• Citation of dataset, acknowledgement
Source: Nature 2006
Open Access – Open Data 2010 Sunje Dallmeier-Tiessen | CERN
17. Role of information scientists
Service
• Management of the research material
remix
– Facilitating new workflows
– Integration of new features: e.g. data
citation
– Preservation, catalogue
• Interface between IT and researcher
– Make publishing platforms usable
• Submission
• Search functionality
foto: flickr
Open Access – Open Data 2010 Sunje Dallmeier-Tiessen | CERN
18. Conclusions
• Missing data publishing culture in many
communities
• New publishing models allow tailored
services, developed with the community
• Significant role of community in research data
preservation and publishing
– Implications for content recruitment
Open Access – Open Data 2010 Sunje Dallmeier-Tiessen | CERN
19. QUESTIONS
sunje.dallmeier-tiessen@cern.ch
Funded by BMBF, Gentner Scholarship
Open Access – Open Data 2010 Sunje Dallmeier-Tiessen | CERN
Notes de l'éditeur
Here add the files from gentner day
Plus the advancements that Martin presented already
Basis fuer die textorientierte und die datenorientierte Publikationsform, dadurch auch zeitliche unabhaengigkeit