SlideShare une entreprise Scribd logo
1  sur  46
Télécharger pour lire hors ligne
Connecting GESIS research data and
publication information systems
Katarina Boland
Department Knowledge Technologies for the Social Sciences
GESIS - Leibniz Institute for the Social Sciences, Cologne, Germany

OpenAIRE Interoperability Workshop, Braga, Portugal
08.02.2013
Outline
1

Introduction to GESIS

2

GESIS information systems

3

Linking publications to datasets

4

Connecting research data and publication information systems

Connecting GESIS research data and publication information systems

2/28
GESIS
largest infrastructure institution for the Social Sciences in
Germany
five scientific departments (Mannheim, Cologne, Berlin)

Connecting GESIS research data and publication information systems

2/28
GESIS

Connecting GESIS research data and publication information systems

3/28
GESIS

Connecting GESIS research data and publication information systems

3/28
GESIS

Connecting GESIS research data and publication information systems

3/28
Publications
SSOAR - Social Science Open Access Repository
SOWIPORT - the social science portal

Connecting GESIS research data and publication information systems

4/28
Publications
SSOAR - Social Science Open Access Repository
electronic full texts (Social
Sciences) for free access
mainly pursues the Green
Way of Open Access
http://www.ssoar.info/en.html

SOWIPORT - the social science portal
Connecting GESIS research data and publication information systems

4/28
Publications
SSOAR - Social Science Open Access Repository
SOWIPORT - the social science portal
approximately 7 million
references on publications and
research projects from 18
databases
additional information
on institutions and events
http://www.gesis.org/sowiport/en/

Connecting GESIS research data and publication information systems

4/28
Research Data
detailed information, documentation on variable-level and beyond
ZACAT - Online Study Catalogue
MISSY - Microdata Information System
documentation on study-level
da|ra - Registration agency for social and economic data
DBK - Data Catalogue

Connecting GESIS research data and publication information systems

5/28
Research Data
da|ra - Registration agency for social and economic data
DBK - Data Catalogue

Connecting GESIS research data and publication information systems

6/28
Research Data
da|ra - Registration agency for social and economic data
German DOI registration
service for social science and
economic data
by GESIS and ZBW - German
National Library of Economics,
in cooperation with
DataCite
http://www.da-ra.de/en/home/

DBK - Data Catalogue

Connecting GESIS research data and publication information systems

6/28
Research Data
da|ra - Registration agency for social and economic data
DBK - Data Catalogue
study descriptions from survey
research, historical social
research and texts for content
analyses
documentation of data from
official statistics will be added
successively
http://www.gesis.org/en/services/research/
data-catalogue/
Connecting GESIS research data and publication information systems

6/28
Outline
1

Introduction to GESIS

2

GESIS information systems

3

Linking publications to datasets

4

Connecting research data and publication information systems

Connecting GESIS research data and publication information systems

7/28
Linking publications to
datasets
the InFoLiS project:
Integration of research data and publiations for the Social
Sciences

InFoLiS is funded by the DFG (SU 647/2-1)

Connecting GESIS research data and publication information systems

8/28
InFoLiS project goals

Response

Re

er y

Qu

Catalogue:
Publications
SSOAR (GESIS),
Primo (UB MA),
...

po
ns
e

se

on

sp

Links

Qu

Re
s

ery

Response

Catalogue:
Research Data
da|ra (GESIS),
...

Connecting GESIS research data and publication information systems

Data

9/28
References to datasets
erfolgt die Darstellung und Diskussion der empirischen Ergebnisse. Hierfür werden
die Daten des Sozio-oekonomischen Panels (SOEP) aus den Jahren 1990 und 2003
verwendet und für beide Zeitpunkte werden die Einflussfaktoren mittels linearer
Regressionsmodelle geschätzt.

presentation and discussion of the empirical findings. For this purpose, data
from the Socio-Economic Panel (SOEP) of the years 1990 and 2003 are used
and for both periods, the impact factors are estimated using linear regression
models.

Connecting GESIS research data and publication information systems

10/28
References to datasets
erfolgt die Darstellung und Diskussion der empirischen Ergebnisse. Hierfür werden
die Daten des Sozio-oekonomischen Panels (SOEP) aus den Jahren 1990 und 2003
verwendet und für beide Zeitpunkte werden die Einflussfaktoren mittels linearer
Regressionsmodelle geschätzt.

data from the <title> of the years <year> are used

Connecting GESIS research data and publication information systems

10/28
References to datasets
Tabelle 1: Bevölkerungsvorausberechnung für Deutschland nach Altersgruppen - Anteile in
Prozent
(Datenbasis: 10. Bevölkerungsvorausberechnung des Statistischen Bundesamtes, Variante 5)

Table 1: Population forecast for Germany depending on age cohorts - proportion
in percent.
Data base: 10th Population Forecast of the Federal Statistical Office , version 5.

Connecting GESIS research data and publication information systems

11/28
References to datasets
Tabelle 1: Bevölkerungsvorausberechnung für Deutschland nach Altersgruppen - Anteile in
Prozent
(Datenbasis: 10. Bevölkerungsvorausberechnung des Statistischen Bundesamtes, Variante 5)

(Data base: <number>. <title> of the <data collector>,
version <version>)

Connecting GESIS research data and publication information systems

11/28
References to datasets
1 Herangezogen wurden außerdem Allbus, Allensbacher Erhebungen, Eurobarometer, International
Social Survey Program, International Social Justice Project, Sozio-ökonomisches Panel, World
Values Survey.

Consulted were furthermore ...

Connecting GESIS research data and publication information systems

12/28
References to datasets
1 Herangezogen wurden außerdem Allbus, Allensbacher Erhebungen, Eurobarometer, International
Social Survey Program, International Social Justice Project, Sozio-ökonomisches Panel, World
Values Survey.

Consulted were furthermore <title1>, <title2>, <title3>, ...,
<titleN>.

Connecting GESIS research data and publication information systems

12/28
References to datasets
Tabelle 3: Stichprobe der Untersuchung in den Jahren 2003 und 2004 sowie Größe der Stichprobe, mit gültigen Daten aus beiden Erhebungen
(Quelle: Ditton u.a. 2005a)

Table 3: Sample of the surveys conducted in the years 2003 and 2004 as well
as size of the sample, with valid data from both surveys
(Source: Ditton et al. 2005a)

Connecting GESIS research data and publication information systems

13/28
References to datasets
Tabelle 3: Stichprobe der Untersuchung in den Jahren 2003 und 2004 sowie Größe der Stichprobe, mit gültigen Daten aus beiden Erhebungen
(Quelle: Ditton u.a. 2005a)

(Source: <citation of descriptive publication>)

Connecting GESIS research data and publication information systems

13/28
References to datasets
Grafik 7: Einschätzung der wirtschaftlichen Lage: Einschätzung der eigenen wirtschaftlichen Lage
(in Prozent)
(Quellen: Allbus/Sozialstaatssurvey)

(Sources: Allbus/Sozialstaatssurvey )

Connecting GESIS research data and publication information systems

14/28
References to datasets
Grafik 7: Einschätzung der wirtschaftlichen Lage: Einschätzung der eigenen wirtschaftlichen Lage
(in Prozent)
(Quellen: Allbus/Sozialstaatssurvey)

(Sources: <title1>/<title2>)

Connecting GESIS research data and publication information systems

14/28
Linking publications to
datasets
References to datasets are not standardized!
see also...
Green, Toby (2009). We Need Publishing Standards for
Datasets and Data Tables. OECD Publishing White Paper.
doi: 10.1787/603233448430
Altman, Micah and Gary King (2007). A Proposed Standard
for the Scholarly Citation of Quantitative Data. In: D-Lib
Magazine 13.3.
url: http://www.dlib.org/dlib/march07/altman/03altman.html
Connecting GESIS research data and publication information systems

15/28
Automatic identification of
references
Why not simply search for study titles in publications?
Studies are referenced using abbreviations, alternative
names or literature
Study titles may be common nouns - ambiguous!
there is no complete list of all conducted studies

Connecting GESIS research data and publication information systems

16/28
General idea
How do humans recognize study references?

Source: Estimations based on SOEP, wave 2002.

Connecting GESIS research data and publication information systems

17/28
General idea
How do humans recognize study references?

Source: Estimations based on xyz, wave 2002.

Connecting GESIS research data and publication information systems

17/28
General idea
How do humans recognize study references?

Source: Estimations based on xyz, wave 2002.

→ Learn patterns: typical contexts for study references

Connecting GESIS research data and publication information systems

17/28
General idea
How do humans recognize study references?

Source: Estimations based on xyz, wave 2002.

→ Learn patterns: typical contexts for study references
→ Sparse Data Problem: use iterative bootstrapping approach

Connecting GESIS research data and publication information systems

17/28
Algorithm

Connecting GESIS research data and publication information systems

18/28
Evaluation: Precision &
Estimate of Recall

about 14% of the found references are not study names, but
citations of publications → not counted as incorrect here
subset of SSOAR with keyword “empirisch-quantitativ”
(empirical quantitative)
German, n = 259
conversion pdf → txt with automatic correction
Connecting GESIS research data and publication information systems

19/28
Reference extraction
for details see...
Boland, Katarina, Ritze, Dominique, Eckert, Kai, & Mathiak, Brigitte (2012).
Identifying References to Datasets in Publications. International Conference on
Theory and Practice of Digital Libraries (TPDL) (pp. 150-161). Paphos, Cyprus:
Springer Berlin Heidelberg. doi:10.1007/978-3-642-33290-6 17

Connecting GESIS research data and publication information systems

20/28
Matching to da|ra records

Connecting GESIS research data and publication information systems

21/28
Matching to da|ra records

Connecting GESIS research data and publication information systems

21/28
Matching to da|ra records
→ Precise matching to DOI not always possible!
→ Instead: Matchings to relevant sources

Connecting GESIS research data and publication information systems

21/28
Matching to da|ra records
→ Precise matching to DOI not always possible!
→ Instead: Matchings to relevant sources
→ Definition of relevance depends on application

Connecting GESIS research data and publication information systems

21/28
Matching to da|ra records

ALLBUScompact

...
...

ALLBUScompact - Cumulation 1980-2010

ALLBUScompact 2000

ALLBUScompact 2000
CAPI/PAPI

...

...

ALLBUS

...

...

...

ALLBUS - Cumulation 1980-2006

...

ALLBUScompact 2000
CAPI

...

...

...

ALLBUS 2000

...
...

ALLBUS 1998

...

ALLBUS 2000
CAPI/PAPI

...

...

ALLBUS - Cumulation 1980-2008

...

...

...
...

...

ALLBUS 1996

...

...
...

...

...

...

→ semantic web technologies
Connecting GESIS research data and publication information systems

22/28
Links

Connecting GESIS research data and publication information systems

23/28
Connecting information
systems

Service I: InFoLiS
Services II, III & Architecture:
Dennis Wegener,
Daniel Hienert,
Dimitar Dimitrov
(SOWIPORT, da|ra)

Connecting GESIS research data and publication information systems

24/28
Connecting information
systems

Demo: da|ra test system with SOWIPORT links

Connecting GESIS research data and publication information systems

25/28
Connecting information
systems

Demo: da|ra test system with SOWIPORT links (offline version)

Connecting GESIS research data and publication information systems

26/28
Conclusion: our aim
interlink our own repositories and information systems
provide services for reference extraction and matching to all
interested institutions (free access to webservices)
- domain- and language-independent
link to publications and data stored in external repositories

Connecting GESIS research data and publication information systems

27/28
Thank you for your
attention!

katarina.boland@gesis.org

Connecting GESIS research data and publication information systems

28/28

Contenu connexe

Tendances

Recognising data sharing
Recognising data sharingRecognising data sharing
Recognising data sharingJisc RDM
 
The value of data curation as part of the publishing process
The value of data curation as part of the publishing processThe value of data curation as part of the publishing process
The value of data curation as part of the publishing processVarsha Khodiyar
 
Research data support: a growth area for academic libraries?
Research data support: a growth area for academic libraries?Research data support: a growth area for academic libraries?
Research data support: a growth area for academic libraries? Robin Rice
 
Manage your online profile: Maximize the visibility of your work and make an ...
Manage your online profile: Maximize the visibility of your work and make an ...Manage your online profile: Maximize the visibility of your work and make an ...
Manage your online profile: Maximize the visibility of your work and make an ...Julia Gelfand
 
HESA data, describing research activity and #REF2021
HESA data, describing research activity and #REF2021HESA data, describing research activity and #REF2021
HESA data, describing research activity and #REF2021Jisc RDM
 
OpenAIRE workshop @ OR2016 - From Repositories, for repositories
OpenAIRE workshop @ OR2016 - From Repositories, for repositoriesOpenAIRE workshop @ OR2016 - From Repositories, for repositories
OpenAIRE workshop @ OR2016 - From Repositories, for repositoriesOpenAIRE
 
20200130_Mannocci_OpenAIRE_ResearchGraph
20200130_Mannocci_OpenAIRE_ResearchGraph20200130_Mannocci_OpenAIRE_ResearchGraph
20200130_Mannocci_OpenAIRE_ResearchGraphOpenAIRE
 
PaNOSC and Research Data Management / Battery2030+ Initiative Workshop / 12 M...
PaNOSC and Research Data Management / Battery2030+ Initiative Workshop / 12 M...PaNOSC and Research Data Management / Battery2030+ Initiative Workshop / 12 M...
PaNOSC and Research Data Management / Battery2030+ Initiative Workshop / 12 M...PaNOSC
 
Webinar: Data management and the Open Research Data Pilot in Horizon 2020
Webinar: Data management and the Open Research Data Pilot in Horizon 2020Webinar: Data management and the Open Research Data Pilot in Horizon 2020
Webinar: Data management and the Open Research Data Pilot in Horizon 2020OpenAccessBelgium
 
Why does research data matter to libraries
Why does research data matter to librariesWhy does research data matter to libraries
Why does research data matter to librariesJisc RDM
 
Research Data Management Services at UWA
Research Data Management Services at UWAResearch Data Management Services at UWA
Research Data Management Services at UWAKatina Toufexis
 
Grant Funding Programme
Grant Funding ProgrammeGrant Funding Programme
Grant Funding ProgrammeJisc RDM
 
Publishing the Full Research Data Lifecycle
Publishing the Full Research Data LifecyclePublishing the Full Research Data Lifecycle
Publishing the Full Research Data LifecycleAnita de Waard
 
Collaboratively creating a network of ideas, data and software
Collaboratively creating a network of ideas, data and softwareCollaboratively creating a network of ideas, data and software
Collaboratively creating a network of ideas, data and softwareAnita de Waard
 
New approaches to data management: supporting FAIR data sharing at Springer N...
New approaches to data management: supporting FAIR data sharing at Springer N...New approaches to data management: supporting FAIR data sharing at Springer N...
New approaches to data management: supporting FAIR data sharing at Springer N...Varsha Khodiyar
 
Big Data (SOCIOMETRIC METHODS FOR RELEVANCY ANALYSIS OF LONG TAIL SCIENCE D...
Big Data (SOCIOMETRIC METHODS FOR  RELEVANCY ANALYSIS OF LONG TAIL  SCIENCE D...Big Data (SOCIOMETRIC METHODS FOR  RELEVANCY ANALYSIS OF LONG TAIL  SCIENCE D...
Big Data (SOCIOMETRIC METHODS FOR RELEVANCY ANALYSIS OF LONG TAIL SCIENCE D...AKSHAY BHAGAT
 
Research Data Spring - Spring Update 2016
Research Data Spring - Spring Update 2016Research Data Spring - Spring Update 2016
Research Data Spring - Spring Update 2016Jisc RDM
 
20190527_Paolo Manghi_ OpenAIRE monitoring
20190527_Paolo Manghi_ OpenAIRE monitoring20190527_Paolo Manghi_ OpenAIRE monitoring
20190527_Paolo Manghi_ OpenAIRE monitoringOpenAIRE
 

Tendances (20)

Recognising data sharing
Recognising data sharingRecognising data sharing
Recognising data sharing
 
The value of data curation as part of the publishing process
The value of data curation as part of the publishing processThe value of data curation as part of the publishing process
The value of data curation as part of the publishing process
 
Research data support: a growth area for academic libraries?
Research data support: a growth area for academic libraries?Research data support: a growth area for academic libraries?
Research data support: a growth area for academic libraries?
 
Manage your online profile: Maximize the visibility of your work and make an ...
Manage your online profile: Maximize the visibility of your work and make an ...Manage your online profile: Maximize the visibility of your work and make an ...
Manage your online profile: Maximize the visibility of your work and make an ...
 
HESA data, describing research activity and #REF2021
HESA data, describing research activity and #REF2021HESA data, describing research activity and #REF2021
HESA data, describing research activity and #REF2021
 
OpenAIRE workshop @ OR2016 - From Repositories, for repositories
OpenAIRE workshop @ OR2016 - From Repositories, for repositoriesOpenAIRE workshop @ OR2016 - From Repositories, for repositories
OpenAIRE workshop @ OR2016 - From Repositories, for repositories
 
20200130_Mannocci_OpenAIRE_ResearchGraph
20200130_Mannocci_OpenAIRE_ResearchGraph20200130_Mannocci_OpenAIRE_ResearchGraph
20200130_Mannocci_OpenAIRE_ResearchGraph
 
PaNOSC and Research Data Management / Battery2030+ Initiative Workshop / 12 M...
PaNOSC and Research Data Management / Battery2030+ Initiative Workshop / 12 M...PaNOSC and Research Data Management / Battery2030+ Initiative Workshop / 12 M...
PaNOSC and Research Data Management / Battery2030+ Initiative Workshop / 12 M...
 
Webinar: Data management and the Open Research Data Pilot in Horizon 2020
Webinar: Data management and the Open Research Data Pilot in Horizon 2020Webinar: Data management and the Open Research Data Pilot in Horizon 2020
Webinar: Data management and the Open Research Data Pilot in Horizon 2020
 
Johnston - How to Curate Research Data
Johnston - How to Curate Research DataJohnston - How to Curate Research Data
Johnston - How to Curate Research Data
 
Why does research data matter to libraries
Why does research data matter to librariesWhy does research data matter to libraries
Why does research data matter to libraries
 
Research Data Management Services at UWA
Research Data Management Services at UWAResearch Data Management Services at UWA
Research Data Management Services at UWA
 
Grant Funding Programme
Grant Funding ProgrammeGrant Funding Programme
Grant Funding Programme
 
Publishing the Full Research Data Lifecycle
Publishing the Full Research Data LifecyclePublishing the Full Research Data Lifecycle
Publishing the Full Research Data Lifecycle
 
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Wor...
NISO/NFAIS Joint Virtual Conference:  Connecting the Library to the Wider Wor...NISO/NFAIS Joint Virtual Conference:  Connecting the Library to the Wider Wor...
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Wor...
 
Collaboratively creating a network of ideas, data and software
Collaboratively creating a network of ideas, data and softwareCollaboratively creating a network of ideas, data and software
Collaboratively creating a network of ideas, data and software
 
New approaches to data management: supporting FAIR data sharing at Springer N...
New approaches to data management: supporting FAIR data sharing at Springer N...New approaches to data management: supporting FAIR data sharing at Springer N...
New approaches to data management: supporting FAIR data sharing at Springer N...
 
Big Data (SOCIOMETRIC METHODS FOR RELEVANCY ANALYSIS OF LONG TAIL SCIENCE D...
Big Data (SOCIOMETRIC METHODS FOR  RELEVANCY ANALYSIS OF LONG TAIL  SCIENCE D...Big Data (SOCIOMETRIC METHODS FOR  RELEVANCY ANALYSIS OF LONG TAIL  SCIENCE D...
Big Data (SOCIOMETRIC METHODS FOR RELEVANCY ANALYSIS OF LONG TAIL SCIENCE D...
 
Research Data Spring - Spring Update 2016
Research Data Spring - Spring Update 2016Research Data Spring - Spring Update 2016
Research Data Spring - Spring Update 2016
 
20190527_Paolo Manghi_ OpenAIRE monitoring
20190527_Paolo Manghi_ OpenAIRE monitoring20190527_Paolo Manghi_ OpenAIRE monitoring
20190527_Paolo Manghi_ OpenAIRE monitoring
 

Similaire à Connecting GESIS research data and publication information systems – Katarina Boland

Integration of research literature and data (InFoLiS)
Integration of research literature and data (InFoLiS)Integration of research literature and data (InFoLiS)
Integration of research literature and data (InFoLiS)Philipp Zumstein
 
Thinking About the Making of Data
Thinking About the Making of DataThinking About the Making of Data
Thinking About the Making of DataPaul Groth
 
Minimal viable data reuse
Minimal viable data reuseMinimal viable data reuse
Minimal viable data reusevoginip
 
Decomposing Social and Semantic Networks in Emerging “Big Data” Research
Decomposing Social and Semantic Networks in Emerging “Big Data” ResearchDecomposing Social and Semantic Networks in Emerging “Big Data” Research
Decomposing Social and Semantic Networks in Emerging “Big Data” ResearchHan Woo PARK
 
Open Science: Research Data Management
Open Science: Research Data ManagementOpen Science: Research Data Management
Open Science: Research Data ManagementLibrary_Connect
 
Semantic Linking & Retrieval for Digital Libraries
Semantic Linking & Retrieval for Digital LibrariesSemantic Linking & Retrieval for Digital Libraries
Semantic Linking & Retrieval for Digital LibrariesStefan Dietze
 
GSmith Springer Nature Data policies and practices: HKU Open Data and Data Pu...
GSmith Springer Nature Data policies and practices: HKU Open Data and Data Pu...GSmith Springer Nature Data policies and practices: HKU Open Data and Data Pu...
GSmith Springer Nature Data policies and practices: HKU Open Data and Data Pu...GrahamSmith646206
 
WWW2013 Tutorial: Linked Data & Education
WWW2013 Tutorial: Linked Data & EducationWWW2013 Tutorial: Linked Data & Education
WWW2013 Tutorial: Linked Data & EducationStefan Dietze
 
ischools future of data managemente dec2017
ischools future of data managemente dec2017ischools future of data managemente dec2017
ischools future of data managemente dec2017ARDC
 
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...Dr. Haxel Consult
 
Alain Frey Research Data for universities and information producers
Alain Frey Research Data for universities and information producersAlain Frey Research Data for universities and information producers
Alain Frey Research Data for universities and information producersIncisive_Events
 
Tools für das Management von Forschungsdaten
Tools für das Management von ForschungsdatenTools für das Management von Forschungsdaten
Tools für das Management von ForschungsdatenHeinz Pampel
 
OpenAIREplus Data Survey WriteUp2
OpenAIREplus Data Survey WriteUp2OpenAIREplus Data Survey WriteUp2
OpenAIREplus Data Survey WriteUp2OpenAIRE
 
A metadata scheme of the software-data relationship: A proposal
A metadata scheme of the software-data relationship: A proposalA metadata scheme of the software-data relationship: A proposal
A metadata scheme of the software-data relationship: A proposalKai Li
 
Data as a research output and a research asset: the case for Open Science/Sim...
Data as a research output and a research asset: the case for Open Science/Sim...Data as a research output and a research asset: the case for Open Science/Sim...
Data as a research output and a research asset: the case for Open Science/Sim...African Open Science Platform
 
The Horizon 2020 Open Data Pilot
The Horizon 2020 Open Data PilotThe Horizon 2020 Open Data Pilot
The Horizon 2020 Open Data PilotMartin Donnelly
 
The Horizon2020 Open Data Pilot - OpenAIRE Webinar
The Horizon2020 Open Data Pilot - OpenAIRE WebinarThe Horizon2020 Open Data Pilot - OpenAIRE Webinar
The Horizon2020 Open Data Pilot - OpenAIRE WebinarMartin Donnelly
 

Similaire à Connecting GESIS research data and publication information systems – Katarina Boland (20)

Integration of research literature and data (InFoLiS)
Integration of research literature and data (InFoLiS)Integration of research literature and data (InFoLiS)
Integration of research literature and data (InFoLiS)
 
Thinking About the Making of Data
Thinking About the Making of DataThinking About the Making of Data
Thinking About the Making of Data
 
Minimal viable data reuse
Minimal viable data reuseMinimal viable data reuse
Minimal viable data reuse
 
Decomposing Social and Semantic Networks in Emerging “Big Data” Research
Decomposing Social and Semantic Networks in Emerging “Big Data” ResearchDecomposing Social and Semantic Networks in Emerging “Big Data” Research
Decomposing Social and Semantic Networks in Emerging “Big Data” Research
 
Open Science: Research Data Management
Open Science: Research Data ManagementOpen Science: Research Data Management
Open Science: Research Data Management
 
Semantic Linking & Retrieval for Digital Libraries
Semantic Linking & Retrieval for Digital LibrariesSemantic Linking & Retrieval for Digital Libraries
Semantic Linking & Retrieval for Digital Libraries
 
GSmith Springer Nature Data policies and practices: HKU Open Data and Data Pu...
GSmith Springer Nature Data policies and practices: HKU Open Data and Data Pu...GSmith Springer Nature Data policies and practices: HKU Open Data and Data Pu...
GSmith Springer Nature Data policies and practices: HKU Open Data and Data Pu...
 
WWW2013 Tutorial: Linked Data & Education
WWW2013 Tutorial: Linked Data & EducationWWW2013 Tutorial: Linked Data & Education
WWW2013 Tutorial: Linked Data & Education
 
ischools future of data managemente dec2017
ischools future of data managemente dec2017ischools future of data managemente dec2017
ischools future of data managemente dec2017
 
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
 
OPEN DATA. The researcher perspective
OPEN DATA.  The researcher perspectiveOPEN DATA.  The researcher perspective
OPEN DATA. The researcher perspective
 
20080719 Esof Open Data Voegler
20080719 Esof Open Data Voegler20080719 Esof Open Data Voegler
20080719 Esof Open Data Voegler
 
Alain Frey Research Data for universities and information producers
Alain Frey Research Data for universities and information producersAlain Frey Research Data for universities and information producers
Alain Frey Research Data for universities and information producers
 
Tools für das Management von Forschungsdaten
Tools für das Management von ForschungsdatenTools für das Management von Forschungsdaten
Tools für das Management von Forschungsdaten
 
OpenAIREplus Data Survey WriteUp2
OpenAIREplus Data Survey WriteUp2OpenAIREplus Data Survey WriteUp2
OpenAIREplus Data Survey WriteUp2
 
A metadata scheme of the software-data relationship: A proposal
A metadata scheme of the software-data relationship: A proposalA metadata scheme of the software-data relationship: A proposal
A metadata scheme of the software-data relationship: A proposal
 
20070919 Bkt Padua Esf Dfg Workshop Intro
20070919 Bkt Padua Esf Dfg Workshop Intro20070919 Bkt Padua Esf Dfg Workshop Intro
20070919 Bkt Padua Esf Dfg Workshop Intro
 
Data as a research output and a research asset: the case for Open Science/Sim...
Data as a research output and a research asset: the case for Open Science/Sim...Data as a research output and a research asset: the case for Open Science/Sim...
Data as a research output and a research asset: the case for Open Science/Sim...
 
The Horizon 2020 Open Data Pilot
The Horizon 2020 Open Data PilotThe Horizon 2020 Open Data Pilot
The Horizon 2020 Open Data Pilot
 
The Horizon2020 Open Data Pilot - OpenAIRE Webinar
The Horizon2020 Open Data Pilot - OpenAIRE WebinarThe Horizon2020 Open Data Pilot - OpenAIRE Webinar
The Horizon2020 Open Data Pilot - OpenAIRE Webinar
 

Plus de OpenAIRE

10th OpenAIRE Content Providers Community Call
10th OpenAIRE Content Providers Community Call10th OpenAIRE Content Providers Community Call
10th OpenAIRE Content Providers Community CallOpenAIRE
 
9th Content Providers Community Call\
9th Content Providers Community Call\9th Content Providers Community Call\
9th Content Providers Community Call\OpenAIRE
 
OpenAIRE in the European Open Science Cloud (EOSC)
OpenAIRE in the European Open Science Cloud (EOSC)OpenAIRE in the European Open Science Cloud (EOSC)
OpenAIRE in the European Open Science Cloud (EOSC)OpenAIRE
 
8th Content Providers Community Call
8th Content Providers Community Call8th Content Providers Community Call
8th Content Providers Community CallOpenAIRE
 
7th Content Providers Community Call
7th Content Providers Community Call7th Content Providers Community Call
7th Content Providers Community CallOpenAIRE
 
OpenAIRE PROVIDE Dashboard for Turkish repository managers
OpenAIRE PROVIDE Dashboard for Turkish repository managersOpenAIRE PROVIDE Dashboard for Turkish repository managers
OpenAIRE PROVIDE Dashboard for Turkish repository managersOpenAIRE
 
What will it cost to manage and share my data?
What will it cost to manage and share my data?What will it cost to manage and share my data?
What will it cost to manage and share my data?OpenAIRE
 
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 3)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 3)Open Research Gateway for the ELIXIR-GR Infrastructure (Part 3)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 3)OpenAIRE
 
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 2)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 2)Open Research Gateway for the ELIXIR-GR Infrastructure (Part 2)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 2)OpenAIRE
 
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 1)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 1)Open Research Gateway for the ELIXIR-GR Infrastructure (Part 1)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 1)OpenAIRE
 
6th Content Providers Community Call
6th Content Providers Community Call6th Content Providers Community Call
6th Content Providers Community CallOpenAIRE
 
20200504_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
20200504_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data20200504_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
20200504_OpenAIRE Legal Policy Webinar: GDPR and Sharing DataOpenAIRE
 
20200504_Research Data & the GDPR: How Open is Open?
20200504_Research Data & the GDPR: How Open is Open?20200504_Research Data & the GDPR: How Open is Open?
20200504_Research Data & the GDPR: How Open is Open?OpenAIRE
 
20200504_Data, Data Ownership and Open Science
20200504_Data, Data Ownership and Open Science20200504_Data, Data Ownership and Open Science
20200504_Data, Data Ownership and Open ScienceOpenAIRE
 
20200429_Research Data & the GDPR: How Open is Open? (updated version)
20200429_Research Data & the GDPR: How Open is Open? (updated version)20200429_Research Data & the GDPR: How Open is Open? (updated version)
20200429_Research Data & the GDPR: How Open is Open? (updated version)OpenAIRE
 
20200429_Data, Data Ownership and Open Science
20200429_Data, Data Ownership and Open Science20200429_Data, Data Ownership and Open Science
20200429_Data, Data Ownership and Open ScienceOpenAIRE
 
20200429_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
20200429_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data20200429_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
20200429_OpenAIRE Legal Policy Webinar: GDPR and Sharing DataOpenAIRE
 
COVID-19: Activities, tools, best practice and contact points in Greece
 COVID-19: Activities, tools, best practice and contact points in Greece COVID-19: Activities, tools, best practice and contact points in Greece
COVID-19: Activities, tools, best practice and contact points in GreeceOpenAIRE
 
5th Content Providers Community Call
5th Content Providers Community Call5th Content Providers Community Call
5th Content Providers Community CallOpenAIRE
 
4th Content Providers Community Call
4th Content Providers Community Call4th Content Providers Community Call
4th Content Providers Community CallOpenAIRE
 

Plus de OpenAIRE (20)

10th OpenAIRE Content Providers Community Call
10th OpenAIRE Content Providers Community Call10th OpenAIRE Content Providers Community Call
10th OpenAIRE Content Providers Community Call
 
9th Content Providers Community Call\
9th Content Providers Community Call\9th Content Providers Community Call\
9th Content Providers Community Call\
 
OpenAIRE in the European Open Science Cloud (EOSC)
OpenAIRE in the European Open Science Cloud (EOSC)OpenAIRE in the European Open Science Cloud (EOSC)
OpenAIRE in the European Open Science Cloud (EOSC)
 
8th Content Providers Community Call
8th Content Providers Community Call8th Content Providers Community Call
8th Content Providers Community Call
 
7th Content Providers Community Call
7th Content Providers Community Call7th Content Providers Community Call
7th Content Providers Community Call
 
OpenAIRE PROVIDE Dashboard for Turkish repository managers
OpenAIRE PROVIDE Dashboard for Turkish repository managersOpenAIRE PROVIDE Dashboard for Turkish repository managers
OpenAIRE PROVIDE Dashboard for Turkish repository managers
 
What will it cost to manage and share my data?
What will it cost to manage and share my data?What will it cost to manage and share my data?
What will it cost to manage and share my data?
 
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 3)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 3)Open Research Gateway for the ELIXIR-GR Infrastructure (Part 3)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 3)
 
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 2)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 2)Open Research Gateway for the ELIXIR-GR Infrastructure (Part 2)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 2)
 
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 1)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 1)Open Research Gateway for the ELIXIR-GR Infrastructure (Part 1)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 1)
 
6th Content Providers Community Call
6th Content Providers Community Call6th Content Providers Community Call
6th Content Providers Community Call
 
20200504_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
20200504_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data20200504_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
20200504_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
 
20200504_Research Data & the GDPR: How Open is Open?
20200504_Research Data & the GDPR: How Open is Open?20200504_Research Data & the GDPR: How Open is Open?
20200504_Research Data & the GDPR: How Open is Open?
 
20200504_Data, Data Ownership and Open Science
20200504_Data, Data Ownership and Open Science20200504_Data, Data Ownership and Open Science
20200504_Data, Data Ownership and Open Science
 
20200429_Research Data & the GDPR: How Open is Open? (updated version)
20200429_Research Data & the GDPR: How Open is Open? (updated version)20200429_Research Data & the GDPR: How Open is Open? (updated version)
20200429_Research Data & the GDPR: How Open is Open? (updated version)
 
20200429_Data, Data Ownership and Open Science
20200429_Data, Data Ownership and Open Science20200429_Data, Data Ownership and Open Science
20200429_Data, Data Ownership and Open Science
 
20200429_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
20200429_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data20200429_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
20200429_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
 
COVID-19: Activities, tools, best practice and contact points in Greece
 COVID-19: Activities, tools, best practice and contact points in Greece COVID-19: Activities, tools, best practice and contact points in Greece
COVID-19: Activities, tools, best practice and contact points in Greece
 
5th Content Providers Community Call
5th Content Providers Community Call5th Content Providers Community Call
5th Content Providers Community Call
 
4th Content Providers Community Call
4th Content Providers Community Call4th Content Providers Community Call
4th Content Providers Community Call
 

Dernier

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsRoshan Dwivedi
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024The Digital Insurer
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesBoston Institute of Analytics
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 

Dernier (20)

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 

Connecting GESIS research data and publication information systems – Katarina Boland

  • 1. Connecting GESIS research data and publication information systems Katarina Boland Department Knowledge Technologies for the Social Sciences GESIS - Leibniz Institute for the Social Sciences, Cologne, Germany OpenAIRE Interoperability Workshop, Braga, Portugal 08.02.2013
  • 2. Outline 1 Introduction to GESIS 2 GESIS information systems 3 Linking publications to datasets 4 Connecting research data and publication information systems Connecting GESIS research data and publication information systems 2/28
  • 3. GESIS largest infrastructure institution for the Social Sciences in Germany five scientific departments (Mannheim, Cologne, Berlin) Connecting GESIS research data and publication information systems 2/28
  • 4. GESIS Connecting GESIS research data and publication information systems 3/28
  • 5. GESIS Connecting GESIS research data and publication information systems 3/28
  • 6. GESIS Connecting GESIS research data and publication information systems 3/28
  • 7. Publications SSOAR - Social Science Open Access Repository SOWIPORT - the social science portal Connecting GESIS research data and publication information systems 4/28
  • 8. Publications SSOAR - Social Science Open Access Repository electronic full texts (Social Sciences) for free access mainly pursues the Green Way of Open Access http://www.ssoar.info/en.html SOWIPORT - the social science portal Connecting GESIS research data and publication information systems 4/28
  • 9. Publications SSOAR - Social Science Open Access Repository SOWIPORT - the social science portal approximately 7 million references on publications and research projects from 18 databases additional information on institutions and events http://www.gesis.org/sowiport/en/ Connecting GESIS research data and publication information systems 4/28
  • 10. Research Data detailed information, documentation on variable-level and beyond ZACAT - Online Study Catalogue MISSY - Microdata Information System documentation on study-level da|ra - Registration agency for social and economic data DBK - Data Catalogue Connecting GESIS research data and publication information systems 5/28
  • 11. Research Data da|ra - Registration agency for social and economic data DBK - Data Catalogue Connecting GESIS research data and publication information systems 6/28
  • 12. Research Data da|ra - Registration agency for social and economic data German DOI registration service for social science and economic data by GESIS and ZBW - German National Library of Economics, in cooperation with DataCite http://www.da-ra.de/en/home/ DBK - Data Catalogue Connecting GESIS research data and publication information systems 6/28
  • 13. Research Data da|ra - Registration agency for social and economic data DBK - Data Catalogue study descriptions from survey research, historical social research and texts for content analyses documentation of data from official statistics will be added successively http://www.gesis.org/en/services/research/ data-catalogue/ Connecting GESIS research data and publication information systems 6/28
  • 14. Outline 1 Introduction to GESIS 2 GESIS information systems 3 Linking publications to datasets 4 Connecting research data and publication information systems Connecting GESIS research data and publication information systems 7/28
  • 15. Linking publications to datasets the InFoLiS project: Integration of research data and publiations for the Social Sciences InFoLiS is funded by the DFG (SU 647/2-1) Connecting GESIS research data and publication information systems 8/28
  • 16. InFoLiS project goals Response Re er y Qu Catalogue: Publications SSOAR (GESIS), Primo (UB MA), ... po ns e se on sp Links Qu Re s ery Response Catalogue: Research Data da|ra (GESIS), ... Connecting GESIS research data and publication information systems Data 9/28
  • 17. References to datasets erfolgt die Darstellung und Diskussion der empirischen Ergebnisse. Hierfür werden die Daten des Sozio-oekonomischen Panels (SOEP) aus den Jahren 1990 und 2003 verwendet und für beide Zeitpunkte werden die Einflussfaktoren mittels linearer Regressionsmodelle geschätzt. presentation and discussion of the empirical findings. For this purpose, data from the Socio-Economic Panel (SOEP) of the years 1990 and 2003 are used and for both periods, the impact factors are estimated using linear regression models. Connecting GESIS research data and publication information systems 10/28
  • 18. References to datasets erfolgt die Darstellung und Diskussion der empirischen Ergebnisse. Hierfür werden die Daten des Sozio-oekonomischen Panels (SOEP) aus den Jahren 1990 und 2003 verwendet und für beide Zeitpunkte werden die Einflussfaktoren mittels linearer Regressionsmodelle geschätzt. data from the <title> of the years <year> are used Connecting GESIS research data and publication information systems 10/28
  • 19. References to datasets Tabelle 1: Bevölkerungsvorausberechnung für Deutschland nach Altersgruppen - Anteile in Prozent (Datenbasis: 10. Bevölkerungsvorausberechnung des Statistischen Bundesamtes, Variante 5) Table 1: Population forecast for Germany depending on age cohorts - proportion in percent. Data base: 10th Population Forecast of the Federal Statistical Office , version 5. Connecting GESIS research data and publication information systems 11/28
  • 20. References to datasets Tabelle 1: Bevölkerungsvorausberechnung für Deutschland nach Altersgruppen - Anteile in Prozent (Datenbasis: 10. Bevölkerungsvorausberechnung des Statistischen Bundesamtes, Variante 5) (Data base: <number>. <title> of the <data collector>, version <version>) Connecting GESIS research data and publication information systems 11/28
  • 21. References to datasets 1 Herangezogen wurden außerdem Allbus, Allensbacher Erhebungen, Eurobarometer, International Social Survey Program, International Social Justice Project, Sozio-ökonomisches Panel, World Values Survey. Consulted were furthermore ... Connecting GESIS research data and publication information systems 12/28
  • 22. References to datasets 1 Herangezogen wurden außerdem Allbus, Allensbacher Erhebungen, Eurobarometer, International Social Survey Program, International Social Justice Project, Sozio-ökonomisches Panel, World Values Survey. Consulted were furthermore <title1>, <title2>, <title3>, ..., <titleN>. Connecting GESIS research data and publication information systems 12/28
  • 23. References to datasets Tabelle 3: Stichprobe der Untersuchung in den Jahren 2003 und 2004 sowie Größe der Stichprobe, mit gültigen Daten aus beiden Erhebungen (Quelle: Ditton u.a. 2005a) Table 3: Sample of the surveys conducted in the years 2003 and 2004 as well as size of the sample, with valid data from both surveys (Source: Ditton et al. 2005a) Connecting GESIS research data and publication information systems 13/28
  • 24. References to datasets Tabelle 3: Stichprobe der Untersuchung in den Jahren 2003 und 2004 sowie Größe der Stichprobe, mit gültigen Daten aus beiden Erhebungen (Quelle: Ditton u.a. 2005a) (Source: <citation of descriptive publication>) Connecting GESIS research data and publication information systems 13/28
  • 25. References to datasets Grafik 7: Einschätzung der wirtschaftlichen Lage: Einschätzung der eigenen wirtschaftlichen Lage (in Prozent) (Quellen: Allbus/Sozialstaatssurvey) (Sources: Allbus/Sozialstaatssurvey ) Connecting GESIS research data and publication information systems 14/28
  • 26. References to datasets Grafik 7: Einschätzung der wirtschaftlichen Lage: Einschätzung der eigenen wirtschaftlichen Lage (in Prozent) (Quellen: Allbus/Sozialstaatssurvey) (Sources: <title1>/<title2>) Connecting GESIS research data and publication information systems 14/28
  • 27. Linking publications to datasets References to datasets are not standardized! see also... Green, Toby (2009). We Need Publishing Standards for Datasets and Data Tables. OECD Publishing White Paper. doi: 10.1787/603233448430 Altman, Micah and Gary King (2007). A Proposed Standard for the Scholarly Citation of Quantitative Data. In: D-Lib Magazine 13.3. url: http://www.dlib.org/dlib/march07/altman/03altman.html Connecting GESIS research data and publication information systems 15/28
  • 28. Automatic identification of references Why not simply search for study titles in publications? Studies are referenced using abbreviations, alternative names or literature Study titles may be common nouns - ambiguous! there is no complete list of all conducted studies Connecting GESIS research data and publication information systems 16/28
  • 29. General idea How do humans recognize study references? Source: Estimations based on SOEP, wave 2002. Connecting GESIS research data and publication information systems 17/28
  • 30. General idea How do humans recognize study references? Source: Estimations based on xyz, wave 2002. Connecting GESIS research data and publication information systems 17/28
  • 31. General idea How do humans recognize study references? Source: Estimations based on xyz, wave 2002. → Learn patterns: typical contexts for study references Connecting GESIS research data and publication information systems 17/28
  • 32. General idea How do humans recognize study references? Source: Estimations based on xyz, wave 2002. → Learn patterns: typical contexts for study references → Sparse Data Problem: use iterative bootstrapping approach Connecting GESIS research data and publication information systems 17/28
  • 33. Algorithm Connecting GESIS research data and publication information systems 18/28
  • 34. Evaluation: Precision & Estimate of Recall about 14% of the found references are not study names, but citations of publications → not counted as incorrect here subset of SSOAR with keyword “empirisch-quantitativ” (empirical quantitative) German, n = 259 conversion pdf → txt with automatic correction Connecting GESIS research data and publication information systems 19/28
  • 35. Reference extraction for details see... Boland, Katarina, Ritze, Dominique, Eckert, Kai, & Mathiak, Brigitte (2012). Identifying References to Datasets in Publications. International Conference on Theory and Practice of Digital Libraries (TPDL) (pp. 150-161). Paphos, Cyprus: Springer Berlin Heidelberg. doi:10.1007/978-3-642-33290-6 17 Connecting GESIS research data and publication information systems 20/28
  • 36. Matching to da|ra records Connecting GESIS research data and publication information systems 21/28
  • 37. Matching to da|ra records Connecting GESIS research data and publication information systems 21/28
  • 38. Matching to da|ra records → Precise matching to DOI not always possible! → Instead: Matchings to relevant sources Connecting GESIS research data and publication information systems 21/28
  • 39. Matching to da|ra records → Precise matching to DOI not always possible! → Instead: Matchings to relevant sources → Definition of relevance depends on application Connecting GESIS research data and publication information systems 21/28
  • 40. Matching to da|ra records ALLBUScompact ... ... ALLBUScompact - Cumulation 1980-2010 ALLBUScompact 2000 ALLBUScompact 2000 CAPI/PAPI ... ... ALLBUS ... ... ... ALLBUS - Cumulation 1980-2006 ... ALLBUScompact 2000 CAPI ... ... ... ALLBUS 2000 ... ... ALLBUS 1998 ... ALLBUS 2000 CAPI/PAPI ... ... ALLBUS - Cumulation 1980-2008 ... ... ... ... ... ALLBUS 1996 ... ... ... ... ... ... → semantic web technologies Connecting GESIS research data and publication information systems 22/28
  • 41. Links Connecting GESIS research data and publication information systems 23/28
  • 42. Connecting information systems Service I: InFoLiS Services II, III & Architecture: Dennis Wegener, Daniel Hienert, Dimitar Dimitrov (SOWIPORT, da|ra) Connecting GESIS research data and publication information systems 24/28
  • 43. Connecting information systems Demo: da|ra test system with SOWIPORT links Connecting GESIS research data and publication information systems 25/28
  • 44. Connecting information systems Demo: da|ra test system with SOWIPORT links (offline version) Connecting GESIS research data and publication information systems 26/28
  • 45. Conclusion: our aim interlink our own repositories and information systems provide services for reference extraction and matching to all interested institutions (free access to webservices) - domain- and language-independent link to publications and data stored in external repositories Connecting GESIS research data and publication information systems 27/28
  • 46. Thank you for your attention! katarina.boland@gesis.org Connecting GESIS research data and publication information systems 28/28