SlideShare une entreprise Scribd logo
1  sur  18
Between information retrieval services and
bibliometrics research
–
new ways of semantic browsing and visual analytics
Rob Koopman, Shenghui Wang
OCLC Research
Andrea Scharnhorst
DANS- KNAW
November 7, 2015
ASIST, sigmetrics workshop
Content
- New approach to find structure in
bibliographic information – ARIADNE (2 Method)
- Applications:
- Data curation – author disambiguation (1 Motivation)
- Illustration of topics – the case of digital humanities
Topical browsing – DEMO (3)
- Excursion into bibliometrics – the Berlin group challenge
(4)
- Wrapping up (5)
Data curation – author disambiguation
Mapping topics, communities, research
fronts, …..
Bibliometrics
Documents are similar because
they:
- Cite each other
- Are cited together
- Use the same references
- Use the same vocabulary
- Have the same authors
Information retrieval
Documents are similar because
they:
- Use the same vocabulary
- - ….
ARIADNE is about similarity of entities!
Document/work, Record and Entity
…
Authors Title Journal … Reference Subject
Authors
names
Topical terms
Reference
Journal
Glänzel, W.
Glanzel, W.
bibliometrics
…
…
citations … Casimir effect
N=SUM (doc)
A MARC record
title
authors
issn
dewey
publisher
Demo examples
• http://thoth.pica.nl/demo/relate WorldCat
• http://thoth.pica.nl/relate ArticleFirst
• http://thoth.pica.nl/astro/relate Astrophysics
data Berlin group
Dataset
● WorldCat, 300+ million records
● Selected 13 million items (topical terms,
authors, ISSNs, Dewey decimal codes,
publishers, subject headings)
● Represented by 6 million topical terms
But a matrix of 13M x 6M is too big to process
C: a co-occurrence matrix
R: a random matrix of +/-1
C’: approximation of C
after random projection
-- Semantic matrix
Koopman, R., Wang, S., Scharnhorst, A., Englebienne, G.: Ariadne’s thread: In- teractive navigation in a world of networked information. In: CHI’15 Extended Abstracts.
Step 1: Building the semantic matrix
– and Dimension reduction based on Random Projection
Step 2: Interactive exploration
- Provide a simple search/text box
- Calculate the top 500 most related
candidates
- Find mutually related items
- Convert distances to probabilities
- Project to 2D
- Enhance interface with links to other spaces
Exploration of a topic
http://thoth.pica.nl/relate?input=hirsch%20index&fsize=100&ncluster=
EINS 1st PLENARY
Digital libraries
Science, Computer
Science, ontologies
Many different humanities fields
Prominently language &
Literary studies
Illustration of context around a
topic/field – journal view
Koopman, R., Wang, S., Scharnhorst, A., Englebienne, G.: Ariadne's thread:
Interactive navigation in a world of networked information. In: CHI'15 Extended
Abstracts. (2015)
As visual exploration
of any dataset – astrophysics case
Wrapping up – future work
● Compare the algorithm to other existing algorithms – benchmarking
● More metadata fields (publisher, subject, identifiers) – ongoing
● Identify further problems to which Ariadne can be applied
● Curation (e.g. author name disambiguation);
● Knowledge discovery (e.g. matching chemical molecules);
● Information science – population of libraries, subject areas, …
● Feedback from users – Prepare user scenarios for usability testing
and set up an evaluation project – tbd
● Improve visualisation
● More functionality (timeline, history)
● Extend the implementation to other databases
Thank you
rob.koopman@oclc.org
shenghui.wang@oclc.org
Andrea.scharnhorst@dans.knaw.nl
http://thoth.pica.nl/relate (ArticleFirst)
http://thoth.pica.nl/astro/relate (Astrophysics articles)
http://thoth.pica.nl/demo/relate (WorldCat)
References
Koopman, R., Wang, S., Scharnhorst, A., Englebienne, G.: Ariadne's thread: Interactive
navigation in a world of networked information. In: B. Begole, J. Kim, K. Inkpen, W. Woo
(eds.) Proceedings of the 33rd Annual ACM Conference Extended Abstracts on Human
Factors in Computing Systems, Seoul, CHI 2015 Extended Abstracts, Republic of Korea, April 18 - 23,
2015, pp. 1833{1838. ACM (2015). DOI 10.1145/2702613.2732781. URL
http://doi.acm.org/10.1145/2702613.2732781 (Preprint Arxiv.org)
Koopman, R., Wang, S., Scharnhorst, A.: Contextualization of Topics - Browsing through
Terms, Authors, Journals and Cluster Allocations. In: A.A. Salah, Y. Tonta, A.A.A.
Salah, C. Sugimoto, U. Al (eds.) Proceedings of ISSI 2015 Istanbul. 15th International
Society of Scientometrics and Informetrics Conference, Istanbul, Turkey, 29th June to 4th
July 2015, pp. 1042{1053. Boazici University Printhouse, Istanbul (2015). URL http:
//www.issi2015.org/en/Proceedings-of-ISSI-2015.html

Contenu connexe

Tendances

Assigning semantic labels to data sources
Assigning semantic labels to data sourcesAssigning semantic labels to data sources
Assigning semantic labels to data sourcesCraig Knoblock
 
Why do we need to model the science system?
Why do we need to model the science system?Why do we need to model the science system?
Why do we need to model the science system?Andrea Scharnhorst
 
Erwin Folmer - Congres 'Data gedreven Beleidsontwikkeling'
Erwin Folmer - Congres 'Data gedreven Beleidsontwikkeling'Erwin Folmer - Congres 'Data gedreven Beleidsontwikkeling'
Erwin Folmer - Congres 'Data gedreven Beleidsontwikkeling'ScienceWorks
 
13 10 2006 Prato
13 10 2006  Prato13 10 2006  Prato
13 10 2006 PratoStuart Dunn
 
DM2E and eCloud
DM2E and eCloudDM2E and eCloud
DM2E and eCloudErik Duval
 
Remoteness and connectedness in the library world
Remoteness and connectedness in the library worldRemoteness and connectedness in the library world
Remoteness and connectedness in the library worldacrawfordlibrary
 
Future of our city - Smart Cities and Knowledge Maps
Future of our city - Smart Cities and Knowledge MapsFuture of our city - Smart Cities and Knowledge Maps
Future of our city - Smart Cities and Knowledge MapsAndrea Scharnhorst
 
Research data discovery in OpenAIRE (Presentation by Paolo Manghi at DI4R2018)
Research data discovery in OpenAIRE (Presentation by Paolo Manghi at DI4R2018)Research data discovery in OpenAIRE (Presentation by Paolo Manghi at DI4R2018)
Research data discovery in OpenAIRE (Presentation by Paolo Manghi at DI4R2018)OpenAIRE
 
Controlled Vocabularies and Text Mining - Use Cases at the Goettingen State a...
Controlled Vocabularies and Text Mining - Use Cases at the Goettingen State a...Controlled Vocabularies and Text Mining - Use Cases at the Goettingen State a...
Controlled Vocabularies and Text Mining - Use Cases at the Goettingen State a...Ralf Stockmann
 
VOSviewer: A software tool for analyzing and visualizing scientific literature
VOSviewer: A software tool for analyzing and visualizing scientific literatureVOSviewer: A software tool for analyzing and visualizing scientific literature
VOSviewer: A software tool for analyzing and visualizing scientific literatureNees Jan van Eck
 
Using full-text data to create improved term maps
Using full-text data to create improved term mapsUsing full-text data to create improved term maps
Using full-text data to create improved term mapsNees Jan van Eck
 
From keyword searching to discourse mining
From keyword searching to discourse miningFrom keyword searching to discourse mining
From keyword searching to discourse miningPim Huijnen
 
A names backbone - a graph of taxonomy
A names backbone - a graph of taxonomyA names backbone - a graph of taxonomy
A names backbone - a graph of taxonomynickyn
 
HUMlab: Virtual Worlds Learning and Research
HUMlab: Virtual Worlds Learning and ResearchHUMlab: Virtual Worlds Learning and Research
HUMlab: Virtual Worlds Learning and ResearchJames Barrett
 
PhD Projects in Text Mining Research Topics With Source Code
PhD Projects in Text Mining Research Topics With Source CodePhD Projects in Text Mining Research Topics With Source Code
PhD Projects in Text Mining Research Topics With Source CodePhD Services
 

Tendances (20)

Assigning semantic labels to data sources
Assigning semantic labels to data sourcesAssigning semantic labels to data sources
Assigning semantic labels to data sources
 
The Standard Template Library
The Standard Template LibraryThe Standard Template Library
The Standard Template Library
 
Why do we need to model the science system?
Why do we need to model the science system?Why do we need to model the science system?
Why do we need to model the science system?
 
Erwin Folmer - Congres 'Data gedreven Beleidsontwikkeling'
Erwin Folmer - Congres 'Data gedreven Beleidsontwikkeling'Erwin Folmer - Congres 'Data gedreven Beleidsontwikkeling'
Erwin Folmer - Congres 'Data gedreven Beleidsontwikkeling'
 
13 10 2006 Prato
13 10 2006  Prato13 10 2006  Prato
13 10 2006 Prato
 
DM2E and eCloud
DM2E and eCloudDM2E and eCloud
DM2E and eCloud
 
Remoteness and connectedness in the library world
Remoteness and connectedness in the library worldRemoteness and connectedness in the library world
Remoteness and connectedness in the library world
 
krynski_cv
krynski_cvkrynski_cv
krynski_cv
 
1 6 2007 UIUC
1 6 2007  UIUC1 6 2007  UIUC
1 6 2007 UIUC
 
Future of our city - Smart Cities and Knowledge Maps
Future of our city - Smart Cities and Knowledge MapsFuture of our city - Smart Cities and Knowledge Maps
Future of our city - Smart Cities and Knowledge Maps
 
Research data discovery in OpenAIRE (Presentation by Paolo Manghi at DI4R2018)
Research data discovery in OpenAIRE (Presentation by Paolo Manghi at DI4R2018)Research data discovery in OpenAIRE (Presentation by Paolo Manghi at DI4R2018)
Research data discovery in OpenAIRE (Presentation by Paolo Manghi at DI4R2018)
 
Controlled Vocabularies and Text Mining - Use Cases at the Goettingen State a...
Controlled Vocabularies and Text Mining - Use Cases at the Goettingen State a...Controlled Vocabularies and Text Mining - Use Cases at the Goettingen State a...
Controlled Vocabularies and Text Mining - Use Cases at the Goettingen State a...
 
VOSviewer: A software tool for analyzing and visualizing scientific literature
VOSviewer: A software tool for analyzing and visualizing scientific literatureVOSviewer: A software tool for analyzing and visualizing scientific literature
VOSviewer: A software tool for analyzing and visualizing scientific literature
 
ld4dh demo lecture
ld4dh demo lectureld4dh demo lecture
ld4dh demo lecture
 
Using full-text data to create improved term maps
Using full-text data to create improved term mapsUsing full-text data to create improved term maps
Using full-text data to create improved term maps
 
From keyword searching to discourse mining
From keyword searching to discourse miningFrom keyword searching to discourse mining
From keyword searching to discourse mining
 
A names backbone - a graph of taxonomy
A names backbone - a graph of taxonomyA names backbone - a graph of taxonomy
A names backbone - a graph of taxonomy
 
HUMlab: Virtual Worlds Learning and Research
HUMlab: Virtual Worlds Learning and ResearchHUMlab: Virtual Worlds Learning and Research
HUMlab: Virtual Worlds Learning and Research
 
PhD Projects in Text Mining Research Topics With Source Code
PhD Projects in Text Mining Research Topics With Source CodePhD Projects in Text Mining Research Topics With Source Code
PhD Projects in Text Mining Research Topics With Source Code
 
Building intelligent systems (that can explain)
Building intelligent systems (that can explain)Building intelligent systems (that can explain)
Building intelligent systems (that can explain)
 

En vedette

Knowledge – dynamics – landscape - navigation – what have interfaces to digit...
Knowledge – dynamics – landscape - navigation – what have interfaces to digit...Knowledge – dynamics – landscape - navigation – what have interfaces to digit...
Knowledge – dynamics – landscape - navigation – what have interfaces to digit...Andrea Scharnhorst
 
Mapping Digital Humanities projects. A pilot of a DH project registry for The...
Mapping Digital Humanities projects. A pilot of a DH project registry for The...Mapping Digital Humanities projects. A pilot of a DH project registry for The...
Mapping Digital Humanities projects. A pilot of a DH project registry for The...Andrea Scharnhorst
 
Knowledge maps for libraries and archives - uses and use cases
Knowledge maps for libraries and archives - uses and use casesKnowledge maps for libraries and archives - uses and use cases
Knowledge maps for libraries and archives - uses and use casesAndrea Scharnhorst
 
Mapping Social Sciences and Humanities - Impact, Orientation, Understanding A...
Mapping Social Sciences and Humanities - Impact, Orientation, Understanding A...Mapping Social Sciences and Humanities - Impact, Orientation, Understanding A...
Mapping Social Sciences and Humanities - Impact, Orientation, Understanding A...Andrea Scharnhorst
 
Drowning in information – the need of macroscopes for research funding
Drowning in information – the need of macroscopes for research fundingDrowning in information – the need of macroscopes for research funding
Drowning in information – the need of macroscopes for research fundingAndrea Scharnhorst
 
Genericity versus expressivity – reflections about the semantics of interoper...
Genericity versus expressivity – reflections about the semantics of interoper...Genericity versus expressivity – reflections about the semantics of interoper...
Genericity versus expressivity – reflections about the semantics of interoper...Andrea Scharnhorst
 
KnowEscape - COST Action TD1210 at the TPDL 2013
KnowEscape - COST Action TD1210 at the TPDL 2013KnowEscape - COST Action TD1210 at the TPDL 2013
KnowEscape - COST Action TD1210 at the TPDL 2013Andrea Scharnhorst
 
Digital Humanities as a Virtual Community
Digital Humanities as a Virtual CommunityDigital Humanities as a Virtual Community
Digital Humanities as a Virtual CommunityAndrea Scharnhorst
 
Walking through a library remotely. Digital Humanities seminar April 12, 2013...
Walking through a library remotely. Digital Humanities seminar April 12, 2013...Walking through a library remotely. Digital Humanities seminar April 12, 2013...
Walking through a library remotely. Digital Humanities seminar April 12, 2013...Andrea Scharnhorst
 
Electronic dashboard synopsis
Electronic dashboard synopsisElectronic dashboard synopsis
Electronic dashboard synopsisabhipokle
 
Texts in history - visualization and digital humanities
Texts in history - visualization and digital humanitiesTexts in history - visualization and digital humanities
Texts in history - visualization and digital humanitiesAndrea Scharnhorst
 
Training in Data Curation as Service in a Federated Data Infrastructure - the...
Training in Data Curation as Service in aFederated Data Infrastructure - the...Training in Data Curation as Service in aFederated Data Infrastructure - the...
Training in Data Curation as Service in a Federated Data Infrastructure - the...Andrea Scharnhorst
 

En vedette (14)

Knowledge – dynamics – landscape - navigation – what have interfaces to digit...
Knowledge – dynamics – landscape - navigation – what have interfaces to digit...Knowledge – dynamics – landscape - navigation – what have interfaces to digit...
Knowledge – dynamics – landscape - navigation – what have interfaces to digit...
 
If only I had a map!
If only I had a map!If only I had a map!
If only I had a map!
 
Mapping Digital Humanities projects. A pilot of a DH project registry for The...
Mapping Digital Humanities projects. A pilot of a DH project registry for The...Mapping Digital Humanities projects. A pilot of a DH project registry for The...
Mapping Digital Humanities projects. A pilot of a DH project registry for The...
 
Knowledge maps for libraries and archives - uses and use cases
Knowledge maps for libraries and archives - uses and use casesKnowledge maps for libraries and archives - uses and use cases
Knowledge maps for libraries and archives - uses and use cases
 
Mapping Social Sciences and Humanities - Impact, Orientation, Understanding A...
Mapping Social Sciences and Humanities - Impact, Orientation, Understanding A...Mapping Social Sciences and Humanities - Impact, Orientation, Understanding A...
Mapping Social Sciences and Humanities - Impact, Orientation, Understanding A...
 
Seed and Expand
Seed and ExpandSeed and Expand
Seed and Expand
 
Drowning in information – the need of macroscopes for research funding
Drowning in information – the need of macroscopes for research fundingDrowning in information – the need of macroscopes for research funding
Drowning in information – the need of macroscopes for research funding
 
Genericity versus expressivity – reflections about the semantics of interoper...
Genericity versus expressivity – reflections about the semantics of interoper...Genericity versus expressivity – reflections about the semantics of interoper...
Genericity versus expressivity – reflections about the semantics of interoper...
 
KnowEscape - COST Action TD1210 at the TPDL 2013
KnowEscape - COST Action TD1210 at the TPDL 2013KnowEscape - COST Action TD1210 at the TPDL 2013
KnowEscape - COST Action TD1210 at the TPDL 2013
 
Digital Humanities as a Virtual Community
Digital Humanities as a Virtual CommunityDigital Humanities as a Virtual Community
Digital Humanities as a Virtual Community
 
Walking through a library remotely. Digital Humanities seminar April 12, 2013...
Walking through a library remotely. Digital Humanities seminar April 12, 2013...Walking through a library remotely. Digital Humanities seminar April 12, 2013...
Walking through a library remotely. Digital Humanities seminar April 12, 2013...
 
Electronic dashboard synopsis
Electronic dashboard synopsisElectronic dashboard synopsis
Electronic dashboard synopsis
 
Texts in history - visualization and digital humanities
Texts in history - visualization and digital humanitiesTexts in history - visualization and digital humanities
Texts in history - visualization and digital humanities
 
Training in Data Curation as Service in a Federated Data Infrastructure - the...
Training in Data Curation as Service in aFederated Data Infrastructure - the...Training in Data Curation as Service in aFederated Data Infrastructure - the...
Training in Data Curation as Service in a Federated Data Infrastructure - the...
 

Similaire à Between  information  retrieval  services  and bibliometrics  research. New  ways  of  semantic  browsing  and  visual analytics

A conceptual model for the annotation of audiovisual heritage in a media stud...
A conceptual model for the annotation of audiovisual heritage in a media stud...A conceptual model for the annotation of audiovisual heritage in a media stud...
A conceptual model for the annotation of audiovisual heritage in a media stud...Liliana M. Melgar Estrada
 
Share: discovery: a focus on papers
Share: discovery: a focus on papersShare: discovery: a focus on papers
Share: discovery: a focus on paperslisld
 
Digital Humanities and Linked Data
Digital Humanities and Linked DataDigital Humanities and Linked Data
Digital Humanities and Linked DataLeon Wessels
 
A Case Study Protocol For Meta-Research Into Digital Practices In The Humanities
A Case Study Protocol For Meta-Research Into Digital Practices In The HumanitiesA Case Study Protocol For Meta-Research Into Digital Practices In The Humanities
A Case Study Protocol For Meta-Research Into Digital Practices In The HumanitiesJeff Brooks
 
LSI latent (par HATOUM Saria et DONGO ESCALANTE Irvin Franco)
LSI latent (par HATOUM Saria et DONGO ESCALANTE Irvin Franco)LSI latent (par HATOUM Saria et DONGO ESCALANTE Irvin Franco)
LSI latent (par HATOUM Saria et DONGO ESCALANTE Irvin Franco)rchbeir
 
E hg rm presentation enhanced publications, 16june2011
E hg rm presentation enhanced publications, 16june2011E hg rm presentation enhanced publications, 16june2011
E hg rm presentation enhanced publications, 16june2011Nick Jankowski
 
E hg rm presentation enhanced publications, 16june2011
E hg rm presentation enhanced publications, 16june2011E hg rm presentation enhanced publications, 16june2011
E hg rm presentation enhanced publications, 16june2011Nick Jankowski
 
Knowledge Representation on the Web
Knowledge Representation on the WebKnowledge Representation on the Web
Knowledge Representation on the WebRinke Hoekstra
 
Arc 323 human studies in architecture fall 2018 lecture 3-literature review
Arc 323 human studies in architecture fall 2018 lecture 3-literature reviewArc 323 human studies in architecture fall 2018 lecture 3-literature review
Arc 323 human studies in architecture fall 2018 lecture 3-literature reviewGalala University
 
EAA2014 Istanbul - Barriers and Opportunities for Linked Open Data use in Arc...
EAA2014 Istanbul - Barriers and Opportunities for Linked Open Data use in Arc...EAA2014 Istanbul - Barriers and Opportunities for Linked Open Data use in Arc...
EAA2014 Istanbul - Barriers and Opportunities for Linked Open Data use in Arc...Keith.May
 
Rebecca Grant - DH research data: identification and challenges (DH2016)
Rebecca Grant - DH research data: identification and challenges (DH2016)Rebecca Grant - DH research data: identification and challenges (DH2016)
Rebecca Grant - DH research data: identification and challenges (DH2016)dri_ireland
 
How the Semantic Web is transforming information access
How the Semantic Web is transforming information accessHow the Semantic Web is transforming information access
How the Semantic Web is transforming information accessGuus Schreiber
 
UKSG webinar - Introduction to Text-Mining Research Papers with Petr Knoth an...
UKSG webinar - Introduction to Text-Mining Research Papers with Petr Knoth an...UKSG webinar - Introduction to Text-Mining Research Papers with Petr Knoth an...
UKSG webinar - Introduction to Text-Mining Research Papers with Petr Knoth an...UKSG: connecting the knowledge community
 
Copac: Reengineering the UK national academic union catalogue to serve the 21...
Copac: Reengineering the UK national academic union catalogue to serve the 21...Copac: Reengineering the UK national academic union catalogue to serve the 21...
Copac: Reengineering the UK national academic union catalogue to serve the 21...Joy Palmer
 
Measuring Science – Tracing the authors
Measuring Science – Tracing the authorsMeasuring Science – Tracing the authors
Measuring Science – Tracing the authors Andrea Scharnhorst
 
Melissa Terras' Report on the #UKMHLiveLab
Melissa Terras' Report on the #UKMHLiveLabMelissa Terras' Report on the #UKMHLiveLab
Melissa Terras' Report on the #UKMHLiveLabUniversity of Edinburgh
 
The Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
The Computer Science Ontology: A Large-Scale Taxonomy of Research AreasThe Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
The Computer Science Ontology: A Large-Scale Taxonomy of Research AreasAngelo Salatino
 
Visual Navigation Project Progress (in: VIRAK Mapping Workshop)
Visual Navigation Project Progress (in: VIRAK Mapping Workshop)Visual Navigation Project Progress (in: VIRAK Mapping Workshop)
Visual Navigation Project Progress (in: VIRAK Mapping Workshop)Visual Navigation Project
 
Twenty Years after: Scientific Research in the Field of Knowledge Organizatio...
Twenty Years after: Scientific Research in the Field of Knowledge Organizatio...Twenty Years after: Scientific Research in the Field of Knowledge Organizatio...
Twenty Years after: Scientific Research in the Field of Knowledge Organizatio...SunnyFace
 

Similaire à Between  information  retrieval  services  and bibliometrics  research. New  ways  of  semantic  browsing  and  visual analytics (20)

A conceptual model for the annotation of audiovisual heritage in a media stud...
A conceptual model for the annotation of audiovisual heritage in a media stud...A conceptual model for the annotation of audiovisual heritage in a media stud...
A conceptual model for the annotation of audiovisual heritage in a media stud...
 
Share: discovery: a focus on papers
Share: discovery: a focus on papersShare: discovery: a focus on papers
Share: discovery: a focus on papers
 
Digital Humanities and Linked Data
Digital Humanities and Linked DataDigital Humanities and Linked Data
Digital Humanities and Linked Data
 
A Case Study Protocol For Meta-Research Into Digital Practices In The Humanities
A Case Study Protocol For Meta-Research Into Digital Practices In The HumanitiesA Case Study Protocol For Meta-Research Into Digital Practices In The Humanities
A Case Study Protocol For Meta-Research Into Digital Practices In The Humanities
 
LSI latent (par HATOUM Saria et DONGO ESCALANTE Irvin Franco)
LSI latent (par HATOUM Saria et DONGO ESCALANTE Irvin Franco)LSI latent (par HATOUM Saria et DONGO ESCALANTE Irvin Franco)
LSI latent (par HATOUM Saria et DONGO ESCALANTE Irvin Franco)
 
E hg rm presentation enhanced publications, 16june2011
E hg rm presentation enhanced publications, 16june2011E hg rm presentation enhanced publications, 16june2011
E hg rm presentation enhanced publications, 16june2011
 
E hg rm presentation enhanced publications, 16june2011
E hg rm presentation enhanced publications, 16june2011E hg rm presentation enhanced publications, 16june2011
E hg rm presentation enhanced publications, 16june2011
 
Knowledge Representation on the Web
Knowledge Representation on the WebKnowledge Representation on the Web
Knowledge Representation on the Web
 
Arc 323 human studies in architecture fall 2018 lecture 3-literature review
Arc 323 human studies in architecture fall 2018 lecture 3-literature reviewArc 323 human studies in architecture fall 2018 lecture 3-literature review
Arc 323 human studies in architecture fall 2018 lecture 3-literature review
 
EAA2014 Istanbul - Barriers and Opportunities for Linked Open Data use in Arc...
EAA2014 Istanbul - Barriers and Opportunities for Linked Open Data use in Arc...EAA2014 Istanbul - Barriers and Opportunities for Linked Open Data use in Arc...
EAA2014 Istanbul - Barriers and Opportunities for Linked Open Data use in Arc...
 
Reading avoidance
Reading avoidanceReading avoidance
Reading avoidance
 
Rebecca Grant - DH research data: identification and challenges (DH2016)
Rebecca Grant - DH research data: identification and challenges (DH2016)Rebecca Grant - DH research data: identification and challenges (DH2016)
Rebecca Grant - DH research data: identification and challenges (DH2016)
 
How the Semantic Web is transforming information access
How the Semantic Web is transforming information accessHow the Semantic Web is transforming information access
How the Semantic Web is transforming information access
 
UKSG webinar - Introduction to Text-Mining Research Papers with Petr Knoth an...
UKSG webinar - Introduction to Text-Mining Research Papers with Petr Knoth an...UKSG webinar - Introduction to Text-Mining Research Papers with Petr Knoth an...
UKSG webinar - Introduction to Text-Mining Research Papers with Petr Knoth an...
 
Copac: Reengineering the UK national academic union catalogue to serve the 21...
Copac: Reengineering the UK national academic union catalogue to serve the 21...Copac: Reengineering the UK national academic union catalogue to serve the 21...
Copac: Reengineering the UK national academic union catalogue to serve the 21...
 
Measuring Science – Tracing the authors
Measuring Science – Tracing the authorsMeasuring Science – Tracing the authors
Measuring Science – Tracing the authors
 
Melissa Terras' Report on the #UKMHLiveLab
Melissa Terras' Report on the #UKMHLiveLabMelissa Terras' Report on the #UKMHLiveLab
Melissa Terras' Report on the #UKMHLiveLab
 
The Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
The Computer Science Ontology: A Large-Scale Taxonomy of Research AreasThe Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
The Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
 
Visual Navigation Project Progress (in: VIRAK Mapping Workshop)
Visual Navigation Project Progress (in: VIRAK Mapping Workshop)Visual Navigation Project Progress (in: VIRAK Mapping Workshop)
Visual Navigation Project Progress (in: VIRAK Mapping Workshop)
 
Twenty Years after: Scientific Research in the Field of Knowledge Organizatio...
Twenty Years after: Scientific Research in the Field of Knowledge Organizatio...Twenty Years after: Scientific Research in the Field of Knowledge Organizatio...
Twenty Years after: Scientific Research in the Field of Knowledge Organizatio...
 

Plus de Andrea Scharnhorst

Flexibility in Metadata Schemes and Standardisation: the Case of CMDI and the...
Flexibility in Metadata Schemes and Standardisation: the Case of CMDI and the...Flexibility in Metadata Schemes and Standardisation: the Case of CMDI and the...
Flexibility in Metadata Schemes and Standardisation: the Case of CMDI and the...Andrea Scharnhorst
 
The Polifonia portal: a confluence of user stories, research pilots, data man...
The Polifonia portal: a confluence of user stories, research pilots, data man...The Polifonia portal: a confluence of user stories, research pilots, data man...
The Polifonia portal: a confluence of user stories, research pilots, data man...Andrea Scharnhorst
 
Floating classifications - Knowledge Organization Systems in past, present an...
Floating classifications - Knowledge Organization Systems in past, present an...Floating classifications - Knowledge Organization Systems in past, present an...
Floating classifications - Knowledge Organization Systems in past, present an...Andrea Scharnhorst
 
Digging into the Knowledge Graph (2017-2020)
Digging into the Knowledge Graph (2017-2020)Digging into the Knowledge Graph (2017-2020)
Digging into the Knowledge Graph (2017-2020)Andrea Scharnhorst
 
Dilemmata of research infrastructures
Dilemmata of research infrastructuresDilemmata of research infrastructures
Dilemmata of research infrastructuresAndrea Scharnhorst
 
Data curation and data archiving at different stages of the research process
Data curation and data archiving at different stages of the research processData curation and data archiving at different stages of the research process
Data curation and data archiving at different stages of the research processAndrea Scharnhorst
 
SUSTAINABILITY BEYOND GUIDELINES
SUSTAINABILITY BEYOND GUIDELINESSUSTAINABILITY BEYOND GUIDELINES
SUSTAINABILITY BEYOND GUIDELINESAndrea Scharnhorst
 
Information science in practice - research at a Trusted Digital Archive
Information science in practice - research at a Trusted Digital ArchiveInformation science in practice - research at a Trusted Digital Archive
Information science in practice - research at a Trusted Digital ArchiveAndrea Scharnhorst
 
How to use science maps to navigate large information spaces? What is the lin...
How to use science maps to navigate large information spaces? What is the lin...How to use science maps to navigate large information spaces? What is the lin...
How to use science maps to navigate large information spaces? What is the lin...Andrea Scharnhorst
 
Bibliometrics, Webometrics, Altmetrics, Alternative metrics.
Bibliometrics, Webometrics, Altmetrics, Alternative metrics.Bibliometrics, Webometrics, Altmetrics, Alternative metrics.
Bibliometrics, Webometrics, Altmetrics, Alternative metrics.Andrea Scharnhorst
 
Digital Humanities in The Netherlands DARIAH, CLARIN, CLARIAH, … DHx.0 A pers...
Digital Humanities in The Netherlands DARIAH, CLARIN, CLARIAH, … DHx.0 A pers...Digital Humanities in The Netherlands DARIAH, CLARIN, CLARIAH, … DHx.0 A pers...
Digital Humanities in The Netherlands DARIAH, CLARIN, CLARIAH, … DHx.0 A pers...Andrea Scharnhorst
 
Rare (and emergent) disciplines in the light of science studies
Rare (and emergent) disciplines in the light of science studiesRare (and emergent) disciplines in the light of science studies
Rare (and emergent) disciplines in the light of science studiesAndrea Scharnhorst
 
Digital Humanities as Innovation: ‘constant revolution’ or ‘moving to the su...
Digital Humanities as Innovation:  ‘constant revolution’ or ‘moving to the su...Digital Humanities as Innovation:  ‘constant revolution’ or ‘moving to the su...
Digital Humanities as Innovation: ‘constant revolution’ or ‘moving to the su...Andrea Scharnhorst
 
KnoweScape - means and meaning of knowledge maps
KnoweScape - means and meaning of knowledge maps KnoweScape - means and meaning of knowledge maps
KnoweScape - means and meaning of knowledge maps Andrea Scharnhorst
 
Cross domain knowledge discovery, complex system theory and semantic web
Cross domain knowledge discovery, complex system theory and semantic webCross domain knowledge discovery, complex system theory and semantic web
Cross domain knowledge discovery, complex system theory and semantic webAndrea Scharnhorst
 

Plus de Andrea Scharnhorst (19)

Flexibility in Metadata Schemes and Standardisation: the Case of CMDI and the...
Flexibility in Metadata Schemes and Standardisation: the Case of CMDI and the...Flexibility in Metadata Schemes and Standardisation: the Case of CMDI and the...
Flexibility in Metadata Schemes and Standardisation: the Case of CMDI and the...
 
The Polifonia portal: a confluence of user stories, research pilots, data man...
The Polifonia portal: a confluence of user stories, research pilots, data man...The Polifonia portal: a confluence of user stories, research pilots, data man...
The Polifonia portal: a confluence of user stories, research pilots, data man...
 
Floating classifications - Knowledge Organization Systems in past, present an...
Floating classifications - Knowledge Organization Systems in past, present an...Floating classifications - Knowledge Organization Systems in past, present an...
Floating classifications - Knowledge Organization Systems in past, present an...
 
Digging into the Knowledge Graph (2017-2020)
Digging into the Knowledge Graph (2017-2020)Digging into the Knowledge Graph (2017-2020)
Digging into the Knowledge Graph (2017-2020)
 
Dilemmata of research infrastructures
Dilemmata of research infrastructuresDilemmata of research infrastructures
Dilemmata of research infrastructures
 
DARIAH Contributions 2019
DARIAH Contributions 2019DARIAH Contributions 2019
DARIAH Contributions 2019
 
Data curation and data archiving at different stages of the research process
Data curation and data archiving at different stages of the research processData curation and data archiving at different stages of the research process
Data curation and data archiving at different stages of the research process
 
SUSTAINABILITY BEYOND GUIDELINES
SUSTAINABILITY BEYOND GUIDELINESSUSTAINABILITY BEYOND GUIDELINES
SUSTAINABILITY BEYOND GUIDELINES
 
Information science in practice - research at a Trusted Digital Archive
Information science in practice - research at a Trusted Digital ArchiveInformation science in practice - research at a Trusted Digital Archive
Information science in practice - research at a Trusted Digital Archive
 
How to use science maps to navigate large information spaces? What is the lin...
How to use science maps to navigate large information spaces? What is the lin...How to use science maps to navigate large information spaces? What is the lin...
How to use science maps to navigate large information spaces? What is the lin...
 
Bibliometrics, Webometrics, Altmetrics, Alternative metrics.
Bibliometrics, Webometrics, Altmetrics, Alternative metrics.Bibliometrics, Webometrics, Altmetrics, Alternative metrics.
Bibliometrics, Webometrics, Altmetrics, Alternative metrics.
 
Humanities and ICT
Humanities and ICTHumanities and ICT
Humanities and ICT
 
Digital Humanities in The Netherlands DARIAH, CLARIN, CLARIAH, … DHx.0 A pers...
Digital Humanities in The Netherlands DARIAH, CLARIN, CLARIAH, … DHx.0 A pers...Digital Humanities in The Netherlands DARIAH, CLARIN, CLARIAH, … DHx.0 A pers...
Digital Humanities in The Netherlands DARIAH, CLARIN, CLARIAH, … DHx.0 A pers...
 
Rare (and emergent) disciplines in the light of science studies
Rare (and emergent) disciplines in the light of science studiesRare (and emergent) disciplines in the light of science studies
Rare (and emergent) disciplines in the light of science studies
 
Digital Humanities as Innovation: ‘constant revolution’ or ‘moving to the su...
Digital Humanities as Innovation:  ‘constant revolution’ or ‘moving to the su...Digital Humanities as Innovation:  ‘constant revolution’ or ‘moving to the su...
Digital Humanities as Innovation: ‘constant revolution’ or ‘moving to the su...
 
KnoweScape - means and meaning of knowledge maps
KnoweScape - means and meaning of knowledge maps KnoweScape - means and meaning of knowledge maps
KnoweScape - means and meaning of knowledge maps
 
Models and Maps of Science
Models and Maps of ScienceModels and Maps of Science
Models and Maps of Science
 
Cross domain knowledge discovery, complex system theory and semantic web
Cross domain knowledge discovery, complex system theory and semantic webCross domain knowledge discovery, complex system theory and semantic web
Cross domain knowledge discovery, complex system theory and semantic web
 
UDC_in_Action
UDC_in_ActionUDC_in_Action
UDC_in_Action
 

Dernier

What is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPWhat is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPCeline George
 
ROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptxROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptxVanesaIglesias10
 
ICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfVanessa Camilleri
 
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdfVirtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdfErwinPantujan2
 
Integumentary System SMP B. Pharm Sem I.ppt
Integumentary System SMP B. Pharm Sem I.pptIntegumentary System SMP B. Pharm Sem I.ppt
Integumentary System SMP B. Pharm Sem I.pptshraddhaparab530
 
Concurrency Control in Database Management system
Concurrency Control in Database Management systemConcurrency Control in Database Management system
Concurrency Control in Database Management systemChristalin Nelson
 
Activity 2-unit 2-update 2024. English translation
Activity 2-unit 2-update 2024. English translationActivity 2-unit 2-update 2024. English translation
Activity 2-unit 2-update 2024. English translationRosabel UA
 
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...JhezDiaz1
 
Food processing presentation for bsc agriculture hons
Food processing presentation for bsc agriculture honsFood processing presentation for bsc agriculture hons
Food processing presentation for bsc agriculture honsManeerUddin
 
Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Seán Kennedy
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxAnupkumar Sharma
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for BeginnersSabitha Banu
 
Global Lehigh Strategic Initiatives (without descriptions)
Global Lehigh Strategic Initiatives (without descriptions)Global Lehigh Strategic Initiatives (without descriptions)
Global Lehigh Strategic Initiatives (without descriptions)cama23
 
4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptxmary850239
 
How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17Celine George
 
Choosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for ParentsChoosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for Parentsnavabharathschool99
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatYousafMalik24
 

Dernier (20)

What is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPWhat is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERP
 
ROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptxROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptx
 
ICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdf
 
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdfVirtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
 
Integumentary System SMP B. Pharm Sem I.ppt
Integumentary System SMP B. Pharm Sem I.pptIntegumentary System SMP B. Pharm Sem I.ppt
Integumentary System SMP B. Pharm Sem I.ppt
 
Concurrency Control in Database Management system
Concurrency Control in Database Management systemConcurrency Control in Database Management system
Concurrency Control in Database Management system
 
Activity 2-unit 2-update 2024. English translation
Activity 2-unit 2-update 2024. English translationActivity 2-unit 2-update 2024. English translation
Activity 2-unit 2-update 2024. English translation
 
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
 
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptxYOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
 
Food processing presentation for bsc agriculture hons
Food processing presentation for bsc agriculture honsFood processing presentation for bsc agriculture hons
Food processing presentation for bsc agriculture hons
 
Raw materials used in Herbal Cosmetics.pptx
Raw materials used in Herbal Cosmetics.pptxRaw materials used in Herbal Cosmetics.pptx
Raw materials used in Herbal Cosmetics.pptx
 
Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...
 
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptxLEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for Beginners
 
Global Lehigh Strategic Initiatives (without descriptions)
Global Lehigh Strategic Initiatives (without descriptions)Global Lehigh Strategic Initiatives (without descriptions)
Global Lehigh Strategic Initiatives (without descriptions)
 
4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx
 
How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17
 
Choosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for ParentsChoosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for Parents
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice great
 

Between  information  retrieval  services  and bibliometrics  research. New  ways  of  semantic  browsing  and  visual analytics

  • 1. Between information retrieval services and bibliometrics research – new ways of semantic browsing and visual analytics Rob Koopman, Shenghui Wang OCLC Research Andrea Scharnhorst DANS- KNAW November 7, 2015 ASIST, sigmetrics workshop
  • 2. Content - New approach to find structure in bibliographic information – ARIADNE (2 Method) - Applications: - Data curation – author disambiguation (1 Motivation) - Illustration of topics – the case of digital humanities Topical browsing – DEMO (3) - Excursion into bibliometrics – the Berlin group challenge (4) - Wrapping up (5)
  • 3. Data curation – author disambiguation
  • 4. Mapping topics, communities, research fronts, ….. Bibliometrics Documents are similar because they: - Cite each other - Are cited together - Use the same references - Use the same vocabulary - Have the same authors Information retrieval Documents are similar because they: - Use the same vocabulary - - …. ARIADNE is about similarity of entities!
  • 5. Document/work, Record and Entity … Authors Title Journal … Reference Subject Authors names Topical terms Reference Journal Glänzel, W. Glanzel, W. bibliometrics … … citations … Casimir effect N=SUM (doc)
  • 7. Demo examples • http://thoth.pica.nl/demo/relate WorldCat • http://thoth.pica.nl/relate ArticleFirst • http://thoth.pica.nl/astro/relate Astrophysics data Berlin group
  • 8. Dataset ● WorldCat, 300+ million records ● Selected 13 million items (topical terms, authors, ISSNs, Dewey decimal codes, publishers, subject headings) ● Represented by 6 million topical terms But a matrix of 13M x 6M is too big to process
  • 9. C: a co-occurrence matrix R: a random matrix of +/-1 C’: approximation of C after random projection -- Semantic matrix Koopman, R., Wang, S., Scharnhorst, A., Englebienne, G.: Ariadne’s thread: In- teractive navigation in a world of networked information. In: CHI’15 Extended Abstracts. Step 1: Building the semantic matrix – and Dimension reduction based on Random Projection
  • 10. Step 2: Interactive exploration - Provide a simple search/text box - Calculate the top 500 most related candidates - Find mutually related items - Convert distances to probabilities - Project to 2D - Enhance interface with links to other spaces
  • 11. Exploration of a topic http://thoth.pica.nl/relate?input=hirsch%20index&fsize=100&ncluster=
  • 12.
  • 13.
  • 14. EINS 1st PLENARY Digital libraries Science, Computer Science, ontologies Many different humanities fields Prominently language & Literary studies Illustration of context around a topic/field – journal view Koopman, R., Wang, S., Scharnhorst, A., Englebienne, G.: Ariadne's thread: Interactive navigation in a world of networked information. In: CHI'15 Extended Abstracts. (2015)
  • 15. As visual exploration of any dataset – astrophysics case
  • 16. Wrapping up – future work ● Compare the algorithm to other existing algorithms – benchmarking ● More metadata fields (publisher, subject, identifiers) – ongoing ● Identify further problems to which Ariadne can be applied ● Curation (e.g. author name disambiguation); ● Knowledge discovery (e.g. matching chemical molecules); ● Information science – population of libraries, subject areas, … ● Feedback from users – Prepare user scenarios for usability testing and set up an evaluation project – tbd ● Improve visualisation ● More functionality (timeline, history) ● Extend the implementation to other databases
  • 18. References Koopman, R., Wang, S., Scharnhorst, A., Englebienne, G.: Ariadne's thread: Interactive navigation in a world of networked information. In: B. Begole, J. Kim, K. Inkpen, W. Woo (eds.) Proceedings of the 33rd Annual ACM Conference Extended Abstracts on Human Factors in Computing Systems, Seoul, CHI 2015 Extended Abstracts, Republic of Korea, April 18 - 23, 2015, pp. 1833{1838. ACM (2015). DOI 10.1145/2702613.2732781. URL http://doi.acm.org/10.1145/2702613.2732781 (Preprint Arxiv.org) Koopman, R., Wang, S., Scharnhorst, A.: Contextualization of Topics - Browsing through Terms, Authors, Journals and Cluster Allocations. In: A.A. Salah, Y. Tonta, A.A.A. Salah, C. Sugimoto, U. Al (eds.) Proceedings of ISSI 2015 Istanbul. 15th International Society of Scientometrics and Informetrics Conference, Istanbul, Turkey, 29th June to 4th July 2015, pp. 1042{1053. Boazici University Printhouse, Istanbul (2015). URL http: //www.issi2015.org/en/Proceedings-of-ISSI-2015.html

Notes de l'éditeur

  1. [snapshot around an author as Loet Leydesdorff, Wolfgang Glaenzel] The idea at the beginning was: would one and the same author not have a similar ‘semantic’ fingerprints in s/he’s scholarly communication if we look into an article database, or book production if we look at WorldCat. The latter of course is a more complex problem, because the signal is weaker. Authors have usual less book publications than that they produce article. This network shows nodes representing author names, other words, journals … the links between them represent some similarity in terms of their lexical profile. Let me explain this in more detail.
  2. At the end we get document-document matrices: symmetric matrices, retrieved from asymmetric matrices such as documents-references; documents-authors; documents-words. In all those cases the unit of analysis is the document as represented by the bibliographic record – and the counterpart are elements of this record – or additional information in the document such as the references.
  3. In different bibliographic systems we find descriptions of works (articles, journals, objects, …) in form of a (classical) bibliographic record and often with additional information. In a first step we deconstruct the bibliographic record+ and extract categories of entities such as author names, journals names, subject headings, Dewey and other classifications. In a second step we ask how often these entities appear with topical terms. In other word, we construct now a word space not for the documents, but for the extracted entities from the document record. Documents are still relevant, because for calculating the co-occurrence of an entity and a topical term we go through all documents and count how often a certain author name and a topical word appear together. The resulting vector we call a semantic representation of an entity. Returning to our motivation: if an author is the same but spelled differently, we would assume that – in large corpus of documents, her semantic representation would be very similar. What we construct is a co-occurrence matrix between entities and topical terms. From this martix we can derive a similarity matrix between entities – taking the cosine of the vectors as measure. This similarity matrix can be visualized in form of a network, where entities are nodes and in any visual representation of this similarity the two ‘authors’ would be near to each other.
  4. We can of course do all this kind of analysis because we have standardized and digitized information. Ariadne has been developed around MARC records in different information servics, OCLC provides. ArticleFirst – this is were the demonstrator runs now; WorldCat – we did an exploration in this. But, in principle it can be applied to any database/set. An example you have seen in Theresa’s presentation
  5. If you never heard of the Hirsch Index – where does it belong to? What other terms are around it? What are the different aspects of this topic? Are there related aspects missing in my search terms? Who are the most prominent authors about this topic? Which journals publish most about this topic? How have others — e.g. librarians — described and classified this topic?
  6. In the case of the h-index there is a wikipedia entry which is much more detailed, still, ARIADNE gives you a first orientation
  7. Ariadne search into ArticleFirst a database from OCLC gives us an indication which journals are involved and based on this which fields are involved, not so surprising
  8. Ariadne search into ArticleFirst a database from OCLC gives us an indication which journals are involved and based on this which fields are involved, not so surprising