SlideShare a Scribd company logo
1 of 33
Download to read offline
Large-scale analysis of bibliometric
data sources
Nees Jan van Eck
Centre for Science and Technology Studies (CWTS), Leiden University
8th LCDS Meeting: Statistics & Data Science
Leiden, November 13, 2015
About myself
• Master in computer science
• PhD thesis on bibliometric
mapping of science
• Researcher at CWTS since 2009
• Research focus on analysis and
visualization of bibliometric
networks
1
Centre for Science and Technology
Studies (CWTS)
• Research center at Leiden University
focusing on science and technology
studies
• About 30 staff members
• History of more than 25 years in
bibliometric and scientometric
research
• Contract research
• Full access to large bibliographic
database (Web of Science and
Scopus)
2
Bibliographic databases: ‘Big data’
3
Web of Science Scopus
Journals 12,000 20,000
Publications 45 million 35 million
Citations 1 billion 0.9 billion
Bibliometric networks
4
Web of
Science
Scopus
Citation network
of publications
Co-authorship network
of authors / organizations
Co-citation network
of pubs / authors / journals
Co-occurrence network
of terms
Bibliographic coupling network
of pubs / authors / journals
Bibliographic
database
Outline
• Software tools
• Network analysis techniques
• Analysis of data science
5
Software tools
6
Software tools
• VOSviewer (www.vosviewer.com)
– Tool for constructing and visualizing bibliometric networks
• CitNetExplorer (www.citnetexplorer.nl)
– Tool for visualizing and analyzing citation networks of
publications
• Both tools have been developed together
with my colleague Ludo Waltman 7
VOSviewer
8
Map of university co-authorship
network
9
Map of journal citation network
10
CitNetExplorer
11
Network
analysis
techniques
13
Network analysis techniques
14
Layout:
• Visualization of similarities
(VOS)
Community detection:
• Weighted modularity
• Smart local moving algorithm
Smart local moving algorithm
15
Q = 0.4198
Q = 0.3791
Reduced
network
Local moving
heuristic in
subnetworks
Local moving heuristic
Original
network
Algorithmically constructed
classification system of science
• 16.2 million publications from the period 2000–
2014 indexed in Web of Science
• 241.7 million citation relations
• Classification system of 3 hierarchical levels:
– 28 broad disciplines
– 813 fields
– 3,822 subfields
16
17
Breakdown of scientific literature into
813 fields
Social sciences
and humanities
Biomedical and
health sciences
Life and earth
sciences
Mathematics and
computer science
Physical
sciences and
engineering
Publications in scientometrics
subfield
18
Time-line map of highly cited
scientometrics publications
19
Analysis of
data science
20
What is data science?
• Empirical operationalization of data science based
on publications with ‘data’ in title or abstract
21
Wikipedia: “Data Science is an interdisciplinary field
about processes and systems to extract knowledge
or insights from data … which is a continuation of
some of the data analysis fields such as statistics,
data mining, and predictive analytics”
LCDS: “Data Science … deals with finding, analyzing
and validating complex patterns in data. Data
Science methods are indispensable for maintaining a
competitive edge in all disciplines in science”
Growth of data-driven research
22
0%
2%
4%
6%
8%
10%
12%
14%
16%
18%
20%
1990 1995 2000 2005 2010 2015
Percentageofpublications
% 'data' publications % 'theory' publications
23
Breakdown of scientific literature into
813 fields
Social sciences
and humanities
Biomedical and
health sciences
Life and earth
sciences
Mathematics and
computer science
Physical
sciences and
engineering
24
Data-driven nature of different
scientific fields
Social sciences
and humanities
Biomedical and
health sciences
Life and earth
sciences
Mathematics and
computer science
Physical
sciences and
engineering
% pub. with ‘data’ in title or abstract
25
Data-driven nature of different
scientific fields
artificial
intelligence
statistics
bioinformatics
neuroimaging
pattern
recognition astronomy
earth
water
weather
climate
remote
sensing
nutrition
obesity
addiction
% pub. with ‘data’ in title or abstract
Data science fields (at least 20% ‘data’
publications)
26
Social sciences
and humanities
Biomedical and
health sciences
Life and earth
sciences
Mathematics and
computer science
Physical
sciences and
engineering
Term map of data science fields
27
28
Leiden University’s publication output
in data science fields
Social sciences
and humanities
Biomedical and
health sciences
Life and earth
sciences
Mathematics and
computer science
Physical
sciences and
engineering
Leiden University’s institutes with most
publications in data science fields
• Leiden Observatory
• LUMC
• Faculty of Archaeology
• Institute of Psychology (FSW)
• Centre for Science and Technology Studies (FSW)
• Mathematical Institute (Science)
• Institute of Biology Leiden (Science)
• Leiden Institute of Advanced Computer Science
(Science)
29
LUMC departments with most
publications in data science fields
• Medical Statistics and Bioinformatics
• Rheumatology
• Psychiatry
• Radiology
• Clinical Epidemiology
• Human Genetics
• Neurosurgery
• Cardiology
• Clinical Oncology
• Endocrinology 30
Term map based on Leiden University’s
publications in data science fields
31
Do it yourself!
32
www.vosviewer.com www.citnetexplorer.nl
Thank you for your attention!
33

More Related Content

What's hot

A systematic empirical comparison of different approaches for normalizing cit...
A systematic empirical comparison of different approaches for normalizing cit...A systematic empirical comparison of different approaches for normalizing cit...
A systematic empirical comparison of different approaches for normalizing cit...Nees Jan van Eck
 
CWTS Leiden Ranking: An advanced bibliometric approach to university ranking
CWTS Leiden Ranking: An advanced bibliometric approach to university rankingCWTS Leiden Ranking: An advanced bibliometric approach to university ranking
CWTS Leiden Ranking: An advanced bibliometric approach to university rankingNees Jan van Eck
 
Visual exploration of scientific literature using VOSviewer and CitNetExplorer
Visual exploration of scientific literature using VOSviewer and CitNetExplorerVisual exploration of scientific literature using VOSviewer and CitNetExplorer
Visual exploration of scientific literature using VOSviewer and CitNetExplorerNees Jan van Eck
 
VOSviewer and CitNetExplorer Tutorial
VOSviewer and CitNetExplorer TutorialVOSviewer and CitNetExplorer Tutorial
VOSviewer and CitNetExplorer TutorialNees Jan van Eck
 
VOSviewer: A software tool for analyzing and visualizing scientific literature
VOSviewer: A software tool for analyzing and visualizing scientific literatureVOSviewer: A software tool for analyzing and visualizing scientific literature
VOSviewer: A software tool for analyzing and visualizing scientific literatureNees Jan van Eck
 
VOSviewer and CitNetExplorer: Software tools for bibliometric analysis of s...
VOSviewer and CitNetExplorer: Software tools for bibliometric analysis of s...VOSviewer and CitNetExplorer: Software tools for bibliometric analysis of s...
VOSviewer and CitNetExplorer: Software tools for bibliometric analysis of s...Nees Jan van Eck
 
Network visualization: Fine-tuning layout techniques for different types of n...
Network visualization: Fine-tuning layout techniques for different types of n...Network visualization: Fine-tuning layout techniques for different types of n...
Network visualization: Fine-tuning layout techniques for different types of n...Nees Jan van Eck
 
Getting started with CitNetExplorer
Getting started with CitNetExplorerGetting started with CitNetExplorer
Getting started with CitNetExplorerNees Jan van Eck
 
Large-scale visualization of science
Large-scale visualization of scienceLarge-scale visualization of science
Large-scale visualization of scienceNees Jan van Eck
 
Open data sources in VOSviewer
Open data sources in VOSviewerOpen data sources in VOSviewer
Open data sources in VOSviewerNees Jan van Eck
 
Intermediacy of publications
Intermediacy of publicationsIntermediacy of publications
Intermediacy of publicationsNees Jan van Eck
 
Using full-text data to create improved term maps
Using full-text data to create improved term mapsUsing full-text data to create improved term maps
Using full-text data to create improved term mapsNees Jan van Eck
 
Scientometric approaches to classification
Scientometric approaches to classificationScientometric approaches to classification
Scientometric approaches to classificationNees Jan van Eck
 
Large-scale visualization of science: Methods, tools, and applications
Large-scale visualization of science: Methods, tools, and applicationsLarge-scale visualization of science: Methods, tools, and applications
Large-scale visualization of science: Methods, tools, and applicationsLudo Waltman
 
Visualizing science based on open data sources
Visualizing science based on open data sourcesVisualizing science based on open data sources
Visualizing science based on open data sourcesNees Jan van Eck
 
Bibliometric visualization using VOSviewer
Bibliometric visualization using VOSviewerBibliometric visualization using VOSviewer
Bibliometric visualization using VOSviewerLudo Waltman
 
Bibliometrische visualisaties voor het bijhouden van wetenschappelijke litera...
Bibliometrische visualisaties voor het bijhouden van wetenschappelijke litera...Bibliometrische visualisaties voor het bijhouden van wetenschappelijke litera...
Bibliometrische visualisaties voor het bijhouden van wetenschappelijke litera...Nees Jan van Eck
 
The landscape of research on research
The landscape of research on researchThe landscape of research on research
The landscape of research on researchLudo Waltman
 

What's hot (20)

A systematic empirical comparison of different approaches for normalizing cit...
A systematic empirical comparison of different approaches for normalizing cit...A systematic empirical comparison of different approaches for normalizing cit...
A systematic empirical comparison of different approaches for normalizing cit...
 
CWTS Leiden Ranking: An advanced bibliometric approach to university ranking
CWTS Leiden Ranking: An advanced bibliometric approach to university rankingCWTS Leiden Ranking: An advanced bibliometric approach to university ranking
CWTS Leiden Ranking: An advanced bibliometric approach to university ranking
 
Visual exploration of scientific literature using VOSviewer and CitNetExplorer
Visual exploration of scientific literature using VOSviewer and CitNetExplorerVisual exploration of scientific literature using VOSviewer and CitNetExplorer
Visual exploration of scientific literature using VOSviewer and CitNetExplorer
 
VOSviewer and CitNetExplorer Tutorial
VOSviewer and CitNetExplorer TutorialVOSviewer and CitNetExplorer Tutorial
VOSviewer and CitNetExplorer Tutorial
 
VOSviewer: A software tool for analyzing and visualizing scientific literature
VOSviewer: A software tool for analyzing and visualizing scientific literatureVOSviewer: A software tool for analyzing and visualizing scientific literature
VOSviewer: A software tool for analyzing and visualizing scientific literature
 
VOSviewer and CitNetExplorer: Software tools for bibliometric analysis of s...
VOSviewer and CitNetExplorer: Software tools for bibliometric analysis of s...VOSviewer and CitNetExplorer: Software tools for bibliometric analysis of s...
VOSviewer and CitNetExplorer: Software tools for bibliometric analysis of s...
 
Network visualization: Fine-tuning layout techniques for different types of n...
Network visualization: Fine-tuning layout techniques for different types of n...Network visualization: Fine-tuning layout techniques for different types of n...
Network visualization: Fine-tuning layout techniques for different types of n...
 
Getting started with CitNetExplorer
Getting started with CitNetExplorerGetting started with CitNetExplorer
Getting started with CitNetExplorer
 
Large-scale visualization of science
Large-scale visualization of scienceLarge-scale visualization of science
Large-scale visualization of science
 
Cluster stability
Cluster stabilityCluster stability
Cluster stability
 
Open data sources in VOSviewer
Open data sources in VOSviewerOpen data sources in VOSviewer
Open data sources in VOSviewer
 
Intermediacy of publications
Intermediacy of publicationsIntermediacy of publications
Intermediacy of publications
 
Using full-text data to create improved term maps
Using full-text data to create improved term mapsUsing full-text data to create improved term maps
Using full-text data to create improved term maps
 
On cluster stability
On cluster stabilityOn cluster stability
On cluster stability
 
Scientometric approaches to classification
Scientometric approaches to classificationScientometric approaches to classification
Scientometric approaches to classification
 
Large-scale visualization of science: Methods, tools, and applications
Large-scale visualization of science: Methods, tools, and applicationsLarge-scale visualization of science: Methods, tools, and applications
Large-scale visualization of science: Methods, tools, and applications
 
Visualizing science based on open data sources
Visualizing science based on open data sourcesVisualizing science based on open data sources
Visualizing science based on open data sources
 
Bibliometric visualization using VOSviewer
Bibliometric visualization using VOSviewerBibliometric visualization using VOSviewer
Bibliometric visualization using VOSviewer
 
Bibliometrische visualisaties voor het bijhouden van wetenschappelijke litera...
Bibliometrische visualisaties voor het bijhouden van wetenschappelijke litera...Bibliometrische visualisaties voor het bijhouden van wetenschappelijke litera...
Bibliometrische visualisaties voor het bijhouden van wetenschappelijke litera...
 
The landscape of research on research
The landscape of research on researchThe landscape of research on research
The landscape of research on research
 

Viewers also liked

Bibliographic coupling
Bibliographic couplingBibliographic coupling
Bibliographic couplingRitesh Tiwari
 
Interactive topic identification using CitNetExplorer
Interactive topic identification using CitNetExplorerInteractive topic identification using CitNetExplorer
Interactive topic identification using CitNetExplorerNees Jan van Eck
 
Implementing a Scholarly Impact Program for Faculty and Graduate Students
Implementing a Scholarly Impact Program for Faculty and Graduate StudentsImplementing a Scholarly Impact Program for Faculty and Graduate Students
Implementing a Scholarly Impact Program for Faculty and Graduate StudentsBrenna Helmstutler
 
The need for contextualized scientometric analysis
The need for contextualized scientometric analysisThe need for contextualized scientometric analysis
The need for contextualized scientometric analysisLudo Waltman
 
What is your h-index and other measures of impact
What is your h-index and other measures of impactWhat is your h-index and other measures of impact
What is your h-index and other measures of impactBerenika Webster
 
Rodrigo Costas & Stefanie Haustein: Citation theories and their application t...
Rodrigo Costas & Stefanie Haustein: Citation theories and their application t...Rodrigo Costas & Stefanie Haustein: Citation theories and their application t...
Rodrigo Costas & Stefanie Haustein: Citation theories and their application t...Stefanie Haustein
 
Research-only rankings of HEIs: Is it possible to measure scientific performa...
Research-only rankings of HEIs:Is it possible to measure scientific performa...Research-only rankings of HEIs:Is it possible to measure scientific performa...
Research-only rankings of HEIs: Is it possible to measure scientific performa...Ludo Waltman
 
Comparing scientific performance across disciplines: Methodological and conce...
Comparing scientific performance across disciplines: Methodological and conce...Comparing scientific performance across disciplines: Methodological and conce...
Comparing scientific performance across disciplines: Methodological and conce...Ludo Waltman
 
SSH & the City. A network approach for tracing the societal contribution of t...
SSH & the City. A network approach for tracing the societal contribution of t...SSH & the City. A network approach for tracing the societal contribution of t...
SSH & the City. A network approach for tracing the societal contribution of t...Nicolas Robinson-Garcia
 
Citation analysis: State of the art, good practices, and future developments
Citation analysis: State of the art, good practices, and future developmentsCitation analysis: State of the art, good practices, and future developments
Citation analysis: State of the art, good practices, and future developmentsLudo Waltman
 
How to build your own citation index
How to build your own citation indexHow to build your own citation index
How to build your own citation indexGESIS
 
Bibliometrics and scientometrics
Bibliometrics and scientometricsBibliometrics and scientometrics
Bibliometrics and scientometricsguest633b30
 
Bibliometrics, Scintometrics, Citation analysis, Content analysis
Bibliometrics, Scintometrics, Citation analysis, Content analysisBibliometrics, Scintometrics, Citation analysis, Content analysis
Bibliometrics, Scintometrics, Citation analysis, Content analysisSumit Ranjan
 

Viewers also liked (17)

Bibliographic coupling
Bibliographic couplingBibliographic coupling
Bibliographic coupling
 
Interactive topic identification using CitNetExplorer
Interactive topic identification using CitNetExplorerInteractive topic identification using CitNetExplorer
Interactive topic identification using CitNetExplorer
 
Kevin Swingler: Introduction to Data Mining
Kevin Swingler: Introduction to Data MiningKevin Swingler: Introduction to Data Mining
Kevin Swingler: Introduction to Data Mining
 
Implementing a Scholarly Impact Program for Faculty and Graduate Students
Implementing a Scholarly Impact Program for Faculty and Graduate StudentsImplementing a Scholarly Impact Program for Faculty and Graduate Students
Implementing a Scholarly Impact Program for Faculty and Graduate Students
 
The need for contextualized scientometric analysis
The need for contextualized scientometric analysisThe need for contextualized scientometric analysis
The need for contextualized scientometric analysis
 
Mike Thelwall: Introduction to Webometrics
Mike Thelwall: Introduction to WebometricsMike Thelwall: Introduction to Webometrics
Mike Thelwall: Introduction to Webometrics
 
Webometrics
WebometricsWebometrics
Webometrics
 
What is your h-index and other measures of impact
What is your h-index and other measures of impactWhat is your h-index and other measures of impact
What is your h-index and other measures of impact
 
Rodrigo Costas & Stefanie Haustein: Citation theories and their application t...
Rodrigo Costas & Stefanie Haustein: Citation theories and their application t...Rodrigo Costas & Stefanie Haustein: Citation theories and their application t...
Rodrigo Costas & Stefanie Haustein: Citation theories and their application t...
 
Research-only rankings of HEIs: Is it possible to measure scientific performa...
Research-only rankings of HEIs:Is it possible to measure scientific performa...Research-only rankings of HEIs:Is it possible to measure scientific performa...
Research-only rankings of HEIs: Is it possible to measure scientific performa...
 
Comparing scientific performance across disciplines: Methodological and conce...
Comparing scientific performance across disciplines: Methodological and conce...Comparing scientific performance across disciplines: Methodological and conce...
Comparing scientific performance across disciplines: Methodological and conce...
 
SSH & the City. A network approach for tracing the societal contribution of t...
SSH & the City. A network approach for tracing the societal contribution of t...SSH & the City. A network approach for tracing the societal contribution of t...
SSH & the City. A network approach for tracing the societal contribution of t...
 
Citation analysis: State of the art, good practices, and future developments
Citation analysis: State of the art, good practices, and future developmentsCitation analysis: State of the art, good practices, and future developments
Citation analysis: State of the art, good practices, and future developments
 
How to build your own citation index
How to build your own citation indexHow to build your own citation index
How to build your own citation index
 
Bibliometrics and scientometrics
Bibliometrics and scientometricsBibliometrics and scientometrics
Bibliometrics and scientometrics
 
Bibliometrics, Scintometrics, Citation analysis, Content analysis
Bibliometrics, Scintometrics, Citation analysis, Content analysisBibliometrics, Scintometrics, Citation analysis, Content analysis
Bibliometrics, Scintometrics, Citation analysis, Content analysis
 
Bibliometrics
BibliometricsBibliometrics
Bibliometrics
 

Similar to Analysis of bibliometric data sources and data science fields

Scientometrics for research assessment
Scientometrics for research assessmentScientometrics for research assessment
Scientometrics for research assessmentLudo Waltman
 
Scientific information retrieval: Challenges and opportunities
Scientific information retrieval: Challenges and opportunitiesScientific information retrieval: Challenges and opportunities
Scientific information retrieval: Challenges and opportunitiesLudo Waltman
 
Beyond Meta-Data: Nano-Publications Recording Scientific Endeavour
Beyond Meta-Data: Nano-Publications Recording Scientific EndeavourBeyond Meta-Data: Nano-Publications Recording Scientific Endeavour
Beyond Meta-Data: Nano-Publications Recording Scientific EndeavourKNOWeSCAPE2014
 
Rebecca Grant DAH Research Presentation
Rebecca Grant DAH Research PresentationRebecca Grant DAH Research Presentation
Rebecca Grant DAH Research Presentationdri_ireland
 
Has anyone seen my data? Incentivising #opendata sharing with altmetrics
Has anyone seen my data? Incentivising #opendata sharing with altmetricsHas anyone seen my data? Incentivising #opendata sharing with altmetrics
Has anyone seen my data? Incentivising #opendata sharing with altmetricsNick Sheppard
 
Public engagement while you sleep
Public engagement while you sleepPublic engagement while you sleep
Public engagement while you sleepUoLResearchSupport
 
Research data support: a growth area for academic libraries?
Research data support: a growth area for academic libraries?Research data support: a growth area for academic libraries?
Research data support: a growth area for academic libraries? Robin Rice
 
Moving from an IR to a CRIS, the why & how
Moving from an IR to a CRIS, the why & howMoving from an IR to a CRIS, the why & how
Moving from an IR to a CRIS, the why & howDavid T Palmer
 
Public engagement while you sleep? How altmetrics can help researchers broade...
Public engagement while you sleep? How altmetrics can help researchers broade...Public engagement while you sleep? How altmetrics can help researchers broade...
Public engagement while you sleep? How altmetrics can help researchers broade...UoLResearchSupport
 
Public engagement while you sleep
Public engagement while you sleep Public engagement while you sleep
Public engagement while you sleep Kirsten Thompson
 
2014 ALA MW SPARC-ACRL Forum Talk
2014 ALA MW SPARC-ACRL Forum Talk2014 ALA MW SPARC-ACRL Forum Talk
2014 ALA MW SPARC-ACRL Forum TalkPaul Bracke
 
Digital research: Collections, data, tools and methods
Digital research: Collections, data, tools and methods Digital research: Collections, data, tools and methods
Digital research: Collections, data, tools and methods Stella Wisdom
 
TIB's action for research data managament as a national library's strategy in...
TIB's action for research data managament as a national library's strategy in...TIB's action for research data managament as a national library's strategy in...
TIB's action for research data managament as a national library's strategy in...Peter Löwe
 
06 e science-bio diversity@ pacc 18.07.2014
06 e science-bio diversity@ pacc 18.07.201406 e science-bio diversity@ pacc 18.07.2014
06 e science-bio diversity@ pacc 18.07.2014VinothkumaR Ramu
 
Managing 'Big Data' in the social sciences: the contribution of an analytico-...
Managing 'Big Data' in the social sciences: the contribution of an analytico-...Managing 'Big Data' in the social sciences: the contribution of an analytico-...
Managing 'Big Data' in the social sciences: the contribution of an analytico-...CILIP MDG
 
Presentation on Open Science and its 'Impacts';
Presentation on Open Science and its 'Impacts'; Presentation on Open Science and its 'Impacts';
Presentation on Open Science and its 'Impacts'; Rene Von schomberg
 
An in-depth bibliometric perspective on China’s scientific performance
An in-depth bibliometric perspective on China’s scientific performanceAn in-depth bibliometric perspective on China’s scientific performance
An in-depth bibliometric perspective on China’s scientific performanceLudo Waltman
 

Similar to Analysis of bibliometric data sources and data science fields (20)

Scientometrics for research assessment
Scientometrics for research assessmentScientometrics for research assessment
Scientometrics for research assessment
 
Scientific information retrieval: Challenges and opportunities
Scientific information retrieval: Challenges and opportunitiesScientific information retrieval: Challenges and opportunities
Scientific information retrieval: Challenges and opportunities
 
Beyond Meta-Data: Nano-Publications Recording Scientific Endeavour
Beyond Meta-Data: Nano-Publications Recording Scientific EndeavourBeyond Meta-Data: Nano-Publications Recording Scientific Endeavour
Beyond Meta-Data: Nano-Publications Recording Scientific Endeavour
 
Rebecca Grant DAH Research Presentation
Rebecca Grant DAH Research PresentationRebecca Grant DAH Research Presentation
Rebecca Grant DAH Research Presentation
 
Has anyone seen my data? Incentivising #opendata sharing with altmetrics
Has anyone seen my data? Incentivising #opendata sharing with altmetricsHas anyone seen my data? Incentivising #opendata sharing with altmetrics
Has anyone seen my data? Incentivising #opendata sharing with altmetrics
 
Public engagement while you sleep
Public engagement while you sleepPublic engagement while you sleep
Public engagement while you sleep
 
Why altmetrics?
Why altmetrics?Why altmetrics?
Why altmetrics?
 
Research data support: a growth area for academic libraries?
Research data support: a growth area for academic libraries?Research data support: a growth area for academic libraries?
Research data support: a growth area for academic libraries?
 
Moving from an IR to a CRIS, the why & how
Moving from an IR to a CRIS, the why & howMoving from an IR to a CRIS, the why & how
Moving from an IR to a CRIS, the why & how
 
Research data management in UK universities: A collaborative venture
Research data management in UK universities: A collaborative ventureResearch data management in UK universities: A collaborative venture
Research data management in UK universities: A collaborative venture
 
Public engagement while you sleep? How altmetrics can help researchers broade...
Public engagement while you sleep? How altmetrics can help researchers broade...Public engagement while you sleep? How altmetrics can help researchers broade...
Public engagement while you sleep? How altmetrics can help researchers broade...
 
Public engagement while you sleep
Public engagement while you sleep Public engagement while you sleep
Public engagement while you sleep
 
50 Years of Data Science
50 Years of Data Science50 Years of Data Science
50 Years of Data Science
 
2014 ALA MW SPARC-ACRL Forum Talk
2014 ALA MW SPARC-ACRL Forum Talk2014 ALA MW SPARC-ACRL Forum Talk
2014 ALA MW SPARC-ACRL Forum Talk
 
Digital research: Collections, data, tools and methods
Digital research: Collections, data, tools and methods Digital research: Collections, data, tools and methods
Digital research: Collections, data, tools and methods
 
TIB's action for research data managament as a national library's strategy in...
TIB's action for research data managament as a national library's strategy in...TIB's action for research data managament as a national library's strategy in...
TIB's action for research data managament as a national library's strategy in...
 
06 e science-bio diversity@ pacc 18.07.2014
06 e science-bio diversity@ pacc 18.07.201406 e science-bio diversity@ pacc 18.07.2014
06 e science-bio diversity@ pacc 18.07.2014
 
Managing 'Big Data' in the social sciences: the contribution of an analytico-...
Managing 'Big Data' in the social sciences: the contribution of an analytico-...Managing 'Big Data' in the social sciences: the contribution of an analytico-...
Managing 'Big Data' in the social sciences: the contribution of an analytico-...
 
Presentation on Open Science and its 'Impacts';
Presentation on Open Science and its 'Impacts'; Presentation on Open Science and its 'Impacts';
Presentation on Open Science and its 'Impacts';
 
An in-depth bibliometric perspective on China’s scientific performance
An in-depth bibliometric perspective on China’s scientific performanceAn in-depth bibliometric perspective on China’s scientific performance
An in-depth bibliometric perspective on China’s scientific performance
 

More from Nees Jan van Eck

Crossref as a source of open bibliographic metadata
Crossref as a source of open bibliographic metadataCrossref as a source of open bibliographic metadata
Crossref as a source of open bibliographic metadataNees Jan van Eck
 
Community detection using citation relations and textual similarities in a la...
Community detection using citation relations and textual similarities in a la...Community detection using citation relations and textual similarities in a la...
Community detection using citation relations and textual similarities in a la...Nees Jan van Eck
 
Visualizing science using VOSviewer based on Crossref, Microsoft Academic, an...
Visualizing science using VOSviewer based on Crossref, Microsoft Academic, an...Visualizing science using VOSviewer based on Crossref, Microsoft Academic, an...
Visualizing science using VOSviewer based on Crossref, Microsoft Academic, an...Nees Jan van Eck
 
A scientometric perspective on university ranking
A scientometric perspective on university rankingA scientometric perspective on university ranking
A scientometric perspective on university rankingNees Jan van Eck
 
Open data sources in VOSviewer
Open data sources in VOSviewerOpen data sources in VOSviewer
Open data sources in VOSviewerNees Jan van Eck
 
A scientometric perspective on university ranking
A scientometric perspective on university rankingA scientometric perspective on university ranking
A scientometric perspective on university rankingNees Jan van Eck
 
CWTS Leiden Ranking: An advanced bibliometric approach to university ranking
CWTS Leiden Ranking: An advanced bibliometric approach to university rankingCWTS Leiden Ranking: An advanced bibliometric approach to university ranking
CWTS Leiden Ranking: An advanced bibliometric approach to university rankingNees Jan van Eck
 
Open data sources in VOSviewer
Open data sources in VOSviewerOpen data sources in VOSviewer
Open data sources in VOSviewerNees Jan van Eck
 
Accuracy of citation data in Web of Science and Scopus
Accuracy of citation data in Web of Science and ScopusAccuracy of citation data in Web of Science and Scopus
Accuracy of citation data in Web of Science and ScopusNees Jan van Eck
 
How to design a ranking system: Criteria and opportunities for a comparison
How to design a ranking system: Criteria and opportunities for a comparisonHow to design a ranking system: Criteria and opportunities for a comparison
How to design a ranking system: Criteria and opportunities for a comparisonNees Jan van Eck
 

More from Nees Jan van Eck (10)

Crossref as a source of open bibliographic metadata
Crossref as a source of open bibliographic metadataCrossref as a source of open bibliographic metadata
Crossref as a source of open bibliographic metadata
 
Community detection using citation relations and textual similarities in a la...
Community detection using citation relations and textual similarities in a la...Community detection using citation relations and textual similarities in a la...
Community detection using citation relations and textual similarities in a la...
 
Visualizing science using VOSviewer based on Crossref, Microsoft Academic, an...
Visualizing science using VOSviewer based on Crossref, Microsoft Academic, an...Visualizing science using VOSviewer based on Crossref, Microsoft Academic, an...
Visualizing science using VOSviewer based on Crossref, Microsoft Academic, an...
 
A scientometric perspective on university ranking
A scientometric perspective on university rankingA scientometric perspective on university ranking
A scientometric perspective on university ranking
 
Open data sources in VOSviewer
Open data sources in VOSviewerOpen data sources in VOSviewer
Open data sources in VOSviewer
 
A scientometric perspective on university ranking
A scientometric perspective on university rankingA scientometric perspective on university ranking
A scientometric perspective on university ranking
 
CWTS Leiden Ranking: An advanced bibliometric approach to university ranking
CWTS Leiden Ranking: An advanced bibliometric approach to university rankingCWTS Leiden Ranking: An advanced bibliometric approach to university ranking
CWTS Leiden Ranking: An advanced bibliometric approach to university ranking
 
Open data sources in VOSviewer
Open data sources in VOSviewerOpen data sources in VOSviewer
Open data sources in VOSviewer
 
Accuracy of citation data in Web of Science and Scopus
Accuracy of citation data in Web of Science and ScopusAccuracy of citation data in Web of Science and Scopus
Accuracy of citation data in Web of Science and Scopus
 
How to design a ranking system: Criteria and opportunities for a comparison
How to design a ranking system: Criteria and opportunities for a comparisonHow to design a ranking system: Criteria and opportunities for a comparison
How to design a ranking system: Criteria and opportunities for a comparison
 

Recently uploaded

GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxAleenaTreesaSaji
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxUmerFayaz5
 
Orientation, design and principles of polyhouse
Orientation, design and principles of polyhouseOrientation, design and principles of polyhouse
Orientation, design and principles of polyhousejana861314
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)PraveenaKalaiselvan1
 
Cultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxCultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxpradhanghanshyam7136
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Lokesh Kothari
 
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxkessiyaTpeter
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)Areesha Ahmad
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSarthak Sekhar Mondal
 
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCESTERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCEPRINCE C P
 
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINChromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINsankalpkumarsahoo174
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxgindu3009
 
DIFFERENCE IN BACK CROSS AND TEST CROSS
DIFFERENCE IN  BACK CROSS AND TEST CROSSDIFFERENCE IN  BACK CROSS AND TEST CROSS
DIFFERENCE IN BACK CROSS AND TEST CROSSLeenakshiTyagi
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...ssifa0344
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoSérgio Sacani
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsAArockiyaNisha
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​kaibalyasahoo82800
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPirithiRaju
 
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencyHire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencySheetal Arora
 

Recently uploaded (20)

GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptx
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptx
 
Orientation, design and principles of polyhouse
Orientation, design and principles of polyhouseOrientation, design and principles of polyhouse
Orientation, design and principles of polyhouse
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)
 
Cultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxCultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptx
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
 
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)
 
CELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdfCELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdf
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
 
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCESTERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
 
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINChromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptx
 
DIFFERENCE IN BACK CROSS AND TEST CROSS
DIFFERENCE IN  BACK CROSS AND TEST CROSSDIFFERENCE IN  BACK CROSS AND TEST CROSS
DIFFERENCE IN BACK CROSS AND TEST CROSS
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on Io
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based Nanomaterials
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencyHire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
 

Analysis of bibliometric data sources and data science fields

  • 1. Large-scale analysis of bibliometric data sources Nees Jan van Eck Centre for Science and Technology Studies (CWTS), Leiden University 8th LCDS Meeting: Statistics & Data Science Leiden, November 13, 2015
  • 2. About myself • Master in computer science • PhD thesis on bibliometric mapping of science • Researcher at CWTS since 2009 • Research focus on analysis and visualization of bibliometric networks 1
  • 3. Centre for Science and Technology Studies (CWTS) • Research center at Leiden University focusing on science and technology studies • About 30 staff members • History of more than 25 years in bibliometric and scientometric research • Contract research • Full access to large bibliographic database (Web of Science and Scopus) 2
  • 4. Bibliographic databases: ‘Big data’ 3 Web of Science Scopus Journals 12,000 20,000 Publications 45 million 35 million Citations 1 billion 0.9 billion
  • 5. Bibliometric networks 4 Web of Science Scopus Citation network of publications Co-authorship network of authors / organizations Co-citation network of pubs / authors / journals Co-occurrence network of terms Bibliographic coupling network of pubs / authors / journals Bibliographic database
  • 6. Outline • Software tools • Network analysis techniques • Analysis of data science 5
  • 8. Software tools • VOSviewer (www.vosviewer.com) – Tool for constructing and visualizing bibliometric networks • CitNetExplorer (www.citnetexplorer.nl) – Tool for visualizing and analyzing citation networks of publications • Both tools have been developed together with my colleague Ludo Waltman 7
  • 10. Map of university co-authorship network 9
  • 11. Map of journal citation network 10
  • 14. Network analysis techniques 14 Layout: • Visualization of similarities (VOS) Community detection: • Weighted modularity • Smart local moving algorithm
  • 15. Smart local moving algorithm 15 Q = 0.4198 Q = 0.3791 Reduced network Local moving heuristic in subnetworks Local moving heuristic Original network
  • 16. Algorithmically constructed classification system of science • 16.2 million publications from the period 2000– 2014 indexed in Web of Science • 241.7 million citation relations • Classification system of 3 hierarchical levels: – 28 broad disciplines – 813 fields – 3,822 subfields 16
  • 17. 17 Breakdown of scientific literature into 813 fields Social sciences and humanities Biomedical and health sciences Life and earth sciences Mathematics and computer science Physical sciences and engineering
  • 19. Time-line map of highly cited scientometrics publications 19
  • 21. What is data science? • Empirical operationalization of data science based on publications with ‘data’ in title or abstract 21 Wikipedia: “Data Science is an interdisciplinary field about processes and systems to extract knowledge or insights from data … which is a continuation of some of the data analysis fields such as statistics, data mining, and predictive analytics” LCDS: “Data Science … deals with finding, analyzing and validating complex patterns in data. Data Science methods are indispensable for maintaining a competitive edge in all disciplines in science”
  • 22. Growth of data-driven research 22 0% 2% 4% 6% 8% 10% 12% 14% 16% 18% 20% 1990 1995 2000 2005 2010 2015 Percentageofpublications % 'data' publications % 'theory' publications
  • 23. 23 Breakdown of scientific literature into 813 fields Social sciences and humanities Biomedical and health sciences Life and earth sciences Mathematics and computer science Physical sciences and engineering
  • 24. 24 Data-driven nature of different scientific fields Social sciences and humanities Biomedical and health sciences Life and earth sciences Mathematics and computer science Physical sciences and engineering % pub. with ‘data’ in title or abstract
  • 25. 25 Data-driven nature of different scientific fields artificial intelligence statistics bioinformatics neuroimaging pattern recognition astronomy earth water weather climate remote sensing nutrition obesity addiction % pub. with ‘data’ in title or abstract
  • 26. Data science fields (at least 20% ‘data’ publications) 26 Social sciences and humanities Biomedical and health sciences Life and earth sciences Mathematics and computer science Physical sciences and engineering
  • 27. Term map of data science fields 27
  • 28. 28 Leiden University’s publication output in data science fields Social sciences and humanities Biomedical and health sciences Life and earth sciences Mathematics and computer science Physical sciences and engineering
  • 29. Leiden University’s institutes with most publications in data science fields • Leiden Observatory • LUMC • Faculty of Archaeology • Institute of Psychology (FSW) • Centre for Science and Technology Studies (FSW) • Mathematical Institute (Science) • Institute of Biology Leiden (Science) • Leiden Institute of Advanced Computer Science (Science) 29
  • 30. LUMC departments with most publications in data science fields • Medical Statistics and Bioinformatics • Rheumatology • Psychiatry • Radiology • Clinical Epidemiology • Human Genetics • Neurosurgery • Cardiology • Clinical Oncology • Endocrinology 30
  • 31. Term map based on Leiden University’s publications in data science fields 31
  • 32. Do it yourself! 32 www.vosviewer.com www.citnetexplorer.nl
  • 33. Thank you for your attention! 33