SlideShare une entreprise Scribd logo
1  sur  20
Télécharger pour lire hors ligne
Mining Research Publication Networks for Impact
PhD Topic Presentation

Drahomira Herrmannova
Knowledge Media Institute
The Open University

KMi Internal Seminar, November 2013

1 / 19
Table of Contents
1 Research Aim

Motivation
Problem statement
2 Literature review

State of the art
Limitations
3 Research objectives

Research questions
Selected approach
Tasks and plans
4 Pilot study
5 References

2 / 19
The key question

“How to evaluate the quality of research publications?”

3 / 19
Who needs this anyway?
• Researchers
• How to select relevant literature for reading?
• Librarians
• How to select journal subscriptions?
• Universities, funding agencies and other institutions
• How to aid reviewers of funding and grant proposals, hiring
committees etc.?
• Publishers and editors
• How can publishers evaluate and promote their journals?
• Society
• How to evaluate the returns of research to the society?

4 / 19
The growth of scholarly literature

Figure : Monthly submission rate (since 1991) for Arxiv.org. Source:
http://arxiv.org/

5 / 19
The growth of journal subscription costs

Figure : Expenditures in ARL libraries (1986 – 2009). Source: [1]

6 / 19
What’s being used

• Peer review
• Qualitative evaluation method
• Traditionally the main filter for controlling the quality of
published research
• Classical quantitative methods
• Typically based on citations and/or productivity
• Citation counts
• JIF
• h-index

7 / 19
So, what’s the problem?

• Peer review
• Speed and cost
• Biased opinion
• Doesn’t limit the amount of published research
• Classical quantitative methods
• Quality vs. impact
• Reasons for citation
• Citation half-life
• Manipulation and gaming
• Author variability
• Field effects

8 / 19
Bibliometrics today

Two changes which influenced the evolution of bibliometrics
• creation of the Web and web-related developments
• growth of Open Access publishing

9 / 19
Bibliometrics today
Two ideas driving the current research
1 Development of new metrics (improvements and replacements
of JIF)
• h-index
• Eigenfactor
• SJR
2

Concerns about the validity of using citations
• Methods using different data
• Patent analysis
• Webometrics
• Altmetrics
• Full-text analysis
• “Fixing” citations (field normalisation of indicators)

10 / 19
Limitations
• Limitations of citation-based metrics
• Citation bias
• Incomplete journal coverage
• Author variability
• Field effects
• Uncited publications
• Manipulation of metrics
• Using JIF for research evaluation
• Limitations of web-based metrics
• Gaming web-based and social metrics
• Problems of data collection
• Adoption of social media by users
• Accumulated advantage
• Limitations of text-based metrics
• Full-text not always available

11 / 19
Research questions

Question 1: What factors influence the quality of a research
publication (with regard to the publication type)?
Question 2: What is the relationship (if there is any) between the
impact of a publication, measured by the classical
bibliometric methods, and the quality of a
publication?
Question 3: How can we detect the factors influencing quality in
order to evaluate the quality of a research
publication?
Question 4: How can this evaluation be used in other disciplines?

12 / 19
Selected approach

• Single number vs. collection of metrics and indicators
• Analysis of full-text
• Until quite recently not easily available
• Full-text – the best indicator of publication quality
• For example
• Co-word analysis
• Analysis of citation context
• Semantic similarity of publications

• Additional indicators
• Famous author or collaboration with famous authors
• Citing or is being cited outside of the research area
• Paper published in a field-specific prestigious journal

13 / 19
Requirements for science evaluation methods
Source: [2]

1

Reliable and accurate, comparable or better than the peer
review system

2

Easy to understand.

3

Economical in terms of development and maintenance, time
required to understand it, etc.

4

Faster than citations, at least comparable to the speed of peer
review

5

Resistant to manipulation and gaming

14 / 19
Tasks and plans
Data collection

Task 1: Identify information sources that may provide relevant
publication data
• Mostly done
Task 2a: Investigate factors that influence the quality of research
publications
Task 2b: Using the identified information sources, develop various
relevant data structures such as:
• collaboration networks
• citation, co-citation and bibliographic coupling
networks
• clusters of semantically related publications
• clusters of publications corresponding to different
topics

15 / 19
Tasks and plans
Data analysis

Task 3a: Study the possibilities of application of NLP for the
evaluation of research publications
Task 3b: Investigate the developed data structures using graph
and network theory as well as bibliometric indicators

16 / 19
Tasks
Development of new methods

Task 4a: Analyse the possibilities of combining the studied
methods in order to design a set of new methods for
estimating quality
Task 4b: Evaluate the proposed methods against current
standards
Task 4c: Analyse the use of the new methods in other
disciplines

17 / 19
Task 1
Identification of data sources

Source
CSX
MAS
JSTOR
DBLP
CORE
ArXiv
KDD
iSearch
DBLP+C
ACM
OCC

MD
X
X
X
X
-

API
X
X
X
X
-

OAI-PMH
X
X
X
-

dumps
X
X
X
X
X
X
X
X
X

cit.
X
X
X
X
X
X
X
X
X

FT
X
*
*
*
X
X
X
X
-

Table : Stars (*) represent sources, which don’t store full-text but provide
links to the full-text where available. MD stands for multidisciplinary.

18 / 19
References

[1] Kyrillidou, Martha and Morris, Shaneka.
ARL Statistics 2008 - 2009.
Association of Research Libraries, Washington, DC, 2011.
[2] Taraborelli, Dario.
Soft peer review: Social software and distributed scientific
evaluation.
Proceedings of the 8th International Conference on the Design
of Cooperative Systems (COOP ’08), Carry-le-Rouet, France,
2008.

19 / 19
How many metrics?

Scientometrics: study of science and research
Bibliometrics: study of scientific literature
Informetrics: study of any type of information
Webometrics: informetric studies of the web
Cybermetrics: informetric studies of the whole Internet
Altmetrics: study of science and research using data from
social media

20 / 19

Contenu connexe

Tendances

Bibliometrics in the library
Bibliometrics in the libraryBibliometrics in the library
Bibliometrics in the library
Wouter Gerritsma
 

Tendances (20)

Bibliometrics
BibliometricsBibliometrics
Bibliometrics
 
Google scholar profiles
Google scholar profilesGoogle scholar profiles
Google scholar profiles
 
Scholarly Metrics Bootcamp USAIN 2014 Pre-conference workshop
Scholarly Metrics Bootcamp USAIN 2014 Pre-conference workshopScholarly Metrics Bootcamp USAIN 2014 Pre-conference workshop
Scholarly Metrics Bootcamp USAIN 2014 Pre-conference workshop
 
Altmetrics: An Overview
Altmetrics: An OverviewAltmetrics: An Overview
Altmetrics: An Overview
 
Bibliometrics: journals, articles, authors (v2)
Bibliometrics: journals, articles, authors (v2)Bibliometrics: journals, articles, authors (v2)
Bibliometrics: journals, articles, authors (v2)
 
Measuring research impact with bibliometrics
Measuring research impact with bibliometricsMeasuring research impact with bibliometrics
Measuring research impact with bibliometrics
 
Bibliometrics jul 2014
Bibliometrics jul 2014Bibliometrics jul 2014
Bibliometrics jul 2014
 
Bibliometrics in the library
Bibliometrics in the libraryBibliometrics in the library
Bibliometrics in the library
 
Van bibliometrics naar altmetrics
Van bibliometrics naar altmetricsVan bibliometrics naar altmetrics
Van bibliometrics naar altmetrics
 
Bibliometric Tools
Bibliometric ToolsBibliometric Tools
Bibliometric Tools
 
Showcasing your Research Impact using Bibliometrics
Showcasing your Research Impact using BibliometricsShowcasing your Research Impact using Bibliometrics
Showcasing your Research Impact using Bibliometrics
 
Introduction to Bibliometrics
Introduction to BibliometricsIntroduction to Bibliometrics
Introduction to Bibliometrics
 
Citation metrics
Citation metricsCitation metrics
Citation metrics
 
Resources for measuring and maximizing research impact fall 2015
Resources for measuring and maximizing research impact fall 2015Resources for measuring and maximizing research impact fall 2015
Resources for measuring and maximizing research impact fall 2015
 
Journal Impact Metrics
Journal Impact MetricsJournal Impact Metrics
Journal Impact Metrics
 
SciVal
SciValSciVal
SciVal
 
Finding Journal Impact Factor using Journal Citation Reports
Finding Journal Impact Factor using Journal Citation Reports Finding Journal Impact Factor using Journal Citation Reports
Finding Journal Impact Factor using Journal Citation Reports
 
STS Hot Topics Midwinter 2014 altmetrics presentation
STS Hot Topics Midwinter 2014 altmetrics presentationSTS Hot Topics Midwinter 2014 altmetrics presentation
STS Hot Topics Midwinter 2014 altmetrics presentation
 
Scopus Journal Metrics
Scopus Journal MetricsScopus Journal Metrics
Scopus Journal Metrics
 
Journal metrics July 2016
Journal metrics July 2016Journal metrics July 2016
Journal metrics July 2016
 

Similaire à Mining Research Publication Networks for Impact -- KMi Internal Seminar

#lak2013, Leuven, DC slides, #learninganalytics
#lak2013, Leuven, DC slides, #learninganalytics#lak2013, Leuven, DC slides, #learninganalytics
#lak2013, Leuven, DC slides, #learninganalytics
Soudé Fazeli
 
2 Topic Selection, Abstrat, Introduction and Objectives.pptx
2 Topic Selection, Abstrat, Introduction and Objectives.pptx2 Topic Selection, Abstrat, Introduction and Objectives.pptx
2 Topic Selection, Abstrat, Introduction and Objectives.pptx
kaleabtegegne
 
RecSysTEL2012 slides
RecSysTEL2012 slidesRecSysTEL2012 slides
RecSysTEL2012 slides
Soudé Fazeli
 

Similaire à Mining Research Publication Networks for Impact -- KMi Internal Seminar (20)

Open Discovery Initiative Update - CNI, April 4, 2013
Open Discovery Initiative Update - CNI, April 4, 2013Open Discovery Initiative Update - CNI, April 4, 2013
Open Discovery Initiative Update - CNI, April 4, 2013
 
#lak2013, Leuven, DC slides, #learninganalytics
#lak2013, Leuven, DC slides, #learninganalytics#lak2013, Leuven, DC slides, #learninganalytics
#lak2013, Leuven, DC slides, #learninganalytics
 
2 Topic Selection, Abstrat, Introduction and Objectives.pptx
2 Topic Selection, Abstrat, Introduction and Objectives.pptx2 Topic Selection, Abstrat, Introduction and Objectives.pptx
2 Topic Selection, Abstrat, Introduction and Objectives.pptx
 
The changing world of research evaluation
The changing world of research evaluationThe changing world of research evaluation
The changing world of research evaluation
 
LIBER's New Strategy 2018-2022
LIBER's New Strategy 2018-2022LIBER's New Strategy 2018-2022
LIBER's New Strategy 2018-2022
 
CORE Analytics Dashboard
CORE Analytics DashboardCORE Analytics Dashboard
CORE Analytics Dashboard
 
Academic Social Networks and Researcher Ranking
Academic Social Networks and Researcher RankingAcademic Social Networks and Researcher Ranking
Academic Social Networks and Researcher Ranking
 
The role of new information and communication technologies in information and...
The role of new information and communication technologies in information and...The role of new information and communication technologies in information and...
The role of new information and communication technologies in information and...
 
Lagace - Copyright Clearance Center April 2, 2015
Lagace - Copyright Clearance Center April 2, 2015Lagace - Copyright Clearance Center April 2, 2015
Lagace - Copyright Clearance Center April 2, 2015
 
Blurring boundaries to spark motivation: collaborative approaches to teaching...
Blurring boundaries to spark motivation: collaborative approaches to teaching...Blurring boundaries to spark motivation: collaborative approaches to teaching...
Blurring boundaries to spark motivation: collaborative approaches to teaching...
 
LIBER Strategy for libraries and research data
LIBER Strategy for libraries and research dataLIBER Strategy for libraries and research data
LIBER Strategy for libraries and research data
 
DORA and the reinvention of research assessment
DORA and the reinvention of research assessmentDORA and the reinvention of research assessment
DORA and the reinvention of research assessment
 
DOAJ as Gatekeeper for Quality Open Access Journals
DOAJ as Gatekeeper for Quality Open Access JournalsDOAJ as Gatekeeper for Quality Open Access Journals
DOAJ as Gatekeeper for Quality Open Access Journals
 
RecSysTEL2012 slides
RecSysTEL2012 slidesRecSysTEL2012 slides
RecSysTEL2012 slides
 
Writing papers during the journey phd workshop Oct 2013
Writing papers during the journey phd workshop Oct 2013Writing papers during the journey phd workshop Oct 2013
Writing papers during the journey phd workshop Oct 2013
 
Spotlight on users: an introduction to client-centered collection assessment
Spotlight on users: an introduction to client-centered collection assessmentSpotlight on users: an introduction to client-centered collection assessment
Spotlight on users: an introduction to client-centered collection assessment
 
Data-Informed Decision Making for Libraries - Athenaeum21
Data-Informed Decision Making for Libraries - Athenaeum21Data-Informed Decision Making for Libraries - Athenaeum21
Data-Informed Decision Making for Libraries - Athenaeum21
 
Data-Informed Decision Making for Digital Resources
Data-Informed Decision Making for Digital ResourcesData-Informed Decision Making for Digital Resources
Data-Informed Decision Making for Digital Resources
 
Promoting Data Literacy at the Grassroots (ACRL 2015, Portland, OR)
Promoting Data Literacy at the Grassroots (ACRL 2015, Portland, OR)Promoting Data Literacy at the Grassroots (ACRL 2015, Portland, OR)
Promoting Data Literacy at the Grassroots (ACRL 2015, Portland, OR)
 
Lern, jan 2015, digital media slides
Lern, jan 2015, digital media slidesLern, jan 2015, digital media slides
Lern, jan 2015, digital media slides
 

Plus de Dasha Herrmannova

Plus de Dasha Herrmannova (10)

Machine Learning for Data Extraction
Machine Learning for Data ExtractionMachine Learning for Data Extraction
Machine Learning for Data Extraction
 
Do Authors Deposit on Time? Tracking Open Access Policy Compliance
Do Authors Deposit on Time? Tracking Open Access Policy ComplianceDo Authors Deposit on Time? Tracking Open Access Policy Compliance
Do Authors Deposit on Time? Tracking Open Access Policy Compliance
 
Semantometrics: Text Analysis in Research Evaluation
Semantometrics: Text Analysis in Research Evaluation Semantometrics: Text Analysis in Research Evaluation
Semantometrics: Text Analysis in Research Evaluation
 
Do Citations and Readership Predict Excellent Publications?
Do Citations and Readership Predict Excellent Publications?Do Citations and Readership Predict Excellent Publications?
Do Citations and Readership Predict Excellent Publications?
 
An Analysis of the Microsoft Academic Graph
An Analysis of the Microsoft Academic GraphAn Analysis of the Microsoft Academic Graph
An Analysis of the Microsoft Academic Graph
 
Visual Search for Supporting Content Exploration in Large Document Collections
Visual Search for Supporting Content Exploration in Large Document CollectionsVisual Search for Supporting Content Exploration in Large Document Collections
Visual Search for Supporting Content Exploration in Large Document Collections
 
Unsupervised Identification of Study Descriptors in Toxicology Research: An E...
Unsupervised Identification of Study Descriptors in Toxicology Research: An E...Unsupervised Identification of Study Descriptors in Toxicology Research: An E...
Unsupervised Identification of Study Descriptors in Toxicology Research: An E...
 
Simple Yet Effective Methods for Large-Scale Scholarly Publication Ranking
Simple Yet Effective Methods for Large-Scale Scholarly Publication RankingSimple Yet Effective Methods for Large-Scale Scholarly Publication Ranking
Simple Yet Effective Methods for Large-Scale Scholarly Publication Ranking
 
Semantometrics in Coauthorship Networks: Fulltext-based Approach for Analysin...
Semantometrics in Coauthorship Networks: Fulltext-based Approach for Analysin...Semantometrics in Coauthorship Networks: Fulltext-based Approach for Analysin...
Semantometrics in Coauthorship Networks: Fulltext-based Approach for Analysin...
 
Towards Semantometrics: A New Semantic Similarity Based Measure for Assessing...
Towards Semantometrics: A New Semantic Similarity Based Measure for Assessing...Towards Semantometrics: A New Semantic Similarity Based Measure for Assessing...
Towards Semantometrics: A New Semantic Similarity Based Measure for Assessing...
 

Dernier

Dernier (20)

GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 

Mining Research Publication Networks for Impact -- KMi Internal Seminar

  • 1. Mining Research Publication Networks for Impact PhD Topic Presentation Drahomira Herrmannova Knowledge Media Institute The Open University KMi Internal Seminar, November 2013 1 / 19
  • 2. Table of Contents 1 Research Aim Motivation Problem statement 2 Literature review State of the art Limitations 3 Research objectives Research questions Selected approach Tasks and plans 4 Pilot study 5 References 2 / 19
  • 3. The key question “How to evaluate the quality of research publications?” 3 / 19
  • 4. Who needs this anyway? • Researchers • How to select relevant literature for reading? • Librarians • How to select journal subscriptions? • Universities, funding agencies and other institutions • How to aid reviewers of funding and grant proposals, hiring committees etc.? • Publishers and editors • How can publishers evaluate and promote their journals? • Society • How to evaluate the returns of research to the society? 4 / 19
  • 5. The growth of scholarly literature Figure : Monthly submission rate (since 1991) for Arxiv.org. Source: http://arxiv.org/ 5 / 19
  • 6. The growth of journal subscription costs Figure : Expenditures in ARL libraries (1986 – 2009). Source: [1] 6 / 19
  • 7. What’s being used • Peer review • Qualitative evaluation method • Traditionally the main filter for controlling the quality of published research • Classical quantitative methods • Typically based on citations and/or productivity • Citation counts • JIF • h-index 7 / 19
  • 8. So, what’s the problem? • Peer review • Speed and cost • Biased opinion • Doesn’t limit the amount of published research • Classical quantitative methods • Quality vs. impact • Reasons for citation • Citation half-life • Manipulation and gaming • Author variability • Field effects 8 / 19
  • 9. Bibliometrics today Two changes which influenced the evolution of bibliometrics • creation of the Web and web-related developments • growth of Open Access publishing 9 / 19
  • 10. Bibliometrics today Two ideas driving the current research 1 Development of new metrics (improvements and replacements of JIF) • h-index • Eigenfactor • SJR 2 Concerns about the validity of using citations • Methods using different data • Patent analysis • Webometrics • Altmetrics • Full-text analysis • “Fixing” citations (field normalisation of indicators) 10 / 19
  • 11. Limitations • Limitations of citation-based metrics • Citation bias • Incomplete journal coverage • Author variability • Field effects • Uncited publications • Manipulation of metrics • Using JIF for research evaluation • Limitations of web-based metrics • Gaming web-based and social metrics • Problems of data collection • Adoption of social media by users • Accumulated advantage • Limitations of text-based metrics • Full-text not always available 11 / 19
  • 12. Research questions Question 1: What factors influence the quality of a research publication (with regard to the publication type)? Question 2: What is the relationship (if there is any) between the impact of a publication, measured by the classical bibliometric methods, and the quality of a publication? Question 3: How can we detect the factors influencing quality in order to evaluate the quality of a research publication? Question 4: How can this evaluation be used in other disciplines? 12 / 19
  • 13. Selected approach • Single number vs. collection of metrics and indicators • Analysis of full-text • Until quite recently not easily available • Full-text – the best indicator of publication quality • For example • Co-word analysis • Analysis of citation context • Semantic similarity of publications • Additional indicators • Famous author or collaboration with famous authors • Citing or is being cited outside of the research area • Paper published in a field-specific prestigious journal 13 / 19
  • 14. Requirements for science evaluation methods Source: [2] 1 Reliable and accurate, comparable or better than the peer review system 2 Easy to understand. 3 Economical in terms of development and maintenance, time required to understand it, etc. 4 Faster than citations, at least comparable to the speed of peer review 5 Resistant to manipulation and gaming 14 / 19
  • 15. Tasks and plans Data collection Task 1: Identify information sources that may provide relevant publication data • Mostly done Task 2a: Investigate factors that influence the quality of research publications Task 2b: Using the identified information sources, develop various relevant data structures such as: • collaboration networks • citation, co-citation and bibliographic coupling networks • clusters of semantically related publications • clusters of publications corresponding to different topics 15 / 19
  • 16. Tasks and plans Data analysis Task 3a: Study the possibilities of application of NLP for the evaluation of research publications Task 3b: Investigate the developed data structures using graph and network theory as well as bibliometric indicators 16 / 19
  • 17. Tasks Development of new methods Task 4a: Analyse the possibilities of combining the studied methods in order to design a set of new methods for estimating quality Task 4b: Evaluate the proposed methods against current standards Task 4c: Analyse the use of the new methods in other disciplines 17 / 19
  • 18. Task 1 Identification of data sources Source CSX MAS JSTOR DBLP CORE ArXiv KDD iSearch DBLP+C ACM OCC MD X X X X - API X X X X - OAI-PMH X X X - dumps X X X X X X X X X cit. X X X X X X X X X FT X * * * X X X X - Table : Stars (*) represent sources, which don’t store full-text but provide links to the full-text where available. MD stands for multidisciplinary. 18 / 19
  • 19. References [1] Kyrillidou, Martha and Morris, Shaneka. ARL Statistics 2008 - 2009. Association of Research Libraries, Washington, DC, 2011. [2] Taraborelli, Dario. Soft peer review: Social software and distributed scientific evaluation. Proceedings of the 8th International Conference on the Design of Cooperative Systems (COOP ’08), Carry-le-Rouet, France, 2008. 19 / 19
  • 20. How many metrics? Scientometrics: study of science and research Bibliometrics: study of scientific literature Informetrics: study of any type of information Webometrics: informetric studies of the web Cybermetrics: informetric studies of the whole Internet Altmetrics: study of science and research using data from social media 20 / 19