SlideShare une entreprise Scribd logo
1  sur  6
Towards the automatic identification
of the nature of citations
(1) Department of Computer Science and Engineering, University of Bologna, Italy
(2) STLab-ISTC, National Research Council, Italy
30 May 2013
Montpellier, France
ESWC 2013
Motivation
• Bibliographic citations can be seen as tools for:
– linking research: making pointers to related works, to source of
experimental data, to methods used, etc.
– disseminating research: conference proceedings, journals, Web
platforms (e.g. blogs, wikis), Semantic Publishing platforms and
projects (e.g. OpenCitation, OpenBibliography, Lucero)
– exploring research: new ways of browsing article through networks
of citations (e.g. CiteWiz, Citation Sensitive In-browser Summariser)
– evaluating research: measuring the importance of journals (e.g.
impact factor) or the scientific productivity of authors (e.g. h-index)
• Assumption: all these activities can be radically improved by
exploiting the actual function of citations, i.e. author’s
reason for citing a given paper
Goal
• To design a method able to automatically infer the
author’s reason for citing a scientific article
• To implement a tool that is comparable to humans in the
task of identifying the nature of citations
Available online at http://wit.istc.cnr.it:8080/tools/citalo
It extends the research
outlined in earlier work X.
Ontology
learning
Citation type
extraction
Word-sense
disambiguation
Alignment to
CiTO
Sentiment
analysis
Output:
cito:extends
Input: a sentence
containing a reference to
a bibliographic entity
indicated by an “X”
Derive a logical (i.e. an
OWL ontology)
representation of the
sentence through
FRED
Extract candidate types
for the citation by looking
for patterns in FRED
output via SPARQL
Gather the sense of the
candidate types through
IMS with respect to
OntoWordNet
Capture the sentiment
polarity emerging from th
text through AlchemyAPI
Assign CiTO types to the
citation through SPARQL
CONSTRUCT
Result
Similarly to Teufel et al. [19] the most
neutral CiTO property,
citesForInformation, was the most
prevalent function in our dataset too,
as the second most used property
was usedMethodIn
We run CiTalO on the same sample according to 8 different configurations and we
compared the results with humans annotations
No configuration
that emerges as
the absolutely best
one from these
data
Worst
configurations were
those that took into
account all the
proximal synsets
We asked humans to manually annotate 106 citation sentences, contained in scientific ar
according to CiTO properties
Thanks

Contenu connexe

Tendances

Coursera programming1 2015
Coursera programming1 2015Coursera programming1 2015
Coursera programming1 2015
Rafal Zdziech
 
Ontology engineering: Ontology alignment
Ontology engineering: Ontology alignmentOntology engineering: Ontology alignment
Ontology engineering: Ontology alignment
Guus Schreiber
 
The Distribution of References in Scientific Papers: an Analysis of the IMRaD...
The Distribution of References in Scientific Papers: an Analysis of the IMRaD...The Distribution of References in Scientific Papers: an Analysis of the IMRaD...
The Distribution of References in Scientific Papers: an Analysis of the IMRaD...
Iana Atanassova
 

Tendances (17)

Ontology mapping for the semantic web
Ontology mapping for the semantic webOntology mapping for the semantic web
Ontology mapping for the semantic web
 
Big Data & Text Mining
Big Data & Text MiningBig Data & Text Mining
Big Data & Text Mining
 
2011 03-provenance-workshop-edingurgh
2011 03-provenance-workshop-edingurgh2011 03-provenance-workshop-edingurgh
2011 03-provenance-workshop-edingurgh
 
Detecting Incongruity Between News Headline and Body Text via a Deep Hierarch...
Detecting Incongruity Between News Headline and Body Text via a Deep Hierarch...Detecting Incongruity Between News Headline and Body Text via a Deep Hierarch...
Detecting Incongruity Between News Headline and Body Text via a Deep Hierarch...
 
MELJUN CORTES research seminar_1__citing_a_source_summer_1516
MELJUN CORTES research seminar_1__citing_a_source_summer_1516MELJUN CORTES research seminar_1__citing_a_source_summer_1516
MELJUN CORTES research seminar_1__citing_a_source_summer_1516
 
Supporting scientific discovery through linkages of literature and data
Supporting scientific discovery through linkages of literature and dataSupporting scientific discovery through linkages of literature and data
Supporting scientific discovery through linkages of literature and data
 
Data Integration Ontology Mapping
Data Integration Ontology MappingData Integration Ontology Mapping
Data Integration Ontology Mapping
 
Between  information  retrieval  services  and bibliometrics  research. New  ...
Between  information  retrieval  services  and bibliometrics  research. New  ...Between  information  retrieval  services  and bibliometrics  research. New  ...
Between  information  retrieval  services  and bibliometrics  research. New  ...
 
Week12
Week12Week12
Week12
 
Coursera programming1 2015
Coursera programming1 2015Coursera programming1 2015
Coursera programming1 2015
 
4.4 text mining
4.4 text mining4.4 text mining
4.4 text mining
 
2011linked science4mccuskermcguinnessfinal
2011linked science4mccuskermcguinnessfinal2011linked science4mccuskermcguinnessfinal
2011linked science4mccuskermcguinnessfinal
 
Ontology engineering: Ontology alignment
Ontology engineering: Ontology alignmentOntology engineering: Ontology alignment
Ontology engineering: Ontology alignment
 
Bibliometric - MIT MetaResources
Bibliometric - MIT MetaResourcesBibliometric - MIT MetaResources
Bibliometric - MIT MetaResources
 
Ontology-based Data Integration
Ontology-based Data IntegrationOntology-based Data Integration
Ontology-based Data Integration
 
The Distribution of References in Scientific Papers: an Analysis of the IMRaD...
The Distribution of References in Scientific Papers: an Analysis of the IMRaD...The Distribution of References in Scientific Papers: an Analysis of the IMRaD...
The Distribution of References in Scientific Papers: an Analysis of the IMRaD...
 
ELIXIR-UK and the ELIXIR Interoperability Platform
ELIXIR-UK and the ELIXIR Interoperability PlatformELIXIR-UK and the ELIXIR Interoperability Platform
ELIXIR-UK and the ELIXIR Interoperability Platform
 

En vedette

Knowledge Patterns for the Web: extraction, transformation, and reuse
Knowledge Patterns for the Web: extraction, transformation, and reuseKnowledge Patterns for the Web: extraction, transformation, and reuse
Knowledge Patterns for the Web: extraction, transformation, and reuse
Andrea Nuzzolese
 

En vedette (9)

C la informacion_tecnologica_basada_en_patentes
C la informacion_tecnologica_basada_en_patentesC la informacion_tecnologica_basada_en_patentes
C la informacion_tecnologica_basada_en_patentes
 
Patentes
PatentesPatentes
Patentes
 
Evaluating citation functions in CiTO: cognitive issues
Evaluating citation functions in CiTO: cognitive issuesEvaluating citation functions in CiTO: cognitive issues
Evaluating citation functions in CiTO: cognitive issues
 
Knowledge Patterns for the Web: extraction, transformation, and reuse
Knowledge Patterns for the Web: extraction, transformation, and reuseKnowledge Patterns for the Web: extraction, transformation, and reuse
Knowledge Patterns for the Web: extraction, transformation, and reuse
 
Oke
OkeOke
Oke
 
Semantic Technologies in ST&DL
Semantic Technologies in ST&DLSemantic Technologies in ST&DL
Semantic Technologies in ST&DL
 
Sheldon challenge
Sheldon challengeSheldon challenge
Sheldon challenge
 
Conference Linked Data: the ScholarlyData project
Conference Linked Data: the ScholarlyData projectConference Linked Data: the ScholarlyData project
Conference Linked Data: the ScholarlyData project
 
Aemoo: Linked Data Exploration based on Knowledge Patterns
Aemoo: Linked Data Exploration based on Knowledge PatternsAemoo: Linked Data Exploration based on Knowledge Patterns
Aemoo: Linked Data Exploration based on Knowledge Patterns
 

Similaire à Towards the automatic identification of the nature of citations

The Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
The Computer Science Ontology: A Large-Scale Taxonomy of Research AreasThe Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
The Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
Angelo Salatino
 
The Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
The Computer Science Ontology:  A Large-Scale Taxonomy of Research AreasThe Computer Science Ontology:  A Large-Scale Taxonomy of Research Areas
The Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
Angelo Salatino
 
Ck32985989
Ck32985989Ck32985989
Ck32985989
IJMER
 

Similaire à Towards the automatic identification of the nature of citations (20)

Towards the automatic identification of the nature of citations
Towards the automatic identification of the nature of citationsTowards the automatic identification of the nature of citations
Towards the automatic identification of the nature of citations
 
OpenCitations
OpenCitationsOpenCitations
OpenCitations
 
The Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
The Computer Science Ontology: A Large-Scale Taxonomy of Research AreasThe Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
The Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
 
The Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
The Computer Science Ontology:  A Large-Scale Taxonomy of Research AreasThe Computer Science Ontology:  A Large-Scale Taxonomy of Research Areas
The Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
 
Applying machine learning techniques to big data in the scholarly domain
Applying machine learning techniques to big data in the scholarly domainApplying machine learning techniques to big data in the scholarly domain
Applying machine learning techniques to big data in the scholarly domain
 
Library Search 3: Finding a journal article 2021
Library Search 3: Finding a journal article 2021Library Search 3: Finding a journal article 2021
Library Search 3: Finding a journal article 2021
 
Semantic lenses to bring digital and semantic publishing together
Semantic lenses to bring digital and semantic publishing togetherSemantic lenses to bring digital and semantic publishing together
Semantic lenses to bring digital and semantic publishing together
 
Ck32985989
Ck32985989Ck32985989
Ck32985989
 
Overview of Bibliometrics - IAP Course version 1.1
Overview of Bibliometrics - IAP Course version 1.1Overview of Bibliometrics - IAP Course version 1.1
Overview of Bibliometrics - IAP Course version 1.1
 
Iot ontologies state of art$$$
Iot ontologies state of art$$$Iot ontologies state of art$$$
Iot ontologies state of art$$$
 
Developing a meta language in multidisciplinary research projects-the case st...
Developing a meta language in multidisciplinary research projects-the case st...Developing a meta language in multidisciplinary research projects-the case st...
Developing a meta language in multidisciplinary research projects-the case st...
 
Social and Collaborative Construction of Structured Knowledge WWW2007
Social and Collaborative Construction of Structured Knowledge WWW2007Social and Collaborative Construction of Structured Knowledge WWW2007
Social and Collaborative Construction of Structured Knowledge WWW2007
 
Mathew.ppt
Mathew.pptMathew.ppt
Mathew.ppt
 
OpenMinTeD: Making Sense of Large Volumes of Data
OpenMinTeD: Making Sense of Large Volumes of DataOpenMinTeD: Making Sense of Large Volumes of Data
OpenMinTeD: Making Sense of Large Volumes of Data
 
A Computational Framework for Concept Representation in Cognitive Systems and...
A Computational Framework for Concept Representation in Cognitive Systems and...A Computational Framework for Concept Representation in Cognitive Systems and...
A Computational Framework for Concept Representation in Cognitive Systems and...
 
Using information
Using informationUsing information
Using information
 
IJRET-V1I1P5 - A User Friendly Mobile Search Engine for fast Accessing the Da...
IJRET-V1I1P5 - A User Friendly Mobile Search Engine for fast Accessing the Da...IJRET-V1I1P5 - A User Friendly Mobile Search Engine for fast Accessing the Da...
IJRET-V1I1P5 - A User Friendly Mobile Search Engine for fast Accessing the Da...
 
MetaScience: Holistic Approach for Research Modeling and Analysis
MetaScience: Holistic Approach for Research Modeling and AnalysisMetaScience: Holistic Approach for Research Modeling and Analysis
MetaScience: Holistic Approach for Research Modeling and Analysis
 
Research and Referencing
Research and ReferencingResearch and Referencing
Research and Referencing
 
Presentation at MTSR 2012
Presentation at MTSR 2012Presentation at MTSR 2012
Presentation at MTSR 2012
 

Plus de Andrea Nuzzolese

Type inference through the analysis of Wikipedia links
Type inference through the analysis of Wikipedia linksType inference through the analysis of Wikipedia links
Type inference through the analysis of Wikipedia links
Andrea Nuzzolese
 
Towards an Empirical Semantic Web Science: Knowledge Pattern Extraction and U...
Towards an Empirical Semantic Web Science: Knowledge Pattern Extraction and U...Towards an Empirical Semantic Web Science: Knowledge Pattern Extraction and U...
Towards an Empirical Semantic Web Science: Knowledge Pattern Extraction and U...
Andrea Nuzzolese
 
Gathering Lexical Linked Data and Knowledge Patterns from FrameNet
Gathering Lexical Linked Data and Knowledge Patterns from FrameNetGathering Lexical Linked Data and Knowledge Patterns from FrameNet
Gathering Lexical Linked Data and Knowledge Patterns from FrameNet
Andrea Nuzzolese
 
Aemoo: exploratory search based on knowledge patterns over the Semantic Web
Aemoo:  exploratory search based on knowledge patterns over the Semantic WebAemoo:  exploratory search based on knowledge patterns over the Semantic Web
Aemoo: exploratory search based on knowledge patterns over the Semantic Web
Andrea Nuzzolese
 

Plus de Andrea Nuzzolese (6)

Loditaly2014 new
Loditaly2014 newLoditaly2014 new
Loditaly2014 new
 
Knowledge Representation and Reasoning with Apache Stanbol
Knowledge Representation and Reasoning with Apache StanbolKnowledge Representation and Reasoning with Apache Stanbol
Knowledge Representation and Reasoning with Apache Stanbol
 
Type inference through the analysis of Wikipedia links
Type inference through the analysis of Wikipedia linksType inference through the analysis of Wikipedia links
Type inference through the analysis of Wikipedia links
 
Towards an Empirical Semantic Web Science: Knowledge Pattern Extraction and U...
Towards an Empirical Semantic Web Science: Knowledge Pattern Extraction and U...Towards an Empirical Semantic Web Science: Knowledge Pattern Extraction and U...
Towards an Empirical Semantic Web Science: Knowledge Pattern Extraction and U...
 
Gathering Lexical Linked Data and Knowledge Patterns from FrameNet
Gathering Lexical Linked Data and Knowledge Patterns from FrameNetGathering Lexical Linked Data and Knowledge Patterns from FrameNet
Gathering Lexical Linked Data and Knowledge Patterns from FrameNet
 
Aemoo: exploratory search based on knowledge patterns over the Semantic Web
Aemoo:  exploratory search based on knowledge patterns over the Semantic WebAemoo:  exploratory search based on knowledge patterns over the Semantic Web
Aemoo: exploratory search based on knowledge patterns over the Semantic Web
 

Dernier

Dernier (20)

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 

Towards the automatic identification of the nature of citations

  • 1. Towards the automatic identification of the nature of citations (1) Department of Computer Science and Engineering, University of Bologna, Italy (2) STLab-ISTC, National Research Council, Italy 30 May 2013 Montpellier, France ESWC 2013
  • 2. Motivation • Bibliographic citations can be seen as tools for: – linking research: making pointers to related works, to source of experimental data, to methods used, etc. – disseminating research: conference proceedings, journals, Web platforms (e.g. blogs, wikis), Semantic Publishing platforms and projects (e.g. OpenCitation, OpenBibliography, Lucero) – exploring research: new ways of browsing article through networks of citations (e.g. CiteWiz, Citation Sensitive In-browser Summariser) – evaluating research: measuring the importance of journals (e.g. impact factor) or the scientific productivity of authors (e.g. h-index) • Assumption: all these activities can be radically improved by exploiting the actual function of citations, i.e. author’s reason for citing a given paper
  • 3. Goal • To design a method able to automatically infer the author’s reason for citing a scientific article • To implement a tool that is comparable to humans in the task of identifying the nature of citations
  • 4. Available online at http://wit.istc.cnr.it:8080/tools/citalo It extends the research outlined in earlier work X. Ontology learning Citation type extraction Word-sense disambiguation Alignment to CiTO Sentiment analysis Output: cito:extends Input: a sentence containing a reference to a bibliographic entity indicated by an “X” Derive a logical (i.e. an OWL ontology) representation of the sentence through FRED Extract candidate types for the citation by looking for patterns in FRED output via SPARQL Gather the sense of the candidate types through IMS with respect to OntoWordNet Capture the sentiment polarity emerging from th text through AlchemyAPI Assign CiTO types to the citation through SPARQL CONSTRUCT
  • 5. Result Similarly to Teufel et al. [19] the most neutral CiTO property, citesForInformation, was the most prevalent function in our dataset too, as the second most used property was usedMethodIn We run CiTalO on the same sample according to 8 different configurations and we compared the results with humans annotations No configuration that emerges as the absolutely best one from these data Worst configurations were those that took into account all the proximal synsets We asked humans to manually annotate 106 citation sentences, contained in scientific ar according to CiTO properties