SlideShare une entreprise Scribd logo
1  sur  13
A Vague Sense Classifier for Detecting Vague
Definitions in Ontologies
Panos Alexopoulos, John Pavlopoulos
14th Conference of the European Chapter of the Association for Computational
Linguistics
Gothenburg, Sweden, 26–30 April 2014
2
Vagueness
Introduction
●Vagueness is a semantic phenomenon where predicates admit
borderline cases, i.e. cases where it is not determinately true that the
predicate applies or not (Shapiro 2006).
●This happens when predicates have blurred boundaries:
● What’s the threshold number of years separating old and not old
films?
● What are the exact criteria that distinguish modern restaurants
from non-modern?
3
Vagueness Consequences
Introduction
●The problem with vague terms in semantic data is the possibility of
disagreements!
●E.g., when we asked domain experts to provide instances of the
concept Critical Business Process, there were certain processes for
which there was a dispute among them about whether they should be
regarded as critical or not.
●The problem was that different experts had different criteria of
process criticality and could not decide which of these were
sufficient to classify a process as critical.
4
Problematic Scenarios
Introduction
1. Structuring Data with a Vague Ontology: Possible
disagreement among experts when defining class and relation
instances.
2. Utilizing Vague Facts in Ontology-Based Systems:
Reasoning results might not meet users’ expectations
3. Integrating Vague Semantic Information: The merging of
particular vague elements can lead to data that will not be
valid for all its users.
5
Problem Definition & Approach
Automatic Vagueness Detection
●Can we automatically determine whether an ontology entity (class, relation etc.)
is vague or not?
● “StrategicClient” as “A client that has a high value for the company” is
vague!
● “AmericanCompany” as “A company that has legal status in the
Unites States” is not!
Problem Definition
●We train a binary classifier that may distinguish between vague and non-vague
term word senses.
●Training is supervised, using examples from Wordnet.
●We use this classifier to determine whether a given ontology element definition
is vague or not.
Approach
6
Data
Automatic Vagueness Detection
●2,000 adjective senses from WordNet.
● 1,000 vague
● 1,000 non-vague
●Inter-agreement of vague/non-vague annotation among 3 human
judges was 0.64 (Cohen’s Kappa)
Vague Senses Non Vague Senses
• Abnormal: not normal, not typical or usual
or regularor conforming to a norm
• Compound: composed of more than one
part
• Impenitent: impervious to moral persuasion • Biweekly: occurring every two weeks.
• Notorious: known widely and usually
unfavorably
• Irregular: falling below the manufacturer's
standard
• Aroused: emotionally aroused • Outermost: situated at the farthest possible
point from a center.
7
Training and Evaluation
Automatic Vagueness Detection
●80% of the data used to train a multinomial Naive Bayes classifier.
●We removed stop words and we used the bag of words assumption to
represent each instance.
●The remaining 20% of the data was used as a test set.
●Classification accuracy was 84%!
8
Comparison with Subjectivity Analyzer
Automatic Vagueness Detection
●We also used a subjective sense classifier to classify our dataset’s
senses as subjective or objective.
●From the 1000 vague senses, only 167 were classified as subjective
while from the 1000 non-vague ones 993.
●This shows that treating vagueness in the same way as
subjectiveness is not really effective.
9
Use Case: Detecting Vagueness in CiTO Ontology
Automatic Vagueness Detection
●As an ontology use case we considered CiTO, an ontology that
enables characterization of the nature or type of citations.
●CiTO consists primarily of relations, many of which are vague (e.g.
plagiarizes).
●We selected 44 relations and we had 3 human judges manually
classify them as vague or not.
●Then we applied our Wordnet-trained vagueness classifier on the
textual definitions of the same relations.
10
Use Case: Detecting Vagueness in CiTO Ontology
Automatic Vagueness Detection
Vague Relations Non Vague Relations
• plagiarizes: A property indicating that
the author of the citing entity
plagiarizes the cited entity, by
including textual or other elements
from the cited entity without formal
acknowledgement of their source
• sharesAuthorInstitutionWith: Each
entity has at least one author that
shares a common institutional
affiliation with an author of the other
entity
• citesAsAuthority: The citing entity
cites the cited entity as one that
provides an authoritative description
or definition of the subject under
discussion.
• providesDataFor: The cited entity
presents data that are used in work
described in the citing entity.
11
Use Case: Detecting Vagueness in CiTO Ontology
Automatic Vagueness Detection
●Classification Results:
● 82% of relations were correctly classified as vague/non-vague
● 94% accuracy for non-vague relations.
● 74% accuracy for vague relations.
●Again, we classified the same relations with the subjectivity classifier:
● 40% of vague/non-vague relations were classified as
subjective/objective respectively.
● 94% of non-vague were classified as objective.
● 7% of vague relations were classified as subjective.
12
Future Work
Vagueness-Aware Semantic Data
●Incorporate the current classifier into an ontology analysis tool
●Improve the classifier by contemplating new features
●See whether it is possible to build a vague sense lexicon.
13
Questions?
Thank you!
iSOCO Madrid
Av. del Partenón, 16-18, 1º7ª
Campo de las Naciones
28042 Madrid
España
(t) +34 913 349 797
iSOCO Pamplona
Parque Tomás
Caballero, 2, 6º4ª
31006 Pamplona
España
(t) +34 948 102 408
iSOCO Valencia
C/ Prof. Beltrán Báguena, 4
Oficina 107
46009 Valencia
España
(t) +34 963 467 143
iSOCO Barcelona
Av. Torre Blanca, 57
Edificio ESADE CREAPOLIS
Oficina 3C 15
08172 Sant Cugat del Vallès
Barcelona, España
(t) +34 935 677 200
iSOCO Colombia
Complejo Ruta N
Calle 67, 52-20
Piso 3, Torre A
Medellín
Colombia
(t) +57 516 7770 ext. 1132
Key Vendor
Virtual Assistant 2013
Quieres
innovar?
Dr. Panos Alexopoulos
Semantic Applications Research
Manager
palexopoulos@isoco.com
(t) +34 913 349 797

Contenu connexe

Tendances

Sentiment analysis
Sentiment analysisSentiment analysis
Sentiment analysisAmenda Joy
 
A Framework for Arabic Concept-Level Sentiment Analysis using SenticNet
A Framework for Arabic Concept-Level Sentiment Analysis using SenticNet A Framework for Arabic Concept-Level Sentiment Analysis using SenticNet
A Framework for Arabic Concept-Level Sentiment Analysis using SenticNet IJECEIAES
 
SemEval - Aspect Based Sentiment Analysis
SemEval - Aspect Based Sentiment AnalysisSemEval - Aspect Based Sentiment Analysis
SemEval - Aspect Based Sentiment AnalysisAditya Joshi
 
Project sentiment analysis
Project sentiment analysisProject sentiment analysis
Project sentiment analysisBob Prieto
 
295B_Report_Sentiment_analysis
295B_Report_Sentiment_analysis295B_Report_Sentiment_analysis
295B_Report_Sentiment_analysisZahid Azam
 
Amazon Product Sentiment review
Amazon Product Sentiment reviewAmazon Product Sentiment review
Amazon Product Sentiment reviewLalit Jain
 
A review of sentiment analysis approaches in big
A review of sentiment analysis approaches in bigA review of sentiment analysis approaches in big
A review of sentiment analysis approaches in bigNurfadhlina Mohd Sharef
 
CrowdTruth for medical relation extraction - WAI talk
CrowdTruth for medical relation extraction - WAI talkCrowdTruth for medical relation extraction - WAI talk
CrowdTruth for medical relation extraction - WAI talkAnca Dumitrache
 
IRJET- Survey of Classification of Business Reviews using Sentiment Analysis
IRJET- Survey of Classification of Business Reviews using Sentiment AnalysisIRJET- Survey of Classification of Business Reviews using Sentiment Analysis
IRJET- Survey of Classification of Business Reviews using Sentiment AnalysisIRJET Journal
 
An overview of text mining and sentiment analysis for Decision Support System
An overview of text mining and sentiment analysis for Decision Support SystemAn overview of text mining and sentiment analysis for Decision Support System
An overview of text mining and sentiment analysis for Decision Support SystemGan Keng Hoon
 
Query recommendation papers
Query recommendation papersQuery recommendation papers
Query recommendation papersAshish Kulkarni
 
Supervised Learning Based Approach to Aspect Based Sentiment Analysis
Supervised Learning Based Approach to Aspect Based Sentiment AnalysisSupervised Learning Based Approach to Aspect Based Sentiment Analysis
Supervised Learning Based Approach to Aspect Based Sentiment AnalysisTharindu Kumara
 
Sentiment Analysis
Sentiment AnalysisSentiment Analysis
Sentiment Analysisishan0019
 
project sentiment analysis
project sentiment analysisproject sentiment analysis
project sentiment analysissneha penmetsa
 

Tendances (20)

Sentiment analysis
Sentiment analysisSentiment analysis
Sentiment analysis
 
A Framework for Arabic Concept-Level Sentiment Analysis using SenticNet
A Framework for Arabic Concept-Level Sentiment Analysis using SenticNet A Framework for Arabic Concept-Level Sentiment Analysis using SenticNet
A Framework for Arabic Concept-Level Sentiment Analysis using SenticNet
 
SemEval - Aspect Based Sentiment Analysis
SemEval - Aspect Based Sentiment AnalysisSemEval - Aspect Based Sentiment Analysis
SemEval - Aspect Based Sentiment Analysis
 
Project sentiment analysis
Project sentiment analysisProject sentiment analysis
Project sentiment analysis
 
295B_Report_Sentiment_analysis
295B_Report_Sentiment_analysis295B_Report_Sentiment_analysis
295B_Report_Sentiment_analysis
 
Amazon Product Sentiment review
Amazon Product Sentiment reviewAmazon Product Sentiment review
Amazon Product Sentiment review
 
A review of sentiment analysis approaches in big
A review of sentiment analysis approaches in bigA review of sentiment analysis approaches in big
A review of sentiment analysis approaches in big
 
CrowdTruth for medical relation extraction - WAI talk
CrowdTruth for medical relation extraction - WAI talkCrowdTruth for medical relation extraction - WAI talk
CrowdTruth for medical relation extraction - WAI talk
 
Project report
Project reportProject report
Project report
 
IRJET- Survey of Classification of Business Reviews using Sentiment Analysis
IRJET- Survey of Classification of Business Reviews using Sentiment AnalysisIRJET- Survey of Classification of Business Reviews using Sentiment Analysis
IRJET- Survey of Classification of Business Reviews using Sentiment Analysis
 
An overview of text mining and sentiment analysis for Decision Support System
An overview of text mining and sentiment analysis for Decision Support SystemAn overview of text mining and sentiment analysis for Decision Support System
An overview of text mining and sentiment analysis for Decision Support System
 
Sentiment analysis
Sentiment analysisSentiment analysis
Sentiment analysis
 
Query recommendation papers
Query recommendation papersQuery recommendation papers
Query recommendation papers
 
Supervised Learning Based Approach to Aspect Based Sentiment Analysis
Supervised Learning Based Approach to Aspect Based Sentiment AnalysisSupervised Learning Based Approach to Aspect Based Sentiment Analysis
Supervised Learning Based Approach to Aspect Based Sentiment Analysis
 
2 13
2 132 13
2 13
 
Sentiment Analysis
Sentiment AnalysisSentiment Analysis
Sentiment Analysis
 
NLP Ecosystem
NLP EcosystemNLP Ecosystem
NLP Ecosystem
 
project sentiment analysis
project sentiment analysisproject sentiment analysis
project sentiment analysis
 
ACL-IJCNLP 2015
ACL-IJCNLP 2015ACL-IJCNLP 2015
ACL-IJCNLP 2015
 
Sentiment analysis
Sentiment analysisSentiment analysis
Sentiment analysis
 

En vedette

DBpedia: A Public Data Infrastructure for the Web of Data
DBpedia: A Public Data Infrastructure for the Web of DataDBpedia: A Public Data Infrastructure for the Web of Data
DBpedia: A Public Data Infrastructure for the Web of DataSebastian Hellmann
 
Evaluating Named Entity Recognition and Disambiguation in News and Tweets
Evaluating Named Entity Recognition and Disambiguation in News and TweetsEvaluating Named Entity Recognition and Disambiguation in News and Tweets
Evaluating Named Entity Recognition and Disambiguation in News and TweetsMarieke van Erp
 
Introduction to the Data Web, DBpedia and the Life-cycle of Linked Data
Introduction to the Data Web, DBpedia and the Life-cycle of Linked DataIntroduction to the Data Web, DBpedia and the Life-cycle of Linked Data
Introduction to the Data Web, DBpedia and the Life-cycle of Linked DataSören Auer
 
Open Education Challenge 2014: exploiting Linked Data in Educational Applicat...
Open Education Challenge 2014: exploiting Linked Data in Educational Applicat...Open Education Challenge 2014: exploiting Linked Data in Educational Applicat...
Open Education Challenge 2014: exploiting Linked Data in Educational Applicat...Stefan Dietze
 
Gathering Alternative Surface Forms for DBpedia Entities
Gathering Alternative Surface Forms for DBpedia EntitiesGathering Alternative Surface Forms for DBpedia Entities
Gathering Alternative Surface Forms for DBpedia EntitiesHeiko Paulheim
 
Federated SPARQL query processing over the Web of Data
Federated SPARQL query processing over the Web of DataFederated SPARQL query processing over the Web of Data
Federated SPARQL query processing over the Web of DataMuhammad Saleem
 
Fast Approximate A-box Consistency Checking using Machine Learning
Fast Approximate  A-box Consistency Checking using Machine LearningFast Approximate  A-box Consistency Checking using Machine Learning
Fast Approximate A-box Consistency Checking using Machine LearningHeiko Paulheim
 
LDQL: A Query Language for the Web of Linked Data
LDQL: A Query Language for the Web of Linked DataLDQL: A Query Language for the Web of Linked Data
LDQL: A Query Language for the Web of Linked DataOlaf Hartig
 
Applying Linked Open Data to Public Procurement
Applying Linked Open Data to Public ProcurementApplying Linked Open Data to Public Procurement
Applying Linked Open Data to Public ProcurementJindřich Mynarz
 
Exploiting the query structure for efficient join ordering in SPARQL queries
Exploiting the query structure for efficient join ordering in SPARQL queriesExploiting the query structure for efficient join ordering in SPARQL queries
Exploiting the query structure for efficient join ordering in SPARQL queriesLuiz Henrique Zambom Santana
 
Automatic Term Ambiguity Detection
Automatic Term Ambiguity DetectionAutomatic Term Ambiguity Detection
Automatic Term Ambiguity DetectionYunyao Li
 
Exploring Linked Data content through network analysis
Exploring Linked Data content through network analysisExploring Linked Data content through network analysis
Exploring Linked Data content through network analysisChristophe Guéret
 
Linked Data: What’s the Story?
Linked Data: What’s the Story?Linked Data: What’s the Story?
Linked Data: What’s the Story?WiLS
 
A Comparison of NER Tools w.r.t. a Domain-Specific Vocabulary
A Comparison of NER Tools w.r.t. a Domain-Specific VocabularyA Comparison of NER Tools w.r.t. a Domain-Specific Vocabulary
A Comparison of NER Tools w.r.t. a Domain-Specific VocabularyTimm Heuss
 
Data Mining with Background Knowledge from the Web - Introducing the RapidMin...
Data Mining with Background Knowledge from the Web - Introducing the RapidMin...Data Mining with Background Knowledge from the Web - Introducing the RapidMin...
Data Mining with Background Knowledge from the Web - Introducing the RapidMin...Heiko Paulheim
 
A Provenance assisted Roadmap for Life Sciences Linked Open Data Cloud
A Provenance assisted Roadmap for Life Sciences Linked Open Data CloudA Provenance assisted Roadmap for Life Sciences Linked Open Data Cloud
A Provenance assisted Roadmap for Life Sciences Linked Open Data CloudSyed Muhammad Ali Hasnain
 

En vedette (20)

DBpedia: A Public Data Infrastructure for the Web of Data
DBpedia: A Public Data Infrastructure for the Web of DataDBpedia: A Public Data Infrastructure for the Web of Data
DBpedia: A Public Data Infrastructure for the Web of Data
 
DBpedia InsideOut
DBpedia InsideOutDBpedia InsideOut
DBpedia InsideOut
 
NLP todo
NLP todoNLP todo
NLP todo
 
Linked Data Fragments
Linked Data FragmentsLinked Data Fragments
Linked Data Fragments
 
Evaluating Named Entity Recognition and Disambiguation in News and Tweets
Evaluating Named Entity Recognition and Disambiguation in News and TweetsEvaluating Named Entity Recognition and Disambiguation in News and Tweets
Evaluating Named Entity Recognition and Disambiguation in News and Tweets
 
Introduction to the Data Web, DBpedia and the Life-cycle of Linked Data
Introduction to the Data Web, DBpedia and the Life-cycle of Linked DataIntroduction to the Data Web, DBpedia and the Life-cycle of Linked Data
Introduction to the Data Web, DBpedia and the Life-cycle of Linked Data
 
Open Education Challenge 2014: exploiting Linked Data in Educational Applicat...
Open Education Challenge 2014: exploiting Linked Data in Educational Applicat...Open Education Challenge 2014: exploiting Linked Data in Educational Applicat...
Open Education Challenge 2014: exploiting Linked Data in Educational Applicat...
 
Gathering Alternative Surface Forms for DBpedia Entities
Gathering Alternative Surface Forms for DBpedia EntitiesGathering Alternative Surface Forms for DBpedia Entities
Gathering Alternative Surface Forms for DBpedia Entities
 
Federated SPARQL query processing over the Web of Data
Federated SPARQL query processing over the Web of DataFederated SPARQL query processing over the Web of Data
Federated SPARQL query processing over the Web of Data
 
Fast Approximate A-box Consistency Checking using Machine Learning
Fast Approximate  A-box Consistency Checking using Machine LearningFast Approximate  A-box Consistency Checking using Machine Learning
Fast Approximate A-box Consistency Checking using Machine Learning
 
LDQL: A Query Language for the Web of Linked Data
LDQL: A Query Language for the Web of Linked DataLDQL: A Query Language for the Web of Linked Data
LDQL: A Query Language for the Web of Linked Data
 
Applying Linked Open Data to Public Procurement
Applying Linked Open Data to Public ProcurementApplying Linked Open Data to Public Procurement
Applying Linked Open Data to Public Procurement
 
Exploiting the query structure for efficient join ordering in SPARQL queries
Exploiting the query structure for efficient join ordering in SPARQL queriesExploiting the query structure for efficient join ordering in SPARQL queries
Exploiting the query structure for efficient join ordering in SPARQL queries
 
Automatic Term Ambiguity Detection
Automatic Term Ambiguity DetectionAutomatic Term Ambiguity Detection
Automatic Term Ambiguity Detection
 
Exploring Linked Data content through network analysis
Exploring Linked Data content through network analysisExploring Linked Data content through network analysis
Exploring Linked Data content through network analysis
 
Entity Search Engine
Entity Search Engine Entity Search Engine
Entity Search Engine
 
Linked Data: What’s the Story?
Linked Data: What’s the Story?Linked Data: What’s the Story?
Linked Data: What’s the Story?
 
A Comparison of NER Tools w.r.t. a Domain-Specific Vocabulary
A Comparison of NER Tools w.r.t. a Domain-Specific VocabularyA Comparison of NER Tools w.r.t. a Domain-Specific Vocabulary
A Comparison of NER Tools w.r.t. a Domain-Specific Vocabulary
 
Data Mining with Background Knowledge from the Web - Introducing the RapidMin...
Data Mining with Background Knowledge from the Web - Introducing the RapidMin...Data Mining with Background Knowledge from the Web - Introducing the RapidMin...
Data Mining with Background Knowledge from the Web - Introducing the RapidMin...
 
A Provenance assisted Roadmap for Life Sciences Linked Open Data Cloud
A Provenance assisted Roadmap for Life Sciences Linked Open Data CloudA Provenance assisted Roadmap for Life Sciences Linked Open Data Cloud
A Provenance assisted Roadmap for Life Sciences Linked Open Data Cloud
 

Similaire à A Vague Sense Classifier for Detecting Vague Definitions in Ontologies

How many truths can you handle?
How many truths can you handle?How many truths can you handle?
How many truths can you handle?Panos Alexopoulos
 
PSY 540 Short Presentation Guidelines and Rubric Overvi.docx
PSY 540 Short Presentation Guidelines and Rubric  Overvi.docxPSY 540 Short Presentation Guidelines and Rubric  Overvi.docx
PSY 540 Short Presentation Guidelines and Rubric Overvi.docxpotmanandrea
 
Of Unicorns, Yetis, and Error-Free Datasets (or what is data quality?)
Of Unicorns, Yetis, and Error-Free Datasets (or what is data quality?)Of Unicorns, Yetis, and Error-Free Datasets (or what is data quality?)
Of Unicorns, Yetis, and Error-Free Datasets (or what is data quality?)Gianluca Tarasconi
 
Fake Product Review Monitoring & Removal and Sentiment Analysis of Genuine Re...
Fake Product Review Monitoring & Removal and Sentiment Analysis of Genuine Re...Fake Product Review Monitoring & Removal and Sentiment Analysis of Genuine Re...
Fake Product Review Monitoring & Removal and Sentiment Analysis of Genuine Re...Dr. Amarjeet Singh
 
Discriminant Analysis.pptx
Discriminant Analysis.pptxDiscriminant Analysis.pptx
Discriminant Analysis.pptxGedaSheko
 
1) A cyber crime is a crime that involves a computer and the Inter.docx
1) A cyber crime is a crime that involves a computer and the Inter.docx1) A cyber crime is a crime that involves a computer and the Inter.docx
1) A cyber crime is a crime that involves a computer and the Inter.docxSONU61709
 
Fore FAIR ISMB 2019
Fore FAIR ISMB 2019Fore FAIR ISMB 2019
Fore FAIR ISMB 2019Ian Fore
 
How Did I Miss That Bug? Managing Cognitive Bias in Testing
How Did I Miss That Bug? Managing Cognitive Bias in TestingHow Did I Miss That Bug? Managing Cognitive Bias in Testing
How Did I Miss That Bug? Managing Cognitive Bias in TestingTechWell
 
Primary Printable Paper. Online assignment writing service.
Primary Printable Paper. Online assignment writing service.Primary Printable Paper. Online assignment writing service.
Primary Printable Paper. Online assignment writing service.Kara Webber
 
Mann core study
Mann core studyMann core study
Mann core studyMrOakes
 
Validity and Reliability of the Research Instrument; How to Test the Validati...
Validity and Reliability of the Research Instrument; How to Test the Validati...Validity and Reliability of the Research Instrument; How to Test the Validati...
Validity and Reliability of the Research Instrument; How to Test the Validati...Hamed Taherdoost
 
Class Delivery Final.pptx
Class Delivery Final.pptxClass Delivery Final.pptx
Class Delivery Final.pptxMadan Gowda
 
Analyzing Qualitative Data for_ Research
Analyzing Qualitative Data for_ ResearchAnalyzing Qualitative Data for_ Research
Analyzing Qualitative Data for_ ResearchNirmalPoudel4
 
Unit2 studyguide302
Unit2 studyguide302Unit2 studyguide302
Unit2 studyguide302tashillary
 
Study design2 6_07
Study design2 6_07Study design2 6_07
Study design2 6_07Dan Fisher
 
CHARACTERISTICS-OF-RESEARCH.pptx
CHARACTERISTICS-OF-RESEARCH.pptxCHARACTERISTICS-OF-RESEARCH.pptx
CHARACTERISTICS-OF-RESEARCH.pptxCrisonMagadan2
 

Similaire à A Vague Sense Classifier for Detecting Vague Definitions in Ontologies (20)

How many truths can you handle?
How many truths can you handle?How many truths can you handle?
How many truths can you handle?
 
PSY 540 Short Presentation Guidelines and Rubric Overvi.docx
PSY 540 Short Presentation Guidelines and Rubric  Overvi.docxPSY 540 Short Presentation Guidelines and Rubric  Overvi.docx
PSY 540 Short Presentation Guidelines and Rubric Overvi.docx
 
Of Unicorns, Yetis, and Error-Free Datasets (or what is data quality?)
Of Unicorns, Yetis, and Error-Free Datasets (or what is data quality?)Of Unicorns, Yetis, and Error-Free Datasets (or what is data quality?)
Of Unicorns, Yetis, and Error-Free Datasets (or what is data quality?)
 
Fake Product Review Monitoring & Removal and Sentiment Analysis of Genuine Re...
Fake Product Review Monitoring & Removal and Sentiment Analysis of Genuine Re...Fake Product Review Monitoring & Removal and Sentiment Analysis of Genuine Re...
Fake Product Review Monitoring & Removal and Sentiment Analysis of Genuine Re...
 
Discriminant Analysis.pptx
Discriminant Analysis.pptxDiscriminant Analysis.pptx
Discriminant Analysis.pptx
 
1) A cyber crime is a crime that involves a computer and the Inter.docx
1) A cyber crime is a crime that involves a computer and the Inter.docx1) A cyber crime is a crime that involves a computer and the Inter.docx
1) A cyber crime is a crime that involves a computer and the Inter.docx
 
Chap008
Chap008Chap008
Chap008
 
Fore FAIR ISMB 2019
Fore FAIR ISMB 2019Fore FAIR ISMB 2019
Fore FAIR ISMB 2019
 
Human Assessment of Ontologies
Human Assessment of OntologiesHuman Assessment of Ontologies
Human Assessment of Ontologies
 
How Did I Miss That Bug? Managing Cognitive Bias in Testing
How Did I Miss That Bug? Managing Cognitive Bias in TestingHow Did I Miss That Bug? Managing Cognitive Bias in Testing
How Did I Miss That Bug? Managing Cognitive Bias in Testing
 
Primary Printable Paper. Online assignment writing service.
Primary Printable Paper. Online assignment writing service.Primary Printable Paper. Online assignment writing service.
Primary Printable Paper. Online assignment writing service.
 
Mann core study
Mann core studyMann core study
Mann core study
 
Validity and Reliability of the Research Instrument; How to Test the Validati...
Validity and Reliability of the Research Instrument; How to Test the Validati...Validity and Reliability of the Research Instrument; How to Test the Validati...
Validity and Reliability of the Research Instrument; How to Test the Validati...
 
Class Delivery Final.pptx
Class Delivery Final.pptxClass Delivery Final.pptx
Class Delivery Final.pptx
 
Analyzing Qualitative Data for_ Research
Analyzing Qualitative Data for_ ResearchAnalyzing Qualitative Data for_ Research
Analyzing Qualitative Data for_ Research
 
Unit2 studyguide302
Unit2 studyguide302Unit2 studyguide302
Unit2 studyguide302
 
Study design2 6_07
Study design2 6_07Study design2 6_07
Study design2 6_07
 
Identification of Research Problem
Identification of Research ProblemIdentification of Research Problem
Identification of Research Problem
 
Qualitative data analysis
Qualitative data analysisQualitative data analysis
Qualitative data analysis
 
CHARACTERISTICS-OF-RESEARCH.pptx
CHARACTERISTICS-OF-RESEARCH.pptxCHARACTERISTICS-OF-RESEARCH.pptx
CHARACTERISTICS-OF-RESEARCH.pptx
 

Dernier

The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAndikSusilo4
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 

Dernier (20)

The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & Application
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 

A Vague Sense Classifier for Detecting Vague Definitions in Ontologies

  • 1. A Vague Sense Classifier for Detecting Vague Definitions in Ontologies Panos Alexopoulos, John Pavlopoulos 14th Conference of the European Chapter of the Association for Computational Linguistics Gothenburg, Sweden, 26–30 April 2014
  • 2. 2 Vagueness Introduction ●Vagueness is a semantic phenomenon where predicates admit borderline cases, i.e. cases where it is not determinately true that the predicate applies or not (Shapiro 2006). ●This happens when predicates have blurred boundaries: ● What’s the threshold number of years separating old and not old films? ● What are the exact criteria that distinguish modern restaurants from non-modern?
  • 3. 3 Vagueness Consequences Introduction ●The problem with vague terms in semantic data is the possibility of disagreements! ●E.g., when we asked domain experts to provide instances of the concept Critical Business Process, there were certain processes for which there was a dispute among them about whether they should be regarded as critical or not. ●The problem was that different experts had different criteria of process criticality and could not decide which of these were sufficient to classify a process as critical.
  • 4. 4 Problematic Scenarios Introduction 1. Structuring Data with a Vague Ontology: Possible disagreement among experts when defining class and relation instances. 2. Utilizing Vague Facts in Ontology-Based Systems: Reasoning results might not meet users’ expectations 3. Integrating Vague Semantic Information: The merging of particular vague elements can lead to data that will not be valid for all its users.
  • 5. 5 Problem Definition & Approach Automatic Vagueness Detection ●Can we automatically determine whether an ontology entity (class, relation etc.) is vague or not? ● “StrategicClient” as “A client that has a high value for the company” is vague! ● “AmericanCompany” as “A company that has legal status in the Unites States” is not! Problem Definition ●We train a binary classifier that may distinguish between vague and non-vague term word senses. ●Training is supervised, using examples from Wordnet. ●We use this classifier to determine whether a given ontology element definition is vague or not. Approach
  • 6. 6 Data Automatic Vagueness Detection ●2,000 adjective senses from WordNet. ● 1,000 vague ● 1,000 non-vague ●Inter-agreement of vague/non-vague annotation among 3 human judges was 0.64 (Cohen’s Kappa) Vague Senses Non Vague Senses • Abnormal: not normal, not typical or usual or regularor conforming to a norm • Compound: composed of more than one part • Impenitent: impervious to moral persuasion • Biweekly: occurring every two weeks. • Notorious: known widely and usually unfavorably • Irregular: falling below the manufacturer's standard • Aroused: emotionally aroused • Outermost: situated at the farthest possible point from a center.
  • 7. 7 Training and Evaluation Automatic Vagueness Detection ●80% of the data used to train a multinomial Naive Bayes classifier. ●We removed stop words and we used the bag of words assumption to represent each instance. ●The remaining 20% of the data was used as a test set. ●Classification accuracy was 84%!
  • 8. 8 Comparison with Subjectivity Analyzer Automatic Vagueness Detection ●We also used a subjective sense classifier to classify our dataset’s senses as subjective or objective. ●From the 1000 vague senses, only 167 were classified as subjective while from the 1000 non-vague ones 993. ●This shows that treating vagueness in the same way as subjectiveness is not really effective.
  • 9. 9 Use Case: Detecting Vagueness in CiTO Ontology Automatic Vagueness Detection ●As an ontology use case we considered CiTO, an ontology that enables characterization of the nature or type of citations. ●CiTO consists primarily of relations, many of which are vague (e.g. plagiarizes). ●We selected 44 relations and we had 3 human judges manually classify them as vague or not. ●Then we applied our Wordnet-trained vagueness classifier on the textual definitions of the same relations.
  • 10. 10 Use Case: Detecting Vagueness in CiTO Ontology Automatic Vagueness Detection Vague Relations Non Vague Relations • plagiarizes: A property indicating that the author of the citing entity plagiarizes the cited entity, by including textual or other elements from the cited entity without formal acknowledgement of their source • sharesAuthorInstitutionWith: Each entity has at least one author that shares a common institutional affiliation with an author of the other entity • citesAsAuthority: The citing entity cites the cited entity as one that provides an authoritative description or definition of the subject under discussion. • providesDataFor: The cited entity presents data that are used in work described in the citing entity.
  • 11. 11 Use Case: Detecting Vagueness in CiTO Ontology Automatic Vagueness Detection ●Classification Results: ● 82% of relations were correctly classified as vague/non-vague ● 94% accuracy for non-vague relations. ● 74% accuracy for vague relations. ●Again, we classified the same relations with the subjectivity classifier: ● 40% of vague/non-vague relations were classified as subjective/objective respectively. ● 94% of non-vague were classified as objective. ● 7% of vague relations were classified as subjective.
  • 12. 12 Future Work Vagueness-Aware Semantic Data ●Incorporate the current classifier into an ontology analysis tool ●Improve the classifier by contemplating new features ●See whether it is possible to build a vague sense lexicon.
  • 13. 13 Questions? Thank you! iSOCO Madrid Av. del Partenón, 16-18, 1º7ª Campo de las Naciones 28042 Madrid España (t) +34 913 349 797 iSOCO Pamplona Parque Tomás Caballero, 2, 6º4ª 31006 Pamplona España (t) +34 948 102 408 iSOCO Valencia C/ Prof. Beltrán Báguena, 4 Oficina 107 46009 Valencia España (t) +34 963 467 143 iSOCO Barcelona Av. Torre Blanca, 57 Edificio ESADE CREAPOLIS Oficina 3C 15 08172 Sant Cugat del Vallès Barcelona, España (t) +34 935 677 200 iSOCO Colombia Complejo Ruta N Calle 67, 52-20 Piso 3, Torre A Medellín Colombia (t) +57 516 7770 ext. 1132 Key Vendor Virtual Assistant 2013 Quieres innovar? Dr. Panos Alexopoulos Semantic Applications Research Manager palexopoulos@isoco.com (t) +34 913 349 797