SlideShare a Scribd company logo
1 of 29
Download to read offline
Bringing Math to LOD:
A Semantic Publishing Platform Prototype for
Scientific Collections in Mathematics
Olga Nevzorova, Nikita Zhiltsov, Danila Zaikin, Olga Zhibrik,
Alexander Kirillovich, Vladimir Nevzorov, Evgeniy Birialtsev
Kazan Federal University
Russia
October 23, 2013

1 / 29
Outline

1 Introduction
2 Approach
3 Use Cases

2 / 29
Our Contribution
Our prototype is geared to build a semantic graph of
mathematical knowledge objects, that
is extracted from a collection of mathematical
scholarly papers, and
is integrated into the LOD «cloud»

3 / 29
Research Output
IVM Data Set

LOD representation of 1 330 scholarly publications of
the «Izvestiya Vuzov. Matematika» (IVM) journal
Covers the semantics of:
article metadata
elements of the logical structure
terminology
formulas

Aligned with DBpedia, CORDIS
More than 850 000 RDF triples
SPARQL endpoint:
http://cll.niimm.ksu.ru:8890/sparql-auth∗
∗

the SPARQL endpoint is secured. Please email the authors for credentials
4 / 29
Related Work

Domain-specific languages: OMDoc, MathLang
Domain models: Cambridge Mathematical
Thesaurus, DBpedia (math-related part),
ScienceWISE Ontology
Math-related NLP: mArachna; linguistic modules of
arXMLiv

5 / 29
Outline

1 Introduction
2 Approach
3 Use Cases

6 / 29
Key Research Contributions
a thorough ontological model of the mathematical
domain
an ontology-based language-independent method for
extraction of logical structure elements in papers
an ontology-based method for extraction of
mathematical named entities from texts in Russian
a method that connects mathematical named entities
to symbolic expressions

7 / 29
Prototype’s Design

8 / 29
Domain Model

9 / 29
Ontology of Structural Elements (1)
http://cll.niimm.ksu.ru/ontologies/mocassin

Covers 15 common structural elements:

Defines 9 object properties and 4 datatype properties:

10 / 29
Ontology of Structural Elements (2)
http://cll.niimm.ksu.ru/ontologies/mocassin

3 cardinality axioms, e.g.
Proof ∧ (= 1 proves ProvableStatement† )
2 transitivity axioms for hasPart and dependsOn
properties
DL expressivity: SRIN (D)

†

i.e., Claim ∨ Corollary ∨ Lemma ∨ Proposition ∨ Theorem
11 / 29
Ontology of Mathematical Concepts (1)
http://cll.niimm.ksu.ru/ontologies/mathematics

Covers 3 450 mathematical concepts
Defines commonly used terms as well as terms from
the emerging professional vocabulary (e.g.
Bitsadze-Samarsky problem)
Supports Russian/English labels

12 / 29
Ontology of Mathematical Concepts (2)
http://cll.niimm.ksu.ru/ontologies/mathematics

Includes two taxonomies:
taxonomy of mathematical theories‡ :
number theory, set theory, algebra, analysis, geometry,
mathematical logic, discrete mathematics, theory of
computation, differential equations, numerical analysis,
probability theory and statistics

taxonomy of mathematical objects

Covers common scientific concepts, such as Problem,
Method, Statement, Formula etc.
DL expressivity: ALCHI
‡

covers just a part of the mathematical knowledge
13 / 29
Ontology of Mathematical Concepts (3)
Object properties

belongsTo/contains, e.g.
Barycentric Coordinates belongsTo Metric Geometry
defines/isDefinedBy, e.g.
Christoffel Symbol isDefinedBy Connectedness
seeAlso, e.g.
Chebyshev Iterative Method seeAlso Numerical Solution of
Linear Equation Systems

14 / 29
Ontology of Mathematical Concepts (4)
Stats

3 450 classes
27% of classes are mapped onto DBpedia
3 630 subclass-of property instances
1 140 other object property instances
Common facts about the development:
lasted for 4 months
7 pro mathematicians participated as domain experts
guided by the authors
WebProtege was used as a collaborative tool

15 / 29
Semantic Annotation

16 / 29
NLP Annotation
Relies on the OntoIntegrator facilities
Solves some of the conventional linguistic tasks, such
as:
tokenization
sentence splitting (∼ 98% F-measure§ )
morphological analysis
NP extraction (88% precision)

Special handling of math symbols, abbreviations, and
math expressions as parts of NPs
Currently supports only Russian language
§

the metrics were evaluated on real math texts with the help of
domain experts
17 / 29
Mining the Logical Structure
Supports our ontology of structural elements:
elements in real texts are instances of the ontology classes
Recognizing types of structural elements:
A string similarity based method gives 89%-100%
F-measure depending on the class
Recognizing semantic relations between them:
A decision tree learner gives 61%-95% F-measure
depending on the relation

18 / 29
Mathematical Named Entity Extraction

Supports our ontology of mathematical concepts:
assigned NPs are instances of the ontology classes
Our method employs annotations of the NP structure
and Jaccard similarity
The method gives 86% F-measure with parameters
focusing on precision/recall trade-off

19 / 29
Connecting Named Entities to Formulas

20 / 29
Connecting Named Entities to Formulas
Parsing mathematical expressions
Detection of variables
Proximity-based matching of mathematical variables
with noun phrases at 68% accuracy

21 / 29
Other supported features

22 / 29
Other supported features

Article metadata extraction (title, author names,
publication year etc.) according to AKT Portal
schema
Semi-manual interlinking¶ with existing LOD data
sets: DBpedia, CORDIS
Publishing the extracted data as an LOD-compliant
RDF data set

¶

by leveraging the Silk app
23 / 29
Outline

1 Introduction
2 Approach
3 Use Cases

24 / 29
Finding DBpedia Entities in Mathematical Formulas
http://cll.niimm.ksu.ru/iswc-demo
1

2

25 / 29
Semantic Search of Theoretical Findings
Finding articles with theorems about finite groups

PREFIX moc: <http://cll.niimm.ksu.ru/ontologies/mocassin#>
PREFIX math: <http://cll.niimm.ksu.ru/ontologies/mathematics#>
SELECT ?article WHERE {
?article moc:hasSegment ?theorem .
?theorem moc:mentions ?entity; a moc:Theorem .
?entity a math:E2183
}

26 / 29
Conclusion
We have developed a holistic approach for mining
LOD representation of scholarly papers in
mathematics
We applied the prototype to a collection of over
1 300 real math papers
We conducted a thorough evaluation of the proposed
methods with the help of domain experts
We provided several use cases to illustrate the utility
of the published data

27 / 29
Future Work

Integrating all the modules into a full-fledged toolkit
Add support of English to the NLP module
Extend our approach to texts on other natural
science domains

28 / 29
Thanks for your attention!
Questions?

29 / 29

More Related Content

What's hot

Ontology-based Data Integration
Ontology-based Data IntegrationOntology-based Data Integration
Ontology-based Data IntegrationJanna Hastings
 
10. Search Tree - Data Structures using C++ by Varsha Patil
10. Search Tree - Data Structures using C++ by Varsha Patil10. Search Tree - Data Structures using C++ by Varsha Patil
10. Search Tree - Data Structures using C++ by Varsha Patilwidespreadpromotion
 
16. Algo analysis & Design - Data Structures using C++ by Varsha Patil
16. Algo analysis & Design - Data Structures using C++ by Varsha Patil16. Algo analysis & Design - Data Structures using C++ by Varsha Patil
16. Algo analysis & Design - Data Structures using C++ by Varsha Patilwidespreadpromotion
 
1. Fundamental Concept - Data Structures using C++ by Varsha Patil
1. Fundamental Concept - Data Structures using C++ by Varsha Patil1. Fundamental Concept - Data Structures using C++ by Varsha Patil
1. Fundamental Concept - Data Structures using C++ by Varsha Patilwidespreadpromotion
 
3. Stack - Data Structures using C++ by Varsha Patil
3. Stack - Data Structures using C++ by Varsha Patil3. Stack - Data Structures using C++ by Varsha Patil
3. Stack - Data Structures using C++ by Varsha Patilwidespreadpromotion
 
5. Queue - Data Structures using C++ by Varsha Patil
5. Queue - Data Structures using C++ by Varsha Patil5. Queue - Data Structures using C++ by Varsha Patil
5. Queue - Data Structures using C++ by Varsha Patilwidespreadpromotion
 
Discrete Mathematics
Discrete MathematicsDiscrete Mathematics
Discrete Mathematicsmetamath
 
Information retrieval 15 alternative algebraic models
Information retrieval 15 alternative algebraic modelsInformation retrieval 15 alternative algebraic models
Information retrieval 15 alternative algebraic modelsVaibhav Khanna
 
Stacks in algorithems & data structure
Stacks in algorithems & data structureStacks in algorithems & data structure
Stacks in algorithems & data structurefaran nawaz
 
4. Recursion - Data Structures using C++ by Varsha Patil
4. Recursion - Data Structures using C++ by Varsha Patil4. Recursion - Data Structures using C++ by Varsha Patil
4. Recursion - Data Structures using C++ by Varsha Patilwidespreadpromotion
 
Ontology Mapping
Ontology MappingOntology Mapping
Ontology Mappingbutest
 
Data Structures and Algorithm - Week 4 - Trees, Binary Trees
Data Structures and Algorithm - Week 4 - Trees, Binary TreesData Structures and Algorithm - Week 4 - Trees, Binary Trees
Data Structures and Algorithm - Week 4 - Trees, Binary TreesFerdin Joe John Joseph PhD
 
Data Structure
Data StructureData Structure
Data Structuresheraz1
 
A hierarchical approach for semi structured document indexing and
A hierarchical approach for semi structured document indexing andA hierarchical approach for semi structured document indexing and
A hierarchical approach for semi structured document indexing andIbrahim Bounhas
 
9. Searching & Sorting - Data Structures using C++ by Varsha Patil
9. Searching & Sorting - Data Structures using C++ by Varsha Patil9. Searching & Sorting - Data Structures using C++ by Varsha Patil
9. Searching & Sorting - Data Structures using C++ by Varsha Patilwidespreadpromotion
 
Combining text and pattern preprocessing in an adaptive dna pattern matcher
Combining text and pattern preprocessing in an adaptive dna pattern matcherCombining text and pattern preprocessing in an adaptive dna pattern matcher
Combining text and pattern preprocessing in an adaptive dna pattern matcherIAEME Publication
 

What's hot (20)

Ontology-based Data Integration
Ontology-based Data IntegrationOntology-based Data Integration
Ontology-based Data Integration
 
10. Search Tree - Data Structures using C++ by Varsha Patil
10. Search Tree - Data Structures using C++ by Varsha Patil10. Search Tree - Data Structures using C++ by Varsha Patil
10. Search Tree - Data Structures using C++ by Varsha Patil
 
16. Algo analysis & Design - Data Structures using C++ by Varsha Patil
16. Algo analysis & Design - Data Structures using C++ by Varsha Patil16. Algo analysis & Design - Data Structures using C++ by Varsha Patil
16. Algo analysis & Design - Data Structures using C++ by Varsha Patil
 
1. Fundamental Concept - Data Structures using C++ by Varsha Patil
1. Fundamental Concept - Data Structures using C++ by Varsha Patil1. Fundamental Concept - Data Structures using C++ by Varsha Patil
1. Fundamental Concept - Data Structures using C++ by Varsha Patil
 
3. Stack - Data Structures using C++ by Varsha Patil
3. Stack - Data Structures using C++ by Varsha Patil3. Stack - Data Structures using C++ by Varsha Patil
3. Stack - Data Structures using C++ by Varsha Patil
 
5. Queue - Data Structures using C++ by Varsha Patil
5. Queue - Data Structures using C++ by Varsha Patil5. Queue - Data Structures using C++ by Varsha Patil
5. Queue - Data Structures using C++ by Varsha Patil
 
Discrete Mathematics
Discrete MathematicsDiscrete Mathematics
Discrete Mathematics
 
Information retrieval 15 alternative algebraic models
Information retrieval 15 alternative algebraic modelsInformation retrieval 15 alternative algebraic models
Information retrieval 15 alternative algebraic models
 
Week 2 - Data Structures and Algorithms
Week 2 - Data Structures and AlgorithmsWeek 2 - Data Structures and Algorithms
Week 2 - Data Structures and Algorithms
 
Stacks in algorithems & data structure
Stacks in algorithems & data structureStacks in algorithems & data structure
Stacks in algorithems & data structure
 
4. Recursion - Data Structures using C++ by Varsha Patil
4. Recursion - Data Structures using C++ by Varsha Patil4. Recursion - Data Structures using C++ by Varsha Patil
4. Recursion - Data Structures using C++ by Varsha Patil
 
Ontology Mapping
Ontology MappingOntology Mapping
Ontology Mapping
 
Data Structures and Algorithm - Week 4 - Trees, Binary Trees
Data Structures and Algorithm - Week 4 - Trees, Binary TreesData Structures and Algorithm - Week 4 - Trees, Binary Trees
Data Structures and Algorithm - Week 4 - Trees, Binary Trees
 
Data Structure
Data StructureData Structure
Data Structure
 
Week 1 - Data Structures and Algorithms
Week 1 - Data Structures and AlgorithmsWeek 1 - Data Structures and Algorithms
Week 1 - Data Structures and Algorithms
 
A hierarchical approach for semi structured document indexing and
A hierarchical approach for semi structured document indexing andA hierarchical approach for semi structured document indexing and
A hierarchical approach for semi structured document indexing and
 
Persistent Search Trees
Persistent Search TreesPersistent Search Trees
Persistent Search Trees
 
9. Searching & Sorting - Data Structures using C++ by Varsha Patil
9. Searching & Sorting - Data Structures using C++ by Varsha Patil9. Searching & Sorting - Data Structures using C++ by Varsha Patil
9. Searching & Sorting - Data Structures using C++ by Varsha Patil
 
Learning ontologies
Learning ontologiesLearning ontologies
Learning ontologies
 
Combining text and pattern preprocessing in an adaptive dna pattern matcher
Combining text and pattern preprocessing in an adaptive dna pattern matcherCombining text and pattern preprocessing in an adaptive dna pattern matcher
Combining text and pattern preprocessing in an adaptive dna pattern matcher
 

Viewers also liked (8)

Word Meaning (Semantics, Semantic Features and Prototype)
Word Meaning (Semantics, Semantic Features and Prototype)Word Meaning (Semantics, Semantic Features and Prototype)
Word Meaning (Semantics, Semantic Features and Prototype)
 
Semantics ppt
Semantics  pptSemantics  ppt
Semantics ppt
 
Semantic roles and semantic features
Semantic roles and semantic featuresSemantic roles and semantic features
Semantic roles and semantic features
 
Semantics
SemanticsSemantics
Semantics
 
Discourse analysis
Discourse analysisDiscourse analysis
Discourse analysis
 
Semantics
SemanticsSemantics
Semantics
 
Semantic Roles
Semantic RolesSemantic Roles
Semantic Roles
 
Lexical relations
Lexical relationsLexical relations
Lexical relations
 

Similar to Bringing Math to LOD

Logical Structure Analysis of Scientific Publications in Mathematics
Logical Structure Analysis of Scientific Publications in MathematicsLogical Structure Analysis of Scientific Publications in Mathematics
Logical Structure Analysis of Scientific Publications in MathematicsNikita Zhiltsov
 
Linking KOS Data [using SKOS and OWL2]
Linking KOS Data [using SKOS and OWL2]Linking KOS Data [using SKOS and OWL2]
Linking KOS Data [using SKOS and OWL2]Marcia Zeng
 
Semantic Transforms Using Collaborative Knowledge Bases
Semantic Transforms Using Collaborative Knowledge BasesSemantic Transforms Using Collaborative Knowledge Bases
Semantic Transforms Using Collaborative Knowledge BasesYegin Genc
 
MIT302 Lesson 2_Advanced Database Systems.pptx
MIT302 Lesson 2_Advanced Database Systems.pptxMIT302 Lesson 2_Advanced Database Systems.pptx
MIT302 Lesson 2_Advanced Database Systems.pptxelsagalgao
 
Probabilistic models (part 1)
Probabilistic models (part 1)Probabilistic models (part 1)
Probabilistic models (part 1)KU Leuven
 
HAN_XU_ICDMW2014
HAN_XU_ICDMW2014HAN_XU_ICDMW2014
HAN_XU_ICDMW2014Han Xu, PhD
 
Bringing Mathematics To the Web of Data: the Case of the Mathematics Subject ...
Bringing Mathematics To the Web of Data: the Case of the Mathematics Subject ...Bringing Mathematics To the Web of Data: the Case of the Mathematics Subject ...
Bringing Mathematics To the Web of Data: the Case of the Mathematics Subject ...Christoph Lange
 
Semantic Annotation of Documents
Semantic Annotation of DocumentsSemantic Annotation of Documents
Semantic Annotation of Documentssubash chandra
 
TOPIC EXTRACTION OF CRAWLED DOCUMENTS COLLECTION USING CORRELATED TOPIC MODEL...
TOPIC EXTRACTION OF CRAWLED DOCUMENTS COLLECTION USING CORRELATED TOPIC MODEL...TOPIC EXTRACTION OF CRAWLED DOCUMENTS COLLECTION USING CORRELATED TOPIC MODEL...
TOPIC EXTRACTION OF CRAWLED DOCUMENTS COLLECTION USING CORRELATED TOPIC MODEL...ijnlc
 
Data integration with a façade. The case of knowledge graph construction.
Data integration with a façade. The case of knowledge graph construction.Data integration with a façade. The case of knowledge graph construction.
Data integration with a façade. The case of knowledge graph construction.Enrico Daga
 
Ontology engineering of automatic text processing methods
Ontology engineering of automatic text processing methodsOntology engineering of automatic text processing methods
Ontology engineering of automatic text processing methodsIJECEIAES
 
TOPIC EXTRACTION OF CRAWLED DOCUMENTS COLLECTION USING CORRELATED TOPIC MODEL...
TOPIC EXTRACTION OF CRAWLED DOCUMENTS COLLECTION USING CORRELATED TOPIC MODEL...TOPIC EXTRACTION OF CRAWLED DOCUMENTS COLLECTION USING CORRELATED TOPIC MODEL...
TOPIC EXTRACTION OF CRAWLED DOCUMENTS COLLECTION USING CORRELATED TOPIC MODEL...kevig
 
A Text Mining Research Based on LDA Topic Modelling
A Text Mining Research Based on LDA Topic ModellingA Text Mining Research Based on LDA Topic Modelling
A Text Mining Research Based on LDA Topic Modellingcsandit
 
A TEXT MINING RESEARCH BASED ON LDA TOPIC MODELLING
A TEXT MINING RESEARCH BASED ON LDA TOPIC MODELLINGA TEXT MINING RESEARCH BASED ON LDA TOPIC MODELLING
A TEXT MINING RESEARCH BASED ON LDA TOPIC MODELLINGcscpconf
 
B.Tech 2nd Year CSE & CSIT AICTE Model Curriculum 2019-20.pdf
B.Tech 2nd Year CSE & CSIT AICTE Model Curriculum 2019-20.pdfB.Tech 2nd Year CSE & CSIT AICTE Model Curriculum 2019-20.pdf
B.Tech 2nd Year CSE & CSIT AICTE Model Curriculum 2019-20.pdfAnita Pal
 
SKOS and Its Application in Transferring Traditional Thesauri into Networked KOS
SKOS and Its Application in Transferring Traditional Thesauri into Networked KOSSKOS and Its Application in Transferring Traditional Thesauri into Networked KOS
SKOS and Its Application in Transferring Traditional Thesauri into Networked KOSMarcia Zeng
 
Towards a Query Rewriting Algorithm Over Proteomics XML Resources
Towards a Query Rewriting Algorithm Over Proteomics XML ResourcesTowards a Query Rewriting Algorithm Over Proteomics XML Resources
Towards a Query Rewriting Algorithm Over Proteomics XML ResourcesCSCJournals
 

Similar to Bringing Math to LOD (20)

Logical Structure Analysis of Scientific Publications in Mathematics
Logical Structure Analysis of Scientific Publications in MathematicsLogical Structure Analysis of Scientific Publications in Mathematics
Logical Structure Analysis of Scientific Publications in Mathematics
 
Linking KOS Data [using SKOS and OWL2]
Linking KOS Data [using SKOS and OWL2]Linking KOS Data [using SKOS and OWL2]
Linking KOS Data [using SKOS and OWL2]
 
Semantic Transforms Using Collaborative Knowledge Bases
Semantic Transforms Using Collaborative Knowledge BasesSemantic Transforms Using Collaborative Knowledge Bases
Semantic Transforms Using Collaborative Knowledge Bases
 
MIT302 Lesson 2_Advanced Database Systems.pptx
MIT302 Lesson 2_Advanced Database Systems.pptxMIT302 Lesson 2_Advanced Database Systems.pptx
MIT302 Lesson 2_Advanced Database Systems.pptx
 
Probabilistic models (part 1)
Probabilistic models (part 1)Probabilistic models (part 1)
Probabilistic models (part 1)
 
HAN_XU_ICDMW2014
HAN_XU_ICDMW2014HAN_XU_ICDMW2014
HAN_XU_ICDMW2014
 
Bringing Mathematics To the Web of Data: the Case of the Mathematics Subject ...
Bringing Mathematics To the Web of Data: the Case of the Mathematics Subject ...Bringing Mathematics To the Web of Data: the Case of the Mathematics Subject ...
Bringing Mathematics To the Web of Data: the Case of the Mathematics Subject ...
 
Semantic Annotation of Documents
Semantic Annotation of DocumentsSemantic Annotation of Documents
Semantic Annotation of Documents
 
TOPIC EXTRACTION OF CRAWLED DOCUMENTS COLLECTION USING CORRELATED TOPIC MODEL...
TOPIC EXTRACTION OF CRAWLED DOCUMENTS COLLECTION USING CORRELATED TOPIC MODEL...TOPIC EXTRACTION OF CRAWLED DOCUMENTS COLLECTION USING CORRELATED TOPIC MODEL...
TOPIC EXTRACTION OF CRAWLED DOCUMENTS COLLECTION USING CORRELATED TOPIC MODEL...
 
Data integration with a façade. The case of knowledge graph construction.
Data integration with a façade. The case of knowledge graph construction.Data integration with a façade. The case of knowledge graph construction.
Data integration with a façade. The case of knowledge graph construction.
 
Ontology engineering of automatic text processing methods
Ontology engineering of automatic text processing methodsOntology engineering of automatic text processing methods
Ontology engineering of automatic text processing methods
 
TOPIC EXTRACTION OF CRAWLED DOCUMENTS COLLECTION USING CORRELATED TOPIC MODEL...
TOPIC EXTRACTION OF CRAWLED DOCUMENTS COLLECTION USING CORRELATED TOPIC MODEL...TOPIC EXTRACTION OF CRAWLED DOCUMENTS COLLECTION USING CORRELATED TOPIC MODEL...
TOPIC EXTRACTION OF CRAWLED DOCUMENTS COLLECTION USING CORRELATED TOPIC MODEL...
 
Classification and Clustering of arXiv Documents, Sections, and Abstracts, Co...
Classification and Clustering of arXiv Documents, Sections, and Abstracts, Co...Classification and Clustering of arXiv Documents, Sections, and Abstracts, Co...
Classification and Clustering of arXiv Documents, Sections, and Abstracts, Co...
 
A Text Mining Research Based on LDA Topic Modelling
A Text Mining Research Based on LDA Topic ModellingA Text Mining Research Based on LDA Topic Modelling
A Text Mining Research Based on LDA Topic Modelling
 
A TEXT MINING RESEARCH BASED ON LDA TOPIC MODELLING
A TEXT MINING RESEARCH BASED ON LDA TOPIC MODELLINGA TEXT MINING RESEARCH BASED ON LDA TOPIC MODELLING
A TEXT MINING RESEARCH BASED ON LDA TOPIC MODELLING
 
What's in a textbook
What's in a textbookWhat's in a textbook
What's in a textbook
 
B.Tech 2nd Year CSE & CSIT AICTE Model Curriculum 2019-20.pdf
B.Tech 2nd Year CSE & CSIT AICTE Model Curriculum 2019-20.pdfB.Tech 2nd Year CSE & CSIT AICTE Model Curriculum 2019-20.pdf
B.Tech 2nd Year CSE & CSIT AICTE Model Curriculum 2019-20.pdf
 
SKOS and Its Application in Transferring Traditional Thesauri into Networked KOS
SKOS and Its Application in Transferring Traditional Thesauri into Networked KOSSKOS and Its Application in Transferring Traditional Thesauri into Networked KOS
SKOS and Its Application in Transferring Traditional Thesauri into Networked KOS
 
Towards a Query Rewriting Algorithm Over Proteomics XML Resources
Towards a Query Rewriting Algorithm Over Proteomics XML ResourcesTowards a Query Rewriting Algorithm Over Proteomics XML Resources
Towards a Query Rewriting Algorithm Over Proteomics XML Resources
 
B046021319
B046021319B046021319
B046021319
 

Recently uploaded

Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 

Recently uploaded (20)

Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 

Bringing Math to LOD

  • 1. Bringing Math to LOD: A Semantic Publishing Platform Prototype for Scientific Collections in Mathematics Olga Nevzorova, Nikita Zhiltsov, Danila Zaikin, Olga Zhibrik, Alexander Kirillovich, Vladimir Nevzorov, Evgeniy Birialtsev Kazan Federal University Russia October 23, 2013 1 / 29
  • 3. Our Contribution Our prototype is geared to build a semantic graph of mathematical knowledge objects, that is extracted from a collection of mathematical scholarly papers, and is integrated into the LOD «cloud» 3 / 29
  • 4. Research Output IVM Data Set LOD representation of 1 330 scholarly publications of the «Izvestiya Vuzov. Matematika» (IVM) journal Covers the semantics of: article metadata elements of the logical structure terminology formulas Aligned with DBpedia, CORDIS More than 850 000 RDF triples SPARQL endpoint: http://cll.niimm.ksu.ru:8890/sparql-auth∗ ∗ the SPARQL endpoint is secured. Please email the authors for credentials 4 / 29
  • 5. Related Work Domain-specific languages: OMDoc, MathLang Domain models: Cambridge Mathematical Thesaurus, DBpedia (math-related part), ScienceWISE Ontology Math-related NLP: mArachna; linguistic modules of arXMLiv 5 / 29
  • 7. Key Research Contributions a thorough ontological model of the mathematical domain an ontology-based language-independent method for extraction of logical structure elements in papers an ontology-based method for extraction of mathematical named entities from texts in Russian a method that connects mathematical named entities to symbolic expressions 7 / 29
  • 10. Ontology of Structural Elements (1) http://cll.niimm.ksu.ru/ontologies/mocassin Covers 15 common structural elements: Defines 9 object properties and 4 datatype properties: 10 / 29
  • 11. Ontology of Structural Elements (2) http://cll.niimm.ksu.ru/ontologies/mocassin 3 cardinality axioms, e.g. Proof ∧ (= 1 proves ProvableStatement† ) 2 transitivity axioms for hasPart and dependsOn properties DL expressivity: SRIN (D) † i.e., Claim ∨ Corollary ∨ Lemma ∨ Proposition ∨ Theorem 11 / 29
  • 12. Ontology of Mathematical Concepts (1) http://cll.niimm.ksu.ru/ontologies/mathematics Covers 3 450 mathematical concepts Defines commonly used terms as well as terms from the emerging professional vocabulary (e.g. Bitsadze-Samarsky problem) Supports Russian/English labels 12 / 29
  • 13. Ontology of Mathematical Concepts (2) http://cll.niimm.ksu.ru/ontologies/mathematics Includes two taxonomies: taxonomy of mathematical theories‡ : number theory, set theory, algebra, analysis, geometry, mathematical logic, discrete mathematics, theory of computation, differential equations, numerical analysis, probability theory and statistics taxonomy of mathematical objects Covers common scientific concepts, such as Problem, Method, Statement, Formula etc. DL expressivity: ALCHI ‡ covers just a part of the mathematical knowledge 13 / 29
  • 14. Ontology of Mathematical Concepts (3) Object properties belongsTo/contains, e.g. Barycentric Coordinates belongsTo Metric Geometry defines/isDefinedBy, e.g. Christoffel Symbol isDefinedBy Connectedness seeAlso, e.g. Chebyshev Iterative Method seeAlso Numerical Solution of Linear Equation Systems 14 / 29
  • 15. Ontology of Mathematical Concepts (4) Stats 3 450 classes 27% of classes are mapped onto DBpedia 3 630 subclass-of property instances 1 140 other object property instances Common facts about the development: lasted for 4 months 7 pro mathematicians participated as domain experts guided by the authors WebProtege was used as a collaborative tool 15 / 29
  • 17. NLP Annotation Relies on the OntoIntegrator facilities Solves some of the conventional linguistic tasks, such as: tokenization sentence splitting (∼ 98% F-measure§ ) morphological analysis NP extraction (88% precision) Special handling of math symbols, abbreviations, and math expressions as parts of NPs Currently supports only Russian language § the metrics were evaluated on real math texts with the help of domain experts 17 / 29
  • 18. Mining the Logical Structure Supports our ontology of structural elements: elements in real texts are instances of the ontology classes Recognizing types of structural elements: A string similarity based method gives 89%-100% F-measure depending on the class Recognizing semantic relations between them: A decision tree learner gives 61%-95% F-measure depending on the relation 18 / 29
  • 19. Mathematical Named Entity Extraction Supports our ontology of mathematical concepts: assigned NPs are instances of the ontology classes Our method employs annotations of the NP structure and Jaccard similarity The method gives 86% F-measure with parameters focusing on precision/recall trade-off 19 / 29
  • 20. Connecting Named Entities to Formulas 20 / 29
  • 21. Connecting Named Entities to Formulas Parsing mathematical expressions Detection of variables Proximity-based matching of mathematical variables with noun phrases at 68% accuracy 21 / 29
  • 23. Other supported features Article metadata extraction (title, author names, publication year etc.) according to AKT Portal schema Semi-manual interlinking¶ with existing LOD data sets: DBpedia, CORDIS Publishing the extracted data as an LOD-compliant RDF data set ¶ by leveraging the Silk app 23 / 29
  • 25. Finding DBpedia Entities in Mathematical Formulas http://cll.niimm.ksu.ru/iswc-demo 1 2 25 / 29
  • 26. Semantic Search of Theoretical Findings Finding articles with theorems about finite groups PREFIX moc: <http://cll.niimm.ksu.ru/ontologies/mocassin#> PREFIX math: <http://cll.niimm.ksu.ru/ontologies/mathematics#> SELECT ?article WHERE { ?article moc:hasSegment ?theorem . ?theorem moc:mentions ?entity; a moc:Theorem . ?entity a math:E2183 } 26 / 29
  • 27. Conclusion We have developed a holistic approach for mining LOD representation of scholarly papers in mathematics We applied the prototype to a collection of over 1 300 real math papers We conducted a thorough evaluation of the proposed methods with the help of domain experts We provided several use cases to illustrate the utility of the published data 27 / 29
  • 28. Future Work Integrating all the modules into a full-fledged toolkit Add support of English to the NLP module Extend our approach to texts on other natural science domains 28 / 29
  • 29. Thanks for your attention! Questions? 29 / 29