SlideShare une entreprise Scribd logo
1  sur  38
Télécharger pour lire hors ligne
Preliminaries OAEI Alcomo LogMap-Repair Evaluation
Evaluating Mapping Repair Systems
with Large Biomedical Ontologies
Ernesto Jiménez-Ruiz Christian Meilicke
Bernardo Cuenca Grau Ian Horrocks
Department of Computer Science, University of Oxford
Research Group Data and Web Science, University of Mannheim, Germany
26th Description Logics Workshop
24 July 2013
Preliminaries OAEI Alcomo LogMap-Repair Evaluation
Outline
Preliminaries
Ontology Alignment Evaluation Initiative
Mapping repair with Alcomo
Mapping repair with LogMap-Repair
Evaluation
Preliminaries OAEI Alcomo LogMap-Repair Evaluation
Ontology mappings
Mappings M are tuples he1, e2, n, ρi
• e1, e2 are entities in the input ontologies O1 and O2
• n a confidence value between 0 and 1
• ρ is the semantic relationship between e1 and e2 (e.g.
subsumption, equivalence or disjointness)
Formalized as OWL 2 axioms
• Where the semantic relationship ρ is one of {≡, ⊑, ⊒, ⊥}
• Confidence values n are represented as axiom annotations
• No extra semantics
Preliminaries OAEI Alcomo LogMap-Repair Evaluation
Ontology mappings
Mappings M are tuples he1, e2, n, ρi
• e1, e2 are entities in the input ontologies O1 and O2
• n a confidence value between 0 and 1
• ρ is the semantic relationship between e1 and e2 (e.g.
subsumption, equivalence or disjointness)
Formalized as OWL 2 axioms
• Where the semantic relationship ρ is one of {≡, ⊑, ⊒, ⊥}
• Confidence values n are represented as axiom annotations
• No extra semantics
Preliminaries OAEI Alcomo LogMap-Repair Evaluation
Incoherent mappings
A set of Mappings M wrt O1 and O2 is incoherente if
• there exists a class A such that
• O1 ∪ O2 ∪ M |= A ⊑ ⊥, and
• O1 ∪ O2 6|= A ⊑ ⊥.
Preliminaries OAEI Alcomo LogMap-Repair Evaluation
Example: Incoherent Mappings
Preliminaries OAEI Alcomo LogMap-Repair Evaluation
Mapping Repair
R is a repair for M wrt O1 and O2
• if M  R is coherent.
• R = M is a candiate repair.
R is a diagnosis for M wrt O1 and O2
• if R′ ⊂ R is not a repair for M.
• Diagnosis are (typically) expensive to obtain.
Preliminaries OAEI Alcomo LogMap-Repair Evaluation
Mapping Repair
R is a repair for M wrt O1 and O2
• if M  R is coherent.
• R = M is a candiate repair.
R is a diagnosis for M wrt O1 and O2
• if R′ ⊂ R is not a repair for M.
• Diagnosis are (typically) expensive to obtain.
Preliminaries OAEI Alcomo LogMap-Repair Evaluation
Computing Mapping Repairs
Standard justification-based ontology debugging
• are expensive since require complete reasoning.
• do not scale when the number of unsatisfiabilities is large.
An approximate Repair R≈
• will not guarantee that M  R≈ is coherent;
• but aims at reducing the number of unsatisfiabilities,
• while preserving M as much as possible.
Preliminaries OAEI Alcomo LogMap-Repair Evaluation
Computing Mapping Repairs
Standard justification-based ontology debugging
• are expensive since require complete reasoning.
• do not scale when the number of unsatisfiabilities is large.
An approximate Repair R≈
• will not guarantee that M  R≈ is coherent;
• but aims at reducing the number of unsatisfiabilities,
• while preserving M as much as possible.
Preliminaries OAEI Alcomo LogMap-Repair Evaluation
Outline
Preliminaries
Ontology Alignment Evaluation Initiative
Mapping repair with Alcomo
Mapping repair with LogMap-Repair
Evaluation
Preliminaries OAEI Alcomo LogMap-Repair Evaluation
Evaluation of Ontology Matching Systems
Ontology Alignment Evaluation Initiative (OAEI)
• A annual campaign for the evaluation of ontology matching
systems.
• Deadline September 1.
• http://oaei.ontologymatching.org/
• The matching problems are organized in several tracks.
• Each track involves different test ontologies and reference
alignments.
• Large BioMed track: FMA, NCI and SNOMED CT.
Preliminaries OAEI Alcomo LogMap-Repair Evaluation
Evaluation of Ontology Matching Systems: OAEI
Quality of a set of mappings M
• Precision and recall wrt reference mappings or gold standard
• Precision (Pre) = |M ∩ MGS|/|M|
• Recall (Rec) = |M ∩ MGS|/|MGS|
• The F-score (F) = (2 × Pre × Rec)/(Pre + Rec).
• Coherence of M wrt O1 and O2.
• Computation times are also considered.
Preliminaries OAEI Alcomo LogMap-Repair Evaluation
Mapping coherence in the Large BioMed track
• FMA v2.0 (78,989 classes), NCI v.08.05d (66,724 classes) and
SNOMED CT v. Jan. 2009 (306,591 classes).
• Evaluated top-7 systems
• Unsat. provided by HermiT and ELK (in SNOMED-NCI)
System
FMA-NCI FMA-SNOMED SNOMED-NCI
Unsat. Ratio Unsat. Ratio Unsat. Ratio
LogMapnoe 9 0.01% 10 0.003% ≥0 ≥0%
LogMap 9 0.01% 10 0.003% ≥17 ≥0.005%
ServOMap 48,743 27% 273,242 71% ≥313,643 ≥84%
ServOMapL 50,334 28% 99,726 26% ≥314,939 ≥84%
GOMMA 5,574 4% 10,752 3% ≥266,051 ≥71%
GOMMAbk 12,939 9% 119,657 31% ≥313,015 ≥84%
YAM++ 50,550 29% 106,107 28% ≥269,107 ≥72%
Preliminaries OAEI Alcomo LogMap-Repair Evaluation
Mapping Repair Systems
Mapping repair with Alcomo and LogMap-Repair
• Alcomo is specialised to repair incoherent mappings.
• LogMap, originally implemented as an matching system, can
also operate as a mapping repair system (LogMap-Repair).
• Evaluated using incomplete reasoning.
• Compute and approximate repair R≈ for a given M wrt O1
and O2.
Preliminaries OAEI Alcomo LogMap-Repair Evaluation
Mapping Repair Systems
Mapping repair with Alcomo and LogMap-Repair
• Alcomo is specialised to repair incoherent mappings.
• LogMap, originally implemented as an matching system, can
also operate as a mapping repair system (LogMap-Repair).
• Evaluated using incomplete reasoning.
• Compute and approximate repair R≈ for a given M wrt O1
and O2.
Preliminaries OAEI Alcomo LogMap-Repair Evaluation
Outline
Preliminaries
Ontology Alignment Evaluation Initiative
Mapping repair with Alcomo
Mapping repair with LogMap-Repair
Evaluation
Preliminaries OAEI Alcomo LogMap-Repair Evaluation
Alcomo mapping repair system
Implemented Repair Techniques
• Complete Reasoning: classical Black-Box appoach (inefficient)
• Incomplete reasoning: pattern-based Approach (efficient but
incomplete)
• Combinations of both techniques (complete, but still not
efficient)
Incomplete reasoning
• Alcomo guarantees that the computed R≈ is a subset of the
diagnosis.
• Alcomo never removes too much.
Preliminaries OAEI Alcomo LogMap-Repair Evaluation
Alcomo mapping repair system
Implemented Repair Techniques
• Complete Reasoning: classical Black-Box appoach (inefficient)
• Incomplete reasoning: pattern-based Approach (efficient
but incomplete)
• Combinations of both techniques (complete, but still not
efficient)
Incomplete reasoning
• Alcomo guarantees that the computed R≈ is a subset of the
diagnosis.
• Alcomo never removes too much.
Preliminaries OAEI Alcomo LogMap-Repair Evaluation
Alcomo mapping repair system
Pattern-based Repair
• First classify O1 and O2
• Check each pair of mappings for conflict patterns
• See two example patterns below:
A
C
#i
#i
B#j
D#j
F#j
F#j D#j
A
C
#i
#i
B#j
D#j
F#j
F#j D#j
disjoint
disjoint
d
i
s
j
o
i
n
t
Preliminaries OAEI Alcomo LogMap-Repair Evaluation
Alcomo: algorithm used in the experiments (iii)
Input: O1, O2: input ontologies; M: input mappings
Output: M′: output mappings; R≈: approximate mapping repair.
1: M′
:= ∅
2: R≈
:= ∅
3: classify O1 and O2
4: C := ConflictPairs(O1, O2, M)
5: for each m ∈ M do ⊲ iterate over M desc. order w.r.t. confidences
6: coh := true
7: for each m′
∈ M′
do
8: if (m′
, m) ∈ C then
9: coh := false
10: break
11: end if
12: end for
13: if coh = true then M′
:= M′
∪ {m}
14: else R≈
:= R≈
∪ {m}
15: end for
16: return hM′
, R≈
i
Preliminaries OAEI Alcomo LogMap-Repair Evaluation
Outline
Preliminaries
Ontology Alignment Evaluation Initiative
Mapping repair with Alcomo
Mapping repair with LogMap-Repair
Evaluation
Preliminaries OAEI Alcomo LogMap-Repair Evaluation
LogMap-Repair system
Propositional Horn representation
• LogMap-Repair projects the classified input ontologies and the
mappings to Propositional Horn clauses.
Propositional FMA (P1) Propositional NCI (P2)
(1) Smegma → Secretion (8) Smegma → ExocrineGlandFluid
(2) Secretion → PortionBodySusbstance (9) ExocrineGlandFluid → Anatomy
(3) PortionBodySusbstance → AnatomicalEntity (10) CellularSecretion → TransmembraneTransport
Computed mappings (M) (11) TransmembraneTransport → TransportProcess
(m4) FMA:Secretion → NCI:CellularSecretion (12) TransportProcess → BiologicalProcess
(m5) NCI:CellularSecretion → FMA:Secretion (13) Anatomy ∧ BiologicalProcess → false
(m6) FMA:Smegma → NCI:Smegma (14) ExocrineGlandFluid ∧ ExfolCells → Smegma
(m7) NCI:Smegma → FMA:Smegma
Preliminaries OAEI Alcomo LogMap-Repair Evaluation
LogMap-Repair system
Propositional Horn SAT with Dowling-Gallier (D-G)
• LogMap-Reapir implements the SAT algorithm D-G
• D-G is call for every class C and the propositional theory PC :
• the rule (true → C);
• the propositional representations P1 and P2 of the input
ontologies; and
• the propositional representation of the mappings.
Recording conflictive mappings
• LogMap-Repair extends D-G to record mappings involved in
an unsatifiability.
Preliminaries OAEI Alcomo LogMap-Repair Evaluation
LogMap-Repair system
Propositional Horn SAT with Dowling-Gallier (D-G)
• LogMap-Reapir implements the SAT algorithm D-G
• D-G is call for every class C and the propositional theory PC :
• the rule (true → C);
• the propositional representations P1 and P2 of the input
ontologies; and
• the propositional representation of the mappings.
Recording conflictive mappings
• LogMap-Repair extends D-G to record mappings involved in
an unsatifiability.
Preliminaries OAEI Alcomo LogMap-Repair Evaluation
LogMap-Repair: Recording conflictive mappings
Preliminaries OAEI Alcomo LogMap-Repair Evaluation
LogMap-Repair system
A “greedy” repair algorithm
• A (local) repair RC is computed for each (unsat.) class C
(top-bottom approach)
• The algorithm identifies subsets of the conflictive mappings of
increasing size, and stops when a repair is found.
• LogMap finds all repairs of “smallest” size for C.
• For example: R1 = {m4} and R2 = {m6}
• The repair with less confidence is selected .
Preliminaries OAEI Alcomo LogMap-Repair Evaluation
LogMap-Repair system
A “greedy” repair algorithm
• A (local) repair RC is computed for each (unsat.) class C
(top-bottom approach)
• The algorithm identifies subsets of the conflictive mappings of
increasing size, and stops when a repair is found.
• LogMap finds all repairs of “smallest” size for C.
• For example: R1 = {m4} and R2 = {m6}
• The repair with less confidence is selected .
Preliminaries OAEI Alcomo LogMap-Repair Evaluation
LogMap-Repair system
Characteristics of the repair algorithm
• LogMap-Repair identifies a (global) repair R≈ =
S
RC .
• P1 ∪ P2 ∪ M  R≈ ∪ {true → C} is satisfiable for each C
in P1 ∪ P2.
• The propositional encoding of O1 and O2 is incomplete.
• The algorithm does not ensure satisfiability of each class in
O1 ∪ O2 ∪ M  R≈.
• R≈ is an approximate repair of M wrt O1 and O2.
Preliminaries OAEI Alcomo LogMap-Repair Evaluation
LogMap-Repair system
Characteristics of the repair algorithm
• LogMap-Repair identifies a (global) repair R≈ =
S
RC .
• P1 ∪ P2 ∪ M  R≈ ∪ {true → C} is satisfiable for each C
in P1 ∪ P2.
• The propositional encoding of O1 and O2 is incomplete.
• The algorithm does not ensure satisfiability of each class in
O1 ∪ O2 ∪ M  R≈.
• R≈ is an approximate repair of M wrt O1 and O2.
Preliminaries OAEI Alcomo LogMap-Repair Evaluation
Outline
Preliminaries
Ontology Alignment Evaluation Initiative
Mapping repair with Alcomo
Mapping repair with LogMap-Repair
Evaluation
Preliminaries OAEI Alcomo LogMap-Repair Evaluation
Evaluation
Evaluation settings
• Three matching problems of the OAEI 2012 Large BioMed
track: FMA-NCI, FMA-SNOMED and SNOMED-NCI.
• Top matching systems in this track apart from LogMap.
• Server with 16 CPUs and allocating 15 Gb RAM.
Preliminaries OAEI Alcomo LogMap-Repair Evaluation
Repair in the FMA-NCI matching problem
System
Matching Results OAEI 2012 Repair with Alcomo Repair with LogMap
|M| t (s) F Unsat. |R≈
| t (s) F Unsat. |R≈
| t (s) F Unsat.
ServOMap 4,932 204 0.819 48,743 97 321 0.820 9 115 21 0.819 9
ServOMapL 5,400 251 0.841 50,334 126 342 0.841 9 166 24 0.839 9
GOMMA 5,686 217 0.839 5,574 123 321 0.839 15 142 20 0.838 9
GOMMAbk 6,330 231 0.837 12,939 184 341 0.836 29 259 33 0.833 9
YAM++ 5,476 1,304 0.862 50,550 116 324 0.862 10 141 21 0.861 9
Average 5,565 441 0.840 33,628 129 330 0.840 14 165 24 0.838 9
Preliminaries OAEI Alcomo LogMap-Repair Evaluation
Repair in the FMA-SNOMED matching problem
System
Matching Results OAEI 2012 Repair with Alcomo Repair with LogMap
|M| t (s) F Unsat. |R≈
| t (s) F Unsat. |R≈
| t (s) F Unsat.
ServOMap 12,642 532 0.770 273,242 829 2,672 0.749 0 2,640 284 0.721 0
ServOMapL 13,210 517 0.794 99,729 975 2,779 0.767 0 2,953 274 0.752 0
GOMMA 11,648 1,994 0.291 10,752 463 2,840 0.293 0 618 245 0.291 0
GOMMAbk 25,660 1,893 0.708 119,657 1,726 2,809 0.698 1,363 5,295 314 0.678 0
YAM++ 14,088 23,900 0.765 106,107 783 2,800 0.760 0 3,461 262 0.720 0
Average 15,450 5,767 0.666 121,897 955 2,780 0.653 273 2,993 276 0.632 0
Preliminaries OAEI Alcomo LogMap-Repair Evaluation
Repair in the SNOMED-NCI matching problem
System
Matching Results OAEI 2012 Repair with Alcomo Repair with LogMap
|M| t (s) F Unsat. |R≈
| t (s) F Unsat. |R≈
| t (s) F Unsat.
ServOMap 24,924 654 0.664 ≥313,643 937 3,223 0.663 ≥35 1,671 276 0.666 ≥296
ServOMapL 27,928 738 0.678 ≥314,939 1,076 2,917 0.677 ≥2,055 1,656 314 0.679 ≥1,241
GOMMA 27,386 1,820 0.606 ≥266,051 1,903 2,947 0.603 ≥2,085 2,949 303 0.607 ≥37
GOMMAbk 34,090 1,940 0.635 ≥313,015 2,720 3,098 0.638 ≥30,583 5,003 435 0.641 ≥1
YAM++ 28,206 30,155 0.680 ≥269,107 757 2,964 0.679 ≥0 1,049 305 0.680 ≥0
Average 28,507 7,061 0.653 ≥295,351 1,479 3,030 0.652 ≥6,952 2,465 326 0.655 ≥315
Conclusions
• Computed (approximate) repairs are not aggressive:
• 5% (Alcomo).
• 11% (LogMap-Repair).
• The repair process does not represent a bottleneck in the
matching process.
• Very fast using LogMap-Repair.
• The impact on the F-score is (on average) negligible.
• The incoherence of the repaired mapping sets is always
significantly reduced.
• Alcomo and LogMap-Repair could complement each
other.
Conclusions and future work
• Feasibility of integrating the mapping repair techniques within
the ontology matching process.
• Good basis to compare ontology and mapping repair systems:
• Call for Mapping/Ontology repair systems (September 2013)
• http://
Questions?
• Alcomo main contact:
Christian Meilicke (@).
• LogMap-Repair main contact:
Ernesto Jimnez-Ruiz (ernesto.jimenez.ruiz@gmail.com)
Thank you for your attention
• Alcomo: http://
• LogMap-Repair:
http://www.cs.ox.ac.uk/isg/tools/LogMap/

Contenu connexe

Tendances

A time study in numerical methods programming
A time study in numerical methods programmingA time study in numerical methods programming
A time study in numerical methods programmingGlen Alleman
 
TMPA-2017: Modeling of PLC-programs by High-level Coloured Petri Nets
TMPA-2017: Modeling of PLC-programs by High-level Coloured Petri NetsTMPA-2017: Modeling of PLC-programs by High-level Coloured Petri Nets
TMPA-2017: Modeling of PLC-programs by High-level Coloured Petri NetsIosif Itkin
 
Krylov Subspace Methods in Model Order Reduction
Krylov Subspace Methods in Model Order ReductionKrylov Subspace Methods in Model Order Reduction
Krylov Subspace Methods in Model Order ReductionMohammad Umar Rehman
 
CARI2020: A CGM-Based Parallel Algorithm Using the Four-Russians Speedup for ...
CARI2020: A CGM-Based Parallel Algorithm Using the Four-Russians Speedup for ...CARI2020: A CGM-Based Parallel Algorithm Using the Four-Russians Speedup for ...
CARI2020: A CGM-Based Parallel Algorithm Using the Four-Russians Speedup for ...Mokhtar SELLAMI
 
An Introduction to Model Order Reduction
An Introduction to Model Order ReductionAn Introduction to Model Order Reduction
An Introduction to Model Order ReductionMohammad Umar Rehman
 
CHAC Algorithm ECAL'07 Presentation
CHAC Algorithm ECAL'07 PresentationCHAC Algorithm ECAL'07 Presentation
CHAC Algorithm ECAL'07 PresentationAntonio Mora
 
Automated Machine Learning via Sequential Uniform Designs
Automated Machine Learning via Sequential Uniform DesignsAutomated Machine Learning via Sequential Uniform Designs
Automated Machine Learning via Sequential Uniform DesignsAijun Zhang
 

Tendances (7)

A time study in numerical methods programming
A time study in numerical methods programmingA time study in numerical methods programming
A time study in numerical methods programming
 
TMPA-2017: Modeling of PLC-programs by High-level Coloured Petri Nets
TMPA-2017: Modeling of PLC-programs by High-level Coloured Petri NetsTMPA-2017: Modeling of PLC-programs by High-level Coloured Petri Nets
TMPA-2017: Modeling of PLC-programs by High-level Coloured Petri Nets
 
Krylov Subspace Methods in Model Order Reduction
Krylov Subspace Methods in Model Order ReductionKrylov Subspace Methods in Model Order Reduction
Krylov Subspace Methods in Model Order Reduction
 
CARI2020: A CGM-Based Parallel Algorithm Using the Four-Russians Speedup for ...
CARI2020: A CGM-Based Parallel Algorithm Using the Four-Russians Speedup for ...CARI2020: A CGM-Based Parallel Algorithm Using the Four-Russians Speedup for ...
CARI2020: A CGM-Based Parallel Algorithm Using the Four-Russians Speedup for ...
 
An Introduction to Model Order Reduction
An Introduction to Model Order ReductionAn Introduction to Model Order Reduction
An Introduction to Model Order Reduction
 
CHAC Algorithm ECAL'07 Presentation
CHAC Algorithm ECAL'07 PresentationCHAC Algorithm ECAL'07 Presentation
CHAC Algorithm ECAL'07 Presentation
 
Automated Machine Learning via Sequential Uniform Designs
Automated Machine Learning via Sequential Uniform DesignsAutomated Machine Learning via Sequential Uniform Designs
Automated Machine Learning via Sequential Uniform Designs
 

Similaire à Evaluating Mapping Repair Systems with Large Biomedical Ontologies

A time study in numerical methods programming
A time study in numerical methods programmingA time study in numerical methods programming
A time study in numerical methods programmingGlen Alleman
 
The Maze of Deletion in Ontology Stream Reasoning
The Maze of Deletion in Ontology Stream Reasoning The Maze of Deletion in Ontology Stream Reasoning
The Maze of Deletion in Ontology Stream Reasoning Jeff Z. Pan
 
Condition Monitoring of a Large-scale PV Power Plant in Australia
Condition Monitoring of a Large-scale PV Power Plant in AustraliaCondition Monitoring of a Large-scale PV Power Plant in Australia
Condition Monitoring of a Large-scale PV Power Plant in AustraliaAmit Dhoke
 
Bag of Pursuits and Neural Gas for Improved Sparse Codin
Bag of Pursuits and Neural Gas for Improved Sparse CodinBag of Pursuits and Neural Gas for Improved Sparse Codin
Bag of Pursuits and Neural Gas for Improved Sparse CodinKarlos Svoboda
 
ODD SEM SYLLABUS 2021.docx
ODD SEM SYLLABUS 2021.docxODD SEM SYLLABUS 2021.docx
ODD SEM SYLLABUS 2021.docxarivasanth
 
Effects of clearance size on the dynamic response of planar multi body system...
Effects of clearance size on the dynamic response of planar multi body system...Effects of clearance size on the dynamic response of planar multi body system...
Effects of clearance size on the dynamic response of planar multi body system...Alexander Decker
 
ARIMA Model for analysis of time series data.ppt
ARIMA Model for analysis of time series data.pptARIMA Model for analysis of time series data.ppt
ARIMA Model for analysis of time series data.pptREFOTDEBuea
 
A Discrete Optimization Approach for SVD Best Truncation Choice based on ROC ...
A Discrete Optimization Approach for SVD Best Truncation Choice based on ROC ...A Discrete Optimization Approach for SVD Best Truncation Choice based on ROC ...
A Discrete Optimization Approach for SVD Best Truncation Choice based on ROC ...Davide Chicco
 
EVOLUTION OF ONTOLOGY-BASED MAPPINGS
EVOLUTION OF ONTOLOGY-BASED MAPPINGSEVOLUTION OF ONTOLOGY-BASED MAPPINGS
EVOLUTION OF ONTOLOGY-BASED MAPPINGSAksw Group
 

Similaire à Evaluating Mapping Repair Systems with Large Biomedical Ontologies (20)

A time study in numerical methods programming
A time study in numerical methods programmingA time study in numerical methods programming
A time study in numerical methods programming
 
The Maze of Deletion in Ontology Stream Reasoning
The Maze of Deletion in Ontology Stream Reasoning The Maze of Deletion in Ontology Stream Reasoning
The Maze of Deletion in Ontology Stream Reasoning
 
Condition Monitoring of a Large-scale PV Power Plant in Australia
Condition Monitoring of a Large-scale PV Power Plant in AustraliaCondition Monitoring of a Large-scale PV Power Plant in Australia
Condition Monitoring of a Large-scale PV Power Plant in Australia
 
Arima
ArimaArima
Arima
 
Event 32
Event 32Event 32
Event 32
 
MUMS: Transition & SPUQ Workshop - Gradient-Free Construction of Active Subsp...
MUMS: Transition & SPUQ Workshop - Gradient-Free Construction of Active Subsp...MUMS: Transition & SPUQ Workshop - Gradient-Free Construction of Active Subsp...
MUMS: Transition & SPUQ Workshop - Gradient-Free Construction of Active Subsp...
 
Ch06 alignment
Ch06 alignmentCh06 alignment
Ch06 alignment
 
Bag of Pursuits and Neural Gas for Improved Sparse Codin
Bag of Pursuits and Neural Gas for Improved Sparse CodinBag of Pursuits and Neural Gas for Improved Sparse Codin
Bag of Pursuits and Neural Gas for Improved Sparse Codin
 
ODD SEM SYLLABUS 2021.docx
ODD SEM SYLLABUS 2021.docxODD SEM SYLLABUS 2021.docx
ODD SEM SYLLABUS 2021.docx
 
Effects of clearance size on the dynamic response of planar multi body system...
Effects of clearance size on the dynamic response of planar multi body system...Effects of clearance size on the dynamic response of planar multi body system...
Effects of clearance size on the dynamic response of planar multi body system...
 
Computational Chemistry Robots
Computational Chemistry RobotsComputational Chemistry Robots
Computational Chemistry Robots
 
ARIMA Model.ppt
ARIMA Model.pptARIMA Model.ppt
ARIMA Model.ppt
 
ARIMA Model for analysis of time series data.ppt
ARIMA Model for analysis of time series data.pptARIMA Model for analysis of time series data.ppt
ARIMA Model for analysis of time series data.ppt
 
ARIMA Model.ppt
ARIMA Model.pptARIMA Model.ppt
ARIMA Model.ppt
 
A Discrete Optimization Approach for SVD Best Truncation Choice based on ROC ...
A Discrete Optimization Approach for SVD Best Truncation Choice based on ROC ...A Discrete Optimization Approach for SVD Best Truncation Choice based on ROC ...
A Discrete Optimization Approach for SVD Best Truncation Choice based on ROC ...
 
Plan economico
Plan economicoPlan economico
Plan economico
 
Plan economico
Plan economicoPlan economico
Plan economico
 
Plan economico del 2017
Plan economico del 2017Plan economico del 2017
Plan economico del 2017
 
Fine Grained Complexity
Fine Grained ComplexityFine Grained Complexity
Fine Grained Complexity
 
EVOLUTION OF ONTOLOGY-BASED MAPPINGS
EVOLUTION OF ONTOLOGY-BASED MAPPINGSEVOLUTION OF ONTOLOGY-BASED MAPPINGS
EVOLUTION OF ONTOLOGY-BASED MAPPINGS
 

Dernier

What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 

Dernier (20)

What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 

Evaluating Mapping Repair Systems with Large Biomedical Ontologies

  • 1. Preliminaries OAEI Alcomo LogMap-Repair Evaluation Evaluating Mapping Repair Systems with Large Biomedical Ontologies Ernesto Jiménez-Ruiz Christian Meilicke Bernardo Cuenca Grau Ian Horrocks Department of Computer Science, University of Oxford Research Group Data and Web Science, University of Mannheim, Germany 26th Description Logics Workshop 24 July 2013
  • 2. Preliminaries OAEI Alcomo LogMap-Repair Evaluation Outline Preliminaries Ontology Alignment Evaluation Initiative Mapping repair with Alcomo Mapping repair with LogMap-Repair Evaluation
  • 3. Preliminaries OAEI Alcomo LogMap-Repair Evaluation Ontology mappings Mappings M are tuples he1, e2, n, ρi • e1, e2 are entities in the input ontologies O1 and O2 • n a confidence value between 0 and 1 • ρ is the semantic relationship between e1 and e2 (e.g. subsumption, equivalence or disjointness) Formalized as OWL 2 axioms • Where the semantic relationship ρ is one of {≡, ⊑, ⊒, ⊥} • Confidence values n are represented as axiom annotations • No extra semantics
  • 4. Preliminaries OAEI Alcomo LogMap-Repair Evaluation Ontology mappings Mappings M are tuples he1, e2, n, ρi • e1, e2 are entities in the input ontologies O1 and O2 • n a confidence value between 0 and 1 • ρ is the semantic relationship between e1 and e2 (e.g. subsumption, equivalence or disjointness) Formalized as OWL 2 axioms • Where the semantic relationship ρ is one of {≡, ⊑, ⊒, ⊥} • Confidence values n are represented as axiom annotations • No extra semantics
  • 5. Preliminaries OAEI Alcomo LogMap-Repair Evaluation Incoherent mappings A set of Mappings M wrt O1 and O2 is incoherente if • there exists a class A such that • O1 ∪ O2 ∪ M |= A ⊑ ⊥, and • O1 ∪ O2 6|= A ⊑ ⊥.
  • 6. Preliminaries OAEI Alcomo LogMap-Repair Evaluation Example: Incoherent Mappings
  • 7. Preliminaries OAEI Alcomo LogMap-Repair Evaluation Mapping Repair R is a repair for M wrt O1 and O2 • if M R is coherent. • R = M is a candiate repair. R is a diagnosis for M wrt O1 and O2 • if R′ ⊂ R is not a repair for M. • Diagnosis are (typically) expensive to obtain.
  • 8. Preliminaries OAEI Alcomo LogMap-Repair Evaluation Mapping Repair R is a repair for M wrt O1 and O2 • if M R is coherent. • R = M is a candiate repair. R is a diagnosis for M wrt O1 and O2 • if R′ ⊂ R is not a repair for M. • Diagnosis are (typically) expensive to obtain.
  • 9. Preliminaries OAEI Alcomo LogMap-Repair Evaluation Computing Mapping Repairs Standard justification-based ontology debugging • are expensive since require complete reasoning. • do not scale when the number of unsatisfiabilities is large. An approximate Repair R≈ • will not guarantee that M R≈ is coherent; • but aims at reducing the number of unsatisfiabilities, • while preserving M as much as possible.
  • 10. Preliminaries OAEI Alcomo LogMap-Repair Evaluation Computing Mapping Repairs Standard justification-based ontology debugging • are expensive since require complete reasoning. • do not scale when the number of unsatisfiabilities is large. An approximate Repair R≈ • will not guarantee that M R≈ is coherent; • but aims at reducing the number of unsatisfiabilities, • while preserving M as much as possible.
  • 11. Preliminaries OAEI Alcomo LogMap-Repair Evaluation Outline Preliminaries Ontology Alignment Evaluation Initiative Mapping repair with Alcomo Mapping repair with LogMap-Repair Evaluation
  • 12. Preliminaries OAEI Alcomo LogMap-Repair Evaluation Evaluation of Ontology Matching Systems Ontology Alignment Evaluation Initiative (OAEI) • A annual campaign for the evaluation of ontology matching systems. • Deadline September 1. • http://oaei.ontologymatching.org/ • The matching problems are organized in several tracks. • Each track involves different test ontologies and reference alignments. • Large BioMed track: FMA, NCI and SNOMED CT.
  • 13. Preliminaries OAEI Alcomo LogMap-Repair Evaluation Evaluation of Ontology Matching Systems: OAEI Quality of a set of mappings M • Precision and recall wrt reference mappings or gold standard • Precision (Pre) = |M ∩ MGS|/|M| • Recall (Rec) = |M ∩ MGS|/|MGS| • The F-score (F) = (2 × Pre × Rec)/(Pre + Rec). • Coherence of M wrt O1 and O2. • Computation times are also considered.
  • 14. Preliminaries OAEI Alcomo LogMap-Repair Evaluation Mapping coherence in the Large BioMed track • FMA v2.0 (78,989 classes), NCI v.08.05d (66,724 classes) and SNOMED CT v. Jan. 2009 (306,591 classes). • Evaluated top-7 systems • Unsat. provided by HermiT and ELK (in SNOMED-NCI) System FMA-NCI FMA-SNOMED SNOMED-NCI Unsat. Ratio Unsat. Ratio Unsat. Ratio LogMapnoe 9 0.01% 10 0.003% ≥0 ≥0% LogMap 9 0.01% 10 0.003% ≥17 ≥0.005% ServOMap 48,743 27% 273,242 71% ≥313,643 ≥84% ServOMapL 50,334 28% 99,726 26% ≥314,939 ≥84% GOMMA 5,574 4% 10,752 3% ≥266,051 ≥71% GOMMAbk 12,939 9% 119,657 31% ≥313,015 ≥84% YAM++ 50,550 29% 106,107 28% ≥269,107 ≥72%
  • 15. Preliminaries OAEI Alcomo LogMap-Repair Evaluation Mapping Repair Systems Mapping repair with Alcomo and LogMap-Repair • Alcomo is specialised to repair incoherent mappings. • LogMap, originally implemented as an matching system, can also operate as a mapping repair system (LogMap-Repair). • Evaluated using incomplete reasoning. • Compute and approximate repair R≈ for a given M wrt O1 and O2.
  • 16. Preliminaries OAEI Alcomo LogMap-Repair Evaluation Mapping Repair Systems Mapping repair with Alcomo and LogMap-Repair • Alcomo is specialised to repair incoherent mappings. • LogMap, originally implemented as an matching system, can also operate as a mapping repair system (LogMap-Repair). • Evaluated using incomplete reasoning. • Compute and approximate repair R≈ for a given M wrt O1 and O2.
  • 17. Preliminaries OAEI Alcomo LogMap-Repair Evaluation Outline Preliminaries Ontology Alignment Evaluation Initiative Mapping repair with Alcomo Mapping repair with LogMap-Repair Evaluation
  • 18. Preliminaries OAEI Alcomo LogMap-Repair Evaluation Alcomo mapping repair system Implemented Repair Techniques • Complete Reasoning: classical Black-Box appoach (inefficient) • Incomplete reasoning: pattern-based Approach (efficient but incomplete) • Combinations of both techniques (complete, but still not efficient) Incomplete reasoning • Alcomo guarantees that the computed R≈ is a subset of the diagnosis. • Alcomo never removes too much.
  • 19. Preliminaries OAEI Alcomo LogMap-Repair Evaluation Alcomo mapping repair system Implemented Repair Techniques • Complete Reasoning: classical Black-Box appoach (inefficient) • Incomplete reasoning: pattern-based Approach (efficient but incomplete) • Combinations of both techniques (complete, but still not efficient) Incomplete reasoning • Alcomo guarantees that the computed R≈ is a subset of the diagnosis. • Alcomo never removes too much.
  • 20. Preliminaries OAEI Alcomo LogMap-Repair Evaluation Alcomo mapping repair system Pattern-based Repair • First classify O1 and O2 • Check each pair of mappings for conflict patterns • See two example patterns below: A C #i #i B#j D#j F#j F#j D#j A C #i #i B#j D#j F#j F#j D#j disjoint disjoint d i s j o i n t
  • 21. Preliminaries OAEI Alcomo LogMap-Repair Evaluation Alcomo: algorithm used in the experiments (iii) Input: O1, O2: input ontologies; M: input mappings Output: M′: output mappings; R≈: approximate mapping repair. 1: M′ := ∅ 2: R≈ := ∅ 3: classify O1 and O2 4: C := ConflictPairs(O1, O2, M) 5: for each m ∈ M do ⊲ iterate over M desc. order w.r.t. confidences 6: coh := true 7: for each m′ ∈ M′ do 8: if (m′ , m) ∈ C then 9: coh := false 10: break 11: end if 12: end for 13: if coh = true then M′ := M′ ∪ {m} 14: else R≈ := R≈ ∪ {m} 15: end for 16: return hM′ , R≈ i
  • 22. Preliminaries OAEI Alcomo LogMap-Repair Evaluation Outline Preliminaries Ontology Alignment Evaluation Initiative Mapping repair with Alcomo Mapping repair with LogMap-Repair Evaluation
  • 23. Preliminaries OAEI Alcomo LogMap-Repair Evaluation LogMap-Repair system Propositional Horn representation • LogMap-Repair projects the classified input ontologies and the mappings to Propositional Horn clauses. Propositional FMA (P1) Propositional NCI (P2) (1) Smegma → Secretion (8) Smegma → ExocrineGlandFluid (2) Secretion → PortionBodySusbstance (9) ExocrineGlandFluid → Anatomy (3) PortionBodySusbstance → AnatomicalEntity (10) CellularSecretion → TransmembraneTransport Computed mappings (M) (11) TransmembraneTransport → TransportProcess (m4) FMA:Secretion → NCI:CellularSecretion (12) TransportProcess → BiologicalProcess (m5) NCI:CellularSecretion → FMA:Secretion (13) Anatomy ∧ BiologicalProcess → false (m6) FMA:Smegma → NCI:Smegma (14) ExocrineGlandFluid ∧ ExfolCells → Smegma (m7) NCI:Smegma → FMA:Smegma
  • 24. Preliminaries OAEI Alcomo LogMap-Repair Evaluation LogMap-Repair system Propositional Horn SAT with Dowling-Gallier (D-G) • LogMap-Reapir implements the SAT algorithm D-G • D-G is call for every class C and the propositional theory PC : • the rule (true → C); • the propositional representations P1 and P2 of the input ontologies; and • the propositional representation of the mappings. Recording conflictive mappings • LogMap-Repair extends D-G to record mappings involved in an unsatifiability.
  • 25. Preliminaries OAEI Alcomo LogMap-Repair Evaluation LogMap-Repair system Propositional Horn SAT with Dowling-Gallier (D-G) • LogMap-Reapir implements the SAT algorithm D-G • D-G is call for every class C and the propositional theory PC : • the rule (true → C); • the propositional representations P1 and P2 of the input ontologies; and • the propositional representation of the mappings. Recording conflictive mappings • LogMap-Repair extends D-G to record mappings involved in an unsatifiability.
  • 26. Preliminaries OAEI Alcomo LogMap-Repair Evaluation LogMap-Repair: Recording conflictive mappings
  • 27. Preliminaries OAEI Alcomo LogMap-Repair Evaluation LogMap-Repair system A “greedy” repair algorithm • A (local) repair RC is computed for each (unsat.) class C (top-bottom approach) • The algorithm identifies subsets of the conflictive mappings of increasing size, and stops when a repair is found. • LogMap finds all repairs of “smallest” size for C. • For example: R1 = {m4} and R2 = {m6} • The repair with less confidence is selected .
  • 28. Preliminaries OAEI Alcomo LogMap-Repair Evaluation LogMap-Repair system A “greedy” repair algorithm • A (local) repair RC is computed for each (unsat.) class C (top-bottom approach) • The algorithm identifies subsets of the conflictive mappings of increasing size, and stops when a repair is found. • LogMap finds all repairs of “smallest” size for C. • For example: R1 = {m4} and R2 = {m6} • The repair with less confidence is selected .
  • 29. Preliminaries OAEI Alcomo LogMap-Repair Evaluation LogMap-Repair system Characteristics of the repair algorithm • LogMap-Repair identifies a (global) repair R≈ = S RC . • P1 ∪ P2 ∪ M R≈ ∪ {true → C} is satisfiable for each C in P1 ∪ P2. • The propositional encoding of O1 and O2 is incomplete. • The algorithm does not ensure satisfiability of each class in O1 ∪ O2 ∪ M R≈. • R≈ is an approximate repair of M wrt O1 and O2.
  • 30. Preliminaries OAEI Alcomo LogMap-Repair Evaluation LogMap-Repair system Characteristics of the repair algorithm • LogMap-Repair identifies a (global) repair R≈ = S RC . • P1 ∪ P2 ∪ M R≈ ∪ {true → C} is satisfiable for each C in P1 ∪ P2. • The propositional encoding of O1 and O2 is incomplete. • The algorithm does not ensure satisfiability of each class in O1 ∪ O2 ∪ M R≈. • R≈ is an approximate repair of M wrt O1 and O2.
  • 31. Preliminaries OAEI Alcomo LogMap-Repair Evaluation Outline Preliminaries Ontology Alignment Evaluation Initiative Mapping repair with Alcomo Mapping repair with LogMap-Repair Evaluation
  • 32. Preliminaries OAEI Alcomo LogMap-Repair Evaluation Evaluation Evaluation settings • Three matching problems of the OAEI 2012 Large BioMed track: FMA-NCI, FMA-SNOMED and SNOMED-NCI. • Top matching systems in this track apart from LogMap. • Server with 16 CPUs and allocating 15 Gb RAM.
  • 33. Preliminaries OAEI Alcomo LogMap-Repair Evaluation Repair in the FMA-NCI matching problem System Matching Results OAEI 2012 Repair with Alcomo Repair with LogMap |M| t (s) F Unsat. |R≈ | t (s) F Unsat. |R≈ | t (s) F Unsat. ServOMap 4,932 204 0.819 48,743 97 321 0.820 9 115 21 0.819 9 ServOMapL 5,400 251 0.841 50,334 126 342 0.841 9 166 24 0.839 9 GOMMA 5,686 217 0.839 5,574 123 321 0.839 15 142 20 0.838 9 GOMMAbk 6,330 231 0.837 12,939 184 341 0.836 29 259 33 0.833 9 YAM++ 5,476 1,304 0.862 50,550 116 324 0.862 10 141 21 0.861 9 Average 5,565 441 0.840 33,628 129 330 0.840 14 165 24 0.838 9
  • 34. Preliminaries OAEI Alcomo LogMap-Repair Evaluation Repair in the FMA-SNOMED matching problem System Matching Results OAEI 2012 Repair with Alcomo Repair with LogMap |M| t (s) F Unsat. |R≈ | t (s) F Unsat. |R≈ | t (s) F Unsat. ServOMap 12,642 532 0.770 273,242 829 2,672 0.749 0 2,640 284 0.721 0 ServOMapL 13,210 517 0.794 99,729 975 2,779 0.767 0 2,953 274 0.752 0 GOMMA 11,648 1,994 0.291 10,752 463 2,840 0.293 0 618 245 0.291 0 GOMMAbk 25,660 1,893 0.708 119,657 1,726 2,809 0.698 1,363 5,295 314 0.678 0 YAM++ 14,088 23,900 0.765 106,107 783 2,800 0.760 0 3,461 262 0.720 0 Average 15,450 5,767 0.666 121,897 955 2,780 0.653 273 2,993 276 0.632 0
  • 35. Preliminaries OAEI Alcomo LogMap-Repair Evaluation Repair in the SNOMED-NCI matching problem System Matching Results OAEI 2012 Repair with Alcomo Repair with LogMap |M| t (s) F Unsat. |R≈ | t (s) F Unsat. |R≈ | t (s) F Unsat. ServOMap 24,924 654 0.664 ≥313,643 937 3,223 0.663 ≥35 1,671 276 0.666 ≥296 ServOMapL 27,928 738 0.678 ≥314,939 1,076 2,917 0.677 ≥2,055 1,656 314 0.679 ≥1,241 GOMMA 27,386 1,820 0.606 ≥266,051 1,903 2,947 0.603 ≥2,085 2,949 303 0.607 ≥37 GOMMAbk 34,090 1,940 0.635 ≥313,015 2,720 3,098 0.638 ≥30,583 5,003 435 0.641 ≥1 YAM++ 28,206 30,155 0.680 ≥269,107 757 2,964 0.679 ≥0 1,049 305 0.680 ≥0 Average 28,507 7,061 0.653 ≥295,351 1,479 3,030 0.652 ≥6,952 2,465 326 0.655 ≥315
  • 36. Conclusions • Computed (approximate) repairs are not aggressive: • 5% (Alcomo). • 11% (LogMap-Repair). • The repair process does not represent a bottleneck in the matching process. • Very fast using LogMap-Repair. • The impact on the F-score is (on average) negligible. • The incoherence of the repaired mapping sets is always significantly reduced. • Alcomo and LogMap-Repair could complement each other.
  • 37. Conclusions and future work • Feasibility of integrating the mapping repair techniques within the ontology matching process. • Good basis to compare ontology and mapping repair systems: • Call for Mapping/Ontology repair systems (September 2013) • http://
  • 38. Questions? • Alcomo main contact: Christian Meilicke (@). • LogMap-Repair main contact: Ernesto Jimnez-Ruiz (ernesto.jimenez.ruiz@gmail.com) Thank you for your attention • Alcomo: http:// • LogMap-Repair: http://www.cs.ox.ac.uk/isg/tools/LogMap/