SlideShare a Scribd company logo
1 of 21
Download to read offline
Comparing XML Files with a
           DOGMA Ontology to Generate
           Ω-RIDL Annotations

           Nadejda Alkhaldi and Christophe Debruyne




16/10/11                        Herhaling titel van presentatie   1
Introduction




Comparing XML Files with a DOGMA Ontology to
Generate Ω-RIDL Annotations
16/10/11    Pag.2
Introduction

  Ontologies are a [formal,] explicit specification of a
  [shared] conceptualization (Gruber)

  Autonomously developed and maintained information
  systems commit to the ontology, a mostly manual
  activity.

  How can we automate (a part) of this process?


  Comparing XML Files with a DOGMA Ontology to
  Generate Ω-RIDL Annotations
  16/10/11    Pag.3
Method: overview

  First we need an ontology.
  –  We used the DOGMA method for ontology engineering
  –  The development of the ontology is reported elsewhere in
     Debruyne et al. (WEBIST 2011)


  Semi-automatically annotate the data
  –  Match concept in the (structure of) the data to the ontology
  –  Generate a Ω-RIDL commitment file
  –  Review of the mappings by representative of the information
     system


  Comparing XML Files with a DOGMA Ontology to
  Generate Ω-RIDL Annotations
  16/10/11    Pag.4
γ in Γ Context-identifiers,
      Method: DOGMA                              pointers to a community




  DOGMA Ontology Descriptions <Λ, ci, K>
  –  Λ a lexon base, a finite set of plausible binary fact types called
     lexons <γ, t1, r1, r2, t2>
     <Vendor Community, Offer, has, is of, Title>
  –  ci a partial function mapping context-identifiers and terms to
     concepts
  –  K a finite set of ontological commitments containing
       –  A selection of lexons
       –  A mapping from application symbols to ontology terms
       –  Predicates over those terms and roles to express constraints


  Comparing XML Files with a DOGMA Ontology to
  Generate Ω-RIDL Annotations
  16/10/11    Pag.5
Method: DOGMA
     Example of a commitment




                                Ω-RIDL: Verheyden et al. (SWDB 2004), Trog et al. (RuleML 2007)
Comparing XML Files with a DOGMA Ontology to
Generate Ω-RIDL Annotations
16/10/11    Pag.6
Method:                           (Semi)-Automatic Annotation


  First … related work?
  –  Annotation Techniques:
     AeroDAML, SHOE Knowledge Annotator, S-CREAM, MnM,
     Armadillo, KIM, SemTag, Ontea.
  –  Ontology and schema matching techniques:
     CUPID, iMAP, oMAP, H-Match
  –  Looking at different aspect and reusing ideas that might be
     usable




  Comparing XML Files with a DOGMA Ontology to
  Generate Ω-RIDL Annotations
  16/10/11    Pag.7
Method:                           (Semi)-Automatic Annotation




Comparing XML Files with a DOGMA Ontology to
Generate Ω-RIDL Annotations
16/10/11    Pag.8
Method:                           (Semi)-Automatic Annotation


  Some considerations
  –  Ontology contains explicit relations between concepts, the XML
     not
  –  XML tags can be matched concepts of the ontology, but the
     content of a tag can also represent an a concept
     E.g., <facility type=“bar”> should be typed onto the concept of
     Bar and not onto Facility of which Bar is a subtype.
  –  No XML Schema to rely on!
  –  Spelling mistakes/language variations




  Comparing XML Files with a DOGMA Ontology to
  Generate Ω-RIDL Annotations
  16/10/11    Pag.9
Method:                           (Semi)-Automatic Annotation


  1) Element match
  –  Match tag and attribute names using string metrics


  2) Linguistic match
  –  Match tag and attribute names using an external thesaurs (e.g.,
     WordNet or a domain specific thesuarus)


  3) Content match
  –  Match the content of a tag (with respect to the tag) to identify
     the concept represented by the content
  Comparing XML Files with a DOGMA Ontology to
  Generate Ω-RIDL Annotations
  16/10/11    Pag.10
Method:                           (Semi)-Automatic Annotation


  4) Structural Match
  –  Adjust the previously computed weighted means by looking to
     the structure of both the ontology graph and XML-tree.




  Comparing XML Files with a DOGMA Ontology to
  Generate Ω-RIDL Annotations
  16/10/11    Pag.11
Method:                           (Semi)-Automatic Annotation


  To summarize:
   –  using an XML and a DOGMA ontology
   –  a series of mapping scores are calculated based on element,
      linguistic and content match
   –  Those scores are then refined using the structural match
   –  The refined scores are then compared against a threshold to
      produce the Ω-RIDL mappings.




  Comparing XML Files with a DOGMA Ontology to
  Generate Ω-RIDL Annotations
  16/10/11    Pag.12
Method: Summary
–  using an XML and a DOGMA ontology
–  a series of mapping scores are calculated based on element,
   linguistic and content match
–  Those scores are then refined using the structural match
–  The refined scores are then compared against a threshold to
   produce the Ω-RIDL mappings.

–  The user can then use the generated mappings to get an idea
   how his application can commit to the ontology and then decide
   how to do so.




Comparing XML Files with a DOGMA Ontology to
Generate Ω-RIDL Annotations
16/10/11    Pag.13
Tool




Comparing XML Files with a DOGMA Ontology to
Generate Ω-RIDL Annotations
16/10/11    Pag.14
Experiment

  Data of the COMDRIVE RFP project
   –  Holiday Packages in the winter sports domain




  Comparing XML Files with a DOGMA Ontology to
  Generate Ω-RIDL Annotations
  16/10/11    Pag.15
Experiment

  Data of the COMDRIVE RFP project
   –  Holiday Packages in the winter sports domain

  Ontology developed in several iterations in the project
   –  Bootstrapping of the ontology
   –  Meeting with vendor experts
   –  Meeting with consumer experts




  Comparing XML Files with a DOGMA Ontology to
  Generate Ω-RIDL Annotations
  16/10/11    Pag.16
Experiment

  Data of the COMDRIVE RFP project
   –  Holiday Packages in the winter sports domain

  Ontology developed in several iterations in the project
   –  Bootstrapping of the ontology
   –  Meeting with vendor experts
   –  Meeting with consumer experts




  Comparing XML Files with a DOGMA Ontology to
  Generate Ω-RIDL Annotations
  16/10/11    Pag.17
Experiment

  Some generated mappings
   –  map ‘‘/countries/country/sumary/code’’ on
      Code identifies / identified by Commodity.
   –  map ‘‘/countries/country/regions/region’’ on Region.
   –  map ‘‘/countries/country/regions/region’’ on
      Ski Area destination of / with destination Holiday Package.
   –  map ‘‘/countries/country/regions/region/cities/city’’ City.
   –  …




   Comparing XML Files with a DOGMA Ontology to
   Generate Ω-RIDL Annotations
   16/10/11    Pag.18
Conclusions

  The four heuristics were able to tackle the considerations
  mentioned.

  The algorithm depends on a good choice of parameters, otherwise
  a lot of “nonsense” mappings are generated

  The structural match needs to be revisited to cope with more
  complicated cases such as:
   –  map ‘‘/countries/country/regions/region/summary/description’’
      on Description of / has RFP.

  Appropriate for suggesting the user mappings (needs testing)
   Comparing XML Files with a DOGMA Ontology to
   Generate Ω-RIDL Annotations
   16/10/11    Pag.19
Future work

  Revision of the structural match

  Integration with tool suite (e.g., Business Semantics Studio)

  Additional testing




   Comparing XML Files with a DOGMA Ontology to
   Generate Ω-RIDL Annotations
   16/10/11    Pag.20
Questions?




Comparing XML Files with a DOGMA Ontology to
Generate Ω-RIDL Annotations
16/10/11    Pag.21

More Related Content

What's hot

Triplificating and linking XBRL financial data
Triplificating and linking XBRL financial dataTriplificating and linking XBRL financial data
Triplificating and linking XBRL financial dataRoberto García
 
EDF2012 Mariana Damova - Factforge
EDF2012   Mariana Damova - FactforgeEDF2012   Mariana Damova - Factforge
EDF2012 Mariana Damova - FactforgeEuropean Data Forum
 
Graph databases & data integration v2
Graph databases & data integration v2Graph databases & data integration v2
Graph databases & data integration v2Dimitris Kontokostas
 
The Evolution of Library Descriptive Practices: Bibliographic Control? Descri...
The Evolution of Library Descriptive Practices: Bibliographic Control? Descri...The Evolution of Library Descriptive Practices: Bibliographic Control? Descri...
The Evolution of Library Descriptive Practices: Bibliographic Control? Descri...Jenn Riley
 
PhD thesis defense: Large-scale multilingual knowledge extraction, publishin...
PhD thesis defense:  Large-scale multilingual knowledge extraction, publishin...PhD thesis defense:  Large-scale multilingual knowledge extraction, publishin...
PhD thesis defense: Large-scale multilingual knowledge extraction, publishin...Dimitris Kontokostas
 
Full Material Declarations: Removing Barriers to Environmental Data Reporting
Full Material Declarations: Removing Barriers to Environmental Data ReportingFull Material Declarations: Removing Barriers to Environmental Data Reporting
Full Material Declarations: Removing Barriers to Environmental Data ReportingRoger L. Franz
 

What's hot (7)

Triplificating and linking XBRL financial data
Triplificating and linking XBRL financial dataTriplificating and linking XBRL financial data
Triplificating and linking XBRL financial data
 
EDF2012 Mariana Damova - Factforge
EDF2012   Mariana Damova - FactforgeEDF2012   Mariana Damova - Factforge
EDF2012 Mariana Damova - Factforge
 
Graph databases & data integration v2
Graph databases & data integration v2Graph databases & data integration v2
Graph databases & data integration v2
 
The Evolution of Library Descriptive Practices: Bibliographic Control? Descri...
The Evolution of Library Descriptive Practices: Bibliographic Control? Descri...The Evolution of Library Descriptive Practices: Bibliographic Control? Descri...
The Evolution of Library Descriptive Practices: Bibliographic Control? Descri...
 
PhD thesis defense: Large-scale multilingual knowledge extraction, publishin...
PhD thesis defense:  Large-scale multilingual knowledge extraction, publishin...PhD thesis defense:  Large-scale multilingual knowledge extraction, publishin...
PhD thesis defense: Large-scale multilingual knowledge extraction, publishin...
 
Full Material Declarations: Removing Barriers to Environmental Data Reporting
Full Material Declarations: Removing Barriers to Environmental Data ReportingFull Material Declarations: Removing Barriers to Environmental Data Reporting
Full Material Declarations: Removing Barriers to Environmental Data Reporting
 
Dublin Core Intro
Dublin Core IntroDublin Core Intro
Dublin Core Intro
 

Viewers also liked

Using Semantic Technologies to Create Virtual Families from Historical Vital ...
Using Semantic Technologies to Create Virtual Families from Historical Vital ...Using Semantic Technologies to Create Virtual Families from Historical Vital ...
Using Semantic Technologies to Create Virtual Families from Historical Vital ...Christophe Debruyne
 
Publishing open data and services for the Flemish Research Information Space
Publishing open data and services for the Flemish Research Information SpacePublishing open data and services for the Flemish Research Information Space
Publishing open data and services for the Flemish Research Information SpaceChristophe Debruyne
 
Semantic Interoperation of Information Systems by Evolving Ontologies through...
Semantic Interoperation of Information Systems by Evolving Ontologies through...Semantic Interoperation of Information Systems by Evolving Ontologies through...
Semantic Interoperation of Information Systems by Evolving Ontologies through...Christophe Debruyne
 
超級房仲家改版
超級房仲家改版超級房仲家改版
超級房仲家改版Nick Hung
 
2014 06-04-presentation-mdn-2014
2014 06-04-presentation-mdn-20142014 06-04-presentation-mdn-2014
2014 06-04-presentation-mdn-2014Christophe Debruyne
 
SDRule-L: Managing Semantically Rich Business Decision Processes
SDRule-L: Managing Semantically Rich Business Decision ProcessesSDRule-L: Managing Semantically Rich Business Decision Processes
SDRule-L: Managing Semantically Rich Business Decision ProcessesChristophe Debruyne
 
A Methodological Framework for Ontology and Multilingual Termontological Data...
A Methodological Framework for Ontology and Multilingual Termontological Data...A Methodological Framework for Ontology and Multilingual Termontological Data...
A Methodological Framework for Ontology and Multilingual Termontological Data...Christophe Debruyne
 
The Relation between a Framework for Collaborative Ontology Engineering and N...
The Relation between a Framework for Collaborative Ontology Engineering and N...The Relation between a Framework for Collaborative Ontology Engineering and N...
The Relation between a Framework for Collaborative Ontology Engineering and N...Christophe Debruyne
 
Doença trofoblastica gestacional
Doença trofoblastica gestacionalDoença trofoblastica gestacional
Doença trofoblastica gestacionalbia26
 
What Every Teacher Should Know About Handwriting
What Every Teacher Should Know About HandwritingWhat Every Teacher Should Know About Handwriting
What Every Teacher Should Know About HandwritingDownhill Publishing LLC
 

Viewers also liked (16)

Using Semantic Technologies to Create Virtual Families from Historical Vital ...
Using Semantic Technologies to Create Virtual Families from Historical Vital ...Using Semantic Technologies to Create Virtual Families from Historical Vital ...
Using Semantic Technologies to Create Virtual Families from Historical Vital ...
 
Publishing open data and services for the Flemish Research Information Space
Publishing open data and services for the Flemish Research Information SpacePublishing open data and services for the Flemish Research Information Space
Publishing open data and services for the Flemish Research Information Space
 
Indulge At The Jupiter Hotel 2011
Indulge At The Jupiter Hotel 2011Indulge At The Jupiter Hotel 2011
Indulge At The Jupiter Hotel 2011
 
Semantic Interoperation of Information Systems by Evolving Ontologies through...
Semantic Interoperation of Information Systems by Evolving Ontologies through...Semantic Interoperation of Information Systems by Evolving Ontologies through...
Semantic Interoperation of Information Systems by Evolving Ontologies through...
 
Award Maker 4 Teachers
Award Maker 4 TeachersAward Maker 4 Teachers
Award Maker 4 Teachers
 
Nameplate Maker 4 Teachers
Nameplate Maker 4 TeachersNameplate Maker 4 Teachers
Nameplate Maker 4 Teachers
 
Handwriting Worksheet Maker
Handwriting Worksheet MakerHandwriting Worksheet Maker
Handwriting Worksheet Maker
 
超級房仲家改版
超級房仲家改版超級房仲家改版
超級房仲家改版
 
2014 06-04-presentation-mdn-2014
2014 06-04-presentation-mdn-20142014 06-04-presentation-mdn-2014
2014 06-04-presentation-mdn-2014
 
SDRule-L: Managing Semantically Rich Business Decision Processes
SDRule-L: Managing Semantically Rich Business Decision ProcessesSDRule-L: Managing Semantically Rich Business Decision Processes
SDRule-L: Managing Semantically Rich Business Decision Processes
 
A Methodological Framework for Ontology and Multilingual Termontological Data...
A Methodological Framework for Ontology and Multilingual Termontological Data...A Methodological Framework for Ontology and Multilingual Termontological Data...
A Methodological Framework for Ontology and Multilingual Termontological Data...
 
What is Linked Data?
What is Linked Data?What is Linked Data?
What is Linked Data?
 
The Relation between a Framework for Collaborative Ontology Engineering and N...
The Relation between a Framework for Collaborative Ontology Engineering and N...The Relation between a Framework for Collaborative Ontology Engineering and N...
The Relation between a Framework for Collaborative Ontology Engineering and N...
 
Students Practice Tracing Letters
Students Practice Tracing LettersStudents Practice Tracing Letters
Students Practice Tracing Letters
 
Doença trofoblastica gestacional
Doença trofoblastica gestacionalDoença trofoblastica gestacional
Doença trofoblastica gestacional
 
What Every Teacher Should Know About Handwriting
What Every Teacher Should Know About HandwritingWhat Every Teacher Should Know About Handwriting
What Every Teacher Should Know About Handwriting
 

Similar to Generating Annotations by Comparing XML and Ontology

Ks2008 Semanticweb In Action
Ks2008 Semanticweb In ActionKs2008 Semanticweb In Action
Ks2008 Semanticweb In ActionRinke Hoekstra
 
Compare And Merge Scripts
Compare And Merge ScriptsCompare And Merge Scripts
Compare And Merge ScriptsOctavian Nadolu
 
Xml and xml processor
Xml and xml processorXml and xml processor
Xml and xml processorHimanshu Soni
 
Xml and xml processor
Xml and xml processorXml and xml processor
Xml and xml processorHimanshu Soni
 
XML and XML Applications - Lecture 04 - Web Information Systems (WE-DINF-11912)
XML and XML Applications - Lecture 04 - Web Information Systems (WE-DINF-11912)XML and XML Applications - Lecture 04 - Web Information Systems (WE-DINF-11912)
XML and XML Applications - Lecture 04 - Web Information Systems (WE-DINF-11912)Beat Signer
 
XML, XML Databases and MPEG-7
XML, XML Databases and MPEG-7XML, XML Databases and MPEG-7
XML, XML Databases and MPEG-7Deniz Kılınç
 
Semantic RDF based integration framework for heterogeneous XML data sources
Semantic RDF based integration framework for heterogeneous XML data sourcesSemantic RDF based integration framework for heterogeneous XML data sources
Semantic RDF based integration framework for heterogeneous XML data sourcesDeniz Kılınç
 
Effective Data Retrieval in XML using TreeMatch Algorithm
Effective Data Retrieval in XML using TreeMatch AlgorithmEffective Data Retrieval in XML using TreeMatch Algorithm
Effective Data Retrieval in XML using TreeMatch AlgorithmIRJET Journal
 
Syntax Reuse: XSLT as a Metalanguage for Knowledge Representation Languages
Syntax Reuse: XSLT as a Metalanguage for Knowledge Representation LanguagesSyntax Reuse: XSLT as a Metalanguage for Knowledge Representation Languages
Syntax Reuse: XSLT as a Metalanguage for Knowledge Representation LanguagesTara Athan
 
Getty Vocabulary Program LOD: Ontologies and Semantic Representation
Getty Vocabulary Program LOD: Ontologies and Semantic RepresentationGetty Vocabulary Program LOD: Ontologies and Semantic Representation
Getty Vocabulary Program LOD: Ontologies and Semantic RepresentationVladimir Alexiev, PhD, PMP
 
Xml For Dummies Chapter 14 Processing Xml it-slideshares.blogspot.com
Xml For Dummies   Chapter 14 Processing Xml it-slideshares.blogspot.comXml For Dummies   Chapter 14 Processing Xml it-slideshares.blogspot.com
Xml For Dummies Chapter 14 Processing Xml it-slideshares.blogspot.comphanleson
 
Falcon-AO: Results for OAEI 2007
Falcon-AO: Results for OAEI 2007Falcon-AO: Results for OAEI 2007
Falcon-AO: Results for OAEI 2007Gong Cheng
 
Comparing Vocabularies for Representing Geographical Features and Their Geometry
Comparing Vocabularies for Representing Geographical Features and Their GeometryComparing Vocabularies for Representing Geographical Features and Their Geometry
Comparing Vocabularies for Representing Geographical Features and Their GeometryGhislain Atemezing
 
Facilitating Busines Interoperability from the Semantic Web
Facilitating Busines Interoperability from the Semantic WebFacilitating Busines Interoperability from the Semantic Web
Facilitating Busines Interoperability from the Semantic WebRoberto García
 
Controlled Vocabularies and Text Mining - Use Cases at the Goettingen State a...
Controlled Vocabularies and Text Mining - Use Cases at the Goettingen State a...Controlled Vocabularies and Text Mining - Use Cases at the Goettingen State a...
Controlled Vocabularies and Text Mining - Use Cases at the Goettingen State a...Ralf Stockmann
 
Ontologies and Semantic in OpenSource projects
Ontologies and Semantic in OpenSource projectsOntologies and Semantic in OpenSource projects
Ontologies and Semantic in OpenSource projectsjgato
 
Benchmarking RDF Metadata Representations: Reification, Singleton Property an...
Benchmarking RDF Metadata Representations: Reification, Singleton Property an...Benchmarking RDF Metadata Representations: Reification, Singleton Property an...
Benchmarking RDF Metadata Representations: Reification, Singleton Property an...Fabrizio Orlandi
 

Similar to Generating Annotations by Comparing XML and Ontology (20)

Ks2008 Semanticweb In Action
Ks2008 Semanticweb In ActionKs2008 Semanticweb In Action
Ks2008 Semanticweb In Action
 
Compare And Merge Scripts
Compare And Merge ScriptsCompare And Merge Scripts
Compare And Merge Scripts
 
Xml and xml processor
Xml and xml processorXml and xml processor
Xml and xml processor
 
Xml and xml processor
Xml and xml processorXml and xml processor
Xml and xml processor
 
XML and XML Applications - Lecture 04 - Web Information Systems (WE-DINF-11912)
XML and XML Applications - Lecture 04 - Web Information Systems (WE-DINF-11912)XML and XML Applications - Lecture 04 - Web Information Systems (WE-DINF-11912)
XML and XML Applications - Lecture 04 - Web Information Systems (WE-DINF-11912)
 
XML, XML Databases and MPEG-7
XML, XML Databases and MPEG-7XML, XML Databases and MPEG-7
XML, XML Databases and MPEG-7
 
Semantic RDF based integration framework for heterogeneous XML data sources
Semantic RDF based integration framework for heterogeneous XML data sourcesSemantic RDF based integration framework for heterogeneous XML data sources
Semantic RDF based integration framework for heterogeneous XML data sources
 
Effective Data Retrieval in XML using TreeMatch Algorithm
Effective Data Retrieval in XML using TreeMatch AlgorithmEffective Data Retrieval in XML using TreeMatch Algorithm
Effective Data Retrieval in XML using TreeMatch Algorithm
 
Basic concepts of xml
Basic concepts of xmlBasic concepts of xml
Basic concepts of xml
 
Syntax Reuse: XSLT as a Metalanguage for Knowledge Representation Languages
Syntax Reuse: XSLT as a Metalanguage for Knowledge Representation LanguagesSyntax Reuse: XSLT as a Metalanguage for Knowledge Representation Languages
Syntax Reuse: XSLT as a Metalanguage for Knowledge Representation Languages
 
Getty Vocabulary Program LOD: Ontologies and Semantic Representation
Getty Vocabulary Program LOD: Ontologies and Semantic RepresentationGetty Vocabulary Program LOD: Ontologies and Semantic Representation
Getty Vocabulary Program LOD: Ontologies and Semantic Representation
 
Xml For Dummies Chapter 14 Processing Xml it-slideshares.blogspot.com
Xml For Dummies   Chapter 14 Processing Xml it-slideshares.blogspot.comXml For Dummies   Chapter 14 Processing Xml it-slideshares.blogspot.com
Xml For Dummies Chapter 14 Processing Xml it-slideshares.blogspot.com
 
Falcon-AO: Results for OAEI 2007
Falcon-AO: Results for OAEI 2007Falcon-AO: Results for OAEI 2007
Falcon-AO: Results for OAEI 2007
 
Comparing Vocabularies for Representing Geographical Features and Their Geometry
Comparing Vocabularies for Representing Geographical Features and Their GeometryComparing Vocabularies for Representing Geographical Features and Their Geometry
Comparing Vocabularies for Representing Geographical Features and Their Geometry
 
E05412327
E05412327E05412327
E05412327
 
Facilitating Busines Interoperability from the Semantic Web
Facilitating Busines Interoperability from the Semantic WebFacilitating Busines Interoperability from the Semantic Web
Facilitating Busines Interoperability from the Semantic Web
 
Controlled Vocabularies and Text Mining - Use Cases at the Goettingen State a...
Controlled Vocabularies and Text Mining - Use Cases at the Goettingen State a...Controlled Vocabularies and Text Mining - Use Cases at the Goettingen State a...
Controlled Vocabularies and Text Mining - Use Cases at the Goettingen State a...
 
Ontologies and Semantic in OpenSource projects
Ontologies and Semantic in OpenSource projectsOntologies and Semantic in OpenSource projects
Ontologies and Semantic in OpenSource projects
 
Benchmarking RDF Metadata Representations: Reification, Singleton Property an...
Benchmarking RDF Metadata Representations: Reification, Singleton Property an...Benchmarking RDF Metadata Representations: Reification, Singleton Property an...
Benchmarking RDF Metadata Representations: Reification, Singleton Property an...
 
Poster
PosterPoster
Poster
 

More from Christophe Debruyne

One year of DALIDA Data Literacy Workshops for Adults: a Report
One year of DALIDA Data Literacy Workshops for Adults: a ReportOne year of DALIDA Data Literacy Workshops for Adults: a Report
One year of DALIDA Data Literacy Workshops for Adults: a ReportChristophe Debruyne
 
Projet TOXIN : Des graphes de connaissances pour la recherche en toxicologie
Projet TOXIN : Des graphes de connaissances pour la recherche en toxicologieProjet TOXIN : Des graphes de connaissances pour la recherche en toxicologie
Projet TOXIN : Des graphes de connaissances pour la recherche en toxicologieChristophe Debruyne
 
Knowledge Graphs: Concept, mogelijkheden en aandachtspunten
Knowledge Graphs: Concept, mogelijkheden en aandachtspuntenKnowledge Graphs: Concept, mogelijkheden en aandachtspunten
Knowledge Graphs: Concept, mogelijkheden en aandachtspuntenChristophe Debruyne
 
Reusable SHACL Constraint Components for Validating Geospatial Linked Data
Reusable SHACL Constraint Components for Validating Geospatial Linked DataReusable SHACL Constraint Components for Validating Geospatial Linked Data
Reusable SHACL Constraint Components for Validating Geospatial Linked DataChristophe Debruyne
 
Hidden Amongst the Data: the Beyond 2022 Knowledge Graph
Hidden Amongst the Data: the Beyond 2022 Knowledge GraphHidden Amongst the Data: the Beyond 2022 Knowledge Graph
Hidden Amongst the Data: the Beyond 2022 Knowledge GraphChristophe Debruyne
 
Facilitating Data Curation: a Solution Developed in the Toxicology Domain
Facilitating Data Curation: a Solution Developed in the Toxicology DomainFacilitating Data Curation: a Solution Developed in the Toxicology Domain
Facilitating Data Curation: a Solution Developed in the Toxicology DomainChristophe Debruyne
 
Using Maps for Interlinking Geospatial Linked Data
Using Maps for Interlinking Geospatial Linked DataUsing Maps for Interlinking Geospatial Linked Data
Using Maps for Interlinking Geospatial Linked DataChristophe Debruyne
 
Linked Data Publication and Interlinking Research within the SFI funded ADAPT...
Linked Data Publication and Interlinking Research within the SFI funded ADAPT...Linked Data Publication and Interlinking Research within the SFI funded ADAPT...
Linked Data Publication and Interlinking Research within the SFI funded ADAPT...Christophe Debruyne
 
Towards Generating Policy-compliant Datasets (poster)
Towards GeneratingPolicy-compliant Datasets (poster)Towards GeneratingPolicy-compliant Datasets (poster)
Towards Generating Policy-compliant Datasets (poster)Christophe Debruyne
 
Towards Generating Policy-compliant Datasets
Towards Generating Policy-compliant DatasetsTowards Generating Policy-compliant Datasets
Towards Generating Policy-compliant DatasetsChristophe Debruyne
 
Generating Executable Mappings from RDF Data Cube Data Structure Definitions
Generating Executable Mappings from RDF Data Cube Data Structure DefinitionsGenerating Executable Mappings from RDF Data Cube Data Structure Definitions
Generating Executable Mappings from RDF Data Cube Data Structure DefinitionsChristophe Debruyne
 
Uplift – Generating RDF datasets from non-RDF data with R2RML
Uplift – Generating RDF datasets from non-RDF data with R2RMLUplift – Generating RDF datasets from non-RDF data with R2RML
Uplift – Generating RDF datasets from non-RDF data with R2RMLChristophe Debruyne
 
A Lightweight Approach to Explore, Enrich and Use Data with a Geospatial Dime...
A Lightweight Approach to Explore, Enrich and Use Data with a Geospatial Dime...A Lightweight Approach to Explore, Enrich and Use Data with a Geospatial Dime...
A Lightweight Approach to Explore, Enrich and Use Data with a Geospatial Dime...Christophe Debruyne
 
Client-side Processing of GeoSPARQL Functions with Triple Pattern Fragments
Client-side Processing of GeoSPARQL Functions with Triple Pattern FragmentsClient-side Processing of GeoSPARQL Functions with Triple Pattern Fragments
Client-side Processing of GeoSPARQL Functions with Triple Pattern FragmentsChristophe Debruyne
 
Serving Ireland's Geospatial Information as Linked Data
Serving Ireland's Geospatial Information as Linked DataServing Ireland's Geospatial Information as Linked Data
Serving Ireland's Geospatial Information as Linked DataChristophe Debruyne
 
Serving Ireland's Geospatial Information as Linked Data (ISWC 2016 Poster)
Serving Ireland's Geospatial Information as Linked Data (ISWC 2016 Poster)Serving Ireland's Geospatial Information as Linked Data (ISWC 2016 Poster)
Serving Ireland's Geospatial Information as Linked Data (ISWC 2016 Poster)Christophe Debruyne
 
R2RML-F: Towards Sharing and Executing Domain Logic in R2RML Mappings
R2RML-F: Towards Sharing and Executing Domain Logic in R2RML MappingsR2RML-F: Towards Sharing and Executing Domain Logic in R2RML Mappings
R2RML-F: Towards Sharing and Executing Domain Logic in R2RML MappingsChristophe Debruyne
 
Towards a Project Centric Metadata Model and Lifecycle for Ontology Mapping G...
Towards a Project Centric Metadata Model and Lifecycle for Ontology Mapping G...Towards a Project Centric Metadata Model and Lifecycle for Ontology Mapping G...
Towards a Project Centric Metadata Model and Lifecycle for Ontology Mapping G...Christophe Debruyne
 
Creating and Consuming Metadata from Transcribed Historical Vital Records for...
Creating and Consuming Metadata from Transcribed Historical Vital Records for...Creating and Consuming Metadata from Transcribed Historical Vital Records for...
Creating and Consuming Metadata from Transcribed Historical Vital Records for...Christophe Debruyne
 
User Satisfaction of a Hybrid Ontology-Engineering Tool
User Satisfaction of a Hybrid Ontology-Engineering ToolUser Satisfaction of a Hybrid Ontology-Engineering Tool
User Satisfaction of a Hybrid Ontology-Engineering ToolChristophe Debruyne
 

More from Christophe Debruyne (20)

One year of DALIDA Data Literacy Workshops for Adults: a Report
One year of DALIDA Data Literacy Workshops for Adults: a ReportOne year of DALIDA Data Literacy Workshops for Adults: a Report
One year of DALIDA Data Literacy Workshops for Adults: a Report
 
Projet TOXIN : Des graphes de connaissances pour la recherche en toxicologie
Projet TOXIN : Des graphes de connaissances pour la recherche en toxicologieProjet TOXIN : Des graphes de connaissances pour la recherche en toxicologie
Projet TOXIN : Des graphes de connaissances pour la recherche en toxicologie
 
Knowledge Graphs: Concept, mogelijkheden en aandachtspunten
Knowledge Graphs: Concept, mogelijkheden en aandachtspuntenKnowledge Graphs: Concept, mogelijkheden en aandachtspunten
Knowledge Graphs: Concept, mogelijkheden en aandachtspunten
 
Reusable SHACL Constraint Components for Validating Geospatial Linked Data
Reusable SHACL Constraint Components for Validating Geospatial Linked DataReusable SHACL Constraint Components for Validating Geospatial Linked Data
Reusable SHACL Constraint Components for Validating Geospatial Linked Data
 
Hidden Amongst the Data: the Beyond 2022 Knowledge Graph
Hidden Amongst the Data: the Beyond 2022 Knowledge GraphHidden Amongst the Data: the Beyond 2022 Knowledge Graph
Hidden Amongst the Data: the Beyond 2022 Knowledge Graph
 
Facilitating Data Curation: a Solution Developed in the Toxicology Domain
Facilitating Data Curation: a Solution Developed in the Toxicology DomainFacilitating Data Curation: a Solution Developed in the Toxicology Domain
Facilitating Data Curation: a Solution Developed in the Toxicology Domain
 
Using Maps for Interlinking Geospatial Linked Data
Using Maps for Interlinking Geospatial Linked DataUsing Maps for Interlinking Geospatial Linked Data
Using Maps for Interlinking Geospatial Linked Data
 
Linked Data Publication and Interlinking Research within the SFI funded ADAPT...
Linked Data Publication and Interlinking Research within the SFI funded ADAPT...Linked Data Publication and Interlinking Research within the SFI funded ADAPT...
Linked Data Publication and Interlinking Research within the SFI funded ADAPT...
 
Towards Generating Policy-compliant Datasets (poster)
Towards GeneratingPolicy-compliant Datasets (poster)Towards GeneratingPolicy-compliant Datasets (poster)
Towards Generating Policy-compliant Datasets (poster)
 
Towards Generating Policy-compliant Datasets
Towards Generating Policy-compliant DatasetsTowards Generating Policy-compliant Datasets
Towards Generating Policy-compliant Datasets
 
Generating Executable Mappings from RDF Data Cube Data Structure Definitions
Generating Executable Mappings from RDF Data Cube Data Structure DefinitionsGenerating Executable Mappings from RDF Data Cube Data Structure Definitions
Generating Executable Mappings from RDF Data Cube Data Structure Definitions
 
Uplift – Generating RDF datasets from non-RDF data with R2RML
Uplift – Generating RDF datasets from non-RDF data with R2RMLUplift – Generating RDF datasets from non-RDF data with R2RML
Uplift – Generating RDF datasets from non-RDF data with R2RML
 
A Lightweight Approach to Explore, Enrich and Use Data with a Geospatial Dime...
A Lightweight Approach to Explore, Enrich and Use Data with a Geospatial Dime...A Lightweight Approach to Explore, Enrich and Use Data with a Geospatial Dime...
A Lightweight Approach to Explore, Enrich and Use Data with a Geospatial Dime...
 
Client-side Processing of GeoSPARQL Functions with Triple Pattern Fragments
Client-side Processing of GeoSPARQL Functions with Triple Pattern FragmentsClient-side Processing of GeoSPARQL Functions with Triple Pattern Fragments
Client-side Processing of GeoSPARQL Functions with Triple Pattern Fragments
 
Serving Ireland's Geospatial Information as Linked Data
Serving Ireland's Geospatial Information as Linked DataServing Ireland's Geospatial Information as Linked Data
Serving Ireland's Geospatial Information as Linked Data
 
Serving Ireland's Geospatial Information as Linked Data (ISWC 2016 Poster)
Serving Ireland's Geospatial Information as Linked Data (ISWC 2016 Poster)Serving Ireland's Geospatial Information as Linked Data (ISWC 2016 Poster)
Serving Ireland's Geospatial Information as Linked Data (ISWC 2016 Poster)
 
R2RML-F: Towards Sharing and Executing Domain Logic in R2RML Mappings
R2RML-F: Towards Sharing and Executing Domain Logic in R2RML MappingsR2RML-F: Towards Sharing and Executing Domain Logic in R2RML Mappings
R2RML-F: Towards Sharing and Executing Domain Logic in R2RML Mappings
 
Towards a Project Centric Metadata Model and Lifecycle for Ontology Mapping G...
Towards a Project Centric Metadata Model and Lifecycle for Ontology Mapping G...Towards a Project Centric Metadata Model and Lifecycle for Ontology Mapping G...
Towards a Project Centric Metadata Model and Lifecycle for Ontology Mapping G...
 
Creating and Consuming Metadata from Transcribed Historical Vital Records for...
Creating and Consuming Metadata from Transcribed Historical Vital Records for...Creating and Consuming Metadata from Transcribed Historical Vital Records for...
Creating and Consuming Metadata from Transcribed Historical Vital Records for...
 
User Satisfaction of a Hybrid Ontology-Engineering Tool
User Satisfaction of a Hybrid Ontology-Engineering ToolUser Satisfaction of a Hybrid Ontology-Engineering Tool
User Satisfaction of a Hybrid Ontology-Engineering Tool
 

Recently uploaded

A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGSujit Pal
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 

Recently uploaded (20)

A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAG
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 

Generating Annotations by Comparing XML and Ontology

  • 1. Comparing XML Files with a DOGMA Ontology to Generate Ω-RIDL Annotations Nadejda Alkhaldi and Christophe Debruyne 16/10/11 Herhaling titel van presentatie 1
  • 2. Introduction Comparing XML Files with a DOGMA Ontology to Generate Ω-RIDL Annotations 16/10/11 Pag.2
  • 3. Introduction   Ontologies are a [formal,] explicit specification of a [shared] conceptualization (Gruber)   Autonomously developed and maintained information systems commit to the ontology, a mostly manual activity.   How can we automate (a part) of this process? Comparing XML Files with a DOGMA Ontology to Generate Ω-RIDL Annotations 16/10/11 Pag.3
  • 4. Method: overview   First we need an ontology. –  We used the DOGMA method for ontology engineering –  The development of the ontology is reported elsewhere in Debruyne et al. (WEBIST 2011)   Semi-automatically annotate the data –  Match concept in the (structure of) the data to the ontology –  Generate a Ω-RIDL commitment file –  Review of the mappings by representative of the information system Comparing XML Files with a DOGMA Ontology to Generate Ω-RIDL Annotations 16/10/11 Pag.4
  • 5. γ in Γ Context-identifiers, Method: DOGMA pointers to a community   DOGMA Ontology Descriptions <Λ, ci, K> –  Λ a lexon base, a finite set of plausible binary fact types called lexons <γ, t1, r1, r2, t2> <Vendor Community, Offer, has, is of, Title> –  ci a partial function mapping context-identifiers and terms to concepts –  K a finite set of ontological commitments containing –  A selection of lexons –  A mapping from application symbols to ontology terms –  Predicates over those terms and roles to express constraints Comparing XML Files with a DOGMA Ontology to Generate Ω-RIDL Annotations 16/10/11 Pag.5
  • 6. Method: DOGMA   Example of a commitment Ω-RIDL: Verheyden et al. (SWDB 2004), Trog et al. (RuleML 2007) Comparing XML Files with a DOGMA Ontology to Generate Ω-RIDL Annotations 16/10/11 Pag.6
  • 7. Method: (Semi)-Automatic Annotation   First … related work? –  Annotation Techniques: AeroDAML, SHOE Knowledge Annotator, S-CREAM, MnM, Armadillo, KIM, SemTag, Ontea. –  Ontology and schema matching techniques: CUPID, iMAP, oMAP, H-Match –  Looking at different aspect and reusing ideas that might be usable Comparing XML Files with a DOGMA Ontology to Generate Ω-RIDL Annotations 16/10/11 Pag.7
  • 8. Method: (Semi)-Automatic Annotation Comparing XML Files with a DOGMA Ontology to Generate Ω-RIDL Annotations 16/10/11 Pag.8
  • 9. Method: (Semi)-Automatic Annotation   Some considerations –  Ontology contains explicit relations between concepts, the XML not –  XML tags can be matched concepts of the ontology, but the content of a tag can also represent an a concept E.g., <facility type=“bar”> should be typed onto the concept of Bar and not onto Facility of which Bar is a subtype. –  No XML Schema to rely on! –  Spelling mistakes/language variations Comparing XML Files with a DOGMA Ontology to Generate Ω-RIDL Annotations 16/10/11 Pag.9
  • 10. Method: (Semi)-Automatic Annotation   1) Element match –  Match tag and attribute names using string metrics   2) Linguistic match –  Match tag and attribute names using an external thesaurs (e.g., WordNet or a domain specific thesuarus)   3) Content match –  Match the content of a tag (with respect to the tag) to identify the concept represented by the content Comparing XML Files with a DOGMA Ontology to Generate Ω-RIDL Annotations 16/10/11 Pag.10
  • 11. Method: (Semi)-Automatic Annotation   4) Structural Match –  Adjust the previously computed weighted means by looking to the structure of both the ontology graph and XML-tree. Comparing XML Files with a DOGMA Ontology to Generate Ω-RIDL Annotations 16/10/11 Pag.11
  • 12. Method: (Semi)-Automatic Annotation   To summarize: –  using an XML and a DOGMA ontology –  a series of mapping scores are calculated based on element, linguistic and content match –  Those scores are then refined using the structural match –  The refined scores are then compared against a threshold to produce the Ω-RIDL mappings. Comparing XML Files with a DOGMA Ontology to Generate Ω-RIDL Annotations 16/10/11 Pag.12
  • 13. Method: Summary –  using an XML and a DOGMA ontology –  a series of mapping scores are calculated based on element, linguistic and content match –  Those scores are then refined using the structural match –  The refined scores are then compared against a threshold to produce the Ω-RIDL mappings. –  The user can then use the generated mappings to get an idea how his application can commit to the ontology and then decide how to do so. Comparing XML Files with a DOGMA Ontology to Generate Ω-RIDL Annotations 16/10/11 Pag.13
  • 14. Tool Comparing XML Files with a DOGMA Ontology to Generate Ω-RIDL Annotations 16/10/11 Pag.14
  • 15. Experiment   Data of the COMDRIVE RFP project –  Holiday Packages in the winter sports domain Comparing XML Files with a DOGMA Ontology to Generate Ω-RIDL Annotations 16/10/11 Pag.15
  • 16. Experiment   Data of the COMDRIVE RFP project –  Holiday Packages in the winter sports domain   Ontology developed in several iterations in the project –  Bootstrapping of the ontology –  Meeting with vendor experts –  Meeting with consumer experts Comparing XML Files with a DOGMA Ontology to Generate Ω-RIDL Annotations 16/10/11 Pag.16
  • 17. Experiment   Data of the COMDRIVE RFP project –  Holiday Packages in the winter sports domain   Ontology developed in several iterations in the project –  Bootstrapping of the ontology –  Meeting with vendor experts –  Meeting with consumer experts Comparing XML Files with a DOGMA Ontology to Generate Ω-RIDL Annotations 16/10/11 Pag.17
  • 18. Experiment   Some generated mappings –  map ‘‘/countries/country/sumary/code’’ on Code identifies / identified by Commodity. –  map ‘‘/countries/country/regions/region’’ on Region. –  map ‘‘/countries/country/regions/region’’ on Ski Area destination of / with destination Holiday Package. –  map ‘‘/countries/country/regions/region/cities/city’’ City. –  … Comparing XML Files with a DOGMA Ontology to Generate Ω-RIDL Annotations 16/10/11 Pag.18
  • 19. Conclusions   The four heuristics were able to tackle the considerations mentioned.   The algorithm depends on a good choice of parameters, otherwise a lot of “nonsense” mappings are generated   The structural match needs to be revisited to cope with more complicated cases such as: –  map ‘‘/countries/country/regions/region/summary/description’’ on Description of / has RFP.   Appropriate for suggesting the user mappings (needs testing) Comparing XML Files with a DOGMA Ontology to Generate Ω-RIDL Annotations 16/10/11 Pag.19
  • 20. Future work   Revision of the structural match   Integration with tool suite (e.g., Business Semantics Studio)   Additional testing Comparing XML Files with a DOGMA Ontology to Generate Ω-RIDL Annotations 16/10/11 Pag.20
  • 21. Questions? Comparing XML Files with a DOGMA Ontology to Generate Ω-RIDL Annotations 16/10/11 Pag.21