SlideShare une entreprise Scribd logo
1  sur  46
Knowledge engineering
and the Web
Guus Schreiber
VU University Amsterdam
Computer Science, Network
Institute
Overview of this talk
• Web data representation
– a meta view
• Knowledge for the Web: categories
– key sources
– Alignment
• Using knowledge: visualization and
search
My journey
knowledge engineering
• design patterns for
problem solving
• methodology for
knowledge systems
• models of domain
knowledge
• ontology
engineering
My journey
access to digital heritage
My journey
Web standards
Chair of
•Web metadata: RDF 1.1
•OWL Web Ontology Language 1.0
•SKOS model for publishing vocabularies
on the Web
•Deployment & best practices
A few words about
Web standardization
• Key success factor!
• Consensus process actually works
– Some of the time at least
• Public review
– Taking every comment seriously
• The danger of over-designing
– Principle of minimal commitment
Example: W3C RDF 1.1 group
• 8K group messages (publicly visible)
• 2K messages about external comments
• 125+ teleconferences
• 200 issues resolved
Web data representation
Caution
• Representation languages are there for
you
• And not the other way around ….
HTML5: a leap forward
Rationale
•Consistent separation of
content and presentation
•Semantics of the
structure of information
Typical new elements
<article>
<section>
<aside>
<header>
<footer>
RDF: triples and graphs
RDF is simply labeling resources and links
RDF: multiple graphs
www.example.org/bob
Data modeling on the Web
RDF
•Class hierarchy
•Property hierarchy
•Domain and range
restrictions
•Data types
• OWL
• Property characteristics
– E.g., inverse, functional,
transitive, …..
• Identify management
– E.g., same as,
equivalent class
• ……..
I prefer a pick-and-choose approach
Writing in an ontology language
does not make it an ontology!
• Ontology is vehicle for sharing
• Papers about your own idiosyncratic
“university ontology” should be rejected
at conferences
• The quality of an ontology does not
depend on the number of, for example,
OWL constructs used
Rationale
•A vocabulary represents
distilled knowledge of a
community
•Typically product of a
consensus process over
longer period of time
Use
•200+ vocabularies
published
•E.g.: Library of
Congress Subject
Headings
•Mainly in library field
SKOS: making existing
vocabularies Web accessible
The strength of SKOS lies its
simplicity
Baker et al: Key choices
in the design of SKOS
Beware of ontological over-
commitment
• We have the understandable tendency
to use semantic modeling constructs
whenever we can
• Better is to limit any Web model to the
absolute minimum
Knowledge on the web:
categories
The concept triad
Source: http://www.jamesodell.com/Ontology_White_Paper_2011-07-15.pdf.
Categorization
• OWL (Description logic) takes an
extensional view of classes
– A set is completely defined by its members
• This puts the emphasis on specifying
class boundaries
• Work of Rosch et al. takes a different
view
20
Categories (Rosch)
• Help us to organize the world
• Tools for perception
• Basic-level categories
– Are the prime categories used by people
– Have the highest number of common and
distinctive attributes
– What those basic-level categories are may
depend on context
21
Basic-level categories
22
23
FOAF: Friend of a Friend
24
Dublin Core: metadata of Web resources
Iconclass
categorizing image scene
schema.org
categories for TV programs
schema.org
the notion of “Role”
schema.org issues
• Top-down versus bottom-up
• Ownership and control
• Who can update/extend?
• Does use for general search bias the
vocabulary?
The myth of a unified
vocabulary
• In large virtual collections there are always
multiple vocabularies
– In multiple languages
• Every vocabulary has its own perspective
– You can’t just merge them
• But you can use vocabularies jointly by
defining a limited set of links
– “Vocabulary alignment”
Category alignment vs.
identity disambiguation
• Alignment concerns finding links
between (similar) categories, which
typically have no identity in the real world
• Identity disambiguation is finding out
whether two or more IDs point to the
same object in the real world (e.g.,
person, building, ship)
• The distinction is more subtle that “class
versus instance”
Alignment techniques
• Syntax: comparison of characters of the terms
– Measures of syntactic distance
– Language processing
• E.g. Tokenization, single/plural,
• Relate to lexical resource
– Relate terms to place in WordNet hierarchy
• Taxonomy comparison
– Look for common parents/children in taxonomy
• Instance based mapping
– Two classes are similar if their instances are similar.
Alignment evaluation
Limitations of categorical
thinking
Be modest! Don’t recreate,
but enrich and align
• Knowledge engineers should refrain
from developing their own idiosyncratic
ontologies
• Instead, they should make the available
rich vocabularies, thesauri and
databases available in an interoperable
(web) format
• Techniques: learning, alignment
Using knowledge: visualization
and search
Visualising piracy events
Extracting piracy events
from piracy reports & Web sources
Enriching description of
search results
Using alignment in search
“Tokugawa”
SVCN period
Edo
SVCN is local in-house
ethnology thesaurus
AAT style/period
Edo (Japanese period)
Tokugawa
AAT is Getty’s
Art & Architecture Thesaurus
Sample graph search algorithm
From search term (literal) to art work
•Find resources with matching label
•Find path from resource to art work
– Cost of each step (step when above cost
threshold)
– Special treatment of semantics: sameAs,
inverseOf, …
•Cluster results based on path similarities
Graph search
Example of path clustering
Issues:
•number of clusters
•path length
Location-based search:
Moulin de la Galette
relatively easy
Relation search:
Picasso, Matisse & Braque
Acknowledgements
• Long list of people
• Projects: COMMIT, Agora, PrestoPrime,
EuropeanaConnect, Poseidon,
BiographyNet, Multimedian E-Culture

Contenu connexe

Tendances

Principles for knowledge engineering on the Web
Principles for knowledge engineering on the WebPrinciples for knowledge engineering on the Web
Principles for knowledge engineering on the WebGuus Schreiber
 
Fri schreiber key_knowledge engineering
Fri schreiber key_knowledge engineeringFri schreiber key_knowledge engineering
Fri schreiber key_knowledge engineeringeswcsummerschool
 
Mdst3703 2013-10-01-hypertext-and-history
Mdst3703 2013-10-01-hypertext-and-historyMdst3703 2013-10-01-hypertext-and-history
Mdst3703 2013-10-01-hypertext-and-historyRafael Alvarado
 
Agora User Committee Meeting 2013
Agora User Committee Meeting 2013Agora User Committee Meeting 2013
Agora User Committee Meeting 2013Lora Aroyo
 
UVA MDST 3703 Thematic Research Collections 2012-09-18
UVA MDST 3703 Thematic Research Collections 2012-09-18UVA MDST 3703 Thematic Research Collections 2012-09-18
UVA MDST 3703 Thematic Research Collections 2012-09-18Rafael Alvarado
 
Bloggen dhd (von Laurent Romary)
Bloggen dhd  (von Laurent Romary)Bloggen dhd  (von Laurent Romary)
Bloggen dhd (von Laurent Romary)MaxWeberStiftung
 
Introduction to digital scholarship tools
Introduction to digital scholarship toolsIntroduction to digital scholarship tools
Introduction to digital scholarship toolslibrarianrafia
 
Envisioning Social Applications of Library Linked Data
Envisioning Social Applications of Library Linked DataEnvisioning Social Applications of Library Linked Data
Envisioning Social Applications of Library Linked DataUldis Bojars
 
Exploring the Networks in Open Public Data
Exploring the Networks in Open Public DataExploring the Networks in Open Public Data
Exploring the Networks in Open Public DataUldis Bojars
 
Semantic Technologies in Learning Environments
Semantic Technologies in Learning EnvironmentsSemantic Technologies in Learning Environments
Semantic Technologies in Learning EnvironmentsDragan Gasevic
 
Commons Creativity (resources)
Commons Creativity (resources)Commons Creativity (resources)
Commons Creativity (resources)Dom Pates
 
Michalis Vafopoulos: Initial thoughts about existence in the Web
Michalis Vafopoulos: Initial thoughts about existence in the WebMichalis Vafopoulos: Initial thoughts about existence in the Web
Michalis Vafopoulos: Initial thoughts about existence in the WebPhiloWeb
 
Digital Humanities: An Introduction
Digital Humanities: An IntroductionDigital Humanities: An Introduction
Digital Humanities: An IntroductionDilip Barad
 

Tendances (19)

Principles for knowledge engineering on the Web
Principles for knowledge engineering on the WebPrinciples for knowledge engineering on the Web
Principles for knowledge engineering on the Web
 
Fri schreiber key_knowledge engineering
Fri schreiber key_knowledge engineeringFri schreiber key_knowledge engineering
Fri schreiber key_knowledge engineering
 
Mdst3703 2013-10-01-hypertext-and-history
Mdst3703 2013-10-01-hypertext-and-historyMdst3703 2013-10-01-hypertext-and-history
Mdst3703 2013-10-01-hypertext-and-history
 
Agora User Committee Meeting 2013
Agora User Committee Meeting 2013Agora User Committee Meeting 2013
Agora User Committee Meeting 2013
 
UVA MDST 3703 Thematic Research Collections 2012-09-18
UVA MDST 3703 Thematic Research Collections 2012-09-18UVA MDST 3703 Thematic Research Collections 2012-09-18
UVA MDST 3703 Thematic Research Collections 2012-09-18
 
Bloggen dhd (von Laurent Romary)
Bloggen dhd  (von Laurent Romary)Bloggen dhd  (von Laurent Romary)
Bloggen dhd (von Laurent Romary)
 
20080606 VöGler GöTtingen E Humanities
20080606 VöGler GöTtingen E Humanities20080606 VöGler GöTtingen E Humanities
20080606 VöGler GöTtingen E Humanities
 
Introduction to digital scholarship tools
Introduction to digital scholarship toolsIntroduction to digital scholarship tools
Introduction to digital scholarship tools
 
The Blossoming of the Semantic Web
The Blossoming of the Semantic WebThe Blossoming of the Semantic Web
The Blossoming of the Semantic Web
 
Envisioning Social Applications of Library Linked Data
Envisioning Social Applications of Library Linked DataEnvisioning Social Applications of Library Linked Data
Envisioning Social Applications of Library Linked Data
 
Exploring the Networks in Open Public Data
Exploring the Networks in Open Public DataExploring the Networks in Open Public Data
Exploring the Networks in Open Public Data
 
Semantic Technologies in Learning Environments
Semantic Technologies in Learning EnvironmentsSemantic Technologies in Learning Environments
Semantic Technologies in Learning Environments
 
What is Digital Public History? Teaching and Practice

What is Digital Public History? Teaching and Practice
 What is Digital Public History? Teaching and Practice

What is Digital Public History? Teaching and Practice

 
Is Linked Open Data the way forward?
Is Linked Open Data the way forward?Is Linked Open Data the way forward?
Is Linked Open Data the way forward?
 
Digicraft and 'Systemic' Thinking in Digital Humanities
Digicraft and 'Systemic' Thinking  in Digital HumanitiesDigicraft and 'Systemic' Thinking  in Digital Humanities
Digicraft and 'Systemic' Thinking in Digital Humanities
 
Commons Creativity (resources)
Commons Creativity (resources)Commons Creativity (resources)
Commons Creativity (resources)
 
Michalis Vafopoulos: Initial thoughts about existence in the Web
Michalis Vafopoulos: Initial thoughts about existence in the WebMichalis Vafopoulos: Initial thoughts about existence in the Web
Michalis Vafopoulos: Initial thoughts about existence in the Web
 
Digital Humanities, Digital Libraries and Information Science: what relation?
Digital Humanities, Digital Libraries and Information Science: what relation?Digital Humanities, Digital Libraries and Information Science: what relation?
Digital Humanities, Digital Libraries and Information Science: what relation?
 
Digital Humanities: An Introduction
Digital Humanities: An IntroductionDigital Humanities: An Introduction
Digital Humanities: An Introduction
 

Similaire à Knowledge engineering and the Web

Taxonomy, ontology, folksonomies & SKOS.
Taxonomy, ontology, folksonomies & SKOS.Taxonomy, ontology, folksonomies & SKOS.
Taxonomy, ontology, folksonomies & SKOS.Janet Leu
 
EAA2014 Istanbul - Barriers and Opportunities for Linked Open Data use in Arc...
EAA2014 Istanbul - Barriers and Opportunities for Linked Open Data use in Arc...EAA2014 Istanbul - Barriers and Opportunities for Linked Open Data use in Arc...
EAA2014 Istanbul - Barriers and Opportunities for Linked Open Data use in Arc...Keith.May
 
TSS 2017: Terminology and Knowledge Organization Systems
TSS 2017: Terminology and Knowledge Organization SystemsTSS 2017: Terminology and Knowledge Organization Systems
TSS 2017: Terminology and Knowledge Organization SystemsMichael Wetzel
 
Tutorial: Building and using ontologies - E.Simperl - ESWC SS 2014
 Tutorial: Building and using ontologies -  E.Simperl - ESWC SS 2014 Tutorial: Building and using ontologies -  E.Simperl - ESWC SS 2014
Tutorial: Building and using ontologies - E.Simperl - ESWC SS 2014eswcsummerschool
 
Building and using ontologies
Building and using ontologies Building and using ontologies
Building and using ontologies Elena Simperl
 
Web 3 final(1)
Web 3 final(1)Web 3 final(1)
Web 3 final(1)Venky Dood
 
Porting Library Vocabularies to the Semantic Web - IFLA 2010
Porting Library Vocabularies to the Semantic Web - IFLA 2010Porting Library Vocabularies to the Semantic Web - IFLA 2010
Porting Library Vocabularies to the Semantic Web - IFLA 2010Bernard Vatant
 
Ontology Engineering: Introduction
Ontology Engineering: IntroductionOntology Engineering: Introduction
Ontology Engineering: IntroductionGuus Schreiber
 
Semantically-enabled Browsing of Large Multilingual Document Collections
Semantically-enabled Browsing of Large Multilingual Document CollectionsSemantically-enabled Browsing of Large Multilingual Document Collections
Semantically-enabled Browsing of Large Multilingual Document CollectionsCarlos Badenes-Olmedo
 
The Rhetoric of Research Objects
The Rhetoric of Research ObjectsThe Rhetoric of Research Objects
The Rhetoric of Research ObjectsCarole Goble
 
W3C Library Linked Data Incubator Group - 2011
W3C Library Linked Data Incubator Group  - 2011W3C Library Linked Data Incubator Group  - 2011
W3C Library Linked Data Incubator Group - 2011Antoine Isaac
 
Semantic Technologies in HE Seminar - Learning Societies Lab
Semantic Technologies in HE Seminar - Learning Societies LabSemantic Technologies in HE Seminar - Learning Societies Lab
Semantic Technologies in HE Seminar - Learning Societies Labthanassis
 
Innovative methods for data integration: Linked Data and NLP
Innovative methods for data integration: Linked Data and NLPInnovative methods for data integration: Linked Data and NLP
Innovative methods for data integration: Linked Data and NLPariadnenetwork
 
Federated to library discovery platfoms
Federated to library discovery platfomsFederated to library discovery platfoms
Federated to library discovery platfomsNikesh Narayanan
 

Similaire à Knowledge engineering and the Web (20)

Taxonomy, ontology, folksonomies & SKOS.
Taxonomy, ontology, folksonomies & SKOS.Taxonomy, ontology, folksonomies & SKOS.
Taxonomy, ontology, folksonomies & SKOS.
 
Ontologies Fmi 042010
Ontologies Fmi 042010Ontologies Fmi 042010
Ontologies Fmi 042010
 
EAA2014 Istanbul - Barriers and Opportunities for Linked Open Data use in Arc...
EAA2014 Istanbul - Barriers and Opportunities for Linked Open Data use in Arc...EAA2014 Istanbul - Barriers and Opportunities for Linked Open Data use in Arc...
EAA2014 Istanbul - Barriers and Opportunities for Linked Open Data use in Arc...
 
TSS 2017: Terminology and Knowledge Organization Systems
TSS 2017: Terminology and Knowledge Organization SystemsTSS 2017: Terminology and Knowledge Organization Systems
TSS 2017: Terminology and Knowledge Organization Systems
 
Tutorial: Building and using ontologies - E.Simperl - ESWC SS 2014
 Tutorial: Building and using ontologies -  E.Simperl - ESWC SS 2014 Tutorial: Building and using ontologies -  E.Simperl - ESWC SS 2014
Tutorial: Building and using ontologies - E.Simperl - ESWC SS 2014
 
Building and using ontologies
Building and using ontologies Building and using ontologies
Building and using ontologies
 
Web 3 final(1)
Web 3 final(1)Web 3 final(1)
Web 3 final(1)
 
Porting Library Vocabularies to the Semantic Web - IFLA 2010
Porting Library Vocabularies to the Semantic Web - IFLA 2010Porting Library Vocabularies to the Semantic Web - IFLA 2010
Porting Library Vocabularies to the Semantic Web - IFLA 2010
 
Ontology Engineering: Introduction
Ontology Engineering: IntroductionOntology Engineering: Introduction
Ontology Engineering: Introduction
 
Semantically-enabled Browsing of Large Multilingual Document Collections
Semantically-enabled Browsing of Large Multilingual Document CollectionsSemantically-enabled Browsing of Large Multilingual Document Collections
Semantically-enabled Browsing of Large Multilingual Document Collections
 
sw owl
 sw owl sw owl
sw owl
 
EDS for JIBS
EDS for JIBSEDS for JIBS
EDS for JIBS
 
Taxonomies and Metadata
Taxonomies and MetadataTaxonomies and Metadata
Taxonomies and Metadata
 
The Rhetoric of Research Objects
The Rhetoric of Research ObjectsThe Rhetoric of Research Objects
The Rhetoric of Research Objects
 
W3C Library Linked Data Incubator Group - 2011
W3C Library Linked Data Incubator Group  - 2011W3C Library Linked Data Incubator Group  - 2011
W3C Library Linked Data Incubator Group - 2011
 
A theory of Metadata enriching & filtering
A theory of  Metadata enriching & filteringA theory of  Metadata enriching & filtering
A theory of Metadata enriching & filtering
 
Semantic Technologies in HE Seminar - Learning Societies Lab
Semantic Technologies in HE Seminar - Learning Societies LabSemantic Technologies in HE Seminar - Learning Societies Lab
Semantic Technologies in HE Seminar - Learning Societies Lab
 
Innovative methods for data integration: Linked Data and NLP
Innovative methods for data integration: Linked Data and NLPInnovative methods for data integration: Linked Data and NLP
Innovative methods for data integration: Linked Data and NLP
 
Ontology
OntologyOntology
Ontology
 
Federated to library discovery platfoms
Federated to library discovery platfomsFederated to library discovery platfoms
Federated to library discovery platfoms
 

Plus de Guus Schreiber

Ontologies: vehicles for reuse
Ontologies: vehicles for reuseOntologies: vehicles for reuse
Ontologies: vehicles for reuseGuus Schreiber
 
Linking historical ship records to a newspaper archive
Linking historical ship records to a newspaper archiveLinking historical ship records to a newspaper archive
Linking historical ship records to a newspaper archiveGuus Schreiber
 
CommonKADS project management
CommonKADS project managementCommonKADS project management
CommonKADS project managementGuus Schreiber
 
UML notations used by CommonKADS
UML notations used by CommonKADSUML notations used by CommonKADS
UML notations used by CommonKADSGuus Schreiber
 
Advanced knowledge modelling
Advanced knowledge modellingAdvanced knowledge modelling
Advanced knowledge modellingGuus Schreiber
 
CommonKADS design and implementation
CommonKADS design and implementationCommonKADS design and implementation
CommonKADS design and implementationGuus Schreiber
 
CommonKADS communication model
CommonKADS communication modelCommonKADS communication model
CommonKADS communication modelGuus Schreiber
 
CommonKADS knowledge modelling process
CommonKADS knowledge modelling processCommonKADS knowledge modelling process
CommonKADS knowledge modelling processGuus Schreiber
 
CommonKADS knowledge model templates
CommonKADS knowledge model templatesCommonKADS knowledge model templates
CommonKADS knowledge model templatesGuus Schreiber
 
CommonKADS knowledge modelling basics
CommonKADS knowledge modelling basicsCommonKADS knowledge modelling basics
CommonKADS knowledge modelling basicsGuus Schreiber
 
CommonKADS knowledge management
CommonKADS knowledge managementCommonKADS knowledge management
CommonKADS knowledge managementGuus Schreiber
 
CommonKADS context models
CommonKADS context modelsCommonKADS context models
CommonKADS context modelsGuus Schreiber
 
Semantic Web: From Representations to Applications
Semantic Web: From Representations to ApplicationsSemantic Web: From Representations to Applications
Semantic Web: From Representations to ApplicationsGuus Schreiber
 
The Semantic Web: status and prospects
The Semantic Web: status and prospectsThe Semantic Web: status and prospects
The Semantic Web: status and prospectsGuus Schreiber
 
E-Culture semantic search pilot
E-Culture semantic search pilotE-Culture semantic search pilot
E-Culture semantic search pilotGuus Schreiber
 
Ontology Engineering: Ontology Use
Ontology Engineering: Ontology UseOntology Engineering: Ontology Use
Ontology Engineering: Ontology UseGuus Schreiber
 
Ontology engineering: Ontology alignment
Ontology engineering: Ontology alignmentOntology engineering: Ontology alignment
Ontology engineering: Ontology alignmentGuus Schreiber
 
Ontology Engineering: Ontology evaluation
Ontology Engineering: Ontology evaluationOntology Engineering: Ontology evaluation
Ontology Engineering: Ontology evaluationGuus Schreiber
 

Plus de Guus Schreiber (20)

Ontologies: vehicles for reuse
Ontologies: vehicles for reuseOntologies: vehicles for reuse
Ontologies: vehicles for reuse
 
Linking historical ship records to a newspaper archive
Linking historical ship records to a newspaper archiveLinking historical ship records to a newspaper archive
Linking historical ship records to a newspaper archive
 
CommonKADS project management
CommonKADS project managementCommonKADS project management
CommonKADS project management
 
UML notations used by CommonKADS
UML notations used by CommonKADSUML notations used by CommonKADS
UML notations used by CommonKADS
 
Advanced knowledge modelling
Advanced knowledge modellingAdvanced knowledge modelling
Advanced knowledge modelling
 
CommonKADS design and implementation
CommonKADS design and implementationCommonKADS design and implementation
CommonKADS design and implementation
 
CommonKADS communication model
CommonKADS communication modelCommonKADS communication model
CommonKADS communication model
 
CommonKADS knowledge modelling process
CommonKADS knowledge modelling processCommonKADS knowledge modelling process
CommonKADS knowledge modelling process
 
CommonKADS knowledge model templates
CommonKADS knowledge model templatesCommonKADS knowledge model templates
CommonKADS knowledge model templates
 
CommonKADS knowledge modelling basics
CommonKADS knowledge modelling basicsCommonKADS knowledge modelling basics
CommonKADS knowledge modelling basics
 
CommonKADS knowledge management
CommonKADS knowledge managementCommonKADS knowledge management
CommonKADS knowledge management
 
CommonKADS context models
CommonKADS context modelsCommonKADS context models
CommonKADS context models
 
Introduction
IntroductionIntroduction
Introduction
 
Semantic Web: From Representations to Applications
Semantic Web: From Representations to ApplicationsSemantic Web: From Representations to Applications
Semantic Web: From Representations to Applications
 
The Semantic Web: status and prospects
The Semantic Web: status and prospectsThe Semantic Web: status and prospects
The Semantic Web: status and prospects
 
E-Culture semantic search pilot
E-Culture semantic search pilotE-Culture semantic search pilot
E-Culture semantic search pilot
 
Vista-TV overview
Vista-TV overviewVista-TV overview
Vista-TV overview
 
Ontology Engineering: Ontology Use
Ontology Engineering: Ontology UseOntology Engineering: Ontology Use
Ontology Engineering: Ontology Use
 
Ontology engineering: Ontology alignment
Ontology engineering: Ontology alignmentOntology engineering: Ontology alignment
Ontology engineering: Ontology alignment
 
Ontology Engineering: Ontology evaluation
Ontology Engineering: Ontology evaluationOntology Engineering: Ontology evaluation
Ontology Engineering: Ontology evaluation
 

Dernier

Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESmohitsingh558521
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 

Dernier (20)

DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 

Knowledge engineering and the Web

  • 1. Knowledge engineering and the Web Guus Schreiber VU University Amsterdam Computer Science, Network Institute
  • 2. Overview of this talk • Web data representation – a meta view • Knowledge for the Web: categories – key sources – Alignment • Using knowledge: visualization and search
  • 3. My journey knowledge engineering • design patterns for problem solving • methodology for knowledge systems • models of domain knowledge • ontology engineering
  • 4. My journey access to digital heritage
  • 5. My journey Web standards Chair of •Web metadata: RDF 1.1 •OWL Web Ontology Language 1.0 •SKOS model for publishing vocabularies on the Web •Deployment & best practices
  • 6. A few words about Web standardization • Key success factor! • Consensus process actually works – Some of the time at least • Public review – Taking every comment seriously • The danger of over-designing – Principle of minimal commitment
  • 7. Example: W3C RDF 1.1 group • 8K group messages (publicly visible) • 2K messages about external comments • 125+ teleconferences • 200 issues resolved
  • 9. Caution • Representation languages are there for you • And not the other way around ….
  • 10. HTML5: a leap forward Rationale •Consistent separation of content and presentation •Semantics of the structure of information Typical new elements <article> <section> <aside> <header> <footer>
  • 11. RDF: triples and graphs RDF is simply labeling resources and links
  • 13. Data modeling on the Web RDF •Class hierarchy •Property hierarchy •Domain and range restrictions •Data types • OWL • Property characteristics – E.g., inverse, functional, transitive, ….. • Identify management – E.g., same as, equivalent class • …….. I prefer a pick-and-choose approach
  • 14. Writing in an ontology language does not make it an ontology! • Ontology is vehicle for sharing • Papers about your own idiosyncratic “university ontology” should be rejected at conferences • The quality of an ontology does not depend on the number of, for example, OWL constructs used
  • 15. Rationale •A vocabulary represents distilled knowledge of a community •Typically product of a consensus process over longer period of time Use •200+ vocabularies published •E.g.: Library of Congress Subject Headings •Mainly in library field SKOS: making existing vocabularies Web accessible
  • 16. The strength of SKOS lies its simplicity Baker et al: Key choices in the design of SKOS
  • 17. Beware of ontological over- commitment • We have the understandable tendency to use semantic modeling constructs whenever we can • Better is to limit any Web model to the absolute minimum
  • 18. Knowledge on the web: categories
  • 19. The concept triad Source: http://www.jamesodell.com/Ontology_White_Paper_2011-07-15.pdf.
  • 20. Categorization • OWL (Description logic) takes an extensional view of classes – A set is completely defined by its members • This puts the emphasis on specifying class boundaries • Work of Rosch et al. takes a different view 20
  • 21. Categories (Rosch) • Help us to organize the world • Tools for perception • Basic-level categories – Are the prime categories used by people – Have the highest number of common and distinctive attributes – What those basic-level categories are may depend on context 21
  • 23. 23 FOAF: Friend of a Friend
  • 24. 24 Dublin Core: metadata of Web resources
  • 28. schema.org issues • Top-down versus bottom-up • Ownership and control • Who can update/extend? • Does use for general search bias the vocabulary?
  • 29. The myth of a unified vocabulary • In large virtual collections there are always multiple vocabularies – In multiple languages • Every vocabulary has its own perspective – You can’t just merge them • But you can use vocabularies jointly by defining a limited set of links – “Vocabulary alignment”
  • 30. Category alignment vs. identity disambiguation • Alignment concerns finding links between (similar) categories, which typically have no identity in the real world • Identity disambiguation is finding out whether two or more IDs point to the same object in the real world (e.g., person, building, ship) • The distinction is more subtle that “class versus instance”
  • 31. Alignment techniques • Syntax: comparison of characters of the terms – Measures of syntactic distance – Language processing • E.g. Tokenization, single/plural, • Relate to lexical resource – Relate terms to place in WordNet hierarchy • Taxonomy comparison – Look for common parents/children in taxonomy • Instance based mapping – Two classes are similar if their instances are similar.
  • 34. Be modest! Don’t recreate, but enrich and align • Knowledge engineers should refrain from developing their own idiosyncratic ontologies • Instead, they should make the available rich vocabularies, thesauri and databases available in an interoperable (web) format • Techniques: learning, alignment
  • 37. Extracting piracy events from piracy reports & Web sources
  • 39. Using alignment in search “Tokugawa” SVCN period Edo SVCN is local in-house ethnology thesaurus AAT style/period Edo (Japanese period) Tokugawa AAT is Getty’s Art & Architecture Thesaurus
  • 40. Sample graph search algorithm From search term (literal) to art work •Find resources with matching label •Find path from resource to art work – Cost of each step (step when above cost threshold) – Special treatment of semantics: sameAs, inverseOf, … •Cluster results based on path similarities
  • 42. Example of path clustering Issues: •number of clusters •path length
  • 43. Location-based search: Moulin de la Galette relatively easy
  • 45.
  • 46. Acknowledgements • Long list of people • Projects: COMMIT, Agora, PrestoPrime, EuropeanaConnect, Poseidon, BiographyNet, Multimedian E-Culture