SlideShare une entreprise Scribd logo
1  sur  19
Télécharger pour lire hors ligne
Crafting a Knowledge Graph Strategy
What to think about
Panos Alexopoulos
Head of Ontology
2.5 years ago I got a mission ...
• “Good morning Mr. Alexopoulos. Your mission,
should you decide to accept it, is to build an
HR and Recruitment Knowledge Graph.”
• “As always, should you or any member of your
team be caught or killed, the company will
disavow any knowledge of your actions. “
• “This tape will self-destruct in 5 seconds.”
… which I accepted as I had the perfect recipe ...
… that often didn’t quite work as expected ...
… leading to several hard-learned lessons ...
… one of them being:
Is it easy to build a
Knowledge Graph?
A tale of two graphs
• EU’s ESCO knowledge graph:
• Released September 2017
• 12K Skill and 3K Profession interrelated concepts in 27
languages
• Built completely manually by experts in 6 years and cost
~15 million Euros
• To be updated once a year by a maintenance committee
• DBPedia knowledge graph
• First release in 2007
• Currently >17M entities in 127 languages
• Automatically extracted from Wikipedia infoboxes
• Live version and monthly releases
Why the difference?
• Types of entities: Abstract entities more difficult to define than concrete
ones (e.g. “Ontology” v.s. “The Netherlands”)
• Types of relations: Vague relations harder to define and achieve
consensus (e.g. “has essential skill” v.s. “was born in”)
• Available data sources: There are no Wikipedia-like infoboxes for
Professions and Skills that could enable automatic extraction.
• Quality requirements: ESCO wanted very high accuracy.
Knowledge Graphs and
Strategy
Strategy is about “What” rather than “How”
• What domains of knowledge we want our
Knowledge Graph to cover?
• What data we want to connect?
• What applications we want to power?
• What users we want to serve?
• What quality dimensions are important and
at what levels?
Strategy is about Choices, Dilemmas and Decisions
• Domain A or Domain B?
• Accuracy or Completeness?
• Application A or Application B?
• Data-Driven or Human-Driven?
• Democracy, Oligarchy or Dictatorship?
• Single or Multiple Truths?
Crafting a Knowledge
Graph Strategy
Know thy Goals
• What business/technical goals are you trying to
achieve via the Knowledge Graph?
• Imitate the others?
• Enhance current products/processes and get
ahead of the competition ? If so, which?
• Lay the foundations for new products/processes?
If so, which?
• Integrate dispersed data and improve their
governance?
• …
• Extract!: Entity and Relation Tagging and Disambiguation in CVs and Vacancies
• Search! and Match!: Query Understanding and Expansion, Semantic Similarity
• Jobfeed: Labour Market Analytics
Example: Products enhanced by the Textkernel Knowledge
Graph
Know thy Context
• Domain(s) and Semantics
• What elements does the Knowledge Graph need to provide
(entity types, relations, etc)?
• Is there ambiguity, vagueness, semantic drift? To what extent?
• Technology Context
• What knowledge can be automatically mined with the
technology we have?
• How fast and how accurately?
• Organizational Context
• Data, technology and products
• People, knowledge and capabilities
• Processes, practices and attitudes
• Business Context
• Business strategy and model(s)
• Clients, partners and competitors
• Trends, hypes, expectations and narratives
Use Context to make choices and break dilemmas
• If your semantics are highly vague and cause disagreements you
might need to support multiple truths.
• If the best extraction system for relation X is only 50% accurate, you
will probably need manual curation.
• If there is already a non-RDF semantics management infrastructure
in the organization, it may not worth replacing it with an RDF one!
• If product A generates more revenue than product B, you might need
to optimize the KG for A.
• If your competitor outperforms you in coverage because of more data
and better infrastructure, you might want to compete on accuracy or
some niche complex domain..
Questions to think when you return to the office
Do w w y e
bu n (or t) a
Kno d G ap ?
Do w w o t
an r u “Wha ” an
b e d e m ?
➔ High-level goals are ok but
you need concrete goals
➔ Any answer with “AI”, “Big
Data” or other buzzword
should be ignored!
➔ Semantics
➔ Data
➔ Technology
➔ Organization
➔ Business
➔ Basic philosophy and
principles
➔ Definition of success (and
failure)
➔ Decision making processes
Do w w e c l e
Con t?
Thank you!
Panos Alexopoulos
Head of Ontology
E-mail: alexopoulos@textkernel.nl
Web: http://www.panosalexopoulos.com
LinkedIn: www.linkedin.com/in/panosalexopoulos
Twitter: @PAlexop

Contenu connexe

Tendances

Edmc use cases 2018 nyc
Edmc use cases 2018   nycEdmc use cases 2018   nyc
Edmc use cases 2018 nycMarty Loughlin
 
How a global manufacturing company built a data science capability from scratch
How a global manufacturing company built a data science capability from scratchHow a global manufacturing company built a data science capability from scratch
How a global manufacturing company built a data science capability from scratchCarlo Torniai
 
State street edmc swaps pilot
State street edmc swaps pilotState street edmc swaps pilot
State street edmc swaps pilotMarty Loughlin
 
AI-SDV 2021: Jay ven Eman - implementation-of-new-technology-within-a-big-pha...
AI-SDV 2021: Jay ven Eman - implementation-of-new-technology-within-a-big-pha...AI-SDV 2021: Jay ven Eman - implementation-of-new-technology-within-a-big-pha...
AI-SDV 2021: Jay ven Eman - implementation-of-new-technology-within-a-big-pha...Dr. Haxel Consult
 
Machine Learning Applied - Contextual Chatbots Coding, Oracle JET and TensorFlow
Machine Learning Applied - Contextual Chatbots Coding, Oracle JET and TensorFlowMachine Learning Applied - Contextual Chatbots Coding, Oracle JET and TensorFlow
Machine Learning Applied - Contextual Chatbots Coding, Oracle JET and TensorFlowandrejusb
 
AI-SDV 2021: Stefan Geissler - AI support for creating and maintaining vocabu...
AI-SDV 2021: Stefan Geissler - AI support for creating and maintaining vocabu...AI-SDV 2021: Stefan Geissler - AI support for creating and maintaining vocabu...
AI-SDV 2021: Stefan Geissler - AI support for creating and maintaining vocabu...Dr. Haxel Consult
 
5. Building the Cancer Research Data Commons with Neo4j: The Bento Framework
5. Building the Cancer Research Data Commons with Neo4j: The Bento Framework5. Building the Cancer Research Data Commons with Neo4j: The Bento Framework
5. Building the Cancer Research Data Commons with Neo4j: The Bento FrameworkNeo4j
 
Neo4j: What's Under the Hood & How Knowing This Can Help You
Neo4j: What's Under the Hood & How Knowing This Can Help You Neo4j: What's Under the Hood & How Knowing This Can Help You
Neo4j: What's Under the Hood & How Knowing This Can Help You Neo4j
 
Talk to me Goose: Going beyond your regular Chatbot
Talk to me Goose: Going beyond your regular ChatbotTalk to me Goose: Going beyond your regular Chatbot
Talk to me Goose: Going beyond your regular ChatbotLuc Bors
 
How LinkedIn leverages data to build scalable payments strategy
How LinkedIn leverages data to build scalable payments strategyHow LinkedIn leverages data to build scalable payments strategy
How LinkedIn leverages data to build scalable payments strategyChi-Yi Kuan
 
GraphTour - DXC - Digital Explorer
GraphTour - DXC - Digital ExplorerGraphTour - DXC - Digital Explorer
GraphTour - DXC - Digital ExplorerNeo4j
 
Neo4j: What's Under the Hood
Neo4j: What's Under the HoodNeo4j: What's Under the Hood
Neo4j: What's Under the HoodNeo4j
 
Compliance made easy: Lynx webinar #1
Compliance made easy: Lynx webinar #1Compliance made easy: Lynx webinar #1
Compliance made easy: Lynx webinar #1Lynx Project
 
Ben Gardner | Delivering a Linked Data warehouse and integrating across the w...
Ben Gardner | Delivering a Linked Data warehouse and integrating across the w...Ben Gardner | Delivering a Linked Data warehouse and integrating across the w...
Ben Gardner | Delivering a Linked Data warehouse and integrating across the w...semanticsconference
 
Session 2.1 ontological representation of the telecom domain for advanced a...
Session 2.1   ontological representation of the telecom domain for advanced a...Session 2.1   ontological representation of the telecom domain for advanced a...
Session 2.1 ontological representation of the telecom domain for advanced a...semanticsconference
 
Graph Data Science: The Secret to Accelerating Innovation with AI/ML
Graph Data Science: The Secret to Accelerating Innovation with AI/MLGraph Data Science: The Secret to Accelerating Innovation with AI/ML
Graph Data Science: The Secret to Accelerating Innovation with AI/MLNeo4j
 
W3C-XBRL International Workshop 2009
W3C-XBRL International Workshop 2009W3C-XBRL International Workshop 2009
W3C-XBRL International Workshop 2009ashubhatnagar
 
Test strategy for Conversational AI
Test strategy for Conversational AITest strategy for Conversational AI
Test strategy for Conversational AIShama Ugale
 
AI-SDV 2020: Using Transformer technology to build an AI based personal News ...
AI-SDV 2020: Using Transformer technology to build an AI based personal News ...AI-SDV 2020: Using Transformer technology to build an AI based personal News ...
AI-SDV 2020: Using Transformer technology to build an AI based personal News ...Dr. Haxel Consult
 
Intro to Graphs and Neo4j
Intro to Graphs and Neo4jIntro to Graphs and Neo4j
Intro to Graphs and Neo4jjexp
 

Tendances (20)

Edmc use cases 2018 nyc
Edmc use cases 2018   nycEdmc use cases 2018   nyc
Edmc use cases 2018 nyc
 
How a global manufacturing company built a data science capability from scratch
How a global manufacturing company built a data science capability from scratchHow a global manufacturing company built a data science capability from scratch
How a global manufacturing company built a data science capability from scratch
 
State street edmc swaps pilot
State street edmc swaps pilotState street edmc swaps pilot
State street edmc swaps pilot
 
AI-SDV 2021: Jay ven Eman - implementation-of-new-technology-within-a-big-pha...
AI-SDV 2021: Jay ven Eman - implementation-of-new-technology-within-a-big-pha...AI-SDV 2021: Jay ven Eman - implementation-of-new-technology-within-a-big-pha...
AI-SDV 2021: Jay ven Eman - implementation-of-new-technology-within-a-big-pha...
 
Machine Learning Applied - Contextual Chatbots Coding, Oracle JET and TensorFlow
Machine Learning Applied - Contextual Chatbots Coding, Oracle JET and TensorFlowMachine Learning Applied - Contextual Chatbots Coding, Oracle JET and TensorFlow
Machine Learning Applied - Contextual Chatbots Coding, Oracle JET and TensorFlow
 
AI-SDV 2021: Stefan Geissler - AI support for creating and maintaining vocabu...
AI-SDV 2021: Stefan Geissler - AI support for creating and maintaining vocabu...AI-SDV 2021: Stefan Geissler - AI support for creating and maintaining vocabu...
AI-SDV 2021: Stefan Geissler - AI support for creating and maintaining vocabu...
 
5. Building the Cancer Research Data Commons with Neo4j: The Bento Framework
5. Building the Cancer Research Data Commons with Neo4j: The Bento Framework5. Building the Cancer Research Data Commons with Neo4j: The Bento Framework
5. Building the Cancer Research Data Commons with Neo4j: The Bento Framework
 
Neo4j: What's Under the Hood & How Knowing This Can Help You
Neo4j: What's Under the Hood & How Knowing This Can Help You Neo4j: What's Under the Hood & How Knowing This Can Help You
Neo4j: What's Under the Hood & How Knowing This Can Help You
 
Talk to me Goose: Going beyond your regular Chatbot
Talk to me Goose: Going beyond your regular ChatbotTalk to me Goose: Going beyond your regular Chatbot
Talk to me Goose: Going beyond your regular Chatbot
 
How LinkedIn leverages data to build scalable payments strategy
How LinkedIn leverages data to build scalable payments strategyHow LinkedIn leverages data to build scalable payments strategy
How LinkedIn leverages data to build scalable payments strategy
 
GraphTour - DXC - Digital Explorer
GraphTour - DXC - Digital ExplorerGraphTour - DXC - Digital Explorer
GraphTour - DXC - Digital Explorer
 
Neo4j: What's Under the Hood
Neo4j: What's Under the HoodNeo4j: What's Under the Hood
Neo4j: What's Under the Hood
 
Compliance made easy: Lynx webinar #1
Compliance made easy: Lynx webinar #1Compliance made easy: Lynx webinar #1
Compliance made easy: Lynx webinar #1
 
Ben Gardner | Delivering a Linked Data warehouse and integrating across the w...
Ben Gardner | Delivering a Linked Data warehouse and integrating across the w...Ben Gardner | Delivering a Linked Data warehouse and integrating across the w...
Ben Gardner | Delivering a Linked Data warehouse and integrating across the w...
 
Session 2.1 ontological representation of the telecom domain for advanced a...
Session 2.1   ontological representation of the telecom domain for advanced a...Session 2.1   ontological representation of the telecom domain for advanced a...
Session 2.1 ontological representation of the telecom domain for advanced a...
 
Graph Data Science: The Secret to Accelerating Innovation with AI/ML
Graph Data Science: The Secret to Accelerating Innovation with AI/MLGraph Data Science: The Secret to Accelerating Innovation with AI/ML
Graph Data Science: The Secret to Accelerating Innovation with AI/ML
 
W3C-XBRL International Workshop 2009
W3C-XBRL International Workshop 2009W3C-XBRL International Workshop 2009
W3C-XBRL International Workshop 2009
 
Test strategy for Conversational AI
Test strategy for Conversational AITest strategy for Conversational AI
Test strategy for Conversational AI
 
AI-SDV 2020: Using Transformer technology to build an AI based personal News ...
AI-SDV 2020: Using Transformer technology to build an AI based personal News ...AI-SDV 2020: Using Transformer technology to build an AI based personal News ...
AI-SDV 2020: Using Transformer technology to build an AI based personal News ...
 
Intro to Graphs and Neo4j
Intro to Graphs and Neo4jIntro to Graphs and Neo4j
Intro to Graphs and Neo4j
 

Similaire à Crafting a Knowledge Graph Strategy - What to think about

What Managers Need to Know about Data Science
What Managers Need to Know about Data ScienceWhat Managers Need to Know about Data Science
What Managers Need to Know about Data ScienceAnnie Flippo
 
Crafting a Compelling Data Science Resume
Crafting a Compelling Data Science ResumeCrafting a Compelling Data Science Resume
Crafting a Compelling Data Science ResumeArushi Prakash, Ph.D.
 
Certified Data Analyst - Ain GenX (Pvt.) Ltd.-3.pdf
Certified Data Analyst - Ain GenX (Pvt.) Ltd.-3.pdfCertified Data Analyst - Ain GenX (Pvt.) Ltd.-3.pdf
Certified Data Analyst - Ain GenX (Pvt.) Ltd.-3.pdfnosharwan1
 
CTO School Meetup - Jan 2013 Becoming Better Technical Leader
CTO School Meetup - Jan 2013   Becoming Better Technical LeaderCTO School Meetup - Jan 2013   Becoming Better Technical Leader
CTO School Meetup - Jan 2013 Becoming Better Technical LeaderJean Barmash
 
Be the Captain of Your Career
Be the Captain of Your Career Be the Captain of Your Career
Be the Captain of Your Career Jack Molisani
 
Lessons after working as a data scientist for 1 year
Lessons after working as a data scientist for 1 yearLessons after working as a data scientist for 1 year
Lessons after working as a data scientist for 1 yearYao Yao
 
InfoVision_PM101_RPadaki
InfoVision_PM101_RPadakiInfoVision_PM101_RPadaki
InfoVision_PM101_RPadakiRavi Padaki
 
DACS Presentation January 2015
DACS Presentation January 2015DACS Presentation January 2015
DACS Presentation January 2015John Barry
 
DACS Presentation January 2015
DACS Presentation January 2015 DACS Presentation January 2015
DACS Presentation January 2015 amandablair203
 
Are you ready for Data science? A 12 point test
Are you ready for Data science? A 12 point testAre you ready for Data science? A 12 point test
Are you ready for Data science? A 12 point testBertil Hatt
 
Scaling Training Data for AI Applications
Scaling Training Data for AI ApplicationsScaling Training Data for AI Applications
Scaling Training Data for AI ApplicationsApplause
 
Data and analytic strategies for developing ethical it
Data and analytic strategies for developing ethical itData and analytic strategies for developing ethical it
Data and analytic strategies for developing ethical itHyoun Park
 
Data Refinement: The missing link between data collection and decisions
Data Refinement: The missing link between data collection and decisionsData Refinement: The missing link between data collection and decisions
Data Refinement: The missing link between data collection and decisionsVivastream
 
1 data science with python
1 data science with python1 data science with python
1 data science with pythonVishal Sathawane
 
How to start your data career
How to start your data careerHow to start your data career
How to start your data careerAdwait Bhave
 
Enterprise search Information
Enterprise search Information Enterprise search Information
Enterprise search Information Netwoven Inc.
 
Data Solutions for Products by Leadspace Director of Analytics
Data Solutions for Products by Leadspace Director of AnalyticsData Solutions for Products by Leadspace Director of Analytics
Data Solutions for Products by Leadspace Director of AnalyticsProduct School
 

Similaire à Crafting a Knowledge Graph Strategy - What to think about (20)

Lean Analytics: How to get more out of your data science team
Lean Analytics: How to get more out of your data science teamLean Analytics: How to get more out of your data science team
Lean Analytics: How to get more out of your data science team
 
What Managers Need to Know about Data Science
What Managers Need to Know about Data ScienceWhat Managers Need to Know about Data Science
What Managers Need to Know about Data Science
 
Crafting a Compelling Data Science Resume
Crafting a Compelling Data Science ResumeCrafting a Compelling Data Science Resume
Crafting a Compelling Data Science Resume
 
Synergis60: 6 Critical Steps to Implementing Data Managment
Synergis60: 6 Critical Steps to Implementing Data ManagmentSynergis60: 6 Critical Steps to Implementing Data Managment
Synergis60: 6 Critical Steps to Implementing Data Managment
 
Certified Data Analyst - Ain GenX (Pvt.) Ltd.-3.pdf
Certified Data Analyst - Ain GenX (Pvt.) Ltd.-3.pdfCertified Data Analyst - Ain GenX (Pvt.) Ltd.-3.pdf
Certified Data Analyst - Ain GenX (Pvt.) Ltd.-3.pdf
 
CTO School Meetup - Jan 2013 Becoming Better Technical Leader
CTO School Meetup - Jan 2013   Becoming Better Technical LeaderCTO School Meetup - Jan 2013   Becoming Better Technical Leader
CTO School Meetup - Jan 2013 Becoming Better Technical Leader
 
Be the Captain of Your Career
Be the Captain of Your Career Be the Captain of Your Career
Be the Captain of Your Career
 
Lessons after working as a data scientist for 1 year
Lessons after working as a data scientist for 1 yearLessons after working as a data scientist for 1 year
Lessons after working as a data scientist for 1 year
 
How to win StartupWeekend
How to win StartupWeekendHow to win StartupWeekend
How to win StartupWeekend
 
InfoVision_PM101_RPadaki
InfoVision_PM101_RPadakiInfoVision_PM101_RPadaki
InfoVision_PM101_RPadaki
 
DACS Presentation January 2015
DACS Presentation January 2015DACS Presentation January 2015
DACS Presentation January 2015
 
DACS Presentation January 2015
DACS Presentation January 2015 DACS Presentation January 2015
DACS Presentation January 2015
 
Are you ready for Data science? A 12 point test
Are you ready for Data science? A 12 point testAre you ready for Data science? A 12 point test
Are you ready for Data science? A 12 point test
 
Scaling Training Data for AI Applications
Scaling Training Data for AI ApplicationsScaling Training Data for AI Applications
Scaling Training Data for AI Applications
 
Data and analytic strategies for developing ethical it
Data and analytic strategies for developing ethical itData and analytic strategies for developing ethical it
Data and analytic strategies for developing ethical it
 
Data Refinement: The missing link between data collection and decisions
Data Refinement: The missing link between data collection and decisionsData Refinement: The missing link between data collection and decisions
Data Refinement: The missing link between data collection and decisions
 
1 data science with python
1 data science with python1 data science with python
1 data science with python
 
How to start your data career
How to start your data careerHow to start your data career
How to start your data career
 
Enterprise search Information
Enterprise search Information Enterprise search Information
Enterprise search Information
 
Data Solutions for Products by Leadspace Director of Analytics
Data Solutions for Products by Leadspace Director of AnalyticsData Solutions for Products by Leadspace Director of Analytics
Data Solutions for Products by Leadspace Director of Analytics
 

Plus de Connected Data World

Systems that learn and reason | Frank Van Harmelen
Systems that learn and reason | Frank Van HarmelenSystems that learn and reason | Frank Van Harmelen
Systems that learn and reason | Frank Van HarmelenConnected Data World
 
Graph Abstractions Matter by Ora Lassila
Graph Abstractions Matter by Ora LassilaGraph Abstractions Matter by Ora Lassila
Graph Abstractions Matter by Ora LassilaConnected Data World
 
Κnowledge Architecture: Combining Strategy, Data Science and Information Arch...
Κnowledge Architecture: Combining Strategy, Data Science and Information Arch...Κnowledge Architecture: Combining Strategy, Data Science and Information Arch...
Κnowledge Architecture: Combining Strategy, Data Science and Information Arch...Connected Data World
 
How to get started with Graph Machine Learning
How to get started with Graph Machine LearningHow to get started with Graph Machine Learning
How to get started with Graph Machine LearningConnected Data World
 
The years of the graph: The future of the future is here
The years of the graph: The future of the future is hereThe years of the graph: The future of the future is here
The years of the graph: The future of the future is hereConnected Data World
 
From Taxonomies and Schemas to Knowledge Graphs: Parts 1 & 2
From Taxonomies and Schemas to Knowledge Graphs: Parts 1 & 2From Taxonomies and Schemas to Knowledge Graphs: Parts 1 & 2
From Taxonomies and Schemas to Knowledge Graphs: Parts 1 & 2Connected Data World
 
From Taxonomies and Schemas to Knowledge Graphs: Part 3
From Taxonomies and Schemas to Knowledge Graphs: Part 3From Taxonomies and Schemas to Knowledge Graphs: Part 3
From Taxonomies and Schemas to Knowledge Graphs: Part 3Connected Data World
 
In Search of the Universal Data Model
In Search of the Universal Data ModelIn Search of the Universal Data Model
In Search of the Universal Data ModelConnected Data World
 
Graph in Apache Cassandra. The World’s Most Scalable Graph Database
Graph in Apache Cassandra. The World’s Most Scalable Graph DatabaseGraph in Apache Cassandra. The World’s Most Scalable Graph Database
Graph in Apache Cassandra. The World’s Most Scalable Graph DatabaseConnected Data World
 
Enterprise Data Governance: Leveraging Knowledge Graph & AI in support of a d...
Enterprise Data Governance: Leveraging Knowledge Graph & AI in support of a d...Enterprise Data Governance: Leveraging Knowledge Graph & AI in support of a d...
Enterprise Data Governance: Leveraging Knowledge Graph & AI in support of a d...Connected Data World
 
Powering Question-Driven Problem Solving to Improve the Chances of Finding Ne...
Powering Question-Driven Problem Solving to Improve the Chances of Finding Ne...Powering Question-Driven Problem Solving to Improve the Chances of Finding Ne...
Powering Question-Driven Problem Solving to Improve the Chances of Finding Ne...Connected Data World
 
Semantic similarity for faster Knowledge Graph delivery at scale
Semantic similarity for faster Knowledge Graph delivery at scaleSemantic similarity for faster Knowledge Graph delivery at scale
Semantic similarity for faster Knowledge Graph delivery at scaleConnected Data World
 
Knowledge Graphs and AI to Hyper-Personalise the Fashion Retail Experience at...
Knowledge Graphs and AI to Hyper-Personalise the Fashion Retail Experience at...Knowledge Graphs and AI to Hyper-Personalise the Fashion Retail Experience at...
Knowledge Graphs and AI to Hyper-Personalise the Fashion Retail Experience at...Connected Data World
 
Schema, Google & The Future of the Web
Schema, Google & The Future of the WebSchema, Google & The Future of the Web
Schema, Google & The Future of the WebConnected Data World
 
RAPIDS cuGraph – Accelerating all your Graph needs
RAPIDS cuGraph – Accelerating all your Graph needsRAPIDS cuGraph – Accelerating all your Graph needs
RAPIDS cuGraph – Accelerating all your Graph needsConnected Data World
 
Elegant and Scalable Code Querying with Code Property Graphs
Elegant and Scalable Code Querying with Code Property GraphsElegant and Scalable Code Querying with Code Property Graphs
Elegant and Scalable Code Querying with Code Property GraphsConnected Data World
 
From Knowledge Graphs to AI-powered SEO: Using taxonomies, schemas and knowle...
From Knowledge Graphs to AI-powered SEO: Using taxonomies, schemas and knowle...From Knowledge Graphs to AI-powered SEO: Using taxonomies, schemas and knowle...
From Knowledge Graphs to AI-powered SEO: Using taxonomies, schemas and knowle...Connected Data World
 
Graph for Good: Empowering your NGO
Graph for Good: Empowering your NGOGraph for Good: Empowering your NGO
Graph for Good: Empowering your NGOConnected Data World
 

Plus de Connected Data World (20)

Systems that learn and reason | Frank Van Harmelen
Systems that learn and reason | Frank Van HarmelenSystems that learn and reason | Frank Van Harmelen
Systems that learn and reason | Frank Van Harmelen
 
Graph Abstractions Matter by Ora Lassila
Graph Abstractions Matter by Ora LassilaGraph Abstractions Matter by Ora Lassila
Graph Abstractions Matter by Ora Lassila
 
Κnowledge Architecture: Combining Strategy, Data Science and Information Arch...
Κnowledge Architecture: Combining Strategy, Data Science and Information Arch...Κnowledge Architecture: Combining Strategy, Data Science and Information Arch...
Κnowledge Architecture: Combining Strategy, Data Science and Information Arch...
 
How to get started with Graph Machine Learning
How to get started with Graph Machine LearningHow to get started with Graph Machine Learning
How to get started with Graph Machine Learning
 
Graphs in sustainable finance
Graphs in sustainable financeGraphs in sustainable finance
Graphs in sustainable finance
 
The years of the graph: The future of the future is here
The years of the graph: The future of the future is hereThe years of the graph: The future of the future is here
The years of the graph: The future of the future is here
 
From Taxonomies and Schemas to Knowledge Graphs: Parts 1 & 2
From Taxonomies and Schemas to Knowledge Graphs: Parts 1 & 2From Taxonomies and Schemas to Knowledge Graphs: Parts 1 & 2
From Taxonomies and Schemas to Knowledge Graphs: Parts 1 & 2
 
From Taxonomies and Schemas to Knowledge Graphs: Part 3
From Taxonomies and Schemas to Knowledge Graphs: Part 3From Taxonomies and Schemas to Knowledge Graphs: Part 3
From Taxonomies and Schemas to Knowledge Graphs: Part 3
 
In Search of the Universal Data Model
In Search of the Universal Data ModelIn Search of the Universal Data Model
In Search of the Universal Data Model
 
Graph in Apache Cassandra. The World’s Most Scalable Graph Database
Graph in Apache Cassandra. The World’s Most Scalable Graph DatabaseGraph in Apache Cassandra. The World’s Most Scalable Graph Database
Graph in Apache Cassandra. The World’s Most Scalable Graph Database
 
Enterprise Data Governance: Leveraging Knowledge Graph & AI in support of a d...
Enterprise Data Governance: Leveraging Knowledge Graph & AI in support of a d...Enterprise Data Governance: Leveraging Knowledge Graph & AI in support of a d...
Enterprise Data Governance: Leveraging Knowledge Graph & AI in support of a d...
 
Graph Realities
Graph RealitiesGraph Realities
Graph Realities
 
Powering Question-Driven Problem Solving to Improve the Chances of Finding Ne...
Powering Question-Driven Problem Solving to Improve the Chances of Finding Ne...Powering Question-Driven Problem Solving to Improve the Chances of Finding Ne...
Powering Question-Driven Problem Solving to Improve the Chances of Finding Ne...
 
Semantic similarity for faster Knowledge Graph delivery at scale
Semantic similarity for faster Knowledge Graph delivery at scaleSemantic similarity for faster Knowledge Graph delivery at scale
Semantic similarity for faster Knowledge Graph delivery at scale
 
Knowledge Graphs and AI to Hyper-Personalise the Fashion Retail Experience at...
Knowledge Graphs and AI to Hyper-Personalise the Fashion Retail Experience at...Knowledge Graphs and AI to Hyper-Personalise the Fashion Retail Experience at...
Knowledge Graphs and AI to Hyper-Personalise the Fashion Retail Experience at...
 
Schema, Google & The Future of the Web
Schema, Google & The Future of the WebSchema, Google & The Future of the Web
Schema, Google & The Future of the Web
 
RAPIDS cuGraph – Accelerating all your Graph needs
RAPIDS cuGraph – Accelerating all your Graph needsRAPIDS cuGraph – Accelerating all your Graph needs
RAPIDS cuGraph – Accelerating all your Graph needs
 
Elegant and Scalable Code Querying with Code Property Graphs
Elegant and Scalable Code Querying with Code Property GraphsElegant and Scalable Code Querying with Code Property Graphs
Elegant and Scalable Code Querying with Code Property Graphs
 
From Knowledge Graphs to AI-powered SEO: Using taxonomies, schemas and knowle...
From Knowledge Graphs to AI-powered SEO: Using taxonomies, schemas and knowle...From Knowledge Graphs to AI-powered SEO: Using taxonomies, schemas and knowle...
From Knowledge Graphs to AI-powered SEO: Using taxonomies, schemas and knowle...
 
Graph for Good: Empowering your NGO
Graph for Good: Empowering your NGOGraph for Good: Empowering your NGO
Graph for Good: Empowering your NGO
 

Dernier

Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESmohitsingh558521
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 

Dernier (20)

Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 

Crafting a Knowledge Graph Strategy - What to think about

  • 1. Crafting a Knowledge Graph Strategy What to think about Panos Alexopoulos Head of Ontology
  • 2. 2.5 years ago I got a mission ... • “Good morning Mr. Alexopoulos. Your mission, should you decide to accept it, is to build an HR and Recruitment Knowledge Graph.” • “As always, should you or any member of your team be caught or killed, the company will disavow any knowledge of your actions. “ • “This tape will self-destruct in 5 seconds.”
  • 3. … which I accepted as I had the perfect recipe ...
  • 4. … that often didn’t quite work as expected ...
  • 5. … leading to several hard-learned lessons ...
  • 6. … one of them being:
  • 7. Is it easy to build a Knowledge Graph?
  • 8. A tale of two graphs • EU’s ESCO knowledge graph: • Released September 2017 • 12K Skill and 3K Profession interrelated concepts in 27 languages • Built completely manually by experts in 6 years and cost ~15 million Euros • To be updated once a year by a maintenance committee • DBPedia knowledge graph • First release in 2007 • Currently >17M entities in 127 languages • Automatically extracted from Wikipedia infoboxes • Live version and monthly releases
  • 9. Why the difference? • Types of entities: Abstract entities more difficult to define than concrete ones (e.g. “Ontology” v.s. “The Netherlands”) • Types of relations: Vague relations harder to define and achieve consensus (e.g. “has essential skill” v.s. “was born in”) • Available data sources: There are no Wikipedia-like infoboxes for Professions and Skills that could enable automatic extraction. • Quality requirements: ESCO wanted very high accuracy.
  • 11. Strategy is about “What” rather than “How” • What domains of knowledge we want our Knowledge Graph to cover? • What data we want to connect? • What applications we want to power? • What users we want to serve? • What quality dimensions are important and at what levels?
  • 12. Strategy is about Choices, Dilemmas and Decisions • Domain A or Domain B? • Accuracy or Completeness? • Application A or Application B? • Data-Driven or Human-Driven? • Democracy, Oligarchy or Dictatorship? • Single or Multiple Truths?
  • 14. Know thy Goals • What business/technical goals are you trying to achieve via the Knowledge Graph? • Imitate the others? • Enhance current products/processes and get ahead of the competition ? If so, which? • Lay the foundations for new products/processes? If so, which? • Integrate dispersed data and improve their governance? • …
  • 15. • Extract!: Entity and Relation Tagging and Disambiguation in CVs and Vacancies • Search! and Match!: Query Understanding and Expansion, Semantic Similarity • Jobfeed: Labour Market Analytics Example: Products enhanced by the Textkernel Knowledge Graph
  • 16. Know thy Context • Domain(s) and Semantics • What elements does the Knowledge Graph need to provide (entity types, relations, etc)? • Is there ambiguity, vagueness, semantic drift? To what extent? • Technology Context • What knowledge can be automatically mined with the technology we have? • How fast and how accurately? • Organizational Context • Data, technology and products • People, knowledge and capabilities • Processes, practices and attitudes • Business Context • Business strategy and model(s) • Clients, partners and competitors • Trends, hypes, expectations and narratives
  • 17. Use Context to make choices and break dilemmas • If your semantics are highly vague and cause disagreements you might need to support multiple truths. • If the best extraction system for relation X is only 50% accurate, you will probably need manual curation. • If there is already a non-RDF semantics management infrastructure in the organization, it may not worth replacing it with an RDF one! • If product A generates more revenue than product B, you might need to optimize the KG for A. • If your competitor outperforms you in coverage because of more data and better infrastructure, you might want to compete on accuracy or some niche complex domain..
  • 18. Questions to think when you return to the office Do w w y e bu n (or t) a Kno d G ap ? Do w w o t an r u “Wha ” an b e d e m ? ➔ High-level goals are ok but you need concrete goals ➔ Any answer with “AI”, “Big Data” or other buzzword should be ignored! ➔ Semantics ➔ Data ➔ Technology ➔ Organization ➔ Business ➔ Basic philosophy and principles ➔ Definition of success (and failure) ➔ Decision making processes Do w w e c l e Con t?
  • 19. Thank you! Panos Alexopoulos Head of Ontology E-mail: alexopoulos@textkernel.nl Web: http://www.panosalexopoulos.com LinkedIn: www.linkedin.com/in/panosalexopoulos Twitter: @PAlexop