SlideShare une entreprise Scribd logo
1  sur  72
Télécharger pour lire hors ligne
05/29/23 Heiko Paulheim 1
Knowledge Graph Generation
from Wikipedia in the Age of ChatGPT:
Knowledge Extraction
or Knowledge Hallucination?
Heiko Paulheim
05/29/23 Heiko Paulheim 2
Yeah, I’ve been Invited for this Keynote!
05/29/23 Heiko Paulheim 3
A Brief History of Knowledge Graphs
Google’s
Announcement
DBpedia
YAGO
ResearchCyc Wikidata
Freebase
NELL
05/29/23 Heiko Paulheim 4
A Brief History of Knowledge Graphs
05/29/23 Heiko Paulheim 5
Wikipedia as a Knowledge Graph
• Wikipedia based Knowledge Graphs
– DBpedia: launched 2007
– YAGO: launched 2008
– Extraction from Wikipedia
using mappings & heuristics
• Present
– Two of the most used knowledge graphs
– ...with Wikidata catching up
05/29/23 Heiko Paulheim 6
Wikipedia as a Knowledge Graph
05/29/23 Heiko Paulheim 7
Wikipedia as a Knowledge Graph
city
campus
state
c
i
t
y
05/29/23 Heiko Paulheim 8
Wikipedia as a Knowledge Graph
• Mapping to a central schema/ontology
University
chancellor Person
Organisation
Agent
campus Place
range
range
domain
domain
subclass of
subclass of
subclass of
05/29/23 Heiko Paulheim 9
Wikipedia as a Knowledge Graph
05/29/23 Heiko Paulheim 10
DBpedia Extraction, ChatGPT Style
05/29/23 Heiko Paulheim 11
DBpedia Extraction, ChatGPT Style
05/29/23 Heiko Paulheim 12
DBpedia Extraction, ChatGPT Style
05/29/23 Heiko Paulheim 13
DBpedia Extraction, ChatGPT Style
• Looks nice, but there are some glitches…
– Handling datatypes:
– Handling coordinates:
• But maybe we can resolve this with better prompt engineering...
05/29/23 Heiko Paulheim 14
DBpedia Extraction, ChatGPT Style
;
05/29/23 Heiko Paulheim 15
DBpedia Extraction, ChatGPT Style
05/29/23 Heiko Paulheim 16
Knowledge Graph Completion, ChatGPT Style
05/29/23 Heiko Paulheim 17
Knowledge Graph Hallucination, ChatGPT Style
• Some more findings:
• None of those are real!
• cf. DBpedia:
05/29/23 Heiko Paulheim 18
Knowledge Graph Completion, ChatGPT Style
05/29/23 Heiko Paulheim 19
Knowledge Graph Hallucination, ChatGPT Style
05/29/23 Heiko Paulheim 20
Knowledge Graph Hallucination, ChatGPT Style
• My first reaction: • My second reaction:
05/29/23 Heiko Paulheim 21
Knowledge Graph Hallucination, ChatGPT Style
Mannheim is a city in the southwestern part of
Germany, the third-largest in the German state of
Baden-Württemberg after Stuttgart and Karlsruhe with a
2019 population of approximately 309,000 inhabitants.
05/29/23 Heiko Paulheim 22
But While We’re at it...
• Hey ChatGPT, did you know this paper?
05/29/23 Heiko Paulheim 23
Back to my Original Presentation
05/29/23 Heiko Paulheim 24
Flashback to 2018
• Much of the missing information is in the Wikipedia text
• ...and already in the abstracts
• Abstracts follow a structure
municipality state country
+
+
-
-
05/29/23 Heiko Paulheim 25
Flashback to 2018
• The first three populated places linked in an abstract about a town
are that town’s municipality, state, and country
• All genres linked in an abstract about a writer
are that writer’s genres
• The first place linked in an abstract about a person
is that person’s birthplace
• The types are already in DBpedia
• Automatically finding those patterns:
We can use existing relations as training data
– Using a local closed world assumption for creating negative examples
05/29/23 Heiko Paulheim 26
Flashback to 2018
• Target: use only models that have >95% precision
– We want extra knowledge, but not much extra noise
• Outcome
– Models could be learned for 99 relations
– Almost 1M additional statements
05/29/23 Heiko Paulheim 27
Relation Extraction from Wikipedia Abstracts:
ChatGPT Style
05/29/23 Heiko Paulheim 28
Relation Extraction from Wikipedia Abstracts:
ChatGPT Style
05/29/23 Heiko Paulheim 29
Relation Extraction from Wikipedia Abstracts:
ChatGPT Style
Only the first three
facts are extracted
from the abstract
05/29/23 Heiko Paulheim 30
Relation Extraction from Wikipedia Abstracts:
ChatGPT Style
DBpedia uses
dbo:federalState here
05/29/23 Heiko Paulheim 31
Relation Extraction from Wikipedia Abstracts:
ChatGPT Style
• In the original paper, we trained general ML models...
05/29/23 Heiko Paulheim 32
Flashback to 2018
• We used solely position and type features
– Nothing language specific
– i.e.: we can apply this to any language
• Extension to 12 largest language editions of DBpedia
– Exploiting inter-language links
– 187 relations (was: 99), 1.6M axioms (was: 1M), at precision >0.95
– #statements per language correlates with #language links to English!
05/29/23 Heiko Paulheim 33
Relation Extraction from Wikipedia Abstracts:
ChatGPT Style
05/29/23 Heiko Paulheim 34
Relation Extraction from Wikipedia Abstracts:
ChatGPT Style
05/29/23 Heiko Paulheim 35
Relation Extraction from Wikipedia Abstracts:
ChatGPT Style
05/29/23 Heiko Paulheim 36
Relation Extraction from Wikipedia Abstracts:
ChatGPT Style
• Let’s challenge ChatGPT a bit more...
05/29/23 Heiko Paulheim 37
Relation Extraction from Wikipedia Abstracts:
ChatGPT Style
05/29/23 Heiko Paulheim 38
Relation Extraction from Wikipedia Abstracts:
ChatGPT Style
Mostly hallucination…
this is not the population
value from the abstract!
05/29/23 Heiko Paulheim 39
Relation Extraction from Wikipedia Abstracts:
ChatGPT Style
05/29/23 Heiko Paulheim 40
Relation Extraction from Wikipedia Abstracts:
ChatGPT Style
05/29/23 Heiko Paulheim 41
Knowledge Graph Hallucination, ChatGPT Style
• ChatGPT seemed to be eager on “extracting” coordinates from
infoboxes and abstracts
05/29/23 Heiko Paulheim 42
Knowledge Graph Hallucination, ChatGPT Style
• At least, all are different coordinates in Mannheim
05/29/23 Heiko Paulheim 43
Funny Footnote –
Even more Knowledge Hallucination
• Trying to create the input file for Google Map on the previous slide:
Even more hallucination…
many of these values
are not
from the responses
05/29/23 Heiko Paulheim 44
Back to my Original Presentation
05/29/23 Heiko Paulheim 45
Cat2Ax: Axiomatizing Wikipedia Categories
 dbo:Album
 dbo:artist.{dbr:Nine_Inch_Nails}
 dbo:genre.{dbr:Rock_Music}
See: ISWC 2019 Paper on Uncovering the Semantics of Wikipedia Categories
05/29/23 Heiko Paulheim 46
Cat2Ax: Axiomatizing Wikipedia Categories
– Frequency: how often does the pattern occur in a category?
• i.e.: share of instances that have dbo:genre.{dbr.Rock_Music}?
– Lexical score: likelihood of term as a surface form of object
• i.e.: how often is Rock used to refer to dbr:Rock_Music?
– Sibling score: how likely are sibling categories sharing similar patterns?
• i.e., are there sibling categories with a high score for dbo:genre?
05/29/23 Heiko Paulheim 47
Cat2Ax: ChatGPT Style
05/29/23 Heiko Paulheim 48
Cat2Ax: ChatGPT Style
05/29/23 Heiko Paulheim 49
Cat2Ax: ChatGPT Style
05/29/23 Heiko Paulheim 50
Cat2Ax: ChatGPT Style
05/29/23 Heiko Paulheim 51
CaLiGraph Example
Category: Musical Groups established
in 1987
List of symphonic metal bands
Category: Swedish death metal bands
List of Swedes in Music
05/29/23 Heiko Paulheim 52
CaLiGraph: ChatGPT Style
05/29/23 Heiko Paulheim 53
CaLiGraph: ChatGPT Style
05/29/23 Heiko Paulheim 54
CaLiGraph: ChatGPT Style
05/29/23 Heiko Paulheim 55
Back to my Original Presentation
05/29/23 Heiko Paulheim 56
Improving Entity Coverage:
Lists in Wikipedia
• Only existing pages have categories
– Lists may also link to non-existing pages
05/29/23 Heiko Paulheim 57
Pushing Entity Coverage Further
• Beyond red links (2020) • Beyond explicit lists (2021)
05/29/23 Heiko Paulheim 58
Cat2Ax: ChatGPT Style
05/29/23 Heiko Paulheim 59
Entity Extraction from Listings: ChatGPT Style
05/29/23 Heiko Paulheim 60
Entity Extraction from Listings: ChatGPT Style
05/29/23 Heiko Paulheim 61
Entity Extraction from Listings: ChatGPT Style
05/29/23 Heiko Paulheim 62
Entity Hallucination from Listings:
ChatGPT goes Rogue
05/29/23 Heiko Paulheim 63
Entity Hallucination from Listings:
ChatGPT goes Rogue
05/29/23 Heiko Paulheim 64
Entity Hallucination from Listings:
ChatGPT goes Rogue
05/29/23 Heiko Paulheim 65
Entity Hallucination from Listings:
ChatGPT goes Rogue
05/29/23 Heiko Paulheim 66
Entity Hallucination from Listings:
ChatGPT goes Rogue
This went on for a while, but lead nowhere.
05/29/23 Heiko Paulheim 67
Revisiting CaLiGraph: Entity Disambiguation
• Examples: Wikipedia pages of Die Krupps and Eisbrecher
?
05/29/23 Heiko Paulheim 68
Revisiting CaLiGraph: Entity Disambiguation
Proper solution:
”NASTyLinker: NIL-Aware Scalable
Transformer-based Entity Linker”
Tuesday, 12 am
05/29/23 Heiko Paulheim 69
Entity Disambiguation: ChatGPT Bloopers
05/29/23 Heiko Paulheim 70
Entity Disambiguation: ChatGPT Bloopers
05/29/23 Heiko Paulheim 71
Take Aways
• Basic KG creation with ChatGPT can work
– At least in a human in the loop setup
• Reinforcement signals might help here
– Main challenge: hallucinations
• On the other hand: consider them
“extraction of additional facts”
• Isn’t that just like heuristic KG completion?
• Disclaimer:
– No PhD students were harmed or replaced by ChatGPT.
• Full ChatGPT protocol available here.
05/29/23 Heiko Paulheim 72
Knowledge Graph Generation
from Wikipedia in the Age of ChatGPT:
Knowledge Extraction
or Knowledge Hallucination?
Heiko Paulheim

Contenu connexe

Tendances

Question Answering - Application and Challenges
Question Answering - Application and ChallengesQuestion Answering - Application and Challenges
Question Answering - Application and ChallengesJens Lehmann
 
Introduction to the Data Web, DBpedia and the Life-cycle of Linked Data
Introduction to the Data Web, DBpedia and the Life-cycle of Linked DataIntroduction to the Data Web, DBpedia and the Life-cycle of Linked Data
Introduction to the Data Web, DBpedia and the Life-cycle of Linked DataSören Auer
 
Introduction of Knowledge Graphs
Introduction of Knowledge GraphsIntroduction of Knowledge Graphs
Introduction of Knowledge GraphsJeff Z. Pan
 
The perfect couple: Uniting Large Language Models and Knowledge Graphs for En...
The perfect couple: Uniting Large Language Models and Knowledge Graphs for En...The perfect couple: Uniting Large Language Models and Knowledge Graphs for En...
The perfect couple: Uniting Large Language Models and Knowledge Graphs for En...Neo4j
 
Prompt Engineering by Dr. Naveed.pdf
Prompt Engineering by Dr. Naveed.pdfPrompt Engineering by Dr. Naveed.pdf
Prompt Engineering by Dr. Naveed.pdfNaveed Ahmed Siddiqui
 
Introduction to Knowledge Graphs
Introduction to Knowledge GraphsIntroduction to Knowledge Graphs
Introduction to Knowledge Graphsmukuljoshi
 
LangChain Intro by KeyMate.AI
LangChain Intro by KeyMate.AILangChain Intro by KeyMate.AI
LangChain Intro by KeyMate.AIOzgurOscarOzkan
 
How to fine-tune and develop your own large language model.pptx
How to fine-tune and develop your own large language model.pptxHow to fine-tune and develop your own large language model.pptx
How to fine-tune and develop your own large language model.pptxKnoldus Inc.
 
AI and ML Series - Introduction to Generative AI and LLMs - Session 1
AI and ML Series - Introduction to Generative AI and LLMs - Session 1AI and ML Series - Introduction to Generative AI and LLMs - Session 1
AI and ML Series - Introduction to Generative AI and LLMs - Session 1DianaGray10
 
Basics of Generative AI: Models, Tokenization, Embeddings, Text Similarity, V...
Basics of Generative AI: Models, Tokenization, Embeddings, Text Similarity, V...Basics of Generative AI: Models, Tokenization, Embeddings, Text Similarity, V...
Basics of Generative AI: Models, Tokenization, Embeddings, Text Similarity, V...Robert McDermott
 
Introduction to Knowledge Graphs: Data Summit 2020
Introduction to Knowledge Graphs: Data Summit 2020Introduction to Knowledge Graphs: Data Summit 2020
Introduction to Knowledge Graphs: Data Summit 2020Enterprise Knowledge
 
Generative Models and ChatGPT
Generative Models and ChatGPTGenerative Models and ChatGPT
Generative Models and ChatGPTLoic Merckel
 
Introduction to Knowledge Graphs and Semantic AI
Introduction to Knowledge Graphs and Semantic AIIntroduction to Knowledge Graphs and Semantic AI
Introduction to Knowledge Graphs and Semantic AISemantic Web Company
 
Foundation Models in Recommender Systems
Foundation Models in Recommender SystemsFoundation Models in Recommender Systems
Foundation Models in Recommender SystemsAnoop Deoras
 
The Future of AI is Generative not Discriminative 5/26/2021
The Future of AI is Generative not Discriminative 5/26/2021The Future of AI is Generative not Discriminative 5/26/2021
The Future of AI is Generative not Discriminative 5/26/2021Steve Omohundro
 
Artificial Inteligence: the begining
Artificial Inteligence: the beginingArtificial Inteligence: the begining
Artificial Inteligence: the beginingDr. C.V. Suresh Babu
 
Knowledge Graphs for Transformation: Dynamic Context for the Intelligent Ente...
Knowledge Graphs for Transformation: Dynamic Context for the Intelligent Ente...Knowledge Graphs for Transformation: Dynamic Context for the Intelligent Ente...
Knowledge Graphs for Transformation: Dynamic Context for the Intelligent Ente...Neo4j
 
Prompting is an art / Sztuka promptowania
Prompting is an art / Sztuka promptowaniaPrompting is an art / Sztuka promptowania
Prompting is an art / Sztuka promptowaniaMichal Jaskolski
 
intro chatGPT workshop.pdf
intro chatGPT workshop.pdfintro chatGPT workshop.pdf
intro chatGPT workshop.pdfpeterpur
 

Tendances (20)

Question Answering - Application and Challenges
Question Answering - Application and ChallengesQuestion Answering - Application and Challenges
Question Answering - Application and Challenges
 
Introduction to the Data Web, DBpedia and the Life-cycle of Linked Data
Introduction to the Data Web, DBpedia and the Life-cycle of Linked DataIntroduction to the Data Web, DBpedia and the Life-cycle of Linked Data
Introduction to the Data Web, DBpedia and the Life-cycle of Linked Data
 
Introduction of Knowledge Graphs
Introduction of Knowledge GraphsIntroduction of Knowledge Graphs
Introduction of Knowledge Graphs
 
The perfect couple: Uniting Large Language Models and Knowledge Graphs for En...
The perfect couple: Uniting Large Language Models and Knowledge Graphs for En...The perfect couple: Uniting Large Language Models and Knowledge Graphs for En...
The perfect couple: Uniting Large Language Models and Knowledge Graphs for En...
 
Prompt Engineering by Dr. Naveed.pdf
Prompt Engineering by Dr. Naveed.pdfPrompt Engineering by Dr. Naveed.pdf
Prompt Engineering by Dr. Naveed.pdf
 
Introduction to Knowledge Graphs
Introduction to Knowledge GraphsIntroduction to Knowledge Graphs
Introduction to Knowledge Graphs
 
LangChain Intro by KeyMate.AI
LangChain Intro by KeyMate.AILangChain Intro by KeyMate.AI
LangChain Intro by KeyMate.AI
 
How to fine-tune and develop your own large language model.pptx
How to fine-tune and develop your own large language model.pptxHow to fine-tune and develop your own large language model.pptx
How to fine-tune and develop your own large language model.pptx
 
AI and ML Series - Introduction to Generative AI and LLMs - Session 1
AI and ML Series - Introduction to Generative AI and LLMs - Session 1AI and ML Series - Introduction to Generative AI and LLMs - Session 1
AI and ML Series - Introduction to Generative AI and LLMs - Session 1
 
Basics of Generative AI: Models, Tokenization, Embeddings, Text Similarity, V...
Basics of Generative AI: Models, Tokenization, Embeddings, Text Similarity, V...Basics of Generative AI: Models, Tokenization, Embeddings, Text Similarity, V...
Basics of Generative AI: Models, Tokenization, Embeddings, Text Similarity, V...
 
Introduction to Knowledge Graphs: Data Summit 2020
Introduction to Knowledge Graphs: Data Summit 2020Introduction to Knowledge Graphs: Data Summit 2020
Introduction to Knowledge Graphs: Data Summit 2020
 
Generative Models and ChatGPT
Generative Models and ChatGPTGenerative Models and ChatGPT
Generative Models and ChatGPT
 
Introduction to Knowledge Graphs and Semantic AI
Introduction to Knowledge Graphs and Semantic AIIntroduction to Knowledge Graphs and Semantic AI
Introduction to Knowledge Graphs and Semantic AI
 
ChatGPT in Education
ChatGPT in EducationChatGPT in Education
ChatGPT in Education
 
Foundation Models in Recommender Systems
Foundation Models in Recommender SystemsFoundation Models in Recommender Systems
Foundation Models in Recommender Systems
 
The Future of AI is Generative not Discriminative 5/26/2021
The Future of AI is Generative not Discriminative 5/26/2021The Future of AI is Generative not Discriminative 5/26/2021
The Future of AI is Generative not Discriminative 5/26/2021
 
Artificial Inteligence: the begining
Artificial Inteligence: the beginingArtificial Inteligence: the begining
Artificial Inteligence: the begining
 
Knowledge Graphs for Transformation: Dynamic Context for the Intelligent Ente...
Knowledge Graphs for Transformation: Dynamic Context for the Intelligent Ente...Knowledge Graphs for Transformation: Dynamic Context for the Intelligent Ente...
Knowledge Graphs for Transformation: Dynamic Context for the Intelligent Ente...
 
Prompting is an art / Sztuka promptowania
Prompting is an art / Sztuka promptowaniaPrompting is an art / Sztuka promptowania
Prompting is an art / Sztuka promptowania
 
intro chatGPT workshop.pdf
intro chatGPT workshop.pdfintro chatGPT workshop.pdf
intro chatGPT workshop.pdf
 

Similaire à Knowledge Graph Generation from Wikipedia in the Age of ChatGPT: Knowledge Extraction or Knowledge Hallucination?

From Wikis to Knowledge Graphs
From Wikis to Knowledge GraphsFrom Wikis to Knowledge Graphs
From Wikis to Knowledge GraphsHeiko Paulheim
 
Machine Learning with and for Semantic Web Knowledge Graphs
Machine Learning with and for Semantic Web Knowledge GraphsMachine Learning with and for Semantic Web Knowledge Graphs
Machine Learning with and for Semantic Web Knowledge GraphsHeiko Paulheim
 
Towards Knowledge Graph Profiling
Towards Knowledge Graph ProfilingTowards Knowledge Graph Profiling
Towards Knowledge Graph ProfilingHeiko Paulheim
 
From Wikipedia to Thousands of Wikis – The DBkWik Knowledge Graph
From Wikipedia to Thousands of Wikis – The DBkWik Knowledge GraphFrom Wikipedia to Thousands of Wikis – The DBkWik Knowledge Graph
From Wikipedia to Thousands of Wikis – The DBkWik Knowledge GraphHeiko Paulheim
 
Knowledge Graphs on the Web
Knowledge Graphs on the WebKnowledge Graphs on the Web
Knowledge Graphs on the WebHeiko Paulheim
 
Beyond DBpedia and YAGO – The New Kids on the Knowledge Graph Block
Beyond DBpedia and YAGO – The New Kids  on the Knowledge Graph BlockBeyond DBpedia and YAGO – The New Kids  on the Knowledge Graph Block
Beyond DBpedia and YAGO – The New Kids on the Knowledge Graph BlockHeiko Paulheim
 
Big Data, Smart Algorithms, and Market Power - A Computer Scientist's Perspec...
Big Data, Smart Algorithms, and Market Power - A Computer Scientist's Perspec...Big Data, Smart Algorithms, and Market Power - A Computer Scientist's Perspec...
Big Data, Smart Algorithms, and Market Power - A Computer Scientist's Perspec...Heiko Paulheim
 

Similaire à Knowledge Graph Generation from Wikipedia in the Age of ChatGPT: Knowledge Extraction or Knowledge Hallucination? (8)

From Wikis to Knowledge Graphs
From Wikis to Knowledge GraphsFrom Wikis to Knowledge Graphs
From Wikis to Knowledge Graphs
 
Machine Learning with and for Semantic Web Knowledge Graphs
Machine Learning with and for Semantic Web Knowledge GraphsMachine Learning with and for Semantic Web Knowledge Graphs
Machine Learning with and for Semantic Web Knowledge Graphs
 
Towards Knowledge Graph Profiling
Towards Knowledge Graph ProfilingTowards Knowledge Graph Profiling
Towards Knowledge Graph Profiling
 
From Wikipedia to Thousands of Wikis – The DBkWik Knowledge Graph
From Wikipedia to Thousands of Wikis – The DBkWik Knowledge GraphFrom Wikipedia to Thousands of Wikis – The DBkWik Knowledge Graph
From Wikipedia to Thousands of Wikis – The DBkWik Knowledge Graph
 
Knowledge Graphs on the Web
Knowledge Graphs on the WebKnowledge Graphs on the Web
Knowledge Graphs on the Web
 
Copyright 2019. tran
Copyright  2019. tranCopyright  2019. tran
Copyright 2019. tran
 
Beyond DBpedia and YAGO – The New Kids on the Knowledge Graph Block
Beyond DBpedia and YAGO – The New Kids  on the Knowledge Graph BlockBeyond DBpedia and YAGO – The New Kids  on the Knowledge Graph Block
Beyond DBpedia and YAGO – The New Kids on the Knowledge Graph Block
 
Big Data, Smart Algorithms, and Market Power - A Computer Scientist's Perspec...
Big Data, Smart Algorithms, and Market Power - A Computer Scientist's Perspec...Big Data, Smart Algorithms, and Market Power - A Computer Scientist's Perspec...
Big Data, Smart Algorithms, and Market Power - A Computer Scientist's Perspec...
 

Plus de Heiko Paulheim

What_do_Knowledge_Graph_Embeddings_Learn.pdf
What_do_Knowledge_Graph_Embeddings_Learn.pdfWhat_do_Knowledge_Graph_Embeddings_Learn.pdf
What_do_Knowledge_Graph_Embeddings_Learn.pdfHeiko Paulheim
 
New Adventures in RDF2vec
New Adventures in RDF2vecNew Adventures in RDF2vec
New Adventures in RDF2vecHeiko Paulheim
 
New Adventures in RDF2vec
New Adventures in RDF2vecNew Adventures in RDF2vec
New Adventures in RDF2vecHeiko Paulheim
 
Knowledge Matters! The Role of Knowledge Graphs in Modern AI Systems
Knowledge Matters! The Role of Knowledge Graphs in Modern AI SystemsKnowledge Matters! The Role of Knowledge Graphs in Modern AI Systems
Knowledge Matters! The Role of Knowledge Graphs in Modern AI SystemsHeiko Paulheim
 
Using Knowledge Graphs in Data Science - From Symbolic to Latent Representati...
Using Knowledge Graphs in Data Science - From Symbolic to Latent Representati...Using Knowledge Graphs in Data Science - From Symbolic to Latent Representati...
Using Knowledge Graphs in Data Science - From Symbolic to Latent Representati...Heiko Paulheim
 
Big Data, Smart Algorithms, and Market Power - A Computer Scientist’s Perspec...
Big Data, Smart Algorithms, and Market Power - A Computer Scientist’s Perspec...Big Data, Smart Algorithms, and Market Power - A Computer Scientist’s Perspec...
Big Data, Smart Algorithms, and Market Power - A Computer Scientist’s Perspec...Heiko Paulheim
 
Machine Learning & Embeddings for Large Knowledge Graphs
Machine Learning & Embeddings  for Large Knowledge GraphsMachine Learning & Embeddings  for Large Knowledge Graphs
Machine Learning & Embeddings for Large Knowledge GraphsHeiko Paulheim
 
Make Embeddings Semantic Again!
Make Embeddings Semantic Again!Make Embeddings Semantic Again!
Make Embeddings Semantic Again!Heiko Paulheim
 
Weakly Supervised Learning for Fake News Detection on Twitter
Weakly Supervised Learning for Fake News Detection on TwitterWeakly Supervised Learning for Fake News Detection on Twitter
Weakly Supervised Learning for Fake News Detection on TwitterHeiko Paulheim
 
Data-driven Joint Debugging of the DBpedia Mappings and Ontology
Data-driven Joint Debugging of the DBpedia Mappings and OntologyData-driven Joint Debugging of the DBpedia Mappings and Ontology
Data-driven Joint Debugging of the DBpedia Mappings and OntologyHeiko Paulheim
 
Fast Approximate A-box Consistency Checking using Machine Learning
Fast Approximate  A-box Consistency Checking using Machine LearningFast Approximate  A-box Consistency Checking using Machine Learning
Fast Approximate A-box Consistency Checking using Machine LearningHeiko Paulheim
 
Serving DBpedia with DOLCE - More Than Just Adding a Cherry on Top
Serving DBpedia with DOLCE - More Than Just Adding a Cherry on TopServing DBpedia with DOLCE - More Than Just Adding a Cherry on Top
Serving DBpedia with DOLCE - More Than Just Adding a Cherry on TopHeiko Paulheim
 
Combining Ontology Matchers via Anomaly Detection
Combining Ontology Matchers via Anomaly DetectionCombining Ontology Matchers via Anomaly Detection
Combining Ontology Matchers via Anomaly DetectionHeiko Paulheim
 
Gathering Alternative Surface Forms for DBpedia Entities
Gathering Alternative Surface Forms for DBpedia EntitiesGathering Alternative Surface Forms for DBpedia Entities
Gathering Alternative Surface Forms for DBpedia EntitiesHeiko Paulheim
 
What the Adoption of schema.org Tells about Linked Open Data
What the Adoption of schema.org Tells about Linked Open DataWhat the Adoption of schema.org Tells about Linked Open Data
What the Adoption of schema.org Tells about Linked Open DataHeiko Paulheim
 
Linked Open Data enhanced Knowledge Discovery
Linked Open Data enhanced  Knowledge DiscoveryLinked Open Data enhanced  Knowledge Discovery
Linked Open Data enhanced Knowledge DiscoveryHeiko Paulheim
 
Mining the Web of Linked Data with RapidMiner
Mining the Web of Linked Data with RapidMinerMining the Web of Linked Data with RapidMiner
Mining the Web of Linked Data with RapidMinerHeiko Paulheim
 
Data Mining with Background Knowledge from the Web - Introducing the RapidMin...
Data Mining with Background Knowledge from the Web - Introducing the RapidMin...Data Mining with Background Knowledge from the Web - Introducing the RapidMin...
Data Mining with Background Knowledge from the Web - Introducing the RapidMin...Heiko Paulheim
 
Detecting Incorrect Numerical Data in DBpedia
Detecting Incorrect Numerical Data in DBpediaDetecting Incorrect Numerical Data in DBpedia
Detecting Incorrect Numerical Data in DBpediaHeiko Paulheim
 

Plus de Heiko Paulheim (20)

What_do_Knowledge_Graph_Embeddings_Learn.pdf
What_do_Knowledge_Graph_Embeddings_Learn.pdfWhat_do_Knowledge_Graph_Embeddings_Learn.pdf
What_do_Knowledge_Graph_Embeddings_Learn.pdf
 
New Adventures in RDF2vec
New Adventures in RDF2vecNew Adventures in RDF2vec
New Adventures in RDF2vec
 
New Adventures in RDF2vec
New Adventures in RDF2vecNew Adventures in RDF2vec
New Adventures in RDF2vec
 
Knowledge Matters! The Role of Knowledge Graphs in Modern AI Systems
Knowledge Matters! The Role of Knowledge Graphs in Modern AI SystemsKnowledge Matters! The Role of Knowledge Graphs in Modern AI Systems
Knowledge Matters! The Role of Knowledge Graphs in Modern AI Systems
 
Using Knowledge Graphs in Data Science - From Symbolic to Latent Representati...
Using Knowledge Graphs in Data Science - From Symbolic to Latent Representati...Using Knowledge Graphs in Data Science - From Symbolic to Latent Representati...
Using Knowledge Graphs in Data Science - From Symbolic to Latent Representati...
 
Big Data, Smart Algorithms, and Market Power - A Computer Scientist’s Perspec...
Big Data, Smart Algorithms, and Market Power - A Computer Scientist’s Perspec...Big Data, Smart Algorithms, and Market Power - A Computer Scientist’s Perspec...
Big Data, Smart Algorithms, and Market Power - A Computer Scientist’s Perspec...
 
Machine Learning & Embeddings for Large Knowledge Graphs
Machine Learning & Embeddings  for Large Knowledge GraphsMachine Learning & Embeddings  for Large Knowledge Graphs
Machine Learning & Embeddings for Large Knowledge Graphs
 
Make Embeddings Semantic Again!
Make Embeddings Semantic Again!Make Embeddings Semantic Again!
Make Embeddings Semantic Again!
 
How much is a Triple?
How much is a Triple?How much is a Triple?
How much is a Triple?
 
Weakly Supervised Learning for Fake News Detection on Twitter
Weakly Supervised Learning for Fake News Detection on TwitterWeakly Supervised Learning for Fake News Detection on Twitter
Weakly Supervised Learning for Fake News Detection on Twitter
 
Data-driven Joint Debugging of the DBpedia Mappings and Ontology
Data-driven Joint Debugging of the DBpedia Mappings and OntologyData-driven Joint Debugging of the DBpedia Mappings and Ontology
Data-driven Joint Debugging of the DBpedia Mappings and Ontology
 
Fast Approximate A-box Consistency Checking using Machine Learning
Fast Approximate  A-box Consistency Checking using Machine LearningFast Approximate  A-box Consistency Checking using Machine Learning
Fast Approximate A-box Consistency Checking using Machine Learning
 
Serving DBpedia with DOLCE - More Than Just Adding a Cherry on Top
Serving DBpedia with DOLCE - More Than Just Adding a Cherry on TopServing DBpedia with DOLCE - More Than Just Adding a Cherry on Top
Serving DBpedia with DOLCE - More Than Just Adding a Cherry on Top
 
Combining Ontology Matchers via Anomaly Detection
Combining Ontology Matchers via Anomaly DetectionCombining Ontology Matchers via Anomaly Detection
Combining Ontology Matchers via Anomaly Detection
 
Gathering Alternative Surface Forms for DBpedia Entities
Gathering Alternative Surface Forms for DBpedia EntitiesGathering Alternative Surface Forms for DBpedia Entities
Gathering Alternative Surface Forms for DBpedia Entities
 
What the Adoption of schema.org Tells about Linked Open Data
What the Adoption of schema.org Tells about Linked Open DataWhat the Adoption of schema.org Tells about Linked Open Data
What the Adoption of schema.org Tells about Linked Open Data
 
Linked Open Data enhanced Knowledge Discovery
Linked Open Data enhanced  Knowledge DiscoveryLinked Open Data enhanced  Knowledge Discovery
Linked Open Data enhanced Knowledge Discovery
 
Mining the Web of Linked Data with RapidMiner
Mining the Web of Linked Data with RapidMinerMining the Web of Linked Data with RapidMiner
Mining the Web of Linked Data with RapidMiner
 
Data Mining with Background Knowledge from the Web - Introducing the RapidMin...
Data Mining with Background Knowledge from the Web - Introducing the RapidMin...Data Mining with Background Knowledge from the Web - Introducing the RapidMin...
Data Mining with Background Knowledge from the Web - Introducing the RapidMin...
 
Detecting Incorrect Numerical Data in DBpedia
Detecting Incorrect Numerical Data in DBpediaDetecting Incorrect Numerical Data in DBpedia
Detecting Incorrect Numerical Data in DBpedia
 

Dernier

BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxMohammedJunaid861692
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxolyaivanovalion
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% SecurePooja Nehwal
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxolyaivanovalion
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxfirstjob4
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionfulawalesam
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Delhi Call girls
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxolyaivanovalion
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 
Zuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxZuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxolyaivanovalion
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxolyaivanovalion
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Delhi Call girls
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxolyaivanovalion
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Delhi Call girls
 

Dernier (20)

Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFx
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptx
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptx
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
Zuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxZuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptx
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
 

Knowledge Graph Generation from Wikipedia in the Age of ChatGPT: Knowledge Extraction or Knowledge Hallucination?

  • 1. 05/29/23 Heiko Paulheim 1 Knowledge Graph Generation from Wikipedia in the Age of ChatGPT: Knowledge Extraction or Knowledge Hallucination? Heiko Paulheim
  • 2. 05/29/23 Heiko Paulheim 2 Yeah, I’ve been Invited for this Keynote!
  • 3. 05/29/23 Heiko Paulheim 3 A Brief History of Knowledge Graphs Google’s Announcement DBpedia YAGO ResearchCyc Wikidata Freebase NELL
  • 4. 05/29/23 Heiko Paulheim 4 A Brief History of Knowledge Graphs
  • 5. 05/29/23 Heiko Paulheim 5 Wikipedia as a Knowledge Graph • Wikipedia based Knowledge Graphs – DBpedia: launched 2007 – YAGO: launched 2008 – Extraction from Wikipedia using mappings & heuristics • Present – Two of the most used knowledge graphs – ...with Wikidata catching up
  • 6. 05/29/23 Heiko Paulheim 6 Wikipedia as a Knowledge Graph
  • 7. 05/29/23 Heiko Paulheim 7 Wikipedia as a Knowledge Graph city campus state c i t y
  • 8. 05/29/23 Heiko Paulheim 8 Wikipedia as a Knowledge Graph • Mapping to a central schema/ontology University chancellor Person Organisation Agent campus Place range range domain domain subclass of subclass of subclass of
  • 9. 05/29/23 Heiko Paulheim 9 Wikipedia as a Knowledge Graph
  • 10. 05/29/23 Heiko Paulheim 10 DBpedia Extraction, ChatGPT Style
  • 11. 05/29/23 Heiko Paulheim 11 DBpedia Extraction, ChatGPT Style
  • 12. 05/29/23 Heiko Paulheim 12 DBpedia Extraction, ChatGPT Style
  • 13. 05/29/23 Heiko Paulheim 13 DBpedia Extraction, ChatGPT Style • Looks nice, but there are some glitches… – Handling datatypes: – Handling coordinates: • But maybe we can resolve this with better prompt engineering...
  • 14. 05/29/23 Heiko Paulheim 14 DBpedia Extraction, ChatGPT Style ;
  • 15. 05/29/23 Heiko Paulheim 15 DBpedia Extraction, ChatGPT Style
  • 16. 05/29/23 Heiko Paulheim 16 Knowledge Graph Completion, ChatGPT Style
  • 17. 05/29/23 Heiko Paulheim 17 Knowledge Graph Hallucination, ChatGPT Style • Some more findings: • None of those are real! • cf. DBpedia:
  • 18. 05/29/23 Heiko Paulheim 18 Knowledge Graph Completion, ChatGPT Style
  • 19. 05/29/23 Heiko Paulheim 19 Knowledge Graph Hallucination, ChatGPT Style
  • 20. 05/29/23 Heiko Paulheim 20 Knowledge Graph Hallucination, ChatGPT Style • My first reaction: • My second reaction:
  • 21. 05/29/23 Heiko Paulheim 21 Knowledge Graph Hallucination, ChatGPT Style Mannheim is a city in the southwestern part of Germany, the third-largest in the German state of Baden-Württemberg after Stuttgart and Karlsruhe with a 2019 population of approximately 309,000 inhabitants.
  • 22. 05/29/23 Heiko Paulheim 22 But While We’re at it... • Hey ChatGPT, did you know this paper?
  • 23. 05/29/23 Heiko Paulheim 23 Back to my Original Presentation
  • 24. 05/29/23 Heiko Paulheim 24 Flashback to 2018 • Much of the missing information is in the Wikipedia text • ...and already in the abstracts • Abstracts follow a structure municipality state country + + - -
  • 25. 05/29/23 Heiko Paulheim 25 Flashback to 2018 • The first three populated places linked in an abstract about a town are that town’s municipality, state, and country • All genres linked in an abstract about a writer are that writer’s genres • The first place linked in an abstract about a person is that person’s birthplace • The types are already in DBpedia • Automatically finding those patterns: We can use existing relations as training data – Using a local closed world assumption for creating negative examples
  • 26. 05/29/23 Heiko Paulheim 26 Flashback to 2018 • Target: use only models that have >95% precision – We want extra knowledge, but not much extra noise • Outcome – Models could be learned for 99 relations – Almost 1M additional statements
  • 27. 05/29/23 Heiko Paulheim 27 Relation Extraction from Wikipedia Abstracts: ChatGPT Style
  • 28. 05/29/23 Heiko Paulheim 28 Relation Extraction from Wikipedia Abstracts: ChatGPT Style
  • 29. 05/29/23 Heiko Paulheim 29 Relation Extraction from Wikipedia Abstracts: ChatGPT Style Only the first three facts are extracted from the abstract
  • 30. 05/29/23 Heiko Paulheim 30 Relation Extraction from Wikipedia Abstracts: ChatGPT Style DBpedia uses dbo:federalState here
  • 31. 05/29/23 Heiko Paulheim 31 Relation Extraction from Wikipedia Abstracts: ChatGPT Style • In the original paper, we trained general ML models...
  • 32. 05/29/23 Heiko Paulheim 32 Flashback to 2018 • We used solely position and type features – Nothing language specific – i.e.: we can apply this to any language • Extension to 12 largest language editions of DBpedia – Exploiting inter-language links – 187 relations (was: 99), 1.6M axioms (was: 1M), at precision >0.95 – #statements per language correlates with #language links to English!
  • 33. 05/29/23 Heiko Paulheim 33 Relation Extraction from Wikipedia Abstracts: ChatGPT Style
  • 34. 05/29/23 Heiko Paulheim 34 Relation Extraction from Wikipedia Abstracts: ChatGPT Style
  • 35. 05/29/23 Heiko Paulheim 35 Relation Extraction from Wikipedia Abstracts: ChatGPT Style
  • 36. 05/29/23 Heiko Paulheim 36 Relation Extraction from Wikipedia Abstracts: ChatGPT Style • Let’s challenge ChatGPT a bit more...
  • 37. 05/29/23 Heiko Paulheim 37 Relation Extraction from Wikipedia Abstracts: ChatGPT Style
  • 38. 05/29/23 Heiko Paulheim 38 Relation Extraction from Wikipedia Abstracts: ChatGPT Style Mostly hallucination… this is not the population value from the abstract!
  • 39. 05/29/23 Heiko Paulheim 39 Relation Extraction from Wikipedia Abstracts: ChatGPT Style
  • 40. 05/29/23 Heiko Paulheim 40 Relation Extraction from Wikipedia Abstracts: ChatGPT Style
  • 41. 05/29/23 Heiko Paulheim 41 Knowledge Graph Hallucination, ChatGPT Style • ChatGPT seemed to be eager on “extracting” coordinates from infoboxes and abstracts
  • 42. 05/29/23 Heiko Paulheim 42 Knowledge Graph Hallucination, ChatGPT Style • At least, all are different coordinates in Mannheim
  • 43. 05/29/23 Heiko Paulheim 43 Funny Footnote – Even more Knowledge Hallucination • Trying to create the input file for Google Map on the previous slide: Even more hallucination… many of these values are not from the responses
  • 44. 05/29/23 Heiko Paulheim 44 Back to my Original Presentation
  • 45. 05/29/23 Heiko Paulheim 45 Cat2Ax: Axiomatizing Wikipedia Categories  dbo:Album  dbo:artist.{dbr:Nine_Inch_Nails}  dbo:genre.{dbr:Rock_Music} See: ISWC 2019 Paper on Uncovering the Semantics of Wikipedia Categories
  • 46. 05/29/23 Heiko Paulheim 46 Cat2Ax: Axiomatizing Wikipedia Categories – Frequency: how often does the pattern occur in a category? • i.e.: share of instances that have dbo:genre.{dbr.Rock_Music}? – Lexical score: likelihood of term as a surface form of object • i.e.: how often is Rock used to refer to dbr:Rock_Music? – Sibling score: how likely are sibling categories sharing similar patterns? • i.e., are there sibling categories with a high score for dbo:genre?
  • 47. 05/29/23 Heiko Paulheim 47 Cat2Ax: ChatGPT Style
  • 48. 05/29/23 Heiko Paulheim 48 Cat2Ax: ChatGPT Style
  • 49. 05/29/23 Heiko Paulheim 49 Cat2Ax: ChatGPT Style
  • 50. 05/29/23 Heiko Paulheim 50 Cat2Ax: ChatGPT Style
  • 51. 05/29/23 Heiko Paulheim 51 CaLiGraph Example Category: Musical Groups established in 1987 List of symphonic metal bands Category: Swedish death metal bands List of Swedes in Music
  • 52. 05/29/23 Heiko Paulheim 52 CaLiGraph: ChatGPT Style
  • 53. 05/29/23 Heiko Paulheim 53 CaLiGraph: ChatGPT Style
  • 54. 05/29/23 Heiko Paulheim 54 CaLiGraph: ChatGPT Style
  • 55. 05/29/23 Heiko Paulheim 55 Back to my Original Presentation
  • 56. 05/29/23 Heiko Paulheim 56 Improving Entity Coverage: Lists in Wikipedia • Only existing pages have categories – Lists may also link to non-existing pages
  • 57. 05/29/23 Heiko Paulheim 57 Pushing Entity Coverage Further • Beyond red links (2020) • Beyond explicit lists (2021)
  • 58. 05/29/23 Heiko Paulheim 58 Cat2Ax: ChatGPT Style
  • 59. 05/29/23 Heiko Paulheim 59 Entity Extraction from Listings: ChatGPT Style
  • 60. 05/29/23 Heiko Paulheim 60 Entity Extraction from Listings: ChatGPT Style
  • 61. 05/29/23 Heiko Paulheim 61 Entity Extraction from Listings: ChatGPT Style
  • 62. 05/29/23 Heiko Paulheim 62 Entity Hallucination from Listings: ChatGPT goes Rogue
  • 63. 05/29/23 Heiko Paulheim 63 Entity Hallucination from Listings: ChatGPT goes Rogue
  • 64. 05/29/23 Heiko Paulheim 64 Entity Hallucination from Listings: ChatGPT goes Rogue
  • 65. 05/29/23 Heiko Paulheim 65 Entity Hallucination from Listings: ChatGPT goes Rogue
  • 66. 05/29/23 Heiko Paulheim 66 Entity Hallucination from Listings: ChatGPT goes Rogue This went on for a while, but lead nowhere.
  • 67. 05/29/23 Heiko Paulheim 67 Revisiting CaLiGraph: Entity Disambiguation • Examples: Wikipedia pages of Die Krupps and Eisbrecher ?
  • 68. 05/29/23 Heiko Paulheim 68 Revisiting CaLiGraph: Entity Disambiguation Proper solution: ”NASTyLinker: NIL-Aware Scalable Transformer-based Entity Linker” Tuesday, 12 am
  • 69. 05/29/23 Heiko Paulheim 69 Entity Disambiguation: ChatGPT Bloopers
  • 70. 05/29/23 Heiko Paulheim 70 Entity Disambiguation: ChatGPT Bloopers
  • 71. 05/29/23 Heiko Paulheim 71 Take Aways • Basic KG creation with ChatGPT can work – At least in a human in the loop setup • Reinforcement signals might help here – Main challenge: hallucinations • On the other hand: consider them “extraction of additional facts” • Isn’t that just like heuristic KG completion? • Disclaimer: – No PhD students were harmed or replaced by ChatGPT. • Full ChatGPT protocol available here.
  • 72. 05/29/23 Heiko Paulheim 72 Knowledge Graph Generation from Wikipedia in the Age of ChatGPT: Knowledge Extraction or Knowledge Hallucination? Heiko Paulheim