SlideShare une entreprise Scribd logo
1  sur  24
Télécharger pour lire hors ligne
NOUS: Construction and
Querying of Dynamic
Knowledge Graphs
SUTANAY CHOUDHURY1, KHUSHBU AGARWAL1, SUMIT PUROHIT1, BAICHUAN
ZHANG2, MEG PIRRUNG1, WILLIAM SMITH1, MATHEW THOMAS1
1 : Pacific Northwest National Laboratory, Richland WA
2 : Purdue University, Indianapolis IN
April 22, 2017 2
Introduction
Knowledge Graphs
Challenges
Motivation (Use Cases)
Domain Specific Querying of Web
Question-answering for Climate Science
Technical Approach
NLP, Entity Disambiguation, Relation Learning
Frequent Pattern Mining, Question Answering
What’s unique
Results
Status and Future Work
Outline
April 22, 2017 3
Knowledge Graphs: Why do we care?
A collection of facts about people, places, things and relationships between them,
in a given context
I am a scientist. Read papers for me.
I am an analyst drowning in data. Help me find interesting events
I am a doctor, show me latest breakthrough to consider for this patient.
April 22, 2017 4
KG: Construction and Analytical
Challenges
Construction
No Base KB
Natural Language Processing is inherently noisy
Unseen entities and relationships
How do we determine its class
Should we include every new relation and entity in KB?
Analysis
Identifying target set of user questions and ontology needed to
answer them.
Mapping user questions to graph analytical tasks
Executing analytical tasks at scale
Pattern discovery: What’s emerging and what’s fading?
Question Answering:
Entity based Querying (What, Who, When, Where)
Hypothesis Generation (Why)
April 22, 2017 5
Introduction
Knowledge Graphs
Challenges
Motivation (Use Cases)
Domain Specific Querying of Web
Question-answering for Climate Science
Technical Approach
NLP, Entity Disambiguation, Relation Learning
Frequent Pattern Mining, Question Answering
What’s unique
Results
Status and Future Work
Outline
April 22, 2017 6
Use Case : Domain-specific Querying of
the Web
Input: multi-month web crawls obtained using domain expert suggestions
Analytic questions:
Find top manufacturers and model names
Find components and features
Who is popular?
What are new releases?
How are companies and components related?
How is country and product related?
Question-answering for Climate Science
DOE’s ARM program wants to make focused investments on
experimental campaigns
Monitor scientific literature to find which campaigns or instruments or
data products are being cited
Our target problems:
"What are the papers written on __aerosols______?"
"What datasets are used for ___aerosols______ publications?”
“What primary measurements are represented in the subset of aerosol
publications?”
"What instruments are represented in the aerosol publications?”
“What sites are most represented in the aerosol publications”?
April 22, 2017 7
April 22, 2017 8
NOUS Motivation
Tasks involved in building KB are not domain specific
Most questions can be mapped to a set of common graph analytical
tasks.
Tell me about X
Tell me about X in context of Y
What are recent trends about X
How are X and Y related
Why did X do Y
April 22, 2017 9
Introduction
Knowledge Graphs
Challenges
Motivation (Use Cases)
Domain Specific Querying of Web
Question-answering for Climate Science
Technical Approach
NLP, Entity Disambiguation, Relation Learning
Frequent Pattern Mining, Question Answering
What’s unique
Results
Status and Future Work
Outline
10
NOUS Workflow
Knowledge	
Graph	Model
Streaming	
Data
Knowledge	
Graph	Tasks
• Continuous pattern discovery.
• Question Answering
Knowledge	Graph		
Verification• Natural Language
processing
• Collective Entity Linking
• Distant supervision based
relationship discovery
Base KB + Streaming Data
Bayesian Personalized
Ranking-based link
prediction to assign
confidence
Protected Information | Proprietary Information
Apache	Spark
Triple Extraction from Natural Language
April 22, 2017 11
the	United	States	government crack
an	iPhone	that	belonged	to	a	
gunman	in	the	San	Bernardino
Apple repair
the	particular	iPhone	hole	that	the	
government	hacked.
Federal	officials specify
the	procedure	used	to	open	the	
iPhone
Federal	officials deny
to	specify	the	procedure	used	to	
open	the	iPhone.
Jay	Kaplan,	chief	executive	of	
the	tech	securAppley company	
Synack and	a	former	National	
Security	Agency	analyst. say
Apple	has	to	earn	the	trust	of	
Apples	customers,”
the	F.B.I. crack Mr.	Farook’s
LegbaCore,	which	previously	
found	and	fixed	flaws	for	
Apple. find flaws	for	Apple.
LegbaCore,	which	previously	
found	and	fixed	flaws	for	
Apple. fix flaws	for	Apple.
The	challenge:	turning	this	into	a	
high	quality	representation
Use existing tools
Stanford Core NLP
Open IE
April 22, 2017 12
Entity Disambiguation and Relation
Learning
Implements Collective Entity Linking from Han et al, SIGIR
2011
Key idea: Search the graph with matches for all the above terms,
build a mesh of terms and their related terms and pick the most
densely connected combination
Out of box performance low (higher 60%), when combined with
domain-specific rules (85-90%+)
Han	et	al	in	"Collective	Entity	
Linking	in	Web	Text:	A	Graph-
based	Method,	SIGIR	2011"
Relation Learning
Implements Distance Supervision
April 22, 2017 13
Now we have a Graph!
14
A Different Approach to Querying
Let’s not demand users learn SQL or SPARQL
Think in plain English, and we will transparently translate queries in background
Protected Information | Proprietary Information
April 22, 2017 15
Task 1: Finding Patterns from Data
Stream
A Pattern Growth approach : VLDB 2017 [Under Review]
Task 1: Finding Patterns from Data
Stream (Contd.)
April 22, 2017 16
We	discover	behavioral	patterns	of	drone	related	entities	(WSJ,	2010-2015)
Early	in	the	period
Towards	the	end
Patterns	help	us	understand	the	shift	in	a	domain,	and	discover	new	trends
Task 2: Question Answering for
Relationship Explanations
April 22, 2017 17
Our goal is to learn “patterns of explanations” by walking the graph
structure
Given a new question and the knowledge of such patterns, we can
generalize and answer questions about unseen entities
Examples:
Why would Ford buy Palladium?
Ford is-a automotive company, automotive company has-part catalytic converters,
catalytic converters has-material Palladium
Answer pattern: A is-a company B, B has-part C, C has-dependency D
Why would Sutanay visit SFO in August 15?
Sutanay has-interest Machine-Learning, SIGKDD related-to Machine-Learning,
SIGKDD has-location SFO, SIGKDD has-start-date August 13
Results – Explanatory Paths using
coherence
April 22, 2017 18
April 22, 2017 19
Use Case Results: Domain-specific
Querying of the Web
Input Size: 2 million+ webpages
Graph construction: 64-node cluster, each node with 16 cores
Graph Analytics: 16-node Spark/Hadoop cluster, each with 16 cores
April 22, 2017 20
Picking up real events by Patterns
April 22, 2017 21
What’s Phenomenal Here?
We are data scientists, not drone experts
We processed nearly two million web crawls with minimal custom
code and built the Knowledge Graph
Analyzed trends, connected dots across multiple source and presented
a hypothesis:
Is tracking autonomous drones and their use important?
Starting with the data analysis to coming up with the question : took less
than 1.5 hours
22
Key Takeaways
Open Source KB Construction and Querying Pipeline
https://github.com/streaming-graphs/NOUS
Version 1.0:
Support for extracting standard entities, relation extraction via distant
supervision,
Advanced trending and explanatory questions
The ability to answer queries where the answer is embedded across multiple
data sources
All algorithms implemented on top of Apache Spark and Scala
Protected Information | Proprietary Information
April 22, 2017 23
Status and Future Work
We are committed towards developing an open source community
A MySQL for Knowledge Graphs!
Preferred method of payment J
Add test code (in Scala or Python)
Break the code AND show an important class of problem
Version 2.0:
Improving KB quality,
Information maintenance over time,
Initial support for human-computer interaction.
Incorporate algorithms for latest advances in NLP, ED
April 22, 2017 24
Questions?

Contenu connexe

Tendances

Data science presentation 2nd CI day
Data science presentation 2nd CI dayData science presentation 2nd CI day
Data science presentation 2nd CI dayMohammed Barakat
 
Big Data Science: Intro and Benefits
Big Data Science: Intro and BenefitsBig Data Science: Intro and Benefits
Big Data Science: Intro and BenefitsChandan Rajah
 
How to become a Data Scientist?
How to become a Data Scientist? How to become a Data Scientist?
How to become a Data Scientist? HackerEarth
 
Tips and Tricks to be an Effective Data Scientist
Tips and Tricks to be an Effective Data ScientistTips and Tricks to be an Effective Data Scientist
Tips and Tricks to be an Effective Data ScientistLisa Cohen
 
Data science in_action
Data science in_actionData science in_action
Data science in_actionJi Li
 
Dagstuhl14 intro-v1
Dagstuhl14 intro-v1Dagstuhl14 intro-v1
Dagstuhl14 intro-v1CS, NcState
 
Demystifying Data Science with an introduction to Machine Learning
Demystifying Data Science with an introduction to Machine LearningDemystifying Data Science with an introduction to Machine Learning
Demystifying Data Science with an introduction to Machine LearningJulian Bright
 
Data science
Data scienceData science
Data science9diov
 
Broad Data (India 2015)
Broad Data (India 2015)Broad Data (India 2015)
Broad Data (India 2015)James Hendler
 
Advancing Foundation and Practice of Software Analytics
Advancing Foundation and Practice of Software AnalyticsAdvancing Foundation and Practice of Software Analytics
Advancing Foundation and Practice of Software AnalyticsTao Xie
 
Data Science Tutorial | What is Data Science? | Data Science For Beginners | ...
Data Science Tutorial | What is Data Science? | Data Science For Beginners | ...Data Science Tutorial | What is Data Science? | Data Science For Beginners | ...
Data Science Tutorial | What is Data Science? | Data Science For Beginners | ...Edureka!
 
Data science presentation
Data science presentationData science presentation
Data science presentationMSDEVMTL
 
Past, Present, and Future of Analyzing Software Data
Past, Present, and Future of Analyzing Software DataPast, Present, and Future of Analyzing Software Data
Past, Present, and Future of Analyzing Software DataJeongwhan Choi
 
Life of a data scientist (pub)
Life of a data scientist (pub)Life of a data scientist (pub)
Life of a data scientist (pub)Buhwan Jeong
 

Tendances (20)

Data science presentation 2nd CI day
Data science presentation 2nd CI dayData science presentation 2nd CI day
Data science presentation 2nd CI day
 
Data Science: Past, Present, and Future
Data Science: Past, Present, and FutureData Science: Past, Present, and Future
Data Science: Past, Present, and Future
 
Big Data Science: Intro and Benefits
Big Data Science: Intro and BenefitsBig Data Science: Intro and Benefits
Big Data Science: Intro and Benefits
 
How to become a Data Scientist?
How to become a Data Scientist? How to become a Data Scientist?
How to become a Data Scientist?
 
Tips and Tricks to be an Effective Data Scientist
Tips and Tricks to be an Effective Data ScientistTips and Tricks to be an Effective Data Scientist
Tips and Tricks to be an Effective Data Scientist
 
Data science and_analytics_for_ordinary_people_ebook
Data science and_analytics_for_ordinary_people_ebookData science and_analytics_for_ordinary_people_ebook
Data science and_analytics_for_ordinary_people_ebook
 
Amrapali Zaveri Defense
Amrapali Zaveri DefenseAmrapali Zaveri Defense
Amrapali Zaveri Defense
 
Data science in_action
Data science in_actionData science in_action
Data science in_action
 
Dagstuhl14 intro-v1
Dagstuhl14 intro-v1Dagstuhl14 intro-v1
Dagstuhl14 intro-v1
 
Demystifying Data Science with an introduction to Machine Learning
Demystifying Data Science with an introduction to Machine LearningDemystifying Data Science with an introduction to Machine Learning
Demystifying Data Science with an introduction to Machine Learning
 
Data science
Data scienceData science
Data science
 
Broad Data (India 2015)
Broad Data (India 2015)Broad Data (India 2015)
Broad Data (India 2015)
 
Advancing Foundation and Practice of Software Analytics
Advancing Foundation and Practice of Software AnalyticsAdvancing Foundation and Practice of Software Analytics
Advancing Foundation and Practice of Software Analytics
 
Data Science Tutorial | What is Data Science? | Data Science For Beginners | ...
Data Science Tutorial | What is Data Science? | Data Science For Beginners | ...Data Science Tutorial | What is Data Science? | Data Science For Beginners | ...
Data Science Tutorial | What is Data Science? | Data Science For Beginners | ...
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Data science presentation
Data science presentationData science presentation
Data science presentation
 
Past, Present, and Future of Analyzing Software Data
Past, Present, and Future of Analyzing Software DataPast, Present, and Future of Analyzing Software Data
Past, Present, and Future of Analyzing Software Data
 
Data Analytics
Data AnalyticsData Analytics
Data Analytics
 
Life of a data scientist (pub)
Life of a data scientist (pub)Life of a data scientist (pub)
Life of a data scientist (pub)
 
Science Data, Responsibly
Science Data, ResponsiblyScience Data, Responsibly
Science Data, Responsibly
 

Similaire à Construction and Querying of Dynamic Knowledge Graphs

Search Solutions 2011: Successful Enterprise Search By Design
Search Solutions 2011: Successful Enterprise Search By DesignSearch Solutions 2011: Successful Enterprise Search By Design
Search Solutions 2011: Successful Enterprise Search By DesignMarianne Sweeny
 
Building AI Applications using Knowledge Graphs
Building AI Applications using Knowledge GraphsBuilding AI Applications using Knowledge Graphs
Building AI Applications using Knowledge GraphsAndre Freitas
 
Using Bibliometrics to Keep Up with the Joneses
Using Bibliometrics to Keep Up with the JonesesUsing Bibliometrics to Keep Up with the Joneses
Using Bibliometrics to Keep Up with the JonesesChristina Pikas
 
Applications of Natural Language Processing to Materials Design
Applications of Natural Language Processing to Materials DesignApplications of Natural Language Processing to Materials Design
Applications of Natural Language Processing to Materials DesignAnubhav Jain
 
Emil Eifrem Keynote - GraphConnect Europe 2017
Emil Eifrem Keynote - GraphConnect Europe 2017Emil Eifrem Keynote - GraphConnect Europe 2017
Emil Eifrem Keynote - GraphConnect Europe 2017Neo4j
 
ISEC'18 Keynote: Intelligent Software Engineering: Synergy between AI and Sof...
ISEC'18 Keynote: Intelligent Software Engineering: Synergy between AI and Sof...ISEC'18 Keynote: Intelligent Software Engineering: Synergy between AI and Sof...
ISEC'18 Keynote: Intelligent Software Engineering: Synergy between AI and Sof...Tao Xie
 
Do It Yourself (DIY) Earth Science Collaboratories Using Best Practices and B...
Do It Yourself (DIY) Earth Science Collaboratories Using Best Practices and B...Do It Yourself (DIY) Earth Science Collaboratories Using Best Practices and B...
Do It Yourself (DIY) Earth Science Collaboratories Using Best Practices and B...Eric Stephan
 
Progress Towards Leveraging Natural Language Processing for Collecting Experi...
Progress Towards Leveraging Natural Language Processing for Collecting Experi...Progress Towards Leveraging Natural Language Processing for Collecting Experi...
Progress Towards Leveraging Natural Language Processing for Collecting Experi...Anubhav Jain
 
Building better knowledge graphs through social computing
Building better knowledge graphs through social computingBuilding better knowledge graphs through social computing
Building better knowledge graphs through social computingElena Simperl
 
PhD Day: Entity Linking using Generic Linked Data Datasets
PhD Day: Entity Linking using Generic Linked Data DatasetsPhD Day: Entity Linking using Generic Linked Data Datasets
PhD Day: Entity Linking using Generic Linked Data DatasetsBianca Pereira
 
Advancing Alcohol Behavior Change
Advancing Alcohol Behavior ChangeAdvancing Alcohol Behavior Change
Advancing Alcohol Behavior ChangeChad Travis
 
Intro to Neo4j Webinar
Intro to Neo4j WebinarIntro to Neo4j Webinar
Intro to Neo4j WebinarNeo4j
 
Department of Commerce App Challenge: Big Data Dashboards
Department of Commerce App Challenge: Big Data DashboardsDepartment of Commerce App Challenge: Big Data Dashboards
Department of Commerce App Challenge: Big Data DashboardsBrand Niemann
 
OSFair2017 Workshop | Text mining
OSFair2017 Workshop | Text miningOSFair2017 Workshop | Text mining
OSFair2017 Workshop | Text miningOpen Science Fair
 
The web of data: how are we doing so far
The web of data: how are we doing so farThe web of data: how are we doing so far
The web of data: how are we doing so farElena Simperl
 
X api chinese cop monthly meeting feb.2016
X api chinese cop monthly meeting   feb.2016X api chinese cop monthly meeting   feb.2016
X api chinese cop monthly meeting feb.2016Jessie Chuang
 

Similaire à Construction and Querying of Dynamic Knowledge Graphs (20)

Using Knowledge Graph for Promoting Cognitive Computing
Using Knowledge Graph for Promoting Cognitive ComputingUsing Knowledge Graph for Promoting Cognitive Computing
Using Knowledge Graph for Promoting Cognitive Computing
 
SLA Summer 2008
SLA Summer 2008SLA Summer 2008
SLA Summer 2008
 
Search Solutions 2011: Successful Enterprise Search By Design
Search Solutions 2011: Successful Enterprise Search By DesignSearch Solutions 2011: Successful Enterprise Search By Design
Search Solutions 2011: Successful Enterprise Search By Design
 
Building AI Applications using Knowledge Graphs
Building AI Applications using Knowledge GraphsBuilding AI Applications using Knowledge Graphs
Building AI Applications using Knowledge Graphs
 
Using Bibliometrics to Keep Up with the Joneses
Using Bibliometrics to Keep Up with the JonesesUsing Bibliometrics to Keep Up with the Joneses
Using Bibliometrics to Keep Up with the Joneses
 
Applications of Natural Language Processing to Materials Design
Applications of Natural Language Processing to Materials DesignApplications of Natural Language Processing to Materials Design
Applications of Natural Language Processing to Materials Design
 
Emil Eifrem Keynote - GraphConnect Europe 2017
Emil Eifrem Keynote - GraphConnect Europe 2017Emil Eifrem Keynote - GraphConnect Europe 2017
Emil Eifrem Keynote - GraphConnect Europe 2017
 
ISEC'18 Keynote: Intelligent Software Engineering: Synergy between AI and Sof...
ISEC'18 Keynote: Intelligent Software Engineering: Synergy between AI and Sof...ISEC'18 Keynote: Intelligent Software Engineering: Synergy between AI and Sof...
ISEC'18 Keynote: Intelligent Software Engineering: Synergy between AI and Sof...
 
Kohlmeier "Innovations in Academic Search & Discovery - A Case Study From the...
Kohlmeier "Innovations in Academic Search & Discovery - A Case Study From the...Kohlmeier "Innovations in Academic Search & Discovery - A Case Study From the...
Kohlmeier "Innovations in Academic Search & Discovery - A Case Study From the...
 
Do It Yourself (DIY) Earth Science Collaboratories Using Best Practices and B...
Do It Yourself (DIY) Earth Science Collaboratories Using Best Practices and B...Do It Yourself (DIY) Earth Science Collaboratories Using Best Practices and B...
Do It Yourself (DIY) Earth Science Collaboratories Using Best Practices and B...
 
Progress Towards Leveraging Natural Language Processing for Collecting Experi...
Progress Towards Leveraging Natural Language Processing for Collecting Experi...Progress Towards Leveraging Natural Language Processing for Collecting Experi...
Progress Towards Leveraging Natural Language Processing for Collecting Experi...
 
Building better knowledge graphs through social computing
Building better knowledge graphs through social computingBuilding better knowledge graphs through social computing
Building better knowledge graphs through social computing
 
Text Analytics - JCC2014 Kimelfeld
Text Analytics - JCC2014 KimelfeldText Analytics - JCC2014 Kimelfeld
Text Analytics - JCC2014 Kimelfeld
 
PhD Day: Entity Linking using Generic Linked Data Datasets
PhD Day: Entity Linking using Generic Linked Data DatasetsPhD Day: Entity Linking using Generic Linked Data Datasets
PhD Day: Entity Linking using Generic Linked Data Datasets
 
Advancing Alcohol Behavior Change
Advancing Alcohol Behavior ChangeAdvancing Alcohol Behavior Change
Advancing Alcohol Behavior Change
 
Intro to Neo4j Webinar
Intro to Neo4j WebinarIntro to Neo4j Webinar
Intro to Neo4j Webinar
 
Department of Commerce App Challenge: Big Data Dashboards
Department of Commerce App Challenge: Big Data DashboardsDepartment of Commerce App Challenge: Big Data Dashboards
Department of Commerce App Challenge: Big Data Dashboards
 
OSFair2017 Workshop | Text mining
OSFair2017 Workshop | Text miningOSFair2017 Workshop | Text mining
OSFair2017 Workshop | Text mining
 
The web of data: how are we doing so far
The web of data: how are we doing so farThe web of data: how are we doing so far
The web of data: how are we doing so far
 
X api chinese cop monthly meeting feb.2016
X api chinese cop monthly meeting   feb.2016X api chinese cop monthly meeting   feb.2016
X api chinese cop monthly meeting feb.2016
 

Dernier

VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...SUHANI PANDEY
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysismanisha194592
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxMohammedJunaid861692
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Delhi Call girls
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxolyaivanovalion
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxolyaivanovalion
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...amitlee9823
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...amitlee9823
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023ymrp368
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceDelhi Call girls
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...amitlee9823
 
Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...shambhavirathore45
 

Dernier (20)

VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptx
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
 
Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...
 

Construction and Querying of Dynamic Knowledge Graphs

  • 1. NOUS: Construction and Querying of Dynamic Knowledge Graphs SUTANAY CHOUDHURY1, KHUSHBU AGARWAL1, SUMIT PUROHIT1, BAICHUAN ZHANG2, MEG PIRRUNG1, WILLIAM SMITH1, MATHEW THOMAS1 1 : Pacific Northwest National Laboratory, Richland WA 2 : Purdue University, Indianapolis IN
  • 2. April 22, 2017 2 Introduction Knowledge Graphs Challenges Motivation (Use Cases) Domain Specific Querying of Web Question-answering for Climate Science Technical Approach NLP, Entity Disambiguation, Relation Learning Frequent Pattern Mining, Question Answering What’s unique Results Status and Future Work Outline
  • 3. April 22, 2017 3 Knowledge Graphs: Why do we care? A collection of facts about people, places, things and relationships between them, in a given context I am a scientist. Read papers for me. I am an analyst drowning in data. Help me find interesting events I am a doctor, show me latest breakthrough to consider for this patient.
  • 4. April 22, 2017 4 KG: Construction and Analytical Challenges Construction No Base KB Natural Language Processing is inherently noisy Unseen entities and relationships How do we determine its class Should we include every new relation and entity in KB? Analysis Identifying target set of user questions and ontology needed to answer them. Mapping user questions to graph analytical tasks Executing analytical tasks at scale Pattern discovery: What’s emerging and what’s fading? Question Answering: Entity based Querying (What, Who, When, Where) Hypothesis Generation (Why)
  • 5. April 22, 2017 5 Introduction Knowledge Graphs Challenges Motivation (Use Cases) Domain Specific Querying of Web Question-answering for Climate Science Technical Approach NLP, Entity Disambiguation, Relation Learning Frequent Pattern Mining, Question Answering What’s unique Results Status and Future Work Outline
  • 6. April 22, 2017 6 Use Case : Domain-specific Querying of the Web Input: multi-month web crawls obtained using domain expert suggestions Analytic questions: Find top manufacturers and model names Find components and features Who is popular? What are new releases? How are companies and components related? How is country and product related?
  • 7. Question-answering for Climate Science DOE’s ARM program wants to make focused investments on experimental campaigns Monitor scientific literature to find which campaigns or instruments or data products are being cited Our target problems: "What are the papers written on __aerosols______?" "What datasets are used for ___aerosols______ publications?” “What primary measurements are represented in the subset of aerosol publications?” "What instruments are represented in the aerosol publications?” “What sites are most represented in the aerosol publications”? April 22, 2017 7
  • 8. April 22, 2017 8 NOUS Motivation Tasks involved in building KB are not domain specific Most questions can be mapped to a set of common graph analytical tasks. Tell me about X Tell me about X in context of Y What are recent trends about X How are X and Y related Why did X do Y
  • 9. April 22, 2017 9 Introduction Knowledge Graphs Challenges Motivation (Use Cases) Domain Specific Querying of Web Question-answering for Climate Science Technical Approach NLP, Entity Disambiguation, Relation Learning Frequent Pattern Mining, Question Answering What’s unique Results Status and Future Work Outline
  • 10. 10 NOUS Workflow Knowledge Graph Model Streaming Data Knowledge Graph Tasks • Continuous pattern discovery. • Question Answering Knowledge Graph Verification• Natural Language processing • Collective Entity Linking • Distant supervision based relationship discovery Base KB + Streaming Data Bayesian Personalized Ranking-based link prediction to assign confidence Protected Information | Proprietary Information Apache Spark
  • 11. Triple Extraction from Natural Language April 22, 2017 11 the United States government crack an iPhone that belonged to a gunman in the San Bernardino Apple repair the particular iPhone hole that the government hacked. Federal officials specify the procedure used to open the iPhone Federal officials deny to specify the procedure used to open the iPhone. Jay Kaplan, chief executive of the tech securAppley company Synack and a former National Security Agency analyst. say Apple has to earn the trust of Apples customers,” the F.B.I. crack Mr. Farook’s LegbaCore, which previously found and fixed flaws for Apple. find flaws for Apple. LegbaCore, which previously found and fixed flaws for Apple. fix flaws for Apple. The challenge: turning this into a high quality representation Use existing tools Stanford Core NLP Open IE
  • 12. April 22, 2017 12 Entity Disambiguation and Relation Learning Implements Collective Entity Linking from Han et al, SIGIR 2011 Key idea: Search the graph with matches for all the above terms, build a mesh of terms and their related terms and pick the most densely connected combination Out of box performance low (higher 60%), when combined with domain-specific rules (85-90%+) Han et al in "Collective Entity Linking in Web Text: A Graph- based Method, SIGIR 2011" Relation Learning Implements Distance Supervision
  • 13. April 22, 2017 13 Now we have a Graph!
  • 14. 14 A Different Approach to Querying Let’s not demand users learn SQL or SPARQL Think in plain English, and we will transparently translate queries in background Protected Information | Proprietary Information
  • 15. April 22, 2017 15 Task 1: Finding Patterns from Data Stream A Pattern Growth approach : VLDB 2017 [Under Review]
  • 16. Task 1: Finding Patterns from Data Stream (Contd.) April 22, 2017 16 We discover behavioral patterns of drone related entities (WSJ, 2010-2015) Early in the period Towards the end Patterns help us understand the shift in a domain, and discover new trends
  • 17. Task 2: Question Answering for Relationship Explanations April 22, 2017 17 Our goal is to learn “patterns of explanations” by walking the graph structure Given a new question and the knowledge of such patterns, we can generalize and answer questions about unseen entities Examples: Why would Ford buy Palladium? Ford is-a automotive company, automotive company has-part catalytic converters, catalytic converters has-material Palladium Answer pattern: A is-a company B, B has-part C, C has-dependency D Why would Sutanay visit SFO in August 15? Sutanay has-interest Machine-Learning, SIGKDD related-to Machine-Learning, SIGKDD has-location SFO, SIGKDD has-start-date August 13
  • 18. Results – Explanatory Paths using coherence April 22, 2017 18
  • 19. April 22, 2017 19 Use Case Results: Domain-specific Querying of the Web Input Size: 2 million+ webpages Graph construction: 64-node cluster, each node with 16 cores Graph Analytics: 16-node Spark/Hadoop cluster, each with 16 cores
  • 20. April 22, 2017 20 Picking up real events by Patterns
  • 21. April 22, 2017 21 What’s Phenomenal Here? We are data scientists, not drone experts We processed nearly two million web crawls with minimal custom code and built the Knowledge Graph Analyzed trends, connected dots across multiple source and presented a hypothesis: Is tracking autonomous drones and their use important? Starting with the data analysis to coming up with the question : took less than 1.5 hours
  • 22. 22 Key Takeaways Open Source KB Construction and Querying Pipeline https://github.com/streaming-graphs/NOUS Version 1.0: Support for extracting standard entities, relation extraction via distant supervision, Advanced trending and explanatory questions The ability to answer queries where the answer is embedded across multiple data sources All algorithms implemented on top of Apache Spark and Scala Protected Information | Proprietary Information
  • 23. April 22, 2017 23 Status and Future Work We are committed towards developing an open source community A MySQL for Knowledge Graphs! Preferred method of payment J Add test code (in Scala or Python) Break the code AND show an important class of problem Version 2.0: Improving KB quality, Information maintenance over time, Initial support for human-computer interaction. Incorporate algorithms for latest advances in NLP, ED
  • 24. April 22, 2017 24 Questions?