SlideShare une entreprise Scribd logo
1  sur  27
Télécharger pour lire hors ligne
Graph Exploration w/ Neo4j
1
https://s3.amazonaws.com/dev.assets.neo4j.com/wp-content/uploads/graph-data-technologies-graph-databases-for-beginners.png
Our Project Partners
2
3GRAPH EXPLORATION
Efficiently extracting knowledge from graph data
even if we do not know exactly what we are looking for
Graph Exploration: From Users to Large Graphs.
CIKM 2016, SIGMOD 2017, KDD 2018
Adaptive Databases
GRAPH EXPLORATION
4
Graph Exploration Stack
Intuitive queries
Interactive algorithms
Users
Graph
Adaptive Databases
Intuitive queries
Interactive algorithms
GRAPH EXPLORATION
5
Graph Exploration Stack
Users
Graph
This project is about ...
6
3,27 × 109
base pairs
Graph Exploration in Biology - Complex Graphs
http://jcs.biologists.org/content/joces/118/21/4947/F3.large.jpg
Graph Exploration in Biology - Status Quo
MATCH (p1:Phenotype)-[:HAS]-(a1:Association)-[:HAS]-(snp:Snp)-[:HAS]-(a2:Association)-[:HAS]-(p2:Phenotype)
WHERE p1.name = 'foo1‘ AND p2.name = 'foo2p‘ AND a1.p < 0.01 AND a2.p < 0.01
WITH DISTINCT snpORDER BY snp.sid
RETURN collect(snp.sid)
7
MATCH
(p1:Phenotype)-[:HAS]-(a1:Association)-[:HAS]-(snp:Snp)-[:HAS]-(a2:Association)-[:HAS]-(p2:Phenotype
)
WHERE p1.name = 'foo1'
AND p2.name = ‘foo2'
AND a1.p < 0.01 AND a2.p < 0.01
WITH DISTINCT snp
MATCH (snp)-[:IN]-(pw:PositionWindow)<-[:IN]-(l:Locus)--(g:Gene)
WHERE l.feature= 'gene'
RETURN collect(DISTINCT g.name)
MATCH
(p1:Phenotype)-[:HAS]-(a1:Association)-[:HAS]-(snp:Snp)-[:HAS]-(a2:Association)-[:HAS]-(p2:Phenotype)
WHEREp1.name = 'foo1'
ANDp2.name = ‘foo2'
ANDa1.p < 0.01 AND a2.p < 0.01
WITH DISTINCT snp
MATCH(snp)-[:IN]-(pw:PositionWindow)<-[:IN]-(l:Locus)--(g:Gene)
WHERE l.feature= 'gene'
WITH DISTINCT g ORDER BY g.name
MATCH(g)-[:CODES]-(:Transcript)-[:CODES]-(p:Protein)-[:MEMBER]-(go:Goterm)
WHERE go.namespace= 'biological_process'
WITH DISTINCT go,p
RETURN go.name, count(p) ORDER BY count(p)DESC
LIMIT 10
MATCH
(p1:Phenotype)-[:HAS]-(a1:Association)-[:HAS]-(snp:Snp)-[:HAS]-(a2:Association)-[:HAS]-(p2:Phenotype)
WHERE p1.name = 'foo1'
AND p2.name = ‘foo2'AND a1.p < 0.01 AND a2.p < 0.01
WITH DISTINCT snpMATCH (snp)-[:IN]-(pw:PositionWindow)<-[:IN]-(l:Locus)--(g:Gene)
WHERE l.feature= 'gene'
WITH DISTINCT gORDER BY g.name
MATCH (g)-[:CODES]-(:Transcript)-[:IS]-(ps:Probeset)-[:SIG]-(s:Sample)
WHERE s.name= 'mustafavi'
RETURN DISTINCT g.name
Can we do better?
8
Problem
● Given two node sets:
How similar are they in my understanding?
● Example
→ Set of movies I like
→ Set of movies I don’t know
→ Will I like the movies I don’t know?
9
As
Good
As It
Gets
Hell or
High
Water
Pulp
Fiction
The
Matrix
Skyfall
Avatar
?
What is a Knowledge Graph?
● (directed) graph G : ⟨V, E, φ, ψ⟩, where 
○ V is a set of nodes,
○ E ⊆ V × V is a set of edges,
○ φ : V → LV
 is an edge labeling function
and
○ ψ : E → LE
is a node labeling function
We refer to the elements of LV
and LE
as node
labels and edge labels
10
What are Meta-Paths?
Graph Path
Graph Schema Meta-Path
A meta-path for a path ⟨n1
, ..., nt
⟩, ni
∈ V , 1 ≤ i ≤ t is a sequence 
P : ⟨φ(n1
),ψ (n1
, n2
), ..., ψ (nt−1
, nt
), φ(nt
)⟩ that alternates node- and
edge-types along the path. 
11
Motivating Example
Q: How famous is Diane Kruger in America?
MATCH(n:Person)
WHERE n.name = “Diane Kruger”
RETURN n
MATCH(m:Movie)
WHERE m.location = “America”
RETURN m
Diane
Kruger
As
Good
As It
Gets
Stand
By MeTop
Gun
Pulp
Fiction A Few
Good
Men
The
Matrix
Up
12
How similar are they?
● Similarity depends on
○ expert knowledge
○ connections among nodes
13
Individualized
exploration
Extract ratingsCompute Meta-Paths
What does the System do and how?
Overview
✓✗
◎
Learn representation
for meta-paths
Calculate
similarity
14
Approximate Meta-Paths
Problem: How to compute all meta-paths fast?
Approx. Solution: Mine meta-paths using the graph’s schema!
Meta-Paths Computation
Compute schema
15
Approximate Meta-Paths
Problem: How to compute all meta-paths fast?
Approx. Solution: Mine meta-paths using the graph’s schema!
Meta-Paths Computation
Meta-Path
approximation
• Extract real and some
non-existent
meta-paths via DFS
on schema
Classifier
• Real meta-paths as
training data
• Predict probability of
meta-path
Exhaustive Meta-Path
computation
• Extract real
meta-paths via DFS
• Early termination
Training data Inference data
Neo4j Graph Algorithm Procedures
Use procedures for schema and meta-paths computation
16
Learning a Meta-Path Embedding
Problem: Vector representation required for active
learning and preference prediction.
Meta-Paths Embedding
?
(3 5 1)T
17
Learning a Meta-Path Embedding
Problem: Vector representation required for active
learning and preference prediction.
Solution: Embed meta-paths
→ Similar meta-paths should have similar vectors.
Challenges:
- Multiple similarity interpretations for meta-paths
- Large number of meta-paths.
- Shared parts between meta-paths.
Our method: Modified word2vec (and paragraph
vectors) method.
Meta-Paths Embedding
(3 5 1)T
18
Learn the Domain Value of all Meta-Paths
- Problem: Users don’t want to rate all meta-paths
→ too many
→ time-consuming
→ tedious and boring
- Solution: Label only a few, but very informative paths
→ explore the space of all meta-paths
→ sequentially query the most uncertain datapoint
⇒ use active learning strategy
Active Learning
✓✗
◎
19
Active Learning
20
Use Learned Preferences for Graph Exploration
Result Explanation
Icons made by Eucalyp from www.flaticon.com is
licensed by CC 3.0 BY
Graph (with
meta-paths)
Domain Knowledge
What is
important in
the graph?
Personalized Exploration
Tool
Similarity Measure
Related Nodes
Stats
21
Personalized Exploration Tool
Result Explanation
Transform Nodes to
Vectors
(Graph-Embedding)
Adapt Vectors Using
Domain-Knowledge
Personalized Vector
Space
precomputed
22
Personalized Exploration Tool
Result Explanation
23
Problem: Suggest similar nodes with regard to given node sets and
user preferences.
Solution: Learn personalized similarity metric between nodes →
Similar nodes should have similar vector space embeddings.
Challenges:
- Different view of abstraction between meta-path preferences and
node instances
- High computational costs but online adaption of embedding space
needed
Our method: Modified node2vec through bias on already
pretrained embeddings
Personalized Exploration Tool
Result Explanation
What nodes
are close to
my selection?How close are
my sets?
Find clusters!
What are
outliers?Personalized Vector
Space
24
System Architecture - How does it work with Neo4j?
Neo4j Graph Database
Neo4j Graph Algorithm Procedures
Containing Meta-Paths Computation
Python Backend Server
ReactJS Frontend
Meta-Path
Embedding
Active Learning Explanation
Node selection
Meta-Path
ordering
Result
visualization
25
● Easy to get your code running in neo4j.
● Neo4j-graph-algorithms: efficiency vs convenience.
● Sometimes no stack-trace when an error occurs.
● Great support and community. Always available.
● Cypher: Easy to begin with, hard to master.
(hpi)-[:LIKES]->(neo4j)
What about neo4j?
Meta-Paths Computation
26
Outlook
4 research topics:
● Efficient meta-paths computation
● Sensible meta-paths embedding
● Precise active learning of meta-paths ranking
● Explanation and exploration using ranked meta-paths
Open Source!
● Meta-paths computation in neo4j graph algos
● MetaExp Repository
● You can use and improve as well!
https://hpi.de/mueller/metaexp/ 27

Contenu connexe

Tendances

Graph Hardware Architecture - Enterprise graphs deserve great hardware!
Graph Hardware Architecture - Enterprise graphs deserve great hardware!Graph Hardware Architecture - Enterprise graphs deserve great hardware!
Graph Hardware Architecture - Enterprise graphs deserve great hardware!TigerGraph
 
Training Series: Build APIs with Neo4j GraphQL Library
Training Series: Build APIs with Neo4j GraphQL LibraryTraining Series: Build APIs with Neo4j GraphQL Library
Training Series: Build APIs with Neo4j GraphQL LibraryNeo4j
 
Neo4j GraphTalk Basel - Health & Life Sciences
Neo4j GraphTalk Basel - Health & Life SciencesNeo4j GraphTalk Basel - Health & Life Sciences
Neo4j GraphTalk Basel - Health & Life SciencesNeo4j
 
Training Week: Build APIs with Neo4j GraphQL Library
Training Week: Build APIs with Neo4j GraphQL LibraryTraining Week: Build APIs with Neo4j GraphQL Library
Training Week: Build APIs with Neo4j GraphQL LibraryNeo4j
 
Graphs and Financial Services Analytics
Graphs and Financial Services AnalyticsGraphs and Financial Services Analytics
Graphs and Financial Services AnalyticsNeo4j
 
Fast Parallel Similarity Calculations with FPGA Hardware
Fast Parallel Similarity Calculations with FPGA HardwareFast Parallel Similarity Calculations with FPGA Hardware
Fast Parallel Similarity Calculations with FPGA HardwareTigerGraph
 
Graph Gurus 21: Integrating Real-Time Deep-Link Graph Analytics with Spark AI
Graph Gurus 21: Integrating Real-Time Deep-Link Graph Analytics with Spark AIGraph Gurus 21: Integrating Real-Time Deep-Link Graph Analytics with Spark AI
Graph Gurus 21: Integrating Real-Time Deep-Link Graph Analytics with Spark AITigerGraph
 
MongoDB company and case studies - john hong
MongoDB company and case studies - john hong MongoDB company and case studies - john hong
MongoDB company and case studies - john hong Ha-Yang(White) Moon
 
Graph Gurus Episode 1: Enterprise Graph
Graph Gurus Episode 1: Enterprise GraphGraph Gurus Episode 1: Enterprise Graph
Graph Gurus Episode 1: Enterprise GraphTigerGraph
 
Graph Gurus Episode 25: Unleash the Business Value of Your Data Lake with Gra...
Graph Gurus Episode 25: Unleash the Business Value of Your Data Lake with Gra...Graph Gurus Episode 25: Unleash the Business Value of Your Data Lake with Gra...
Graph Gurus Episode 25: Unleash the Business Value of Your Data Lake with Gra...TigerGraph
 
Democratizing Data
Democratizing DataDemocratizing Data
Democratizing DataDatabricks
 
Predicting Influence and Communities Using Graph Algorithms
Predicting Influence and Communities Using Graph AlgorithmsPredicting Influence and Communities Using Graph Algorithms
Predicting Influence and Communities Using Graph AlgorithmsDatabricks
 
Using Graph Algorithms For Advanced Analytics - Part 4 Similarity 30 graph al...
Using Graph Algorithms For Advanced Analytics - Part 4 Similarity 30 graph al...Using Graph Algorithms For Advanced Analytics - Part 4 Similarity 30 graph al...
Using Graph Algorithms For Advanced Analytics - Part 4 Similarity 30 graph al...TigerGraph
 
Graph Gurus 15: Introducing TigerGraph 2.4
Graph Gurus 15: Introducing TigerGraph 2.4 Graph Gurus 15: Introducing TigerGraph 2.4
Graph Gurus 15: Introducing TigerGraph 2.4 TigerGraph
 
Webinar: Enterprise Data Management in the Era of MongoDB and Data Lakes
Webinar: Enterprise Data Management in the Era of MongoDB and Data LakesWebinar: Enterprise Data Management in the Era of MongoDB and Data Lakes
Webinar: Enterprise Data Management in the Era of MongoDB and Data LakesMongoDB
 
Intro to Neo4j and Graph Databases
Intro to Neo4j and Graph DatabasesIntro to Neo4j and Graph Databases
Intro to Neo4j and Graph DatabasesNeo4j
 
Plume - A Code Property Graph Extraction and Analysis Library
Plume - A Code Property Graph Extraction and Analysis LibraryPlume - A Code Property Graph Extraction and Analysis Library
Plume - A Code Property Graph Extraction and Analysis LibraryTigerGraph
 
Transform Your Telecom Operations with Graph Technologies
Transform Your Telecom Operations with Graph TechnologiesTransform Your Telecom Operations with Graph Technologies
Transform Your Telecom Operations with Graph TechnologiesNeo4j
 
The convergence of reporting and interactive BI on Hadoop
The convergence of reporting and interactive BI on HadoopThe convergence of reporting and interactive BI on Hadoop
The convergence of reporting and interactive BI on HadoopDataWorks Summit
 
An Introduction to Graph: Database, Analytics, and Cloud Services
An Introduction to Graph:  Database, Analytics, and Cloud ServicesAn Introduction to Graph:  Database, Analytics, and Cloud Services
An Introduction to Graph: Database, Analytics, and Cloud ServicesJean Ihm
 

Tendances (20)

Graph Hardware Architecture - Enterprise graphs deserve great hardware!
Graph Hardware Architecture - Enterprise graphs deserve great hardware!Graph Hardware Architecture - Enterprise graphs deserve great hardware!
Graph Hardware Architecture - Enterprise graphs deserve great hardware!
 
Training Series: Build APIs with Neo4j GraphQL Library
Training Series: Build APIs with Neo4j GraphQL LibraryTraining Series: Build APIs with Neo4j GraphQL Library
Training Series: Build APIs with Neo4j GraphQL Library
 
Neo4j GraphTalk Basel - Health & Life Sciences
Neo4j GraphTalk Basel - Health & Life SciencesNeo4j GraphTalk Basel - Health & Life Sciences
Neo4j GraphTalk Basel - Health & Life Sciences
 
Training Week: Build APIs with Neo4j GraphQL Library
Training Week: Build APIs with Neo4j GraphQL LibraryTraining Week: Build APIs with Neo4j GraphQL Library
Training Week: Build APIs with Neo4j GraphQL Library
 
Graphs and Financial Services Analytics
Graphs and Financial Services AnalyticsGraphs and Financial Services Analytics
Graphs and Financial Services Analytics
 
Fast Parallel Similarity Calculations with FPGA Hardware
Fast Parallel Similarity Calculations with FPGA HardwareFast Parallel Similarity Calculations with FPGA Hardware
Fast Parallel Similarity Calculations with FPGA Hardware
 
Graph Gurus 21: Integrating Real-Time Deep-Link Graph Analytics with Spark AI
Graph Gurus 21: Integrating Real-Time Deep-Link Graph Analytics with Spark AIGraph Gurus 21: Integrating Real-Time Deep-Link Graph Analytics with Spark AI
Graph Gurus 21: Integrating Real-Time Deep-Link Graph Analytics with Spark AI
 
MongoDB company and case studies - john hong
MongoDB company and case studies - john hong MongoDB company and case studies - john hong
MongoDB company and case studies - john hong
 
Graph Gurus Episode 1: Enterprise Graph
Graph Gurus Episode 1: Enterprise GraphGraph Gurus Episode 1: Enterprise Graph
Graph Gurus Episode 1: Enterprise Graph
 
Graph Gurus Episode 25: Unleash the Business Value of Your Data Lake with Gra...
Graph Gurus Episode 25: Unleash the Business Value of Your Data Lake with Gra...Graph Gurus Episode 25: Unleash the Business Value of Your Data Lake with Gra...
Graph Gurus Episode 25: Unleash the Business Value of Your Data Lake with Gra...
 
Democratizing Data
Democratizing DataDemocratizing Data
Democratizing Data
 
Predicting Influence and Communities Using Graph Algorithms
Predicting Influence and Communities Using Graph AlgorithmsPredicting Influence and Communities Using Graph Algorithms
Predicting Influence and Communities Using Graph Algorithms
 
Using Graph Algorithms For Advanced Analytics - Part 4 Similarity 30 graph al...
Using Graph Algorithms For Advanced Analytics - Part 4 Similarity 30 graph al...Using Graph Algorithms For Advanced Analytics - Part 4 Similarity 30 graph al...
Using Graph Algorithms For Advanced Analytics - Part 4 Similarity 30 graph al...
 
Graph Gurus 15: Introducing TigerGraph 2.4
Graph Gurus 15: Introducing TigerGraph 2.4 Graph Gurus 15: Introducing TigerGraph 2.4
Graph Gurus 15: Introducing TigerGraph 2.4
 
Webinar: Enterprise Data Management in the Era of MongoDB and Data Lakes
Webinar: Enterprise Data Management in the Era of MongoDB and Data LakesWebinar: Enterprise Data Management in the Era of MongoDB and Data Lakes
Webinar: Enterprise Data Management in the Era of MongoDB and Data Lakes
 
Intro to Neo4j and Graph Databases
Intro to Neo4j and Graph DatabasesIntro to Neo4j and Graph Databases
Intro to Neo4j and Graph Databases
 
Plume - A Code Property Graph Extraction and Analysis Library
Plume - A Code Property Graph Extraction and Analysis LibraryPlume - A Code Property Graph Extraction and Analysis Library
Plume - A Code Property Graph Extraction and Analysis Library
 
Transform Your Telecom Operations with Graph Technologies
Transform Your Telecom Operations with Graph TechnologiesTransform Your Telecom Operations with Graph Technologies
Transform Your Telecom Operations with Graph Technologies
 
The convergence of reporting and interactive BI on Hadoop
The convergence of reporting and interactive BI on HadoopThe convergence of reporting and interactive BI on Hadoop
The convergence of reporting and interactive BI on Hadoop
 
An Introduction to Graph: Database, Analytics, and Cloud Services
An Introduction to Graph:  Database, Analytics, and Cloud ServicesAn Introduction to Graph:  Database, Analytics, and Cloud Services
An Introduction to Graph: Database, Analytics, and Cloud Services
 

Similaire à Neo4j MeetUp - Graph Exploration with MetaExp

Graph Analyses with Python and NetworkX
Graph Analyses with Python and NetworkXGraph Analyses with Python and NetworkX
Graph Analyses with Python and NetworkXBenjamin Bengfort
 
Dagstuhl seminar talk on querying big graphs
Dagstuhl seminar talk on querying big graphsDagstuhl seminar talk on querying big graphs
Dagstuhl seminar talk on querying big graphsArijit Khan
 
Data Structure Graph DMZ #DMZone
Data Structure Graph DMZ #DMZoneData Structure Graph DMZ #DMZone
Data Structure Graph DMZ #DMZoneDoug Needham
 
Machine Learning + Graph Databases for Better Recommendations V2 08/20/2022
Machine Learning + Graph Databases for Better Recommendations V2 08/20/2022Machine Learning + Graph Databases for Better Recommendations V2 08/20/2022
Machine Learning + Graph Databases for Better Recommendations V2 08/20/2022ArangoDB Database
 
ATO 2022 - Machine Learning + Graph Databases for Better Recommendations (3)....
ATO 2022 - Machine Learning + Graph Databases for Better Recommendations (3)....ATO 2022 - Machine Learning + Graph Databases for Better Recommendations (3)....
ATO 2022 - Machine Learning + Graph Databases for Better Recommendations (3)....ArangoDB Database
 
High-Performance Graph Analysis and Modeling
High-Performance Graph Analysis and ModelingHigh-Performance Graph Analysis and Modeling
High-Performance Graph Analysis and ModelingNesreen K. Ahmed
 
Knowledge graphs, meet Deep Learning
Knowledge graphs, meet Deep LearningKnowledge graphs, meet Deep Learning
Knowledge graphs, meet Deep LearningConnected Data World
 
Machine Learning Project
Machine Learning ProjectMachine Learning Project
Machine Learning ProjectAdeyemi Fowe
 
Knowledge Graph Embeddings for Recommender Systems
Knowledge Graph Embeddings for Recommender SystemsKnowledge Graph Embeddings for Recommender Systems
Knowledge Graph Embeddings for Recommender SystemsEnrico Palumbo
 
Semi-Supervised Classification with Graph Convolutional Networks @ICLR2017読み会
Semi-Supervised Classification with Graph Convolutional Networks @ICLR2017読み会Semi-Supervised Classification with Graph Convolutional Networks @ICLR2017読み会
Semi-Supervised Classification with Graph Convolutional Networks @ICLR2017読み会Eiji Sekiya
 
Social Network Analysis Introduction including Data Structure Graph overview.
Social Network Analysis Introduction including Data Structure Graph overview. Social Network Analysis Introduction including Data Structure Graph overview.
Social Network Analysis Introduction including Data Structure Graph overview. Doug Needham
 
Improve ml predictions using graph algorithms (webinar july 23_19).pptx
Improve ml predictions using graph algorithms (webinar july 23_19).pptxImprove ml predictions using graph algorithms (webinar july 23_19).pptx
Improve ml predictions using graph algorithms (webinar july 23_19).pptxNeo4j
 
Generative modeling with Convolutional Neural Networks
Generative modeling with Convolutional Neural NetworksGenerative modeling with Convolutional Neural Networks
Generative modeling with Convolutional Neural NetworksDenis Dus
 
Machine Learning + Graph Databases for Better Recommendations V1 08/06/2022
Machine Learning + Graph Databases for Better Recommendations V1 08/06/2022Machine Learning + Graph Databases for Better Recommendations V1 08/06/2022
Machine Learning + Graph Databases for Better Recommendations V1 08/06/2022ArangoDB Database
 
Introduction to Graph neural networks @ Vienna Deep Learning meetup
Introduction to Graph neural networks @  Vienna Deep Learning meetupIntroduction to Graph neural networks @  Vienna Deep Learning meetup
Introduction to Graph neural networks @ Vienna Deep Learning meetupLiad Magen
 
Relational machine-learning
Relational machine-learningRelational machine-learning
Relational machine-learningBhushan Kotnis
 
Recent Progress on Utilizing Tag Information with GANs - StarGAN & TD-GAN
Recent Progress on Utilizing Tag Information with GANs - StarGAN & TD-GANRecent Progress on Utilizing Tag Information with GANs - StarGAN & TD-GAN
Recent Progress on Utilizing Tag Information with GANs - StarGAN & TD-GANHao-Wen (Herman) Dong
 

Similaire à Neo4j MeetUp - Graph Exploration with MetaExp (20)

Graph Analyses with Python and NetworkX
Graph Analyses with Python and NetworkXGraph Analyses with Python and NetworkX
Graph Analyses with Python and NetworkX
 
Dagstuhl seminar talk on querying big graphs
Dagstuhl seminar talk on querying big graphsDagstuhl seminar talk on querying big graphs
Dagstuhl seminar talk on querying big graphs
 
Data Structure Graph DMZ #DMZone
Data Structure Graph DMZ #DMZoneData Structure Graph DMZ #DMZone
Data Structure Graph DMZ #DMZone
 
Machine Learning + Graph Databases for Better Recommendations V2 08/20/2022
Machine Learning + Graph Databases for Better Recommendations V2 08/20/2022Machine Learning + Graph Databases for Better Recommendations V2 08/20/2022
Machine Learning + Graph Databases for Better Recommendations V2 08/20/2022
 
ATO 2022 - Machine Learning + Graph Databases for Better Recommendations (3)....
ATO 2022 - Machine Learning + Graph Databases for Better Recommendations (3)....ATO 2022 - Machine Learning + Graph Databases for Better Recommendations (3)....
ATO 2022 - Machine Learning + Graph Databases for Better Recommendations (3)....
 
High-Performance Graph Analysis and Modeling
High-Performance Graph Analysis and ModelingHigh-Performance Graph Analysis and Modeling
High-Performance Graph Analysis and Modeling
 
Knowledge graphs, meet Deep Learning
Knowledge graphs, meet Deep LearningKnowledge graphs, meet Deep Learning
Knowledge graphs, meet Deep Learning
 
Machine Learning Project
Machine Learning ProjectMachine Learning Project
Machine Learning Project
 
Knowledge Graph Embeddings for Recommender Systems
Knowledge Graph Embeddings for Recommender SystemsKnowledge Graph Embeddings for Recommender Systems
Knowledge Graph Embeddings for Recommender Systems
 
Ghost
GhostGhost
Ghost
 
Semi-Supervised Classification with Graph Convolutional Networks @ICLR2017読み会
Semi-Supervised Classification with Graph Convolutional Networks @ICLR2017読み会Semi-Supervised Classification with Graph Convolutional Networks @ICLR2017読み会
Semi-Supervised Classification with Graph Convolutional Networks @ICLR2017読み会
 
Social Network Analysis Introduction including Data Structure Graph overview.
Social Network Analysis Introduction including Data Structure Graph overview. Social Network Analysis Introduction including Data Structure Graph overview.
Social Network Analysis Introduction including Data Structure Graph overview.
 
PointNet
PointNetPointNet
PointNet
 
Improve ml predictions using graph algorithms (webinar july 23_19).pptx
Improve ml predictions using graph algorithms (webinar july 23_19).pptxImprove ml predictions using graph algorithms (webinar july 23_19).pptx
Improve ml predictions using graph algorithms (webinar july 23_19).pptx
 
Generative modeling with Convolutional Neural Networks
Generative modeling with Convolutional Neural NetworksGenerative modeling with Convolutional Neural Networks
Generative modeling with Convolutional Neural Networks
 
Yarn spark next_gen_hadoop_8_jan_2014
Yarn spark next_gen_hadoop_8_jan_2014Yarn spark next_gen_hadoop_8_jan_2014
Yarn spark next_gen_hadoop_8_jan_2014
 
Machine Learning + Graph Databases for Better Recommendations V1 08/06/2022
Machine Learning + Graph Databases for Better Recommendations V1 08/06/2022Machine Learning + Graph Databases for Better Recommendations V1 08/06/2022
Machine Learning + Graph Databases for Better Recommendations V1 08/06/2022
 
Introduction to Graph neural networks @ Vienna Deep Learning meetup
Introduction to Graph neural networks @  Vienna Deep Learning meetupIntroduction to Graph neural networks @  Vienna Deep Learning meetup
Introduction to Graph neural networks @ Vienna Deep Learning meetup
 
Relational machine-learning
Relational machine-learningRelational machine-learning
Relational machine-learning
 
Recent Progress on Utilizing Tag Information with GANs - StarGAN & TD-GAN
Recent Progress on Utilizing Tag Information with GANs - StarGAN & TD-GANRecent Progress on Utilizing Tag Information with GANs - StarGAN & TD-GAN
Recent Progress on Utilizing Tag Information with GANs - StarGAN & TD-GAN
 

Dernier

Simplifying Microservices & Apps - The art of effortless development - Meetup...
Simplifying Microservices & Apps - The art of effortless development - Meetup...Simplifying Microservices & Apps - The art of effortless development - Meetup...
Simplifying Microservices & Apps - The art of effortless development - Meetup...Rob Geurden
 
OpenChain AI Study Group - Europe and Asia Recap - 2024-04-11 - Full Recording
OpenChain AI Study Group - Europe and Asia Recap - 2024-04-11 - Full RecordingOpenChain AI Study Group - Europe and Asia Recap - 2024-04-11 - Full Recording
OpenChain AI Study Group - Europe and Asia Recap - 2024-04-11 - Full RecordingShane Coughlan
 
Machine Learning Software Engineering Patterns and Their Engineering
Machine Learning Software Engineering Patterns and Their EngineeringMachine Learning Software Engineering Patterns and Their Engineering
Machine Learning Software Engineering Patterns and Their EngineeringHironori Washizaki
 
Ronisha Informatics Private Limited Catalogue
Ronisha Informatics Private Limited CatalogueRonisha Informatics Private Limited Catalogue
Ronisha Informatics Private Limited Catalogueitservices996
 
2024 DevNexus Patterns for Resiliency: Shuffle shards
2024 DevNexus Patterns for Resiliency: Shuffle shards2024 DevNexus Patterns for Resiliency: Shuffle shards
2024 DevNexus Patterns for Resiliency: Shuffle shardsChristopher Curtin
 
Tech Tuesday Slides - Introduction to Project Management with OnePlan's Work ...
Tech Tuesday Slides - Introduction to Project Management with OnePlan's Work ...Tech Tuesday Slides - Introduction to Project Management with OnePlan's Work ...
Tech Tuesday Slides - Introduction to Project Management with OnePlan's Work ...OnePlan Solutions
 
Amazon Bedrock in Action - presentation of the Bedrock's capabilities
Amazon Bedrock in Action - presentation of the Bedrock's capabilitiesAmazon Bedrock in Action - presentation of the Bedrock's capabilities
Amazon Bedrock in Action - presentation of the Bedrock's capabilitiesKrzysztofKkol1
 
Large Language Models for Test Case Evolution and Repair
Large Language Models for Test Case Evolution and RepairLarge Language Models for Test Case Evolution and Repair
Large Language Models for Test Case Evolution and RepairLionel Briand
 
Real-time Tracking and Monitoring with Cargo Cloud Solutions.pptx
Real-time Tracking and Monitoring with Cargo Cloud Solutions.pptxReal-time Tracking and Monitoring with Cargo Cloud Solutions.pptx
Real-time Tracking and Monitoring with Cargo Cloud Solutions.pptxRTS corp
 
Post Quantum Cryptography – The Impact on Identity
Post Quantum Cryptography – The Impact on IdentityPost Quantum Cryptography – The Impact on Identity
Post Quantum Cryptography – The Impact on Identityteam-WIBU
 
Leveraging AI for Mobile App Testing on Real Devices | Applitools + Kobiton
Leveraging AI for Mobile App Testing on Real Devices | Applitools + KobitonLeveraging AI for Mobile App Testing on Real Devices | Applitools + Kobiton
Leveraging AI for Mobile App Testing on Real Devices | Applitools + KobitonApplitools
 
OpenChain Education Work Group Monthly Meeting - 2024-04-10 - Full Recording
OpenChain Education Work Group Monthly Meeting - 2024-04-10 - Full RecordingOpenChain Education Work Group Monthly Meeting - 2024-04-10 - Full Recording
OpenChain Education Work Group Monthly Meeting - 2024-04-10 - Full RecordingShane Coughlan
 
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4jGraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4jNeo4j
 
Salesforce Implementation Services PPT By ABSYZ
Salesforce Implementation Services PPT By ABSYZSalesforce Implementation Services PPT By ABSYZ
Salesforce Implementation Services PPT By ABSYZABSYZ Inc
 
Understanding Flamingo - DeepMind's VLM Architecture
Understanding Flamingo - DeepMind's VLM ArchitectureUnderstanding Flamingo - DeepMind's VLM Architecture
Understanding Flamingo - DeepMind's VLM Architecturerahul_net
 
eSoftTools IMAP Backup Software and migration tools
eSoftTools IMAP Backup Software and migration toolseSoftTools IMAP Backup Software and migration tools
eSoftTools IMAP Backup Software and migration toolsosttopstonverter
 
Introduction to Firebase Workshop Slides
Introduction to Firebase Workshop SlidesIntroduction to Firebase Workshop Slides
Introduction to Firebase Workshop Slidesvaideheekore1
 
Osi security architecture in network.pptx
Osi security architecture in network.pptxOsi security architecture in network.pptx
Osi security architecture in network.pptxVinzoCenzo
 
Strategies for using alternative queries to mitigate zero results
Strategies for using alternative queries to mitigate zero resultsStrategies for using alternative queries to mitigate zero results
Strategies for using alternative queries to mitigate zero resultsJean Silva
 
Comparing Linux OS Image Update Models - EOSS 2024.pdf
Comparing Linux OS Image Update Models - EOSS 2024.pdfComparing Linux OS Image Update Models - EOSS 2024.pdf
Comparing Linux OS Image Update Models - EOSS 2024.pdfDrew Moseley
 

Dernier (20)

Simplifying Microservices & Apps - The art of effortless development - Meetup...
Simplifying Microservices & Apps - The art of effortless development - Meetup...Simplifying Microservices & Apps - The art of effortless development - Meetup...
Simplifying Microservices & Apps - The art of effortless development - Meetup...
 
OpenChain AI Study Group - Europe and Asia Recap - 2024-04-11 - Full Recording
OpenChain AI Study Group - Europe and Asia Recap - 2024-04-11 - Full RecordingOpenChain AI Study Group - Europe and Asia Recap - 2024-04-11 - Full Recording
OpenChain AI Study Group - Europe and Asia Recap - 2024-04-11 - Full Recording
 
Machine Learning Software Engineering Patterns and Their Engineering
Machine Learning Software Engineering Patterns and Their EngineeringMachine Learning Software Engineering Patterns and Their Engineering
Machine Learning Software Engineering Patterns and Their Engineering
 
Ronisha Informatics Private Limited Catalogue
Ronisha Informatics Private Limited CatalogueRonisha Informatics Private Limited Catalogue
Ronisha Informatics Private Limited Catalogue
 
2024 DevNexus Patterns for Resiliency: Shuffle shards
2024 DevNexus Patterns for Resiliency: Shuffle shards2024 DevNexus Patterns for Resiliency: Shuffle shards
2024 DevNexus Patterns for Resiliency: Shuffle shards
 
Tech Tuesday Slides - Introduction to Project Management with OnePlan's Work ...
Tech Tuesday Slides - Introduction to Project Management with OnePlan's Work ...Tech Tuesday Slides - Introduction to Project Management with OnePlan's Work ...
Tech Tuesday Slides - Introduction to Project Management with OnePlan's Work ...
 
Amazon Bedrock in Action - presentation of the Bedrock's capabilities
Amazon Bedrock in Action - presentation of the Bedrock's capabilitiesAmazon Bedrock in Action - presentation of the Bedrock's capabilities
Amazon Bedrock in Action - presentation of the Bedrock's capabilities
 
Large Language Models for Test Case Evolution and Repair
Large Language Models for Test Case Evolution and RepairLarge Language Models for Test Case Evolution and Repair
Large Language Models for Test Case Evolution and Repair
 
Real-time Tracking and Monitoring with Cargo Cloud Solutions.pptx
Real-time Tracking and Monitoring with Cargo Cloud Solutions.pptxReal-time Tracking and Monitoring with Cargo Cloud Solutions.pptx
Real-time Tracking and Monitoring with Cargo Cloud Solutions.pptx
 
Post Quantum Cryptography – The Impact on Identity
Post Quantum Cryptography – The Impact on IdentityPost Quantum Cryptography – The Impact on Identity
Post Quantum Cryptography – The Impact on Identity
 
Leveraging AI for Mobile App Testing on Real Devices | Applitools + Kobiton
Leveraging AI for Mobile App Testing on Real Devices | Applitools + KobitonLeveraging AI for Mobile App Testing on Real Devices | Applitools + Kobiton
Leveraging AI for Mobile App Testing on Real Devices | Applitools + Kobiton
 
OpenChain Education Work Group Monthly Meeting - 2024-04-10 - Full Recording
OpenChain Education Work Group Monthly Meeting - 2024-04-10 - Full RecordingOpenChain Education Work Group Monthly Meeting - 2024-04-10 - Full Recording
OpenChain Education Work Group Monthly Meeting - 2024-04-10 - Full Recording
 
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4jGraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
 
Salesforce Implementation Services PPT By ABSYZ
Salesforce Implementation Services PPT By ABSYZSalesforce Implementation Services PPT By ABSYZ
Salesforce Implementation Services PPT By ABSYZ
 
Understanding Flamingo - DeepMind's VLM Architecture
Understanding Flamingo - DeepMind's VLM ArchitectureUnderstanding Flamingo - DeepMind's VLM Architecture
Understanding Flamingo - DeepMind's VLM Architecture
 
eSoftTools IMAP Backup Software and migration tools
eSoftTools IMAP Backup Software and migration toolseSoftTools IMAP Backup Software and migration tools
eSoftTools IMAP Backup Software and migration tools
 
Introduction to Firebase Workshop Slides
Introduction to Firebase Workshop SlidesIntroduction to Firebase Workshop Slides
Introduction to Firebase Workshop Slides
 
Osi security architecture in network.pptx
Osi security architecture in network.pptxOsi security architecture in network.pptx
Osi security architecture in network.pptx
 
Strategies for using alternative queries to mitigate zero results
Strategies for using alternative queries to mitigate zero resultsStrategies for using alternative queries to mitigate zero results
Strategies for using alternative queries to mitigate zero results
 
Comparing Linux OS Image Update Models - EOSS 2024.pdf
Comparing Linux OS Image Update Models - EOSS 2024.pdfComparing Linux OS Image Update Models - EOSS 2024.pdf
Comparing Linux OS Image Update Models - EOSS 2024.pdf
 

Neo4j MeetUp - Graph Exploration with MetaExp

  • 1. Graph Exploration w/ Neo4j 1 https://s3.amazonaws.com/dev.assets.neo4j.com/wp-content/uploads/graph-data-technologies-graph-databases-for-beginners.png
  • 3. 3GRAPH EXPLORATION Efficiently extracting knowledge from graph data even if we do not know exactly what we are looking for Graph Exploration: From Users to Large Graphs. CIKM 2016, SIGMOD 2017, KDD 2018
  • 4. Adaptive Databases GRAPH EXPLORATION 4 Graph Exploration Stack Intuitive queries Interactive algorithms Users Graph
  • 5. Adaptive Databases Intuitive queries Interactive algorithms GRAPH EXPLORATION 5 Graph Exploration Stack Users Graph This project is about ...
  • 6. 6 3,27 × 109 base pairs Graph Exploration in Biology - Complex Graphs http://jcs.biologists.org/content/joces/118/21/4947/F3.large.jpg
  • 7. Graph Exploration in Biology - Status Quo MATCH (p1:Phenotype)-[:HAS]-(a1:Association)-[:HAS]-(snp:Snp)-[:HAS]-(a2:Association)-[:HAS]-(p2:Phenotype) WHERE p1.name = 'foo1‘ AND p2.name = 'foo2p‘ AND a1.p < 0.01 AND a2.p < 0.01 WITH DISTINCT snpORDER BY snp.sid RETURN collect(snp.sid) 7 MATCH (p1:Phenotype)-[:HAS]-(a1:Association)-[:HAS]-(snp:Snp)-[:HAS]-(a2:Association)-[:HAS]-(p2:Phenotype ) WHERE p1.name = 'foo1' AND p2.name = ‘foo2' AND a1.p < 0.01 AND a2.p < 0.01 WITH DISTINCT snp MATCH (snp)-[:IN]-(pw:PositionWindow)<-[:IN]-(l:Locus)--(g:Gene) WHERE l.feature= 'gene' RETURN collect(DISTINCT g.name) MATCH (p1:Phenotype)-[:HAS]-(a1:Association)-[:HAS]-(snp:Snp)-[:HAS]-(a2:Association)-[:HAS]-(p2:Phenotype) WHEREp1.name = 'foo1' ANDp2.name = ‘foo2' ANDa1.p < 0.01 AND a2.p < 0.01 WITH DISTINCT snp MATCH(snp)-[:IN]-(pw:PositionWindow)<-[:IN]-(l:Locus)--(g:Gene) WHERE l.feature= 'gene' WITH DISTINCT g ORDER BY g.name MATCH(g)-[:CODES]-(:Transcript)-[:CODES]-(p:Protein)-[:MEMBER]-(go:Goterm) WHERE go.namespace= 'biological_process' WITH DISTINCT go,p RETURN go.name, count(p) ORDER BY count(p)DESC LIMIT 10 MATCH (p1:Phenotype)-[:HAS]-(a1:Association)-[:HAS]-(snp:Snp)-[:HAS]-(a2:Association)-[:HAS]-(p2:Phenotype) WHERE p1.name = 'foo1' AND p2.name = ‘foo2'AND a1.p < 0.01 AND a2.p < 0.01 WITH DISTINCT snpMATCH (snp)-[:IN]-(pw:PositionWindow)<-[:IN]-(l:Locus)--(g:Gene) WHERE l.feature= 'gene' WITH DISTINCT gORDER BY g.name MATCH (g)-[:CODES]-(:Transcript)-[:IS]-(ps:Probeset)-[:SIG]-(s:Sample) WHERE s.name= 'mustafavi' RETURN DISTINCT g.name
  • 8. Can we do better? 8
  • 9. Problem ● Given two node sets: How similar are they in my understanding? ● Example → Set of movies I like → Set of movies I don’t know → Will I like the movies I don’t know? 9 As Good As It Gets Hell or High Water Pulp Fiction The Matrix Skyfall Avatar ?
  • 10. What is a Knowledge Graph? ● (directed) graph G : ⟨V, E, φ, ψ⟩, where  ○ V is a set of nodes, ○ E ⊆ V × V is a set of edges, ○ φ : V → LV  is an edge labeling function and ○ ψ : E → LE is a node labeling function We refer to the elements of LV and LE as node labels and edge labels 10
  • 11. What are Meta-Paths? Graph Path Graph Schema Meta-Path A meta-path for a path ⟨n1 , ..., nt ⟩, ni ∈ V , 1 ≤ i ≤ t is a sequence  P : ⟨φ(n1 ),ψ (n1 , n2 ), ..., ψ (nt−1 , nt ), φ(nt )⟩ that alternates node- and edge-types along the path.  11
  • 12. Motivating Example Q: How famous is Diane Kruger in America? MATCH(n:Person) WHERE n.name = “Diane Kruger” RETURN n MATCH(m:Movie) WHERE m.location = “America” RETURN m Diane Kruger As Good As It Gets Stand By MeTop Gun Pulp Fiction A Few Good Men The Matrix Up 12
  • 13. How similar are they? ● Similarity depends on ○ expert knowledge ○ connections among nodes 13
  • 14. Individualized exploration Extract ratingsCompute Meta-Paths What does the System do and how? Overview ✓✗ ◎ Learn representation for meta-paths Calculate similarity 14
  • 15. Approximate Meta-Paths Problem: How to compute all meta-paths fast? Approx. Solution: Mine meta-paths using the graph’s schema! Meta-Paths Computation Compute schema 15
  • 16. Approximate Meta-Paths Problem: How to compute all meta-paths fast? Approx. Solution: Mine meta-paths using the graph’s schema! Meta-Paths Computation Meta-Path approximation • Extract real and some non-existent meta-paths via DFS on schema Classifier • Real meta-paths as training data • Predict probability of meta-path Exhaustive Meta-Path computation • Extract real meta-paths via DFS • Early termination Training data Inference data Neo4j Graph Algorithm Procedures Use procedures for schema and meta-paths computation 16
  • 17. Learning a Meta-Path Embedding Problem: Vector representation required for active learning and preference prediction. Meta-Paths Embedding ? (3 5 1)T 17
  • 18. Learning a Meta-Path Embedding Problem: Vector representation required for active learning and preference prediction. Solution: Embed meta-paths → Similar meta-paths should have similar vectors. Challenges: - Multiple similarity interpretations for meta-paths - Large number of meta-paths. - Shared parts between meta-paths. Our method: Modified word2vec (and paragraph vectors) method. Meta-Paths Embedding (3 5 1)T 18
  • 19. Learn the Domain Value of all Meta-Paths - Problem: Users don’t want to rate all meta-paths → too many → time-consuming → tedious and boring - Solution: Label only a few, but very informative paths → explore the space of all meta-paths → sequentially query the most uncertain datapoint ⇒ use active learning strategy Active Learning ✓✗ ◎ 19
  • 21. Use Learned Preferences for Graph Exploration Result Explanation Icons made by Eucalyp from www.flaticon.com is licensed by CC 3.0 BY Graph (with meta-paths) Domain Knowledge What is important in the graph? Personalized Exploration Tool Similarity Measure Related Nodes Stats 21
  • 22. Personalized Exploration Tool Result Explanation Transform Nodes to Vectors (Graph-Embedding) Adapt Vectors Using Domain-Knowledge Personalized Vector Space precomputed 22
  • 23. Personalized Exploration Tool Result Explanation 23 Problem: Suggest similar nodes with regard to given node sets and user preferences. Solution: Learn personalized similarity metric between nodes → Similar nodes should have similar vector space embeddings. Challenges: - Different view of abstraction between meta-path preferences and node instances - High computational costs but online adaption of embedding space needed Our method: Modified node2vec through bias on already pretrained embeddings
  • 24. Personalized Exploration Tool Result Explanation What nodes are close to my selection?How close are my sets? Find clusters! What are outliers?Personalized Vector Space 24
  • 25. System Architecture - How does it work with Neo4j? Neo4j Graph Database Neo4j Graph Algorithm Procedures Containing Meta-Paths Computation Python Backend Server ReactJS Frontend Meta-Path Embedding Active Learning Explanation Node selection Meta-Path ordering Result visualization 25
  • 26. ● Easy to get your code running in neo4j. ● Neo4j-graph-algorithms: efficiency vs convenience. ● Sometimes no stack-trace when an error occurs. ● Great support and community. Always available. ● Cypher: Easy to begin with, hard to master. (hpi)-[:LIKES]->(neo4j) What about neo4j? Meta-Paths Computation 26
  • 27. Outlook 4 research topics: ● Efficient meta-paths computation ● Sensible meta-paths embedding ● Precise active learning of meta-paths ranking ● Explanation and exploration using ranked meta-paths Open Source! ● Meta-paths computation in neo4j graph algos ● MetaExp Repository ● You can use and improve as well! https://hpi.de/mueller/metaexp/ 27