SlideShare a Scribd company logo
1 of 27
Download to read offline
-­‐
-­‐
-­‐
-­‐
-­‐
Social  network  
Human  Disease  Network  
[Barabasi 2007]
Food  Web  [2007]
Terrorist  Network
[Krebs  2002]Internet  (AS)  [2005]
Gene  Regulatory  Network  
[Decourty 2008]
Protein  Interactions  
[breast  cancer]
Political  blogs
Power  grid
input 0 …
1 …
0 …
Feature	
  
Engineering
features
1 …
1 … 0
0
1
0
0
Learning	
  
AlgorithmModel
Prediction	
  Task
Link	
  prediction
Classification	
  
Anomaly	
  detection
input 0 …
1 …
0 …
Feature	
  
Engineering
features
1 …
1 … 0
0
1
0
0
Learning	
  
AlgorithmModel
Prediction	
  Task
Automatic	
  
Feature	
  Learning
Link	
  prediction
Classification	
  
Anomaly	
  detection
§ Goal:  Learn  representation  (features)  for  a  set  of  graph  
elements  (nodes,  edges,  etc.)
§ Key  intuition:  Map  the  graph  elements  (e.g.,  nodes)  to  the  
d-­‐dimension  space,  while  preserving  node  similarity
§ Use  the  features  for  any  downstream  prediction  task
Recent  work:  Map  nodes  based  on  their  proximity  in  the  
input  graph  – (nearby  nodes  are  close  together)
DeepWalk Model
Perrozi et	
  al.	
  KDD	
  2014
Recent  work:  Map  nodes  based  on  their  proximity  in  the  
input  graph  – (nearby  nodes  are  close  together)
How  to  get  nearby  nodes?
Perrozi et	
  al.	
  KDD	
  2014
Grover	
  et	
  al.	
  KDD	
  2016
Recent  work:  Map  nodes  based  on  their  proximity  in  the  
input  graph  – (nearby  nodes  are  close  together)
§ A  (conditional)  walk/path  is  a  finite  sequence  of  adjacent  
vertices  in  the  graph
How  to  get  nearby  nodes?
Perrozi et	
  al.	
  KDD	
  2014
Grover	
  et	
  al.	
  KDD	
  2016
V1
V3
V4
V2
V5
The  random  walk  traversed  link  V1  -­‐-­‐-­‐ V2
Evaluating  next  step  at  node  V2
Mikolov et	
  al.	
  ICLR	
  2013
Perrozi et	
  al.	
  KDD	
  2014
focus	
  vertex
§ No  support  for  inductive/transfer  learning
• features  are  learned  for  node  identities  
• features  do  not  generalize  beyond  the  input  graph
§ Map  nodes  based  on  their  proximity  only
§ No  notion  of  attributes
§ No  notion  of  structural  similarity
Communities:  cohesive  subsets  of  nodes
Roles:  represent  structural  patterns
-­‐ two  nodes  belong  to  the  same  role  if  they’ve  similar  structural  patterns
Cj#
Ci#
Ck#
Rossi	
  &	
  Ahmed	
  TKDE	
  2015
Ahmed	
  et	
  al.	
  AAAI	
  2017
Goal:  Find  a  mapping  of  nodes  to  d-­‐dimensions  that  preserves  
proximity  and  node  similarity
Using  structure  +  attributes  (if  any)
Ahmed	
  et.	
  al	
  2017
A  (conditional)  attributed  walk  is  a  finite  sequence  of  adjacent  
node  types  (words)  in  the  graph
Ahmed	
  et.	
  al	
  2017
The  random  walk  traversed  link                            ,  
Evaluating  next  step  at  node  V2
focus	
  vertex
Ahmed	
  et.	
  al	
  2017
G1
1
G2
3
2
G3
4
G4
5
6
G5
7
8
G6
9
G7
10
11
12
G9
15
G8
13
14
Network  Motifs:  Simple  Building  Blocks  of  Complex  Networks  – [Milo  et  al.  – Science  2002]
The  Structure  and  Function  of  Complex  Networks  – [Newman  – Siam  Review  2003]
Applied  to  food,  biologcal,  genetic,  neural,  web,  and  other  networks
§ Predict  which  pairs  of  nodes  are  likely  to  connect
§ Applications: social  network  analysis,  biological  networks,  
terrorist  networks,  etc.
Deepwalk (DW)  – Perrozi et  al.  KDD  2014
node2vec    (N2V)  – Grover  et  al.  KDD  2016
LINE:  Tang  et  al.  – WWW  2015
1 2 4 8 12 16
0
2
4
6
8
10
12
14
16
Number of processing units
Speedup
socfb−MIT
bio−dmela
soc−gowalla
tech−RL−caida
web−wikipedia09
1 2 4 8 12 16
0
2
4
6
8
10
12
14
16
Number of processing units
Speedup
Strong  scaling  results
Using  Intel  Xeon  E5-­‐2687W  server,  16  cores
Motif  Counting
§ We  propose  a  generic  framework  for  learning  representation  
in  large  attributed  graphs
§ Maps  nodes  based  on  Structural  similarity  +  proximity  +  
attributes  (if  any)
§ Learns  universal  features  that  can  generalize  across  
networks/graphs
§ Useful  for  inductive/transfer learning
§ Scalable  for  large  graphs
§ Generalizing  other  deep  graph  models
§ Theoretical  analysis  
§ Choice  of  mapping  functions
§ Impact  of  sampling  strategy  
§ Evaluation  on  other  ML  tasks
§ Efficient  estimation  of  word  representations  in  vector  space.  ICLR  2013  [Mikolov et.  al]
§ A  Framework  for  Generalizing  Graph-­‐based  Representation  Learning  Methods.  arXiv:1709.04596    2017  [Ahmed  et.  al]
§ Role  Discovery  in  Networks.  TKDE  2015  [Rossi  &  Ahmed]
§ A  Higher-­‐order  Latent  Space  Network  Model.  AAAI  2017  [Ahmed,  Rossi,  Willke,  Zhou]
§ node2vec:  Scalable  Feature  Learning  for  Networks.  KDD  2016  [Grover,  Leskovec]
§ DeepWalk:  online  learning  of  social  representations.  KDD  2014  [Perozzi,  Al-­‐Rafou,  Skiena]
§ Efficient  Graphlet Counting  for  Large  Networks.  ICDM  2015,  [Ahmed  et  al.]
§ Graphlet Decomposition:  Framework,  Algorithms,  and  Applications.  J.  Know.  &  Info.  2016  [Ahmed  et  al.]
§ Network  Motifs:  Simple  Building  Blocks  of  Complex  Networks.  Science  2002,  [Milo  et  al.]
§ Uncovering  Biological  Network  Function  via  Graphlet Degree  Signatures.  Cancer  Informatics  2008  [Milenković-­‐Pržulj]
§ Graph  Kernels.  JMLR  2010,  [Vishwanathan et  al.]
§ The  Structure   and  Function  of  Complex  Networks.  SIAM  Review  2003,  [Newman]
§ Biological  network  comparison  using  graphlet degree  distribution.  Bioinformatics  2007  [Pržulj]
§ Efficient  Graphlet Kernels  for  Large  Graph  Comparison.  AISTAT  2009  [Shervashidze et  al.]
§ Local  structure   in  social  networks.  Sociological  methodology  1976,  [Holland-­‐Leinhardt]
§ The  strength   of  weak  ties:  A  network  theory  revisited.  Sociological  theory 1983  [Granovetter]
Thank  you!
Questions?
nesreen.k.ahmed@intel.com
http://nesreenahmed.com

More Related Content

What's hot

Machine learning with graph
Machine learning with graphMachine learning with graph
Machine learning with graphDing Li
 
Big Data Intelligence: from Correlation Discovery to Causal Reasoning
Big Data Intelligence: from Correlation Discovery to Causal Reasoning Big Data Intelligence: from Correlation Discovery to Causal Reasoning
Big Data Intelligence: from Correlation Discovery to Causal Reasoning Wanjin Yu
 
network mining and representation learning
network mining and representation learningnetwork mining and representation learning
network mining and representation learningsun peiyuan
 
20191107 deeplearningapproachesfornetworks
20191107 deeplearningapproachesfornetworks20191107 deeplearningapproachesfornetworks
20191107 deeplearningapproachesfornetworkstm1966
 
Higher-order clustering coefficients at Purdue CSoI
Higher-order clustering coefficients at Purdue CSoIHigher-order clustering coefficients at Purdue CSoI
Higher-order clustering coefficients at Purdue CSoIAustin Benson
 
Architecture Design for Deep Neural Networks I
Architecture Design for Deep Neural Networks IArchitecture Design for Deep Neural Networks I
Architecture Design for Deep Neural Networks IWanjin Yu
 
Social Network Analysis and Visualization
Social Network Analysis and VisualizationSocial Network Analysis and Visualization
Social Network Analysis and VisualizationAlberto Ramirez
 
LCF: A Temporal Approach to Link Prediction in Dynamic Social Networks
 LCF: A Temporal Approach to Link Prediction in Dynamic Social Networks LCF: A Temporal Approach to Link Prediction in Dynamic Social Networks
LCF: A Temporal Approach to Link Prediction in Dynamic Social NetworksIJCSIS Research Publications
 
Interpretation of the biological knowledge using networks approach
Interpretation of the biological knowledge using networks approachInterpretation of the biological knowledge using networks approach
Interpretation of the biological knowledge using networks approachElena Sügis
 
Gradient-Based Meta-Learning with Learned Layerwise Metric and Subspace
Gradient-Based Meta-Learning with Learned Layerwise Metric and SubspaceGradient-Based Meta-Learning with Learned Layerwise Metric and Subspace
Gradient-Based Meta-Learning with Learned Layerwise Metric and SubspaceYoonho Lee
 
Convolutional neural networks for image classification — evidence from Kaggle...
Convolutional neural networks for image classification — evidence from Kaggle...Convolutional neural networks for image classification — evidence from Kaggle...
Convolutional neural networks for image classification — evidence from Kaggle...Dmytro Mishkin
 
[CS570] Machine Learning Team Project (I know what items really are)
[CS570] Machine Learning Team Project (I know what items really are)[CS570] Machine Learning Team Project (I know what items really are)
[CS570] Machine Learning Team Project (I know what items really are)Kunwoo Park
 
Link Prediction in (Partially) Aligned Heterogeneous Social Networks
Link Prediction in (Partially) Aligned Heterogeneous Social NetworksLink Prediction in (Partially) Aligned Heterogeneous Social Networks
Link Prediction in (Partially) Aligned Heterogeneous Social NetworksSina Sajadmanesh
 
The Hidden Geometry of Multiplex Networks @ Next Generation Network Analytics
The Hidden Geometry of Multiplex Networks @ Next Generation Network Analytics The Hidden Geometry of Multiplex Networks @ Next Generation Network Analytics
The Hidden Geometry of Multiplex Networks @ Next Generation Network Analytics Kolja Kleineberg
 
Probabilistic Relational Models for Link Prediction Problem
Probabilistic Relational Models for Link Prediction ProblemProbabilistic Relational Models for Link Prediction Problem
Probabilistic Relational Models for Link Prediction ProblemSina Sajadmanesh
 
Geometric correlations in multiplexes and how they make them more robust
Geometric correlations in multiplexes and how they make them more robustGeometric correlations in multiplexes and how they make them more robust
Geometric correlations in multiplexes and how they make them more robustKolja Kleineberg
 
VLDB 2015 Tutorial: On Uncertain Graph Modeling and Queries
VLDB 2015 Tutorial: On Uncertain Graph Modeling and QueriesVLDB 2015 Tutorial: On Uncertain Graph Modeling and Queries
VLDB 2015 Tutorial: On Uncertain Graph Modeling and QueriesArijit Khan
 
Networks, Deep Learning and COVID-19
Networks, Deep Learning and COVID-19Networks, Deep Learning and COVID-19
Networks, Deep Learning and COVID-19tm1966
 

What's hot (20)

Machine learning with graph
Machine learning with graphMachine learning with graph
Machine learning with graph
 
Big Data Intelligence: from Correlation Discovery to Causal Reasoning
Big Data Intelligence: from Correlation Discovery to Causal Reasoning Big Data Intelligence: from Correlation Discovery to Causal Reasoning
Big Data Intelligence: from Correlation Discovery to Causal Reasoning
 
network mining and representation learning
network mining and representation learningnetwork mining and representation learning
network mining and representation learning
 
20191107 deeplearningapproachesfornetworks
20191107 deeplearningapproachesfornetworks20191107 deeplearningapproachesfornetworks
20191107 deeplearningapproachesfornetworks
 
Higher-order clustering coefficients at Purdue CSoI
Higher-order clustering coefficients at Purdue CSoIHigher-order clustering coefficients at Purdue CSoI
Higher-order clustering coefficients at Purdue CSoI
 
Architecture Design for Deep Neural Networks I
Architecture Design for Deep Neural Networks IArchitecture Design for Deep Neural Networks I
Architecture Design for Deep Neural Networks I
 
Social Network Analysis and Visualization
Social Network Analysis and VisualizationSocial Network Analysis and Visualization
Social Network Analysis and Visualization
 
Dagstuhl 2013 - Montali - On the Relationship between OBDA and Relational Map...
Dagstuhl 2013 - Montali - On the Relationship between OBDA and Relational Map...Dagstuhl 2013 - Montali - On the Relationship between OBDA and Relational Map...
Dagstuhl 2013 - Montali - On the Relationship between OBDA and Relational Map...
 
LCF: A Temporal Approach to Link Prediction in Dynamic Social Networks
 LCF: A Temporal Approach to Link Prediction in Dynamic Social Networks LCF: A Temporal Approach to Link Prediction in Dynamic Social Networks
LCF: A Temporal Approach to Link Prediction in Dynamic Social Networks
 
Interpretation of the biological knowledge using networks approach
Interpretation of the biological knowledge using networks approachInterpretation of the biological knowledge using networks approach
Interpretation of the biological knowledge using networks approach
 
Gradient-Based Meta-Learning with Learned Layerwise Metric and Subspace
Gradient-Based Meta-Learning with Learned Layerwise Metric and SubspaceGradient-Based Meta-Learning with Learned Layerwise Metric and Subspace
Gradient-Based Meta-Learning with Learned Layerwise Metric and Subspace
 
3D 딥러닝 동향
3D 딥러닝 동향3D 딥러닝 동향
3D 딥러닝 동향
 
Convolutional neural networks for image classification — evidence from Kaggle...
Convolutional neural networks for image classification — evidence from Kaggle...Convolutional neural networks for image classification — evidence from Kaggle...
Convolutional neural networks for image classification — evidence from Kaggle...
 
[CS570] Machine Learning Team Project (I know what items really are)
[CS570] Machine Learning Team Project (I know what items really are)[CS570] Machine Learning Team Project (I know what items really are)
[CS570] Machine Learning Team Project (I know what items really are)
 
Link Prediction in (Partially) Aligned Heterogeneous Social Networks
Link Prediction in (Partially) Aligned Heterogeneous Social NetworksLink Prediction in (Partially) Aligned Heterogeneous Social Networks
Link Prediction in (Partially) Aligned Heterogeneous Social Networks
 
The Hidden Geometry of Multiplex Networks @ Next Generation Network Analytics
The Hidden Geometry of Multiplex Networks @ Next Generation Network Analytics The Hidden Geometry of Multiplex Networks @ Next Generation Network Analytics
The Hidden Geometry of Multiplex Networks @ Next Generation Network Analytics
 
Probabilistic Relational Models for Link Prediction Problem
Probabilistic Relational Models for Link Prediction ProblemProbabilistic Relational Models for Link Prediction Problem
Probabilistic Relational Models for Link Prediction Problem
 
Geometric correlations in multiplexes and how they make them more robust
Geometric correlations in multiplexes and how they make them more robustGeometric correlations in multiplexes and how they make them more robust
Geometric correlations in multiplexes and how they make them more robust
 
VLDB 2015 Tutorial: On Uncertain Graph Modeling and Queries
VLDB 2015 Tutorial: On Uncertain Graph Modeling and QueriesVLDB 2015 Tutorial: On Uncertain Graph Modeling and Queries
VLDB 2015 Tutorial: On Uncertain Graph Modeling and Queries
 
Networks, Deep Learning and COVID-19
Networks, Deep Learning and COVID-19Networks, Deep Learning and COVID-19
Networks, Deep Learning and COVID-19
 

Similar to Representation Learning in Large Attributed Graphs

High-Performance Graph Analysis and Modeling
High-Performance Graph Analysis and ModelingHigh-Performance Graph Analysis and Modeling
High-Performance Graph Analysis and ModelingNesreen K. Ahmed
 
Using Set Cover to Optimize a Large-Scale Low Latency Distributed Graph
Using Set Cover to Optimize a Large-Scale Low Latency Distributed GraphUsing Set Cover to Optimize a Large-Scale Low Latency Distributed Graph
Using Set Cover to Optimize a Large-Scale Low Latency Distributed GraphRui Wang
 
Automating Software Development Using Artificial Intelligence (AI)
Automating Software Development Using Artificial Intelligence (AI)Automating Software Development Using Artificial Intelligence (AI)
Automating Software Development Using Artificial Intelligence (AI)Jeremy Bradbury
 
Principles of Data Visualization
Principles of Data VisualizationPrinciples of Data Visualization
Principles of Data VisualizationEamonn Maguire
 
ArXiv Literature Exploration using Social Network Analysis
ArXiv Literature Exploration using Social Network AnalysisArXiv Literature Exploration using Social Network Analysis
ArXiv Literature Exploration using Social Network AnalysisTanat Iempreedee
 
The Future is Big Graphs: A Community View on Graph Processing Systems
The Future is Big Graphs: A Community View on Graph Processing SystemsThe Future is Big Graphs: A Community View on Graph Processing Systems
The Future is Big Graphs: A Community View on Graph Processing SystemsNeo4j
 
Leveraging Multiple GPUs and CPUs for Graphlet Counting in Large Networks
Leveraging Multiple GPUs and CPUs for  Graphlet Counting in Large Networks Leveraging Multiple GPUs and CPUs for  Graphlet Counting in Large Networks
Leveraging Multiple GPUs and CPUs for Graphlet Counting in Large Networks Ryan Rossi
 
Information Technology in Industry(ITII) - November Issue 2018
Information Technology in Industry(ITII) - November Issue 2018Information Technology in Industry(ITII) - November Issue 2018
Information Technology in Industry(ITII) - November Issue 2018ITIIIndustries
 
Integrating GIS utility data in the UK
Integrating GIS utility data in the UKIntegrating GIS utility data in the UK
Integrating GIS utility data in the UKAntArch
 
Architectural Support for Model-driven Performance Prediction of Distributed ...
Architectural Support for Model-driven Performance Prediction of Distributed ...Architectural Support for Model-driven Performance Prediction of Distributed ...
Architectural Support for Model-driven Performance Prediction of Distributed ...Vanea Chiprianov
 
03 interlinking-dass
03 interlinking-dass03 interlinking-dass
03 interlinking-dassDiego Pessoa
 
Graph based Clustering
Graph based ClusteringGraph based Clustering
Graph based Clustering怡秀 林
 
From Signal to Symbols
From Signal to SymbolsFrom Signal to Symbols
From Signal to Symbolsgpano
 
Burnaev and Notchenko. Skoltech. Bridging gap between 2D and 3D with Deep Lea...
Burnaev and Notchenko. Skoltech. Bridging gap between 2D and 3D with Deep Lea...Burnaev and Notchenko. Skoltech. Bridging gap between 2D and 3D with Deep Lea...
Burnaev and Notchenko. Skoltech. Bridging gap between 2D and 3D with Deep Lea...Skolkovo Robotics Center
 
Script Identification for printed document images at text-line level using DC...
Script Identification for printed document images at text-line level using DC...Script Identification for printed document images at text-line level using DC...
Script Identification for printed document images at text-line level using DC...IOSR Journals
 
Semi-Supervised Classification with Graph Convolutional Networks @ICLR2017読み会
Semi-Supervised Classification with Graph Convolutional Networks @ICLR2017読み会Semi-Supervised Classification with Graph Convolutional Networks @ICLR2017読み会
Semi-Supervised Classification with Graph Convolutional Networks @ICLR2017読み会Eiji Sekiya
 
LPCNN: convolutional neural network for link prediction based on network stru...
LPCNN: convolutional neural network for link prediction based on network stru...LPCNN: convolutional neural network for link prediction based on network stru...
LPCNN: convolutional neural network for link prediction based on network stru...TELKOMNIKA JOURNAL
 
Обучение нейросетей компьютерного зрения в видеоиграх
Обучение нейросетей компьютерного зрения в видеоиграхОбучение нейросетей компьютерного зрения в видеоиграх
Обучение нейросетей компьютерного зрения в видеоиграхAnatol Alizar
 
User Interfaces that Design Themselves: Talk given at Data-Driven Design Day ...
User Interfaces that Design Themselves: Talk given at Data-Driven Design Day ...User Interfaces that Design Themselves: Talk given at Data-Driven Design Day ...
User Interfaces that Design Themselves: Talk given at Data-Driven Design Day ...Aalto University
 

Similar to Representation Learning in Large Attributed Graphs (20)

High-Performance Graph Analysis and Modeling
High-Performance Graph Analysis and ModelingHigh-Performance Graph Analysis and Modeling
High-Performance Graph Analysis and Modeling
 
Using Set Cover to Optimize a Large-Scale Low Latency Distributed Graph
Using Set Cover to Optimize a Large-Scale Low Latency Distributed GraphUsing Set Cover to Optimize a Large-Scale Low Latency Distributed Graph
Using Set Cover to Optimize a Large-Scale Low Latency Distributed Graph
 
Automating Software Development Using Artificial Intelligence (AI)
Automating Software Development Using Artificial Intelligence (AI)Automating Software Development Using Artificial Intelligence (AI)
Automating Software Development Using Artificial Intelligence (AI)
 
Principles of Data Visualization
Principles of Data VisualizationPrinciples of Data Visualization
Principles of Data Visualization
 
ArXiv Literature Exploration using Social Network Analysis
ArXiv Literature Exploration using Social Network AnalysisArXiv Literature Exploration using Social Network Analysis
ArXiv Literature Exploration using Social Network Analysis
 
The Future is Big Graphs: A Community View on Graph Processing Systems
The Future is Big Graphs: A Community View on Graph Processing SystemsThe Future is Big Graphs: A Community View on Graph Processing Systems
The Future is Big Graphs: A Community View on Graph Processing Systems
 
Leveraging Multiple GPUs and CPUs for Graphlet Counting in Large Networks
Leveraging Multiple GPUs and CPUs for  Graphlet Counting in Large Networks Leveraging Multiple GPUs and CPUs for  Graphlet Counting in Large Networks
Leveraging Multiple GPUs and CPUs for Graphlet Counting in Large Networks
 
Information Technology in Industry(ITII) - November Issue 2018
Information Technology in Industry(ITII) - November Issue 2018Information Technology in Industry(ITII) - November Issue 2018
Information Technology in Industry(ITII) - November Issue 2018
 
ThesisProposal
ThesisProposalThesisProposal
ThesisProposal
 
Integrating GIS utility data in the UK
Integrating GIS utility data in the UKIntegrating GIS utility data in the UK
Integrating GIS utility data in the UK
 
Architectural Support for Model-driven Performance Prediction of Distributed ...
Architectural Support for Model-driven Performance Prediction of Distributed ...Architectural Support for Model-driven Performance Prediction of Distributed ...
Architectural Support for Model-driven Performance Prediction of Distributed ...
 
03 interlinking-dass
03 interlinking-dass03 interlinking-dass
03 interlinking-dass
 
Graph based Clustering
Graph based ClusteringGraph based Clustering
Graph based Clustering
 
From Signal to Symbols
From Signal to SymbolsFrom Signal to Symbols
From Signal to Symbols
 
Burnaev and Notchenko. Skoltech. Bridging gap between 2D and 3D with Deep Lea...
Burnaev and Notchenko. Skoltech. Bridging gap between 2D and 3D with Deep Lea...Burnaev and Notchenko. Skoltech. Bridging gap between 2D and 3D with Deep Lea...
Burnaev and Notchenko. Skoltech. Bridging gap between 2D and 3D with Deep Lea...
 
Script Identification for printed document images at text-line level using DC...
Script Identification for printed document images at text-line level using DC...Script Identification for printed document images at text-line level using DC...
Script Identification for printed document images at text-line level using DC...
 
Semi-Supervised Classification with Graph Convolutional Networks @ICLR2017読み会
Semi-Supervised Classification with Graph Convolutional Networks @ICLR2017読み会Semi-Supervised Classification with Graph Convolutional Networks @ICLR2017読み会
Semi-Supervised Classification with Graph Convolutional Networks @ICLR2017読み会
 
LPCNN: convolutional neural network for link prediction based on network stru...
LPCNN: convolutional neural network for link prediction based on network stru...LPCNN: convolutional neural network for link prediction based on network stru...
LPCNN: convolutional neural network for link prediction based on network stru...
 
Обучение нейросетей компьютерного зрения в видеоиграх
Обучение нейросетей компьютерного зрения в видеоиграхОбучение нейросетей компьютерного зрения в видеоиграх
Обучение нейросетей компьютерного зрения в видеоиграх
 
User Interfaces that Design Themselves: Talk given at Data-Driven Design Day ...
User Interfaces that Design Themselves: Talk given at Data-Driven Design Day ...User Interfaces that Design Themselves: Talk given at Data-Driven Design Day ...
User Interfaces that Design Themselves: Talk given at Data-Driven Design Day ...
 

More from Nesreen K. Ahmed

Sampling from Massive Graph Streams: A Unifying Framework
Sampling from Massive Graph Streams: A Unifying FrameworkSampling from Massive Graph Streams: A Unifying Framework
Sampling from Massive Graph Streams: A Unifying FrameworkNesreen K. Ahmed
 
The Power of Motif Counting Theory, Algorithms, and Applications for Large Gr...
The Power of Motif Counting Theory, Algorithms, and Applications for Large Gr...The Power of Motif Counting Theory, Algorithms, and Applications for Large Gr...
The Power of Motif Counting Theory, Algorithms, and Applications for Large Gr...Nesreen K. Ahmed
 
Sampling for Approximate Bipartite Network Projection
Sampling for Approximate Bipartite Network ProjectionSampling for Approximate Bipartite Network Projection
Sampling for Approximate Bipartite Network ProjectionNesreen K. Ahmed
 
On Sampling from Massive Graph Streams
On Sampling from Massive Graph StreamsOn Sampling from Massive Graph Streams
On Sampling from Massive Graph StreamsNesreen K. Ahmed
 
Graph Sample and Hold: A Framework for Big Graph Analytics
Graph Sample and Hold: A Framework for Big Graph AnalyticsGraph Sample and Hold: A Framework for Big Graph Analytics
Graph Sample and Hold: A Framework for Big Graph AnalyticsNesreen K. Ahmed
 
Fast Graphlet Decomposition: Theory, Algorithms, and Applications
Fast Graphlet Decomposition: Theory, Algorithms, and ApplicationsFast Graphlet Decomposition: Theory, Algorithms, and Applications
Fast Graphlet Decomposition: Theory, Algorithms, and ApplicationsNesreen K. Ahmed
 

More from Nesreen K. Ahmed (6)

Sampling from Massive Graph Streams: A Unifying Framework
Sampling from Massive Graph Streams: A Unifying FrameworkSampling from Massive Graph Streams: A Unifying Framework
Sampling from Massive Graph Streams: A Unifying Framework
 
The Power of Motif Counting Theory, Algorithms, and Applications for Large Gr...
The Power of Motif Counting Theory, Algorithms, and Applications for Large Gr...The Power of Motif Counting Theory, Algorithms, and Applications for Large Gr...
The Power of Motif Counting Theory, Algorithms, and Applications for Large Gr...
 
Sampling for Approximate Bipartite Network Projection
Sampling for Approximate Bipartite Network ProjectionSampling for Approximate Bipartite Network Projection
Sampling for Approximate Bipartite Network Projection
 
On Sampling from Massive Graph Streams
On Sampling from Massive Graph StreamsOn Sampling from Massive Graph Streams
On Sampling from Massive Graph Streams
 
Graph Sample and Hold: A Framework for Big Graph Analytics
Graph Sample and Hold: A Framework for Big Graph AnalyticsGraph Sample and Hold: A Framework for Big Graph Analytics
Graph Sample and Hold: A Framework for Big Graph Analytics
 
Fast Graphlet Decomposition: Theory, Algorithms, and Applications
Fast Graphlet Decomposition: Theory, Algorithms, and ApplicationsFast Graphlet Decomposition: Theory, Algorithms, and Applications
Fast Graphlet Decomposition: Theory, Algorithms, and Applications
 

Recently uploaded

Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024thyngster
 
Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxMike Bennett
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)jennyeacort
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfJohn Sterrett
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsVICTOR MAESTRE RAMIREZ
 
Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Seán Kennedy
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改yuu sss
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.natarajan8993
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]📊 Markus Baersch
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfgstagge
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degreeyuu sss
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一F sss
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理e4aez8ss
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...limedy534
 
Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanMYRABACSAFRA2
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Seán Kennedy
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档208367051
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfBoston Institute of Analytics
 

Recently uploaded (20)

Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
 
Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptx
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdf
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business Professionals
 
Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]
 
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdf
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
 
Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population Mean
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
 

Representation Learning in Large Attributed Graphs

  • 1.
  • 2. -­‐ -­‐ -­‐ -­‐ -­‐ Social  network   Human  Disease  Network   [Barabasi 2007] Food  Web  [2007] Terrorist  Network [Krebs  2002]Internet  (AS)  [2005] Gene  Regulatory  Network   [Decourty 2008] Protein  Interactions   [breast  cancer] Political  blogs Power  grid
  • 3. input 0 … 1 … 0 … Feature   Engineering features 1 … 1 … 0 0 1 0 0 Learning   AlgorithmModel Prediction  Task Link  prediction Classification   Anomaly  detection
  • 4. input 0 … 1 … 0 … Feature   Engineering features 1 … 1 … 0 0 1 0 0 Learning   AlgorithmModel Prediction  Task Automatic   Feature  Learning Link  prediction Classification   Anomaly  detection
  • 5. § Goal:  Learn  representation  (features)  for  a  set  of  graph   elements  (nodes,  edges,  etc.) § Key  intuition:  Map  the  graph  elements  (e.g.,  nodes)  to  the   d-­‐dimension  space,  while  preserving  node  similarity § Use  the  features  for  any  downstream  prediction  task
  • 6. Recent  work:  Map  nodes  based  on  their  proximity  in  the   input  graph  – (nearby  nodes  are  close  together) DeepWalk Model Perrozi et  al.  KDD  2014
  • 7. Recent  work:  Map  nodes  based  on  their  proximity  in  the   input  graph  – (nearby  nodes  are  close  together) How  to  get  nearby  nodes? Perrozi et  al.  KDD  2014 Grover  et  al.  KDD  2016
  • 8. Recent  work:  Map  nodes  based  on  their  proximity  in  the   input  graph  – (nearby  nodes  are  close  together) § A  (conditional)  walk/path  is  a  finite  sequence  of  adjacent   vertices  in  the  graph How  to  get  nearby  nodes? Perrozi et  al.  KDD  2014 Grover  et  al.  KDD  2016
  • 9. V1 V3 V4 V2 V5 The  random  walk  traversed  link  V1  -­‐-­‐-­‐ V2 Evaluating  next  step  at  node  V2
  • 10. Mikolov et  al.  ICLR  2013 Perrozi et  al.  KDD  2014 focus  vertex
  • 11.
  • 12. § No  support  for  inductive/transfer  learning • features  are  learned  for  node  identities   • features  do  not  generalize  beyond  the  input  graph § Map  nodes  based  on  their  proximity  only § No  notion  of  attributes § No  notion  of  structural  similarity
  • 13. Communities:  cohesive  subsets  of  nodes Roles:  represent  structural  patterns -­‐ two  nodes  belong  to  the  same  role  if  they’ve  similar  structural  patterns Cj# Ci# Ck# Rossi  &  Ahmed  TKDE  2015 Ahmed  et  al.  AAAI  2017
  • 14. Goal:  Find  a  mapping  of  nodes  to  d-­‐dimensions  that  preserves   proximity  and  node  similarity Using  structure  +  attributes  (if  any)
  • 16. A  (conditional)  attributed  walk  is  a  finite  sequence  of  adjacent   node  types  (words)  in  the  graph Ahmed  et.  al  2017
  • 17. The  random  walk  traversed  link                            ,   Evaluating  next  step  at  node  V2
  • 20. G1 1 G2 3 2 G3 4 G4 5 6 G5 7 8 G6 9 G7 10 11 12 G9 15 G8 13 14 Network  Motifs:  Simple  Building  Blocks  of  Complex  Networks  – [Milo  et  al.  – Science  2002] The  Structure  and  Function  of  Complex  Networks  – [Newman  – Siam  Review  2003] Applied  to  food,  biologcal,  genetic,  neural,  web,  and  other  networks
  • 21. § Predict  which  pairs  of  nodes  are  likely  to  connect § Applications: social  network  analysis,  biological  networks,   terrorist  networks,  etc.
  • 22. Deepwalk (DW)  – Perrozi et  al.  KDD  2014 node2vec    (N2V)  – Grover  et  al.  KDD  2016 LINE:  Tang  et  al.  – WWW  2015
  • 23. 1 2 4 8 12 16 0 2 4 6 8 10 12 14 16 Number of processing units Speedup socfb−MIT bio−dmela soc−gowalla tech−RL−caida web−wikipedia09 1 2 4 8 12 16 0 2 4 6 8 10 12 14 16 Number of processing units Speedup Strong  scaling  results Using  Intel  Xeon  E5-­‐2687W  server,  16  cores Motif  Counting
  • 24. § We  propose  a  generic  framework  for  learning  representation   in  large  attributed  graphs § Maps  nodes  based  on  Structural  similarity  +  proximity  +   attributes  (if  any) § Learns  universal  features  that  can  generalize  across   networks/graphs § Useful  for  inductive/transfer learning § Scalable  for  large  graphs
  • 25. § Generalizing  other  deep  graph  models § Theoretical  analysis   § Choice  of  mapping  functions § Impact  of  sampling  strategy   § Evaluation  on  other  ML  tasks
  • 26. § Efficient  estimation  of  word  representations  in  vector  space.  ICLR  2013  [Mikolov et.  al] § A  Framework  for  Generalizing  Graph-­‐based  Representation  Learning  Methods.  arXiv:1709.04596    2017  [Ahmed  et.  al] § Role  Discovery  in  Networks.  TKDE  2015  [Rossi  &  Ahmed] § A  Higher-­‐order  Latent  Space  Network  Model.  AAAI  2017  [Ahmed,  Rossi,  Willke,  Zhou] § node2vec:  Scalable  Feature  Learning  for  Networks.  KDD  2016  [Grover,  Leskovec] § DeepWalk:  online  learning  of  social  representations.  KDD  2014  [Perozzi,  Al-­‐Rafou,  Skiena] § Efficient  Graphlet Counting  for  Large  Networks.  ICDM  2015,  [Ahmed  et  al.] § Graphlet Decomposition:  Framework,  Algorithms,  and  Applications.  J.  Know.  &  Info.  2016  [Ahmed  et  al.] § Network  Motifs:  Simple  Building  Blocks  of  Complex  Networks.  Science  2002,  [Milo  et  al.] § Uncovering  Biological  Network  Function  via  Graphlet Degree  Signatures.  Cancer  Informatics  2008  [Milenković-­‐Pržulj] § Graph  Kernels.  JMLR  2010,  [Vishwanathan et  al.] § The  Structure   and  Function  of  Complex  Networks.  SIAM  Review  2003,  [Newman] § Biological  network  comparison  using  graphlet degree  distribution.  Bioinformatics  2007  [Pržulj] § Efficient  Graphlet Kernels  for  Large  Graph  Comparison.  AISTAT  2009  [Shervashidze et  al.] § Local  structure   in  social  networks.  Sociological  methodology  1976,  [Holland-­‐Leinhardt] § The  strength   of  weak  ties:  A  network  theory  revisited.  Sociological  theory 1983  [Granovetter]