SlideShare une entreprise Scribd logo
1  sur  24
Télécharger pour lire hors ligne
Graph Neural Networks (GNNs) Meet Reinforcement
Learning (RL)
05/01/2021
Graph Neural Networks (GNNs)
Kipf, Thomas N., and Max Welling. "Semi-supervised classification with graph
convolutional networks." arXiv preprint arXiv:1609.02907 (2016).
● Targeting at real-world problems having underlying graph
structures which are non-Euclidean.
○ Euclidean data is data which is sensibly modelled as
being plotted in n-dimensional linear space, e.g.,
images (where the x and y coordinates refer to the
location of each pixel, and the z coordinate refers to its
colour/intensity).
○ Non-Euclidean data: have information of edges,
values associated with nodes, or the directionality of
edges.
Graph Neural Networks (GNNs)
Tasks
Graph Neural Networks (GNNs)
Neural Networks Forward Propagation
●
● w is weights, H is the last hidden states, b is the bias and
sigma is the nonlinear function.
● When i = 0, then H[i]
= X, where X is the input.
Graph Neural Networks
● Sparse.
● Flexible connection.
Graph Neural Networks (GNNs)
Definitions
● Adjacency matrix (A): a N x N matrix filled
with either 0 or 1, where N is the total number
of nodes.
● Node attributes matrix (X): represents the
features or attributes of each node. If there are
N nodes and the size of node attributes is F,
then the shape of this matrix is N x F.
● Edge attributes matrix (E), can also have its
own attributes.
Adjacency matrix
Node attributes matrix (6 x 6)
Graph Neural Networks (GNNs)
Graph
● G = (V, E), denote a graph.
● node feature vectors Xv for v in V
Graph Neural Networks[1]
● Step1: aggregate information from neighbors.
● Step2: combine encoded neighbor’s info with its
own info.
● Step3: use READOUT function to extract the
representation.
Graph Convolutional Neural Network (GCN)
Kipf, Thomas N., and Max Welling. "Semi-supervised classification with graph
convolutional networks." arXiv preprint arXiv:1609.02907 (2016).
Aggregator + Combine:
Representation from the paper
●
GraphSAGE
Hamilton, William L., Rex Ying, and Jure Leskovec. "Inductive representation
learning on large graphs." arXiv preprint arXiv:1706.02216 (2017).
● Aggregator:
● Combine:
Representation from the paper
● Different aggregator
○ Mean, LSTM, Pooling.
GraphSAGE
Hamilton, William L., Rex Ying, and Jure Leskovec. "Inductive
representation learning on large graphs." arXiv preprint arXiv:1706.02216
(2017).
Literature Review
Dong, Jiqian, et al. "A DRL-based Multiagent
Cooperative Control Framework for CAV
Networks: a Graphic Convolution Q Network."
arXiv preprint arXiv:2010.05437 (2020).
● Combining graphic convolution neural
network (GCN) and Deep Q Network (DQN)
to solve the lane changing problem in mixed
traffic with multiple AVs.
● Off-policy Q learning.
Attention (Self-attention)
● Self-attention, also known as intra-attention, is an attention
mechanism relating different positions of a single sequence in
order to compute a representation of the same sequence.
Attention (self-attention, intra-attention)
● Different attention mechanisms.
Attention (self-attention, intra-attention)
● Born for Translation
Attention (self-attention, intra-attention)
●
Attention (self-attention, intra-attention)
●
Attention (self-attention, intra-attention)
●
Attention (self-attention, intra-attention)
●
Attention (self-attention, intra-attention)
●
Attention (self-attention, intra-attention)
● Learnable attention matrix: WQ, WK and WV.
Previous Works
● Iqbal, Shariq, and Fei Sha. "Actor-attention-critic for multi-agent
reinforcement learning." International Conference on Machine Learning.
PMLR, 2019.
Our attention critic is able to
dynamically select which agents to
attend to at each time point during
training, improving performance in
multi-agent domains with complex
interactions.
Graph Convolutional Reinforcement Learning (DGN)
Jiang, Jiechuan, et al. "Graph convolutional
reinforcement learning." arXiv preprint
arXiv:1810.09202 (2018).
● Graph convolution adapts to the dynamics of
the underlying graph of the multi-agent
environment, and relation kernels capture the
interplay between agents by their relation
representations.
● Use self-attention to extract features.
● Off-policy Q learning.
GAT on Autonomous Driving
Wang, Jiawei, et al. "Multi-agent Graph Reinforcement Learning for Connected
Automated Driving", 2020
● On-policy PPO.
● Lane-following.
References
1. Dong, Jiqian, et al. "A DRL-based Multiagent Cooperative Control Framework for CAV Networks: a
Graphic Convolution Q Network." arXiv preprint arXiv:2010.05437 (2020).
2. Gupta, Jayesh K., Maxim Egorov, and Mykel Kochenderfer. "Cooperative multi-agent control using
deep reinforcement learning." International Conference on Autonomous Agents and Multiagent
Systems. Springer, Cham, 2017.
3. Lin, Kaixiang, et al. "Efficient large-scale fleet management via multi-agent deep reinforcement
learning." Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery
& Data Mining. 2018.
4. https://ai.stackexchange.com/questions/11226/what-is-non-euclidean-data#:~:text=Since%20Euclidean
%20spaces%20are%20prototypically,coordinate%20refers%20to%20its%20colour%2F
5. https://towardsdatascience.com/understanding-graph-convolutional-networks-for-node-classification-a
2bfdb7aba7b
6. Kipf, Thomas N., and Max Welling. "Semi-supervised classification with graph convolutional
networks." arXiv preprint arXiv:1609.02907 (2016).
Thanks!

Contenu connexe

Similaire à GNNs meet RL.pdf

240311_Thuy_Labseminar[Contrastive Multi-View Representation Learning on Grap...
240311_Thuy_Labseminar[Contrastive Multi-View Representation Learning on Grap...240311_Thuy_Labseminar[Contrastive Multi-View Representation Learning on Grap...
240311_Thuy_Labseminar[Contrastive Multi-View Representation Learning on Grap...thanhdowork
 
Edge Representation Learning with Hypergraphs
Edge Representation Learning with HypergraphsEdge Representation Learning with Hypergraphs
Edge Representation Learning with HypergraphsMLAI2
 
NS-CUK Seminar: V.T.Hoang, Review on "Exploiting Neighbor Effect: Conv-Agnost...
NS-CUK Seminar: V.T.Hoang, Review on "Exploiting Neighbor Effect: Conv-Agnost...NS-CUK Seminar: V.T.Hoang, Review on "Exploiting Neighbor Effect: Conv-Agnost...
NS-CUK Seminar: V.T.Hoang, Review on "Exploiting Neighbor Effect: Conv-Agnost...ssuser4b1f48
 
D1L5 Visualization (D1L2 Insight@DCU Machine Learning Workshop 2017)
D1L5 Visualization (D1L2 Insight@DCU Machine Learning Workshop 2017)D1L5 Visualization (D1L2 Insight@DCU Machine Learning Workshop 2017)
D1L5 Visualization (D1L2 Insight@DCU Machine Learning Workshop 2017)Universitat Politècnica de Catalunya
 
Learning Graph Representation for Data-Efficiency RL
Learning Graph Representation for Data-Efficiency RLLearning Graph Representation for Data-Efficiency RL
Learning Graph Representation for Data-Efficiency RLlauratoni4
 
Laplacian-regularized Graph Bandits
Laplacian-regularized Graph BanditsLaplacian-regularized Graph Bandits
Laplacian-regularized Graph Banditslauratoni4
 
Bridging knowledge graphs_to_generate_scene_graphs
Bridging knowledge graphs_to_generate_scene_graphsBridging knowledge graphs_to_generate_scene_graphs
Bridging knowledge graphs_to_generate_scene_graphsWoen Yon Lai
 
NS-CUK Seminar: S.T.Nguyen, Review on "Improving Graph Neural Network Express...
NS-CUK Seminar: S.T.Nguyen, Review on "Improving Graph Neural Network Express...NS-CUK Seminar: S.T.Nguyen, Review on "Improving Graph Neural Network Express...
NS-CUK Seminar: S.T.Nguyen, Review on "Improving Graph Neural Network Express...ssuser4b1f48
 
An experimental evaluation of similarity-based and embedding-based link predi...
An experimental evaluation of similarity-based and embedding-based link predi...An experimental evaluation of similarity-based and embedding-based link predi...
An experimental evaluation of similarity-based and embedding-based link predi...IJDKP
 
Introduction to Graph Neural Networks: Basics and Applications - Katsuhiko Is...
Introduction to Graph Neural Networks: Basics and Applications - Katsuhiko Is...Introduction to Graph Neural Networks: Basics and Applications - Katsuhiko Is...
Introduction to Graph Neural Networks: Basics and Applications - Katsuhiko Is...Preferred Networks
 
物件偵測與辨識技術
物件偵測與辨識技術物件偵測與辨識技術
物件偵測與辨識技術CHENHuiMei
 
NS - CUK Seminar : V.T.Hoang, Review on "Structure-Aware Transformer for Grap...
NS - CUK Seminar : V.T.Hoang, Review on "Structure-Aware Transformer for Grap...NS - CUK Seminar : V.T.Hoang, Review on "Structure-Aware Transformer for Grap...
NS - CUK Seminar : V.T.Hoang, Review on "Structure-Aware Transformer for Grap...ssuser4b1f48
 
From Signal to Symbols
From Signal to SymbolsFrom Signal to Symbols
From Signal to Symbolsgpano
 
An experimental evaluation of similarity-based and embedding-based link predi...
An experimental evaluation of similarity-based and embedding-based link predi...An experimental evaluation of similarity-based and embedding-based link predi...
An experimental evaluation of similarity-based and embedding-based link predi...IJDKP
 
Learning Convolutional Neural Networks for Graphs
Learning Convolutional Neural Networks for GraphsLearning Convolutional Neural Networks for Graphs
Learning Convolutional Neural Networks for GraphsMathias Niepert
 
Neo4j MeetUp - Graph Exploration with MetaExp
Neo4j MeetUp - Graph Exploration with MetaExpNeo4j MeetUp - Graph Exploration with MetaExp
Neo4j MeetUp - Graph Exploration with MetaExpAdrian Ziegler
 
240401_Thanh_LabSeminar[Person Re-identification using Heterogeneous Local Gr...
240401_Thanh_LabSeminar[Person Re-identification using Heterogeneous Local Gr...240401_Thanh_LabSeminar[Person Re-identification using Heterogeneous Local Gr...
240401_Thanh_LabSeminar[Person Re-identification using Heterogeneous Local Gr...thanhdowork
 

Similaire à GNNs meet RL.pdf (20)

240311_Thuy_Labseminar[Contrastive Multi-View Representation Learning on Grap...
240311_Thuy_Labseminar[Contrastive Multi-View Representation Learning on Grap...240311_Thuy_Labseminar[Contrastive Multi-View Representation Learning on Grap...
240311_Thuy_Labseminar[Contrastive Multi-View Representation Learning on Grap...
 
Gnn overview
Gnn overviewGnn overview
Gnn overview
 
Sun_MAPL_GNN.pptx
Sun_MAPL_GNN.pptxSun_MAPL_GNN.pptx
Sun_MAPL_GNN.pptx
 
Line
LineLine
Line
 
Edge Representation Learning with Hypergraphs
Edge Representation Learning with HypergraphsEdge Representation Learning with Hypergraphs
Edge Representation Learning with Hypergraphs
 
NS-CUK Seminar: V.T.Hoang, Review on "Exploiting Neighbor Effect: Conv-Agnost...
NS-CUK Seminar: V.T.Hoang, Review on "Exploiting Neighbor Effect: Conv-Agnost...NS-CUK Seminar: V.T.Hoang, Review on "Exploiting Neighbor Effect: Conv-Agnost...
NS-CUK Seminar: V.T.Hoang, Review on "Exploiting Neighbor Effect: Conv-Agnost...
 
D1L5 Visualization (D1L2 Insight@DCU Machine Learning Workshop 2017)
D1L5 Visualization (D1L2 Insight@DCU Machine Learning Workshop 2017)D1L5 Visualization (D1L2 Insight@DCU Machine Learning Workshop 2017)
D1L5 Visualization (D1L2 Insight@DCU Machine Learning Workshop 2017)
 
Learning Graph Representation for Data-Efficiency RL
Learning Graph Representation for Data-Efficiency RLLearning Graph Representation for Data-Efficiency RL
Learning Graph Representation for Data-Efficiency RL
 
Laplacian-regularized Graph Bandits
Laplacian-regularized Graph BanditsLaplacian-regularized Graph Bandits
Laplacian-regularized Graph Bandits
 
Bridging knowledge graphs_to_generate_scene_graphs
Bridging knowledge graphs_to_generate_scene_graphsBridging knowledge graphs_to_generate_scene_graphs
Bridging knowledge graphs_to_generate_scene_graphs
 
NS-CUK Seminar: S.T.Nguyen, Review on "Improving Graph Neural Network Express...
NS-CUK Seminar: S.T.Nguyen, Review on "Improving Graph Neural Network Express...NS-CUK Seminar: S.T.Nguyen, Review on "Improving Graph Neural Network Express...
NS-CUK Seminar: S.T.Nguyen, Review on "Improving Graph Neural Network Express...
 
An experimental evaluation of similarity-based and embedding-based link predi...
An experimental evaluation of similarity-based and embedding-based link predi...An experimental evaluation of similarity-based and embedding-based link predi...
An experimental evaluation of similarity-based and embedding-based link predi...
 
Introduction to Graph Neural Networks: Basics and Applications - Katsuhiko Is...
Introduction to Graph Neural Networks: Basics and Applications - Katsuhiko Is...Introduction to Graph Neural Networks: Basics and Applications - Katsuhiko Is...
Introduction to Graph Neural Networks: Basics and Applications - Katsuhiko Is...
 
物件偵測與辨識技術
物件偵測與辨識技術物件偵測與辨識技術
物件偵測與辨識技術
 
NS - CUK Seminar : V.T.Hoang, Review on "Structure-Aware Transformer for Grap...
NS - CUK Seminar : V.T.Hoang, Review on "Structure-Aware Transformer for Grap...NS - CUK Seminar : V.T.Hoang, Review on "Structure-Aware Transformer for Grap...
NS - CUK Seminar : V.T.Hoang, Review on "Structure-Aware Transformer for Grap...
 
From Signal to Symbols
From Signal to SymbolsFrom Signal to Symbols
From Signal to Symbols
 
An experimental evaluation of similarity-based and embedding-based link predi...
An experimental evaluation of similarity-based and embedding-based link predi...An experimental evaluation of similarity-based and embedding-based link predi...
An experimental evaluation of similarity-based and embedding-based link predi...
 
Learning Convolutional Neural Networks for Graphs
Learning Convolutional Neural Networks for GraphsLearning Convolutional Neural Networks for Graphs
Learning Convolutional Neural Networks for Graphs
 
Neo4j MeetUp - Graph Exploration with MetaExp
Neo4j MeetUp - Graph Exploration with MetaExpNeo4j MeetUp - Graph Exploration with MetaExp
Neo4j MeetUp - Graph Exploration with MetaExp
 
240401_Thanh_LabSeminar[Person Re-identification using Heterogeneous Local Gr...
240401_Thanh_LabSeminar[Person Re-identification using Heterogeneous Local Gr...240401_Thanh_LabSeminar[Person Re-identification using Heterogeneous Local Gr...
240401_Thanh_LabSeminar[Person Re-identification using Heterogeneous Local Gr...
 

Dernier

ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfKamal Acharya
 
Online banking management system project.pdf
Online banking management system project.pdfOnline banking management system project.pdf
Online banking management system project.pdfKamal Acharya
 
Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxIntroduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxupamatechverse
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxpranjaldaimarysona
 
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...roncy bisnoi
 
UNIT-III FMM. DIMENSIONAL ANALYSIS
UNIT-III FMM.        DIMENSIONAL ANALYSISUNIT-III FMM.        DIMENSIONAL ANALYSIS
UNIT-III FMM. DIMENSIONAL ANALYSISrknatarajan
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performancesivaprakash250
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Christo Ananth
 
Porous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingPorous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingrakeshbaidya232001
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escortsranjana rawat
 
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Bookingdharasingh5698
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordCCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordAsst.prof M.Gokilavani
 
MANUFACTURING PROCESS-II UNIT-1 THEORY OF METAL CUTTING
MANUFACTURING PROCESS-II UNIT-1 THEORY OF METAL CUTTINGMANUFACTURING PROCESS-II UNIT-1 THEORY OF METAL CUTTING
MANUFACTURING PROCESS-II UNIT-1 THEORY OF METAL CUTTINGSIVASHANKAR N
 
AKTU Computer Networks notes --- Unit 3.pdf
AKTU Computer Networks notes ---  Unit 3.pdfAKTU Computer Networks notes ---  Unit 3.pdf
AKTU Computer Networks notes --- Unit 3.pdfankushspencer015
 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Dr.Costas Sachpazis
 
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptxBSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptxfenichawla
 
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 

Dernier (20)

ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
 
Online banking management system project.pdf
Online banking management system project.pdfOnline banking management system project.pdf
Online banking management system project.pdf
 
Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxIntroduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptx
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptx
 
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
 
UNIT-III FMM. DIMENSIONAL ANALYSIS
UNIT-III FMM.        DIMENSIONAL ANALYSISUNIT-III FMM.        DIMENSIONAL ANALYSIS
UNIT-III FMM. DIMENSIONAL ANALYSIS
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performance
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
 
Porous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingPorous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writing
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
 
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
 
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordCCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
 
MANUFACTURING PROCESS-II UNIT-1 THEORY OF METAL CUTTING
MANUFACTURING PROCESS-II UNIT-1 THEORY OF METAL CUTTINGMANUFACTURING PROCESS-II UNIT-1 THEORY OF METAL CUTTING
MANUFACTURING PROCESS-II UNIT-1 THEORY OF METAL CUTTING
 
AKTU Computer Networks notes --- Unit 3.pdf
AKTU Computer Networks notes ---  Unit 3.pdfAKTU Computer Networks notes ---  Unit 3.pdf
AKTU Computer Networks notes --- Unit 3.pdf
 
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
 
Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024
 
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptxBSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
 
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
 

GNNs meet RL.pdf

  • 1. Graph Neural Networks (GNNs) Meet Reinforcement Learning (RL) 05/01/2021
  • 2. Graph Neural Networks (GNNs) Kipf, Thomas N., and Max Welling. "Semi-supervised classification with graph convolutional networks." arXiv preprint arXiv:1609.02907 (2016). ● Targeting at real-world problems having underlying graph structures which are non-Euclidean. ○ Euclidean data is data which is sensibly modelled as being plotted in n-dimensional linear space, e.g., images (where the x and y coordinates refer to the location of each pixel, and the z coordinate refers to its colour/intensity). ○ Non-Euclidean data: have information of edges, values associated with nodes, or the directionality of edges.
  • 3. Graph Neural Networks (GNNs) Tasks
  • 4. Graph Neural Networks (GNNs) Neural Networks Forward Propagation ● ● w is weights, H is the last hidden states, b is the bias and sigma is the nonlinear function. ● When i = 0, then H[i] = X, where X is the input. Graph Neural Networks ● Sparse. ● Flexible connection.
  • 5. Graph Neural Networks (GNNs) Definitions ● Adjacency matrix (A): a N x N matrix filled with either 0 or 1, where N is the total number of nodes. ● Node attributes matrix (X): represents the features or attributes of each node. If there are N nodes and the size of node attributes is F, then the shape of this matrix is N x F. ● Edge attributes matrix (E), can also have its own attributes. Adjacency matrix Node attributes matrix (6 x 6)
  • 6. Graph Neural Networks (GNNs) Graph ● G = (V, E), denote a graph. ● node feature vectors Xv for v in V Graph Neural Networks[1] ● Step1: aggregate information from neighbors. ● Step2: combine encoded neighbor’s info with its own info. ● Step3: use READOUT function to extract the representation.
  • 7. Graph Convolutional Neural Network (GCN) Kipf, Thomas N., and Max Welling. "Semi-supervised classification with graph convolutional networks." arXiv preprint arXiv:1609.02907 (2016). Aggregator + Combine: Representation from the paper ●
  • 8. GraphSAGE Hamilton, William L., Rex Ying, and Jure Leskovec. "Inductive representation learning on large graphs." arXiv preprint arXiv:1706.02216 (2017). ● Aggregator: ● Combine: Representation from the paper ● Different aggregator ○ Mean, LSTM, Pooling.
  • 9. GraphSAGE Hamilton, William L., Rex Ying, and Jure Leskovec. "Inductive representation learning on large graphs." arXiv preprint arXiv:1706.02216 (2017).
  • 10. Literature Review Dong, Jiqian, et al. "A DRL-based Multiagent Cooperative Control Framework for CAV Networks: a Graphic Convolution Q Network." arXiv preprint arXiv:2010.05437 (2020). ● Combining graphic convolution neural network (GCN) and Deep Q Network (DQN) to solve the lane changing problem in mixed traffic with multiple AVs. ● Off-policy Q learning.
  • 11. Attention (Self-attention) ● Self-attention, also known as intra-attention, is an attention mechanism relating different positions of a single sequence in order to compute a representation of the same sequence.
  • 12. Attention (self-attention, intra-attention) ● Different attention mechanisms.
  • 19. Attention (self-attention, intra-attention) ● Learnable attention matrix: WQ, WK and WV.
  • 20. Previous Works ● Iqbal, Shariq, and Fei Sha. "Actor-attention-critic for multi-agent reinforcement learning." International Conference on Machine Learning. PMLR, 2019. Our attention critic is able to dynamically select which agents to attend to at each time point during training, improving performance in multi-agent domains with complex interactions.
  • 21. Graph Convolutional Reinforcement Learning (DGN) Jiang, Jiechuan, et al. "Graph convolutional reinforcement learning." arXiv preprint arXiv:1810.09202 (2018). ● Graph convolution adapts to the dynamics of the underlying graph of the multi-agent environment, and relation kernels capture the interplay between agents by their relation representations. ● Use self-attention to extract features. ● Off-policy Q learning.
  • 22. GAT on Autonomous Driving Wang, Jiawei, et al. "Multi-agent Graph Reinforcement Learning for Connected Automated Driving", 2020 ● On-policy PPO. ● Lane-following.
  • 23. References 1. Dong, Jiqian, et al. "A DRL-based Multiagent Cooperative Control Framework for CAV Networks: a Graphic Convolution Q Network." arXiv preprint arXiv:2010.05437 (2020). 2. Gupta, Jayesh K., Maxim Egorov, and Mykel Kochenderfer. "Cooperative multi-agent control using deep reinforcement learning." International Conference on Autonomous Agents and Multiagent Systems. Springer, Cham, 2017. 3. Lin, Kaixiang, et al. "Efficient large-scale fleet management via multi-agent deep reinforcement learning." Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 2018. 4. https://ai.stackexchange.com/questions/11226/what-is-non-euclidean-data#:~:text=Since%20Euclidean %20spaces%20are%20prototypically,coordinate%20refers%20to%20its%20colour%2F 5. https://towardsdatascience.com/understanding-graph-convolutional-networks-for-node-classification-a 2bfdb7aba7b 6. Kipf, Thomas N., and Max Welling. "Semi-supervised classification with graph convolutional networks." arXiv preprint arXiv:1609.02907 (2016).