SlideShare une entreprise Scribd logo
1  sur  18
Télécharger pour lire hors ligne
GASGD: Stochastic Gradient Descent for Distributed
Asynchronous Matrix Completion via Graph Partitioning
Fabio Petroni
Cyber Intelligence and information Security
CIS Sapienza
Department of Computer Control and
Management Engineering Antonio Ruberti,
Sapienza University of Rome -
petroni|querzoni@dis.uniroma1.it
Foster City, Silicon Valley
6th-10th October 2014
and Leonardo Querzoni
Matrix Completion & SGD
R
Q
x
P
?
item vector
user vector
LOSS(P, Q) =
X
(Rij Pi Qj )2
+ ...
I Stochastic Gradient Descent works by taking steps
proportional to the negative of the gradient of the LOSS.
I stochastic = P and Q are updated for each given training
case by a small step, toward the average gradient descent.
2 of 18
Scalability
7 Lengthy training stages;
7 high computational costs;
7 especially on large data sets;
7 input data may not fit in main memory.
I goal = e ciently exploit computer cluster architectures.
...
3 of 18
Distributed Asynchronous SGD
R
Q3
P3
Q4
P4
Q2
P2
Q1
P1
1 2 3 4
R1 R4
R3R2
I R is splitted;
I vectors are
replicated;
I replicas
concurrently
updated;
I replicas deviate
inconsistently;
I synchronization.
4 of 18
Bulk Synchronous Processing Model
5 of 18
Challenges 1/2
R
1 2 3 4
R1 R4
R3R2
I 1. Load balance
B ensure that computing nodes are
fed with the same load.
I 2. Minimize communication
B minimize vector replicas.
6 of 18
Challenges 2/2
...
I 3. Tune synchronization frequency
among computing nodes.
I Current implementations synchronize
vector copies:
B continuously during the epoch
(waste of resources);
B once after every epoch
(slow convergence).
I epoch = a single iteration over the ratings.
7 of 18
Contributions
3 We mitigate the load imbalance by proposing an input
slicing solution based on graph partitioning algorithms;
3 we show how to reduce the number of shared data by
properly leveraging known characteristics of the input dataset
(bipartite power-law nature);
3 we show how to leverage the tradeo↵ between
communication cost and algorithm convergence rate by
tuning the frequency of the bulk synchronization phase
during the computation.
8 of 18
Graph representation
I The rating matrix describes a bipartite graph.
x
x x
x
x
x
xx
x
x
x
x
R
I Real data: skewed power-law degree distribution.
9 of 18
Input partitioner
I vertex-cut performs better than edge-cut in power-law graphs.
I Assumption: the input
data doesn’t fit in
main memory.
I Streaming algorithm.
I Balanced k-way
vertex-cut graph
partitioning:
B minimize replicas;
B balance edge load.
10 of 18
Balanced Vertex-Cut Streaming Algorithms
I Hashing: pseudo-random edge assignment;
I Grid: shu✏e and split the rating matrix in identical blocks;
I Greedy: [Gonzalez et al. 2012] and [Ahmed et al. 2013].
Bipartite Aware Greedy Algorithm
I Real word bipartite graphs are often significantly skewed: one
of the two sets is much bigger than the other.
I By perfectly splitting the bigger set it is possible to achieve a
smaller replication factor.
I GIP (Greedy - Item Partitioned)
I GUP (Greedy - User Partitioned)
11 of 18
Evaluation: The Data Sets
Degree distributions:
100
101
102
10
3
10
4
100
101
102
103
104
105
numberofvertices
degree
items
users
MovieLens 10M
100
101
102
10
3
10
4
100
101
102
103
104
105
numberofvertices
degree
items
users
Netflix
12 of 18
Experiments: Partitioning quality
MovieLens
1
10
100
8 16 32 64 128 256
replicationfactor
partitions
hashing
grid
greedy
GIP
GUP
1
10
100
8 16 32 64 128 256
loadstandarddeviation(%)
partitions
hashing
grid
greedy
GIP
GUP
Netflix
1
10
100
8 16 32 64 128 256
replicationfactor
partitions
hashing
grid
greedy
GIP
GUP
1
10
100
8 16 32 64 128 256
loadstandarddeviation(%)
partitions
hashing
grid
greedy
GIP
GUP
I RF
Replication
Factor
I RSD
Relative
Standard
Deviation
13 of 18
Synchronization frequency
f = 1
f = 100
Netflix
5
6
7
10
20
30
40
50
60
0 5 10 15 20 25 30
loss(×10
7
)
epoch
grid
greedy
GUP
centralized
5
6
7
8
9
10
20
30
40
50
60
0 5 10 15 20 25 30
loss(×10
7
)
epoch
grid
greedy
GUP
centralized
I f = synchronization
frequency parameter
B number of
synchronization steps
during an epoch.
I tradeo↵ between
communication cost and
convergence rate.
14 of 18
Evaluation: SSE and Communication cost
MovieLens
1011
1012
10
13
10
14
10
15
100
101
102
103
SSE
frequency
grid, |C|=32
greedy, |C|=32
GUP, |C|=32
grid, |C|=8
greedy, |C|=8
GUP, |C|=8
1
2
3
4
5
6
7
8
9
10
100
101
102
103
104
105
communacationcost(×107
)
frequency
hashing
grid
greedy
GUP
0.01
0.05
0.1
0.15
0.2
0.25
101
Zoom
Netflix
1013
1014
1015
10
16
1017
100
101
102
103
SSE
frequency
grid, |C|=32
greedy, |C|=32
GUP, |C|=32
grid, |C|=8
greedy, |C|=8
GUP, |C|=8
1
2
3
4
5
6
7
8
9
10
100
101
102
103
104
105
106
communacationcost(×10
9
)
frequency
hashing
grid
greedy
GUP
0.01
0.05
0.1
0.15
0.2
10
1
Zoom
I SSE
between
ASGD
variants and
SGD curves
I CC
15 of 18
Communication cost
I T = the training set
I U = users set
I I = items set
I V = U [ I
I C = processing nodes
I RF = replication factor
I RFU = users’ RF
I RFI = items’ RF
RF =
|U|RFU + |I|RFI
|V |
f = 1 ! CC ⇡ 2(|U|RFU + |I|RFI ) = 2|V |RF
f =
|T|
|C|
! CC ⇡ |T|(RFU + RFI )
16 of 18
Conclusions
I three distinct contributions aimed at improving the e ciency
and scalability of ASGD:
1. we proposed an input slicing solution based on graph
partitioning approach that mitigates the load imbalance
among SGD instances (i.e. better scalability);
2. we further reduce the amount of shared data by exploiting
specific characteristics of the training dataset. This provides
lower communication costs during the algorithm execution
(i.e. better e ciency);
3. we introduced a synchronization frequency parameter driving
a tradeo↵ that can be accurately leveraged to further improve
the algorithm e ciency.
17 of 18
Thank you!
Questions?
Fabio Petroni
Rome, Italy
Current position:
PhD Student in Computer Engineering, Sapienza University of Rome
Research Areas:
Recommendation Systems, Collaborative Filtering, Distributed Systems
petroni@dis.uniroma1.it
21 of 21

Contenu connexe

Tendances

IRJET- Comparison for Max-Flow Min-Cut Algorithms for Optimal Assignment Problem
IRJET- Comparison for Max-Flow Min-Cut Algorithms for Optimal Assignment ProblemIRJET- Comparison for Max-Flow Min-Cut Algorithms for Optimal Assignment Problem
IRJET- Comparison for Max-Flow Min-Cut Algorithms for Optimal Assignment ProblemIRJET Journal
 
The Gaussian Process Latent Variable Model (GPLVM)
The Gaussian Process Latent Variable Model (GPLVM)The Gaussian Process Latent Variable Model (GPLVM)
The Gaussian Process Latent Variable Model (GPLVM)James McMurray
 
Paper id 37201520
Paper id 37201520Paper id 37201520
Paper id 37201520IJRAT
 
Traffic sign classification
Traffic sign classificationTraffic sign classification
Traffic sign classificationBill Kromydas
 
FAST AND EFFICIENT IMAGE COMPRESSION BASED ON PARALLEL COMPUTING USING MATLAB
FAST AND EFFICIENT IMAGE COMPRESSION BASED ON PARALLEL COMPUTING USING MATLABFAST AND EFFICIENT IMAGE COMPRESSION BASED ON PARALLEL COMPUTING USING MATLAB
FAST AND EFFICIENT IMAGE COMPRESSION BASED ON PARALLEL COMPUTING USING MATLABJournal For Research
 
Colfax-Winograd-Summary _final (1)
Colfax-Winograd-Summary _final (1)Colfax-Winograd-Summary _final (1)
Colfax-Winograd-Summary _final (1)Sangamesh Ragate
 
CFM Challenge - Course Project
CFM Challenge - Course ProjectCFM Challenge - Course Project
CFM Challenge - Course ProjectKhalilBergaoui
 
Size measurement and estimation
Size measurement and estimationSize measurement and estimation
Size measurement and estimationLouis A. Poulin
 
Analysis of Impact of Graph Theory in Computer Application
Analysis of Impact of Graph Theory in Computer ApplicationAnalysis of Impact of Graph Theory in Computer Application
Analysis of Impact of Graph Theory in Computer ApplicationIRJET Journal
 
Graph Matching
Graph MatchingGraph Matching
Graph Matchinggraphitech
 
Cari presentation maurice-tchoupe-joskelngoufo
Cari presentation maurice-tchoupe-joskelngoufoCari presentation maurice-tchoupe-joskelngoufo
Cari presentation maurice-tchoupe-joskelngoufoMokhtar SELLAMI
 
Sigmod11 outsource shortest path
Sigmod11 outsource shortest pathSigmod11 outsource shortest path
Sigmod11 outsource shortest pathredhatdb
 

Tendances (20)

IRJET- Comparison for Max-Flow Min-Cut Algorithms for Optimal Assignment Problem
IRJET- Comparison for Max-Flow Min-Cut Algorithms for Optimal Assignment ProblemIRJET- Comparison for Max-Flow Min-Cut Algorithms for Optimal Assignment Problem
IRJET- Comparison for Max-Flow Min-Cut Algorithms for Optimal Assignment Problem
 
The Gaussian Process Latent Variable Model (GPLVM)
The Gaussian Process Latent Variable Model (GPLVM)The Gaussian Process Latent Variable Model (GPLVM)
The Gaussian Process Latent Variable Model (GPLVM)
 
Fourier Transform Assignment Help
Fourier Transform Assignment HelpFourier Transform Assignment Help
Fourier Transform Assignment Help
 
Rn d presentation_gurulingannk
Rn d presentation_gurulingannkRn d presentation_gurulingannk
Rn d presentation_gurulingannk
 
03 hdrf presentation
03 hdrf presentation03 hdrf presentation
03 hdrf presentation
 
Paper id 37201520
Paper id 37201520Paper id 37201520
Paper id 37201520
 
Traffic sign classification
Traffic sign classificationTraffic sign classification
Traffic sign classification
 
FAST AND EFFICIENT IMAGE COMPRESSION BASED ON PARALLEL COMPUTING USING MATLAB
FAST AND EFFICIENT IMAGE COMPRESSION BASED ON PARALLEL COMPUTING USING MATLABFAST AND EFFICIENT IMAGE COMPRESSION BASED ON PARALLEL COMPUTING USING MATLAB
FAST AND EFFICIENT IMAGE COMPRESSION BASED ON PARALLEL COMPUTING USING MATLAB
 
CLIM Program: Remote Sensing Workshop, Statistical Emulation with Dimension R...
CLIM Program: Remote Sensing Workshop, Statistical Emulation with Dimension R...CLIM Program: Remote Sensing Workshop, Statistical Emulation with Dimension R...
CLIM Program: Remote Sensing Workshop, Statistical Emulation with Dimension R...
 
Neural Style Transfer in practice
Neural Style Transfer in practiceNeural Style Transfer in practice
Neural Style Transfer in practice
 
Colfax-Winograd-Summary _final (1)
Colfax-Winograd-Summary _final (1)Colfax-Winograd-Summary _final (1)
Colfax-Winograd-Summary _final (1)
 
CFM Challenge - Course Project
CFM Challenge - Course ProjectCFM Challenge - Course Project
CFM Challenge - Course Project
 
Size measurement and estimation
Size measurement and estimationSize measurement and estimation
Size measurement and estimation
 
Slide1
Slide1Slide1
Slide1
 
Computation Assignment Help
Computation Assignment Help Computation Assignment Help
Computation Assignment Help
 
Analysis of Impact of Graph Theory in Computer Application
Analysis of Impact of Graph Theory in Computer ApplicationAnalysis of Impact of Graph Theory in Computer Application
Analysis of Impact of Graph Theory in Computer Application
 
Graph Matching
Graph MatchingGraph Matching
Graph Matching
 
Cari presentation maurice-tchoupe-joskelngoufo
Cari presentation maurice-tchoupe-joskelngoufoCari presentation maurice-tchoupe-joskelngoufo
Cari presentation maurice-tchoupe-joskelngoufo
 
Unit 3
Unit 3Unit 3
Unit 3
 
Sigmod11 outsource shortest path
Sigmod11 outsource shortest pathSigmod11 outsource shortest path
Sigmod11 outsource shortest path
 

En vedette

CORE: Context-Aware Open Relation Extraction with Factorization Machines
CORE: Context-Aware Open Relation Extraction with Factorization MachinesCORE: Context-Aware Open Relation Extraction with Factorization Machines
CORE: Context-Aware Open Relation Extraction with Factorization MachinesFabio Petroni, PhD
 
HSIENA: a hybrid publish/subscribe system
HSIENA: a hybrid publish/subscribe systemHSIENA: a hybrid publish/subscribe system
HSIENA: a hybrid publish/subscribe systemFabio Petroni, PhD
 
Machine Learning Methods for Parameter Acquisition in a Human ...
Machine Learning Methods for Parameter Acquisition in a Human ...Machine Learning Methods for Parameter Acquisition in a Human ...
Machine Learning Methods for Parameter Acquisition in a Human ...butest
 
MALT: Distributed Data-Parallelism for Existing ML Applications (Distributed ...
MALT: Distributed Data-Parallelism for Existing ML Applications (Distributed ...MALT: Distributed Data-Parallelism for Existing ML Applications (Distributed ...
MALT: Distributed Data-Parallelism for Existing ML Applications (Distributed ...asimkadav
 
Spark Summit EU talk by Rolf Jagerman
Spark Summit EU talk by Rolf JagermanSpark Summit EU talk by Rolf Jagerman
Spark Summit EU talk by Rolf JagermanSpark Summit
 
Scalable Deep Learning Platform On Spark In Baidu
Scalable Deep Learning Platform On Spark In BaiduScalable Deep Learning Platform On Spark In Baidu
Scalable Deep Learning Platform On Spark In BaiduJen Aman
 
Challenges on Distributed Machine Learning
Challenges on Distributed Machine LearningChallenges on Distributed Machine Learning
Challenges on Distributed Machine Learningjie cao
 

En vedette (7)

CORE: Context-Aware Open Relation Extraction with Factorization Machines
CORE: Context-Aware Open Relation Extraction with Factorization MachinesCORE: Context-Aware Open Relation Extraction with Factorization Machines
CORE: Context-Aware Open Relation Extraction with Factorization Machines
 
HSIENA: a hybrid publish/subscribe system
HSIENA: a hybrid publish/subscribe systemHSIENA: a hybrid publish/subscribe system
HSIENA: a hybrid publish/subscribe system
 
Machine Learning Methods for Parameter Acquisition in a Human ...
Machine Learning Methods for Parameter Acquisition in a Human ...Machine Learning Methods for Parameter Acquisition in a Human ...
Machine Learning Methods for Parameter Acquisition in a Human ...
 
MALT: Distributed Data-Parallelism for Existing ML Applications (Distributed ...
MALT: Distributed Data-Parallelism for Existing ML Applications (Distributed ...MALT: Distributed Data-Parallelism for Existing ML Applications (Distributed ...
MALT: Distributed Data-Parallelism for Existing ML Applications (Distributed ...
 
Spark Summit EU talk by Rolf Jagerman
Spark Summit EU talk by Rolf JagermanSpark Summit EU talk by Rolf Jagerman
Spark Summit EU talk by Rolf Jagerman
 
Scalable Deep Learning Platform On Spark In Baidu
Scalable Deep Learning Platform On Spark In BaiduScalable Deep Learning Platform On Spark In Baidu
Scalable Deep Learning Platform On Spark In Baidu
 
Challenges on Distributed Machine Learning
Challenges on Distributed Machine LearningChallenges on Distributed Machine Learning
Challenges on Distributed Machine Learning
 

Similaire à GASGD: Stochastic Gradient Descent for Distributed Asynchronous Matrix Completion via Graph Partitioning

A simple framework for contrastive learning of visual representations
A simple framework for contrastive learning of visual representationsA simple framework for contrastive learning of visual representations
A simple framework for contrastive learning of visual representationsDevansh16
 
Graphical Model Selection for Big Data
Graphical Model Selection for Big DataGraphical Model Selection for Big Data
Graphical Model Selection for Big DataAlexander Jung
 
13th Athens Big Data Meetup - 2nd Talk - Training Neural Networks With Enterp...
13th Athens Big Data Meetup - 2nd Talk - Training Neural Networks With Enterp...13th Athens Big Data Meetup - 2nd Talk - Training Neural Networks With Enterp...
13th Athens Big Data Meetup - 2nd Talk - Training Neural Networks With Enterp...Athens Big Data
 
VJAI Paper Reading#3-KDD2019-ClusterGCN
VJAI Paper Reading#3-KDD2019-ClusterGCNVJAI Paper Reading#3-KDD2019-ClusterGCN
VJAI Paper Reading#3-KDD2019-ClusterGCNDat Nguyen
 
Learning Graph Representation for Data-Efficiency RL
Learning Graph Representation for Data-Efficiency RLLearning Graph Representation for Data-Efficiency RL
Learning Graph Representation for Data-Efficiency RLlauratoni4
 
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...ijceronline
 
Laplacian-regularized Graph Bandits
Laplacian-regularized Graph BanditsLaplacian-regularized Graph Bandits
Laplacian-regularized Graph Banditslauratoni4
 
Stochastic Computing Correlation Utilization in Convolutional Neural Network ...
Stochastic Computing Correlation Utilization in Convolutional Neural Network ...Stochastic Computing Correlation Utilization in Convolutional Neural Network ...
Stochastic Computing Correlation Utilization in Convolutional Neural Network ...TELKOMNIKA JOURNAL
 
RDataMining slides-regression-classification
RDataMining slides-regression-classificationRDataMining slides-regression-classification
RDataMining slides-regression-classificationYanchang Zhao
 
A Blind Steganalysis on JPEG Gray Level Image Based on Statistical Features a...
A Blind Steganalysis on JPEG Gray Level Image Based on Statistical Features a...A Blind Steganalysis on JPEG Gray Level Image Based on Statistical Features a...
A Blind Steganalysis on JPEG Gray Level Image Based on Statistical Features a...IJERD Editor
 
Multimedia Security - JPEG Artifact details
Multimedia Security - JPEG Artifact detailsMultimedia Security - JPEG Artifact details
Multimedia Security - JPEG Artifact detailsSebastiano Battiato
 
Clustbigfim frequent itemset mining of
Clustbigfim frequent itemset mining ofClustbigfim frequent itemset mining of
Clustbigfim frequent itemset mining ofijfcstjournal
 

Similaire à GASGD: Stochastic Gradient Descent for Distributed Asynchronous Matrix Completion via Graph Partitioning (20)

2019 GDRR: Blockchain Data Analytics - Dissecting Blockchain Price Analytics...
2019 GDRR: Blockchain Data Analytics  - Dissecting Blockchain Price Analytics...2019 GDRR: Blockchain Data Analytics  - Dissecting Blockchain Price Analytics...
2019 GDRR: Blockchain Data Analytics - Dissecting Blockchain Price Analytics...
 
A simple framework for contrastive learning of visual representations
A simple framework for contrastive learning of visual representationsA simple framework for contrastive learning of visual representations
A simple framework for contrastive learning of visual representations
 
Graphical Model Selection for Big Data
Graphical Model Selection for Big DataGraphical Model Selection for Big Data
Graphical Model Selection for Big Data
 
T24144148
T24144148T24144148
T24144148
 
Presentation
PresentationPresentation
Presentation
 
13th Athens Big Data Meetup - 2nd Talk - Training Neural Networks With Enterp...
13th Athens Big Data Meetup - 2nd Talk - Training Neural Networks With Enterp...13th Athens Big Data Meetup - 2nd Talk - Training Neural Networks With Enterp...
13th Athens Big Data Meetup - 2nd Talk - Training Neural Networks With Enterp...
 
VJAI Paper Reading#3-KDD2019-ClusterGCN
VJAI Paper Reading#3-KDD2019-ClusterGCNVJAI Paper Reading#3-KDD2019-ClusterGCN
VJAI Paper Reading#3-KDD2019-ClusterGCN
 
A detailed analysis of the supervised machine Learning Algorithms
A detailed analysis of the supervised machine Learning AlgorithmsA detailed analysis of the supervised machine Learning Algorithms
A detailed analysis of the supervised machine Learning Algorithms
 
Fcm1
Fcm1Fcm1
Fcm1
 
Fcm1
Fcm1Fcm1
Fcm1
 
Learning Graph Representation for Data-Efficiency RL
Learning Graph Representation for Data-Efficiency RLLearning Graph Representation for Data-Efficiency RL
Learning Graph Representation for Data-Efficiency RL
 
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
 
Laplacian-regularized Graph Bandits
Laplacian-regularized Graph BanditsLaplacian-regularized Graph Bandits
Laplacian-regularized Graph Bandits
 
Stochastic Computing Correlation Utilization in Convolutional Neural Network ...
Stochastic Computing Correlation Utilization in Convolutional Neural Network ...Stochastic Computing Correlation Utilization in Convolutional Neural Network ...
Stochastic Computing Correlation Utilization in Convolutional Neural Network ...
 
An35225228
An35225228An35225228
An35225228
 
RDataMining slides-regression-classification
RDataMining slides-regression-classificationRDataMining slides-regression-classification
RDataMining slides-regression-classification
 
G04654247
G04654247G04654247
G04654247
 
A Blind Steganalysis on JPEG Gray Level Image Based on Statistical Features a...
A Blind Steganalysis on JPEG Gray Level Image Based on Statistical Features a...A Blind Steganalysis on JPEG Gray Level Image Based on Statistical Features a...
A Blind Steganalysis on JPEG Gray Level Image Based on Statistical Features a...
 
Multimedia Security - JPEG Artifact details
Multimedia Security - JPEG Artifact detailsMultimedia Security - JPEG Artifact details
Multimedia Security - JPEG Artifact details
 
Clustbigfim frequent itemset mining of
Clustbigfim frequent itemset mining ofClustbigfim frequent itemset mining of
Clustbigfim frequent itemset mining of
 

Dernier

(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130Suhani Kapoor
 
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxpranjaldaimarysona
 
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSMANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSSIVASHANKAR N
 
Introduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxIntroduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxupamatechverse
 
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...RajaP95
 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Dr.Costas Sachpazis
 
Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxIntroduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxupamatechverse
 
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝soniya singh
 
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...Call Girls in Nagpur High Profile
 
Extrusion Processes and Their Limitations
Extrusion Processes and Their LimitationsExtrusion Processes and Their Limitations
Extrusion Processes and Their Limitations120cr0395
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130Suhani Kapoor
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...Soham Mondal
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVRajaP95
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Christo Ananth
 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Dr.Costas Sachpazis
 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxupamatechverse
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 

Dernier (20)

(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
 
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptx
 
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSMANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
 
Introduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxIntroduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptx
 
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
 
Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxIntroduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptx
 
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
 
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
 
Extrusion Processes and Their Limitations
Extrusion Processes and Their LimitationsExtrusion Processes and Their Limitations
Extrusion Processes and Their Limitations
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptx
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
 
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCRCall Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
 

GASGD: Stochastic Gradient Descent for Distributed Asynchronous Matrix Completion via Graph Partitioning

  • 1. GASGD: Stochastic Gradient Descent for Distributed Asynchronous Matrix Completion via Graph Partitioning Fabio Petroni Cyber Intelligence and information Security CIS Sapienza Department of Computer Control and Management Engineering Antonio Ruberti, Sapienza University of Rome - petroni|querzoni@dis.uniroma1.it Foster City, Silicon Valley 6th-10th October 2014 and Leonardo Querzoni
  • 2. Matrix Completion & SGD R Q x P ? item vector user vector LOSS(P, Q) = X (Rij Pi Qj )2 + ... I Stochastic Gradient Descent works by taking steps proportional to the negative of the gradient of the LOSS. I stochastic = P and Q are updated for each given training case by a small step, toward the average gradient descent. 2 of 18
  • 3. Scalability 7 Lengthy training stages; 7 high computational costs; 7 especially on large data sets; 7 input data may not fit in main memory. I goal = e ciently exploit computer cluster architectures. ... 3 of 18
  • 4. Distributed Asynchronous SGD R Q3 P3 Q4 P4 Q2 P2 Q1 P1 1 2 3 4 R1 R4 R3R2 I R is splitted; I vectors are replicated; I replicas concurrently updated; I replicas deviate inconsistently; I synchronization. 4 of 18
  • 6. Challenges 1/2 R 1 2 3 4 R1 R4 R3R2 I 1. Load balance B ensure that computing nodes are fed with the same load. I 2. Minimize communication B minimize vector replicas. 6 of 18
  • 7. Challenges 2/2 ... I 3. Tune synchronization frequency among computing nodes. I Current implementations synchronize vector copies: B continuously during the epoch (waste of resources); B once after every epoch (slow convergence). I epoch = a single iteration over the ratings. 7 of 18
  • 8. Contributions 3 We mitigate the load imbalance by proposing an input slicing solution based on graph partitioning algorithms; 3 we show how to reduce the number of shared data by properly leveraging known characteristics of the input dataset (bipartite power-law nature); 3 we show how to leverage the tradeo↵ between communication cost and algorithm convergence rate by tuning the frequency of the bulk synchronization phase during the computation. 8 of 18
  • 9. Graph representation I The rating matrix describes a bipartite graph. x x x x x x xx x x x x R I Real data: skewed power-law degree distribution. 9 of 18
  • 10. Input partitioner I vertex-cut performs better than edge-cut in power-law graphs. I Assumption: the input data doesn’t fit in main memory. I Streaming algorithm. I Balanced k-way vertex-cut graph partitioning: B minimize replicas; B balance edge load. 10 of 18
  • 11. Balanced Vertex-Cut Streaming Algorithms I Hashing: pseudo-random edge assignment; I Grid: shu✏e and split the rating matrix in identical blocks; I Greedy: [Gonzalez et al. 2012] and [Ahmed et al. 2013]. Bipartite Aware Greedy Algorithm I Real word bipartite graphs are often significantly skewed: one of the two sets is much bigger than the other. I By perfectly splitting the bigger set it is possible to achieve a smaller replication factor. I GIP (Greedy - Item Partitioned) I GUP (Greedy - User Partitioned) 11 of 18
  • 12. Evaluation: The Data Sets Degree distributions: 100 101 102 10 3 10 4 100 101 102 103 104 105 numberofvertices degree items users MovieLens 10M 100 101 102 10 3 10 4 100 101 102 103 104 105 numberofvertices degree items users Netflix 12 of 18
  • 13. Experiments: Partitioning quality MovieLens 1 10 100 8 16 32 64 128 256 replicationfactor partitions hashing grid greedy GIP GUP 1 10 100 8 16 32 64 128 256 loadstandarddeviation(%) partitions hashing grid greedy GIP GUP Netflix 1 10 100 8 16 32 64 128 256 replicationfactor partitions hashing grid greedy GIP GUP 1 10 100 8 16 32 64 128 256 loadstandarddeviation(%) partitions hashing grid greedy GIP GUP I RF Replication Factor I RSD Relative Standard Deviation 13 of 18
  • 14. Synchronization frequency f = 1 f = 100 Netflix 5 6 7 10 20 30 40 50 60 0 5 10 15 20 25 30 loss(×10 7 ) epoch grid greedy GUP centralized 5 6 7 8 9 10 20 30 40 50 60 0 5 10 15 20 25 30 loss(×10 7 ) epoch grid greedy GUP centralized I f = synchronization frequency parameter B number of synchronization steps during an epoch. I tradeo↵ between communication cost and convergence rate. 14 of 18
  • 15. Evaluation: SSE and Communication cost MovieLens 1011 1012 10 13 10 14 10 15 100 101 102 103 SSE frequency grid, |C|=32 greedy, |C|=32 GUP, |C|=32 grid, |C|=8 greedy, |C|=8 GUP, |C|=8 1 2 3 4 5 6 7 8 9 10 100 101 102 103 104 105 communacationcost(×107 ) frequency hashing grid greedy GUP 0.01 0.05 0.1 0.15 0.2 0.25 101 Zoom Netflix 1013 1014 1015 10 16 1017 100 101 102 103 SSE frequency grid, |C|=32 greedy, |C|=32 GUP, |C|=32 grid, |C|=8 greedy, |C|=8 GUP, |C|=8 1 2 3 4 5 6 7 8 9 10 100 101 102 103 104 105 106 communacationcost(×10 9 ) frequency hashing grid greedy GUP 0.01 0.05 0.1 0.15 0.2 10 1 Zoom I SSE between ASGD variants and SGD curves I CC 15 of 18
  • 16. Communication cost I T = the training set I U = users set I I = items set I V = U [ I I C = processing nodes I RF = replication factor I RFU = users’ RF I RFI = items’ RF RF = |U|RFU + |I|RFI |V | f = 1 ! CC ⇡ 2(|U|RFU + |I|RFI ) = 2|V |RF f = |T| |C| ! CC ⇡ |T|(RFU + RFI ) 16 of 18
  • 17. Conclusions I three distinct contributions aimed at improving the e ciency and scalability of ASGD: 1. we proposed an input slicing solution based on graph partitioning approach that mitigates the load imbalance among SGD instances (i.e. better scalability); 2. we further reduce the amount of shared data by exploiting specific characteristics of the training dataset. This provides lower communication costs during the algorithm execution (i.e. better e ciency); 3. we introduced a synchronization frequency parameter driving a tradeo↵ that can be accurately leveraged to further improve the algorithm e ciency. 17 of 18
  • 18. Thank you! Questions? Fabio Petroni Rome, Italy Current position: PhD Student in Computer Engineering, Sapienza University of Rome Research Areas: Recommendation Systems, Collaborative Filtering, Distributed Systems petroni@dis.uniroma1.it 21 of 21