SlideShare a Scribd company logo
1 of 22
Gossip-based Partitioning
and Replication
Middle-ware for
Online Social Networks
Muhammad Anis Uddin Nasir
(EMDC/ICT/LCN)
Supervisor: Šarūnas Girdzijauskas
Examiner: Johan Montelius
Online Social Networks
8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware
•Vertices •Edges •Metadata
Ioanna Antonio Vaidas
Aras
Vasia
Anis
Mudit
Manos
2
LeandroJohan
Existing Solutions
• Relational Databases
- MySQL Cluster
• Key-Value stores
- Cassandra, Amazon Dynamo
• Document Databases
- MongoDB, CouchDB
• Graph Databases
- Neo4j, Titans
8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware 3
Why Existing Solutions are not
enough?
8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware
5
3
4
2
1
10
8
9
7
6
4
Why Existing Solutions are not
enough?
• Random Partitioning
• Social Request
- E.g., gather new feeds
from all the friends
• Enforcing Data
Locality
• Random partitioning
can lead to full
replication!
8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware
5
3
4
2
1
10
8
9
7
6
1 4 7 82 3 5 6 10 9
1’ 4’ 7’ 8’ 9’ 2’ 3’ 6’5’ 10’
5
Social Graphs are not Random
8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware 6
Graph Partitioning
8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware 7
JA-BE-JA- edge-cut
8/27/2013
Muhammad Anis Uddin Nasir- Gossip-based Partitioning and
Replication Middle-ware
Server A Server B
6
3
5
2
1
4
76’
3’
1’
4’
7’
• Edge Cut = 3 links, 3+2=5 replicas to
maintain
8
SPAR- Minimizing Replicas
8/27/2013
Muhammad Anis Uddin Nasir- Gossip-based Partitioning and
Replication Middle-ware
Server A Server B
6
3
5
2
1
4
76’
3’2’
5’
• Edge Cut = 4 links, 2+2=4 replicas to
maintain
9
Initialization
8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware
5
3
4
2
1
10
8
9
7
6
1 4 7 82 3 5 6 10 9
1’ 4’ 7’ 8’ 9’ 2’ 3’ 6’5’ 10’
• Node Addition
- Assign it to server with
minimum master
• Edge Addition
- Check if Nodes are Local
- Else create replicas to
maintain locality
10
Gossip Phase
• Cost Function
- Count number of replicas
- For current and new server
• Peer Selection
- Local, Random, Hybrid
8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware
5
3
4
2
1
10
8
9
7
6
1 4 7 82 3 5 6 10 9
1’ 4’ 7’ 8’ 9’ 5’ 10’
11
2’ 3’ 6’
Gossip Phase
• Cost Function
- Count number of replicas
- For existing and new server
• Peer Selection
- Local, Random, Hybrid
• Simulated Annealing
8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware
5
3
4
2
1
10
8
9
7
6
6 4 7 82 3 5 1 10 9
4’ 8’ 9’ 3’ 5’ 10’6’ 1’
4 10
12
Simulated Annealing
8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware 13
Algorithms
Algorithm Random SPAR JA-BE-JA Gossip-based
Data locality
Decentralized
Load Balancing
Fault tolerance
Avoiding Local
Optima
Availability
8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware 14
Datasets
Datasets Vertices Edges
Synth-C 2,000 20,000
Synth-HC 2,000 20,000
Synth-PL 2,000 20,000
SNAP-Facebook 4,039 88,234
WSON-Facebook 60,290 1,545,686
SNAP-Twitter 81,306 1,768,149
8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware 15
Evaluation- with datasets
8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware
0
2
4
6
8
10
12
Random
SPAR
JA-BE-JA
Gossip-based
ReplicationOverhead
>3x gain
compared to
Random
Partitioning
≈2x gain
compared to
SPAR
• Number of Servers =16, Replication factor=2
16
Evaluation- with replication factor
8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware
ReplicationOverhead
• Number of Servers =16
0
1
2
3
4
5
6
7
8
9
10
f=0
f=2
Random Graphs
generates maximum
replication overhead Real Graphs
generates minimum
replication
overhead
Data locality is
achieved by fault
tolerance replicas
17
Evaluation- with servers
8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware
ReplicationOverhead
• Replication factor =2
Number of Servers
WSON-Facebook
18
0
2
4
6
8
10
12
14
16
18
20
8 16 32 64
Random
SPAR
JA-BE-JA
Gossip-based
Gossip-based
generates minimum
replication
overhead
Replication
overhead
increases non
linearly
>4x gain
compared to
Random
Partitioning
0
2
4
6
8
10
12
14
16
18
20
8 16 32 64
Gossip-based
Evaluation- dynamicity
8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware
• Number of Servers =16, Replication factor=2
0.2
0.25
0.3
0.35
0.4
0.45
0
50
100
150
200
250
300
350
400
450
500
550
600
650
700
750
800
850
900
0.2
0.25
0.3
0.35
0.4
0.45
0
50
100
150
200
250
300
350
400
450
500
550
600
650
700
750
800
850
900
950
1000
SNAP-Twitter SNAP-Facebook
Number of cycles Number of cycles
ReplicationOverhead
ReplicationOverhead
Spikes show
bulk edge
addition
Algorithm
Stabilization
19
Transition state,
i.e., reducing the
number of replicas
after new edge
additions
Conclusion
• Random Partitioning does not provide efficient
solution of Online Social Networks
• Minimizing Replicas can help to achieve better
partitioning
• Gossip-based heuristic was proposed to solve the
minimization problem while achieving the global
optima
• Algorithm able to handle different datasets and
adjusts with dynamic nature of OSNs
8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware 20
Gossip-based Partitioning
and Replication
Middle-ware for
Online Social Networks
Muhammad Anis Uddin Nasir
(EMDC/ICT/LCN)
Supervisor: Šarūnas Girdzijauskas
Examiner: Johan Montelius
Future Work
• Execution of the algorithm with large datasets using
parallel graph processing frameworks like
GraphLab and Apache Girpah
• Load Balancing using both Master and Replicas and
providing different consistency levels
• Smart Replication to provide data locality for highly
interactive nodes
• Implement different consistency strategies based to
access patterns
8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware 22

More Related Content

Viewers also liked

The Power of Both Choices: Practical Load Balancing for Distributed Stream Pr...
The Power of Both Choices: Practical Load Balancing for Distributed Stream Pr...The Power of Both Choices: Practical Load Balancing for Distributed Stream Pr...
The Power of Both Choices: Practical Load Balancing for Distributed Stream Pr...Anis Nasir
 
Gossipy2
Gossipy2Gossipy2
Gossipy2NBCC
 
World War Two
World War TwoWorld War Two
World War Twosystemed1
 
Software-Defined Networking , Survey of HotSDN 2012
Software-Defined Networking , Survey of HotSDN 2012Software-Defined Networking , Survey of HotSDN 2012
Software-Defined Networking , Survey of HotSDN 2012Jason TC HOU (侯宗成)
 
Workplace Gossip
Workplace GossipWorkplace Gossip
Workplace GossipYuGao
 
Software-Defined Networking(SDN):A New Approach to Networking
Software-Defined Networking(SDN):A New Approach to NetworkingSoftware-Defined Networking(SDN):A New Approach to Networking
Software-Defined Networking(SDN):A New Approach to NetworkingAnju Ann
 
Good and Bad Power Point Examples Ed Tech
Good and Bad Power Point Examples Ed TechGood and Bad Power Point Examples Ed Tech
Good and Bad Power Point Examples Ed TechLynnylu
 

Viewers also liked (14)

The Power of Both Choices: Practical Load Balancing for Distributed Stream Pr...
The Power of Both Choices: Practical Load Balancing for Distributed Stream Pr...The Power of Both Choices: Practical Load Balancing for Distributed Stream Pr...
The Power of Both Choices: Practical Load Balancing for Distributed Stream Pr...
 
Gossipy2
Gossipy2Gossipy2
Gossipy2
 
World War Two
World War TwoWorld War Two
World War Two
 
Gossiping
GossipingGossiping
Gossiping
 
Software-Defined Networking , Survey of HotSDN 2012
Software-Defined Networking , Survey of HotSDN 2012Software-Defined Networking , Survey of HotSDN 2012
Software-Defined Networking , Survey of HotSDN 2012
 
Gossip
GossipGossip
Gossip
 
Workplace Gossip
Workplace GossipWorkplace Gossip
Workplace Gossip
 
Gossip
GossipGossip
Gossip
 
How To Deal With Gossip
How To Deal With GossipHow To Deal With Gossip
How To Deal With Gossip
 
Gossip & Rumor In the Workplace
Gossip & Rumor In the WorkplaceGossip & Rumor In the Workplace
Gossip & Rumor In the Workplace
 
Men V/S Women
Men V/S WomenMen V/S Women
Men V/S Women
 
Software-Defined Networking(SDN):A New Approach to Networking
Software-Defined Networking(SDN):A New Approach to NetworkingSoftware-Defined Networking(SDN):A New Approach to Networking
Software-Defined Networking(SDN):A New Approach to Networking
 
Good and Bad Power Point Examples Ed Tech
Good and Bad Power Point Examples Ed TechGood and Bad Power Point Examples Ed Tech
Good and Bad Power Point Examples Ed Tech
 
STEAL THIS PRESENTATION!
STEAL THIS PRESENTATION! STEAL THIS PRESENTATION!
STEAL THIS PRESENTATION!
 

Similar to Gossip based partitioning and replication for Online Social Networks

Getting started with Cassandra 2.1
Getting started with Cassandra 2.1Getting started with Cassandra 2.1
Getting started with Cassandra 2.1Viswanath J
 
Apache Cassandra For Java Developers - Why, What and How. LJC @ UCL October 2014
Apache Cassandra For Java Developers - Why, What and How. LJC @ UCL October 2014Apache Cassandra For Java Developers - Why, What and How. LJC @ UCL October 2014
Apache Cassandra For Java Developers - Why, What and How. LJC @ UCL October 2014Johnny Miller
 
Using Riak for Events storage and analysis at Booking.com
Using Riak for Events storage and analysis at Booking.comUsing Riak for Events storage and analysis at Booking.com
Using Riak for Events storage and analysis at Booking.comDamien Krotkine
 
Global Azure Virtual 2020 What's new on Azure IaaS for SQL VMs
Global Azure Virtual 2020 What's new on Azure IaaS for SQL VMsGlobal Azure Virtual 2020 What's new on Azure IaaS for SQL VMs
Global Azure Virtual 2020 What's new on Azure IaaS for SQL VMsMarco Obinu
 
Data Science Lab Meetup: Cassandra and Spark
Data Science Lab Meetup: Cassandra and SparkData Science Lab Meetup: Cassandra and Spark
Data Science Lab Meetup: Cassandra and SparkChristopher Batey
 
Sharding Methods for MongoDB
Sharding Methods for MongoDBSharding Methods for MongoDB
Sharding Methods for MongoDBMongoDB
 
Lawrence Livermore Labs talk 2011
Lawrence Livermore Labs talk 2011Lawrence Livermore Labs talk 2011
Lawrence Livermore Labs talk 2011MapR Technologies
 
Agility and Scalability with MongoDB
Agility and Scalability with MongoDBAgility and Scalability with MongoDB
Agility and Scalability with MongoDBMongoDB
 
D108636GC10_les01.pptx
D108636GC10_les01.pptxD108636GC10_les01.pptx
D108636GC10_les01.pptxSuresh569521
 
TritonSort: A Balanced Large-Scale Sorting System (NSDI 2011)
TritonSort: A Balanced Large-Scale Sorting System (NSDI 2011)TritonSort: A Balanced Large-Scale Sorting System (NSDI 2011)
TritonSort: A Balanced Large-Scale Sorting System (NSDI 2011)Alex Rasmussen
 
10 Ways to Scale with Redis - LA Redis Meetup 2019
10 Ways to Scale with Redis - LA Redis Meetup 201910 Ways to Scale with Redis - LA Redis Meetup 2019
10 Ways to Scale with Redis - LA Redis Meetup 2019Dave Nielsen
 
Scaling with sync_replication using Galera and EC2
Scaling with sync_replication using Galera and EC2Scaling with sync_replication using Galera and EC2
Scaling with sync_replication using Galera and EC2Marco Tusa
 
impervious cover
impervious coverimpervious cover
impervious coverJames Yang
 
Big Data Day LA 2016/ NoSQL track - Analytics at the Speed of Light with Redi...
Big Data Day LA 2016/ NoSQL track - Analytics at the Speed of Light with Redi...Big Data Day LA 2016/ NoSQL track - Analytics at the Speed of Light with Redi...
Big Data Day LA 2016/ NoSQL track - Analytics at the Speed of Light with Redi...Data Con LA
 
Highly available, scalable and secure data with Cassandra and DataStax Enterp...
Highly available, scalable and secure data with Cassandra and DataStax Enterp...Highly available, scalable and secure data with Cassandra and DataStax Enterp...
Highly available, scalable and secure data with Cassandra and DataStax Enterp...Johnny Miller
 
lastfm contentdashboards project description
lastfm contentdashboards project descriptionlastfm contentdashboards project description
lastfm contentdashboards project descriptionGaurav Bhardwaj
 
Shaping the Future of Travel with MongoDB
Shaping the Future of Travel with MongoDBShaping the Future of Travel with MongoDB
Shaping the Future of Travel with MongoDBMongoDB
 

Similar to Gossip based partitioning and replication for Online Social Networks (20)

Getting started with Cassandra 2.1
Getting started with Cassandra 2.1Getting started with Cassandra 2.1
Getting started with Cassandra 2.1
 
Apache Cassandra For Java Developers - Why, What and How. LJC @ UCL October 2014
Apache Cassandra For Java Developers - Why, What and How. LJC @ UCL October 2014Apache Cassandra For Java Developers - Why, What and How. LJC @ UCL October 2014
Apache Cassandra For Java Developers - Why, What and How. LJC @ UCL October 2014
 
Geode - Day 1
Geode - Day 1Geode - Day 1
Geode - Day 1
 
MySQL vs. MonetDB
MySQL vs. MonetDBMySQL vs. MonetDB
MySQL vs. MonetDB
 
Using Riak for Events storage and analysis at Booking.com
Using Riak for Events storage and analysis at Booking.comUsing Riak for Events storage and analysis at Booking.com
Using Riak for Events storage and analysis at Booking.com
 
Global Azure Virtual 2020 What's new on Azure IaaS for SQL VMs
Global Azure Virtual 2020 What's new on Azure IaaS for SQL VMsGlobal Azure Virtual 2020 What's new on Azure IaaS for SQL VMs
Global Azure Virtual 2020 What's new on Azure IaaS for SQL VMs
 
EDNS0 Client-Subnet for DNS Based CDNs
EDNS0 Client-Subnet for DNS Based CDNs EDNS0 Client-Subnet for DNS Based CDNs
EDNS0 Client-Subnet for DNS Based CDNs
 
Data Science Lab Meetup: Cassandra and Spark
Data Science Lab Meetup: Cassandra and SparkData Science Lab Meetup: Cassandra and Spark
Data Science Lab Meetup: Cassandra and Spark
 
Sharding Methods for MongoDB
Sharding Methods for MongoDBSharding Methods for MongoDB
Sharding Methods for MongoDB
 
Lawrence Livermore Labs talk 2011
Lawrence Livermore Labs talk 2011Lawrence Livermore Labs talk 2011
Lawrence Livermore Labs talk 2011
 
Agility and Scalability with MongoDB
Agility and Scalability with MongoDBAgility and Scalability with MongoDB
Agility and Scalability with MongoDB
 
D108636GC10_les01.pptx
D108636GC10_les01.pptxD108636GC10_les01.pptx
D108636GC10_les01.pptx
 
TritonSort: A Balanced Large-Scale Sorting System (NSDI 2011)
TritonSort: A Balanced Large-Scale Sorting System (NSDI 2011)TritonSort: A Balanced Large-Scale Sorting System (NSDI 2011)
TritonSort: A Balanced Large-Scale Sorting System (NSDI 2011)
 
10 Ways to Scale with Redis - LA Redis Meetup 2019
10 Ways to Scale with Redis - LA Redis Meetup 201910 Ways to Scale with Redis - LA Redis Meetup 2019
10 Ways to Scale with Redis - LA Redis Meetup 2019
 
Scaling with sync_replication using Galera and EC2
Scaling with sync_replication using Galera and EC2Scaling with sync_replication using Galera and EC2
Scaling with sync_replication using Galera and EC2
 
impervious cover
impervious coverimpervious cover
impervious cover
 
Big Data Day LA 2016/ NoSQL track - Analytics at the Speed of Light with Redi...
Big Data Day LA 2016/ NoSQL track - Analytics at the Speed of Light with Redi...Big Data Day LA 2016/ NoSQL track - Analytics at the Speed of Light with Redi...
Big Data Day LA 2016/ NoSQL track - Analytics at the Speed of Light with Redi...
 
Highly available, scalable and secure data with Cassandra and DataStax Enterp...
Highly available, scalable and secure data with Cassandra and DataStax Enterp...Highly available, scalable and secure data with Cassandra and DataStax Enterp...
Highly available, scalable and secure data with Cassandra and DataStax Enterp...
 
lastfm contentdashboards project description
lastfm contentdashboards project descriptionlastfm contentdashboards project description
lastfm contentdashboards project description
 
Shaping the Future of Travel with MongoDB
Shaping the Future of Travel with MongoDBShaping the Future of Travel with MongoDB
Shaping the Future of Travel with MongoDB
 

Recently uploaded

ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...JhezDiaz1
 
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxBarangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxCarlos105
 
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfAMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfphamnguyenenglishnb
 
Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Seán Kennedy
 
ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4MiaBumagat1
 
How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17Celine George
 
AUDIENCE THEORY -CULTIVATION THEORY - GERBNER.pptx
AUDIENCE THEORY -CULTIVATION THEORY -  GERBNER.pptxAUDIENCE THEORY -CULTIVATION THEORY -  GERBNER.pptx
AUDIENCE THEORY -CULTIVATION THEORY - GERBNER.pptxiammrhaywood
 
Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Celine George
 
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...Postal Advocate Inc.
 
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxINTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxHumphrey A Beña
 
ENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomnelietumpap1
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Celine George
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatYousafMalik24
 
ACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfSpandanaRallapalli
 
Culture Uniformity or Diversity IN SOCIOLOGY.pptx
Culture Uniformity or Diversity IN SOCIOLOGY.pptxCulture Uniformity or Diversity IN SOCIOLOGY.pptx
Culture Uniformity or Diversity IN SOCIOLOGY.pptxPoojaSen20
 
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdfVirtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdfErwinPantujan2
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxthorishapillay1
 
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfGrade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfJemuel Francisco
 
4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptxmary850239
 

Recently uploaded (20)

ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
 
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxBarangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
 
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfAMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
 
Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...
 
ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4
 
How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17
 
AUDIENCE THEORY -CULTIVATION THEORY - GERBNER.pptx
AUDIENCE THEORY -CULTIVATION THEORY -  GERBNER.pptxAUDIENCE THEORY -CULTIVATION THEORY -  GERBNER.pptx
AUDIENCE THEORY -CULTIVATION THEORY - GERBNER.pptx
 
Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17
 
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
 
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxINTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
 
ENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choom
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice great
 
ACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdf
 
Culture Uniformity or Diversity IN SOCIOLOGY.pptx
Culture Uniformity or Diversity IN SOCIOLOGY.pptxCulture Uniformity or Diversity IN SOCIOLOGY.pptx
Culture Uniformity or Diversity IN SOCIOLOGY.pptx
 
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdfVirtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptx
 
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfGrade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
 
4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx
 

Gossip based partitioning and replication for Online Social Networks

  • 1. Gossip-based Partitioning and Replication Middle-ware for Online Social Networks Muhammad Anis Uddin Nasir (EMDC/ICT/LCN) Supervisor: Šarūnas Girdzijauskas Examiner: Johan Montelius
  • 2. Online Social Networks 8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware •Vertices •Edges •Metadata Ioanna Antonio Vaidas Aras Vasia Anis Mudit Manos 2 LeandroJohan
  • 3. Existing Solutions • Relational Databases - MySQL Cluster • Key-Value stores - Cassandra, Amazon Dynamo • Document Databases - MongoDB, CouchDB • Graph Databases - Neo4j, Titans 8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware 3
  • 4. Why Existing Solutions are not enough? 8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware 5 3 4 2 1 10 8 9 7 6 4
  • 5. Why Existing Solutions are not enough? • Random Partitioning • Social Request - E.g., gather new feeds from all the friends • Enforcing Data Locality • Random partitioning can lead to full replication! 8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware 5 3 4 2 1 10 8 9 7 6 1 4 7 82 3 5 6 10 9 1’ 4’ 7’ 8’ 9’ 2’ 3’ 6’5’ 10’ 5
  • 6. Social Graphs are not Random 8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware 6
  • 7. Graph Partitioning 8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware 7
  • 8. JA-BE-JA- edge-cut 8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware Server A Server B 6 3 5 2 1 4 76’ 3’ 1’ 4’ 7’ • Edge Cut = 3 links, 3+2=5 replicas to maintain 8
  • 9. SPAR- Minimizing Replicas 8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware Server A Server B 6 3 5 2 1 4 76’ 3’2’ 5’ • Edge Cut = 4 links, 2+2=4 replicas to maintain 9
  • 10. Initialization 8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware 5 3 4 2 1 10 8 9 7 6 1 4 7 82 3 5 6 10 9 1’ 4’ 7’ 8’ 9’ 2’ 3’ 6’5’ 10’ • Node Addition - Assign it to server with minimum master • Edge Addition - Check if Nodes are Local - Else create replicas to maintain locality 10
  • 11. Gossip Phase • Cost Function - Count number of replicas - For current and new server • Peer Selection - Local, Random, Hybrid 8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware 5 3 4 2 1 10 8 9 7 6 1 4 7 82 3 5 6 10 9 1’ 4’ 7’ 8’ 9’ 5’ 10’ 11 2’ 3’ 6’
  • 12. Gossip Phase • Cost Function - Count number of replicas - For existing and new server • Peer Selection - Local, Random, Hybrid • Simulated Annealing 8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware 5 3 4 2 1 10 8 9 7 6 6 4 7 82 3 5 1 10 9 4’ 8’ 9’ 3’ 5’ 10’6’ 1’ 4 10 12
  • 13. Simulated Annealing 8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware 13
  • 14. Algorithms Algorithm Random SPAR JA-BE-JA Gossip-based Data locality Decentralized Load Balancing Fault tolerance Avoiding Local Optima Availability 8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware 14
  • 15. Datasets Datasets Vertices Edges Synth-C 2,000 20,000 Synth-HC 2,000 20,000 Synth-PL 2,000 20,000 SNAP-Facebook 4,039 88,234 WSON-Facebook 60,290 1,545,686 SNAP-Twitter 81,306 1,768,149 8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware 15
  • 16. Evaluation- with datasets 8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware 0 2 4 6 8 10 12 Random SPAR JA-BE-JA Gossip-based ReplicationOverhead >3x gain compared to Random Partitioning ≈2x gain compared to SPAR • Number of Servers =16, Replication factor=2 16
  • 17. Evaluation- with replication factor 8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware ReplicationOverhead • Number of Servers =16 0 1 2 3 4 5 6 7 8 9 10 f=0 f=2 Random Graphs generates maximum replication overhead Real Graphs generates minimum replication overhead Data locality is achieved by fault tolerance replicas 17
  • 18. Evaluation- with servers 8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware ReplicationOverhead • Replication factor =2 Number of Servers WSON-Facebook 18 0 2 4 6 8 10 12 14 16 18 20 8 16 32 64 Random SPAR JA-BE-JA Gossip-based Gossip-based generates minimum replication overhead Replication overhead increases non linearly >4x gain compared to Random Partitioning 0 2 4 6 8 10 12 14 16 18 20 8 16 32 64 Gossip-based
  • 19. Evaluation- dynamicity 8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware • Number of Servers =16, Replication factor=2 0.2 0.25 0.3 0.35 0.4 0.45 0 50 100 150 200 250 300 350 400 450 500 550 600 650 700 750 800 850 900 0.2 0.25 0.3 0.35 0.4 0.45 0 50 100 150 200 250 300 350 400 450 500 550 600 650 700 750 800 850 900 950 1000 SNAP-Twitter SNAP-Facebook Number of cycles Number of cycles ReplicationOverhead ReplicationOverhead Spikes show bulk edge addition Algorithm Stabilization 19 Transition state, i.e., reducing the number of replicas after new edge additions
  • 20. Conclusion • Random Partitioning does not provide efficient solution of Online Social Networks • Minimizing Replicas can help to achieve better partitioning • Gossip-based heuristic was proposed to solve the minimization problem while achieving the global optima • Algorithm able to handle different datasets and adjusts with dynamic nature of OSNs 8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware 20
  • 21. Gossip-based Partitioning and Replication Middle-ware for Online Social Networks Muhammad Anis Uddin Nasir (EMDC/ICT/LCN) Supervisor: Šarūnas Girdzijauskas Examiner: Johan Montelius
  • 22. Future Work • Execution of the algorithm with large datasets using parallel graph processing frameworks like GraphLab and Apache Girpah • Load Balancing using both Master and Replicas and providing different consistency levels • Smart Replication to provide data locality for highly interactive nodes • Implement different consistency strategies based to access patterns 8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware 22