SlideShare une entreprise Scribd logo
1  sur  77
Télécharger pour lire hors ligne
Distributed Hash Tables
Amir H. Payberah
amir@sics.se
Amirkabir University of Technology
(Tehran Polytechnic)
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 1 / 62
What is the Problem?
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 2 / 62
What is the Problem?
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 3 / 62
Possible Solutions (1/3)
Central directory
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 4 / 62
Possible Solutions (2/3)
Flooding
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 5 / 62
Possible Solutions (3/3)
Distributed Hash Table (DHT)
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 6 / 62
Distributed Hash Table
(DHT)
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 7 / 62
Distributed Hash Table
An ordinary hash-table, which is ...
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 8 / 62
Distributed Hash Table
An ordinary hash-table, which is distributed.
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 9 / 62
Steps to Build a DHT
Step 1: decide on common key space for nodes and values.
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 10 / 62
Steps to Build a DHT
Step 1: decide on common key space for nodes and values.
Step 2: connect the nodes smartly.
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 10 / 62
Steps to Build a DHT
Step 1: decide on common key space for nodes and values.
Step 2: connect the nodes smartly.
Step 3: make a strategy for assigning items to nodes.
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 10 / 62
Chord: an Example of a DHT
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 11 / 62
Construct Chord - Step 1
Use a logical name space, called the id space, consisting of identifiers
{0, 1, 2, · · · , N − 1}.
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 12 / 62
Construct Chord - Step 1
Use a logical name space, called the id space, consisting of identifiers
{0, 1, 2, · · · , N − 1}.
Id space is a logical ring modulo N.
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 12 / 62
Construct Chord - Step 1
Use a logical name space, called the id space, consisting of identifiers
{0, 1, 2, · · · , N − 1}.
Id space is a logical ring modulo N.
Every node picks a random id though Hash H.
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 12 / 62
Construct Chord - Step 1
Use a logical name space, called the id space, consisting of identifiers
{0, 1, 2, · · · , N − 1}.
Id space is a logical ring modulo N.
Every node picks a random id though Hash H.
Example:
• Space N = 16{0, · · · , 15}
• Five nodes a, b, c, d, e.
• H(a) = 6
• H(b) = 5
• H(c) = 0
• H(d) = 11
• H(e) = 2
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 12 / 62
Construct Chord - Step 2 (1/2)
The successor of an id is the first node met going in clockwise
direction starting at the id.
succ(x): is the first node on the ring with id greater than or equal
x.
• succ(12) = 0
• succ(1) = 2
• succ(6) = 6
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 13 / 62
Construct Chord - Step 2 (2/2)
Each node points to its successor.
The successor of a node n is succ(n + 1).
• 0’s successor is succ(1) = 2.
• 2’s successor is succ(3) = 5.
• 11’s successor is succ(12) = 0.
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 14 / 62
Construct Chord - Step 3
Where to store data?
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 15 / 62
Construct Chord - Step 3
Where to store data?
Use globally known hash function H.
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 15 / 62
Construct Chord - Step 3
Where to store data?
Use globally known hash function H.
Each item key, value gets identifier H(key) = k.
• Space N = 16{0, · · · , 15}
• Five nodes a, b, c, d, e.
• H(Fatemeh) = 12
• H(Cosmin) = 2
• H(Seif) = 9
• H(Sarunas) = 14
• H(Tallat) = 4
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 15 / 62
Construct Chord - Step 3
Where to store data?
Use globally known hash function H.
Each item key, value gets identifier H(key) = k.
• Space N = 16{0, · · · , 15}
• Five nodes a, b, c, d, e.
• H(Fatemeh) = 12
• H(Cosmin) = 2
• H(Seif) = 9
• H(Sarunas) = 14
• H(Tallat) = 4
Store each item at its successor.
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 16 / 62
Construct Chord - Step 3
Where to store data?
Use globally known hash function H.
Each item key, value gets identifier H(key) = k.
• Space N = 16{0, · · · , 15}
• Five nodes a, b, c, d, e.
• H(Fatemeh) = 12
• H(Cosmin) = 2
• H(Seif) = 9
• H(Sarunas) = 14
• H(Tallat) = 4
Store each item at its successor.
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 16 / 62
Construct Chord - Step 3
Where to store data?
Use globally known hash function H.
Each item key, value gets identifier H(key) = k.
• Space N = 16{0, · · · , 15}
• Five nodes a, b, c, d, e.
• H(Fatemeh) = 12
• H(Cosmin) = 2
• H(Seif) = 9
• H(Sarunas) = 14
• H(Tallat) = 4
Store each item at its successor.
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 17 / 62
How to Lookup?
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 18 / 62
Lookup (1/2)
To lookup a key k:
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 19 / 62
Lookup (1/2)
To lookup a key k:
• Calculate H(k).
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 19 / 62
Lookup (1/2)
To lookup a key k:
• Calculate H(k).
• Follow succ pointers until item k is found.
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 19 / 62
Lookup (1/2)
To lookup a key k:
• Calculate H(k).
• Follow succ pointers until item k is found.
Example:
• Lookup Seif at node 2.
• H(Seif) = 9
• Traverse nodes: 2, 5, 6, 11
• Return Stockholm to initiator
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 19 / 62
Lookup (2/2)
Algorithm 1 Ask node n to find the successor of id
1: procedure n.findSuccessor(id)
2: if pred = Ø and id ∈ (pred, n] then
3: return n
4: else if id ∈ (n, succ] then
5: return succ
6: else// forward the query around the circle
7: return succ.findSuccessor(id)
8: end if
9: end procedure
(a, b] the segment of the ring moving clockwise from but not includ-
ing a until and including b.
n.foo(.) denotes an RPC of foo(.) to node n.
n.bar denotes and RPC to fetch the value of the variable bar in
node n.
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 20 / 62
Put and Get
Algorithm 2 Store value with key id in the DHT
1: procedure n.put(id, value)
2: n = findSuccessor(id)
3: s.store(id, value)
4: end procedure
Algorithm 3 Retrieve the value of the key id from the DHT
1: procedure n.get(id)
2: n = findSuccessor(id)
3: return s.retrieve(id)
4: end procedure
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 21 / 62
Any Improvement?
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 22 / 62
Improvement
Speeding up lookups.
If only the successor pointers are used:
• Worst case lookup time is N, for N nodes.
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 23 / 62
Speeding up Lookups (1/2)
Finger/routing table:
• Point to succ(n + 1)
• Point to succ(n + 2)
• Point to succ(n + 4)
• ...
• Point to succ(n + 2M−1
) (N = 2M
, N: the id space size)
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 24 / 62
Speeding up Lookups (1/2)
Finger/routing table:
• Point to succ(n + 1)
• Point to succ(n + 2)
• Point to succ(n + 4)
• ...
• Point to succ(n + 2M−1
) (N = 2M
, N: the id space size)
Distance always halved to the
destination.
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 24 / 62
Speeding up Lookups (2/2)
Every node n knows succ(n + 2i−1) for i = 1, · · · , M.
Size of routing tables is logarithmic:
• Routing table size: M, where N = 2M
.
• Routing entries = log2(N).
• Example: Log2(1000000) ≈ 20
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 25 / 62
Lookup Improvement (1/3)
Algorithm 4 Ask node n to find the successor of id
1: procedure n.findSuccessor(id)
2: if pred = Ø and id ∈ (pred, n] then
3: return n
4: else if id ∈ (n, succ] then
5: return succ
6: else// forward the query around the circle
7: return succ.findSuccessor(id)
8: end if
9: end procedure
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 26 / 62
Lookup Improvement (2/3)
Algorithm 5 Ask node n to find the successor of id
1: procedure n.findSuccessor(id)
2: if pred = Ø and id ∈ (pred, n] then
3: return n
4: else if id ∈ (n, succ] then
5: return succ
6: else// forward the query around the circle
7: p ← closestPrecedingNode(id)
8: return p.findSuccessor(id)
9: end if
10: end procedure
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 27 / 62
Lookup Improvement (3/3)
Algorithm 6 Search locally for the highest predecessor of id
1: procedure closestPrecedingNode(id)
2: for i = m downto 1 do
3: if finget[i] ∈ (n, id) then
4: return finger[i]
5: end if
6: end for
7: end procedure
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 28 / 62
Lookups (1/7)
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 29 / 62
Lookups (2/7)
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 30 / 62
Lookups (3/7)
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 31 / 62
Lookups (4/7)
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 32 / 62
Lookups (5/7)
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 33 / 62
Lookups (6/7)
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 34 / 62
Lookups (7/7)
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 35 / 62
How to Maintain the Ring?
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 36 / 62
Periodic Stabilization (1/2)
In Chord, in addition to the successor pointer, every node has a
predecessor pointer.
• Predecessor of node n is the first node met in anti-clockwise
direction starting at n.
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 37 / 62
Periodic Stabilization (1/2)
In Chord, in addition to the successor pointer, every node has a
predecessor pointer.
• Predecessor of node n is the first node met in anti-clockwise
direction starting at n.
Periodic stabilization is used to make pointers eventually correct.
• Pointing succ to closest alive
successor.
• Pointing pred to closest alive
predecessor.
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 37 / 62
Periodic Stabilization (2/2)
Algorithm 7 Periodically at n
1: procedure n.stabilize()
2: v ← succ.pred
3: if v = Ø and v ∈ (n, succ] then
4: succ ← v
5: end if
6: send notify(n) to succ
7: end procedure
Algorithm 8 Upon receipt a notify(p) at node m
1: on receive notify | p from n do
2: if pred = Ø or p ∈ (pred, m] then
3: pred ← p
4: end if
5: end event
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 38 / 62
Handling Join
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 39 / 62
Handling Join
When n joins:
• Find n’s successor with lookup(n).
• Set succ to n’s successor.
• Stabilization fixes the rest.
Algorithm 9 Join a Chord ring containing node m
1: procedure n.join(m)
2: pred ← Ø
3: succ ← m.findSuccessor(n)
4: end procedure
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 40 / 62
Join (1/5)
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 41 / 62
Join (2/5)
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 42 / 62
Join (3/5)
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 43 / 62
Join (4/5)
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 44 / 62
Join (5/5)
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 45 / 62
Fix Fingers (1/4)
Periodically refresh finger table entries, and store the index of the
next finger to fix.
Algorithm 10 When receiving notify(p) at n
1: procedure n.fixFingers()
2: next ← next + 1
3: if next > m then
4: next ← 1
5: end if
6: finger[next] ← findSuccessor(n ⊕ 2next−1)
7: end procedure
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 46 / 62
Fix Fingers (2/4)
Current situation: succ(N48) = N60
succ(21 ⊕ 25) = succ(53) = N60
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 47 / 62
Fix Fingers (3/4)
succ(21 ⊕ 25) = succ(53) =?
New node N56 joins and stabilizes successor pointer.
Finger 6 of node N21 is wrong now.
N21 eventually try to fix finger 6 by looking up 53 which stops at
N48.
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 48 / 62
Fix Fingers (4/4)
succ(21 ⊕ 25) = succ(53) = N56
N48 will eventually stabilize its successor.
This means the ring is correct now.
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 49 / 62
Handling Failure
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 50 / 62
Successor List (1/2)
A node has a successors list of size r containing the immediate r
successors.
• succ(n + 1)
• succ(succ(n + 1) + 1)
• succ(succ(succ(n + 1) + 1) + 1)
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 51 / 62
Successor List (1/2)
A node has a successors list of size r containing the immediate r
successors.
• succ(n + 1)
• succ(succ(n + 1) + 1)
• succ(succ(succ(n + 1) + 1) + 1)
How big should r be?
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 51 / 62
Successor List (1/2)
A node has a successors list of size r containing the immediate r
successors.
• succ(n + 1)
• succ(succ(n + 1) + 1)
• succ(succ(succ(n + 1) + 1) + 1)
How big should r be? log2(N)
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 51 / 62
Successor List (2/2)
Algorithm 11 Join a Chord ring containing node m
1: procedure n.join(m)
2: pred ← Ø
3: succ ← m.findSuccessor(n)
4: updateSuccesorList(succ.successorList)
5: end procedure
Algorithm 12 Periodically at n
1: procedure n.stabilize()
2: succ ← find first alive node in successor list
3: v ← succ.pred
4: if v = Ø and v ∈ (n, succ] then
5: succ ← v
6: end if
7: send notify(n) to succ updateSuccessorList(succ.successorList)
8: end procedure
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 52 / 62
Dealing with Failure
Periodic stabilization
If successor fails: replace with closest alive successor
If predecessor fails: set predecessor to nil
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 53 / 62
Failure (1/5)
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 54 / 62
Failure (2/5)
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 55 / 62
Failure (3/5)
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 56 / 62
Failure (4/5)
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 57 / 62
Failure (5/5)
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 58 / 62
Summary
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 59 / 62
Summary
DHTs: distributed key, value
Lookup service
Put and Get
Finger list: improve the lookup
Periodically stabilization
Successor list
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 60 / 62
References:
Ion Stoica et al., Chord: A scalable peer-to-peer lookup service for
internet applications, SIGCOMM, 2001.
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 61 / 62
Questions?
Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 62 / 62

Contenu connexe

Tendances

Tendances (20)

Minimization of DFA.pptx
Minimization of DFA.pptxMinimization of DFA.pptx
Minimization of DFA.pptx
 
The Floyd–Warshall algorithm
The Floyd–Warshall algorithmThe Floyd–Warshall algorithm
The Floyd–Warshall algorithm
 
3.9 external sorting
3.9 external sorting3.9 external sorting
3.9 external sorting
 
Quick sort
Quick sortQuick sort
Quick sort
 
Directory structure
Directory structureDirectory structure
Directory structure
 
Chapter 8 - Main Memory
Chapter 8 - Main MemoryChapter 8 - Main Memory
Chapter 8 - Main Memory
 
Hashing and separate chain
Hashing and separate chainHashing and separate chain
Hashing and separate chain
 
Lec1
Lec1Lec1
Lec1
 
Unit 2 in daa
Unit 2 in daaUnit 2 in daa
Unit 2 in daa
 
Chapter 12 ds
Chapter 12 dsChapter 12 ds
Chapter 12 ds
 
Floyd warshall-algorithm
Floyd warshall-algorithmFloyd warshall-algorithm
Floyd warshall-algorithm
 
Unit I-Data structures stack & Queue
Unit I-Data structures stack & QueueUnit I-Data structures stack & Queue
Unit I-Data structures stack & Queue
 
Hashing Part Two: Static Perfect Hashing
Hashing Part Two: Static Perfect HashingHashing Part Two: Static Perfect Hashing
Hashing Part Two: Static Perfect Hashing
 
CONTEXT FREE GRAMMAR
CONTEXT FREE GRAMMAR CONTEXT FREE GRAMMAR
CONTEXT FREE GRAMMAR
 
Disjoint sets
Disjoint setsDisjoint sets
Disjoint sets
 
Dijkstra s algorithm
Dijkstra s algorithmDijkstra s algorithm
Dijkstra s algorithm
 
Dynamic Partition Pruning in Apache Spark
Dynamic Partition Pruning in Apache SparkDynamic Partition Pruning in Apache Spark
Dynamic Partition Pruning in Apache Spark
 
Disk scheduling
Disk schedulingDisk scheduling
Disk scheduling
 
Chapter 7 - Deadlocks
Chapter 7 - DeadlocksChapter 7 - Deadlocks
Chapter 7 - Deadlocks
 
A presentation on prim's and kruskal's algorithm
A presentation on prim's and kruskal's algorithmA presentation on prim's and kruskal's algorithm
A presentation on prim's and kruskal's algorithm
 

En vedette

Chord presentation
Chord presentationChord presentation
Chord presentation
GertThijs
 
Consistent hashing
Consistent hashingConsistent hashing
Consistent hashing
Jooho Lee
 

En vedette (20)

Distributed Hash Table
Distributed Hash TableDistributed Hash Table
Distributed Hash Table
 
Distributed Hash Table and Consistent Hashing
Distributed Hash Table and Consistent HashingDistributed Hash Table and Consistent Hashing
Distributed Hash Table and Consistent Hashing
 
Chord presentation
Chord presentationChord presentation
Chord presentation
 
Process Management - Part3
Process Management - Part3Process Management - Part3
Process Management - Part3
 
Process Management - Part1
Process Management - Part1Process Management - Part1
Process Management - Part1
 
Introduction to Operating Systems - Part3
Introduction to Operating Systems - Part3Introduction to Operating Systems - Part3
Introduction to Operating Systems - Part3
 
Threads
ThreadsThreads
Threads
 
Google File System
Google File SystemGoogle File System
Google File System
 
Process Management - Part2
Process Management - Part2Process Management - Part2
Process Management - Part2
 
Introduction to Operating Systems - Part2
Introduction to Operating Systems - Part2Introduction to Operating Systems - Part2
Introduction to Operating Systems - Part2
 
Process Synchronization - Part2
Process Synchronization -  Part2Process Synchronization -  Part2
Process Synchronization - Part2
 
Process Synchronization - Part1
Process Synchronization -  Part1Process Synchronization -  Part1
Process Synchronization - Part1
 
Introduction to Operating Systems - Part1
Introduction to Operating Systems - Part1Introduction to Operating Systems - Part1
Introduction to Operating Systems - Part1
 
Wanhive vs Chord Distributed Hash Table
Wanhive vs Chord Distributed Hash TableWanhive vs Chord Distributed Hash Table
Wanhive vs Chord Distributed Hash Table
 
O(1) DHT
O(1) DHTO(1) DHT
O(1) DHT
 
Consistent hashing
Consistent hashingConsistent hashing
Consistent hashing
 
key distribution in network security
key distribution in network securitykey distribution in network security
key distribution in network security
 
Hash tables
Hash tablesHash tables
Hash tables
 
Basics of Content Chunking
Basics of Content ChunkingBasics of Content Chunking
Basics of Content Chunking
 
Hashing and Hash Tables
Hashing and Hash TablesHashing and Hash Tables
Hashing and Hash Tables
 

Similaire à Distributed Hash Table

computer notes - Data Structures - 26
computer notes - Data Structures - 26computer notes - Data Structures - 26
computer notes - Data Structures - 26
ecomputernotes
 
Secure information aggregation in sensor networks
Secure information aggregation in sensor networksSecure information aggregation in sensor networks
Secure information aggregation in sensor networks
Aleksandr Yampolskiy
 
Bigdata presentation
Bigdata presentationBigdata presentation
Bigdata presentation
Yonas Gidey
 

Similaire à Distributed Hash Table (20)

MapReduce
MapReduceMapReduce
MapReduce
 
Hashing
HashingHashing
Hashing
 
Hashing
HashingHashing
Hashing
 
Spark
SparkSpark
Spark
 
Scala
ScalaScala
Scala
 
computer notes - Data Structures - 26
computer notes - Data Structures - 26computer notes - Data Structures - 26
computer notes - Data Structures - 26
 
6 attributed grammars
6  attributed grammars6  attributed grammars
6 attributed grammars
 
Distributed algorithms for big data @ GeeCon
Distributed algorithms for big data @ GeeConDistributed algorithms for big data @ GeeCon
Distributed algorithms for big data @ GeeCon
 
Recursive algorithms
Recursive algorithmsRecursive algorithms
Recursive algorithms
 
Cs419 lec12 semantic analyzer
Cs419 lec12  semantic analyzerCs419 lec12  semantic analyzer
Cs419 lec12 semantic analyzer
 
Secure information aggregation in sensor networks
Secure information aggregation in sensor networksSecure information aggregation in sensor networks
Secure information aggregation in sensor networks
 
Sketching and locality sensitive hashing for alignment
Sketching and locality sensitive hashing for alignmentSketching and locality sensitive hashing for alignment
Sketching and locality sensitive hashing for alignment
 
Searching Informed Search.pdf
Searching Informed Search.pdfSearching Informed Search.pdf
Searching Informed Search.pdf
 
Hashing in datastructure
Hashing in datastructureHashing in datastructure
Hashing in datastructure
 
Bigdata presentation
Bigdata presentationBigdata presentation
Bigdata presentation
 
Bigdata Presentation
Bigdata PresentationBigdata Presentation
Bigdata Presentation
 
A Star Search
A Star SearchA Star Search
A Star Search
 
A Star Search
A Star SearchA Star Search
A Star Search
 
AI Greedy and A-STAR Search
AI Greedy and A-STAR SearchAI Greedy and A-STAR Search
AI Greedy and A-STAR Search
 
Hashing.pptx
Hashing.pptxHashing.pptx
Hashing.pptx
 

Plus de Amir Payberah

Plus de Amir Payberah (20)

P2P Content Distribution Network
P2P Content Distribution NetworkP2P Content Distribution Network
P2P Content Distribution Network
 
Cloud Computing
Cloud ComputingCloud Computing
Cloud Computing
 
Data Intensive Computing Frameworks
Data Intensive Computing FrameworksData Intensive Computing Frameworks
Data Intensive Computing Frameworks
 
The Stratosphere Big Data Analytics Platform
The Stratosphere Big Data Analytics PlatformThe Stratosphere Big Data Analytics Platform
The Stratosphere Big Data Analytics Platform
 
The Spark Big Data Analytics Platform
The Spark Big Data Analytics PlatformThe Spark Big Data Analytics Platform
The Spark Big Data Analytics Platform
 
Security
SecuritySecurity
Security
 
Protection
ProtectionProtection
Protection
 
Linux Module Programming
Linux Module ProgrammingLinux Module Programming
Linux Module Programming
 
IO Systems
IO SystemsIO Systems
IO Systems
 
File System Implementation - Part2
File System Implementation - Part2File System Implementation - Part2
File System Implementation - Part2
 
File System Implementation - Part1
File System Implementation - Part1File System Implementation - Part1
File System Implementation - Part1
 
File System Interface
File System InterfaceFile System Interface
File System Interface
 
Storage
StorageStorage
Storage
 
Virtual Memory - Part2
Virtual Memory - Part2Virtual Memory - Part2
Virtual Memory - Part2
 
Virtual Memory - Part1
Virtual Memory - Part1Virtual Memory - Part1
Virtual Memory - Part1
 
Main Memory - Part2
Main Memory - Part2Main Memory - Part2
Main Memory - Part2
 
Main Memory - Part1
Main Memory - Part1Main Memory - Part1
Main Memory - Part1
 
Deadlocks
DeadlocksDeadlocks
Deadlocks
 
CPU Scheduling - Part2
CPU Scheduling - Part2CPU Scheduling - Part2
CPU Scheduling - Part2
 
CPU Scheduling - Part1
CPU Scheduling - Part1CPU Scheduling - Part1
CPU Scheduling - Part1
 

Dernier

introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdfintroduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
VishalKumarJha10
 
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
Health
 
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM TechniquesAI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
VictorSzoltysek
 
The title is not connected to what is inside
The title is not connected to what is insideThe title is not connected to what is inside
The title is not connected to what is inside
shinachiaurasa2
 

Dernier (20)

W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
 
%in Midrand+277-882-255-28 abortion pills for sale in midrand
%in Midrand+277-882-255-28 abortion pills for sale in midrand%in Midrand+277-882-255-28 abortion pills for sale in midrand
%in Midrand+277-882-255-28 abortion pills for sale in midrand
 
Introducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) SolutionIntroducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) Solution
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
 
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected WorkerHow To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
 
8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech students8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech students
 
Pharm-D Biostatistics and Research methodology
Pharm-D Biostatistics and Research methodologyPharm-D Biostatistics and Research methodology
Pharm-D Biostatistics and Research methodology
 
Sector 18, Noida Call girls :8448380779 Model Escorts | 100% verified
Sector 18, Noida Call girls :8448380779 Model Escorts | 100% verifiedSector 18, Noida Call girls :8448380779 Model Escorts | 100% verified
Sector 18, Noida Call girls :8448380779 Model Escorts | 100% verified
 
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdfintroduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
 
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
 
Right Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsRight Money Management App For Your Financial Goals
Right Money Management App For Your Financial Goals
 
Software Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsSoftware Quality Assurance Interview Questions
Software Quality Assurance Interview Questions
 
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
 
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
 
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein
 
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfThe Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
 
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM TechniquesAI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
 
The Top App Development Trends Shaping the Industry in 2024-25 .pdf
The Top App Development Trends Shaping the Industry in 2024-25 .pdfThe Top App Development Trends Shaping the Industry in 2024-25 .pdf
The Top App Development Trends Shaping the Industry in 2024-25 .pdf
 
The title is not connected to what is inside
The title is not connected to what is insideThe title is not connected to what is inside
The title is not connected to what is inside
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
 

Distributed Hash Table

  • 1. Distributed Hash Tables Amir H. Payberah amir@sics.se Amirkabir University of Technology (Tehran Polytechnic) Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 1 / 62
  • 2. What is the Problem? Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 2 / 62
  • 3. What is the Problem? Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 3 / 62
  • 4. Possible Solutions (1/3) Central directory Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 4 / 62
  • 5. Possible Solutions (2/3) Flooding Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 5 / 62
  • 6. Possible Solutions (3/3) Distributed Hash Table (DHT) Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 6 / 62
  • 7. Distributed Hash Table (DHT) Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 7 / 62
  • 8. Distributed Hash Table An ordinary hash-table, which is ... Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 8 / 62
  • 9. Distributed Hash Table An ordinary hash-table, which is distributed. Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 9 / 62
  • 10. Steps to Build a DHT Step 1: decide on common key space for nodes and values. Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 10 / 62
  • 11. Steps to Build a DHT Step 1: decide on common key space for nodes and values. Step 2: connect the nodes smartly. Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 10 / 62
  • 12. Steps to Build a DHT Step 1: decide on common key space for nodes and values. Step 2: connect the nodes smartly. Step 3: make a strategy for assigning items to nodes. Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 10 / 62
  • 13. Chord: an Example of a DHT Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 11 / 62
  • 14. Construct Chord - Step 1 Use a logical name space, called the id space, consisting of identifiers {0, 1, 2, · · · , N − 1}. Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 12 / 62
  • 15. Construct Chord - Step 1 Use a logical name space, called the id space, consisting of identifiers {0, 1, 2, · · · , N − 1}. Id space is a logical ring modulo N. Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 12 / 62
  • 16. Construct Chord - Step 1 Use a logical name space, called the id space, consisting of identifiers {0, 1, 2, · · · , N − 1}. Id space is a logical ring modulo N. Every node picks a random id though Hash H. Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 12 / 62
  • 17. Construct Chord - Step 1 Use a logical name space, called the id space, consisting of identifiers {0, 1, 2, · · · , N − 1}. Id space is a logical ring modulo N. Every node picks a random id though Hash H. Example: • Space N = 16{0, · · · , 15} • Five nodes a, b, c, d, e. • H(a) = 6 • H(b) = 5 • H(c) = 0 • H(d) = 11 • H(e) = 2 Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 12 / 62
  • 18. Construct Chord - Step 2 (1/2) The successor of an id is the first node met going in clockwise direction starting at the id. succ(x): is the first node on the ring with id greater than or equal x. • succ(12) = 0 • succ(1) = 2 • succ(6) = 6 Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 13 / 62
  • 19. Construct Chord - Step 2 (2/2) Each node points to its successor. The successor of a node n is succ(n + 1). • 0’s successor is succ(1) = 2. • 2’s successor is succ(3) = 5. • 11’s successor is succ(12) = 0. Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 14 / 62
  • 20. Construct Chord - Step 3 Where to store data? Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 15 / 62
  • 21. Construct Chord - Step 3 Where to store data? Use globally known hash function H. Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 15 / 62
  • 22. Construct Chord - Step 3 Where to store data? Use globally known hash function H. Each item key, value gets identifier H(key) = k. • Space N = 16{0, · · · , 15} • Five nodes a, b, c, d, e. • H(Fatemeh) = 12 • H(Cosmin) = 2 • H(Seif) = 9 • H(Sarunas) = 14 • H(Tallat) = 4 Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 15 / 62
  • 23. Construct Chord - Step 3 Where to store data? Use globally known hash function H. Each item key, value gets identifier H(key) = k. • Space N = 16{0, · · · , 15} • Five nodes a, b, c, d, e. • H(Fatemeh) = 12 • H(Cosmin) = 2 • H(Seif) = 9 • H(Sarunas) = 14 • H(Tallat) = 4 Store each item at its successor. Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 16 / 62
  • 24. Construct Chord - Step 3 Where to store data? Use globally known hash function H. Each item key, value gets identifier H(key) = k. • Space N = 16{0, · · · , 15} • Five nodes a, b, c, d, e. • H(Fatemeh) = 12 • H(Cosmin) = 2 • H(Seif) = 9 • H(Sarunas) = 14 • H(Tallat) = 4 Store each item at its successor. Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 16 / 62
  • 25. Construct Chord - Step 3 Where to store data? Use globally known hash function H. Each item key, value gets identifier H(key) = k. • Space N = 16{0, · · · , 15} • Five nodes a, b, c, d, e. • H(Fatemeh) = 12 • H(Cosmin) = 2 • H(Seif) = 9 • H(Sarunas) = 14 • H(Tallat) = 4 Store each item at its successor. Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 17 / 62
  • 26. How to Lookup? Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 18 / 62
  • 27. Lookup (1/2) To lookup a key k: Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 19 / 62
  • 28. Lookup (1/2) To lookup a key k: • Calculate H(k). Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 19 / 62
  • 29. Lookup (1/2) To lookup a key k: • Calculate H(k). • Follow succ pointers until item k is found. Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 19 / 62
  • 30. Lookup (1/2) To lookup a key k: • Calculate H(k). • Follow succ pointers until item k is found. Example: • Lookup Seif at node 2. • H(Seif) = 9 • Traverse nodes: 2, 5, 6, 11 • Return Stockholm to initiator Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 19 / 62
  • 31. Lookup (2/2) Algorithm 1 Ask node n to find the successor of id 1: procedure n.findSuccessor(id) 2: if pred = Ø and id ∈ (pred, n] then 3: return n 4: else if id ∈ (n, succ] then 5: return succ 6: else// forward the query around the circle 7: return succ.findSuccessor(id) 8: end if 9: end procedure (a, b] the segment of the ring moving clockwise from but not includ- ing a until and including b. n.foo(.) denotes an RPC of foo(.) to node n. n.bar denotes and RPC to fetch the value of the variable bar in node n. Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 20 / 62
  • 32. Put and Get Algorithm 2 Store value with key id in the DHT 1: procedure n.put(id, value) 2: n = findSuccessor(id) 3: s.store(id, value) 4: end procedure Algorithm 3 Retrieve the value of the key id from the DHT 1: procedure n.get(id) 2: n = findSuccessor(id) 3: return s.retrieve(id) 4: end procedure Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 21 / 62
  • 33. Any Improvement? Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 22 / 62
  • 34. Improvement Speeding up lookups. If only the successor pointers are used: • Worst case lookup time is N, for N nodes. Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 23 / 62
  • 35. Speeding up Lookups (1/2) Finger/routing table: • Point to succ(n + 1) • Point to succ(n + 2) • Point to succ(n + 4) • ... • Point to succ(n + 2M−1 ) (N = 2M , N: the id space size) Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 24 / 62
  • 36. Speeding up Lookups (1/2) Finger/routing table: • Point to succ(n + 1) • Point to succ(n + 2) • Point to succ(n + 4) • ... • Point to succ(n + 2M−1 ) (N = 2M , N: the id space size) Distance always halved to the destination. Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 24 / 62
  • 37. Speeding up Lookups (2/2) Every node n knows succ(n + 2i−1) for i = 1, · · · , M. Size of routing tables is logarithmic: • Routing table size: M, where N = 2M . • Routing entries = log2(N). • Example: Log2(1000000) ≈ 20 Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 25 / 62
  • 38. Lookup Improvement (1/3) Algorithm 4 Ask node n to find the successor of id 1: procedure n.findSuccessor(id) 2: if pred = Ø and id ∈ (pred, n] then 3: return n 4: else if id ∈ (n, succ] then 5: return succ 6: else// forward the query around the circle 7: return succ.findSuccessor(id) 8: end if 9: end procedure Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 26 / 62
  • 39. Lookup Improvement (2/3) Algorithm 5 Ask node n to find the successor of id 1: procedure n.findSuccessor(id) 2: if pred = Ø and id ∈ (pred, n] then 3: return n 4: else if id ∈ (n, succ] then 5: return succ 6: else// forward the query around the circle 7: p ← closestPrecedingNode(id) 8: return p.findSuccessor(id) 9: end if 10: end procedure Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 27 / 62
  • 40. Lookup Improvement (3/3) Algorithm 6 Search locally for the highest predecessor of id 1: procedure closestPrecedingNode(id) 2: for i = m downto 1 do 3: if finget[i] ∈ (n, id) then 4: return finger[i] 5: end if 6: end for 7: end procedure Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 28 / 62
  • 41. Lookups (1/7) Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 29 / 62
  • 42. Lookups (2/7) Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 30 / 62
  • 43. Lookups (3/7) Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 31 / 62
  • 44. Lookups (4/7) Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 32 / 62
  • 45. Lookups (5/7) Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 33 / 62
  • 46. Lookups (6/7) Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 34 / 62
  • 47. Lookups (7/7) Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 35 / 62
  • 48. How to Maintain the Ring? Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 36 / 62
  • 49. Periodic Stabilization (1/2) In Chord, in addition to the successor pointer, every node has a predecessor pointer. • Predecessor of node n is the first node met in anti-clockwise direction starting at n. Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 37 / 62
  • 50. Periodic Stabilization (1/2) In Chord, in addition to the successor pointer, every node has a predecessor pointer. • Predecessor of node n is the first node met in anti-clockwise direction starting at n. Periodic stabilization is used to make pointers eventually correct. • Pointing succ to closest alive successor. • Pointing pred to closest alive predecessor. Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 37 / 62
  • 51. Periodic Stabilization (2/2) Algorithm 7 Periodically at n 1: procedure n.stabilize() 2: v ← succ.pred 3: if v = Ø and v ∈ (n, succ] then 4: succ ← v 5: end if 6: send notify(n) to succ 7: end procedure Algorithm 8 Upon receipt a notify(p) at node m 1: on receive notify | p from n do 2: if pred = Ø or p ∈ (pred, m] then 3: pred ← p 4: end if 5: end event Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 38 / 62
  • 52. Handling Join Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 39 / 62
  • 53. Handling Join When n joins: • Find n’s successor with lookup(n). • Set succ to n’s successor. • Stabilization fixes the rest. Algorithm 9 Join a Chord ring containing node m 1: procedure n.join(m) 2: pred ← Ø 3: succ ← m.findSuccessor(n) 4: end procedure Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 40 / 62
  • 54. Join (1/5) Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 41 / 62
  • 55. Join (2/5) Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 42 / 62
  • 56. Join (3/5) Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 43 / 62
  • 57. Join (4/5) Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 44 / 62
  • 58. Join (5/5) Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 45 / 62
  • 59. Fix Fingers (1/4) Periodically refresh finger table entries, and store the index of the next finger to fix. Algorithm 10 When receiving notify(p) at n 1: procedure n.fixFingers() 2: next ← next + 1 3: if next > m then 4: next ← 1 5: end if 6: finger[next] ← findSuccessor(n ⊕ 2next−1) 7: end procedure Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 46 / 62
  • 60. Fix Fingers (2/4) Current situation: succ(N48) = N60 succ(21 ⊕ 25) = succ(53) = N60 Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 47 / 62
  • 61. Fix Fingers (3/4) succ(21 ⊕ 25) = succ(53) =? New node N56 joins and stabilizes successor pointer. Finger 6 of node N21 is wrong now. N21 eventually try to fix finger 6 by looking up 53 which stops at N48. Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 48 / 62
  • 62. Fix Fingers (4/4) succ(21 ⊕ 25) = succ(53) = N56 N48 will eventually stabilize its successor. This means the ring is correct now. Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 49 / 62
  • 63. Handling Failure Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 50 / 62
  • 64. Successor List (1/2) A node has a successors list of size r containing the immediate r successors. • succ(n + 1) • succ(succ(n + 1) + 1) • succ(succ(succ(n + 1) + 1) + 1) Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 51 / 62
  • 65. Successor List (1/2) A node has a successors list of size r containing the immediate r successors. • succ(n + 1) • succ(succ(n + 1) + 1) • succ(succ(succ(n + 1) + 1) + 1) How big should r be? Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 51 / 62
  • 66. Successor List (1/2) A node has a successors list of size r containing the immediate r successors. • succ(n + 1) • succ(succ(n + 1) + 1) • succ(succ(succ(n + 1) + 1) + 1) How big should r be? log2(N) Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 51 / 62
  • 67. Successor List (2/2) Algorithm 11 Join a Chord ring containing node m 1: procedure n.join(m) 2: pred ← Ø 3: succ ← m.findSuccessor(n) 4: updateSuccesorList(succ.successorList) 5: end procedure Algorithm 12 Periodically at n 1: procedure n.stabilize() 2: succ ← find first alive node in successor list 3: v ← succ.pred 4: if v = Ø and v ∈ (n, succ] then 5: succ ← v 6: end if 7: send notify(n) to succ updateSuccessorList(succ.successorList) 8: end procedure Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 52 / 62
  • 68. Dealing with Failure Periodic stabilization If successor fails: replace with closest alive successor If predecessor fails: set predecessor to nil Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 53 / 62
  • 69. Failure (1/5) Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 54 / 62
  • 70. Failure (2/5) Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 55 / 62
  • 71. Failure (3/5) Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 56 / 62
  • 72. Failure (4/5) Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 57 / 62
  • 73. Failure (5/5) Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 58 / 62
  • 74. Summary Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 59 / 62
  • 75. Summary DHTs: distributed key, value Lookup service Put and Get Finger list: improve the lookup Periodically stabilization Successor list Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 60 / 62
  • 76. References: Ion Stoica et al., Chord: A scalable peer-to-peer lookup service for internet applications, SIGCOMM, 2001. Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 61 / 62
  • 77. Questions? Amir H. Payberah (Tehran Polytechnic) DHTs 1393/7/12 62 / 62