Distributed locks in Ruby - Correctness vs Efficiency - Knapsack Pro case study (mutex, semaphore)

•

0 j'aime•160 vues

This document discusses using distributed locks to synchronize access to shared resources in distributed applications. It provides examples of when distributed locks are needed to prevent data corruption or loss from concurrent requests. The document recommends using proven distributed lock libraries like redis-semaphore and redlock-rb to implement distributed locks instead of recreating the logic. Tips are given on testing for concurrency issues and choosing correctness over efficiency with locks.

Technologie

Distributed locks
KnapsackPro.com API case study
Artur Trzop

CircleCI container
Execute Rspec tests
Run all tests in single container

First container
Execute Rspec tests
Second container
Execute Rspec tests
Split tests between container
CI build for git commit

First container
Execute Rspec
tests fetched from work queue
Second container
Execute Rspec
tests fetched from work queue
Dynamic tests split between container
with knapsack_pro gem
Tests in work queue on
KnapsackPro.com API server

First requests from any container
should create a new work queue
for the git commit
def test_files
create_queue unless queue_exists?
test_files_from_top_of_the_queue
end

When this code can happen
at the same time?
● development - webrick is single threaded (bug won’t happen)
● Production (bug happens):
○ unicorn (multiple processes)
○ puma (multiple threads and/or multiple processes)

What is distributed lock?
synchronize access to shared resources
It means different processes must operate with shared resources
in a mutually exclusive way

Why you want a lock in a
distributed application?
● Efficiency
○ Avoid expensive computation
○ Saves time & money

Why you want a lock in a
distributed application?
● Correctness
○ Prevent corrupted data
○ Prevent data loss
○ Prevent data inconsistency

What tool I could use?
https://github.com/dv/redis-semaphore

How I solve the problem?
● Write tests to do concurrent requests against staging
○ Requests in separate threads
● Verify if the problem exists after implementing fix

Implementation fix
def test_files
semaphore = Redis::Semaphore.new(:semaphore_for_ci_build_id, host: "localhost")
semaphore.lock(5) do
create_queue unless queue_exists?
end
test_files_from_top_of_the_queue
end

What should I remember
● Locks are hard
● Distributed locks are even harder
● Don't reinvent the wheel, use proven solutions
● Most web apps are not thread-safe due to missing locks.
○ Expect edge cases while you grow.

Another example
def save
build = find_build || new_build
do_something_complex_with_build
build.save
end

With distributed lock
def save
semaphore.lock(5) do
build = find_build || new_build
do_something_complex_with_build
build.save
end
end

What distributed lock
allows us?
● When single redis DB
○ You can have multiple unicorn/puma processes
○ You can have multiple machines if each use the same redis DB

What distributed lock
allows us?
● When multiple redis DBs
○ Use library that supports Redlock algorithm
■ redlock-rb gem
■ https://redis.io/topics/distlock
● Is Redlock perfect? Kind of, because it has timing assumptions.
https://martin.kleppmann.com/2016/02/08/how-to-do-distributed-locking.html
Response from Redis author http://antirez.com/news/101

Tips
● Be aware of trade-off. Do you care about efficiency or correctness?
● Test your endpoints with concurrent requests to reproduce problem
● Use transactions when changing many records
● Use unique index to ensure data consistency in DB
● Use tested distribution lock solutions

Nice to read
https://makandracards.com/makandra/31937-differences-between-transactions-an
d-locking

Distributed locks in Ruby - Correctness vs Efficiency - Knapsack Pro case study (mutex, semaphore)

Contenu connexe

Tendances

Developing Rich Internet Applications with Perl and JavaScriptnohuhu

Ruby eventmachine pres at rubybdxMathieu Elie

Git+jenkins+rex presentationDwi Sasongko Supriyadi

BBC's GraphDB (formerly Owlim) AWS Cloud Migrationlogomachy

Introduction to Node.jsSetyo Nugroho

Build a chatroom!SheilaJimenezMorejon

Add a backend and deploy!SheilaJimenezMorejon

Are you using an opensource library? There's a good chance you are vulnerable...Codemotion

RSYSLOG v8 improvements and how to write plugins in any language.Rainer Gerhards

CTU June 2011 - C# 5.0 - ASYNC & AwaitSpiffy

TDD for joomla extensionsRoberto Segura

openSUSE Conference 2017 - The Docker at Travis Presentationlslezak

grifork - fast propagative task runner -IKEDA Kiyoshi

Intro to Node.jsJames Carr

Rust Programming LanguageJaeju Kim

Running of nist testMuhammad Hamid

"fireap" - fast task runner on consulIKEDA Kiyoshi

Deep drive into rust programming languageVigneshwer Dhinakaran

DevOps in realtimeAndriy Samilyak

ruby + websocket + haproxyMathieu Elie

Tendances (20)

Developing Rich Internet Applications with Perl and JavaScript

Ruby eventmachine pres at rubybdx

Git+jenkins+rex presentation

BBC's GraphDB (formerly Owlim) AWS Cloud Migration

Introduction to Node.js

Build a chatroom!

Add a backend and deploy!

Are you using an opensource library? There's a good chance you are vulnerable...

RSYSLOG v8 improvements and how to write plugins in any language.

CTU June 2011 - C# 5.0 - ASYNC & Await

TDD for joomla extensions

openSUSE Conference 2017 - The Docker at Travis Presentation

grifork - fast propagative task runner -

Intro to Node.js

Rust Programming Language

Running of nist test

"fireap" - fast task runner on consul

Deep drive into rust programming language

DevOps in realtime

ruby + websocket + haproxy

Similaire à Distributed locks in Ruby - Correctness vs Efficiency - Knapsack Pro case study (mutex, semaphore)

Parallelizing CI using Docker Swarm-ModeAkihiro Suda

Introduction to containersNitish Jadia

Deploying to DigitalOcean With GitHub ActionsDigitalOcean

Gr8conf EU 2013 Speed up your development: GroovyServ and Grails Improx PluginYasuharu Nakano

Distributed ElixirÓscar De Arriba González

Containers: from development to production at DevNation 2015Jérôme Petazzoni

MERIMeeting du 27 mai 2014 - Parallel ProgrammingOlivier NAVARRE

Hands-on Lab: Red Hat Container Development & OpenShiftAmazon Web Services

Let's Talk Locks!C4Media

Mono RepoZacky Pickholz

StorageOS, Storage for Containers Shouldn't Be Annoying at Container Camp UKStorageOS

Hashicorp-Terraform-Deep-Dive-with-no-Fear-Victor-Turbinsky-Texuna.pdfssuser705051

Terraform-2.pdfrutiksankapal21

Build optimization mechanisms in GitLab and DockerDmytro Patkovskyi

Codeception: introduction to php testing (v2 - Aberdeen php)Engineor

Demo 0.9.4eTimeline, LLC

It's always sunny with OpenJ9DanHeidinga

Docker 102 - Immutable InfrastructureAdrian Otto

The State of the Veil FrameworkVeilFramework

Software TestingAndrew Wang

Similaire à Distributed locks in Ruby - Correctness vs Efficiency - Knapsack Pro case study (mutex, semaphore) (20)

Parallelizing CI using Docker Swarm-Mode

Introduction to containers

Deploying to DigitalOcean With GitHub Actions

Gr8conf EU 2013 Speed up your development: GroovyServ and Grails Improx Plugin

Distributed Elixir

Containers: from development to production at DevNation 2015

MERIMeeting du 27 mai 2014 - Parallel Programming

Hands-on Lab: Red Hat Container Development & OpenShift

Let's Talk Locks!

Mono Repo

StorageOS, Storage for Containers Shouldn't Be Annoying at Container Camp UK

Hashicorp-Terraform-Deep-Dive-with-no-Fear-Victor-Turbinsky-Texuna.pdf

Terraform-2.pdf

Build optimization mechanisms in GitLab and Docker

Codeception: introduction to php testing (v2 - Aberdeen php)

Demo 0.9.4

It's always sunny with OpenJ9

Docker 102 - Immutable Infrastructure

The State of the Veil Framework

Software Testing

Dernier

How to convert PDF to text with Nanonetsnaman860154

GenCyber Cyber Security Day PresentationMichael W. Hawkins

Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j

IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge

Boost PC performance: How more available memory can improve productivityPrincipled Technologies

🐬 The future of MySQL is Postgres 🐘RTylerCroy

[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745

Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services

Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko

Histor y of HAM Radio presentation slidevu2urc

presentation ICT roal in 21st century educationjfdjdjcjdnsjd

04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG

Evaluating the top large language models.pdfChristopherTHyatt

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo

What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays

Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun

2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong

Dernier (20)

How to convert PDF to text with Nanonets

GenCyber Cyber Security Day Presentation

Axa Assurance Maroc - Insurer Innovation Award 2024

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...

IAC 2024 - IA Fast Track to Search Focused AI Solutions

Boost PC performance: How more available memory can improve productivity

🐬 The future of MySQL is Postgres 🐘

[2024]Digital Global Overview Report 2024 Meltwater.pdf

Strategies for Landing an Oracle DBA Job as a Fresher

Handwritten Text Recognition for manuscripts and early printed texts

Histor y of HAM Radio presentation slide

presentation ICT roal in 21st century education

04-2024-HHUG-Sales-and-Marketing-Alignment.pptx

Evaluating the top large language models.pdf

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...

What Are The Drone Anti-jamming Systems Technology?

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...

Powerful Google developer tools for immediate impact! (2023-24 C)

2024: Domino Containers - The Next Step. News from the Domino Container commu...

Distributed locks in Ruby - Correctness vs Efficiency - Knapsack Pro case study (mutex, semaphore)

1. Distributed locks KnapsackPro.com API case study Artur Trzop

2. Start with a problem

3. But know the first

4. CircleCI container Execute Rspec tests Run all tests in single container

5. First container Execute Rspec tests Second container Execute Rspec tests Split tests between container CI build for git commit

6. First container Execute Rspec tests fetched from work queue Second container Execute Rspec tests fetched from work queue Dynamic tests split between container with knapsack_pro gem Tests in work queue on KnapsackPro.com API server

7. First requests from any container should create a new work queue for the git commit def test_files create_queue unless queue_exists? test_files_from_top_of_the_queue end

8. When this code can happen at the same time? ● development - webrick is single threaded (bug won’t happen) ● Production (bug happens): ○ unicorn (multiple processes) ○ puma (multiple threads and/or multiple processes)

9. What is distributed lock? synchronize access to shared resources It means different processes must operate with shared resources in a mutually exclusive way

10. Why you want a lock in a distributed application? ● Efficiency ○ Avoid expensive computation ○ Saves time & money

11. Why you want a lock in a distributed application? ● Correctness ○ Prevent corrupted data ○ Prevent data loss ○ Prevent data inconsistency

12. What tool I could use? https://github.com/dv/redis-semaphore

13. How I solve the problem? ● Write tests to do concurrent requests against staging ○ Requests in separate threads ● Verify if the problem exists after implementing fix

14. Implementation fix def test_files semaphore = Redis::Semaphore.new(:semaphore_for_ci_build_id, host: "localhost") semaphore.lock(5) do create_queue unless queue_exists? end test_files_from_top_of_the_queue end

15. What should I remember ● Locks are hard ● Distributed locks are even harder ● Don't reinvent the wheel, use proven solutions ● Most web apps are not thread-safe due to missing locks. ○ Expect edge cases while you grow.

16. Another example def save build = find_build || new_build do_something_complex_with_build build.save end

17. With distributed lock def save semaphore.lock(5) do build = find_build || new_build do_something_complex_with_build build.save end end

18. What distributed lock allows us? ● When single redis DB ○ You can have multiple unicorn/puma processes ○ You can have multiple machines if each use the same redis DB

19. What distributed lock allows us? ● When multiple redis DBs ○ Use library that supports Redlock algorithm ■ redlock-rb gem ■ https://redis.io/topics/distlock ● Is Redlock perfect? Kind of, because it has timing assumptions. https://martin.kleppmann.com/2016/02/08/how-to-do-distributed-locking.html Response from Redis author http://antirez.com/news/101

20. Tips ● Be aware of trade-off. Do you care about efficiency or correctness? ● Test your endpoints with concurrent requests to reproduce problem ● Use transactions when changing many records ● Use unique index to ensure data consistency in DB ● Use tested distribution lock solutions

21. Nice to read https://makandracards.com/makandra/31937-differences-between-transactions-an d-locking

22. Thanks

Distributed locks in Ruby - Correctness vs Efficiency - Knapsack Pro case study (mutex, semaphore)

Recommandé

Recommandé

Contenu connexe

Tendances

Tendances (20)

Similaire à Distributed locks in Ruby - Correctness vs Efficiency - Knapsack Pro case study (mutex, semaphore)

Similaire à Distributed locks in Ruby - Correctness vs Efficiency - Knapsack Pro case study (mutex, semaphore) (20)

Dernier

Dernier (20)

Distributed locks in Ruby - Correctness vs Efficiency - Knapsack Pro case study (mutex, semaphore)