How Shopify Is Scaling Up Its Redis Message Queues

•Télécharger en tant que PPTX, PDF•

0 j'aime•501 vues

Redis Labs

Moe Chaieb at Redis Day NYC 2019

Technologie

• One of the oldest and largest Ruby on Rails monoliths
• 1000+ developers
• 1000 Pull Requests per day
• 170K peak RPS
• 2 billion background jobs processed per day

Background Jobs at Shopify
Architecture Overview
Multi-Tenancy, Flash Sales
Scalability Problems & Solutions
Performance Bottlenecks, and Horizontal Scalability

• Asynchronous Units of Work
• Email
• Webhooks
• Checkout and payment processing
• Backfills, maintenance tasks
• Schema Migrations
• Our own library Hedwig
• Ruby
• Similar to Resque: but better fitted to our
architecture
• Queues as Redis Lists
Background Jobs at Shopify

Solution: Dedicated Instances for Locking
http://gustavocaso.github.io/2019/04/30/migrating-millions-of-redis-keys-without-downtime/

Solution 1: Dedicated Instances per Queue

Solution 2: Horizontally Scalable Queues

• Single tenants pushing Redis limits
• Case-by-case solutions:
• Error reporting: Kafka
• Connections: Proxy
• Locking: Dedicated Redis Instance
• Queues: exploration phase
• Dedicated instances
• Horizontal scaling
Summary

Contenu connexe

Tendances

API Documentation plays an important role in improving the customer’s experience with APIs, which is always a struggle for most of the company. The way to accomplish this is to transition API development culture from “Code First” to “Design First”, here in SAS we call it “API First”. For better API designing and documentation, we have built an API First CI/CD workflow which brings many open-sourced API tools together and involves developers, product managers, documentation writers, and testers to synchronously work together to develop APIs in a “Design First” approach, the industry standard. In the talk, we will discuss how the API-first Workflow could enable better collaboration between teams which could help in many aspects especially writing the openAPI documentation, keeping it up to date and sync with your code. We will take a deep look at one example, the Linting tool from API First workflow, which helps to make sure the API documentation follows the company standard from the start. With openSource linting tools like Spectral, it’s easy for teams to define their own linting rules which includes company standards. When your API specifications go through the linter in the CI/CD pipeline, the linter will throw errors and warnings as you write your spec. This will help ensure your specification is following proper guidelines and that’s all automatic.

API First Workflow: How could we have better API Docs through DevOps pipeline

Pronovix

Retail architecture target

joelcrabb

Introduction to Apache Kafka

AIMDek Technologies

Upgrades suck. We get it. They are risky and time consuming and you have better things to do. In this talk we'll present good reasons to upgrade anyway and give suggestions on how to de-risk your upgrades. Straight from the team that upgrades Kafka almost every week. We'll review all the releases in the past year - major, minor and bug-fixes. We'll explain the differences between those and what can you expect from each. We'll go into the most important features and most critical fixes and improvements, so you'll have ample ammunition when you explain to your boss why you really have to upgrade Kafka. Then we'll discuss how we validate new releases and suggest a safe upgrade process - because we know that uneventful upgrades are a key to the next upgrade.

Please Upgrade Apache Kafka. Now. (Gwen Shapira, Confluent) Kafka Summit SF 2019

confluent

YouTube Link: https://youtu.be/6WYEmUVhiwQ ** Node.js Certification Training: https://www.edureka.co/nodejs-certification-training ** This Edureka PPT on 'Node.js Interview Questions' will help you in preparing better for your Node.js Interviews and ace it. In this session, we will be discussing Top 50 frequently asked questions in Node.js. Follow us to never miss an update in the future. YouTube: https://www.youtube.com/user/edurekaIN Instagram: https://www.instagram.com/edureka_learning/ Facebook: https://www.facebook.com/edurekaIN/ Twitter: https://twitter.com/edurekain LinkedIn: https://www.linkedin.com/company/edureka Castbox: https://castbox.fm/networks/505?country=in

Top 50 Node.js Interview Questions and Answers | Edureka

Edureka!

Protecting your data at rest with Apache Kafka by Confluent and Vormetric

confluent

You know that Domain Driven Design, Hexagonal Architecture, and the Single Responsibility Principle are important but it’s hard to know how to best apply them to Rails applications. Following the path of least-resistance will get you in trouble. In this session you will learn a way out of the “fat model, skinny controller” hell. You will leave with a roadmap to guide your design based on concepts from Domain Driven Design and Hexagonal Architecture.

Domain Driven Design and Hexagonal Architecture with Rails

Declan Whelan

(Sam Obeid, Shopify) Kafka Summit SF 2018 At Shopify we manage multiple Apache Kafka clusters in multiple locations in Google’s cloud platform. We deploy our Kafka clusters as Kubernetes StatefulSets, and we use other K8s workloads to implement different tasks. Automating critical and repetitive operational tasks is one of our top priorities. In this talk we’ll discuss how we leveraged Kubernetes Custom Resources and Controllers to automate some of the key cluster operational tasks, to detect clusters configuration changes and react to these changes with required actions. We will go through actual examples we implemented at Shopify, how we solved the problem of cluster discovery and how we automated topics creation across different clusters with zero human intervention and safety controls.

Automate Your Kafka Cluster with Kubernetes Custom Resources

confluent

(Stephane Maarek, DataCumulus) Kafka Summit SF 2018 Security in Kafka is a cornerstone of true enterprise production-ready deployment: It enables companies to control access to the cluster and limit risks in data corruption and unwanted operations. Understanding how to use security in Kafka and exploiting its capabilities can be complex, especially as the documentation that is available is aimed at people with substantial existing knowledge on the matter. This talk will be delivered in a “hero journey” fashion, tracing the experience of an engineer with basic understanding of Kafka who is tasked with securing a Kafka cluster. Along the way, I will illustrate the benefits and implications of various mechanisms and provide some real-world tips on how users can simplify security management. Attendees of this talk will learn about aspects of security in Kafka, including: -Encryption: What is SSL, what problems it solves and how Kafka leverages it. We’ll discuss encryption in flight vs. encryption at rest. -Authentication: Without authentication, anyone would be able to write to any topic in a Kafka cluster, do anything and remain anonymous. We’ll explore the available authentication mechanisms and their suitability for different types of deployment, including mutual SSL authentication, SASL/GSSAPI, SASL/SCRAM and SASL/PLAIN. -Authorization: How ACLs work in Kafka, ZooKeeper security (risks and mitigations) and how to manage ACLs at scale

Kafka Security 101 and Real-World Tips

confluent

Igor Anishchenko Odessa Java TechTalks Lohika - May, 2012 Let's take a step back and compare data serialization formats, of which there are plenty. What are the key differences between Apache Thrift, Google Protocol Buffers and Apache Avro. Which is "The Best"? Truth of the matter is, they are all very good and each has its own strong points. Hence, the answer is as much of a personal choice, as well as understanding of the historical context for each, and correctly identifying your own, individual requirements.

Thrift vs Protocol Buffers vs Avro - Biased Comparison

Igor Anishchenko

Swagger

NexThoughts Technologies

A step-by-step deep dive into Kafka Security world. This presentation covers few most sought-after questions in Streaming / Kafka; like what happens internally when SASL / Kerberos / SSL security is configured, how does various Kafka components interacts with each other. This could be valuable resource for administrators, users & Application developers alike. Having internal Kafka knowledge would help them to configure, manage and use the Kafka systems in a more optimal way with least possible errors / mistakes. Agenda is to discuss: - Various Kafka Security model available: PLAINTEXT, SASL_PLAINTEXT, SASL_SSL, PLAINTEXT_SSL and when to use which model - Anatomy of each Security model: in-depth examination of these models and what happens internally when they are used; with real life examples - Do's and Don'ts of Kafka Security - Common Errors & Troubleshooting This talk will be all about looking under-the-hood with respect to Kafka Security. Suitable for all levels from beginners to expert. Speaker Vipin Rathor, Sr. Product Specialist (security), Hortonworks

Visualizing Kafka Security

DataWorks Summit

Data integration is a really difficult problem. We know this because 80% of the time in every project is spent getting the data you want the way you want it. We know this because this problem remains challenging despite 40 years of attempts to solve it. All we want is a service that will be reliable, handle all kinds of data and integrate with all kinds of systems, be easy to manage and scale as our systems grow. Oh, and it should be super low latency too. Is it too much to ask? In this presentation, we’ll discuss the basic challenges of data integration and introduce few design and architecture patterns that are used to tackle these challenges. We will then explore how these patterns can be implemented using Apache Kafka. Difficult problems are difficult and we offer no silver bullets, but we will share pragmatic solutions that helped many organizations build fast, scalable and manageable data pipelines.

Stream All Things—Patterns of Modern Data Integration with Gwen Shapira

Databricks

Understand the Microservices Architecture concepts Understand Event Sourcing and CQRS Understanding Domain Driven Design Understanding Functional Reactive Programming Understanding Distributed Transaction Management Understanding Microservices Messaging Setting up Micro services Infrastructure (API Gateway, Service Discovery, Load Balancer, Circuit Breaker) https://github.com/meta-magic/microservice_workshop

Microservices Part 3 Service Mesh and Kafka

Araf Karsh Hamid

ksqlDB is a stream processing SQL engine, which allows stream processing on top of Apache Kafka. ksqlDB is based on Kafka Stream and provides capabilities for consuming messages from Kafka, analysing these messages in near-realtime with a SQL like language and produce results again to a Kafka topic. By that, no single line of Java code has to be written and you can reuse your SQL knowhow. This lowers the bar for starting with stream processing significantly. ksqlDB offers powerful capabilities of stream processing, such as joins, aggregations, time windows and support for event time. In this talk I will present how KSQL integrates with the Kafka ecosystem and demonstrate how easy it is to implement a solution using ksqlDB for most part. This will be done in a live demo on a fictitious IoT sample.

ksqlDB - Stream Processing simplified!

Guido Schmutz

Atomicity In Redis: Thomas Hunter

Redis Labs

Kafka Connect and Streams (Concepts, Architecture, Features)

Kai Wähner

Kafka Tutorial: Kafka Security

Jean-Paul Azar

Apache kafka meet_up_zurich_at_swissre_from_zero_to_hero_with_kafka_connect_2...

confluent

Rate-Limiting 30 Million requests by Vijay Lakshminarayanan and Girish Koundi...

Redis Labs

Tendances (20)

API First Workflow: How could we have better API Docs through DevOps pipeline

Retail architecture target

Introduction to Apache Kafka

Please Upgrade Apache Kafka. Now. (Gwen Shapira, Confluent) Kafka Summit SF 2019

Top 50 Node.js Interview Questions and Answers | Edureka

Protecting your data at rest with Apache Kafka by Confluent and Vormetric

Domain Driven Design and Hexagonal Architecture with Rails

Automate Your Kafka Cluster with Kubernetes Custom Resources

Kafka Security 101 and Real-World Tips

Thrift vs Protocol Buffers vs Avro - Biased Comparison

Swagger

Visualizing Kafka Security

Stream All Things—Patterns of Modern Data Integration with Gwen Shapira

Microservices Part 3 Service Mesh and Kafka

ksqlDB - Stream Processing simplified!

Atomicity In Redis: Thomas Hunter

Kafka Connect and Streams (Concepts, Architecture, Features)

Kafka Tutorial: Kafka Security

Apache kafka meet_up_zurich_at_swissre_from_zero_to_hero_with_kafka_connect_2...

Rate-Limiting 30 Million requests by Vijay Lakshminarayanan and Girish Koundi...

Similaire à How Shopify Is Scaling Up Its Redis Message Queues

Scaling Social Games

Paolo Negri

Companies with batch and stream processing pipelines need to serve the insights they glean back to their users, an often-overlooked problem that can be hard to achieve reliably and at scale. Felix GV and Yan Yan offer an overview of Venice, a new data store capable of ingesting data from Hadoop and Kafka, merging it together, replicating it globally, and serving it online at low latency. Venice was designed to be the next-generation replacement of the Voldemort Read-Only system, with the intent to provide a broader feature set, better availability characteristics, and a more efficient architecture. Venice is designed for high-throughput ingestion from Hadoop and Kafka, and these data sources can be merged at ingestion time in order to provide semantics similar to those of a lambda architecture but with a simpler, faster, and more available read path. Robustness is a primary architectural concern and, as such, Venice provides highly available reads and writes, self-healing, stringent data validation guarantees, and the ability to roll back entire datasets in cases where bad data is pushed.

Introducing Venice - Strata NYC 2017

Felix GV

RedisDay London 2018 - How We Run Redis in Multiple Datacenters

Redis Labs

RedisDay London 2018 - Stack Overflow's Next Steps in Redis

Redis Labs

Introducing Venice

Yan Yan

How Shopify Scales Rails

jduff

Frontend development skills are more and more demanded from our clients and stakeholders. Thanks to Facebook, they know what a dynamic UI is and they want it too in their products. It can be a scary situation for people working mostly on a backend side of web applications. In this presentation I want to show that JavaScript can be really fun to write and mature enough to cope with backend technologies.

Frontend as a first class citizen

Marcin Grzywaczewski

React on rails v6.1 at LA Ruby, November 2016

Justin Gordon

Scala at foursquare

jorgeortiz85

Moving to the Cloud: AWS, Zend, RightScale

mmoline

PHP at Yahoo!

elliando dias

Service-Oriented Design and Implement with Rails3

Wen-Tien Chang

Technical overview of three of the most representative KeyValue Stores: Cassandra, Redis and CouchDB. Focused on Ruby and Ruby on Rails developement, this talk shows how to solve common problems, the most popular libraries, benchmarking and the best use case for each one of them. This talk was part of the Conferencia Rails 2009, Madrid, Spain. http://app.conferenciarails.org/talks/43-key-value-stores-conviertete-en-un-jedi-master

KeyValue Stores

Mauro Pompilio

Getting started with Riak in the Cloud

Ines Sombra

Handling Redis failover with ZooKeeper

ryanlecompte

Hadoop @ eBay: Past, Present, and Future

Ryan Hennig

A Tale of 2 Systems

David Newman

NoSQL databases such as Redis, MongoDB and Cassandra are emerging as a compelling choice for many applications. They can simplify the persistence of complex data models and offer significantly better scalability and performance. However, using a NoSQL database means giving up the benefits of the relational model such as SQL, constraints and ACID transactions. For some applications, the solution is polyglot persistence: using SQL and NoSQL databases together. In this talk, you will learn about the benefits and drawbacks of polyglot persistence and how to design applications that use this approach. We will explore the architecture and implementation of an example application that uses MySQL as the system of record and Redis as a very high-performance database that handles queries from the front-end. You will learn about mechanisms for maintaining consistency across the various databases.

Developing polyglot persistence applications #javaone 2012

Chris Richardson

Building Distributed Systems With Riak and Riak Core

Andy Gross

A look ahead at RAP (ESE 2010)

Ralf Sternberg

Similaire à How Shopify Is Scaling Up Its Redis Message Queues (20)

Scaling Social Games

Introducing Venice - Strata NYC 2017

RedisDay London 2018 - How We Run Redis in Multiple Datacenters

RedisDay London 2018 - Stack Overflow's Next Steps in Redis

Introducing Venice

How Shopify Scales Rails

Frontend as a first class citizen

React on rails v6.1 at LA Ruby, November 2016

Scala at foursquare

Moving to the Cloud: AWS, Zend, RightScale

PHP at Yahoo!

Service-Oriented Design and Implement with Rails3

KeyValue Stores

Getting started with Riak in the Cloud

Handling Redis failover with ZooKeeper

Hadoop @ eBay: Past, Present, and Future

A Tale of 2 Systems

Developing polyglot persistence applications #javaone 2012

Building Distributed Systems With Riak and Riak Core

A look ahead at RAP (ESE 2010)

Plus de Redis Labs

Redis Day Bangalore 2020 - Session state caching with redis

Redis Labs

Protecting Your API with Redis by Jane Paek - Redis Day Seattle 2020

Redis Labs

The Happy Marriage of Redis and Protobuf by Scott Haines of Twilio - Redis Da...

Redis Labs

SQL, Redis and Kubernetes by Paul Stanton of Windocks - Redis Day Seattle 2020

Redis Labs

Rust and Redis - Solving Problems for Kubernetes by Ravi Jagannathan of VMwar...

Redis Labs

Redis for Data Science and Engineering by Dmitry Polyakovsky of Oracle

Redis Labs

Practical Use Cases for ACLs in Redis 6 by Jamie Scott - Redis Day Seattle 2020

Redis Labs

Moving Beyond Cache by Yiftach Shoolman Redis Labs - Redis Day Seattle 2020

Redis Labs

Leveraging Redis for System Monitoring by Adam McCormick of SBG - Redis Day S...

Redis Labs

JSON in Redis - When to use RedisJSON by Jay Won of Coupang - Redis Day Seatt...

Redis Labs

Highly Available Persistent Session Management Service by Mohamed Elmergawi o...

Redis Labs

Anatomy of a Redis Command by Madelyn Olson of Amazon Web Services - Redis Da...

Redis Labs

Building a Multi-dimensional Analytics Engine with RedisGraph by Matthew Goos...

Redis Labs

RediSearch 1.6 by Pieter Cailliau - Redis Day Bangalore 2020

Redis Labs

RedisGraph 2.0 by Pieter Cailliau - Redis Day Bangalore 2020

Redis Labs

RedisTimeSeries 1.2 by Pieter Cailliau - Redis Day Bangalore 2020

Redis Labs

RedisAI 0.9 by Sherin Thomas of Tensorwerk - Redis Day Bangalore 2020

Redis Labs

Three Pillars of Observability by Rajalakshmi Raji Srinivasan of Site24x7 Zoh...

Redis Labs

Solving Complex Scaling Problems by Prashant Kumar and Abhishek Jain of Myntr...

Redis Labs

Redis as a High Scale Swiss Army Knife by Rahul Dagar and Abhishek Gupta of G...

Redis Labs

Plus de Redis Labs (20)