Continuous Deployment with Cassandra

•Télécharger en tant que PPTX, PDF•

0 j'aime•955 vues

Continuous Deployment of Cassandra and other General best practices with deploying, maintaining, and upgrading Cassandra operations

Technologie

Continuous Deployment with C*:
Treating C* as First-Class Code
Michael Kjellman
@mkjellman
Software Engineer, Barracuda Networks

C* At Barracuda
• Powers 100% of our Spam and Webfilter Backend
• 48 Node Cluster
• 2 Datacenters
• Requests: 20k writes/sec 30k reads/sec
• Latency: 1 ms/write 1.6 ms/read
• > 30TB of Data
• Almost entirely native protocol/CQL3

Hardware Configuration
• 32GB of RAM
• 1x SSD
• 2x Spinning Disks
• 2x 6 Core AMD

Key Configuration Options
• key_cache_size_in_mb: 1024
• row_cache_size_in_mb: 0
• memtable_total_space_in_mb: 2048
• HEAP_NEWSIZE = “1200M” (-Xmn)
• MAX_HEAP_SIZE = “8G” (-Xmx)
• -XX:SurvivorRatio=6
• Sidenote: Java 7u40 is out!

How do I keep my graphs pretty during
a C* upgrade?
September 18th 2013

Make a C* Build
$> git clone http://git-wip-
us.apache.org/repos/asf/cassandra.git
$> git checkout –t origin/cassandra-1.2
$> git log
$> vim build.xml (change version number every
time you make a build!)
$> ant clean release

Deployment
• Make release
• Test release with CCM
• Push release to Puppet (deals with config, etc)
• Run controlled and scripted rolling restart one datacenter
at a time
– flush
– stop
– start
– validate node

So, why not just
apt-get install cassandra?
• Makes running a custom release in the future a
complete nightmare
• Lost visibility into changes in the release
• WHY are you upgrading
• Treat a C* build just as if it was a release of your
code. What commits did you put into your own
release?

MY CODE DOESN’T WORK WITHOUT A
STABLE C* CLUSTER
Simply Put:

When things go wrong
• Every commit (those by C* committers or my
own) come with potential bugs and regressions
• Gossip Bugs Can Bite Hard:
– CASSANDRA-5665: Gossiper.handleMajorStateChange
can lose existing node ApplicationState
• At 48 nodes, even small mistakes are massive

Writing your code to deal with node
failure
• Upgrading a C* cluster means constant node
failures for the duration of the rolling restart
• How does your code deal with read latency and
retries
– CASSANDRA-4705: Eager Retries for reads for 2.0+
• The mythical “constantly failing” code != stability.
– Handle exceptions (and node/read failures) gracefully!

Why treat C* like your own code
• Using C* will move much of your own
application logic to C*
• The bugs have to go somewhere!
• Data replication at database layer or at
application layer

Contenu connexe

Tendances

WebLogic Stability; Detect and Analyse Stuck ThreadsMaarten Smeets

Galera webinar migration to galera cluster from my sql async replicationCodership Oy - Creators of Galera Cluster

Moving mongo db to the cloud strategies and points to considerVinicius M Grippa

What's New in Postgres Plus Advanced Server 9.3EDB

OpenNebulaconf2017US: Configuration management with OpenNebula and Ansible by...OpenNebula Project

Nick Fisk - low latency CephShapeBlue

Continuous Delivery and Infrastructure as CodeSascha Möllering

Virtualization and SAN Basics for DBAsQuest Software

Drupal 8 and NGINX NGINX, Inc.

NOSQL - not only sqlSergey Shishkin

Using galera replication to create geo distributed clusters on the wanCodership Oy - Creators of Galera Cluster

Ceph Tech Talk -- Ceph Benchmarking ToolCeph Community

Plny12 galera-cluster-best-practicesDimas Prasetyo

Integrating Puppet with Cloud Infrastructures-Remco OverdijkMaxServ

WordPress + NGINX Best Practices with EasyEngineNGINX, Inc.

MySQL for Beginners - part 1Ivan Zoratti

Selenium grid workshop london 2016Marcus Merrell

Tendances (17)

WebLogic Stability; Detect and Analyse Stuck Threads

Galera webinar migration to galera cluster from my sql async replication

Moving mongo db to the cloud strategies and points to consider

What's New in Postgres Plus Advanced Server 9.3

OpenNebulaconf2017US: Configuration management with OpenNebula and Ansible by...

Nick Fisk - low latency Ceph

Continuous Delivery and Infrastructure as Code

Virtualization and SAN Basics for DBAs

Drupal 8 and NGINX

NOSQL - not only sql

Using galera replication to create geo distributed clusters on the wan

Ceph Tech Talk -- Ceph Benchmarking Tool

Plny12 galera-cluster-best-practices

Integrating Puppet with Cloud Infrastructures-Remco Overdijk

WordPress + NGINX Best Practices with EasyEngine

MySQL for Beginners - part 1

Selenium grid workshop london 2016

En vedette

C* Summit 2013 - Hindsight is 20/20. MySQL to Cassandra by Michael KjellmanDataStax Academy

UKOUG, Lies, Damn Lies and I/O StatisticsKyle Hailey

Apache Cassandra Ignite PresentationJared Winick

oracle 11g database architectureOsama Mustafa

Optimizer StatisticsConnor McDonald

DataFrames: The Extended CutWes McKinney

Analytic SQL Sep 2013Connor McDonald

How to find and fix your Oracle application performance problemCary Millsap

An introduction to data virtualization in business intelligenceDavid Walker

Apache Jackrabbit Oak on MongoDBMongoDB

Python as part of a production machine learning stack by Michael Manapat PyDa...PyData

Netflix oss season 2 episode 1 - meetup Lightning talksRuslan Meshenberg

SQL-on-Hadoop TutorialDaniel Abadi

En vedette (13)

C* Summit 2013 - Hindsight is 20/20. MySQL to Cassandra by Michael Kjellman

UKOUG, Lies, Damn Lies and I/O Statistics

Apache Cassandra Ignite Presentation

oracle 11g database architecture

Optimizer Statistics

DataFrames: The Extended Cut

Analytic SQL Sep 2013

How to find and fix your Oracle application performance problem

An introduction to data virtualization in business intelligence

Apache Jackrabbit Oak on MongoDB

Python as part of a production machine learning stack by Michael Manapat PyDa...

Netflix oss season 2 episode 1 - meetup Lightning talks

SQL-on-Hadoop Tutorial

Similaire à Continuous Deployment with Cassandra

Hindsight is 20/20: MySQL to CassandraMichael Kjellman

AdGear Use Case with Scylla - 1M Queries Per Second with Single-Digit Millise...ScyllaDB

Cassandra Community Webinar: MySQL to Cassandra - What I Wish I'd KnownDataStax

uClusterChristos Kotsalos

Presentation1Mostafa Najmi

Scylla Summit 2018: Make Scylla Fast Again! Find out how using Tools, Talent,...ScyllaDB

TechWiseTV Workshop: Cisco UCS C4200Robb Boyd

CPU Cachesshinolajla

Something about SSE and beyondLihang Li

Cuda 6 performance_reportMichael Zhang

A Dataflow Processing Chip for Training Deep Neural Networksinside-BigData.com

Ceph Day Beijing - Ceph all-flash array design based on NUMA architectureCeph Community

Ceph Day Beijing - Ceph All-Flash Array Design Based on NUMA ArchitectureDanielle Womboldt

Apache Spark At Scale in the CloudRose Toomey

Apache Spark At Scale in the CloudDatabricks

iland Internet Solutions: Leveraging Cassandra for real-time multi-datacenter...DataStax Academy

Leveraging Cassandra for real-time multi-datacenter public cloud analyticsJulien Anguenot

Sista: Improving Cog’s JIT performanceESUG

Processors selectionPradeep Shankhwar

BSides LV 2016 - Beyond the tip of the iceberg - fuzzing binary protocols for...Alexandre Moneger

Similaire à Continuous Deployment with Cassandra (20)

Hindsight is 20/20: MySQL to Cassandra

AdGear Use Case with Scylla - 1M Queries Per Second with Single-Digit Millise...

Cassandra Community Webinar: MySQL to Cassandra - What I Wish I'd Known

uCluster

Presentation1

Scylla Summit 2018: Make Scylla Fast Again! Find out how using Tools, Talent,...

TechWiseTV Workshop: Cisco UCS C4200

CPU Caches

Something about SSE and beyond

Cuda 6 performance_report

A Dataflow Processing Chip for Training Deep Neural Networks

Ceph Day Beijing - Ceph all-flash array design based on NUMA architecture

Ceph Day Beijing - Ceph All-Flash Array Design Based on NUMA Architecture

Apache Spark At Scale in the Cloud

iland Internet Solutions: Leveraging Cassandra for real-time multi-datacenter...

Leveraging Cassandra for real-time multi-datacenter public cloud analytics

Sista: Improving Cog’s JIT performance

Processors selection

BSides LV 2016 - Beyond the tip of the iceberg - fuzzing binary protocols for...

Dernier

"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays

DMCC Future of Trade Web3 - Special EditionDubai Multi Commodity Centre

SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal

Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren

Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst

Take control of your SAP testing with UiPath Test SuiteDianaGray10

Advanced Computer Architecture – An IntroductionDilum Bandara

What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett

How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe

Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB

Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation

DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell

Artificial intelligence in cctv survelliance.pptxhariprasad279825

WordPress Websites for Engineers: Elevate Your Brandgvaughan

Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm

Story boards and shot lists for my a level piececharlottematthew16

Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University

Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity

Gen AI in Business - Global Trends Report 2024.pdfAddepto

TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc

Dernier (20)

"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack

DMCC Future of Trade Web3 - Special Edition

SAP Build Work Zone - Overview L2-L3.pptx

Advanced Test Driven-Development @ php[tek] 2024

Human Factors of XR: Using Human Factors to Design XR Systems

Take control of your SAP testing with UiPath Test Suite

Advanced Computer Architecture – An Introduction

What's New in Teams Calling, Meetings and Devices March 2024

How AI, OpenAI, and ChatGPT impact business and software.

Developer Data Modeling Mistakes: From Postgres to NoSQL

Connect Wave/ connectwave Pitch Deck Presentation

DSPy a system for AI to Write Prompts and Do Fine Tuning

Artificial intelligence in cctv survelliance.pptx

WordPress Websites for Engineers: Elevate Your Brand

Streamlining Python Development: A Guide to a Modern Project Setup

Story boards and shot lists for my a level piece

Nell’iperspazio con Rocket: il Framework Web di Rust!

Dev Dives: Streamline document processing with UiPath Studio Web

Gen AI in Business - Global Trends Report 2024.pdf

TrustArc Webinar - How to Build Consumer Trust Through Data Privacy

Continuous Deployment with Cassandra

1. Continuous Deployment with C*: Treating C* as First-Class Code Michael Kjellman @mkjellman Software Engineer, Barracuda Networks

3. C* At Barracuda • Powers 100% of our Spam and Webfilter Backend • 48 Node Cluster • 2 Datacenters • Requests: 20k writes/sec 30k reads/sec • Latency: 1 ms/write 1.6 ms/read • > 30TB of Data • Almost entirely native protocol/CQL3

4. Hardware Configuration • 32GB of RAM • 1x SSD • 2x Spinning Disks • 2x 6 Core AMD

5. Key Configuration Options • key_cache_size_in_mb: 1024 • row_cache_size_in_mb: 0 • memtable_total_space_in_mb: 2048 • HEAP_NEWSIZE = “1200M” (-Xmn) • MAX_HEAP_SIZE = “8G” (-Xmx) • -XX:SurvivorRatio=6 • Sidenote: Java 7u40 is out!

6. How do I keep my graphs pretty during a C* upgrade? September 18th 2013

7. Make a C* Build $> git clone http://git-wip- us.apache.org/repos/asf/cassandra.git $> git checkout –t origin/cassandra-1.2 $> git log $> vim build.xml (change version number every time you make a build!) $> ant clean release

8. Deployment • Make release • Test release with CCM • Push release to Puppet (deals with config, etc) • Run controlled and scripted rolling restart one datacenter at a time – flush – stop – start – validate node

9. Automate, Automate, Automate

10. So, why not just apt-get install cassandra? • Makes running a custom release in the future a complete nightmare • Lost visibility into changes in the release • WHY are you upgrading • Treat a C* build just as if it was a release of your code. What commits did you put into your own release?

11. MY CODE DOESN’T WORK WITHOUT A STABLE C* CLUSTER Simply Put:

12. When things go wrong • Every commit (those by C* committers or my own) come with potential bugs and regressions • Gossip Bugs Can Bite Hard: – CASSANDRA-5665: Gossiper.handleMajorStateChange can lose existing node ApplicationState • At 48 nodes, even small mistakes are massive

13. Writing your code to deal with node failure • Upgrading a C* cluster means constant node failures for the duration of the rolling restart • How does your code deal with read latency and retries – CASSANDRA-4705: Eager Retries for reads for 2.0+ • The mythical “constantly failing” code != stability. – Handle exceptions (and node/read failures) gracefully!

14. Why treat C* like your own code • Using C* will move much of your own application logic to C* • The bugs have to go somewhere! • Data replication at database layer or at application layer

15. QUESTIONS? Thanks for Listening!

Continuous Deployment with Cassandra

Recommandé

Recommandé

Contenu connexe

Tendances

Tendances (17)

En vedette

En vedette (13)

Similaire à Continuous Deployment with Cassandra

Similaire à Continuous Deployment with Cassandra (20)

Dernier

Dernier (20)

Continuous Deployment with Cassandra