Webinar | Building Apps with the Cassandra Python Driver

•Download as PPTX, PDF•

1 like•2,856 views

With the new Python driver for Cassandra it is easy to build integrations and apps that use Cassandra seamlessly as a back in. This session will explore what it takes to build the app and the features available with the new Python drivers.

Technology

Building Apps with the Cassandra Python Driver
Eddie Satterly–
CTO Big Data & Analytics at CSC
Dial In: 1-877-668-4493
Access Code: 807 224 168

Where is the Driver
https://github.com/datastax/python-driver

Key Features
The driver is a connection handler for the Cassandra system
underneath your app with a low-level API. The key features which
really helped simplify the python code from the earlier version of the
app are:
Connection Pooling & Node Discovery – This lets you
connect to the whole set of nodes providing only the seed nodes
in your list. With my old driver you had to provide the list of all
nodes and make the python code decide how to connect.
You give it this set of nodes 192.168.1.1 & 192.168.1.2 and the
driver makes a connection and automatically discovers all other
nodes in the cluster instance.

Key Features Cont.
Cluster Attributes – There are several cluster object attributes you can
set but some of the key ones are the ability to set a default keyspace via
the method cluster.connect(‘mykeyspace’) as well as setting the CQL
version for cluster that run in mixed mode due to different timing of data
models being built also metrics_enabled which controls metrics collection
SSL_Options – This attribute is called out separately due to the high
value of this in environments where client to node communication needs
to be encrypted and that feature is turned on cluster side. While this is not
turned on by default in my app it is needed for many of the customers that
are using it.
Load balancing – This is a great added feature that really helps to avoid
hotspot nodes in the older driver approach as now you set the policy in an
attribute (roundrobin is the default) and the driver controls connection. In
early test with the old driver even though the code was supposed to pick a
pseudo-random node affinity seemed to happen and creat hotspot nodes
for queries.

Key Features Cont.
default_timeout– Setting a timeout so that the app can detect failures and respond
without leaving the client hanging is key
row_factory – This lets you determine what format to return the results in. This is
super valuable to make sure your app has the data returned in the optimal way for
analysis and manipulation. There were over 50 lines on code in my old python scripts
to handle one-offs that are now gone since this feature exists. Below are the options:
execute_async() – This is one of the best features in the new driver and
makes the processing time for requests much faster from the client PoV.
There is a method to call to force blocking for results to this if needed
but in most cases doing other work while waiting on results providers
speeds up the response times by many milliseconds.

Take a Look at Docs
There are many other features I did not call out so take a look
at:
http://datastax.github.io/python-driver/index.html
http://datastax.github.io/python-driver/api/index.html
For high throughput operations like remote lookups I
highly suggest using multiprocessing module instead of
using multithreading, but make sure you understand the
implication with object passing.

How I Use It
Take a look at my github in a couple of weeks the new version of the
app will be there using this driver once all the final testing is done.
The current version there is using the old driver and approach so
look for v2.0
https://github.com/esatterly/splunk-cassandra
Build your own playgrounds and figure out the right options and configuration
settings to return data and do analysis and manipulation on it. I will be putting two
other apps out in the next few months for other non-Splunk use cases as well so
stay tuned.

What's hot

Hardening cassandra for compliance or paranoiazznate

Transforms Document Management at Scale with Distributed Database Solution wi...DataStax Academy

Oracle to Cassandra Core Concepts Guid Part 1: A new hopeDataStax

From PoCs to ProductionDataStax

Webinar: Eventual Consistency != Hopeful ConsistencyDataStax

Cassandra Development Nirvana DataStax

Webinar : Nouveautés de MongoDB 3.2MongoDB

Cassandra SF 2015 - Repeatable, Scalable, Reliable, Observable Cassandraaaronmorton

Reporting from the Trenches: Intuit & CassandraDataStax

Azure + DataStax Enterprise Powers Office 365 Per User StoreDataStax Academy

Making Every Drop Count: How i20 Addresses the Water Crisis with the IoT and ...DataStax

Ruby Driver Explained: DataStax Webinar May 5th 2015DataStax

Shift: Real World Migration from MongoDB to CassandraDataStax

Target: Performance Tuning Cassandra at TargetDataStax Academy

RedisConf18 - Remote Monitoring & Controlling Scienific InstrumentsRedis Labs

DataStax: How to Roll Cassandra into Production Without Losing your Health, M...DataStax Academy

Capital One: Using Cassandra In Building A Reporting PlatformDataStax Academy

Real-time personal trainer on the SMACK stackAnirvan Chakraborty

Datadog: a Real-Time Metrics Database for One Quadrillion Points/DayC4Media

RedisConf18 - Open Source Built for Scale: Redis in Amazon ElastiCache ServiceRedis Labs

What's hot (20)

Hardening cassandra for compliance or paranoia

Transforms Document Management at Scale with Distributed Database Solution wi...

Oracle to Cassandra Core Concepts Guid Part 1: A new hope

From PoCs to Production

Webinar: Eventual Consistency != Hopeful Consistency

Cassandra Development Nirvana

Webinar : Nouveautés de MongoDB 3.2

Cassandra SF 2015 - Repeatable, Scalable, Reliable, Observable Cassandra

Reporting from the Trenches: Intuit & Cassandra

Azure + DataStax Enterprise Powers Office 365 Per User Store

Making Every Drop Count: How i20 Addresses the Water Crisis with the IoT and ...

Ruby Driver Explained: DataStax Webinar May 5th 2015

Shift: Real World Migration from MongoDB to Cassandra

Target: Performance Tuning Cassandra at Target

RedisConf18 - Remote Monitoring & Controlling Scienific Instruments

DataStax: How to Roll Cassandra into Production Without Losing your Health, M...

Capital One: Using Cassandra In Building A Reporting Platform

Real-time personal trainer on the SMACK stack

Datadog: a Real-Time Metrics Database for One Quadrillion Points/Day

RedisConf18 - Open Source Built for Scale: Redis in Amazon ElastiCache Service

Viewers also liked

How much money do you lose every time your ecommerce site goes down?DataStax

Webinar: ROI on Big Data - RDBMS, NoSQL or Both? A Simple Guide for Knowing H...DataStax

Cassandra Community Webinar: Back to Basics with CQL3DataStax

Webinar: Dyn + DataStax - helping companies deliver exceptional end-user expe...DataStax

Cassandra Community Webinar | In Case of Emergency Break GlassDataStax

Webinar | How Clear Capital Delivers Always-on Appraisals on 122 Million Prop...DataStax

Don't Let Your Shoppers Drop; 5 Rules for Today’s eCommerceDataStax

Webinar: Don't Leave Your Data in the DarkDataStax

Webinar | Introducing DataStax Enterprise 4.6DataStax

Cassandra Community Webinar | Practice Makes Perfect: Extreme Cassandra Optim...DataStax

Cassandra TK 2014 - Large Nodesaaronmorton

Webinar: Getting Started with Apache CassandraDataStax

Webinar | From Zero to 1 Million with Google Cloud Platform and DataStaxDataStax

Webinar: 2 Billion Data Points Each DayDataStax

Cassandra Community Webinar | Make Life Easier - An Introduction to Cassandra...DataStax

Webinar: Building Blocks for the Future of TelevisionDataStax

Webinar: DataStax Training - Everything you need to become a Cassandra RockstarDataStax

Webinar: Diagnosing Apache Cassandra Problems in ProductionDataStax Academy

DataStax Training – Everything you need to become a Cassandra RockstarDataStax

How To Tell if Your Business Needs NoSQLDataStax

Viewers also liked (20)

How much money do you lose every time your ecommerce site goes down?

Webinar: ROI on Big Data - RDBMS, NoSQL or Both? A Simple Guide for Knowing H...

Cassandra Community Webinar: Back to Basics with CQL3

Webinar: Dyn + DataStax - helping companies deliver exceptional end-user expe...

Cassandra Community Webinar | In Case of Emergency Break Glass

Webinar | How Clear Capital Delivers Always-on Appraisals on 122 Million Prop...

Don't Let Your Shoppers Drop; 5 Rules for Today’s eCommerce

Webinar: Don't Leave Your Data in the Dark

Webinar | Introducing DataStax Enterprise 4.6

Cassandra Community Webinar | Practice Makes Perfect: Extreme Cassandra Optim...

Cassandra TK 2014 - Large Nodes

Webinar: Getting Started with Apache Cassandra

Webinar | From Zero to 1 Million with Google Cloud Platform and DataStax

Webinar: 2 Billion Data Points Each Day

Cassandra Community Webinar | Make Life Easier - An Introduction to Cassandra...

Webinar: Building Blocks for the Future of Television

Webinar: DataStax Training - Everything you need to become a Cassandra Rockstar

Webinar: Diagnosing Apache Cassandra Problems in Production

DataStax Training – Everything you need to become a Cassandra Rockstar

How To Tell if Your Business Needs NoSQL

Similar to Webinar | Building Apps with the Cassandra Python Driver

Google Cloud Next '22 Recap: Serverless & Data editionDaniel Zivkovic

Presentación11.pdfPabloCanesta

Surekha_haoop_expsurekhakadi

Strategies and Tips for Building Enterprise Drupal Applications - PNWDS 2013Mack Hardy

Operator SDK for K8s using GoCloudOps2005

Care and feeding notesPerrin Harkins

Cisco project ideasVIT University

Running Apache Spark on Kubernetes: Best Practices and PitfallsDatabricks

Build cloud native solution using open source Nitesh Jadhav

Function as a Servicerich fernandez

Multi-tenancy with RailsPaul Gallagher

Kubernetes One-Click Deployment: Hands-on Workshop (Mainz)QAware GmbH

Open shift and docker - october,2014Hojoong Kim

System design for Web ApplicationMichael Choi

MuleSoft Manchester Meetup #3 slides 31st March 2020Ieva Navickaite

Continuous integration / continuous deliveryEatDog

PWA - The Future of eCommerce - Magento Meetup Ahmedabad 2018Bhavesh Surani

Challenges In Modern ApplicationRahul Kumar Gupta

What's New in Docker - February 2017Patrick Chanezon

AtoZ about TYPO3 v8 CMSNITSAN Technologies Pvt Ltd

Similar to Webinar | Building Apps with the Cassandra Python Driver (20)

Google Cloud Next '22 Recap: Serverless & Data edition

Presentación11.pdf

Surekha_haoop_exp

Strategies and Tips for Building Enterprise Drupal Applications - PNWDS 2013

Operator SDK for K8s using Go

Care and feeding notes

Cisco project ideas

Running Apache Spark on Kubernetes: Best Practices and Pitfalls

Build cloud native solution using open source

Function as a Service

Multi-tenancy with Rails

Kubernetes One-Click Deployment: Hands-on Workshop (Mainz)

Open shift and docker - october,2014

System design for Web Application

MuleSoft Manchester Meetup #3 slides 31st March 2020

Continuous integration / continuous delivery

PWA - The Future of eCommerce - Magento Meetup Ahmedabad 2018

Challenges In Modern Application

What's New in Docker - February 2017

AtoZ about TYPO3 v8 CMS

Recently uploaded

TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc

Architecting Cloud Native ApplicationsWSO2

Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays

Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...apidays

Why Teams call analytics are critical to your entire businesspanagenda

Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services

Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez

EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot

Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...apidays

CNIC Information System with Pakdata Cf In Pakistandanishmna97

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software

Elevate Developer Efficiency & build GenAI Application with Amazon QBhuvaneswari Subramani

Six Myths about Ontologies: The Basics of Formal Ontologyjohnbeverley2021

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@

Platformless Horizons for Digital AdaptabilityWSO2

presentation ICT roal in 21st century educationjfdjdjcjdnsjd

Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Victor Rentea

Vector Search -An Introduction in Oracle Database 23ai.pptxRemote DBA Services

Apidays New York 2024 - The value of a flexible API Management solution for O...apidays

Recently uploaded (20)

TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery

Architecting Cloud Native Applications

Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe

Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...

Why Teams call analytics are critical to your entire business

Strategies for Landing an Oracle DBA Job as a Fresher

Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood

EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER

Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...

CNIC Information System with Pakdata Cf In Pakistan

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME

Elevate Developer Efficiency & build GenAI Application with Amazon Q

Six Myths about Ontologies: The Basics of Formal Ontology

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...

Platformless Horizons for Digital Adaptability

presentation ICT roal in 21st century education

Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024

Vector Search -An Introduction in Oracle Database 23ai.pptx

Apidays New York 2024 - The value of a flexible API Management solution for O...

Webinar | Building Apps with the Cassandra Python Driver

1. Building Apps with the Cassandra Python Driver Eddie Satterly– CTO Big Data & Analytics at CSC Dial In: 1-877-668-4493 Access Code: 807 224 168

2. Where is the Driver https://github.com/datastax/python-driver

3. Key Features The driver is a connection handler for the Cassandra system underneath your app with a low-level API. The key features which really helped simplify the python code from the earlier version of the app are: Connection Pooling & Node Discovery – This lets you connect to the whole set of nodes providing only the seed nodes in your list. With my old driver you had to provide the list of all nodes and make the python code decide how to connect. You give it this set of nodes 192.168.1.1 & 192.168.1.2 and the driver makes a connection and automatically discovers all other nodes in the cluster instance.

4. Key Features Cont. Cluster Attributes – There are several cluster object attributes you can set but some of the key ones are the ability to set a default keyspace via the method cluster.connect(‘mykeyspace’) as well as setting the CQL version for cluster that run in mixed mode due to different timing of data models being built also metrics_enabled which controls metrics collection SSL_Options – This attribute is called out separately due to the high value of this in environments where client to node communication needs to be encrypted and that feature is turned on cluster side. While this is not turned on by default in my app it is needed for many of the customers that are using it. Load balancing – This is a great added feature that really helps to avoid hotspot nodes in the older driver approach as now you set the policy in an attribute (roundrobin is the default) and the driver controls connection. In early test with the old driver even though the code was supposed to pick a pseudo-random node affinity seemed to happen and creat hotspot nodes for queries.

5. Key Features Cont. default_timeout– Setting a timeout so that the app can detect failures and respond without leaving the client hanging is key row_factory – This lets you determine what format to return the results in. This is super valuable to make sure your app has the data returned in the optimal way for analysis and manipulation. There were over 50 lines on code in my old python scripts to handle one-offs that are now gone since this feature exists. Below are the options: execute_async() – This is one of the best features in the new driver and makes the processing time for requests much faster from the client PoV. There is a method to call to force blocking for results to this if needed but in most cases doing other work while waiting on results providers speeds up the response times by many milliseconds.

6. Take a Look at Docs There are many other features I did not call out so take a look at: http://datastax.github.io/python-driver/index.html http://datastax.github.io/python-driver/api/index.html For high throughput operations like remote lookups I highly suggest using multiprocessing module instead of using multithreading, but make sure you understand the implication with object passing.

7. How I Use It Take a look at my github in a couple of weeks the new version of the app will be there using this driver once all the final testing is done. The current version there is using the old driver and approach so look for v2.0 https://github.com/esatterly/splunk-cassandra Build your own playgrounds and figure out the right options and configuration settings to return data and do analysis and manipulation on it. I will be putting two other apps out in the next few months for other non-Splunk use cases as well so stay tuned.

8. Thank You Questions?

Webinar | Building Apps with the Cassandra Python Driver

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Viewers also liked

Viewers also liked (20)

Similar to Webinar | Building Apps with the Cassandra Python Driver

Similar to Webinar | Building Apps with the Cassandra Python Driver (20)

More from DataStax Academy

More from DataStax Academy (20)

Recently uploaded

Recently uploaded (20)

Webinar | Building Apps with the Cassandra Python Driver