The use case for Cassandra at Ping Identity

•Télécharger en tant que PPTX, PDF•

2 j'aime•1,706 vues

The Use Case for Cassandra at Ping Identity How and why Ping Identity uses Cassandra database inside PingOne. By Michael Ward, Site Reliability Engineer, On-Demand Ping Identity mward@pingidentity.com @devoperandi

Technologie Business

Copyright ©2013 Ping Identity Corporation. All rights reserved.
Site Reliability Engineering

Copyright ©2013 Ping Identity Corporation. All rights reserved.
• We believe secure professional and
personal identities underlie human
progress in a connected world. Our
purpose is to enable and protect identity,
defend privacy and secure the Internet.
• Over 1,000 companies, including over half
of the Fortune 100, rely on our award-
winning products to make the digital world
a better experience for hundreds of millions
of people.
• Denver, Colorado. Est. 2003
About Ping Identity

Copyright ©2013 Ping Identity Corporation. All rights reserved.
Design Philosophy
Memory: 6-8GB
CPU: 2
Disk: 30GB
More servers, smaller capacity
Geographic distribution for data redundancy,
availability and performance
Horizontal scalability
No single point of failure

Copyright ©2013 Ping Identity Corporation. All rights reserved.
Cassandra at Ping….taking the plunge
Current:
PingOne reporting
PingOne for Groups
Future goals:
Migration from Mongo (EOY)
Some migration from MySQL
Real Time Analytics (innovation project)

Copyright ©2013 Ping Identity Corporation. All rights reserved.
Cassandra Reporting Cluster
• Built to provide customer insight into PingOne
• First dive into Cassandra v0.7 (production)
• Upgrade (in place) to v1.1.2
• v0.8 as stepping stone
• In production today as v1.1.9

Copyright ©2013 Ping Identity Corporation. All rights reserved.
Cassandra Reporting Cluster (v0.7)
DC1 DC2

Copyright ©2013 Ping Identity Corporation. All rights reserved.
Where did we miss? (v1.1.2)
Upgrade in place
Missed out on compression
Failed to gain read performance
Data wasn’t spread evenly across the cluster

Copyright ©2013 Ping Identity Corporation. All rights reserved.
Cassandra Reporting (v1.1.9)
DC1 DC2
DC3
0 10
20
Features:
9 nodes
Compression
Token Offsets
2 Replicas per DC
Off-Heap caching
Server-to-Server SSL
Limitations:
No Access Control
Manual Token Generation
Node recovery 1 replica from
each token range
DC1
DC2
DC3

$Copyright ©2013 Ping Identity Corporation. All rights reserved. central logs Snapshots to new cluster Bulkload snapshots into cluster to gain compressionv1.1.2 v1.1.9 Create Tables with: compression={'sstable_compression': 'SnappyCompressor'}; Replay reporting gap to new cluster log transform Traffic switch after bulkload Migration from 1.1.2 to 1.1.9$

Copyright ©2013 Ping Identity Corporation. All rights reserved.
PingOne Reports

Copyright ©2013 Ping Identity Corporation. All rights reserved.
PingOne for Groups
Cassandra v1.2.5
Features:
Auto Token Generation
Vnodes
Descent data distribution
More efficient memory utilization
Atomic Batches
Secondary indexes
Request Tracing
Internal
Authentication/Authorization

Copyright ©2013 Ping Identity Corporation. All rights reserved.
.setDiscoveryType(NodeDiscoveryType.RING_DESCRIBE)
.setCqlVersion("3.0.0")
.setTargetCassandraVersion("1.2”)
.setRetryPolicy(retryPolicy);
.setConnectTimeout(2000);
RUN_ONCE
Astyanax config

Copyright ©2013 Ping Identity Corporation. All rights reserved.
Limitations (v1.2.5)
Caching
# Replicas
Size-Tiered Compaction

Copyright ©2013 Ping Identity Corporation. All rights reserved.
Questions?

Copyright ©2013 Ping Identity Corporation. All rights reserved.
Thanks!
mward@pingidentity.com
@devoperandi
www.pingidentity.com/blogs
http://status.pingidentity.com
http://uptime.pingidentity.com

Recommandé

Iot top 10 vulnerabilities and misconceptions 2016Erez Metula

The Case For Next Generation IAM Patrick Harding

Federated Identity for IoT with OAuth2Paul Fremantle

How sdp delivers_zero_trustZscaler

IoT World - creating a secure robust IoT reference architecturePaul Fremantle

Secure access to applications on Microsoft AzureZscaler

What Comes After VPN?Zscaler

Jamie Bowser - A Touch(ID) of iOS Securitycentralohioissa

Recommandé

Iot top 10 vulnerabilities and misconceptions 2016Erez Metula

The Case For Next Generation IAM Patrick Harding

Federated Identity for IoT with OAuth2Paul Fremantle

How sdp delivers_zero_trustZscaler

IoT World - creating a secure robust IoT reference architecturePaul Fremantle

Secure access to applications on Microsoft AzureZscaler

What Comes After VPN?Zscaler

Jamie Bowser - A Touch(ID) of iOS Securitycentralohioissa

Virtualized Firewall: Is it the panacea to secure distributed enterprises?Zscaler

Securing the Internet of ThingsPaul Fremantle

Cloud vs. On-Premises Security: Can you afford not to switch?Zscaler

CSA Presentation - Software Defined PerimeterVishwas Manral

Top 5 predictions webinarZscaler

How to Overcome Network Access Control Limitations for Better Network SecurityCryptzone

Schneider electric powers security transformation with one simple app copyZscaler

Creator Ci40 IoT kit & Framework - scalable LWM2M IoT dev platform for businessPaul Evans

The Software-Defined Perimeter: Securing Network Access for the Modern WorkforcePerimeter 81

Using an Open Source RESTful Backend for IoT ApplicationsJan Liband

Blueprint for creating a Secure IoT ProductGuy Vinograd ☁

Security in microservices architecturesinovia

Webinar remote access_no_vpn_pitfalls_111517Zscaler

From The Hidden Internet: Lesson From 12 Months Of MonitoringPriyanka Aash

Overcoming the Challenges of Architecting for the CloudZscaler

Creator IoT FrameworkPaul Evans

IoT testing and quality assurance indicthreadsIndicThreads

Architectural Patterns in IoT Cloud PlatformsRoshan Kulkarni

IoT End-to-End Security OverviewAmazon Web Services

WSO2Con EU 2015: Keynote - The Identity of Things: The Next Internet ChallengeWSO2

Presentation deploying cloud based servicesxKinAnx

AquaQ Analytics Kx Event - Data Direct Networks PresentationAquaQ Analytics

Contenu connexe

Tendances

Virtualized Firewall: Is it the panacea to secure distributed enterprises?Zscaler

Securing the Internet of ThingsPaul Fremantle

Cloud vs. On-Premises Security: Can you afford not to switch?Zscaler

CSA Presentation - Software Defined PerimeterVishwas Manral

Top 5 predictions webinarZscaler

How to Overcome Network Access Control Limitations for Better Network SecurityCryptzone

Schneider electric powers security transformation with one simple app copyZscaler

Creator Ci40 IoT kit & Framework - scalable LWM2M IoT dev platform for businessPaul Evans

The Software-Defined Perimeter: Securing Network Access for the Modern WorkforcePerimeter 81

Using an Open Source RESTful Backend for IoT ApplicationsJan Liband

Blueprint for creating a Secure IoT ProductGuy Vinograd ☁

Security in microservices architecturesinovia

Webinar remote access_no_vpn_pitfalls_111517Zscaler

From The Hidden Internet: Lesson From 12 Months Of MonitoringPriyanka Aash

Overcoming the Challenges of Architecting for the CloudZscaler

Creator IoT FrameworkPaul Evans

IoT testing and quality assurance indicthreadsIndicThreads

Architectural Patterns in IoT Cloud PlatformsRoshan Kulkarni

IoT End-to-End Security OverviewAmazon Web Services

WSO2Con EU 2015: Keynote - The Identity of Things: The Next Internet ChallengeWSO2

Tendances (20)

Virtualized Firewall: Is it the panacea to secure distributed enterprises?

Securing the Internet of Things

Cloud vs. On-Premises Security: Can you afford not to switch?

CSA Presentation - Software Defined Perimeter

Top 5 predictions webinar

How to Overcome Network Access Control Limitations for Better Network Security

Schneider electric powers security transformation with one simple app copy

Creator Ci40 IoT kit & Framework - scalable LWM2M IoT dev platform for business

The Software-Defined Perimeter: Securing Network Access for the Modern Workforce

Using an Open Source RESTful Backend for IoT Applications

Blueprint for creating a Secure IoT Product

Security in microservices architectures

Webinar remote access_no_vpn_pitfalls_111517

From The Hidden Internet: Lesson From 12 Months Of Monitoring

Overcoming the Challenges of Architecting for the Cloud

Creator IoT Framework

IoT testing and quality assurance indicthreads

Architectural Patterns in IoT Cloud Platforms

IoT End-to-End Security Overview

WSO2Con EU 2015: Keynote - The Identity of Things: The Next Internet Challenge

Similaire à The use case for Cassandra at Ping Identity

Presentation deploying cloud based servicesxKinAnx

AquaQ Analytics Kx Event - Data Direct Networks PresentationAquaQ Analytics

DDN and Intel: Partnered for ExascaleIntel IT Center

JOSA TechTalks - Downgrade your CostsJordan Open Source Association

DDN Product Update from SC13inside-BigData.com

Safer restarts, faster streaming, and better repair, just a glimpse of cassan...Vinay Kumar Chella

TechEd NZ 2014: Azure and SharepointIntergen

Ransomware-Recovery-as-a-ServiceSagi Brody

Rendering in the CloudBenjamin Shrive

cncf overview and building edge computing using kubernetesKrishna-Kumar

Game Analytics at London Apache Druid MeetupJelena Zanko

Presentazione SimpliVity @ VMUGIT UserCon 2015VMUG IT

Deploy Microservices in the Real WorldElana Krasner

SPCA2013 - Windows Azure for SharePoint PeopleNCCOMMS

Introduction to Software Defined Visualization (SDVis)Intel® Software

Big datadc skyfall_preso_v2abramsm

Reblaze Case Study on GCPIdan Tohami

Building Resilient Applications with Cloudflare DNSDevOps.com

What is expected from Chief Cloud Officers?Bernard Paques

2016-JAN-28 -- High Performance Production Databases on CephCeph Community

Similaire à The use case for Cassandra at Ping Identity (20)

Presentation deploying cloud based services

AquaQ Analytics Kx Event - Data Direct Networks Presentation

DDN and Intel: Partnered for Exascale

JOSA TechTalks - Downgrade your Costs

DDN Product Update from SC13

Safer restarts, faster streaming, and better repair, just a glimpse of cassan...

TechEd NZ 2014: Azure and Sharepoint

Ransomware-Recovery-as-a-Service

Rendering in the Cloud

cncf overview and building edge computing using kubernetes

Game Analytics at London Apache Druid Meetup

Presentazione SimpliVity @ VMUGIT UserCon 2015

Deploy Microservices in the Real World

SPCA2013 - Windows Azure for SharePoint People

Introduction to Software Defined Visualization (SDVis)

Big datadc skyfall_preso_v2

Reblaze Case Study on GCP

Building Resilient Applications with Cloudflare DNS

What is expected from Chief Cloud Officers?

2016-JAN-28 -- High Performance Production Databases on Ceph

Plus de Ping Identity

Healthcare Patient Experiences MatterPing Identity

Optimize Your Zero Trust InfrastructurePing Identity

Ping’s Technology Partner ProgramPing Identity

Remote Work Fuels Zero Trust GrowthPing Identity

Identity Verification: Who’s Really There? Ping Identity

Extraordinary Financial Customer ExperiencesPing Identity

Extraordinary Retail Customer ExperiencesPing Identity

Security Practices: The Generational Gap | InfographicPing Identity

Security Concerns Around the World | InfographicPing Identity

Hybrid IAM: Fuelling Agility in the Cloud Transformation Journey | Gartner IA...Ping Identity

LES ATTITUDES DES CONSOMMATEURS À L’ÈRE DES CYBERATTAQUESPing Identity

WIE TICKEN VERBRAUCHER IM ZEITALTER DER DATENSCHUTZVERLETZUNGEN?Ping Identity

Consumer Attitudes in a Post-breach Era: The Geographical GapPing Identity

Standard Based API Security, Access Control and AI Based Attack - API Days Pa...Ping Identity

ATTITUDES DES CONSOMMATEURS A L’ERE DES PIRATAGES LE CONFLIT DE GENERATIONSPing Identity

2018 Survey: Consumer Attitudes in a Post-Breach Era - The Generational GapPing Identity

WIE TICKEN VERBRAUCHER IM ZEITALTER DER DATENSCHUTZVERLETZUNGEN? ALLES EINE F...Ping Identity

API Security Needs AI Now More Than EverPing Identity

Fishing for a CIAM Platform? 11 Question to Ask Before You BuyPing Identity

Criteria for Effective Modern IAM Strategies (Gartner IAM 2018)Ping Identity

Plus de Ping Identity (20)

Healthcare Patient Experiences Matter

Optimize Your Zero Trust Infrastructure

Ping’s Technology Partner Program

Remote Work Fuels Zero Trust Growth

Identity Verification: Who’s Really There?

Extraordinary Financial Customer Experiences

Extraordinary Retail Customer Experiences

Security Practices: The Generational Gap | Infographic

Security Concerns Around the World | Infographic

Hybrid IAM: Fuelling Agility in the Cloud Transformation Journey | Gartner IA...

LES ATTITUDES DES CONSOMMATEURS À L’ÈRE DES CYBERATTAQUES

WIE TICKEN VERBRAUCHER IM ZEITALTER DER DATENSCHUTZVERLETZUNGEN?

Consumer Attitudes in a Post-breach Era: The Geographical Gap

Standard Based API Security, Access Control and AI Based Attack - API Days Pa...

ATTITUDES DES CONSOMMATEURS A L’ERE DES PIRATAGES LE CONFLIT DE GENERATIONS

2018 Survey: Consumer Attitudes in a Post-Breach Era - The Generational Gap

WIE TICKEN VERBRAUCHER IM ZEITALTER DER DATENSCHUTZVERLETZUNGEN? ALLES EINE F...

API Security Needs AI Now More Than Ever

Fishing for a CIAM Platform? 11 Question to Ask Before You Buy

Criteria for Effective Modern IAM Strategies (Gartner IAM 2018)

Dernier

Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55

Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik

Finology Group – Insurtech Innovation Award 2024The Digital Insurer

How to convert PDF to text with Nanonetsnaman860154

Salesforce Community Group Quito, Salesforce 101Paola De la Torre

GenCyber Cyber Security Day PresentationMichael W. Hawkins

Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada

🐬 The future of MySQL is Postgres 🐘RTylerCroy

04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG

Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes

Google AI Hackathon: LLM based Evaluator for RAGSujit Pal

The Evolution of Money: Digital Transformation and CBDCs in Central BankingSelcen Ozturkcan

Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard

Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko

Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix

08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls

Boost PC performance: How more available memory can improve productivityPrincipled Technologies

SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren

[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745

Scaling API-first – The story of a global engineering organizationRadu Cotescu

Dernier (20)

Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...

Injustice - Developers Among Us (SciFiDevCon 2024)

Finology Group – Insurtech Innovation Award 2024

How to convert PDF to text with Nanonets

Salesforce Community Group Quito, Salesforce 101

GenCyber Cyber Security Day Presentation

Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024

🐬 The future of MySQL is Postgres 🐘

04-2024-HHUG-Sales-and-Marketing-Alignment.pptx

Enhancing Worker Digital Experience: A Hands-on Workshop for Partners

Google AI Hackathon: LLM based Evaluator for RAG

The Evolution of Money: Digital Transformation and CBDCs in Central Banking

Maximizing Board Effectiveness 2024 Webinar.pptx

Handwritten Text Recognition for manuscripts and early printed texts

Swan(sea) Song – personal research during my six years at Swansea ... and bey...

08448380779 Call Girls In Friends Colony Women Seeking Men

Boost PC performance: How more available memory can improve productivity

SQL Database Design For Developers at php[tek] 2024

[2024]Digital Global Overview Report 2024 Meltwater.pdf

Scaling API-first – The story of a global engineering organization

The use case for Cassandra at Ping Identity

2. Copyright ©2013 Ping Identity Corporation. All rights reserved. • We believe secure professional and personal identities underlie human progress in a connected world. Our purpose is to enable and protect identity, defend privacy and secure the Internet. • Over 1,000 companies, including over half of the Fortune 100, rely on our award- winning products to make the digital world a better experience for hundreds of millions of people. • Denver, Colorado. Est. 2003 About Ping Identity

3. Copyright ©2013 Ping Identity Corporation. All rights reserved. Design Philosophy Memory: 6-8GB CPU: 2 Disk: 30GB More servers, smaller capacity Geographic distribution for data redundancy, availability and performance Horizontal scalability No single point of failure

4. Copyright ©2013 Ping Identity Corporation. All rights reserved. Cassandra at Ping….taking the plunge Current: PingOne reporting PingOne for Groups Future goals: Migration from Mongo (EOY) Some migration from MySQL Real Time Analytics (innovation project)

5. Copyright ©2013 Ping Identity Corporation. All rights reserved. Cassandra Reporting Cluster • Built to provide customer insight into PingOne • First dive into Cassandra v0.7 (production) • Upgrade (in place) to v1.1.2 • v0.8 as stepping stone • In production today as v1.1.9

7. Copyright ©2013 Ping Identity Corporation. All rights reserved. Where did we miss? (v1.1.2) Upgrade in place Missed out on compression Failed to gain read performance Data wasn’t spread evenly across the cluster

8. Copyright ©2013 Ping Identity Corporation. All rights reserved. Cassandra Reporting (v1.1.9) DC1 DC2 DC3 0 10 20 Features: 9 nodes Compression Token Offsets 2 Replicas per DC Off-Heap caching Server-to-Server SSL Limitations: No Access Control Manual Token Generation Node recovery 1 replica from each token range DC1 DC2 DC3

9. Copyright ©2013 Ping Identity Corporation. All rights reserved. central logs Snapshots to new cluster Bulkload snapshots into cluster to gain compressionv1.1.2 v1.1.9 Create Tables with: compression={'sstable_compression': 'SnappyCompressor'}; Replay reporting gap to new cluster log transform Traffic switch after bulkload Migration from 1.1.2 to 1.1.9

11. Copyright ©2013 Ping Identity Corporation. All rights reserved. PingOne for Groups Cassandra v1.2.5 Features: Auto Token Generation Vnodes Descent data distribution More efficient memory utilization Atomic Batches Secondary indexes Request Tracing Internal Authentication/Authorization

12. Copyright ©2013 Ping Identity Corporation. All rights reserved. .setDiscoveryType(NodeDiscoveryType.RING_DESCRIBE) .setCqlVersion("3.0.0") .setTargetCassandraVersion("1.2”) .setRetryPolicy(retryPolicy); .setConnectTimeout(2000); RUN_ONCE Astyanax config

Notes de l'éditeur

History:Cassandra like most things at Ping started out as a trial run. We implemented reporting for PingOne on Cassandra and let it bake.And we wanted to see what direction it was going, get our feet wet and see how it fit in with existing and future projects. Experimenting on MongoDB. Great debate between Cassandra and MongoDbCassandra won due to write anywhere technologyMore servers, smaller capacityGeographic distribution for data redundancy, availability and performanceHorizontal scalabilityNo single point of failure
Remember to mention our migration from Mongo by year endHaven’t performed this migration yet
Why? Built to provide insight into PingOneWhy?SaaS applications are known for not providing logging and reporting information into their customers. We wanted to change that. And we continue building out this functionality out.Reports range from Number of success and failed SSOs. Unique user access per application over any period of time back to a year.Same schema – Use case still fitsClient = Hector of thrift api
Requirements:Geographically distributedRespectable performanceNo updates or deletes (repairs suck)Benefits:Easy management due to requirements for the clusterLimitations:One big ringWrites could start in DC1 and actually write to DC2Lopsided dataNo compressionReads were slowNodes recover over the WANLack of Security
So this upgrade happened in two parts:First to v0.8Second to 1.1.2After upgrading the cluster in place we found out this wasn’t a good ideaWe missed out on compressionOur data was still not really evenly distributedReplication was set to one per DC
Started with 9 nodes in the cluster with intent to horizontally scale25-35% performance improvement on reads5-10% performance improvement on writesCompression enabled 50% reduction in data sizeToken offsetsbetter data distributionnode recovery happened locallyMultiple replicas means always read locallyFirst write always happens locally thus faster response back to applicationLimitations
Traffic first directed at old clusterTake snapshot of clusterPush to new clusterCopy Schema from old cluster to new clusterAdd Snappy CompressionBulk load into new clusterSwitch Traffic to new ClusterReplay logs from central log server from bulk load timeCompression We chose to stream the data into a new cluster to allow for compression. Steps: tar up snaptshot push to new cluster stream in using bulk loader Because we did this during the day, we new consistency between the clusters would fall behind. We allowed this because we are capable replaying this into the cluster after the switch.
Here is what our Reporting Cluster looks like on the front end
New ClusterMuch easier to implementNo Manual token generationMore efficient memory utilizationImplemented Secondary Indexes Better data distribution via VnodesDevs wanted to take advantage of CQL3, implement Astyanax client, Atomic BatchesOps wanted Internal authPerformance Boosts in v1.2:Reduced memory footprint by partition summary (last on-heap memory structure)15% read performance increase by including ‘USETLAB JVM flag’, (localizes object allocation in memory) https://blogs.oracle.com/jonthecollector/entry/the_real_thingAuto Token Generation Just set the number of token ranges you want per serverData Distribution More token ranges = less likely to have unbalanced clusterMemory UtilizationMoved compression metadata and bloomfilters off-heapAtomic batches If one is successful they all areRequest Tracing Allows for performance testing of individual queries against the databaseAuthentication/Authorization Hey security around the cluster. Go figure.Less manual cluster rebalance is using something other than random partitioner
We aren’t currently performing any Row CacheThe number of replicas per datacenter can actually reduce the effectiveness of Row Caching