SlideShare une entreprise Scribd logo
1  sur  33
© 2016 24/7 CUSTOMER, INC.
BIG DATA BANGALORE JAN MEETUP - 24/7 CUSTOMER, INC.
Recipes for building resilient cross-
DC data pipeline with Kafka
Sr. Engineering Manager - Big Data
Platform
Suneet Grover
© 2016 24/7 CUSTOMER, INC. 2
About [24]7
© 2016 24/7 CUSTOMER, INC.
Today’s engagement is not driving successful moments
3
Q&A
IVR
© 2016 24/7 CUSTOMER, INC.
Smart Customer Engagement
4
Data-Driven
Reflecting All
Available Data
Click here to see [24]7 in action
Video available at http://player.vimeo.com/video/85280070
Predictive
Real-time
Decisions
Omni-channel
Across Digital
& Voice
Personalized
User Experience
© 2016 24/7 CUSTOMER, INC.
Intent-driven
engagement
Anticipate consumer intent
Holistic experience across channels
Delivering the right moments
to
Move from
Channel-centric
engagement
Reacting to consumer behavior
Disconnected, fragmented channels
Too many failed experiences
5
© 2016 24/7 CUSTOMER, INC.
[24]7 by the numbers
6
1.2bsmart speech
calls/year
127mvirtual agent
inquiries/year
30magent
chats/year
341mweb visitors
/month
5000+digital chat agents
(#1 WW)
70+data scientists
(most in industry)
100+patents
300+software engineers &
designers
© 2016 24/7 CUSTOMER, INC.
Agenda
• Introduction to Kafka
• Kafka at [24]7
• From problems to solutions
• Transparency and Resiliency
• Metrics Demo
• Design for multiple data centers
7
© 2016 24/7 CUSTOMER, INC. 8
Introduction to Kafka
© 2016 24/7 CUSTOMER, INC.
Apache Kafka
• Distributed
• High performance and throughput
• Streaming platform, pub/sub system
9
© 2016 24/7 CUSTOMER, INC.
Topic and Partitions
10
© 2016 24/7 CUSTOMER, INC.
Producers Consumers
11
© 2016 24/7 CUSTOMER, INC.
Kafka setup across DCs
12
Brokers
Region1 Region 2
Mirrormakers
Zookeepers
Brokers
Mirrormakers
Zookeepers
© 2016 24/7 CUSTOMER, INC. 13
Kafka at [24]7
© 2016 24/7 CUSTOMER, INC. 14
Intent Prediction
Data Analytics
Business Intelligence
© 2016 24/7 CUSTOMER, INC. 15
From problems to solutions
© 2016 24/7 CUSTOMER, INC.
Challenges with Kafka 0.8.0
• Broker partition stickiness does not allow to scale
• ZK load and latencies keep increasing
• Range based mirror-maker algorithm not optimal
• Stale topics cannot be deleted
• Controller can get into a stuck state
• Conflict errors in mirror-makers
• Socket leaks leading to open file descriptors
16
© 2016 24/7 CUSTOMER, INC.
Learnings from Kafka 0.8.0
• If the controller gets into a stuck state, delete the “/controller”
node from zookeeper
• Always do clean shutdown and restart of brokers
• Some issues are not always visible as errors or warnings
• Run ZK on SSD
17
© 2016 24/7 CUSTOMER, INC.
Kafka 0.10
18
• Very stable release
• Easy to do in-place from 0.8.2 onwards
• Better client APIs
• Richer admin operations
© 2016 24/7 CUSTOMER, INC.
Broker configurations that worked for us
19
• default.replication.factor = 3
• num.partitions = 2
• delete.topic.enable = true
• auto.leader.rebalance.enable = true
• controlled.shutdown.enable = true
• queued.max.requests = 1000
© 2016 24/7 CUSTOMER, INC.© 2016 24/7 CUSTOMER, INC. 20
Transparency and
Resiliency
© 2016 24/7 CUSTOMER, INC.
Metrics flow
21
Grafana
Graphite
Kafka Broker
Metrics Reporter
Kafka MM JMXTrans
Zookeeper
Host level
Metrics & Alerts
Lag monitor
ELK
© 2016 24/7 CUSTOMER, INC.
Essential Broker Metrics
• Disk, CPU and throughput utilization
• Ingress, egress volume per broker and topic
• Active controller count
• Offline partitions
• Under replicated partitions
• Partitions per broker
• Log flush rate
22
© 2016 24/7 CUSTOMER, INC.
Basic Alerts
23
• Disk, CPU utilization
• Open file handles
• Controller count
• Controller re-elections
• Under replicated partitions
• Offline partitions
• Stuck pending commands in zookeeper
• Conflicts in mirror-makers
© 2016 24/7 CUSTOMER, INC.
JMXTrans
24
• Push mirror-maker metrics to graphite
• Throughput per topic, per thread, per instance etc.
• WaitOnTake, WaitOnPut
• Push zookeeper metrics to graphite
• Latency, quorum, connections etc.
© 2016 24/7 CUSTOMER, INC.
Data Lag Monitoring
25
• Measures the event level time delay
• Monitors data latencies per cluster, per topic, per partition
• Latencies between multiple steps in Kafka pipeline
• Optimize and configure sampling ratio
• Supports multiple message formats json, avro etc.
• Alerts based on pre-defined thresholds
© 2016 24/7 CUSTOMER, INC.
Indicative Broker Metrics
• Request Metrics
• Local Time
• Remote Time
• Queue Time
• Request Handler Idle Percent
• Network Processor Idle Percent
26
© 2016 24/7 CUSTOMER, INC. 27
Now some demo
© 2016 24/7 CUSTOMER, INC.© 2016 24/7 CUSTOMER, INC. 28
Design for Multiple Data
Centers
© 2016 24/7 CUSTOMER, INC.
Range Based Mirror Makers
1000
181
14
5
1
10
100
1000
Consumer 1 Consumer 2 Consumer 3 Consumer 4
Skewed Partition Assignment
Num Partitions
29
© 2016 24/7 CUSTOMER, INC.
Round Robin Mirror Makers
0
50
100
150
200
250
300
350
Consumer 1 Consumer 2 Consumer 3 Consumer 4
Uniform Partition Assignment
Num Partitions
30
© 2016 24/7 CUSTOMER, INC.
Mirror-maker fine tuning
• Round Robin works better than Range based in most cases
• Spread out the topics in multiple MM consumer groups
• If you have a few large volume topics
• Negative regex works with whitelist parameter
• Doesn’t help to have too many MM consumer threads
• Tune socket buffer size (doesn’t apply unless OS allows)
• MM - socket.receive.buffer.bytes = 1048576
• Broker - socket.send.buffer.bytes = 1048576
31
© 2016 24/7 CUSTOMER, INC. 32
We are hiring!!!
For current open positions, please log onto our careers web page
http://www.247-inc.com/
Company>Careers>Location
For further details, Please reach out to:
Achappa C B - achappa.cb@247-inc.com, M: +91-7338458247
© 2016 24/7 CUSTOMER, INC. 33

Contenu connexe

Tendances

Feed Your SIEM Smart with Kafka Connect (Vitalii Rudenskyi, McKesson Corp) Ka...
Feed Your SIEM Smart with Kafka Connect (Vitalii Rudenskyi, McKesson Corp) Ka...Feed Your SIEM Smart with Kafka Connect (Vitalii Rudenskyi, McKesson Corp) Ka...
Feed Your SIEM Smart with Kafka Connect (Vitalii Rudenskyi, McKesson Corp) Ka...
HostedbyConfluent
 

Tendances (20)

Testing Event Driven Architectures: How to Broker the Complexity | Frank Kilc...
Testing Event Driven Architectures: How to Broker the Complexity | Frank Kilc...Testing Event Driven Architectures: How to Broker the Complexity | Frank Kilc...
Testing Event Driven Architectures: How to Broker the Complexity | Frank Kilc...
 
A guide through the Azure Messaging services - Update Conference
A guide through the Azure Messaging services - Update ConferenceA guide through the Azure Messaging services - Update Conference
A guide through the Azure Messaging services - Update Conference
 
Feed Your SIEM Smart with Kafka Connect (Vitalii Rudenskyi, McKesson Corp) Ka...
Feed Your SIEM Smart with Kafka Connect (Vitalii Rudenskyi, McKesson Corp) Ka...Feed Your SIEM Smart with Kafka Connect (Vitalii Rudenskyi, McKesson Corp) Ka...
Feed Your SIEM Smart with Kafka Connect (Vitalii Rudenskyi, McKesson Corp) Ka...
 
Microservices in the Apache Kafka Ecosystem
Microservices in the Apache Kafka EcosystemMicroservices in the Apache Kafka Ecosystem
Microservices in the Apache Kafka Ecosystem
 
Agile Data Integration: How is it possible?
Agile Data Integration: How is it possible?Agile Data Integration: How is it possible?
Agile Data Integration: How is it possible?
 
The Data Dichotomy- Rethinking the Way We Treat Data and Services
The Data Dichotomy- Rethinking the Way We Treat Data and ServicesThe Data Dichotomy- Rethinking the Way We Treat Data and Services
The Data Dichotomy- Rethinking the Way We Treat Data and Services
 
Apache Flink, AWS Kinesis, Analytics
Apache Flink, AWS Kinesis, Analytics Apache Flink, AWS Kinesis, Analytics
Apache Flink, AWS Kinesis, Analytics
 
Microservices with Kafka Ecosystem
Microservices with Kafka EcosystemMicroservices with Kafka Ecosystem
Microservices with Kafka Ecosystem
 
Jay Kreps | Kafka Summit NYC 2019 Keynote (Events Everywhere) | CEO, Confluent
Jay Kreps | Kafka Summit NYC 2019 Keynote (Events Everywhere) | CEO, ConfluentJay Kreps | Kafka Summit NYC 2019 Keynote (Events Everywhere) | CEO, Confluent
Jay Kreps | Kafka Summit NYC 2019 Keynote (Events Everywhere) | CEO, Confluent
 
Live Event Debugging With ksqlDB at Reddit | Hannah Hagen and Paul Kiernan, R...
Live Event Debugging With ksqlDB at Reddit | Hannah Hagen and Paul Kiernan, R...Live Event Debugging With ksqlDB at Reddit | Hannah Hagen and Paul Kiernan, R...
Live Event Debugging With ksqlDB at Reddit | Hannah Hagen and Paul Kiernan, R...
 
James Watters, Pivotal | Kafka Summit NYC 2019 Keynote (Spring Boot+Kafka: Th...
James Watters, Pivotal | Kafka Summit NYC 2019 Keynote (Spring Boot+Kafka: Th...James Watters, Pivotal | Kafka Summit NYC 2019 Keynote (Spring Boot+Kafka: Th...
James Watters, Pivotal | Kafka Summit NYC 2019 Keynote (Spring Boot+Kafka: Th...
 
Data Mess to Data Mesh | Jay Kreps, CEO, Confluent | Kafka Summit Americas 20...
Data Mess to Data Mesh | Jay Kreps, CEO, Confluent | Kafka Summit Americas 20...Data Mess to Data Mesh | Jay Kreps, CEO, Confluent | Kafka Summit Americas 20...
Data Mess to Data Mesh | Jay Kreps, CEO, Confluent | Kafka Summit Americas 20...
 
Removing performance bottlenecks with Kafka Monitoring and topic configuration
Removing performance bottlenecks with Kafka Monitoring and topic configurationRemoving performance bottlenecks with Kafka Monitoring and topic configuration
Removing performance bottlenecks with Kafka Monitoring and topic configuration
 
Building Microservices with Apache Kafka
Building Microservices with Apache KafkaBuilding Microservices with Apache Kafka
Building Microservices with Apache Kafka
 
Transformation During a Global Pandemic | Ashish Pandit and Scott Lee, Univer...
Transformation During a Global Pandemic | Ashish Pandit and Scott Lee, Univer...Transformation During a Global Pandemic | Ashish Pandit and Scott Lee, Univer...
Transformation During a Global Pandemic | Ashish Pandit and Scott Lee, Univer...
 
Kafka Summit SF 2017 - Building Event-Driven Services with Stateful Streams
Kafka Summit SF 2017 - Building Event-Driven Services with Stateful StreamsKafka Summit SF 2017 - Building Event-Driven Services with Stateful Streams
Kafka Summit SF 2017 - Building Event-Driven Services with Stateful Streams
 
Introducing Confluent Cloud: Apache Kafka as a Service
Introducing Confluent Cloud: Apache Kafka as a Service Introducing Confluent Cloud: Apache Kafka as a Service
Introducing Confluent Cloud: Apache Kafka as a Service
 
Lessons from the field: Catalog of Kafka Deployments | Joseph Niemiec, Cloudera
Lessons from the field: Catalog of Kafka Deployments | Joseph Niemiec, ClouderaLessons from the field: Catalog of Kafka Deployments | Joseph Niemiec, Cloudera
Lessons from the field: Catalog of Kafka Deployments | Joseph Niemiec, Cloudera
 
Developing custom transformation in the Kafka connect to minimize data redund...
Developing custom transformation in the Kafka connect to minimize data redund...Developing custom transformation in the Kafka connect to minimize data redund...
Developing custom transformation in the Kafka connect to minimize data redund...
 
Fan-out, fan-in & the multiplexer: Replication recipes for global platform di...
Fan-out, fan-in & the multiplexer: Replication recipes for global platform di...Fan-out, fan-in & the multiplexer: Replication recipes for global platform di...
Fan-out, fan-in & the multiplexer: Replication recipes for global platform di...
 

En vedette

Building Reactive Fast Data & the Data Lake with Akka, Kafka, Spark
Building Reactive Fast Data & the Data Lake with Akka, Kafka, SparkBuilding Reactive Fast Data & the Data Lake with Akka, Kafka, Spark
Building Reactive Fast Data & the Data Lake with Akka, Kafka, Spark
Todd Fritz
 
Learnings Using Spark Streaming and DataFrames for Walmart Search: Spark Summ...
Learnings Using Spark Streaming and DataFrames for Walmart Search: Spark Summ...Learnings Using Spark Streaming and DataFrames for Walmart Search: Spark Summ...
Learnings Using Spark Streaming and DataFrames for Walmart Search: Spark Summ...
Spark Summit
 
Achieving Real-time Ingestion and Analysis of Security Events through Kafka a...
Achieving Real-time Ingestion and Analysis of Security Events through Kafka a...Achieving Real-time Ingestion and Analysis of Security Events through Kafka a...
Achieving Real-time Ingestion and Analysis of Security Events through Kafka a...
Kevin Mao
 

En vedette (20)

Getting started with Azure Event Hubs and Stream Analytics services
Getting started with Azure Event Hubs and Stream Analytics servicesGetting started with Azure Event Hubs and Stream Analytics services
Getting started with Azure Event Hubs and Stream Analytics services
 
Storm over gearpump
Storm over gearpumpStorm over gearpump
Storm over gearpump
 
London Apache Kafka Meetup (Jan 2017)
London Apache Kafka Meetup (Jan 2017)London Apache Kafka Meetup (Jan 2017)
London Apache Kafka Meetup (Jan 2017)
 
Kafka connect
Kafka connectKafka connect
Kafka connect
 
Not Only Streams for Akademia JLabs
Not Only Streams for Akademia JLabsNot Only Streams for Akademia JLabs
Not Only Streams for Akademia JLabs
 
Processing IoT Data with Apache Kafka
Processing IoT Data with Apache KafkaProcessing IoT Data with Apache Kafka
Processing IoT Data with Apache Kafka
 
IoT Connected Brewery
IoT Connected BreweryIoT Connected Brewery
IoT Connected Brewery
 
Strata+Hadoop 2017 San Jose - The Rise of Real Time: Apache Kafka and the Str...
Strata+Hadoop 2017 San Jose - The Rise of Real Time: Apache Kafka and the Str...Strata+Hadoop 2017 San Jose - The Rise of Real Time: Apache Kafka and the Str...
Strata+Hadoop 2017 San Jose - The Rise of Real Time: Apache Kafka and the Str...
 
Apache kafka-a distributed streaming platform
Apache kafka-a distributed streaming platformApache kafka-a distributed streaming platform
Apache kafka-a distributed streaming platform
 
Extracting Insights from Data at Twitter
Extracting Insights from Data at TwitterExtracting Insights from Data at Twitter
Extracting Insights from Data at Twitter
 
Building Reactive Fast Data & the Data Lake with Akka, Kafka, Spark
Building Reactive Fast Data & the Data Lake with Akka, Kafka, SparkBuilding Reactive Fast Data & the Data Lake with Akka, Kafka, Spark
Building Reactive Fast Data & the Data Lake with Akka, Kafka, Spark
 
Big Data Day LA 2015 - Always-on Ingestion for Data at Scale by Arvind Prabha...
Big Data Day LA 2015 - Always-on Ingestion for Data at Scale by Arvind Prabha...Big Data Day LA 2015 - Always-on Ingestion for Data at Scale by Arvind Prabha...
Big Data Day LA 2015 - Always-on Ingestion for Data at Scale by Arvind Prabha...
 
IoT Innovation Lab Berlin @relayr - Kay Lerch on Getting basics right for you...
IoT Innovation Lab Berlin @relayr - Kay Lerch on Getting basics right for you...IoT Innovation Lab Berlin @relayr - Kay Lerch on Getting basics right for you...
IoT Innovation Lab Berlin @relayr - Kay Lerch on Getting basics right for you...
 
Introduction to Structured Streaming
Introduction to Structured StreamingIntroduction to Structured Streaming
Introduction to Structured Streaming
 
Dataflow with Apache NiFi - Apache NiFi Meetup - 2016 Hadoop Summit - San Jose
Dataflow with Apache NiFi - Apache NiFi Meetup - 2016 Hadoop Summit - San JoseDataflow with Apache NiFi - Apache NiFi Meetup - 2016 Hadoop Summit - San Jose
Dataflow with Apache NiFi - Apache NiFi Meetup - 2016 Hadoop Summit - San Jose
 
Comparison of various streaming technologies
Comparison of various streaming technologiesComparison of various streaming technologies
Comparison of various streaming technologies
 
Learnings Using Spark Streaming and DataFrames for Walmart Search: Spark Summ...
Learnings Using Spark Streaming and DataFrames for Walmart Search: Spark Summ...Learnings Using Spark Streaming and DataFrames for Walmart Search: Spark Summ...
Learnings Using Spark Streaming and DataFrames for Walmart Search: Spark Summ...
 
Real-Time Event & Stream Processing on MS Azure
Real-Time Event & Stream Processing on MS AzureReal-Time Event & Stream Processing on MS Azure
Real-Time Event & Stream Processing on MS Azure
 
Reactive integrations with Akka Streams
Reactive integrations with Akka StreamsReactive integrations with Akka Streams
Reactive integrations with Akka Streams
 
Achieving Real-time Ingestion and Analysis of Security Events through Kafka a...
Achieving Real-time Ingestion and Analysis of Security Events through Kafka a...Achieving Real-time Ingestion and Analysis of Security Events through Kafka a...
Achieving Real-time Ingestion and Analysis of Security Events through Kafka a...
 

Similaire à Blr hadoop meetup

SRE and GitOps for Building Robust Kubernetes Platforms.pdf
SRE and GitOps for Building Robust Kubernetes Platforms.pdfSRE and GitOps for Building Robust Kubernetes Platforms.pdf
SRE and GitOps for Building Robust Kubernetes Platforms.pdf
Weaveworks
 
Highly configurable and extensible data processing framework at PubMatic
Highly configurable and extensible data processing framework at PubMaticHighly configurable and extensible data processing framework at PubMatic
Highly configurable and extensible data processing framework at PubMatic
DataWorks Summit
 
Martin Anderson - Fast We Promise
Martin Anderson - Fast We PromiseMartin Anderson - Fast We Promise
Martin Anderson - Fast We Promise
bloodredsun
 
Why Generic Configurators dont work in the valve Industry
Why Generic Configurators dont work in the valve IndustryWhy Generic Configurators dont work in the valve Industry
Why Generic Configurators dont work in the valve Industry
Sanjeev Nadkarni
 

Similaire à Blr hadoop meetup (20)

Apache Kafka Bay Area Sep Meetup - 24/7 Customer, Inc.
Apache Kafka Bay Area Sep Meetup - 24/7 Customer, Inc.Apache Kafka Bay Area Sep Meetup - 24/7 Customer, Inc.
Apache Kafka Bay Area Sep Meetup - 24/7 Customer, Inc.
 
Tokyo AK Meetup Speedtest - Share.pdf
Tokyo AK Meetup Speedtest - Share.pdfTokyo AK Meetup Speedtest - Share.pdf
Tokyo AK Meetup Speedtest - Share.pdf
 
FreeSBC - A New Approach to the SBC
FreeSBC - A New Approach to the SBCFreeSBC - A New Approach to the SBC
FreeSBC - A New Approach to the SBC
 
Wavefront-by-VMware-April-2019
Wavefront-by-VMware-April-2019Wavefront-by-VMware-April-2019
Wavefront-by-VMware-April-2019
 
FreeSBC - A New Approach to the SBC
FreeSBC - A New Approach to the SBCFreeSBC - A New Approach to the SBC
FreeSBC - A New Approach to the SBC
 
Getting Started With ThousandEyes Proof of Concepts: End User Digital Experience
Getting Started With ThousandEyes Proof of Concepts: End User Digital ExperienceGetting Started With ThousandEyes Proof of Concepts: End User Digital Experience
Getting Started With ThousandEyes Proof of Concepts: End User Digital Experience
 
VMware Certified Professional 5 - Data Center Virtualization Delta Exam
VMware Certified Professional 5 - Data Center Virtualization Delta ExamVMware Certified Professional 5 - Data Center Virtualization Delta Exam
VMware Certified Professional 5 - Data Center Virtualization Delta Exam
 
Praxistaugliche notes strategien 4 cloud
Praxistaugliche notes strategien 4 cloudPraxistaugliche notes strategien 4 cloud
Praxistaugliche notes strategien 4 cloud
 
SRE and GitOps for Building Robust Kubernetes Platforms.pdf
SRE and GitOps for Building Robust Kubernetes Platforms.pdfSRE and GitOps for Building Robust Kubernetes Platforms.pdf
SRE and GitOps for Building Robust Kubernetes Platforms.pdf
 
PEnDAR webinar 2 with notes
PEnDAR webinar 2 with notesPEnDAR webinar 2 with notes
PEnDAR webinar 2 with notes
 
3 Critical Keys to DevOps Success: Lessons from Forrester Research, Intel, an...
3 Critical Keys to DevOps Success: Lessons from Forrester Research, Intel, an...3 Critical Keys to DevOps Success: Lessons from Forrester Research, Intel, an...
3 Critical Keys to DevOps Success: Lessons from Forrester Research, Intel, an...
 
Flexiant and 451 Research Discuss the Service Provider Path from VPS to IaaS
Flexiant and 451 Research Discuss the Service Provider Path from VPS to IaaSFlexiant and 451 Research Discuss the Service Provider Path from VPS to IaaS
Flexiant and 451 Research Discuss the Service Provider Path from VPS to IaaS
 
CNCF On-Demand Webinar_ LitmusChaos Project Updates.pdf
CNCF On-Demand Webinar_ LitmusChaos Project Updates.pdfCNCF On-Demand Webinar_ LitmusChaos Project Updates.pdf
CNCF On-Demand Webinar_ LitmusChaos Project Updates.pdf
 
Getting Started with ThousandEyes Proof of Concepts
Getting Started with ThousandEyes Proof of ConceptsGetting Started with ThousandEyes Proof of Concepts
Getting Started with ThousandEyes Proof of Concepts
 
Near real-time statistical modeling and anomaly detection using Flink!
Near real-time statistical modeling and anomaly detection using Flink!Near real-time statistical modeling and anomaly detection using Flink!
Near real-time statistical modeling and anomaly detection using Flink!
 
Highly configurable and extensible data processing framework at PubMatic
Highly configurable and extensible data processing framework at PubMaticHighly configurable and extensible data processing framework at PubMatic
Highly configurable and extensible data processing framework at PubMatic
 
Valve CPQ challenges & solutions
Valve CPQ challenges & solutionsValve CPQ challenges & solutions
Valve CPQ challenges & solutions
 
Martin Anderson - Fast We Promise
Martin Anderson - Fast We PromiseMartin Anderson - Fast We Promise
Martin Anderson - Fast We Promise
 
Why Generic Configurators dont work in the valve Industry
Why Generic Configurators dont work in the valve IndustryWhy Generic Configurators dont work in the valve Industry
Why Generic Configurators dont work in the valve Industry
 
How to Get the Fastest Possible Citrix Logon Times?
How to Get the Fastest Possible Citrix Logon Times?How to Get the Fastest Possible Citrix Logon Times?
How to Get the Fastest Possible Citrix Logon Times?
 

Dernier

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Dernier (20)

Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Cyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdfCyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdf
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 

Blr hadoop meetup

  • 1. © 2016 24/7 CUSTOMER, INC. BIG DATA BANGALORE JAN MEETUP - 24/7 CUSTOMER, INC. Recipes for building resilient cross- DC data pipeline with Kafka Sr. Engineering Manager - Big Data Platform Suneet Grover
  • 2. © 2016 24/7 CUSTOMER, INC. 2 About [24]7
  • 3. © 2016 24/7 CUSTOMER, INC. Today’s engagement is not driving successful moments 3 Q&A IVR
  • 4. © 2016 24/7 CUSTOMER, INC. Smart Customer Engagement 4 Data-Driven Reflecting All Available Data Click here to see [24]7 in action Video available at http://player.vimeo.com/video/85280070 Predictive Real-time Decisions Omni-channel Across Digital & Voice Personalized User Experience
  • 5. © 2016 24/7 CUSTOMER, INC. Intent-driven engagement Anticipate consumer intent Holistic experience across channels Delivering the right moments to Move from Channel-centric engagement Reacting to consumer behavior Disconnected, fragmented channels Too many failed experiences 5
  • 6. © 2016 24/7 CUSTOMER, INC. [24]7 by the numbers 6 1.2bsmart speech calls/year 127mvirtual agent inquiries/year 30magent chats/year 341mweb visitors /month 5000+digital chat agents (#1 WW) 70+data scientists (most in industry) 100+patents 300+software engineers & designers
  • 7. © 2016 24/7 CUSTOMER, INC. Agenda • Introduction to Kafka • Kafka at [24]7 • From problems to solutions • Transparency and Resiliency • Metrics Demo • Design for multiple data centers 7
  • 8. © 2016 24/7 CUSTOMER, INC. 8 Introduction to Kafka
  • 9. © 2016 24/7 CUSTOMER, INC. Apache Kafka • Distributed • High performance and throughput • Streaming platform, pub/sub system 9
  • 10. © 2016 24/7 CUSTOMER, INC. Topic and Partitions 10
  • 11. © 2016 24/7 CUSTOMER, INC. Producers Consumers 11
  • 12. © 2016 24/7 CUSTOMER, INC. Kafka setup across DCs 12 Brokers Region1 Region 2 Mirrormakers Zookeepers Brokers Mirrormakers Zookeepers
  • 13. © 2016 24/7 CUSTOMER, INC. 13 Kafka at [24]7
  • 14. © 2016 24/7 CUSTOMER, INC. 14 Intent Prediction Data Analytics Business Intelligence
  • 15. © 2016 24/7 CUSTOMER, INC. 15 From problems to solutions
  • 16. © 2016 24/7 CUSTOMER, INC. Challenges with Kafka 0.8.0 • Broker partition stickiness does not allow to scale • ZK load and latencies keep increasing • Range based mirror-maker algorithm not optimal • Stale topics cannot be deleted • Controller can get into a stuck state • Conflict errors in mirror-makers • Socket leaks leading to open file descriptors 16
  • 17. © 2016 24/7 CUSTOMER, INC. Learnings from Kafka 0.8.0 • If the controller gets into a stuck state, delete the “/controller” node from zookeeper • Always do clean shutdown and restart of brokers • Some issues are not always visible as errors or warnings • Run ZK on SSD 17
  • 18. © 2016 24/7 CUSTOMER, INC. Kafka 0.10 18 • Very stable release • Easy to do in-place from 0.8.2 onwards • Better client APIs • Richer admin operations
  • 19. © 2016 24/7 CUSTOMER, INC. Broker configurations that worked for us 19 • default.replication.factor = 3 • num.partitions = 2 • delete.topic.enable = true • auto.leader.rebalance.enable = true • controlled.shutdown.enable = true • queued.max.requests = 1000
  • 20. © 2016 24/7 CUSTOMER, INC.© 2016 24/7 CUSTOMER, INC. 20 Transparency and Resiliency
  • 21. © 2016 24/7 CUSTOMER, INC. Metrics flow 21 Grafana Graphite Kafka Broker Metrics Reporter Kafka MM JMXTrans Zookeeper Host level Metrics & Alerts Lag monitor ELK
  • 22. © 2016 24/7 CUSTOMER, INC. Essential Broker Metrics • Disk, CPU and throughput utilization • Ingress, egress volume per broker and topic • Active controller count • Offline partitions • Under replicated partitions • Partitions per broker • Log flush rate 22
  • 23. © 2016 24/7 CUSTOMER, INC. Basic Alerts 23 • Disk, CPU utilization • Open file handles • Controller count • Controller re-elections • Under replicated partitions • Offline partitions • Stuck pending commands in zookeeper • Conflicts in mirror-makers
  • 24. © 2016 24/7 CUSTOMER, INC. JMXTrans 24 • Push mirror-maker metrics to graphite • Throughput per topic, per thread, per instance etc. • WaitOnTake, WaitOnPut • Push zookeeper metrics to graphite • Latency, quorum, connections etc.
  • 25. © 2016 24/7 CUSTOMER, INC. Data Lag Monitoring 25 • Measures the event level time delay • Monitors data latencies per cluster, per topic, per partition • Latencies between multiple steps in Kafka pipeline • Optimize and configure sampling ratio • Supports multiple message formats json, avro etc. • Alerts based on pre-defined thresholds
  • 26. © 2016 24/7 CUSTOMER, INC. Indicative Broker Metrics • Request Metrics • Local Time • Remote Time • Queue Time • Request Handler Idle Percent • Network Processor Idle Percent 26
  • 27. © 2016 24/7 CUSTOMER, INC. 27 Now some demo
  • 28. © 2016 24/7 CUSTOMER, INC.© 2016 24/7 CUSTOMER, INC. 28 Design for Multiple Data Centers
  • 29. © 2016 24/7 CUSTOMER, INC. Range Based Mirror Makers 1000 181 14 5 1 10 100 1000 Consumer 1 Consumer 2 Consumer 3 Consumer 4 Skewed Partition Assignment Num Partitions 29
  • 30. © 2016 24/7 CUSTOMER, INC. Round Robin Mirror Makers 0 50 100 150 200 250 300 350 Consumer 1 Consumer 2 Consumer 3 Consumer 4 Uniform Partition Assignment Num Partitions 30
  • 31. © 2016 24/7 CUSTOMER, INC. Mirror-maker fine tuning • Round Robin works better than Range based in most cases • Spread out the topics in multiple MM consumer groups • If you have a few large volume topics • Negative regex works with whitelist parameter • Doesn’t help to have too many MM consumer threads • Tune socket buffer size (doesn’t apply unless OS allows) • MM - socket.receive.buffer.bytes = 1048576 • Broker - socket.send.buffer.bytes = 1048576 31
  • 32. © 2016 24/7 CUSTOMER, INC. 32 We are hiring!!! For current open positions, please log onto our careers web page http://www.247-inc.com/ Company>Careers>Location For further details, Please reach out to: Achappa C B - achappa.cb@247-inc.com, M: +91-7338458247
  • 33. © 2016 24/7 CUSTOMER, INC. 33

Notes de l'éditeur

  1. Intro to yourself Credit to the team
  2. Click and let this do the slow build. The key points are: Consumers find it frustrating to cross channels (web, phone, IVR, etc.) because their content is not preserved. So they have to do things like authenticate (user ID, password) multiple times in the same interaction. These types of experiences turn potential Brand Advocates into Detractors who will move to other brands.
  3. This a KEY slide. Emphasize that today’s leading-edge companies – those that consumers love to engage with and have strong brands – are moving to Intent Driven Engagement
  4. Would like viewers read the slide. Then focus on bottom row: 300+ software engineers and designers Most data scientists in the industry 100+ patents point made earlier We are the #1 provider of digital chat agents in the world.
  5. Other features which we haven’t tried Security, Streams etc.