SlideShare une entreprise Scribd logo
1  sur  34
Using Docker to Scale
Operational Intelligence at Splunk
Mike Dickey
Sr. Director, Splunk
@mike_dickey
Harish Jayakumar
Sr.Solution Engineer, Docker
@harish_jkumar
Forward-Looking Statements
During the course of this presentation, we may make forward-looking statements regarding future events or the expected
performance of the company. We caution you that such statements reflect our current expectations and estimates based on factors
currently known to us and that actual events or results could differ material. For important factors that may cause actual results to
differ from those captained in our forward-looking statements, please review our fillings with the SEC.
The forward-looking statements made in this presentation are being made as of the time
and data of its live presentation. If reviewed after its live presentation, this presentation
may not contain current or accurate information. We do not assume any obligation to update
any forward-looking statements we may make. In addition, any information about our roadmap
outlines our general product direction and is subject to change at any time without notice.
It is for informational purposes only and shall not be incorporated into any contract or other
commitment. Splunk undertakes no obligation either to develop the features or functionality
described or to include any such feature or functionality in a future release.
Splunk, Splunk>, Listen to Your Data, The Engine for Machine Data, Splunk Cloud, Splunk Light and SPL are trademarks and registered trademarks
of Splunk Inc. in the United States and other countries. All other brand names, product names, or trademarks belong to their respective owners.
© 2017 Splunk Inc. All rights reserved.
Agenda
● Splunk’s challenges
● Docker Enterprise Edition
● Splunk + Docker EE
● Demo
● Before & After Docker Metrics
● Lessons Learned
Infrastructure
● Large Scale Scrum
“LeSS Huge” Model
● Engineering
Infrastructure Area
○ 13 teams (about 100 engineers)
in California & Shanghai
○ Test quality, automation, common
tools, frameworks, build systems
○ Working on an overhaul of all test
infrastructure, including CI/CD
Multiple CI EnvironmentsTest Server Sprawl
100+ bare-metal servers for functional
and performance testing
Too many frameworks, manual work
Many days to setup a single test
Splunk Challenges
Bamboo vs Jenkins
Plans & agents managed by hand
Physical server agents, very
poor scalability
LONG wait times, build bottleneck, not
enough capacity
Wiki-Managed Infrastructure
soln-perf66.sv.splunk.com
(root)
soln-
perf66.ilo.sv.splunk.com
(AD Credential)
2x 8-core Xeon 2.40Ghz, 32 Gb RAM, 2x
1Tb SATA 6G HDD (RAID 1)
Allan
(Large ES env)
forever
soln-perf67.sv.splunk.com
(root)
soln-
perf67.ilo.sv.splunk.com
(AD Credential)
2x 8-core Xeon 2.40Ghz, 32 Gb RAM, 2x
1Tb SATA 6G HDD (RAID 1)
Mike –
UCP dev cluster
forever
soln-perf68.sv.splunk.com
(root)
soln-
perf68.ilo.sv.splunk.com
(AD Credential)
2x 8-core Xeon 2.40Ghz, 32 Gb RAM, 2x
1Tb SATA 6G HDD (RAID 1)
Dhananjay 4/26/2017
soln-perf69.sv.splunk.com
(root)
soln-
perf69.ilo.sv.splunk.com
(AD Credential)
2x 8-core Xeon 2.40Ghz, 32 Gb RAM, 2x
1Tb SATA 6G HDD (RAID 1)
Stream –
Manan/Vladimir forever
Why Docker
● Fun
● De-facto
● Bare-metal performance
● Consistency
● Efficient
● Successful POCs
Selling Docker Internally
● Why?
○ Reduce waste of underutilized servers
○ Automate manual testing work
○ Enables more testing, higher quality
● Start small, deliver value quickly with each
iteration
● CPO, CFO & CEO as Scale & Scope grew
Why Docker EE
● Windows Server 2016 & Linux
● Role Based Access Control
● Compose & common Docker API
● End user experience : Docker on
Desktop and Docker on Infrastructure
● End to end support from vendor -
■ Around the corner (literally)
Docker EE components
Public Cloud Virtual Physical
docker enterprise edition ADVANCED
INTEGRATED SECURITY
docker trusted registry
image management
docker universal control plane
app & cluster management
docker engine
container runtime, orchestration, networking, volumes, plugins
CI/CD Images Operating Systems Volumes Monitoring Logging more...
UCP ( Windows & Linux)
Build a Secure Software Supply Chain (CaaS)
Image RegistrySecurity scan
& sign
Traditional
Third Party
Microservices
docker store
DEVELOPERS IT
OPERATIONS
Control Plane
services:
database:
image: sixeyed/atsea-db:mssql
ports:
- mode: host
target: 1433
networks:
- atsea
deploy:
endpoint_mode: dnsrr
placement:
constraints:
- 'node.platform.os == windows'
appserver:
image: sixeyed/atsea-app:mssql
ports:
- target: 8080
published: 8080
networks:
- atsea
deploy:
placement:
constraints:
- 'node.platform.os == linux'
networks:
atsea:
node.platform.os is a built in label
that can be used for workload
placement
Windows and Linux nodes can share
a common overlay network
Hybrid Applications
1:1:1 mapping of
subject to role to
collection.
Grant Subject Role Collection
Who (orgs, teams,
users) can perform an
action.
What they
can do
Where work can
be done
Enhanced RBAC
SUPPORT SECURE MULTI-TENANCY ACROSS
MULTIPLE TEAMS THROUGH NODE-BASED
ISOLATION AND SEGREGATION
KEY FEATURES
BENEFITS
• Enforce node affinity and node anti-affinity rules to
allow certain users/teams/orgs to deploy within a
subset of nodes or outside certain nodes (eg.
Production nodes)
• Set up different security zones within the same cluster
to isolate and segregate access of protected
information
• Support multiple teams within the same cluster while
providing physical separation and isolation
• Prevent “noisy neighbors” by limiting a team’s resources
to approved nodes
• Meet compliance and regulatory requirements by isolating
sensitive workloads to certain nodes and limiting access to
those nodes
Prod
Dev
Dev Team A Dev Team B SecOpsOps Team
PHI
RBAC for Nodes
Node
Worker
Node
Worker
Node
Worker
Node
Worker
Docker EE :Swarm + Kub ( Coming Soon)
Demo
● Windows Server 2016 & Linux
● Role Based Access Control
● Compose & common Docker API
● End user experience : Docker on
Desktop and Docker on Infrastructure
● End to end support from vendor -
■ Around the corner (literally)
Container Native Test Lab
Container Native Test Lab
Container Native Test Lab
Container Native Test Lab
New Tooling to Test Splunk
New CI/CD Platform
Volume
3. Stage Build
Runner
4. Stage Build
Runner
App Cert
6. Stage Publish5. Stage AppCert
Runner Runner
Splunkins as
report service
Notify build
status
Notify
build
status
2. Notify Jenkins
to build
UCP
1. Git Push
Artifactory
Results
● Test setup time reduced from days to
minutes
○ Far more efficient use of the
hardware we have
○ Enables us to run more tests,
more frequently, earlier in
release cycles
● Eliminated CI/CD Bottlenecks
● Example: DMA performance
improvements in Splunk 7.0
Growth & Adoption
Unique Users
Logging bug
4x Growth
150 Servers
265 Servers
300 Servers
400 Servers
Server Utilization 600 Servers
Number of
Servers
600
Containers
1 Mil.
>10,000/day
CI Jobs
executed
30,000
Average
Utilization
75%
Unique users
385
Splunk’s Docker Growth & Adoption
Year to Date
As of October 3, 2017
Challenges &
Journey
Internal:
○ Basic container knowledge
■ Trained Developers and
Ops
○ Let’s blame the new guy
■ Real source: Bad drives,
memory chips, networking
Challenges & Journey
Bugs found & fixed:
○ Placement strategies
(anti-affinity, binpacking)
○ LDAP (AD) & RBAC
○ Performance & Timeouts at
Scale of 300-600 servers
○ Worked with support
to prioritize
Lessons Learned
Test things before you
update production
Perform root cause
analysis of problems
Start small, over-
communicate & set
realistic expectations
Monitor & respond
quickly to problems
Deliver what your
customers ask for
Invite contributions &
participation
Thanks
Questions?
@mike_dickey
@harish_jkumar
Challenges & Journey
Outstanding
Performance & Accuracy
of Metrics
Errors related to state
inconsistency
Kernel & docker engine
upgrades
Docker EE :Swarm + Kub ( Coming Soon)
Docker EE :Swarm + Kub ( Coming Soon)

Contenu connexe

Tendances

Packaging software for the distribution on the edge
Packaging software for the distribution on the edgePackaging software for the distribution on the edge
Packaging software for the distribution on the edge
Docker, Inc.
 

Tendances (20)

DCEU 18: Docker for Windows Containers and Kubernetes
DCEU 18: Docker for Windows Containers and KubernetesDCEU 18: Docker for Windows Containers and Kubernetes
DCEU 18: Docker for Windows Containers and Kubernetes
 
DCEU 18: State of the Docker Engine
DCEU 18: State of the Docker EngineDCEU 18: State of the Docker Engine
DCEU 18: State of the Docker Engine
 
Multi-Container Apps spanning Docker, Mesos and OpenStack
Multi-Container Apps spanning Docker, Mesos and OpenStackMulti-Container Apps spanning Docker, Mesos and OpenStack
Multi-Container Apps spanning Docker, Mesos and OpenStack
 
DCEU 18: Docker Enterprise Platform and Architecture
DCEU 18: Docker Enterprise Platform and ArchitectureDCEU 18: Docker Enterprise Platform and Architecture
DCEU 18: Docker Enterprise Platform and Architecture
 
Enabling Production Grade Containerized Applications through Policy Based Inf...
Enabling Production Grade Containerized Applications through Policy Based Inf...Enabling Production Grade Containerized Applications through Policy Based Inf...
Enabling Production Grade Containerized Applications through Policy Based Inf...
 
Packaging software for the distribution on the edge
Packaging software for the distribution on the edgePackaging software for the distribution on the edge
Packaging software for the distribution on the edge
 
DCEU 18: Building Your Development Pipeline
DCEU 18: Building Your Development PipelineDCEU 18: Building Your Development Pipeline
DCEU 18: Building Your Development Pipeline
 
Docker Online Meetup: Infrakit update and Q&A
Docker Online Meetup: Infrakit update and Q&ADocker Online Meetup: Infrakit update and Q&A
Docker Online Meetup: Infrakit update and Q&A
 
The state of containerd
The state of containerdThe state of containerd
The state of containerd
 
Dockers and kubernetes
Dockers and kubernetesDockers and kubernetes
Dockers and kubernetes
 
Docker HK Meetup - 201707
Docker HK Meetup - 201707Docker HK Meetup - 201707
Docker HK Meetup - 201707
 
The Fairy Tale of the One Command Build Script
The Fairy Tale of the One Command Build ScriptThe Fairy Tale of the One Command Build Script
The Fairy Tale of the One Command Build Script
 
LlinuxKit security, Security Scanning and Notary
LlinuxKit security, Security Scanning and NotaryLlinuxKit security, Security Scanning and Notary
LlinuxKit security, Security Scanning and Notary
 
DockerCon EU 2015: Shipping Manifests, Bill of Lading and Docker Metadata and...
DockerCon EU 2015: Shipping Manifests, Bill of Lading and Docker Metadata and...DockerCon EU 2015: Shipping Manifests, Bill of Lading and Docker Metadata and...
DockerCon EU 2015: Shipping Manifests, Bill of Lading and Docker Metadata and...
 
DCSF 19 Data Center Networking with Containers
DCSF 19 Data Center Networking with ContainersDCSF 19 Data Center Networking with Containers
DCSF 19 Data Center Networking with Containers
 
DCEU 18: Continuous Delivery with Docker Containers and Java: The Good, the B...
DCEU 18: Continuous Delivery with Docker Containers and Java: The Good, the B...DCEU 18: Continuous Delivery with Docker Containers and Java: The Good, the B...
DCEU 18: Continuous Delivery with Docker Containers and Java: The Good, the B...
 
Docker for Java Developers - Fabiane Nardon and Arun gupta
Docker for Java Developers - Fabiane Nardon and Arun guptaDocker for Java Developers - Fabiane Nardon and Arun gupta
Docker for Java Developers - Fabiane Nardon and Arun gupta
 
DCEU 18: Docker Container Security
DCEU 18: Docker Container SecurityDCEU 18: Docker Container Security
DCEU 18: Docker Container Security
 
DCEU 18: Developing with Docker Containers
DCEU 18: Developing with Docker ContainersDCEU 18: Developing with Docker Containers
DCEU 18: Developing with Docker Containers
 
"[WORKSHOP] K8S for developers", Denis Romanuk
"[WORKSHOP] K8S for developers", Denis Romanuk"[WORKSHOP] K8S for developers", Denis Romanuk
"[WORKSHOP] K8S for developers", Denis Romanuk
 

Similaire à Using Docker EE to Scale Operational Intelligence at Splunk

Oracle DBA Trainer
Oracle DBA TrainerOracle DBA Trainer
Oracle DBA Trainer
P S Rani
 
Backroll: Production Grade KVM Backup Solution Integrated in CloudStack
Backroll: Production Grade KVM Backup Solution Integrated in CloudStackBackroll: Production Grade KVM Backup Solution Integrated in CloudStack
Backroll: Production Grade KVM Backup Solution Integrated in CloudStack
ShapeBlue
 

Similaire à Using Docker EE to Scale Operational Intelligence at Splunk (20)

DCSF19 CMD and Conquer: Containerizing the Monolith
DCSF19 CMD and Conquer: Containerizing the Monolith  DCSF19 CMD and Conquer: Containerizing the Monolith
DCSF19 CMD and Conquer: Containerizing the Monolith
 
What's New in Docker - February 2017
What's New in Docker - February 2017What's New in Docker - February 2017
What's New in Docker - February 2017
 
DCSF 19 Building Your Development Pipeline
DCSF 19 Building Your Development Pipeline  DCSF 19 Building Your Development Pipeline
DCSF 19 Building Your Development Pipeline
 
DevOps Fest 2020. immutable infrastructure as code. True story.
DevOps Fest 2020. immutable infrastructure as code. True story.DevOps Fest 2020. immutable infrastructure as code. True story.
DevOps Fest 2020. immutable infrastructure as code. True story.
 
CAPS: What's best for deploying and managing OpenStack? Chef vs. Ansible vs. ...
CAPS: What's best for deploying and managing OpenStack? Chef vs. Ansible vs. ...CAPS: What's best for deploying and managing OpenStack? Chef vs. Ansible vs. ...
CAPS: What's best for deploying and managing OpenStack? Chef vs. Ansible vs. ...
 
CAPS: What's best for deploying and managing OpenStack? Chef vs. Ansible vs. ...
CAPS: What's best for deploying and managing OpenStack? Chef vs. Ansible vs. ...CAPS: What's best for deploying and managing OpenStack? Chef vs. Ansible vs. ...
CAPS: What's best for deploying and managing OpenStack? Chef vs. Ansible vs. ...
 
Webinar : Docker in Production
Webinar : Docker in ProductionWebinar : Docker in Production
Webinar : Docker in Production
 
Extensible dev secops pipelines with Jenkins, Docker, Terraform, and a kitche...
Extensible dev secops pipelines with Jenkins, Docker, Terraform, and a kitche...Extensible dev secops pipelines with Jenkins, Docker, Terraform, and a kitche...
Extensible dev secops pipelines with Jenkins, Docker, Terraform, and a kitche...
 
Drupal 8 DevOps . Profile and SQL flows.
Drupal 8 DevOps . Profile and SQL flows.Drupal 8 DevOps . Profile and SQL flows.
Drupal 8 DevOps . Profile and SQL flows.
 
Devops with Python by Yaniv Cohen DevopShift
Devops with Python by Yaniv Cohen DevopShiftDevops with Python by Yaniv Cohen DevopShift
Devops with Python by Yaniv Cohen DevopShift
 
Splunk n-box-splunk conf-2017
Splunk n-box-splunk conf-2017Splunk n-box-splunk conf-2017
Splunk n-box-splunk conf-2017
 
Tech Talk: DevOps at LeanIX @ Startup Camp Berlin
Tech Talk: DevOps at LeanIX @ Startup Camp BerlinTech Talk: DevOps at LeanIX @ Startup Camp Berlin
Tech Talk: DevOps at LeanIX @ Startup Camp Berlin
 
Full stack development best practice and toolset
Full stack development best practice and toolsetFull stack development best practice and toolset
Full stack development best practice and toolset
 
What HPC can learn from DevOps?
What HPC can learn from DevOps?What HPC can learn from DevOps?
What HPC can learn from DevOps?
 
Oracle DBA Trainer
Oracle DBA TrainerOracle DBA Trainer
Oracle DBA Trainer
 
Chugging Our Own "Craft Brew” – HPE’s Journey Towards Containers-as-a-Service...
Chugging Our Own "Craft Brew” – HPE’s Journey Towards Containers-as-a-Service...Chugging Our Own "Craft Brew” – HPE’s Journey Towards Containers-as-a-Service...
Chugging Our Own "Craft Brew” – HPE’s Journey Towards Containers-as-a-Service...
 
Using Rancher and Docker with RightScale at Industrie IT
Using Rancher and Docker with RightScale at Industrie IT Using Rancher and Docker with RightScale at Industrie IT
Using Rancher and Docker with RightScale at Industrie IT
 
Backroll: Production Grade KVM Backup Solution Integrated in CloudStack
Backroll: Production Grade KVM Backup Solution Integrated in CloudStackBackroll: Production Grade KVM Backup Solution Integrated in CloudStack
Backroll: Production Grade KVM Backup Solution Integrated in CloudStack
 
AWS Webcast - Build Agile Applications in AWS Cloud for Government
AWS Webcast - Build Agile Applications in AWS Cloud for GovernmentAWS Webcast - Build Agile Applications in AWS Cloud for Government
AWS Webcast - Build Agile Applications in AWS Cloud for Government
 
Vikash_Kr_Singh_CV
Vikash_Kr_Singh_CVVikash_Kr_Singh_CV
Vikash_Kr_Singh_CV
 

Plus de Docker, Inc.

Build & Deploy Multi-Container Applications to AWS
Build & Deploy Multi-Container Applications to AWSBuild & Deploy Multi-Container Applications to AWS
Build & Deploy Multi-Container Applications to AWS
Docker, Inc.
 
Build & Deploy Multi-Container Applications to AWS
Build & Deploy Multi-Container Applications to AWSBuild & Deploy Multi-Container Applications to AWS
Build & Deploy Multi-Container Applications to AWS
Docker, Inc.
 

Plus de Docker, Inc. (20)

Containerize Your Game Server for the Best Multiplayer Experience
Containerize Your Game Server for the Best Multiplayer Experience Containerize Your Game Server for the Best Multiplayer Experience
Containerize Your Game Server for the Best Multiplayer Experience
 
How to Improve Your Image Builds Using Advance Docker Build
How to Improve Your Image Builds Using Advance Docker BuildHow to Improve Your Image Builds Using Advance Docker Build
How to Improve Your Image Builds Using Advance Docker Build
 
Build & Deploy Multi-Container Applications to AWS
Build & Deploy Multi-Container Applications to AWSBuild & Deploy Multi-Container Applications to AWS
Build & Deploy Multi-Container Applications to AWS
 
Securing Your Containerized Applications with NGINX
Securing Your Containerized Applications with NGINXSecuring Your Containerized Applications with NGINX
Securing Your Containerized Applications with NGINX
 
How To Build and Run Node Apps with Docker and Compose
How To Build and Run Node Apps with Docker and ComposeHow To Build and Run Node Apps with Docker and Compose
How To Build and Run Node Apps with Docker and Compose
 
Hands-on Helm
Hands-on Helm Hands-on Helm
Hands-on Helm
 
Distributed Deep Learning with Docker at Salesforce
Distributed Deep Learning with Docker at SalesforceDistributed Deep Learning with Docker at Salesforce
Distributed Deep Learning with Docker at Salesforce
 
The First 10M Pulls: Building The Official Curl Image for Docker Hub
The First 10M Pulls: Building The Official Curl Image for Docker HubThe First 10M Pulls: Building The Official Curl Image for Docker Hub
The First 10M Pulls: Building The Official Curl Image for Docker Hub
 
Monitoring in a Microservices World
Monitoring in a Microservices WorldMonitoring in a Microservices World
Monitoring in a Microservices World
 
COVID-19 in Italy: How Docker is Helping the Biggest Italian IT Company Conti...
COVID-19 in Italy: How Docker is Helping the Biggest Italian IT Company Conti...COVID-19 in Italy: How Docker is Helping the Biggest Italian IT Company Conti...
COVID-19 in Italy: How Docker is Helping the Biggest Italian IT Company Conti...
 
Predicting Space Weather with Docker
Predicting Space Weather with DockerPredicting Space Weather with Docker
Predicting Space Weather with Docker
 
Become a Docker Power User With Microsoft Visual Studio Code
Become a Docker Power User With Microsoft Visual Studio CodeBecome a Docker Power User With Microsoft Visual Studio Code
Become a Docker Power User With Microsoft Visual Studio Code
 
How to Use Mirroring and Caching to Optimize your Container Registry
How to Use Mirroring and Caching to Optimize your Container RegistryHow to Use Mirroring and Caching to Optimize your Container Registry
How to Use Mirroring and Caching to Optimize your Container Registry
 
Monolithic to Microservices + Docker = SDLC on Steroids!
Monolithic to Microservices + Docker = SDLC on Steroids!Monolithic to Microservices + Docker = SDLC on Steroids!
Monolithic to Microservices + Docker = SDLC on Steroids!
 
Kubernetes at Datadog Scale
Kubernetes at Datadog ScaleKubernetes at Datadog Scale
Kubernetes at Datadog Scale
 
Labels, Labels, Labels
Labels, Labels, Labels Labels, Labels, Labels
Labels, Labels, Labels
 
Using Docker Hub at Scale to Support Micro Focus' Delivery and Deployment Model
Using Docker Hub at Scale to Support Micro Focus' Delivery and Deployment ModelUsing Docker Hub at Scale to Support Micro Focus' Delivery and Deployment Model
Using Docker Hub at Scale to Support Micro Focus' Delivery and Deployment Model
 
Build & Deploy Multi-Container Applications to AWS
Build & Deploy Multi-Container Applications to AWSBuild & Deploy Multi-Container Applications to AWS
Build & Deploy Multi-Container Applications to AWS
 
From Fortran on the Desktop to Kubernetes in the Cloud: A Windows Migration S...
From Fortran on the Desktop to Kubernetes in the Cloud: A Windows Migration S...From Fortran on the Desktop to Kubernetes in the Cloud: A Windows Migration S...
From Fortran on the Desktop to Kubernetes in the Cloud: A Windows Migration S...
 
Developing with Docker for the Arm Architecture
Developing with Docker for the Arm ArchitectureDeveloping with Docker for the Arm Architecture
Developing with Docker for the Arm Architecture
 

Dernier

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Dernier (20)

Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 

Using Docker EE to Scale Operational Intelligence at Splunk

  • 1. Using Docker to Scale Operational Intelligence at Splunk Mike Dickey Sr. Director, Splunk @mike_dickey Harish Jayakumar Sr.Solution Engineer, Docker @harish_jkumar
  • 2. Forward-Looking Statements During the course of this presentation, we may make forward-looking statements regarding future events or the expected performance of the company. We caution you that such statements reflect our current expectations and estimates based on factors currently known to us and that actual events or results could differ material. For important factors that may cause actual results to differ from those captained in our forward-looking statements, please review our fillings with the SEC. The forward-looking statements made in this presentation are being made as of the time and data of its live presentation. If reviewed after its live presentation, this presentation may not contain current or accurate information. We do not assume any obligation to update any forward-looking statements we may make. In addition, any information about our roadmap outlines our general product direction and is subject to change at any time without notice. It is for informational purposes only and shall not be incorporated into any contract or other commitment. Splunk undertakes no obligation either to develop the features or functionality described or to include any such feature or functionality in a future release. Splunk, Splunk>, Listen to Your Data, The Engine for Machine Data, Splunk Cloud, Splunk Light and SPL are trademarks and registered trademarks of Splunk Inc. in the United States and other countries. All other brand names, product names, or trademarks belong to their respective owners. © 2017 Splunk Inc. All rights reserved.
  • 3. Agenda ● Splunk’s challenges ● Docker Enterprise Edition ● Splunk + Docker EE ● Demo ● Before & After Docker Metrics ● Lessons Learned
  • 4. Infrastructure ● Large Scale Scrum “LeSS Huge” Model ● Engineering Infrastructure Area ○ 13 teams (about 100 engineers) in California & Shanghai ○ Test quality, automation, common tools, frameworks, build systems ○ Working on an overhaul of all test infrastructure, including CI/CD
  • 5. Multiple CI EnvironmentsTest Server Sprawl 100+ bare-metal servers for functional and performance testing Too many frameworks, manual work Many days to setup a single test Splunk Challenges Bamboo vs Jenkins Plans & agents managed by hand Physical server agents, very poor scalability LONG wait times, build bottleneck, not enough capacity
  • 6. Wiki-Managed Infrastructure soln-perf66.sv.splunk.com (root) soln- perf66.ilo.sv.splunk.com (AD Credential) 2x 8-core Xeon 2.40Ghz, 32 Gb RAM, 2x 1Tb SATA 6G HDD (RAID 1) Allan (Large ES env) forever soln-perf67.sv.splunk.com (root) soln- perf67.ilo.sv.splunk.com (AD Credential) 2x 8-core Xeon 2.40Ghz, 32 Gb RAM, 2x 1Tb SATA 6G HDD (RAID 1) Mike – UCP dev cluster forever soln-perf68.sv.splunk.com (root) soln- perf68.ilo.sv.splunk.com (AD Credential) 2x 8-core Xeon 2.40Ghz, 32 Gb RAM, 2x 1Tb SATA 6G HDD (RAID 1) Dhananjay 4/26/2017 soln-perf69.sv.splunk.com (root) soln- perf69.ilo.sv.splunk.com (AD Credential) 2x 8-core Xeon 2.40Ghz, 32 Gb RAM, 2x 1Tb SATA 6G HDD (RAID 1) Stream – Manan/Vladimir forever
  • 7. Why Docker ● Fun ● De-facto ● Bare-metal performance ● Consistency ● Efficient ● Successful POCs
  • 8. Selling Docker Internally ● Why? ○ Reduce waste of underutilized servers ○ Automate manual testing work ○ Enables more testing, higher quality ● Start small, deliver value quickly with each iteration ● CPO, CFO & CEO as Scale & Scope grew
  • 9. Why Docker EE ● Windows Server 2016 & Linux ● Role Based Access Control ● Compose & common Docker API ● End user experience : Docker on Desktop and Docker on Infrastructure ● End to end support from vendor - ■ Around the corner (literally)
  • 10.
  • 11. Docker EE components Public Cloud Virtual Physical docker enterprise edition ADVANCED INTEGRATED SECURITY docker trusted registry image management docker universal control plane app & cluster management docker engine container runtime, orchestration, networking, volumes, plugins CI/CD Images Operating Systems Volumes Monitoring Logging more...
  • 12. UCP ( Windows & Linux)
  • 13. Build a Secure Software Supply Chain (CaaS) Image RegistrySecurity scan & sign Traditional Third Party Microservices docker store DEVELOPERS IT OPERATIONS Control Plane
  • 14. services: database: image: sixeyed/atsea-db:mssql ports: - mode: host target: 1433 networks: - atsea deploy: endpoint_mode: dnsrr placement: constraints: - 'node.platform.os == windows' appserver: image: sixeyed/atsea-app:mssql ports: - target: 8080 published: 8080 networks: - atsea deploy: placement: constraints: - 'node.platform.os == linux' networks: atsea: node.platform.os is a built in label that can be used for workload placement Windows and Linux nodes can share a common overlay network Hybrid Applications
  • 15. 1:1:1 mapping of subject to role to collection. Grant Subject Role Collection Who (orgs, teams, users) can perform an action. What they can do Where work can be done Enhanced RBAC
  • 16. SUPPORT SECURE MULTI-TENANCY ACROSS MULTIPLE TEAMS THROUGH NODE-BASED ISOLATION AND SEGREGATION KEY FEATURES BENEFITS • Enforce node affinity and node anti-affinity rules to allow certain users/teams/orgs to deploy within a subset of nodes or outside certain nodes (eg. Production nodes) • Set up different security zones within the same cluster to isolate and segregate access of protected information • Support multiple teams within the same cluster while providing physical separation and isolation • Prevent “noisy neighbors” by limiting a team’s resources to approved nodes • Meet compliance and regulatory requirements by isolating sensitive workloads to certain nodes and limiting access to those nodes Prod Dev Dev Team A Dev Team B SecOpsOps Team PHI RBAC for Nodes Node Worker Node Worker Node Worker Node Worker
  • 17. Docker EE :Swarm + Kub ( Coming Soon)
  • 18. Demo ● Windows Server 2016 & Linux ● Role Based Access Control ● Compose & common Docker API ● End user experience : Docker on Desktop and Docker on Infrastructure ● End to end support from vendor - ■ Around the corner (literally)
  • 23. New Tooling to Test Splunk
  • 24. New CI/CD Platform Volume 3. Stage Build Runner 4. Stage Build Runner App Cert 6. Stage Publish5. Stage AppCert Runner Runner Splunkins as report service Notify build status Notify build status 2. Notify Jenkins to build UCP 1. Git Push Artifactory
  • 25. Results ● Test setup time reduced from days to minutes ○ Far more efficient use of the hardware we have ○ Enables us to run more tests, more frequently, earlier in release cycles ● Eliminated CI/CD Bottlenecks ● Example: DMA performance improvements in Splunk 7.0
  • 26. Growth & Adoption Unique Users Logging bug 4x Growth 150 Servers 265 Servers 300 Servers 400 Servers Server Utilization 600 Servers
  • 27. Number of Servers 600 Containers 1 Mil. >10,000/day CI Jobs executed 30,000 Average Utilization 75% Unique users 385 Splunk’s Docker Growth & Adoption Year to Date As of October 3, 2017
  • 28. Challenges & Journey Internal: ○ Basic container knowledge ■ Trained Developers and Ops ○ Let’s blame the new guy ■ Real source: Bad drives, memory chips, networking
  • 29. Challenges & Journey Bugs found & fixed: ○ Placement strategies (anti-affinity, binpacking) ○ LDAP (AD) & RBAC ○ Performance & Timeouts at Scale of 300-600 servers ○ Worked with support to prioritize
  • 30. Lessons Learned Test things before you update production Perform root cause analysis of problems Start small, over- communicate & set realistic expectations Monitor & respond quickly to problems Deliver what your customers ask for Invite contributions & participation
  • 32. Challenges & Journey Outstanding Performance & Accuracy of Metrics Errors related to state inconsistency Kernel & docker engine upgrades
  • 33. Docker EE :Swarm + Kub ( Coming Soon)
  • 34. Docker EE :Swarm + Kub ( Coming Soon)