SlideShare une entreprise Scribd logo
1  sur  33
Strata 2017
Creating a DevOps
Practice for Analytics
Bob Eilbacher
September 28, 2017
Agenda
 About Caserta
 DevOps
 DevOps for Analytics
 Organization and Teams
 Questions
About Caserta
 Data Intelligence Consulting and Modern Data Engineering
 Award-winning data innovation
 Internationally recognized work force
 Strategy, Architecture, Governance, Implementation
About Caserta
 Architecture & Design
 Implementation Services
 Disruption Management
 Strategic Technical Consulting
 Training & Education
 Application Innovation
 Cloud Management
What is DevOps for Analytics?
First some terminology…
 DevOps
 Associated with movement primarily in application
development space for last 5-10 years
 Focused on very fast and continuous software product
releases
 Think intra-day Prod releases at Netflix, Amazon, etc.
 Convergence of development and operations
methodologies to minimize TTR
 Tons of resources – devops.com, DZone
What is DevOps for Analytics?
Some more terminology…
 DataOps
 Re-emergent term
 Seems to have a broader context
 Applying DevOps to data management or to handling
backend databases
 Also tends to carry real legacy connotation
 Manual operations of database backups and restores,
What is DevOps for Analytics?
And finally…
 AnalyticsOps
 This is a term that we see starting to be used more
 Its focused on applying DevOps practices within a data
analytics and data science context
 This is the area we’re interested in for this talk
 We’ll use the terms AnalyticsOps or the more explicit
DevOps for Analytics interchangeably
DevOps…
 Speak with anyone and they will tell you first that DevOps
is a culture
 Based primarily on teamwork
DevOps…
DevOps…
 Speak with anyone and they will tell you first that DevOps is a
“culture”
 Based primarily on teamwork
 Aims to address the underlying conflict between
development and operations objectives
Innovation @ speed vs. Performance @ quality
Change vs. Stability
 Culture is not “implemented”
 It needs to evolve
 Good news is it can be seeded
DevOps…
 It works!
 75% of IT and product dev organizations were successfully
using DevOps to some extent
– Source: RightScale 2016 State of the Cloud Report
 It’s flexible
 No two companies’ DevOps approaches will look the same
 Infinite number of ways to create teamwork
 A reflection of the organization itself
DevOps…
 DevOps tenets
 Continuous Integration
 Test Automation
 Continuous Delivery
 Continuous Deployment
 End-to-end automation is still aspirational for most
companies
 Justify how much automation you need based on business
requirements.
DevOps…
 What DevOps is not is a toolchain implementation
 Tools help the team execute within the culture
 Don’t run out and put an end to end chain in place and then
expect adoption
 Lets talk about tools for a minute …
 Explosion of both open-source and commercial DevOps
tooling
 Serve every discrete need
 requirements management, SCM, test automation, defect
tracking, build, deployment, monitoring and more
 1,500+ tools available
DevOps…
 Tooling categories:
 Code : Code development, version control tools, code merging
 Build : Continuous integration tools, build status
 Test : Test and results determine performance
 Package : Artifact repository, application pre-deployment
staging
 Release : Change management, release automation
 Configure : Infrastructure configuration and management,
Infrastructure as Code tools
 Monitor : Applications performance monitoring, end user
experience
DevOps…
Source: XebiaLabs
Why DevOps for Analytics?
“The fact is that analytic teams are
being compared by their businesses to
Amazon Prime – 2-day delivery of
almost anything”
Source: Unknown
Why DevOps for Analytics?
Why DevOps for Analytics?
 A couple of recent real world examples…
Data Science Rock Star Process Overengineering
Why DevOps for Analytics?
 Analytics and data science projects, what used to take
months to achieve is happening in days or hours
 Businesses typically like that and want more…
 Enabled by the strong trend toward cloud analytic
platforms/services
 Infrastructure as code (IaC) allows extension of software
development practices to servers and infrastructure
 We can automate the build of complex analytic pipelines -
storage, processing engines, etc. with relative ease
DevOps for Analytics
 DevOps for Analytics combines the development and
operations teams and establishes best practices that
improve coordination between data science and operations
 BUT… Data Science and Analytics are different from
application development
 Especially in a Big Data environments - need big data to test big
data applications
 Much more diverse mix of tools and technologies – not just java
 Some differences in approach are needed
DevOps for Analytics
 AnalyticsOps this is still in its early days
 There aren’t any real solid industry success stories published
 People are still trying to figure out what works and aren’t’ open
kimono and sharing experiences just yet
 Not a lot of experienced practitioners
 But there are some early themes and guidelines emerging
DevOps for Analytics
 Environments
 Separate DEV and PROD environments
 Should you reuse any of the PROD data assets?
 Separate landing area, destination area (Data Lake), etc.
 Trickier with increasing data volumes – do it smart to avoid
double costs
 Sharing compute cluster resources is OK
 Make all job inputs and outputs configuration driven (PROD
and DEV code doesn’t change) – for CI
DevOps for Analytics
 Automated Testing
 It’s almost impossible to get full code coverage
 How do you unit test SPARK SQL scripts? Regression tests?
Data validation?
 Test data is a complex problem – handle as a cross-functional
initiative.
 Analytic results are often buried in complex outputs, QA
becomes forensic data analysis
 Automate what you can, supplement with community based
real-world data testing in a parallel Dev/Test environment
 The role of the Test/QA Engineer is still really important
 Test/QA Engineers need Data Engineering experience
DevOps for Analytics
 Monitoring
 Tracking and analyzing intra-day demand and longer term trends
in infrastructure performance (standard DevOps)
 But then…
 By their nature analytics processes require monitoring and
tuning over time with real-world inputs
 Data drifts; Predictive models have a finite lifetime
 Silent failures
 Feedback to developers so they can see how their code is
performing and affecting the Prod environment
 Continuous improvement
 The next wave is analytics on analytics…
DevOps for Analytics
 Emerging DevOps for Analytics environment usually contain
 SCM
 CI
 Repo to store analytics app
 Repo to store configuration
 An API to deploy to the cluster
 Mechanism to monitor behavior and performance
DevOps for Analytics Organization
 Building a DevOps for Analytics culture is not an easy
undertaking
 Should fall under the purview of a dedicated data organization
 These organizations are typically lead by the Chief Data
Officer
 More recently by Chief Data Scientist a Chief Analytics Officer
 Key responsibilities include
 Fostering adoption
 Clarifying and aligning to the business' vision
 Securing reasonable funding
DevOps for Analytics Organization
 The goal over time is to create lean, highly performant, cross-
functional, extremely effective teams
 Business Stakeholders
 Data Engineers
 Data Analysts & Data Scientists
 QA
 Operations
 All of these skills are important - but when in doubt get more Data
Engineers!
 Everyone on team has an equal voice
 Everyone codes & Everyone needs to know what Prod looks like
DevOps for Analytics Organization
 Start-up Condition: Bring in an experienced set of DevOps for
Analytics Engineers
 Help define the culture, lead by example
 Identify the Innovators and get them involved and leading
 The DevOps Engineers job is to ultimately engineer themselves out
of the equation
Source: Matthew Skelton, DevOps Patterns - Team Topologies
Final Thoughts
“We aim to engineer systems and processes
to better integrate development and
operations, resulting in decreased time to
market and an application infrastructure
that is instrumented, scalable and fault
tolerant… and immortal!”
- Will Liu, Equinox Data Team
Final Thoughts
 There are plenty of benefits in establishing a DevOps
for Analytics culture for your organization
 For the business: Speed to insight
 For the teams: Professional and personal satisfaction
 Be Fearless –
go build your own DevOps for Analytics culture!
Questions
Happy Birthday Joe Caserta!
Thank You
 Bob Eilbacher
 Vice President Operations, Caserta
 bob@casertaconcepts.com
Upcoming Training Opportunity:
Caserta is hosting 3 Days of Training Courses October 18-20th in NYC,
taught by Joe Caserta, co-author of The Data Warehouse ETL Toolkit:
Day 1: Agile Data Warehouse Design & Dimensional Modeling
Day 2: ETL Architecture & Design
Day 3: Big Data for Data Warehouse Practitioners
More info at casertaconcepts.com/event/

Contenu connexe

Tendances

AWS vs Azure vs Google (GCP) - Slides
AWS vs Azure vs Google (GCP) - SlidesAWS vs Azure vs Google (GCP) - Slides
AWS vs Azure vs Google (GCP) - SlidesTobyWilman
 
The Basics of Getting Started With Microsoft Azure
The Basics of Getting Started With Microsoft AzureThe Basics of Getting Started With Microsoft Azure
The Basics of Getting Started With Microsoft AzureMicrosoft Azure
 
1- Introduction of Azure data factory.pptx
1- Introduction of Azure data factory.pptx1- Introduction of Azure data factory.pptx
1- Introduction of Azure data factory.pptxBRIJESH KUMAR
 
Microsoft M365 Cross Tenant Migration Book
Microsoft M365 Cross Tenant Migration BookMicrosoft M365 Cross Tenant Migration Book
Microsoft M365 Cross Tenant Migration BookThomas Poett
 
Introduction to AWS Storage Services
Introduction to AWS Storage ServicesIntroduction to AWS Storage Services
Introduction to AWS Storage ServicesAmazon Web Services
 
Data Lake Overview
Data Lake OverviewData Lake Overview
Data Lake OverviewJames Serra
 
CAF presentation 09 16-2020
CAF presentation 09 16-2020CAF presentation 09 16-2020
CAF presentation 09 16-2020Michael Nichols
 
Azure Stack Fundamentals
Azure Stack FundamentalsAzure Stack Fundamentals
Azure Stack FundamentalsCenk Ersoy
 
Mastering SharePoint Migration Planning
Mastering SharePoint Migration PlanningMastering SharePoint Migration Planning
Mastering SharePoint Migration PlanningChristian Buckley
 
Designing and Building Next Generation Data Pipelines at Scale with Structure...
Designing and Building Next Generation Data Pipelines at Scale with Structure...Designing and Building Next Generation Data Pipelines at Scale with Structure...
Designing and Building Next Generation Data Pipelines at Scale with Structure...Databricks
 
Modern Data Flow
Modern Data FlowModern Data Flow
Modern Data Flowconfluent
 
Modernize Solutions with SharePoint & the Power Platform
Modernize Solutions with SharePoint & the Power PlatformModernize Solutions with SharePoint & the Power Platform
Modernize Solutions with SharePoint & the Power PlatformJonathan Schultz
 
Accelerate Cloud Migration to AWS Cloud with Cognizant Cloud Steps
Accelerate Cloud Migration to AWS Cloud with Cognizant Cloud StepsAccelerate Cloud Migration to AWS Cloud with Cognizant Cloud Steps
Accelerate Cloud Migration to AWS Cloud with Cognizant Cloud StepsAmazon Web Services
 
Cloud Has Become the New Normal: TCS
Cloud Has Become the New Normal: TCS Cloud Has Become the New Normal: TCS
Cloud Has Become the New Normal: TCS Amazon Web Services
 
Introduction to Azure Databricks
Introduction to Azure DatabricksIntroduction to Azure Databricks
Introduction to Azure DatabricksJames Serra
 
Data Center Migration to the AWS Cloud
Data Center Migration to the AWS CloudData Center Migration to the AWS Cloud
Data Center Migration to the AWS CloudTom Laszewski
 

Tendances (20)

AWS vs Azure vs Google (GCP) - Slides
AWS vs Azure vs Google (GCP) - SlidesAWS vs Azure vs Google (GCP) - Slides
AWS vs Azure vs Google (GCP) - Slides
 
The Basics of Getting Started With Microsoft Azure
The Basics of Getting Started With Microsoft AzureThe Basics of Getting Started With Microsoft Azure
The Basics of Getting Started With Microsoft Azure
 
1- Introduction of Azure data factory.pptx
1- Introduction of Azure data factory.pptx1- Introduction of Azure data factory.pptx
1- Introduction of Azure data factory.pptx
 
Lift & Shift to Azure
Lift & Shift to AzureLift & Shift to Azure
Lift & Shift to Azure
 
Introduction to AWS Glue
Introduction to AWS Glue Introduction to AWS Glue
Introduction to AWS Glue
 
Microsoft M365 Cross Tenant Migration Book
Microsoft M365 Cross Tenant Migration BookMicrosoft M365 Cross Tenant Migration Book
Microsoft M365 Cross Tenant Migration Book
 
Introduction to AWS Storage Services
Introduction to AWS Storage ServicesIntroduction to AWS Storage Services
Introduction to AWS Storage Services
 
Data Lake Overview
Data Lake OverviewData Lake Overview
Data Lake Overview
 
CAF presentation 09 16-2020
CAF presentation 09 16-2020CAF presentation 09 16-2020
CAF presentation 09 16-2020
 
Azure Stack Fundamentals
Azure Stack FundamentalsAzure Stack Fundamentals
Azure Stack Fundamentals
 
Mastering SharePoint Migration Planning
Mastering SharePoint Migration PlanningMastering SharePoint Migration Planning
Mastering SharePoint Migration Planning
 
Designing and Building Next Generation Data Pipelines at Scale with Structure...
Designing and Building Next Generation Data Pipelines at Scale with Structure...Designing and Building Next Generation Data Pipelines at Scale with Structure...
Designing and Building Next Generation Data Pipelines at Scale with Structure...
 
Data Migration to Azure
Data Migration to AzureData Migration to Azure
Data Migration to Azure
 
Modern Data Flow
Modern Data FlowModern Data Flow
Modern Data Flow
 
Azure storage
Azure storageAzure storage
Azure storage
 
Modernize Solutions with SharePoint & the Power Platform
Modernize Solutions with SharePoint & the Power PlatformModernize Solutions with SharePoint & the Power Platform
Modernize Solutions with SharePoint & the Power Platform
 
Accelerate Cloud Migration to AWS Cloud with Cognizant Cloud Steps
Accelerate Cloud Migration to AWS Cloud with Cognizant Cloud StepsAccelerate Cloud Migration to AWS Cloud with Cognizant Cloud Steps
Accelerate Cloud Migration to AWS Cloud with Cognizant Cloud Steps
 
Cloud Has Become the New Normal: TCS
Cloud Has Become the New Normal: TCS Cloud Has Become the New Normal: TCS
Cloud Has Become the New Normal: TCS
 
Introduction to Azure Databricks
Introduction to Azure DatabricksIntroduction to Azure Databricks
Introduction to Azure Databricks
 
Data Center Migration to the AWS Cloud
Data Center Migration to the AWS CloudData Center Migration to the AWS Cloud
Data Center Migration to the AWS Cloud
 

Similaire à Creating a DevOps Practice for Analytics -- Strata Data, September 28, 2017

Introduction to DevOps slides-converted (1).pptx
Introduction to DevOps slides-converted (1).pptxIntroduction to DevOps slides-converted (1).pptx
Introduction to DevOps slides-converted (1).pptxaasssss1
 
Innovate Better Through Machine data Analytics
Innovate Better Through Machine data AnalyticsInnovate Better Through Machine data Analytics
Innovate Better Through Machine data AnalyticsHal Rottenberg
 
Integrating SAP into DevOps Pipelines: Why and How
Integrating SAP into DevOps Pipelines: Why and HowIntegrating SAP into DevOps Pipelines: Why and How
Integrating SAP into DevOps Pipelines: Why and HowDevOps.com
 
Introduction to DevOps slides.pdf
Introduction to DevOps slides.pdfIntroduction to DevOps slides.pdf
Introduction to DevOps slides.pdfBoreVishnusai
 
DevOps-driving-blind
DevOps-driving-blindDevOps-driving-blind
DevOps-driving-blindPaul Peissner
 
ITpreneurs’ DevOps Portfolio- Professionalizing DevOps Skills
ITpreneurs’ DevOps Portfolio- Professionalizing DevOps SkillsITpreneurs’ DevOps Portfolio- Professionalizing DevOps Skills
ITpreneurs’ DevOps Portfolio- Professionalizing DevOps SkillsITpreneurs
 
Devops Strategy Roadmap Lifecycle Ppt Powerpoint Presentation Slides Complete...
Devops Strategy Roadmap Lifecycle Ppt Powerpoint Presentation Slides Complete...Devops Strategy Roadmap Lifecycle Ppt Powerpoint Presentation Slides Complete...
Devops Strategy Roadmap Lifecycle Ppt Powerpoint Presentation Slides Complete...SlideTeam
 
AWS re:Invent 2016: Lift and Evolve – Saving Money in the Cloud is Easy, Maki...
AWS re:Invent 2016: Lift and Evolve – Saving Money in the Cloud is Easy, Maki...AWS re:Invent 2016: Lift and Evolve – Saving Money in the Cloud is Easy, Maki...
AWS re:Invent 2016: Lift and Evolve – Saving Money in the Cloud is Easy, Maki...Amazon Web Services
 
Join Us to Explore DevOps on AWS with REAN Cloud
Join Us to Explore DevOps on AWS with REAN CloudJoin Us to Explore DevOps on AWS with REAN Cloud
Join Us to Explore DevOps on AWS with REAN CloudAmazon Web Services
 
DevOps for the Discouraged
DevOps for the Discouraged DevOps for the Discouraged
DevOps for the Discouraged James Wickett
 
Meetup DevOps - Accelerate
Meetup DevOps - AccelerateMeetup DevOps - Accelerate
Meetup DevOps - AccelerateDelta-N
 
DevOps Culture transformation in Modern Software Delivery
DevOps Culture transformation in Modern Software DeliveryDevOps Culture transformation in Modern Software Delivery
DevOps Culture transformation in Modern Software DeliveryNajib Radzuan
 
Continuous Security / DevSecOps- Why How and What
Continuous Security /  DevSecOps- Why How and WhatContinuous Security /  DevSecOps- Why How and What
Continuous Security / DevSecOps- Why How and WhatMarc Hornbeek
 
DevOps culture, concepte , philosophie and practices
DevOps culture, concepte , philosophie and practicesDevOps culture, concepte , philosophie and practices
DevOps culture, concepte , philosophie and practicesayoubbahaddouayoub
 
apidays LIVE India 2022_Achieving High DevOps Practice Maturity.pptx
apidays LIVE India 2022_Achieving High DevOps Practice Maturity.pptxapidays LIVE India 2022_Achieving High DevOps Practice Maturity.pptx
apidays LIVE India 2022_Achieving High DevOps Practice Maturity.pptxapidays
 

Similaire à Creating a DevOps Practice for Analytics -- Strata Data, September 28, 2017 (20)

DevOps 1 (1).pptx
DevOps 1 (1).pptxDevOps 1 (1).pptx
DevOps 1 (1).pptx
 
Introduction to DevOps slides-converted (1).pptx
Introduction to DevOps slides-converted (1).pptxIntroduction to DevOps slides-converted (1).pptx
Introduction to DevOps slides-converted (1).pptx
 
Innovate Better Through Machine data Analytics
Innovate Better Through Machine data AnalyticsInnovate Better Through Machine data Analytics
Innovate Better Through Machine data Analytics
 
DevOps
DevOpsDevOps
DevOps
 
Dev ops
Dev opsDev ops
Dev ops
 
Integrating SAP into DevOps Pipelines: Why and How
Integrating SAP into DevOps Pipelines: Why and HowIntegrating SAP into DevOps Pipelines: Why and How
Integrating SAP into DevOps Pipelines: Why and How
 
Introduction to DevOps slides.pdf
Introduction to DevOps slides.pdfIntroduction to DevOps slides.pdf
Introduction to DevOps slides.pdf
 
Breaking DevOps Illusion
Breaking DevOps IllusionBreaking DevOps Illusion
Breaking DevOps Illusion
 
DevOps-driving-blind
DevOps-driving-blindDevOps-driving-blind
DevOps-driving-blind
 
ITpreneurs’ DevOps Portfolio- Professionalizing DevOps Skills
ITpreneurs’ DevOps Portfolio- Professionalizing DevOps SkillsITpreneurs’ DevOps Portfolio- Professionalizing DevOps Skills
ITpreneurs’ DevOps Portfolio- Professionalizing DevOps Skills
 
Devops Strategy Roadmap Lifecycle Ppt Powerpoint Presentation Slides Complete...
Devops Strategy Roadmap Lifecycle Ppt Powerpoint Presentation Slides Complete...Devops Strategy Roadmap Lifecycle Ppt Powerpoint Presentation Slides Complete...
Devops Strategy Roadmap Lifecycle Ppt Powerpoint Presentation Slides Complete...
 
AWS re:Invent 2016: Lift and Evolve – Saving Money in the Cloud is Easy, Maki...
AWS re:Invent 2016: Lift and Evolve – Saving Money in the Cloud is Easy, Maki...AWS re:Invent 2016: Lift and Evolve – Saving Money in the Cloud is Easy, Maki...
AWS re:Invent 2016: Lift and Evolve – Saving Money in the Cloud is Easy, Maki...
 
Join Us to Explore DevOps on AWS with REAN Cloud
Join Us to Explore DevOps on AWS with REAN CloudJoin Us to Explore DevOps on AWS with REAN Cloud
Join Us to Explore DevOps on AWS with REAN Cloud
 
DevOps for the Discouraged
DevOps for the Discouraged DevOps for the Discouraged
DevOps for the Discouraged
 
Dev ops concept
Dev ops conceptDev ops concept
Dev ops concept
 
Meetup DevOps - Accelerate
Meetup DevOps - AccelerateMeetup DevOps - Accelerate
Meetup DevOps - Accelerate
 
DevOps Culture transformation in Modern Software Delivery
DevOps Culture transformation in Modern Software DeliveryDevOps Culture transformation in Modern Software Delivery
DevOps Culture transformation in Modern Software Delivery
 
Continuous Security / DevSecOps- Why How and What
Continuous Security /  DevSecOps- Why How and WhatContinuous Security /  DevSecOps- Why How and What
Continuous Security / DevSecOps- Why How and What
 
DevOps culture, concepte , philosophie and practices
DevOps culture, concepte , philosophie and practicesDevOps culture, concepte , philosophie and practices
DevOps culture, concepte , philosophie and practices
 
apidays LIVE India 2022_Achieving High DevOps Practice Maturity.pptx
apidays LIVE India 2022_Achieving High DevOps Practice Maturity.pptxapidays LIVE India 2022_Achieving High DevOps Practice Maturity.pptx
apidays LIVE India 2022_Achieving High DevOps Practice Maturity.pptx
 

Plus de Caserta

Using Machine Learning & Spark to Power Data-Driven Marketing
Using Machine Learning & Spark to Power Data-Driven MarketingUsing Machine Learning & Spark to Power Data-Driven Marketing
Using Machine Learning & Spark to Power Data-Driven MarketingCaserta
 
Data Intelligence: How the Amalgamation of Data, Science, and Technology is C...
Data Intelligence: How the Amalgamation of Data, Science, and Technology is C...Data Intelligence: How the Amalgamation of Data, Science, and Technology is C...
Data Intelligence: How the Amalgamation of Data, Science, and Technology is C...Caserta
 
General Data Protection Regulation - BDW Meetup, October 11th, 2017
General Data Protection Regulation - BDW Meetup, October 11th, 2017General Data Protection Regulation - BDW Meetup, October 11th, 2017
General Data Protection Regulation - BDW Meetup, October 11th, 2017Caserta
 
Integrating the CDO Role Into Your Organization; Managing the Disruption (MIT...
Integrating the CDO Role Into Your Organization; Managing the Disruption (MIT...Integrating the CDO Role Into Your Organization; Managing the Disruption (MIT...
Integrating the CDO Role Into Your Organization; Managing the Disruption (MIT...Caserta
 
Architecting Data For The Modern Enterprise - Data Summit 2017, Closing Keynote
Architecting Data For The Modern Enterprise - Data Summit 2017, Closing KeynoteArchitecting Data For The Modern Enterprise - Data Summit 2017, Closing Keynote
Architecting Data For The Modern Enterprise - Data Summit 2017, Closing KeynoteCaserta
 
Introduction to Data Science (Data Summit, 2017)
Introduction to Data Science (Data Summit, 2017)Introduction to Data Science (Data Summit, 2017)
Introduction to Data Science (Data Summit, 2017)Caserta
 
Looker Data Modeling in the Age of Cloud - BDW Meetup May 2, 2017
Looker Data Modeling in the Age of Cloud - BDW Meetup May 2, 2017Looker Data Modeling in the Age of Cloud - BDW Meetup May 2, 2017
Looker Data Modeling in the Age of Cloud - BDW Meetup May 2, 2017Caserta
 
The Rise of the CDO in Today's Enterprise
The Rise of the CDO in Today's EnterpriseThe Rise of the CDO in Today's Enterprise
The Rise of the CDO in Today's EnterpriseCaserta
 
Building a New Platform for Customer Analytics
Building a New Platform for Customer Analytics Building a New Platform for Customer Analytics
Building a New Platform for Customer Analytics Caserta
 
Building New Data Ecosystem for Customer Analytics, Strata + Hadoop World, 2016
Building New Data Ecosystem for Customer Analytics, Strata + Hadoop World, 2016Building New Data Ecosystem for Customer Analytics, Strata + Hadoop World, 2016
Building New Data Ecosystem for Customer Analytics, Strata + Hadoop World, 2016Caserta
 
You're the New CDO, Now What?
You're the New CDO, Now What?You're the New CDO, Now What?
You're the New CDO, Now What?Caserta
 
The Data Lake - Balancing Data Governance and Innovation
The Data Lake - Balancing Data Governance and Innovation The Data Lake - Balancing Data Governance and Innovation
The Data Lake - Balancing Data Governance and Innovation Caserta
 
Making Big Data Easy for Everyone
Making Big Data Easy for EveryoneMaking Big Data Easy for Everyone
Making Big Data Easy for EveryoneCaserta
 
Benefits of the Azure Cloud
Benefits of the Azure CloudBenefits of the Azure Cloud
Benefits of the Azure CloudCaserta
 
Big Data Analytics on the Cloud
Big Data Analytics on the CloudBig Data Analytics on the Cloud
Big Data Analytics on the CloudCaserta
 
Intro to Data Science on Hadoop
Intro to Data Science on HadoopIntro to Data Science on Hadoop
Intro to Data Science on HadoopCaserta
 
The Emerging Role of the Data Lake
The Emerging Role of the Data LakeThe Emerging Role of the Data Lake
The Emerging Role of the Data LakeCaserta
 
Not Your Father's Database by Databricks
Not Your Father's Database by DatabricksNot Your Father's Database by Databricks
Not Your Father's Database by DatabricksCaserta
 
Mastering Customer Data on Apache Spark
Mastering Customer Data on Apache SparkMastering Customer Data on Apache Spark
Mastering Customer Data on Apache SparkCaserta
 
Moving Past Infrastructure Limitations
Moving Past Infrastructure LimitationsMoving Past Infrastructure Limitations
Moving Past Infrastructure LimitationsCaserta
 

Plus de Caserta (20)

Using Machine Learning & Spark to Power Data-Driven Marketing
Using Machine Learning & Spark to Power Data-Driven MarketingUsing Machine Learning & Spark to Power Data-Driven Marketing
Using Machine Learning & Spark to Power Data-Driven Marketing
 
Data Intelligence: How the Amalgamation of Data, Science, and Technology is C...
Data Intelligence: How the Amalgamation of Data, Science, and Technology is C...Data Intelligence: How the Amalgamation of Data, Science, and Technology is C...
Data Intelligence: How the Amalgamation of Data, Science, and Technology is C...
 
General Data Protection Regulation - BDW Meetup, October 11th, 2017
General Data Protection Regulation - BDW Meetup, October 11th, 2017General Data Protection Regulation - BDW Meetup, October 11th, 2017
General Data Protection Regulation - BDW Meetup, October 11th, 2017
 
Integrating the CDO Role Into Your Organization; Managing the Disruption (MIT...
Integrating the CDO Role Into Your Organization; Managing the Disruption (MIT...Integrating the CDO Role Into Your Organization; Managing the Disruption (MIT...
Integrating the CDO Role Into Your Organization; Managing the Disruption (MIT...
 
Architecting Data For The Modern Enterprise - Data Summit 2017, Closing Keynote
Architecting Data For The Modern Enterprise - Data Summit 2017, Closing KeynoteArchitecting Data For The Modern Enterprise - Data Summit 2017, Closing Keynote
Architecting Data For The Modern Enterprise - Data Summit 2017, Closing Keynote
 
Introduction to Data Science (Data Summit, 2017)
Introduction to Data Science (Data Summit, 2017)Introduction to Data Science (Data Summit, 2017)
Introduction to Data Science (Data Summit, 2017)
 
Looker Data Modeling in the Age of Cloud - BDW Meetup May 2, 2017
Looker Data Modeling in the Age of Cloud - BDW Meetup May 2, 2017Looker Data Modeling in the Age of Cloud - BDW Meetup May 2, 2017
Looker Data Modeling in the Age of Cloud - BDW Meetup May 2, 2017
 
The Rise of the CDO in Today's Enterprise
The Rise of the CDO in Today's EnterpriseThe Rise of the CDO in Today's Enterprise
The Rise of the CDO in Today's Enterprise
 
Building a New Platform for Customer Analytics
Building a New Platform for Customer Analytics Building a New Platform for Customer Analytics
Building a New Platform for Customer Analytics
 
Building New Data Ecosystem for Customer Analytics, Strata + Hadoop World, 2016
Building New Data Ecosystem for Customer Analytics, Strata + Hadoop World, 2016Building New Data Ecosystem for Customer Analytics, Strata + Hadoop World, 2016
Building New Data Ecosystem for Customer Analytics, Strata + Hadoop World, 2016
 
You're the New CDO, Now What?
You're the New CDO, Now What?You're the New CDO, Now What?
You're the New CDO, Now What?
 
The Data Lake - Balancing Data Governance and Innovation
The Data Lake - Balancing Data Governance and Innovation The Data Lake - Balancing Data Governance and Innovation
The Data Lake - Balancing Data Governance and Innovation
 
Making Big Data Easy for Everyone
Making Big Data Easy for EveryoneMaking Big Data Easy for Everyone
Making Big Data Easy for Everyone
 
Benefits of the Azure Cloud
Benefits of the Azure CloudBenefits of the Azure Cloud
Benefits of the Azure Cloud
 
Big Data Analytics on the Cloud
Big Data Analytics on the CloudBig Data Analytics on the Cloud
Big Data Analytics on the Cloud
 
Intro to Data Science on Hadoop
Intro to Data Science on HadoopIntro to Data Science on Hadoop
Intro to Data Science on Hadoop
 
The Emerging Role of the Data Lake
The Emerging Role of the Data LakeThe Emerging Role of the Data Lake
The Emerging Role of the Data Lake
 
Not Your Father's Database by Databricks
Not Your Father's Database by DatabricksNot Your Father's Database by Databricks
Not Your Father's Database by Databricks
 
Mastering Customer Data on Apache Spark
Mastering Customer Data on Apache SparkMastering Customer Data on Apache Spark
Mastering Customer Data on Apache Spark
 
Moving Past Infrastructure Limitations
Moving Past Infrastructure LimitationsMoving Past Infrastructure Limitations
Moving Past Infrastructure Limitations
 

Dernier

How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 

Dernier (20)

How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 

Creating a DevOps Practice for Analytics -- Strata Data, September 28, 2017

  • 1. Strata 2017 Creating a DevOps Practice for Analytics Bob Eilbacher September 28, 2017
  • 2. Agenda  About Caserta  DevOps  DevOps for Analytics  Organization and Teams  Questions
  • 3. About Caserta  Data Intelligence Consulting and Modern Data Engineering  Award-winning data innovation  Internationally recognized work force  Strategy, Architecture, Governance, Implementation
  • 4. About Caserta  Architecture & Design  Implementation Services  Disruption Management  Strategic Technical Consulting  Training & Education  Application Innovation  Cloud Management
  • 5. What is DevOps for Analytics? First some terminology…  DevOps  Associated with movement primarily in application development space for last 5-10 years  Focused on very fast and continuous software product releases  Think intra-day Prod releases at Netflix, Amazon, etc.  Convergence of development and operations methodologies to minimize TTR  Tons of resources – devops.com, DZone
  • 6. What is DevOps for Analytics? Some more terminology…  DataOps  Re-emergent term  Seems to have a broader context  Applying DevOps to data management or to handling backend databases  Also tends to carry real legacy connotation  Manual operations of database backups and restores,
  • 7. What is DevOps for Analytics? And finally…  AnalyticsOps  This is a term that we see starting to be used more  Its focused on applying DevOps practices within a data analytics and data science context  This is the area we’re interested in for this talk  We’ll use the terms AnalyticsOps or the more explicit DevOps for Analytics interchangeably
  • 8. DevOps…  Speak with anyone and they will tell you first that DevOps is a culture  Based primarily on teamwork
  • 10. DevOps…  Speak with anyone and they will tell you first that DevOps is a “culture”  Based primarily on teamwork  Aims to address the underlying conflict between development and operations objectives Innovation @ speed vs. Performance @ quality Change vs. Stability  Culture is not “implemented”  It needs to evolve  Good news is it can be seeded
  • 11. DevOps…  It works!  75% of IT and product dev organizations were successfully using DevOps to some extent – Source: RightScale 2016 State of the Cloud Report  It’s flexible  No two companies’ DevOps approaches will look the same  Infinite number of ways to create teamwork  A reflection of the organization itself
  • 12. DevOps…  DevOps tenets  Continuous Integration  Test Automation  Continuous Delivery  Continuous Deployment  End-to-end automation is still aspirational for most companies  Justify how much automation you need based on business requirements.
  • 13. DevOps…  What DevOps is not is a toolchain implementation  Tools help the team execute within the culture  Don’t run out and put an end to end chain in place and then expect adoption  Lets talk about tools for a minute …  Explosion of both open-source and commercial DevOps tooling  Serve every discrete need  requirements management, SCM, test automation, defect tracking, build, deployment, monitoring and more  1,500+ tools available
  • 14. DevOps…  Tooling categories:  Code : Code development, version control tools, code merging  Build : Continuous integration tools, build status  Test : Test and results determine performance  Package : Artifact repository, application pre-deployment staging  Release : Change management, release automation  Configure : Infrastructure configuration and management, Infrastructure as Code tools  Monitor : Applications performance monitoring, end user experience
  • 16. Why DevOps for Analytics? “The fact is that analytic teams are being compared by their businesses to Amazon Prime – 2-day delivery of almost anything” Source: Unknown
  • 17. Why DevOps for Analytics?
  • 18. Why DevOps for Analytics?  A couple of recent real world examples… Data Science Rock Star Process Overengineering
  • 19. Why DevOps for Analytics?  Analytics and data science projects, what used to take months to achieve is happening in days or hours  Businesses typically like that and want more…  Enabled by the strong trend toward cloud analytic platforms/services  Infrastructure as code (IaC) allows extension of software development practices to servers and infrastructure  We can automate the build of complex analytic pipelines - storage, processing engines, etc. with relative ease
  • 20. DevOps for Analytics  DevOps for Analytics combines the development and operations teams and establishes best practices that improve coordination between data science and operations  BUT… Data Science and Analytics are different from application development  Especially in a Big Data environments - need big data to test big data applications  Much more diverse mix of tools and technologies – not just java  Some differences in approach are needed
  • 21. DevOps for Analytics  AnalyticsOps this is still in its early days  There aren’t any real solid industry success stories published  People are still trying to figure out what works and aren’t’ open kimono and sharing experiences just yet  Not a lot of experienced practitioners  But there are some early themes and guidelines emerging
  • 22. DevOps for Analytics  Environments  Separate DEV and PROD environments  Should you reuse any of the PROD data assets?  Separate landing area, destination area (Data Lake), etc.  Trickier with increasing data volumes – do it smart to avoid double costs  Sharing compute cluster resources is OK  Make all job inputs and outputs configuration driven (PROD and DEV code doesn’t change) – for CI
  • 23. DevOps for Analytics  Automated Testing  It’s almost impossible to get full code coverage  How do you unit test SPARK SQL scripts? Regression tests? Data validation?  Test data is a complex problem – handle as a cross-functional initiative.  Analytic results are often buried in complex outputs, QA becomes forensic data analysis  Automate what you can, supplement with community based real-world data testing in a parallel Dev/Test environment  The role of the Test/QA Engineer is still really important  Test/QA Engineers need Data Engineering experience
  • 24. DevOps for Analytics  Monitoring  Tracking and analyzing intra-day demand and longer term trends in infrastructure performance (standard DevOps)  But then…  By their nature analytics processes require monitoring and tuning over time with real-world inputs  Data drifts; Predictive models have a finite lifetime  Silent failures  Feedback to developers so they can see how their code is performing and affecting the Prod environment  Continuous improvement  The next wave is analytics on analytics…
  • 25. DevOps for Analytics  Emerging DevOps for Analytics environment usually contain  SCM  CI  Repo to store analytics app  Repo to store configuration  An API to deploy to the cluster  Mechanism to monitor behavior and performance
  • 26. DevOps for Analytics Organization  Building a DevOps for Analytics culture is not an easy undertaking  Should fall under the purview of a dedicated data organization  These organizations are typically lead by the Chief Data Officer  More recently by Chief Data Scientist a Chief Analytics Officer  Key responsibilities include  Fostering adoption  Clarifying and aligning to the business' vision  Securing reasonable funding
  • 27. DevOps for Analytics Organization  The goal over time is to create lean, highly performant, cross- functional, extremely effective teams  Business Stakeholders  Data Engineers  Data Analysts & Data Scientists  QA  Operations  All of these skills are important - but when in doubt get more Data Engineers!  Everyone on team has an equal voice  Everyone codes & Everyone needs to know what Prod looks like
  • 28. DevOps for Analytics Organization  Start-up Condition: Bring in an experienced set of DevOps for Analytics Engineers  Help define the culture, lead by example  Identify the Innovators and get them involved and leading  The DevOps Engineers job is to ultimately engineer themselves out of the equation Source: Matthew Skelton, DevOps Patterns - Team Topologies
  • 29. Final Thoughts “We aim to engineer systems and processes to better integrate development and operations, resulting in decreased time to market and an application infrastructure that is instrumented, scalable and fault tolerant… and immortal!” - Will Liu, Equinox Data Team
  • 30. Final Thoughts  There are plenty of benefits in establishing a DevOps for Analytics culture for your organization  For the business: Speed to insight  For the teams: Professional and personal satisfaction  Be Fearless – go build your own DevOps for Analytics culture!
  • 32. Happy Birthday Joe Caserta!
  • 33. Thank You  Bob Eilbacher  Vice President Operations, Caserta  bob@casertaconcepts.com Upcoming Training Opportunity: Caserta is hosting 3 Days of Training Courses October 18-20th in NYC, taught by Joe Caserta, co-author of The Data Warehouse ETL Toolkit: Day 1: Agile Data Warehouse Design & Dimensional Modeling Day 2: ETL Architecture & Design Day 3: Big Data for Data Warehouse Practitioners More info at casertaconcepts.com/event/