SlideShare une entreprise Scribd logo
1  sur  37
Télécharger pour lire hors ligne
How Starbucks Forecasts Demand
at Scale with Facebook Prophet
and Databricks
Rob Saker, Retail Industry Leader at Databricks
Brendan O’Shaughnessy, Data Science Manager at Starbucks
Bilal Obeidat, Solutions Architect, Databricks
Housekeeping
• Your connection will be muted
• Submit questions via the Q&A panel
• Questions will be answered at the end of the webinar
• Any outstanding questions will be answered in the Databricks Forum
(https://forums.databricks.com)
• Webinar will be recorded and attachments will be made available via
www.databricks.com
Introducing Our Speakers
3
Bilal Obeidat, Lead Solutions Architect
● 15+ years of Software Development at Microsoft, Hitachi
● MS, Computer Science, Bradley University
Brendan O’Shaughnessy, Data Science Manager
● 10+ years of Analytics and Data Science at Starbucks, NRPC
● MS, Spatial Information Science and Engineering, University of Maine
Rob Saker, Retail Industry Leader
● Formerly Retail industry Chief Data Officer (CDO)
● 15+ years in Data & Analytic leadership across Retail & CPG
● MBA, University of Nebraska
Unified data analytics platform for accelerating innovation across
data science, data engineering, and business analytics
Original creators of popular data and machine learning open source projects
Global company with 5,000 customers and 450+ partners
Databricks Customers - Over Half of Top 25 Retailers by Revenue
Media & Entertainment Technology
Public Sector Retail & CPG Consumer Services Energy & IndustrialMarketing & AdTech
Data & Analytics ServicesHealthcare & Pharma Financial Services
Agenda
➔ A Consumer-Driven Supply Chain
◆ Why Demand Forecasting is Relevant?
◆ The Need for Granular Demand Forecasting at Scale
◆ How Starbucks does Demand Forecasting?
➔ Demo: ML based Forecasting With Facebook Prophet
➔ Conclusion
Customers want what they want,
where they want it, when they want it
Consumer Behavior is Changing Supply Networks
DIRECT TO CONSUMER
PERSONALIZATION
40% of Best Buy’s online purchases are picked up at the store
50% of all restaurant prepared food is consumed away from restaurant
CUSTOMER ENGAGEMENT
FROM ANYWHERE
CONVENIENCE
IS KEY
Your Supply Chain Needs to be
Consumer-Driven
A Consumer-Driven Supply Chain Starts with Demand
Forecasting
What is Demand Forecasting?
Demand forecasting is the
process where we use
historical sales, promotions,
shopper and causal data to
understand and predict
customer demand.
Shipments
CPG Direct-to-consumer shipments
Manufacturers
Orders Store shipments
Distribution
Centers
E-commerce shipments to consumers
Shoppers
In-store purchases
Buy-online, pickup
in store (BOPIS)
E-commerce
delivery
Subscription
Retail Stores
Sales
Retail Supply Chain
Orders Store shipments
Shipments
CPG Direct-to-consumer shipments
Manufacturers Distribution
Centers
Retail Stores Shoppers
E-commerce shipments to consumers
In-store purchases
Buy-online, pickup
in store (BOPIS)
E-commerce
delivery
Subscription
Sales
Retail Supply Chain Demand Forecast
Orders Store shipments
Shipments
CPG Direct-to-consumer shipments
Manufacturers Distribution
Centers
Retail Stores Shoppers
E-commerce shipments to consumers
In-store purchases
Buy-online, pickup
in store (BOPIS)
E-commerce
delivery
Subscription
Sales
Retail Supply Chain Demand Forecast
Intrastore
shipments
Store shipments
Inventory levels
Reorder quantity
DC shipments
Production plan
Home delivery logistics
Home delivery logistics
Retail Capability Model
Supply Chain Management Merchandising Store Operations
Customer
Management
Planning Freight & Logistics Planning Operations
Supplier Management
Distribution
Replenishment
Inventory
Purchasing
Sourcing
Advanced Planning
Demand Forecasting
Warehouse
Management
Reverse Logistics
Transportation
Procurement
Logistics
Allocation
Demand Planning
Revenue/Price
Management
Promotions Planning
Assortment
Category Management
Space Planning
Fresh Planning
Demand Planning
Joint planning
Trade management
Merchandising
Store Channel
Promotions
Digital
Store Auditing
Broker Management
Store Operations
Workforce
Point-of-Sale
Shelf tags/Signage
Smart Store Devices
Store Inventory
Management
Returns Management
Fresh Production
Loss Prevention
Loyalty
Digital/Ecommerce
Mobile
Customer Support
Customer
Segmentation/CRM
Voice AI/Call Center
Back Office
People Management Recruiting Finance Asset Management Real Estate
Energy & UtilitiesIT
Industry Standards &
Compliance
Legal
Master Data
Management
Retail Capability Model
Supply Chain Management Merchandising Store Operations
Customer
Management
Planning Freight & Logistics Planning Operations
Supplier Management
Distribution
Replenishment
Inventory
Purchasing
Sourcing
Advanced Planning
Demand Forecasting
Warehouse
Management
Reverse Logistics
Transportation
Procurement
Logistics
Allocation
Demand Planning
Revenue/Price
Management
Promotions Planning
Assortment
Category Management
Space Planning
Fresh Planning
Demand Planning
Joint planning
Trade management
Merchandising
Store Channel
Promotions
Digital
Store Auditing
Broker Management
Store Operations
Workforce
Point-of-Sale
Shelf tags/Signage
Smart Store Devices
Store Inventory
Management
Returns Management
Fresh Production
Loss Prevention
Loyalty
Digital/Ecommerce
Mobile
Customer Support
Customer
Segmentation/CRM
Voice AI/Call Center
Back Office
People Management Recruiting Finance Asset Management Real Estate
Energy & UtilitiesIT
Industry Standards &
Compliance
Legal
Master Data
Management
Using Data and ML Across the Supply Chain is Challenging
FORECASTING NOT
ACCURATE OR GRANULAR
LARGE VOLUMES OF
RAPIDLY CHANGING DATA
LIMITED REAL-TIME AND
CAUSAL DATA
NOT EASY TO GET TO
ACTIONABLE INSIGHTS
Managers unable to get per
day /store/SKU forecast
Data is constantly shifting and
changing. Eg. Revised data to
account for shoplifting
Omnichannel is making local
(weather), real-time (IOT),
causal (competitor pricing)
data more important
Store/Distribution managers
get BI tools with lots of data
that they have no time to
explore
Traditional Demand Forecasting with Allocations
Market AreaPromo Group Week
Day Day Day
Traditional tools can’t scale to fine-grain
DC
Promo Group
Week
105,000
1 billion
500 million
2 billion
1.5 billion
DMA
Promo Group
Week
525,000
Store
Promo Group
Week
11.922,500
Store
SKU
Week
238,450,000
Store
SKU
Day
1,669,150,000
Traditional
Forecasting
Tools
Fine grained
forecast with
Databricks
Special run/few
times a year
Demand Forecasting is Complicated
You Need Forecasts by Product by Store
Stores SKUs
You Have a Large # of Store-SKU Combinations
Processing each
model sequentially is
slow
Each model may have
local causal influences
Distribute Model Training with Apache SparkTM
Apache Spark: De-Facto Unified Analytics Engine
Runtime
Delta
Spark Core Engine
Big Data Processing
ETL + SQL + Streaming
Machine Learning
MLlib + SparkR
Uniquely combines Data & AI technologies
R Python Scala Java
Use a Unified Data Analytics Platform Across the Data and
ML Lifecycle
DO GRANULAR AND
ACCURATE FORECASTS
KEEP UP WITH CHANGING
DATA
USE REAL-TIME AND
CAUSAL DATA
ACTIONABLE AND EASY
INSIGHTS FOR MANAGERS
Point BI tools that store managers
use directly at ML insights stored
on Delta Lake tables
Use and track 100s of ML
models to forecast demand by
day/store/SKU using MLflow
Use Delta Lake UPSERTS to keep
data consistent
Single streamlined pipeline for
real time and streaming data with
Delta Lake and Apache SparkTM
Unified Data Analytics for Consumer-Driven Supply Chain
Forecast
Demand
INVENTORY
DATA
IOT DATA
COMPETITOR
DATA
PRICING
DATA
SKU
DATA
DEMOGRAPHICS
DATA
GEO-LOCATION
DATA
PO0
DATAS
VIDEO
DATA
SHIPMEN
DATAT
Optimize
Inventory
Faster
Freight and
Logistics
Databricks Delivers Fine-Grained Demand Forecasting
Traditional Analysis Suites Databricks
Fine grained forecasting Aggregate level Day, store & SKU
Real-time data No Streaming data
Custom causal Data Limited
Integrate weather, online &
mobile interactions
Multi-modal data for training No
Structured, unstructured,
image, video, sensor data.
Localize models for greater accuracy No Yes
Push predictions to the edge No Yes
Starbucks Data Science
Forecasting Framework
“Flexible forecasting at scale”
Brendan O’Shaughnessy
Data Science Manager at
Starbucks
Delivery
Forecasting has utility across business
functions
Promotions Market Planning
Operations
Equipment Usage
Labor
Challenges
▪ Many metrics and granularities
▪ Timely delivery
▪ Easy accessibility for business stakeholders
▪ Iterative nature of data science
▪ Accuracy
▪ Quick diagnostics
Forecasting framework
▪ Built-in diagnostics
▪ Status monitoring
▪ Leverages Delta Lake and ADLS for data ingress and
egress
▪ Executable from Databricks CLI
Robust and reliable
▪ Agnostic to metric and granularity
▪ Leverages diverse forecasting algorithms using
grouped pandas UDFs
▪ Modular structure for straightforward additions
and enhancements
▪ Apache SparkTM enables quick runtimes regardless
of the number of individual forecasts
Flexible
Example structure
Create custom forecasts
for any use case with
relative ease
Enterprise data
(DB Delta)
Historical
Metric
Custom data
(ADLS)
Build dataset
Compile results
To stakeholders
Model 1 Model 2 Model n
To data products
…
https://www.linkedin.com/company/starbucks/jobs/
Demand Forecasting
Demo
Key Takeaways Slide
ü A Consumer-driven supply chain is indispensable
ü Start with demand forecasting at scale
ü Localization of models is slow and tedious
ü Using Data and ML across supply chain is challenging
Databricks provides Unified Data Analytics to bring Data and ML
together for accurate and granular demand forecasting
Thank you! Questions?
38
Sign up for a Free Trial : databricks.com/trial
Register For Spark + AI Summit: https://databricks.com/sparkaisummit/
Read Blog: https://dbricks.co/DemFcst

Contenu connexe

Tendances

Azure data analytics platform - A reference architecture
Azure data analytics platform - A reference architecture Azure data analytics platform - A reference architecture
Azure data analytics platform - A reference architecture
Rajesh Kumar
 
Standing on the Shoulders of Open-Source Giants: The Serverless Realtime Lake...
Standing on the Shoulders of Open-Source Giants: The Serverless Realtime Lake...Standing on the Shoulders of Open-Source Giants: The Serverless Realtime Lake...
Standing on the Shoulders of Open-Source Giants: The Serverless Realtime Lake...
HostedbyConfluent
 

Tendances (20)

Introduction SQL Analytics on Lakehouse Architecture
Introduction SQL Analytics on Lakehouse ArchitectureIntroduction SQL Analytics on Lakehouse Architecture
Introduction SQL Analytics on Lakehouse Architecture
 
Modern Data architecture Design
Modern Data architecture DesignModern Data architecture Design
Modern Data architecture Design
 
Data Catalog as the Platform for Data Intelligence
Data Catalog as the Platform for Data IntelligenceData Catalog as the Platform for Data Intelligence
Data Catalog as the Platform for Data Intelligence
 
Azure data analytics platform - A reference architecture
Azure data analytics platform - A reference architecture Azure data analytics platform - A reference architecture
Azure data analytics platform - A reference architecture
 
Lakehouse in Azure
Lakehouse in AzureLakehouse in Azure
Lakehouse in Azure
 
Introdution to Dataops and AIOps (or MLOps)
Introdution to Dataops and AIOps (or MLOps)Introdution to Dataops and AIOps (or MLOps)
Introdution to Dataops and AIOps (or MLOps)
 
Moving to Databricks & Delta
Moving to Databricks & DeltaMoving to Databricks & Delta
Moving to Databricks & Delta
 
The Path to Data and Analytics Modernization
The Path to Data and Analytics ModernizationThe Path to Data and Analytics Modernization
The Path to Data and Analytics Modernization
 
Standing on the Shoulders of Open-Source Giants: The Serverless Realtime Lake...
Standing on the Shoulders of Open-Source Giants: The Serverless Realtime Lake...Standing on the Shoulders of Open-Source Giants: The Serverless Realtime Lake...
Standing on the Shoulders of Open-Source Giants: The Serverless Realtime Lake...
 
AI Modernization at AT&T and the Application to Fraud with Databricks
AI Modernization at AT&T and the Application to Fraud with DatabricksAI Modernization at AT&T and the Application to Fraud with Databricks
AI Modernization at AT&T and the Application to Fraud with Databricks
 
A Reference Architecture for ETL 2.0
A Reference Architecture for ETL 2.0 A Reference Architecture for ETL 2.0
A Reference Architecture for ETL 2.0
 
Building a modern data platform in AWS
Building a modern data platform in AWSBuilding a modern data platform in AWS
Building a modern data platform in AWS
 
Building a Data Lake on AWS
Building a Data Lake on AWSBuilding a Data Lake on AWS
Building a Data Lake on AWS
 
DataOps - The Foundation for Your Agile Data Architecture
DataOps - The Foundation for Your Agile Data ArchitectureDataOps - The Foundation for Your Agile Data Architecture
DataOps - The Foundation for Your Agile Data Architecture
 
Owning Your Own (Data) Lake House
Owning Your Own (Data) Lake HouseOwning Your Own (Data) Lake House
Owning Your Own (Data) Lake House
 
Delta Lake OSS: Create reliable and performant Data Lake by Quentin Ambard
Delta Lake OSS: Create reliable and performant Data Lake by Quentin AmbardDelta Lake OSS: Create reliable and performant Data Lake by Quentin Ambard
Delta Lake OSS: Create reliable and performant Data Lake by Quentin Ambard
 
Data Infra and Data Access in Nubank
Data Infra and Data Access in NubankData Infra and Data Access in Nubank
Data Infra and Data Access in Nubank
 
Denodo: Enabling a Data Mesh Architecture and Data Sharing Culture at Landsba...
Denodo: Enabling a Data Mesh Architecture and Data Sharing Culture at Landsba...Denodo: Enabling a Data Mesh Architecture and Data Sharing Culture at Landsba...
Denodo: Enabling a Data Mesh Architecture and Data Sharing Culture at Landsba...
 
Building Lakehouses on Delta Lake with SQL Analytics Primer
Building Lakehouses on Delta Lake with SQL Analytics PrimerBuilding Lakehouses on Delta Lake with SQL Analytics Primer
Building Lakehouses on Delta Lake with SQL Analytics Primer
 
Data Science At Zillow
Data Science At ZillowData Science At Zillow
Data Science At Zillow
 

Similaire à How Starbucks Forecasts Demand at Scale with Facebook Prophet and Databricks

Knowledge Graphs for Supply Chain Operations.pdf
Knowledge Graphs for Supply Chain Operations.pdfKnowledge Graphs for Supply Chain Operations.pdf
Knowledge Graphs for Supply Chain Operations.pdf
Vaticle
 
Customer Intelligence_ Harnessing Elephants at Transamerica Presentation (1)
Customer Intelligence_ Harnessing Elephants at Transamerica    Presentation (1)Customer Intelligence_ Harnessing Elephants at Transamerica    Presentation (1)
Customer Intelligence_ Harnessing Elephants at Transamerica Presentation (1)
Vishal Bamba
 

Similaire à How Starbucks Forecasts Demand at Scale with Facebook Prophet and Databricks (20)

Knowledge Graphs for Supply Chain Operations.pdf
Knowledge Graphs for Supply Chain Operations.pdfKnowledge Graphs for Supply Chain Operations.pdf
Knowledge Graphs for Supply Chain Operations.pdf
 
Using ML and Azure to improve Customer Lifetime Value
Using ML and Azure to improve Customer Lifetime ValueUsing ML and Azure to improve Customer Lifetime Value
Using ML and Azure to improve Customer Lifetime Value
 
Big Data & Analytics 101: How Customer Lifetime Value Enhances Predictive Mar...
Big Data & Analytics 101: How Customer Lifetime Value Enhances Predictive Mar...Big Data & Analytics 101: How Customer Lifetime Value Enhances Predictive Mar...
Big Data & Analytics 101: How Customer Lifetime Value Enhances Predictive Mar...
 
Turning Big Data into Better Business Outcomes
Turning Big Data into Better Business OutcomesTurning Big Data into Better Business Outcomes
Turning Big Data into Better Business Outcomes
 
Data Integrity for Banking and Financial Services
Data Integrity for Banking and Financial ServicesData Integrity for Banking and Financial Services
Data Integrity for Banking and Financial Services
 
Data Integrity for Banking and Financial Services
Data Integrity for Banking and Financial ServicesData Integrity for Banking and Financial Services
Data Integrity for Banking and Financial Services
 
Webinar: Know Where, Why, What: Big Data’s Role In Predictive And Location A...
Webinar:  Know Where, Why, What: Big Data’s Role In Predictive And Location A...Webinar:  Know Where, Why, What: Big Data’s Role In Predictive And Location A...
Webinar: Know Where, Why, What: Big Data’s Role In Predictive And Location A...
 
Retail Analytics and BI with Looker, BigQuery, GCP & Leigha Jarett
Retail Analytics and BI with Looker, BigQuery, GCP & Leigha JarettRetail Analytics and BI with Looker, BigQuery, GCP & Leigha Jarett
Retail Analytics and BI with Looker, BigQuery, GCP & Leigha Jarett
 
Gain a Holistic View of your Customer's Journey
Gain a Holistic View of your Customer's JourneyGain a Holistic View of your Customer's Journey
Gain a Holistic View of your Customer's Journey
 
Moving Targets: Harnessing Real-time Value from Data in Motion
Moving Targets: Harnessing Real-time Value from Data in Motion Moving Targets: Harnessing Real-time Value from Data in Motion
Moving Targets: Harnessing Real-time Value from Data in Motion
 
Accelerating Data-Driven Enterprise Transformation in Banking, Financial Serv...
Accelerating Data-Driven Enterprise Transformation in Banking, Financial Serv...Accelerating Data-Driven Enterprise Transformation in Banking, Financial Serv...
Accelerating Data-Driven Enterprise Transformation in Banking, Financial Serv...
 
Use of Analytics to recover from COVID19 hit economy
Use of Analytics to recover from COVID19 hit economyUse of Analytics to recover from COVID19 hit economy
Use of Analytics to recover from COVID19 hit economy
 
Customer Intelligence_ Harnessing Elephants at Transamerica Presentation (1)
Customer Intelligence_ Harnessing Elephants at Transamerica    Presentation (1)Customer Intelligence_ Harnessing Elephants at Transamerica    Presentation (1)
Customer Intelligence_ Harnessing Elephants at Transamerica Presentation (1)
 
Big Data Everywhere Chicago: Platfora - Practices for Customer Analytics on H...
Big Data Everywhere Chicago: Platfora - Practices for Customer Analytics on H...Big Data Everywhere Chicago: Platfora - Practices for Customer Analytics on H...
Big Data Everywhere Chicago: Platfora - Practices for Customer Analytics on H...
 
Big Data: The Road to Know More About Your Business
Big Data:  The Road to Know More About Your BusinessBig Data:  The Road to Know More About Your Business
Big Data: The Road to Know More About Your Business
 
Finding Data at Risk for CCPA Compliance
Finding Data at Risk for CCPA ComplianceFinding Data at Risk for CCPA Compliance
Finding Data at Risk for CCPA Compliance
 
Supply Chain 2030: Presentation by Lora Cecere at CLX Conference
Supply Chain 2030: Presentation by Lora Cecere at CLX ConferenceSupply Chain 2030: Presentation by Lora Cecere at CLX Conference
Supply Chain 2030: Presentation by Lora Cecere at CLX Conference
 
Customer 360
Customer 360Customer 360
Customer 360
 
Journey to a Modern Data Architecture
Journey to a Modern Data ArchitectureJourney to a Modern Data Architecture
Journey to a Modern Data Architecture
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsBuilding a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business Goals
 

Dernier

call Now 9811711561 Cash Payment乂 Call Girls in Dwarka
call Now 9811711561 Cash Payment乂 Call Girls in Dwarkacall Now 9811711561 Cash Payment乂 Call Girls in Dwarka
call Now 9811711561 Cash Payment乂 Call Girls in Dwarka
vikas rana
 
Indian Call Girl In Dubai #$# O5634O3O18 #$# Dubai Call Girl
Indian Call Girl In Dubai #$# O5634O3O18 #$# Dubai Call GirlIndian Call Girl In Dubai #$# O5634O3O18 #$# Dubai Call Girl
Indian Call Girl In Dubai #$# O5634O3O18 #$# Dubai Call Girl
AroojKhan71
 

Dernier (8)

The 15 Minute Breakdown: 2024 Beauty Marketing Study
The 15 Minute Breakdown: 2024 Beauty Marketing StudyThe 15 Minute Breakdown: 2024 Beauty Marketing Study
The 15 Minute Breakdown: 2024 Beauty Marketing Study
 
Top Rated Pune Call Girls Talegaon Dabhade ⟟ 6297143586 ⟟ Call Me For Genuin...
Top Rated  Pune Call Girls Talegaon Dabhade ⟟ 6297143586 ⟟ Call Me For Genuin...Top Rated  Pune Call Girls Talegaon Dabhade ⟟ 6297143586 ⟟ Call Me For Genuin...
Top Rated Pune Call Girls Talegaon Dabhade ⟟ 6297143586 ⟟ Call Me For Genuin...
 
call Now 9811711561 Cash Payment乂 Call Girls in Dwarka
call Now 9811711561 Cash Payment乂 Call Girls in Dwarkacall Now 9811711561 Cash Payment乂 Call Girls in Dwarka
call Now 9811711561 Cash Payment乂 Call Girls in Dwarka
 
Indian Call Girl In Dubai #$# O5634O3O18 #$# Dubai Call Girl
Indian Call Girl In Dubai #$# O5634O3O18 #$# Dubai Call GirlIndian Call Girl In Dubai #$# O5634O3O18 #$# Dubai Call Girl
Indian Call Girl In Dubai #$# O5634O3O18 #$# Dubai Call Girl
 
Digital Business Strategy - How Food Brands Compete Through Technology
Digital Business Strategy - How Food Brands Compete Through TechnologyDigital Business Strategy - How Food Brands Compete Through Technology
Digital Business Strategy - How Food Brands Compete Through Technology
 
Planting Seeds of Success and of Failure.pdf
Planting Seeds of Success and of Failure.pdfPlanting Seeds of Success and of Failure.pdf
Planting Seeds of Success and of Failure.pdf
 
Film= Dubai Call Girls O525547819 Call Girls Dubai Whsatapp
Film= Dubai Call Girls O525547819 Call Girls Dubai WhsatappFilm= Dubai Call Girls O525547819 Call Girls Dubai Whsatapp
Film= Dubai Call Girls O525547819 Call Girls Dubai Whsatapp
 
Call Girls In Dev kunj Delhi 9654467111 Short 1500 Night 6000
Call Girls In Dev kunj Delhi 9654467111 Short 1500 Night 6000Call Girls In Dev kunj Delhi 9654467111 Short 1500 Night 6000
Call Girls In Dev kunj Delhi 9654467111 Short 1500 Night 6000
 

How Starbucks Forecasts Demand at Scale with Facebook Prophet and Databricks

  • 1. How Starbucks Forecasts Demand at Scale with Facebook Prophet and Databricks Rob Saker, Retail Industry Leader at Databricks Brendan O’Shaughnessy, Data Science Manager at Starbucks Bilal Obeidat, Solutions Architect, Databricks
  • 2. Housekeeping • Your connection will be muted • Submit questions via the Q&A panel • Questions will be answered at the end of the webinar • Any outstanding questions will be answered in the Databricks Forum (https://forums.databricks.com) • Webinar will be recorded and attachments will be made available via www.databricks.com
  • 3. Introducing Our Speakers 3 Bilal Obeidat, Lead Solutions Architect ● 15+ years of Software Development at Microsoft, Hitachi ● MS, Computer Science, Bradley University Brendan O’Shaughnessy, Data Science Manager ● 10+ years of Analytics and Data Science at Starbucks, NRPC ● MS, Spatial Information Science and Engineering, University of Maine Rob Saker, Retail Industry Leader ● Formerly Retail industry Chief Data Officer (CDO) ● 15+ years in Data & Analytic leadership across Retail & CPG ● MBA, University of Nebraska
  • 4. Unified data analytics platform for accelerating innovation across data science, data engineering, and business analytics Original creators of popular data and machine learning open source projects Global company with 5,000 customers and 450+ partners
  • 5. Databricks Customers - Over Half of Top 25 Retailers by Revenue Media & Entertainment Technology Public Sector Retail & CPG Consumer Services Energy & IndustrialMarketing & AdTech Data & Analytics ServicesHealthcare & Pharma Financial Services
  • 6. Agenda ➔ A Consumer-Driven Supply Chain ◆ Why Demand Forecasting is Relevant? ◆ The Need for Granular Demand Forecasting at Scale ◆ How Starbucks does Demand Forecasting? ➔ Demo: ML based Forecasting With Facebook Prophet ➔ Conclusion
  • 7. Customers want what they want, where they want it, when they want it Consumer Behavior is Changing Supply Networks DIRECT TO CONSUMER PERSONALIZATION 40% of Best Buy’s online purchases are picked up at the store 50% of all restaurant prepared food is consumed away from restaurant CUSTOMER ENGAGEMENT FROM ANYWHERE CONVENIENCE IS KEY
  • 8. Your Supply Chain Needs to be Consumer-Driven
  • 9. A Consumer-Driven Supply Chain Starts with Demand Forecasting What is Demand Forecasting? Demand forecasting is the process where we use historical sales, promotions, shopper and causal data to understand and predict customer demand.
  • 10. Shipments CPG Direct-to-consumer shipments Manufacturers Orders Store shipments Distribution Centers E-commerce shipments to consumers Shoppers In-store purchases Buy-online, pickup in store (BOPIS) E-commerce delivery Subscription Retail Stores Sales Retail Supply Chain
  • 11. Orders Store shipments Shipments CPG Direct-to-consumer shipments Manufacturers Distribution Centers Retail Stores Shoppers E-commerce shipments to consumers In-store purchases Buy-online, pickup in store (BOPIS) E-commerce delivery Subscription Sales Retail Supply Chain Demand Forecast
  • 12. Orders Store shipments Shipments CPG Direct-to-consumer shipments Manufacturers Distribution Centers Retail Stores Shoppers E-commerce shipments to consumers In-store purchases Buy-online, pickup in store (BOPIS) E-commerce delivery Subscription Sales Retail Supply Chain Demand Forecast Intrastore shipments Store shipments Inventory levels Reorder quantity DC shipments Production plan Home delivery logistics Home delivery logistics
  • 13. Retail Capability Model Supply Chain Management Merchandising Store Operations Customer Management Planning Freight & Logistics Planning Operations Supplier Management Distribution Replenishment Inventory Purchasing Sourcing Advanced Planning Demand Forecasting Warehouse Management Reverse Logistics Transportation Procurement Logistics Allocation Demand Planning Revenue/Price Management Promotions Planning Assortment Category Management Space Planning Fresh Planning Demand Planning Joint planning Trade management Merchandising Store Channel Promotions Digital Store Auditing Broker Management Store Operations Workforce Point-of-Sale Shelf tags/Signage Smart Store Devices Store Inventory Management Returns Management Fresh Production Loss Prevention Loyalty Digital/Ecommerce Mobile Customer Support Customer Segmentation/CRM Voice AI/Call Center Back Office People Management Recruiting Finance Asset Management Real Estate Energy & UtilitiesIT Industry Standards & Compliance Legal Master Data Management
  • 14. Retail Capability Model Supply Chain Management Merchandising Store Operations Customer Management Planning Freight & Logistics Planning Operations Supplier Management Distribution Replenishment Inventory Purchasing Sourcing Advanced Planning Demand Forecasting Warehouse Management Reverse Logistics Transportation Procurement Logistics Allocation Demand Planning Revenue/Price Management Promotions Planning Assortment Category Management Space Planning Fresh Planning Demand Planning Joint planning Trade management Merchandising Store Channel Promotions Digital Store Auditing Broker Management Store Operations Workforce Point-of-Sale Shelf tags/Signage Smart Store Devices Store Inventory Management Returns Management Fresh Production Loss Prevention Loyalty Digital/Ecommerce Mobile Customer Support Customer Segmentation/CRM Voice AI/Call Center Back Office People Management Recruiting Finance Asset Management Real Estate Energy & UtilitiesIT Industry Standards & Compliance Legal Master Data Management
  • 15. Using Data and ML Across the Supply Chain is Challenging FORECASTING NOT ACCURATE OR GRANULAR LARGE VOLUMES OF RAPIDLY CHANGING DATA LIMITED REAL-TIME AND CAUSAL DATA NOT EASY TO GET TO ACTIONABLE INSIGHTS Managers unable to get per day /store/SKU forecast Data is constantly shifting and changing. Eg. Revised data to account for shoplifting Omnichannel is making local (weather), real-time (IOT), causal (competitor pricing) data more important Store/Distribution managers get BI tools with lots of data that they have no time to explore
  • 16. Traditional Demand Forecasting with Allocations Market AreaPromo Group Week Day Day Day
  • 17. Traditional tools can’t scale to fine-grain DC Promo Group Week 105,000 1 billion 500 million 2 billion 1.5 billion DMA Promo Group Week 525,000 Store Promo Group Week 11.922,500 Store SKU Week 238,450,000 Store SKU Day 1,669,150,000 Traditional Forecasting Tools Fine grained forecast with Databricks Special run/few times a year
  • 18. Demand Forecasting is Complicated You Need Forecasts by Product by Store Stores SKUs
  • 19. You Have a Large # of Store-SKU Combinations
  • 20.
  • 21.
  • 23. Each model may have local causal influences
  • 24. Distribute Model Training with Apache SparkTM
  • 25. Apache Spark: De-Facto Unified Analytics Engine Runtime Delta Spark Core Engine Big Data Processing ETL + SQL + Streaming Machine Learning MLlib + SparkR Uniquely combines Data & AI technologies R Python Scala Java
  • 26. Use a Unified Data Analytics Platform Across the Data and ML Lifecycle DO GRANULAR AND ACCURATE FORECASTS KEEP UP WITH CHANGING DATA USE REAL-TIME AND CAUSAL DATA ACTIONABLE AND EASY INSIGHTS FOR MANAGERS Point BI tools that store managers use directly at ML insights stored on Delta Lake tables Use and track 100s of ML models to forecast demand by day/store/SKU using MLflow Use Delta Lake UPSERTS to keep data consistent Single streamlined pipeline for real time and streaming data with Delta Lake and Apache SparkTM
  • 27. Unified Data Analytics for Consumer-Driven Supply Chain Forecast Demand INVENTORY DATA IOT DATA COMPETITOR DATA PRICING DATA SKU DATA DEMOGRAPHICS DATA GEO-LOCATION DATA PO0 DATAS VIDEO DATA SHIPMEN DATAT Optimize Inventory Faster Freight and Logistics
  • 28. Databricks Delivers Fine-Grained Demand Forecasting Traditional Analysis Suites Databricks Fine grained forecasting Aggregate level Day, store & SKU Real-time data No Streaming data Custom causal Data Limited Integrate weather, online & mobile interactions Multi-modal data for training No Structured, unstructured, image, video, sensor data. Localize models for greater accuracy No Yes Push predictions to the edge No Yes
  • 29. Starbucks Data Science Forecasting Framework “Flexible forecasting at scale” Brendan O’Shaughnessy Data Science Manager at Starbucks
  • 30. Delivery Forecasting has utility across business functions Promotions Market Planning Operations Equipment Usage Labor
  • 31. Challenges ▪ Many metrics and granularities ▪ Timely delivery ▪ Easy accessibility for business stakeholders ▪ Iterative nature of data science ▪ Accuracy ▪ Quick diagnostics
  • 32. Forecasting framework ▪ Built-in diagnostics ▪ Status monitoring ▪ Leverages Delta Lake and ADLS for data ingress and egress ▪ Executable from Databricks CLI Robust and reliable ▪ Agnostic to metric and granularity ▪ Leverages diverse forecasting algorithms using grouped pandas UDFs ▪ Modular structure for straightforward additions and enhancements ▪ Apache SparkTM enables quick runtimes regardless of the number of individual forecasts Flexible
  • 33. Example structure Create custom forecasts for any use case with relative ease Enterprise data (DB Delta) Historical Metric Custom data (ADLS) Build dataset Compile results To stakeholders Model 1 Model 2 Model n To data products …
  • 36. Key Takeaways Slide ü A Consumer-driven supply chain is indispensable ü Start with demand forecasting at scale ü Localization of models is slow and tedious ü Using Data and ML across supply chain is challenging Databricks provides Unified Data Analytics to bring Data and ML together for accurate and granular demand forecasting
  • 37. Thank you! Questions? 38 Sign up for a Free Trial : databricks.com/trial Register For Spark + AI Summit: https://databricks.com/sparkaisummit/ Read Blog: https://dbricks.co/DemFcst