SlideShare a Scribd company logo
1 of 26
1© 2015 Pivotal Software, Inc. All rights reserved.
2© 2015 Pivotal Software, Inc. All rights reserved. 2© 2015 Pivotal Software, Inc. All rights reserved.
The Science of Segmentation:
What Questions Should You Be Asking Your Data?
April 14, 2015
Jarrod Vawdrey, Data Scientist @ Pivotal
Grace Gee, Data Scientist @ Pivotal
3© 2015 Pivotal Software, Inc. All rights reserved.
Agenda
• Typical State Of Companies New To Big Data Analytics
– Benefits of Big Data technologies
• When to Use Segmentation
– Common business problems
– Types of available data
• Use Cases & Approaches To Segmentation
– Common approaches
– Best practices
4© 2015 Pivotal Software, Inc. All rights reserved. 4© 2015 Pivotal Software, Inc. All rights reserved.
Typical State of Companies
New to Big Data Analytics
5© 2015 Pivotal Software, Inc. All rights reserved.
Typical State of Companies New to Analytics
• Companies in the process of transforming into a data-
driven organization often ask similar questions about
where to start:
How do I make data available for
my analysts?What tools are needed to efficiently
process and build models on my big data
sets?
What data should I be collecting and
archiving?
Where and how can I start to use all my
data to quickly gain actionable insights
and begin integrating data science into our
organization’s practices?
How do I leverage data to generate
value for stakeholders? How do I enable analysts and data
scientists to be more effective?
6© 2015 Pivotal Software, Inc. All rights reserved.
Common Business Challenges
Data Availability
• Disparate data sources
• No integration of data across lines of businesses
• Insufficient data
• Unknown single source of truth
Slow Time-to-Insight
• Often outdated analytics architectures focused on operational
processes hamper experimental nature of big data analytics
• Lack of knowledge about analytics software for in-place
processing of and computation on Big Data
• Company organizational structure inhibits fast acquisition of
data and communication of insights
7© 2015 Pivotal Software, Inc. All rights reserved.
Big Data Technologies for Data-Driven
Organizations
• Data Lake: efficient, massively scalable Big Data storage platform
– Store all data: we don’t want to inhibit the ability to answer future
questions
– Save all (structured, unstructured, and semi-structured) types of
data: we may not immediately know “optimal” form to store data
for analysis
– Work with multiple types of data from one location
– Centralized location of data accessible to all organizations
• Agile Analytics Platform: purpose-built architecture for getting
results and gaining insights quickly through parallel, in-place data
analytics
– No required sampling due to limited memory
– No data movement
– Scalable analytics
8© 2015 Pivotal Software, Inc. All rights reserved.
Big Data Technologies for Data-Driven
Organizations
Enterprise
Apps
Reporting
Prioritized
Operational
Processes
Data Sources
Inventory
Optimization
Demand
Forecasting
Proprietary
Structured Data
Proprietary
Unstructured Data
Partner Data
Self Reporting:
Google, Weblogs,
Twitter
External Sources:
Census, Nielsen,
Weather, etc…
Sensors
HAWQGreenplum DB
Pivotal HD (HDFS)
GemFire XD
MADlib, PL/R,
PL/Python, etc.
Platform-Driven Data Science
1 0
0 1 01 0
0 1
0 1
1 1 0
Fraud
Detection
9© 2015 Pivotal Software, Inc. All rights reserved.
Segmentation:
An important step for understanding data
• What is segmentation?
– Automatic grouping of entities based on a common set of
features
– Identification of patterns amongst similar entities
• What is segmentation good for?
– Identifying select features that greatly differentiate groups
of entities
• E.g. Identifying behaviors of high-profit suppliers and low-profit
suppliers
– Identifying similar characteristics amongst different groups
• E.g. Identifying similar market segments to target
– Predicting characteristics and behaviors of new or
unknown entities
• E.g. Inferring missing labels, predicting market response to new
products
10© 2015 Pivotal Software, Inc. All rights reserved.
Segmentation & Big Data Technologies
• Segmentation problems often deal with:
– Multiple data sources from multiple lines of business and external sources
– BIG DATA, particularly from sensor data or transactions/point of sales
– High-dimensional feature sets
• Big Data technologies help make segmentation problems become feasible
and bring faster time-to-insights through:
– Ability to leverage and integrate all relevant data sources, no matter how large
 Data Lake
– Using ALL data to train segmentation models and not rely on samples or a
subset of data that fits into memory
 MPP databases, Hadoop, HAWQ, MADlib, Spark, etc.
– Quickly building segmentation models and scoring new entities through
parallelized, in-place computation
 MPP databases, Hadoop, HAWQ, MADlib, Spark, etc.
How cutting edge Big Data technology enables faster insights
11© 2015 Pivotal Software, Inc. All rights reserved. 11© 2015 Pivotal Software, Inc. All rights reserved.
When To Use Segmentation
12© 2015 Pivotal Software, Inc. All rights reserved.
Common Business Problems
Customer Micro-targeting
Identifying market segments and their
purchasing behaviors
Operations & Logistics
Identifying behaviors of underperforming or
outperforming stores, suppliers, delivery
services, etc.
Fraud
Identifying normal and anomalous user
behaviors within networks
Domain Resolution
Inferring labels or groups of similar web domains
where segmentation can help
13© 2015 Pivotal Software, Inc. All rights reserved.
Data Used In Segmentation of Customers
Power in leveraging both internal and external datasets
Demographic
profiles
Sensor data
Product
metadata
Shipment data Store metadata
Transactions
and invoices
Delivery
information
Marketing plans
External data:
Census,
Nielsen, social
networks, etc.
14© 2015 Pivotal Software, Inc. All rights reserved.
Gaining Additional Value From External Data
Often companies do not or cannot collect
sufficient data about their customers to
construct a complete profile. Augmenting
internal data with external sources allows
companies to:
• Develop a 360 degree customer view
• Gain insights into how consumers are
interacting with competitors
• Improve accuracy of predictive models
• Increase the value of internal data
Point of
sales
Transaction
data
Web/Apps
logs
Investments
Market
basket
Loans
Traffic
Weather
IXI wealth
complete
Haver time
series
Dept. of
Labor
CRM
Internal External
Note: This list only represents a subset of data sources that should be considered.
15© 2015 Pivotal Software, Inc. All rights reserved.
Example: Using Census Data to Build
Family Profiles
Consumer Packaged Goods (CPG) companies are
often interested in building market profiles for
micro-targeting to improve marketing strategies and
supply chain planning.
Hypothesis:
• Not only are CPG companies interested in the
individual consumer, but in the family profile as
well
– E.g. Consumption of child products is
affected by family size
Approach:
• Census Public Use Microdata Sample (PUMS)
files include person records and housing
records which can be combined in
segmentation models to build rich family
profiles.
fraction of households
Households with Children*
*Children as defined by a certain age group
16© 2015 Pivotal Software, Inc. All rights reserved. 16© 2015 Pivotal Software, Inc. All rights reserved.
Use Cases &
Approaches to Segmentation
17© 2015 Pivotal Software, Inc. All rights reserved.
Common Approaches for Implementing
Segmentation
Data Step
• Identify join
relationships across
all data sources
• Aggregate data to
common granularity
Feature Step
• Identify and create
features that can
characterize the
entities you want to
segment, e.g. age,
gender, types of last
transactions,
average time
between visits,
average spend,
sensitivity to price
change, etc.
Model Step
• Candidate
algorithms: clustering
strategies like k-
means & hierarchical
clustering,
regression or
hierarchical modeling
and grouping by
similar coefficients,
ensemble methods,
etc.
Analysis of Results
• Look at average
features across
clusters
• Look at average
cluster features vs.
population average
(e.g. to find
anomalous behavior)
• Identify common
features amongst
segments (e.g.
opportunities for
cross-sell/up-sell)
18© 2015 Pivotal Software, Inc. All rights reserved.
• Objective:
Identify characteristics of consumers that prefer certain brands or
products
• Common business challenges:
– No integration of data amongst different lines of businesses
– Internal data is not sufficient for building profiles
– No information about which consumers are more/less profitable
• Data sources:
– Point of sales, demographic data, loyalty data,
product and store metadata, external data
Example: Profiling market and consumer
segments
19© 2015 Pivotal Software, Inc. All rights reserved.
• Identify relationships and joins amongst all data sources
• Clean data by removing outliers and imputing missing values if appropriate
– For example using the median or weighted average value for a state to
impute into a missing value for a county
• Aggregate or select data to common granularity that makes sense
– For example, demographic profiles can be built at the zip code or county level,
and store profiles can be built at the individual store or tier or region level
Step 1: Consolidate Data Sources
• Do gap analysis to determine the scope of data
sufficient for analysis
- For example, a certain subset of customers may
be missing data for a large time period and should
be scoped out
time
numberofstoresreporting
Using an MPP database like Greenplum, we can join tables with billions of rows
in a little over a minute.
20© 2015 Pivotal Software, Inc. All rights reserved.
Step 2: Feature Engineering & Selection
Transactions&PointofSales
Total sales
Change in sales
Price
Discount
Market basket
Store/Location
Geolocation
Weather
Product
Department/Type
Color
Size
Brand
Package
Demographic
Age
Gender
Income
Employment
Education
Family size
Marital status
Citizenship
Language
Loyalty
Status
Length of
membership
Activity
It’s common for data scientists to generate hundreds of thousands of features.
21© 2015 Pivotal Software, Inc. All rights reserved.
Step 2: Feature Engineering & Selection
In order to reduce feature dimensionality and
account for unwanted bias due to the
inclusion of highly correlated features, we can
filter features using approaches such as :
• Principal Component Analysis
• Reducing the dimensionality of the feature
space to a select number of principal
components
• Iterative pairwise correlation comparison
• Calculate NxN pairwise correlations, where
N is the number features
• Remove the feature existing in the greatest
number of correlated pairs (correlation
coefficient greater than some threshold)
• Iterate until no correlated pairs exist
Example: Subset of feature correlation matrix. The
large number of features requires an automated
approach to feature selection
22© 2015 Pivotal Software, Inc. All rights reserved.
• Example: K-means Clustering
1. Create single feature vector for each entity, e.g.
consumer
2. Use k-means clustering to identify k consumer
segments
i. Try multiple training trials for multiple values of k
ii. Use any one of a variety of techniques for selecting
optimal k, e.g. silhouette coefficient
3. Look at average features across segments to identify
segment characteristics
4. Look at purchasing behaviors of each segment to
identify segment preferences
Step 3: Build Models
23© 2015 Pivotal Software, Inc. All rights reserved.
• Segmentation models used to
identify and profile consumer groups
– Calculate descriptive statistics for each
segment and compare to uncover
previously hidden opportunities
• Cross-sell/up-sell opportunities
• Potential data issues or supply chain
execution opportunities regarding
unequal proportion of product shipment
or inventory to regional preference
• Rich set of reusable data assets
made available for ongoing analysis
& reporting
Step 4: Extract Business Value from Results
Cluster 1 Cluster 2 Cluster 3 Cluster 4
Feature 1
Feature 2
Feature 3
.
.
.
low value high value
compared across clusters
24© 2015 Pivotal Software, Inc. All rights reserved.
What Questions Should You Be Asking Your
Data?
• Are you collecting the right data & storing it in the right
fashion?
• Do you have the right technology to support your data
and data science endeavors?
• Where are the gaps in your data? How can external
sources fill those gaps?
• How can your data sources be joined or aggregated
together to build rich feature sets?
• How can you extract business value from your data?
Segmentation will help you answer all of these questions!
25© 2015 Pivotal Software, Inc. All rights reserved.
Thank You.
26© 2015 Pivotal Software, Inc. All rights reserved.

More Related Content

What's hot

Improving the customer experience using big data customer-centric measurement...
Improving the customer experience using big data customer-centric measurement...Improving the customer experience using big data customer-centric measurement...
Improving the customer experience using big data customer-centric measurement...Business Over Broadway
 
Predictive Analytics in Practice - Breakfast Club 11th May 2017
Predictive Analytics in Practice - Breakfast Club 11th May 2017Predictive Analytics in Practice - Breakfast Club 11th May 2017
Predictive Analytics in Practice - Breakfast Club 11th May 2017Bilot
 
Data Quality
Data QualityData Quality
Data Qualityjerdeb
 
Data Analytics and Business Intelligence
Data Analytics and Business IntelligenceData Analytics and Business Intelligence
Data Analytics and Business IntelligenceChris Ortega, MBA
 
Business Intelligence, Portals, Dashboards and Operational Matrix with ShareP...
Business Intelligence, Portals, Dashboards and Operational Matrix with ShareP...Business Intelligence, Portals, Dashboards and Operational Matrix with ShareP...
Business Intelligence, Portals, Dashboards and Operational Matrix with ShareP...Optimus BT
 
Finance and Accounting BPM
Finance and Accounting BPMFinance and Accounting BPM
Finance and Accounting BPMBob Samuels
 
Business Intelligence
Business IntelligenceBusiness Intelligence
Business IntelligenceHome
 
The essentials of business intelligence
The essentials of business intelligenceThe essentials of business intelligence
The essentials of business intelligenceShwetabh Jaiswal
 
Four Pillars of Business Analytics by Actuate
Four Pillars of Business Analytics by ActuateFour Pillars of Business Analytics by Actuate
Four Pillars of Business Analytics by ActuateEdgar Alejandro Villegas
 
Data-Ed Online Presents: Data Warehouse Strategies
Data-Ed Online Presents: Data Warehouse StrategiesData-Ed Online Presents: Data Warehouse Strategies
Data-Ed Online Presents: Data Warehouse StrategiesDATAVERSITY
 
Business intelligence and analytics
Business intelligence and analyticsBusiness intelligence and analytics
Business intelligence and analyticsYogesh Supekar
 
Best Practices: Datawarehouse Automation Conference September 20, 2012 - Amst...
Best Practices: Datawarehouse Automation Conference September 20, 2012 - Amst...Best Practices: Datawarehouse Automation Conference September 20, 2012 - Amst...
Best Practices: Datawarehouse Automation Conference September 20, 2012 - Amst...Erik Fransen
 
intro_to_business_analytics_and_data_science_ver 1.0
intro_to_business_analytics_and_data_science_ver 1.0intro_to_business_analytics_and_data_science_ver 1.0
intro_to_business_analytics_and_data_science_ver 1.0Anthony Paulus
 
Chief Data Officer (CDO) Organization Roles
Chief Data Officer (CDO) Organization RolesChief Data Officer (CDO) Organization Roles
Chief Data Officer (CDO) Organization RolesDave Getty
 
The Shifting Landscape of Data Integration
The Shifting Landscape of Data IntegrationThe Shifting Landscape of Data Integration
The Shifting Landscape of Data IntegrationDATAVERSITY
 
TDWI Spotlight: Enabling Data Self-Service with Security, Governance, and Reg...
TDWI Spotlight: Enabling Data Self-Service with Security, Governance, and Reg...TDWI Spotlight: Enabling Data Self-Service with Security, Governance, and Reg...
TDWI Spotlight: Enabling Data Self-Service with Security, Governance, and Reg...Denodo
 
Understanding big data and data analytics-Business Intelligence
Understanding big data and data analytics-Business IntelligenceUnderstanding big data and data analytics-Business Intelligence
Understanding big data and data analytics-Business IntelligenceSeta Wicaksana
 

What's hot (20)

Improving the customer experience using big data customer-centric measurement...
Improving the customer experience using big data customer-centric measurement...Improving the customer experience using big data customer-centric measurement...
Improving the customer experience using big data customer-centric measurement...
 
Predictive Analytics in Practice - Breakfast Club 11th May 2017
Predictive Analytics in Practice - Breakfast Club 11th May 2017Predictive Analytics in Practice - Breakfast Club 11th May 2017
Predictive Analytics in Practice - Breakfast Club 11th May 2017
 
Data Quality
Data QualityData Quality
Data Quality
 
Data Analytics and Business Intelligence
Data Analytics and Business IntelligenceData Analytics and Business Intelligence
Data Analytics and Business Intelligence
 
Business Intelligence, Portals, Dashboards and Operational Matrix with ShareP...
Business Intelligence, Portals, Dashboards and Operational Matrix with ShareP...Business Intelligence, Portals, Dashboards and Operational Matrix with ShareP...
Business Intelligence, Portals, Dashboards and Operational Matrix with ShareP...
 
Finance and Accounting BPM
Finance and Accounting BPMFinance and Accounting BPM
Finance and Accounting BPM
 
Big data gaurav
Big data gauravBig data gaurav
Big data gaurav
 
Business Intelligence
Business IntelligenceBusiness Intelligence
Business Intelligence
 
The essentials of business intelligence
The essentials of business intelligenceThe essentials of business intelligence
The essentials of business intelligence
 
Four Pillars of Business Analytics by Actuate
Four Pillars of Business Analytics by ActuateFour Pillars of Business Analytics by Actuate
Four Pillars of Business Analytics by Actuate
 
Data-Ed Online Presents: Data Warehouse Strategies
Data-Ed Online Presents: Data Warehouse StrategiesData-Ed Online Presents: Data Warehouse Strategies
Data-Ed Online Presents: Data Warehouse Strategies
 
Going MAD: A Framework For Delivering Pervasive BI Solutions
Going MAD: A Framework For Delivering Pervasive BI SolutionsGoing MAD: A Framework For Delivering Pervasive BI Solutions
Going MAD: A Framework For Delivering Pervasive BI Solutions
 
Business intelligence and analytics
Business intelligence and analyticsBusiness intelligence and analytics
Business intelligence and analytics
 
Best Practices: Datawarehouse Automation Conference September 20, 2012 - Amst...
Best Practices: Datawarehouse Automation Conference September 20, 2012 - Amst...Best Practices: Datawarehouse Automation Conference September 20, 2012 - Amst...
Best Practices: Datawarehouse Automation Conference September 20, 2012 - Amst...
 
intro_to_business_analytics_and_data_science_ver 1.0
intro_to_business_analytics_and_data_science_ver 1.0intro_to_business_analytics_and_data_science_ver 1.0
intro_to_business_analytics_and_data_science_ver 1.0
 
Chief Data Officer (CDO) Organization Roles
Chief Data Officer (CDO) Organization RolesChief Data Officer (CDO) Organization Roles
Chief Data Officer (CDO) Organization Roles
 
The Shifting Landscape of Data Integration
The Shifting Landscape of Data IntegrationThe Shifting Landscape of Data Integration
The Shifting Landscape of Data Integration
 
BI assessment template jr
BI assessment template jrBI assessment template jr
BI assessment template jr
 
TDWI Spotlight: Enabling Data Self-Service with Security, Governance, and Reg...
TDWI Spotlight: Enabling Data Self-Service with Security, Governance, and Reg...TDWI Spotlight: Enabling Data Self-Service with Security, Governance, and Reg...
TDWI Spotlight: Enabling Data Self-Service with Security, Governance, and Reg...
 
Understanding big data and data analytics-Business Intelligence
Understanding big data and data analytics-Business IntelligenceUnderstanding big data and data analytics-Business Intelligence
Understanding big data and data analytics-Business Intelligence
 

Viewers also liked

Five awesome django tutorials - Open Data Science
Five awesome django tutorials - Open Data ScienceFive awesome django tutorials - Open Data Science
Five awesome django tutorials - Open Data Scienceopendatascience
 
Dan Mallinger, Data Science Practice Manager, Think Big Analytics at MLconf NYC
Dan Mallinger, Data Science Practice Manager, Think Big Analytics at MLconf NYCDan Mallinger, Data Science Practice Manager, Think Big Analytics at MLconf NYC
Dan Mallinger, Data Science Practice Manager, Think Big Analytics at MLconf NYCMLconf
 
Dan Mallinger – Data Science Practice Manager, Think Big Analytics at MLconf ATL
Dan Mallinger – Data Science Practice Manager, Think Big Analytics at MLconf ATLDan Mallinger – Data Science Practice Manager, Think Big Analytics at MLconf ATL
Dan Mallinger – Data Science Practice Manager, Think Big Analytics at MLconf ATLMLconf
 
Idiots guide to setting up a data science team
Idiots guide to setting up a data science teamIdiots guide to setting up a data science team
Idiots guide to setting up a data science teamAshish Bansal
 
Challenges of managing Data Science Project
Challenges of managing Data Science ProjectChallenges of managing Data Science Project
Challenges of managing Data Science ProjectLamjed Ben Jabeur
 
The Other 99% of a Data Science Project
The Other 99% of a Data Science ProjectThe Other 99% of a Data Science Project
The Other 99% of a Data Science ProjectEugene Mandel
 
CRISP-DM - Agile Approach To Data Mining Projects
CRISP-DM - Agile Approach To Data Mining ProjectsCRISP-DM - Agile Approach To Data Mining Projects
CRISP-DM - Agile Approach To Data Mining ProjectsMichał Łopuszyński
 
CRISP-DM: a data science project methodology
CRISP-DM: a data science project methodologyCRISP-DM: a data science project methodology
CRISP-DM: a data science project methodologySergey Shelpuk
 
Back to Square One: Building a Data Science Team from Scratch
Back to Square One: Building a Data Science Team from ScratchBack to Square One: Building a Data Science Team from Scratch
Back to Square One: Building a Data Science Team from ScratchKlaas Bosteels
 
(BDT309) Data Science & Best Practices for Apache Spark on Amazon EMR
(BDT309) Data Science & Best Practices for Apache Spark on Amazon EMR(BDT309) Data Science & Best Practices for Apache Spark on Amazon EMR
(BDT309) Data Science & Best Practices for Apache Spark on Amazon EMRAmazon Web Services
 
Complete topshop presentation
Complete topshop presentation Complete topshop presentation
Complete topshop presentation Jessica Moore
 
CUSTOMER ANALYTICS & SEGMENTATION FOR CUSTOMER CENTRIC ORGANIZATION & MARKETI...
CUSTOMER ANALYTICS & SEGMENTATION FOR CUSTOMER CENTRIC ORGANIZATION & MARKETI...CUSTOMER ANALYTICS & SEGMENTATION FOR CUSTOMER CENTRIC ORGANIZATION & MARKETI...
CUSTOMER ANALYTICS & SEGMENTATION FOR CUSTOMER CENTRIC ORGANIZATION & MARKETI...Data Science Thailand
 
Smart Data Slides: Data Science and Business Analysis - A Look at Best Practi...
Smart Data Slides: Data Science and Business Analysis - A Look at Best Practi...Smart Data Slides: Data Science and Business Analysis - A Look at Best Practi...
Smart Data Slides: Data Science and Business Analysis - A Look at Best Practi...DATAVERSITY
 

Viewers also liked (14)

Five awesome django tutorials - Open Data Science
Five awesome django tutorials - Open Data ScienceFive awesome django tutorials - Open Data Science
Five awesome django tutorials - Open Data Science
 
Dan Mallinger, Data Science Practice Manager, Think Big Analytics at MLconf NYC
Dan Mallinger, Data Science Practice Manager, Think Big Analytics at MLconf NYCDan Mallinger, Data Science Practice Manager, Think Big Analytics at MLconf NYC
Dan Mallinger, Data Science Practice Manager, Think Big Analytics at MLconf NYC
 
Dan Mallinger – Data Science Practice Manager, Think Big Analytics at MLconf ATL
Dan Mallinger – Data Science Practice Manager, Think Big Analytics at MLconf ATLDan Mallinger – Data Science Practice Manager, Think Big Analytics at MLconf ATL
Dan Mallinger – Data Science Practice Manager, Think Big Analytics at MLconf ATL
 
Idiots guide to setting up a data science team
Idiots guide to setting up a data science teamIdiots guide to setting up a data science team
Idiots guide to setting up a data science team
 
Challenges of managing Data Science Project
Challenges of managing Data Science ProjectChallenges of managing Data Science Project
Challenges of managing Data Science Project
 
The Other 99% of a Data Science Project
The Other 99% of a Data Science ProjectThe Other 99% of a Data Science Project
The Other 99% of a Data Science Project
 
CRISP-DM - Agile Approach To Data Mining Projects
CRISP-DM - Agile Approach To Data Mining ProjectsCRISP-DM - Agile Approach To Data Mining Projects
CRISP-DM - Agile Approach To Data Mining Projects
 
CRISP-DM: a data science project methodology
CRISP-DM: a data science project methodologyCRISP-DM: a data science project methodology
CRISP-DM: a data science project methodology
 
Back to Square One: Building a Data Science Team from Scratch
Back to Square One: Building a Data Science Team from ScratchBack to Square One: Building a Data Science Team from Scratch
Back to Square One: Building a Data Science Team from Scratch
 
(BDT309) Data Science & Best Practices for Apache Spark on Amazon EMR
(BDT309) Data Science & Best Practices for Apache Spark on Amazon EMR(BDT309) Data Science & Best Practices for Apache Spark on Amazon EMR
(BDT309) Data Science & Best Practices for Apache Spark on Amazon EMR
 
Complete topshop presentation
Complete topshop presentation Complete topshop presentation
Complete topshop presentation
 
CUSTOMER ANALYTICS & SEGMENTATION FOR CUSTOMER CENTRIC ORGANIZATION & MARKETI...
CUSTOMER ANALYTICS & SEGMENTATION FOR CUSTOMER CENTRIC ORGANIZATION & MARKETI...CUSTOMER ANALYTICS & SEGMENTATION FOR CUSTOMER CENTRIC ORGANIZATION & MARKETI...
CUSTOMER ANALYTICS & SEGMENTATION FOR CUSTOMER CENTRIC ORGANIZATION & MARKETI...
 
Agile Data Science
Agile Data ScienceAgile Data Science
Agile Data Science
 
Smart Data Slides: Data Science and Business Analysis - A Look at Best Practi...
Smart Data Slides: Data Science and Business Analysis - A Look at Best Practi...Smart Data Slides: Data Science and Business Analysis - A Look at Best Practi...
Smart Data Slides: Data Science and Business Analysis - A Look at Best Practi...
 

Similar to Webinar - The Science of Segmentation: What Questions You Should be Asking Your Data?

Lecture 1.13 & 1.14 &1.15_Business Profiles in Big Data.pptx
Lecture 1.13 & 1.14 &1.15_Business Profiles in Big Data.pptxLecture 1.13 & 1.14 &1.15_Business Profiles in Big Data.pptx
Lecture 1.13 & 1.14 &1.15_Business Profiles in Big Data.pptxRATISHKUMAR32
 
WHAT IS BUSINESS ANALYTICS um hj mnjh nit 1 ppt only kjjn
WHAT IS BUSINESS ANALYTICS um hj mnjh nit 1 ppt only kjjnWHAT IS BUSINESS ANALYTICS um hj mnjh nit 1 ppt only kjjn
WHAT IS BUSINESS ANALYTICS um hj mnjh nit 1 ppt only kjjnRohitKumar639388
 
Data Architecture Strategies: Building an Enterprise Data Strategy – Where to...
Data Architecture Strategies: Building an Enterprise Data Strategy – Where to...Data Architecture Strategies: Building an Enterprise Data Strategy – Where to...
Data Architecture Strategies: Building an Enterprise Data Strategy – Where to...DATAVERSITY
 
Tips --Break Down the Barriers to Better Data Analytics
Tips --Break Down the Barriers to Better Data AnalyticsTips --Break Down the Barriers to Better Data Analytics
Tips --Break Down the Barriers to Better Data AnalyticsAbhishek Sood
 
BIG DATA CHAPTER 2 IN DSS.pptx
BIG DATA CHAPTER 2 IN DSS.pptxBIG DATA CHAPTER 2 IN DSS.pptx
BIG DATA CHAPTER 2 IN DSS.pptxmuflehaljarrah
 
2013 ALPFA Leadership Submit, Data Analytics in Practice
2013 ALPFA Leadership Submit, Data Analytics in Practice2013 ALPFA Leadership Submit, Data Analytics in Practice
2013 ALPFA Leadership Submit, Data Analytics in PracticeAlejandro Jaramillo
 
Big Data Developer Career Path: Job & Interview Preparation
Big Data Developer Career Path: Job & Interview PreparationBig Data Developer Career Path: Job & Interview Preparation
Big Data Developer Career Path: Job & Interview PreparationIntellipaat
 
Enhancing Decision Making - Management Information System
Enhancing Decision Making - Management Information SystemEnhancing Decision Making - Management Information System
Enhancing Decision Making - Management Information SystemFaHaD .H. NooR
 
Big Data, Big Thinking: Untapped Opportunities
Big Data, Big Thinking: Untapped OpportunitiesBig Data, Big Thinking: Untapped Opportunities
Big Data, Big Thinking: Untapped OpportunitiesSAP Technology
 
The ABCs of Treating Data as Product
The ABCs of Treating Data as ProductThe ABCs of Treating Data as Product
The ABCs of Treating Data as ProductDATAVERSITY
 
Data warehousing - Dr. Radhika Kotecha
Data warehousing - Dr. Radhika KotechaData warehousing - Dr. Radhika Kotecha
Data warehousing - Dr. Radhika KotechaRadhika Kotecha
 
Five Critical Success Factors for Embedded Analytics
Five Critical Success Factors for Embedded AnalyticsFive Critical Success Factors for Embedded Analytics
Five Critical Success Factors for Embedded Analyticsibi
 
Data Modeling & Data Integration
Data Modeling & Data IntegrationData Modeling & Data Integration
Data Modeling & Data IntegrationDATAVERSITY
 
Saama-POI Webinar Slides FINAL 04.27.2016 dm
Saama-POI Webinar Slides FINAL 04.27.2016 dmSaama-POI Webinar Slides FINAL 04.27.2016 dm
Saama-POI Webinar Slides FINAL 04.27.2016 dmDan Maxwell
 
Business intelligence- Components, Tools, Need and Applications
Business intelligence- Components, Tools, Need and ApplicationsBusiness intelligence- Components, Tools, Need and Applications
Business intelligence- Components, Tools, Need and Applicationsraj
 
Data as a Profit Driver – Emerging Techniques to Monetize Data as a Strategic...
Data as a Profit Driver – Emerging Techniques to Monetize Data as a Strategic...Data as a Profit Driver – Emerging Techniques to Monetize Data as a Strategic...
Data as a Profit Driver – Emerging Techniques to Monetize Data as a Strategic...DATAVERSITY
 
Google Analytics Training - full 2017
Google Analytics Training - full 2017Google Analytics Training - full 2017
Google Analytics Training - full 2017Nate Plaunt
 

Similar to Webinar - The Science of Segmentation: What Questions You Should be Asking Your Data? (20)

Lecture 1.13 & 1.14 &1.15_Business Profiles in Big Data.pptx
Lecture 1.13 & 1.14 &1.15_Business Profiles in Big Data.pptxLecture 1.13 & 1.14 &1.15_Business Profiles in Big Data.pptx
Lecture 1.13 & 1.14 &1.15_Business Profiles in Big Data.pptx
 
WHAT IS BUSINESS ANALYTICS um hj mnjh nit 1 ppt only kjjn
WHAT IS BUSINESS ANALYTICS um hj mnjh nit 1 ppt only kjjnWHAT IS BUSINESS ANALYTICS um hj mnjh nit 1 ppt only kjjn
WHAT IS BUSINESS ANALYTICS um hj mnjh nit 1 ppt only kjjn
 
Data Architecture Strategies: Building an Enterprise Data Strategy – Where to...
Data Architecture Strategies: Building an Enterprise Data Strategy – Where to...Data Architecture Strategies: Building an Enterprise Data Strategy – Where to...
Data Architecture Strategies: Building an Enterprise Data Strategy – Where to...
 
Big data
Big dataBig data
Big data
 
Tips --Break Down the Barriers to Better Data Analytics
Tips --Break Down the Barriers to Better Data AnalyticsTips --Break Down the Barriers to Better Data Analytics
Tips --Break Down the Barriers to Better Data Analytics
 
BIG DATA CHAPTER 2 IN DSS.pptx
BIG DATA CHAPTER 2 IN DSS.pptxBIG DATA CHAPTER 2 IN DSS.pptx
BIG DATA CHAPTER 2 IN DSS.pptx
 
2013 ALPFA Leadership Submit, Data Analytics in Practice
2013 ALPFA Leadership Submit, Data Analytics in Practice2013 ALPFA Leadership Submit, Data Analytics in Practice
2013 ALPFA Leadership Submit, Data Analytics in Practice
 
06 summary
06 summary06 summary
06 summary
 
Big Data Developer Career Path: Job & Interview Preparation
Big Data Developer Career Path: Job & Interview PreparationBig Data Developer Career Path: Job & Interview Preparation
Big Data Developer Career Path: Job & Interview Preparation
 
Enhancing Decision Making - Management Information System
Enhancing Decision Making - Management Information SystemEnhancing Decision Making - Management Information System
Enhancing Decision Making - Management Information System
 
Big Data, Big Thinking: Untapped Opportunities
Big Data, Big Thinking: Untapped OpportunitiesBig Data, Big Thinking: Untapped Opportunities
Big Data, Big Thinking: Untapped Opportunities
 
The ABCs of Treating Data as Product
The ABCs of Treating Data as ProductThe ABCs of Treating Data as Product
The ABCs of Treating Data as Product
 
Data warehousing - Dr. Radhika Kotecha
Data warehousing - Dr. Radhika KotechaData warehousing - Dr. Radhika Kotecha
Data warehousing - Dr. Radhika Kotecha
 
Five Critical Success Factors for Embedded Analytics
Five Critical Success Factors for Embedded AnalyticsFive Critical Success Factors for Embedded Analytics
Five Critical Success Factors for Embedded Analytics
 
Data Modeling & Data Integration
Data Modeling & Data IntegrationData Modeling & Data Integration
Data Modeling & Data Integration
 
Saama-POI Webinar Slides FINAL 04.27.2016 dm
Saama-POI Webinar Slides FINAL 04.27.2016 dmSaama-POI Webinar Slides FINAL 04.27.2016 dm
Saama-POI Webinar Slides FINAL 04.27.2016 dm
 
Business intelligence- Components, Tools, Need and Applications
Business intelligence- Components, Tools, Need and ApplicationsBusiness intelligence- Components, Tools, Need and Applications
Business intelligence- Components, Tools, Need and Applications
 
Data as a Profit Driver – Emerging Techniques to Monetize Data as a Strategic...
Data as a Profit Driver – Emerging Techniques to Monetize Data as a Strategic...Data as a Profit Driver – Emerging Techniques to Monetize Data as a Strategic...
Data as a Profit Driver – Emerging Techniques to Monetize Data as a Strategic...
 
Google Analytics Training - full 2017
Google Analytics Training - full 2017Google Analytics Training - full 2017
Google Analytics Training - full 2017
 
Data mining wrhousing-lec
Data mining wrhousing-lecData mining wrhousing-lec
Data mining wrhousing-lec
 

More from VMware Tanzu

What AI Means For Your Product Strategy And What To Do About It
What AI Means For Your Product Strategy And What To Do About ItWhat AI Means For Your Product Strategy And What To Do About It
What AI Means For Your Product Strategy And What To Do About ItVMware Tanzu
 
Make the Right Thing the Obvious Thing at Cardinal Health 2023
Make the Right Thing the Obvious Thing at Cardinal Health 2023Make the Right Thing the Obvious Thing at Cardinal Health 2023
Make the Right Thing the Obvious Thing at Cardinal Health 2023VMware Tanzu
 
Enhancing DevEx and Simplifying Operations at Scale
Enhancing DevEx and Simplifying Operations at ScaleEnhancing DevEx and Simplifying Operations at Scale
Enhancing DevEx and Simplifying Operations at ScaleVMware Tanzu
 
Spring Update | July 2023
Spring Update | July 2023Spring Update | July 2023
Spring Update | July 2023VMware Tanzu
 
Platforms, Platform Engineering, & Platform as a Product
Platforms, Platform Engineering, & Platform as a ProductPlatforms, Platform Engineering, & Platform as a Product
Platforms, Platform Engineering, & Platform as a ProductVMware Tanzu
 
Building Cloud Ready Apps
Building Cloud Ready AppsBuilding Cloud Ready Apps
Building Cloud Ready AppsVMware Tanzu
 
Spring Boot 3 And Beyond
Spring Boot 3 And BeyondSpring Boot 3 And Beyond
Spring Boot 3 And BeyondVMware Tanzu
 
Spring Cloud Gateway - SpringOne Tour 2023 Charles Schwab.pdf
Spring Cloud Gateway - SpringOne Tour 2023 Charles Schwab.pdfSpring Cloud Gateway - SpringOne Tour 2023 Charles Schwab.pdf
Spring Cloud Gateway - SpringOne Tour 2023 Charles Schwab.pdfVMware Tanzu
 
Simplify and Scale Enterprise Apps in the Cloud | Boston 2023
Simplify and Scale Enterprise Apps in the Cloud | Boston 2023Simplify and Scale Enterprise Apps in the Cloud | Boston 2023
Simplify and Scale Enterprise Apps in the Cloud | Boston 2023VMware Tanzu
 
Simplify and Scale Enterprise Apps in the Cloud | Seattle 2023
Simplify and Scale Enterprise Apps in the Cloud | Seattle 2023Simplify and Scale Enterprise Apps in the Cloud | Seattle 2023
Simplify and Scale Enterprise Apps in the Cloud | Seattle 2023VMware Tanzu
 
tanzu_developer_connect.pptx
tanzu_developer_connect.pptxtanzu_developer_connect.pptx
tanzu_developer_connect.pptxVMware Tanzu
 
Tanzu Virtual Developer Connect Workshop - French
Tanzu Virtual Developer Connect Workshop - FrenchTanzu Virtual Developer Connect Workshop - French
Tanzu Virtual Developer Connect Workshop - FrenchVMware Tanzu
 
Tanzu Developer Connect Workshop - English
Tanzu Developer Connect Workshop - EnglishTanzu Developer Connect Workshop - English
Tanzu Developer Connect Workshop - EnglishVMware Tanzu
 
Virtual Developer Connect Workshop - English
Virtual Developer Connect Workshop - EnglishVirtual Developer Connect Workshop - English
Virtual Developer Connect Workshop - EnglishVMware Tanzu
 
Tanzu Developer Connect - French
Tanzu Developer Connect - FrenchTanzu Developer Connect - French
Tanzu Developer Connect - FrenchVMware Tanzu
 
Simplify and Scale Enterprise Apps in the Cloud | Dallas 2023
Simplify and Scale Enterprise Apps in the Cloud | Dallas 2023Simplify and Scale Enterprise Apps in the Cloud | Dallas 2023
Simplify and Scale Enterprise Apps in the Cloud | Dallas 2023VMware Tanzu
 
SpringOne Tour: Deliver 15-Factor Applications on Kubernetes with Spring Boot
SpringOne Tour: Deliver 15-Factor Applications on Kubernetes with Spring BootSpringOne Tour: Deliver 15-Factor Applications on Kubernetes with Spring Boot
SpringOne Tour: Deliver 15-Factor Applications on Kubernetes with Spring BootVMware Tanzu
 
SpringOne Tour: The Influential Software Engineer
SpringOne Tour: The Influential Software EngineerSpringOne Tour: The Influential Software Engineer
SpringOne Tour: The Influential Software EngineerVMware Tanzu
 
SpringOne Tour: Domain-Driven Design: Theory vs Practice
SpringOne Tour: Domain-Driven Design: Theory vs PracticeSpringOne Tour: Domain-Driven Design: Theory vs Practice
SpringOne Tour: Domain-Driven Design: Theory vs PracticeVMware Tanzu
 
SpringOne Tour: Spring Recipes: A Collection of Common-Sense Solutions
SpringOne Tour: Spring Recipes: A Collection of Common-Sense SolutionsSpringOne Tour: Spring Recipes: A Collection of Common-Sense Solutions
SpringOne Tour: Spring Recipes: A Collection of Common-Sense SolutionsVMware Tanzu
 

More from VMware Tanzu (20)

What AI Means For Your Product Strategy And What To Do About It
What AI Means For Your Product Strategy And What To Do About ItWhat AI Means For Your Product Strategy And What To Do About It
What AI Means For Your Product Strategy And What To Do About It
 
Make the Right Thing the Obvious Thing at Cardinal Health 2023
Make the Right Thing the Obvious Thing at Cardinal Health 2023Make the Right Thing the Obvious Thing at Cardinal Health 2023
Make the Right Thing the Obvious Thing at Cardinal Health 2023
 
Enhancing DevEx and Simplifying Operations at Scale
Enhancing DevEx and Simplifying Operations at ScaleEnhancing DevEx and Simplifying Operations at Scale
Enhancing DevEx and Simplifying Operations at Scale
 
Spring Update | July 2023
Spring Update | July 2023Spring Update | July 2023
Spring Update | July 2023
 
Platforms, Platform Engineering, & Platform as a Product
Platforms, Platform Engineering, & Platform as a ProductPlatforms, Platform Engineering, & Platform as a Product
Platforms, Platform Engineering, & Platform as a Product
 
Building Cloud Ready Apps
Building Cloud Ready AppsBuilding Cloud Ready Apps
Building Cloud Ready Apps
 
Spring Boot 3 And Beyond
Spring Boot 3 And BeyondSpring Boot 3 And Beyond
Spring Boot 3 And Beyond
 
Spring Cloud Gateway - SpringOne Tour 2023 Charles Schwab.pdf
Spring Cloud Gateway - SpringOne Tour 2023 Charles Schwab.pdfSpring Cloud Gateway - SpringOne Tour 2023 Charles Schwab.pdf
Spring Cloud Gateway - SpringOne Tour 2023 Charles Schwab.pdf
 
Simplify and Scale Enterprise Apps in the Cloud | Boston 2023
Simplify and Scale Enterprise Apps in the Cloud | Boston 2023Simplify and Scale Enterprise Apps in the Cloud | Boston 2023
Simplify and Scale Enterprise Apps in the Cloud | Boston 2023
 
Simplify and Scale Enterprise Apps in the Cloud | Seattle 2023
Simplify and Scale Enterprise Apps in the Cloud | Seattle 2023Simplify and Scale Enterprise Apps in the Cloud | Seattle 2023
Simplify and Scale Enterprise Apps in the Cloud | Seattle 2023
 
tanzu_developer_connect.pptx
tanzu_developer_connect.pptxtanzu_developer_connect.pptx
tanzu_developer_connect.pptx
 
Tanzu Virtual Developer Connect Workshop - French
Tanzu Virtual Developer Connect Workshop - FrenchTanzu Virtual Developer Connect Workshop - French
Tanzu Virtual Developer Connect Workshop - French
 
Tanzu Developer Connect Workshop - English
Tanzu Developer Connect Workshop - EnglishTanzu Developer Connect Workshop - English
Tanzu Developer Connect Workshop - English
 
Virtual Developer Connect Workshop - English
Virtual Developer Connect Workshop - EnglishVirtual Developer Connect Workshop - English
Virtual Developer Connect Workshop - English
 
Tanzu Developer Connect - French
Tanzu Developer Connect - FrenchTanzu Developer Connect - French
Tanzu Developer Connect - French
 
Simplify and Scale Enterprise Apps in the Cloud | Dallas 2023
Simplify and Scale Enterprise Apps in the Cloud | Dallas 2023Simplify and Scale Enterprise Apps in the Cloud | Dallas 2023
Simplify and Scale Enterprise Apps in the Cloud | Dallas 2023
 
SpringOne Tour: Deliver 15-Factor Applications on Kubernetes with Spring Boot
SpringOne Tour: Deliver 15-Factor Applications on Kubernetes with Spring BootSpringOne Tour: Deliver 15-Factor Applications on Kubernetes with Spring Boot
SpringOne Tour: Deliver 15-Factor Applications on Kubernetes with Spring Boot
 
SpringOne Tour: The Influential Software Engineer
SpringOne Tour: The Influential Software EngineerSpringOne Tour: The Influential Software Engineer
SpringOne Tour: The Influential Software Engineer
 
SpringOne Tour: Domain-Driven Design: Theory vs Practice
SpringOne Tour: Domain-Driven Design: Theory vs PracticeSpringOne Tour: Domain-Driven Design: Theory vs Practice
SpringOne Tour: Domain-Driven Design: Theory vs Practice
 
SpringOne Tour: Spring Recipes: A Collection of Common-Sense Solutions
SpringOne Tour: Spring Recipes: A Collection of Common-Sense SolutionsSpringOne Tour: Spring Recipes: A Collection of Common-Sense Solutions
SpringOne Tour: Spring Recipes: A Collection of Common-Sense Solutions
 

Recently uploaded

RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhijennyeacort
 
MK KOMUNIKASI DATA (TI)komdat komdat.docx
MK KOMUNIKASI DATA (TI)komdat komdat.docxMK KOMUNIKASI DATA (TI)komdat komdat.docx
MK KOMUNIKASI DATA (TI)komdat komdat.docxUnduhUnggah1
 
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...ssuserf63bd7
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档208367051
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一F sss
 
Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanMYRABACSAFRA2
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPTBoston Institute of Analytics
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectBoston Institute of Analytics
 
毕业文凭制作#回国入职#diploma#degree美国加州州立大学北岭分校毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#de...
毕业文凭制作#回国入职#diploma#degree美国加州州立大学北岭分校毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#de...毕业文凭制作#回国入职#diploma#degree美国加州州立大学北岭分校毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#de...
毕业文凭制作#回国入职#diploma#degree美国加州州立大学北岭分校毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#de...ttt fff
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfJohn Sterrett
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.natarajan8993
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queensdataanalyticsqueen03
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 217djon017
 
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesConf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesTimothy Spann
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Seán Kennedy
 
Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxMike Bennett
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfBoston Institute of Analytics
 
Multiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfMultiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfchwongval
 
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024Susanna-Assunta Sansone
 
办理学位证加利福尼亚大学洛杉矶分校毕业证,UCLA成绩单原版一比一
办理学位证加利福尼亚大学洛杉矶分校毕业证,UCLA成绩单原版一比一办理学位证加利福尼亚大学洛杉矶分校毕业证,UCLA成绩单原版一比一
办理学位证加利福尼亚大学洛杉矶分校毕业证,UCLA成绩单原版一比一F sss
 

Recently uploaded (20)

RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
 
MK KOMUNIKASI DATA (TI)komdat komdat.docx
MK KOMUNIKASI DATA (TI)komdat komdat.docxMK KOMUNIKASI DATA (TI)komdat komdat.docx
MK KOMUNIKASI DATA (TI)komdat komdat.docx
 
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
 
Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population Mean
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis Project
 
毕业文凭制作#回国入职#diploma#degree美国加州州立大学北岭分校毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#de...
毕业文凭制作#回国入职#diploma#degree美国加州州立大学北岭分校毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#de...毕业文凭制作#回国入职#diploma#degree美国加州州立大学北岭分校毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#de...
毕业文凭制作#回国入职#diploma#degree美国加州州立大学北岭分校毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#de...
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdf
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queens
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2
 
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesConf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...
 
Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptx
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
 
Multiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfMultiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdf
 
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
 
办理学位证加利福尼亚大学洛杉矶分校毕业证,UCLA成绩单原版一比一
办理学位证加利福尼亚大学洛杉矶分校毕业证,UCLA成绩单原版一比一办理学位证加利福尼亚大学洛杉矶分校毕业证,UCLA成绩单原版一比一
办理学位证加利福尼亚大学洛杉矶分校毕业证,UCLA成绩单原版一比一
 

Webinar - The Science of Segmentation: What Questions You Should be Asking Your Data?

  • 1. 1© 2015 Pivotal Software, Inc. All rights reserved.
  • 2. 2© 2015 Pivotal Software, Inc. All rights reserved. 2© 2015 Pivotal Software, Inc. All rights reserved. The Science of Segmentation: What Questions Should You Be Asking Your Data? April 14, 2015 Jarrod Vawdrey, Data Scientist @ Pivotal Grace Gee, Data Scientist @ Pivotal
  • 3. 3© 2015 Pivotal Software, Inc. All rights reserved. Agenda • Typical State Of Companies New To Big Data Analytics – Benefits of Big Data technologies • When to Use Segmentation – Common business problems – Types of available data • Use Cases & Approaches To Segmentation – Common approaches – Best practices
  • 4. 4© 2015 Pivotal Software, Inc. All rights reserved. 4© 2015 Pivotal Software, Inc. All rights reserved. Typical State of Companies New to Big Data Analytics
  • 5. 5© 2015 Pivotal Software, Inc. All rights reserved. Typical State of Companies New to Analytics • Companies in the process of transforming into a data- driven organization often ask similar questions about where to start: How do I make data available for my analysts?What tools are needed to efficiently process and build models on my big data sets? What data should I be collecting and archiving? Where and how can I start to use all my data to quickly gain actionable insights and begin integrating data science into our organization’s practices? How do I leverage data to generate value for stakeholders? How do I enable analysts and data scientists to be more effective?
  • 6. 6© 2015 Pivotal Software, Inc. All rights reserved. Common Business Challenges Data Availability • Disparate data sources • No integration of data across lines of businesses • Insufficient data • Unknown single source of truth Slow Time-to-Insight • Often outdated analytics architectures focused on operational processes hamper experimental nature of big data analytics • Lack of knowledge about analytics software for in-place processing of and computation on Big Data • Company organizational structure inhibits fast acquisition of data and communication of insights
  • 7. 7© 2015 Pivotal Software, Inc. All rights reserved. Big Data Technologies for Data-Driven Organizations • Data Lake: efficient, massively scalable Big Data storage platform – Store all data: we don’t want to inhibit the ability to answer future questions – Save all (structured, unstructured, and semi-structured) types of data: we may not immediately know “optimal” form to store data for analysis – Work with multiple types of data from one location – Centralized location of data accessible to all organizations • Agile Analytics Platform: purpose-built architecture for getting results and gaining insights quickly through parallel, in-place data analytics – No required sampling due to limited memory – No data movement – Scalable analytics
  • 8. 8© 2015 Pivotal Software, Inc. All rights reserved. Big Data Technologies for Data-Driven Organizations Enterprise Apps Reporting Prioritized Operational Processes Data Sources Inventory Optimization Demand Forecasting Proprietary Structured Data Proprietary Unstructured Data Partner Data Self Reporting: Google, Weblogs, Twitter External Sources: Census, Nielsen, Weather, etc… Sensors HAWQGreenplum DB Pivotal HD (HDFS) GemFire XD MADlib, PL/R, PL/Python, etc. Platform-Driven Data Science 1 0 0 1 01 0 0 1 0 1 1 1 0 Fraud Detection
  • 9. 9© 2015 Pivotal Software, Inc. All rights reserved. Segmentation: An important step for understanding data • What is segmentation? – Automatic grouping of entities based on a common set of features – Identification of patterns amongst similar entities • What is segmentation good for? – Identifying select features that greatly differentiate groups of entities • E.g. Identifying behaviors of high-profit suppliers and low-profit suppliers – Identifying similar characteristics amongst different groups • E.g. Identifying similar market segments to target – Predicting characteristics and behaviors of new or unknown entities • E.g. Inferring missing labels, predicting market response to new products
  • 10. 10© 2015 Pivotal Software, Inc. All rights reserved. Segmentation & Big Data Technologies • Segmentation problems often deal with: – Multiple data sources from multiple lines of business and external sources – BIG DATA, particularly from sensor data or transactions/point of sales – High-dimensional feature sets • Big Data technologies help make segmentation problems become feasible and bring faster time-to-insights through: – Ability to leverage and integrate all relevant data sources, no matter how large  Data Lake – Using ALL data to train segmentation models and not rely on samples or a subset of data that fits into memory  MPP databases, Hadoop, HAWQ, MADlib, Spark, etc. – Quickly building segmentation models and scoring new entities through parallelized, in-place computation  MPP databases, Hadoop, HAWQ, MADlib, Spark, etc. How cutting edge Big Data technology enables faster insights
  • 11. 11© 2015 Pivotal Software, Inc. All rights reserved. 11© 2015 Pivotal Software, Inc. All rights reserved. When To Use Segmentation
  • 12. 12© 2015 Pivotal Software, Inc. All rights reserved. Common Business Problems Customer Micro-targeting Identifying market segments and their purchasing behaviors Operations & Logistics Identifying behaviors of underperforming or outperforming stores, suppliers, delivery services, etc. Fraud Identifying normal and anomalous user behaviors within networks Domain Resolution Inferring labels or groups of similar web domains where segmentation can help
  • 13. 13© 2015 Pivotal Software, Inc. All rights reserved. Data Used In Segmentation of Customers Power in leveraging both internal and external datasets Demographic profiles Sensor data Product metadata Shipment data Store metadata Transactions and invoices Delivery information Marketing plans External data: Census, Nielsen, social networks, etc.
  • 14. 14© 2015 Pivotal Software, Inc. All rights reserved. Gaining Additional Value From External Data Often companies do not or cannot collect sufficient data about their customers to construct a complete profile. Augmenting internal data with external sources allows companies to: • Develop a 360 degree customer view • Gain insights into how consumers are interacting with competitors • Improve accuracy of predictive models • Increase the value of internal data Point of sales Transaction data Web/Apps logs Investments Market basket Loans Traffic Weather IXI wealth complete Haver time series Dept. of Labor CRM Internal External Note: This list only represents a subset of data sources that should be considered.
  • 15. 15© 2015 Pivotal Software, Inc. All rights reserved. Example: Using Census Data to Build Family Profiles Consumer Packaged Goods (CPG) companies are often interested in building market profiles for micro-targeting to improve marketing strategies and supply chain planning. Hypothesis: • Not only are CPG companies interested in the individual consumer, but in the family profile as well – E.g. Consumption of child products is affected by family size Approach: • Census Public Use Microdata Sample (PUMS) files include person records and housing records which can be combined in segmentation models to build rich family profiles. fraction of households Households with Children* *Children as defined by a certain age group
  • 16. 16© 2015 Pivotal Software, Inc. All rights reserved. 16© 2015 Pivotal Software, Inc. All rights reserved. Use Cases & Approaches to Segmentation
  • 17. 17© 2015 Pivotal Software, Inc. All rights reserved. Common Approaches for Implementing Segmentation Data Step • Identify join relationships across all data sources • Aggregate data to common granularity Feature Step • Identify and create features that can characterize the entities you want to segment, e.g. age, gender, types of last transactions, average time between visits, average spend, sensitivity to price change, etc. Model Step • Candidate algorithms: clustering strategies like k- means & hierarchical clustering, regression or hierarchical modeling and grouping by similar coefficients, ensemble methods, etc. Analysis of Results • Look at average features across clusters • Look at average cluster features vs. population average (e.g. to find anomalous behavior) • Identify common features amongst segments (e.g. opportunities for cross-sell/up-sell)
  • 18. 18© 2015 Pivotal Software, Inc. All rights reserved. • Objective: Identify characteristics of consumers that prefer certain brands or products • Common business challenges: – No integration of data amongst different lines of businesses – Internal data is not sufficient for building profiles – No information about which consumers are more/less profitable • Data sources: – Point of sales, demographic data, loyalty data, product and store metadata, external data Example: Profiling market and consumer segments
  • 19. 19© 2015 Pivotal Software, Inc. All rights reserved. • Identify relationships and joins amongst all data sources • Clean data by removing outliers and imputing missing values if appropriate – For example using the median or weighted average value for a state to impute into a missing value for a county • Aggregate or select data to common granularity that makes sense – For example, demographic profiles can be built at the zip code or county level, and store profiles can be built at the individual store or tier or region level Step 1: Consolidate Data Sources • Do gap analysis to determine the scope of data sufficient for analysis - For example, a certain subset of customers may be missing data for a large time period and should be scoped out time numberofstoresreporting Using an MPP database like Greenplum, we can join tables with billions of rows in a little over a minute.
  • 20. 20© 2015 Pivotal Software, Inc. All rights reserved. Step 2: Feature Engineering & Selection Transactions&PointofSales Total sales Change in sales Price Discount Market basket Store/Location Geolocation Weather Product Department/Type Color Size Brand Package Demographic Age Gender Income Employment Education Family size Marital status Citizenship Language Loyalty Status Length of membership Activity It’s common for data scientists to generate hundreds of thousands of features.
  • 21. 21© 2015 Pivotal Software, Inc. All rights reserved. Step 2: Feature Engineering & Selection In order to reduce feature dimensionality and account for unwanted bias due to the inclusion of highly correlated features, we can filter features using approaches such as : • Principal Component Analysis • Reducing the dimensionality of the feature space to a select number of principal components • Iterative pairwise correlation comparison • Calculate NxN pairwise correlations, where N is the number features • Remove the feature existing in the greatest number of correlated pairs (correlation coefficient greater than some threshold) • Iterate until no correlated pairs exist Example: Subset of feature correlation matrix. The large number of features requires an automated approach to feature selection
  • 22. 22© 2015 Pivotal Software, Inc. All rights reserved. • Example: K-means Clustering 1. Create single feature vector for each entity, e.g. consumer 2. Use k-means clustering to identify k consumer segments i. Try multiple training trials for multiple values of k ii. Use any one of a variety of techniques for selecting optimal k, e.g. silhouette coefficient 3. Look at average features across segments to identify segment characteristics 4. Look at purchasing behaviors of each segment to identify segment preferences Step 3: Build Models
  • 23. 23© 2015 Pivotal Software, Inc. All rights reserved. • Segmentation models used to identify and profile consumer groups – Calculate descriptive statistics for each segment and compare to uncover previously hidden opportunities • Cross-sell/up-sell opportunities • Potential data issues or supply chain execution opportunities regarding unequal proportion of product shipment or inventory to regional preference • Rich set of reusable data assets made available for ongoing analysis & reporting Step 4: Extract Business Value from Results Cluster 1 Cluster 2 Cluster 3 Cluster 4 Feature 1 Feature 2 Feature 3 . . . low value high value compared across clusters
  • 24. 24© 2015 Pivotal Software, Inc. All rights reserved. What Questions Should You Be Asking Your Data? • Are you collecting the right data & storing it in the right fashion? • Do you have the right technology to support your data and data science endeavors? • Where are the gaps in your data? How can external sources fill those gaps? • How can your data sources be joined or aggregated together to build rich feature sets? • How can you extract business value from your data? Segmentation will help you answer all of these questions!
  • 25. 25© 2015 Pivotal Software, Inc. All rights reserved. Thank You.
  • 26. 26© 2015 Pivotal Software, Inc. All rights reserved.