SlideShare une entreprise Scribd logo
1  sur  38
Télécharger pour lire hors ligne
GEO @
Carl Anderson
Market Analytics Team
Growing Very Rapidly...
# Open
Locations
Today: 200
2010: 0
175,000+ Members Globally
200 Physical Locations
64 Cities
20 Countries
...and Globally
WHERE DO
PEOPLE WANT
TO WORK?
?
Where next? Why?
World Metro Neighborhood Building Floor
Multiple Scales
What’s a good city, neighborhood, or location?
What are the relevant metrics, trends?
What are the firmographics for Bangkok?
How do we determine and measure market potential and saturation?
When will be too late to enter a given market?
How do we determine comparable markets?
Questions, Questions
175,000+ Members Globally
200 Physical Locations
64 Cities
20 Countries
Space
People &
Business
primarily to serve Sales, Marketing, Executives, Real Estate
Location intelligence is the intersection of
people and space
Responsibilities of the Market Analytics Team:
● Geospatial strategy
● Data sourcing, provisioning
● Ad hoc analyses
● Develop predictive models
● Develop APIs
● Tooling
● Geo-expertise resource
MODELS / DATA SCIENCE
Esri Tapestry Segmentation
Caveat: you have to define an area not a point location to attach a segment.
There is no definitive segment for a location as it is a function of scale of the area being considered.
Portland
Zoomed out
coarser-grained
Portland
Zoomed in
finer-grained
500m radius around each location was the
smallest radius in which Esri would assign
a segment to each location (smaller radii
led to missing data).
There is no correct scale. Nevertheless, we
find some interesting findings.
Segments Vary With Scale
Enriched 140 WeWork US
locations with 500m and 1km
radius.
Almost all were in handful of
67 segments.
Goal:
understand distribution of segments on
current fleet
Value:
● Filter
● Score
Esri Tapestry Segmentation
WeWork (open)
Other
Goal:
predict which ZIP codes should have a
coworking space, and why
Coworking Model
As of June 2017
Score all ZIPs Opportunities:
Where is there no coworking but there should be?
Where is there coworking but there should not be?
Train on balanced dataset Interpretable models: Decision trees, Logistic, NaiveBayes, kNN
Enrich data Demographics, HHV, firmographics, daytime population...
All Coworking spaces in US 3,600 Coworking spaces
Aggregate by ZIP ~36,000 ZIP codes
Test on unbalanced dataset Optimize for precision not accuracy or F1
Filter by population density ~16,000 Coworking spaces
Goal:
Predict which ZIP codes should
have a coworking space, and why
Value:
● List of ZIPs we should
investigate
● Relative importance of
features
● Validation of current fleet
Coworking Model
Comps
Proforma: document set out case for a new location, including projected financial
performance
Given new Building, which are most similar “comps” in our fleet?
Predict “detrended occupancy”
All WeWork locations open >6 mo
Enrich data
Mine combinations of 1-10 features
kNN with leave one out cross-validation
Output ranked table for each WeWork
Goal:
Provide a ranked list of comparable
WeWork locations given some
non-WeWork location. Which
features are important?
Value:
● Better predictions
● Insights into drivers
Comps
Building Neighborhood Market
Building # of floors # of businesses within Xm # of colleges within MSA
Building total sq ft Distance to other WeWorks Undergrad, grad enrollment
Year built How far can one walk, bike, drive in
X seconds
Population who commute by
car, walk
Building Class #businesses of different sizes Household income
Building Rating Per capita income House values by ZIP
WW #floors, #offices,
#desks
Walk, bike, transit score Daytime population
Goal:
Provide a ranked list of comparable
WeWork locations given some
non-WeWork location. Which
features are important?
Value:
● Better predictions
● Insights into drivers
NY
LA
SF
DC
Goal:
Provide a ranked list of
comparable WeWork locations
given some non-WeWork
location. Which features are
important?
Chord diagram of nearest
neighbor of each location
Comps
Goal:
Determine whether # of amenities in
different classes predict WeWork
location success
Value:
● Location Score / Feature
● Key categories?
All WeWork locations open >6 mo
List of all US storefront businesses
For each business category &
distance, how predictive is that
category?
lm(occupancy ~ #chinese restaurants within 200m)
lm(occupancy ~ #pizza restaurants within 200m)
...
lm(gyms ~ #pizza restaurants within 800m)
Amenities
● Score every reasonable location in U.S.
○ Location attributes
■ Across street: Western Union , Blue Bottle ?
■ Neighborhood: food, transport, river/parks, fitness...
● Output: simple thumbs up / down
● Value:
○ Heatmaps
○ Monitor locations (buildings, other businesses)
Goal:
score every urban intersection in US with
thumbs up / thumbs down, and understand why
Work In Progress
topos.ai
“I think the best technologies, and Twitter is included in this,
disappear. They fade into the background, and they’re relevant when
you want to use them, and they get out of the way when you don’t”
JACK DORSEY
2012 Charlie Rose interview
CHALLENGES
Data Quality / Availability
Same for metros?
Metro Identifiers
HiPPOs
carl.anderson@wework.com
@leapingllamas
We Are Hiring!
https://www.wework.com/careers
Questions?

Contenu connexe

Similaire à Geo@Work, keynote from Carto Spatial Data Science conference

Webinar: Beyond KPIs: The People And Process Drivers In Location Analytics
Webinar: Beyond KPIs: The People And Process Drivers In Location Analytics Webinar: Beyond KPIs: The People And Process Drivers In Location Analytics
Webinar: Beyond KPIs: The People And Process Drivers In Location Analytics G3 Communications
 
How technology is changing opp danielson
How technology is changing opp danielsonHow technology is changing opp danielson
How technology is changing opp danielsonEDR
 
Haystack- Learning to rank in an hourly job market
Haystack- Learning to rank in an hourly job market Haystack- Learning to rank in an hourly job market
Haystack- Learning to rank in an hourly job market Xun Wang
 
WorkFromAnywhere H4R Stanford 2020
WorkFromAnywhere H4R Stanford 2020WorkFromAnywhere H4R Stanford 2020
WorkFromAnywhere H4R Stanford 2020Stanford University
 
Value Engineer your Products | Workshop | Agile Gurugram 2018 | 23 - 24 March
Value Engineer your Products | Workshop | Agile Gurugram 2018 | 23 - 24 MarchValue Engineer your Products | Workshop | Agile Gurugram 2018 | 23 - 24 March
Value Engineer your Products | Workshop | Agile Gurugram 2018 | 23 - 24 MarchAgileNetwork
 
Positioning for Retail Recovery: The Role of Predictive Analytics Fueled by M...
Positioning for Retail Recovery: The Role of Predictive Analytics Fueled by M...Positioning for Retail Recovery: The Role of Predictive Analytics Fueled by M...
Positioning for Retail Recovery: The Role of Predictive Analytics Fueled by M...Precisely
 
Agile Mumbai 2019 Conference | Defining and redefining value for Digital Tra...
Agile Mumbai 2019 Conference |  Defining and redefining value for Digital Tra...Agile Mumbai 2019 Conference |  Defining and redefining value for Digital Tra...
Agile Mumbai 2019 Conference | Defining and redefining value for Digital Tra...AgileNetwork
 
Location Analytics for Retail
Location Analytics for RetailLocation Analytics for Retail
Location Analytics for RetailEsri
 
JobMobile_Summmit_2014_Shankar_Mishra_Recruiting_Trends
JobMobile_Summmit_2014_Shankar_Mishra_Recruiting_TrendsJobMobile_Summmit_2014_Shankar_Mishra_Recruiting_Trends
JobMobile_Summmit_2014_Shankar_Mishra_Recruiting_TrendsTheLadders
 
Geomarketing method part__2.2
Geomarketing method part__2.2Geomarketing method part__2.2
Geomarketing method part__2.2CSR
 
Csr about
Csr aboutCsr about
Csr aboutCSR
 
Differentiate and Drive through Digital
Differentiate and Drive through DigitalDifferentiate and Drive through Digital
Differentiate and Drive through DigitalAtlas Integrated
 
The Role of Data Science in Real Estate
The Role of Data Science in Real EstateThe Role of Data Science in Real Estate
The Role of Data Science in Real EstateCARTO
 
Marketing Casestudy for WeWork India
Marketing Casestudy for WeWork IndiaMarketing Casestudy for WeWork India
Marketing Casestudy for WeWork IndiaVartika Verma
 
Nicholas Gorski: Real-time revenue science at Twitter
Nicholas Gorski: Real-time revenue science at TwitterNicholas Gorski: Real-time revenue science at Twitter
Nicholas Gorski: Real-time revenue science at TwitterDavid Garrison
 
Stepping up your Spatial Analysis with Mastercard Retail Location insights 3.0
Stepping up your Spatial Analysis with Mastercard Retail Location insights 3.0Stepping up your Spatial Analysis with Mastercard Retail Location insights 3.0
Stepping up your Spatial Analysis with Mastercard Retail Location insights 3.0CARTO
 
CARTO en 5 Pasos: del Dato a la Toma de Decisiones [CARTO]
CARTO en 5 Pasos: del Dato a la Toma de Decisiones [CARTO]CARTO en 5 Pasos: del Dato a la Toma de Decisiones [CARTO]
CARTO en 5 Pasos: del Dato a la Toma de Decisiones [CARTO]CARTO
 
Market analysis through Consumer Behavior Pattern Insights
Market analysis through Consumer Behavior Pattern InsightsMarket analysis through Consumer Behavior Pattern Insights
Market analysis through Consumer Behavior Pattern InsightsCARTO
 
GoTime Marketing [google analytics, market segment research]
GoTime Marketing  [google analytics, market segment research]GoTime Marketing  [google analytics, market segment research]
GoTime Marketing [google analytics, market segment research]GarciaKristoffer
 

Similaire à Geo@Work, keynote from Carto Spatial Data Science conference (20)

Webinar: Beyond KPIs: The People And Process Drivers In Location Analytics
Webinar: Beyond KPIs: The People And Process Drivers In Location Analytics Webinar: Beyond KPIs: The People And Process Drivers In Location Analytics
Webinar: Beyond KPIs: The People And Process Drivers In Location Analytics
 
How technology is changing opp danielson
How technology is changing opp danielsonHow technology is changing opp danielson
How technology is changing opp danielson
 
Haystack- Learning to rank in an hourly job market
Haystack- Learning to rank in an hourly job market Haystack- Learning to rank in an hourly job market
Haystack- Learning to rank in an hourly job market
 
WorkFromAnywhere H4R Stanford 2020
WorkFromAnywhere H4R Stanford 2020WorkFromAnywhere H4R Stanford 2020
WorkFromAnywhere H4R Stanford 2020
 
Value Engineer your Products | Workshop | Agile Gurugram 2018 | 23 - 24 March
Value Engineer your Products | Workshop | Agile Gurugram 2018 | 23 - 24 MarchValue Engineer your Products | Workshop | Agile Gurugram 2018 | 23 - 24 March
Value Engineer your Products | Workshop | Agile Gurugram 2018 | 23 - 24 March
 
Positioning for Retail Recovery: The Role of Predictive Analytics Fueled by M...
Positioning for Retail Recovery: The Role of Predictive Analytics Fueled by M...Positioning for Retail Recovery: The Role of Predictive Analytics Fueled by M...
Positioning for Retail Recovery: The Role of Predictive Analytics Fueled by M...
 
Agile Mumbai 2019 Conference | Defining and redefining value for Digital Tra...
Agile Mumbai 2019 Conference |  Defining and redefining value for Digital Tra...Agile Mumbai 2019 Conference |  Defining and redefining value for Digital Tra...
Agile Mumbai 2019 Conference | Defining and redefining value for Digital Tra...
 
Retail site selection checklist
Retail site selection checklist Retail site selection checklist
Retail site selection checklist
 
Location Analytics for Retail
Location Analytics for RetailLocation Analytics for Retail
Location Analytics for Retail
 
JobMobile_Summmit_2014_Shankar_Mishra_Recruiting_Trends
JobMobile_Summmit_2014_Shankar_Mishra_Recruiting_TrendsJobMobile_Summmit_2014_Shankar_Mishra_Recruiting_Trends
JobMobile_Summmit_2014_Shankar_Mishra_Recruiting_Trends
 
Geomarketing method part__2.2
Geomarketing method part__2.2Geomarketing method part__2.2
Geomarketing method part__2.2
 
Csr about
Csr aboutCsr about
Csr about
 
Differentiate and Drive through Digital
Differentiate and Drive through DigitalDifferentiate and Drive through Digital
Differentiate and Drive through Digital
 
The Role of Data Science in Real Estate
The Role of Data Science in Real EstateThe Role of Data Science in Real Estate
The Role of Data Science in Real Estate
 
Marketing Casestudy for WeWork India
Marketing Casestudy for WeWork IndiaMarketing Casestudy for WeWork India
Marketing Casestudy for WeWork India
 
Nicholas Gorski: Real-time revenue science at Twitter
Nicholas Gorski: Real-time revenue science at TwitterNicholas Gorski: Real-time revenue science at Twitter
Nicholas Gorski: Real-time revenue science at Twitter
 
Stepping up your Spatial Analysis with Mastercard Retail Location insights 3.0
Stepping up your Spatial Analysis with Mastercard Retail Location insights 3.0Stepping up your Spatial Analysis with Mastercard Retail Location insights 3.0
Stepping up your Spatial Analysis with Mastercard Retail Location insights 3.0
 
CARTO en 5 Pasos: del Dato a la Toma de Decisiones [CARTO]
CARTO en 5 Pasos: del Dato a la Toma de Decisiones [CARTO]CARTO en 5 Pasos: del Dato a la Toma de Decisiones [CARTO]
CARTO en 5 Pasos: del Dato a la Toma de Decisiones [CARTO]
 
Market analysis through Consumer Behavior Pattern Insights
Market analysis through Consumer Behavior Pattern InsightsMarket analysis through Consumer Behavior Pattern Insights
Market analysis through Consumer Behavior Pattern Insights
 
GoTime Marketing [google analytics, market segment research]
GoTime Marketing  [google analytics, market segment research]GoTime Marketing  [google analytics, market segment research]
GoTime Marketing [google analytics, market segment research]
 

Plus de Carl Anderson

Inspiring healthy habits: data science at WW
Inspiring healthy habits: data science at WWInspiring healthy habits: data science at WW
Inspiring healthy habits: data science at WWCarl Anderson
 
Leveraging an in-house modeling framework for fun and profit
Leveraging an in-house modeling framework for fun and profitLeveraging an in-house modeling framework for fun and profit
Leveraging an in-house modeling framework for fun and profitCarl Anderson
 
Data Quality: principles, approaches, and best practices
Data Quality: principles, approaches, and best practicesData Quality: principles, approaches, and best practices
Data Quality: principles, approaches, and best practicesCarl Anderson
 
Setting up Data Science for Success: The Data Layer
Setting up Data Science for Success: The Data LayerSetting up Data Science for Success: The Data Layer
Setting up Data Science for Success: The Data LayerCarl Anderson
 
Creating a Data-Driven Organization -- thisismetis meetup
Creating a Data-Driven Organization -- thisismetis meetupCreating a Data-Driven Organization -- thisismetis meetup
Creating a Data-Driven Organization -- thisismetis meetupCarl Anderson
 
Creating a Data-Driven Organization, Data Day Texas, January 2016
Creating a Data-Driven Organization, Data Day Texas, January 2016Creating a Data-Driven Organization, Data Day Texas, January 2016
Creating a Data-Driven Organization, Data Day Texas, January 2016Carl Anderson
 
Creating a Data-Driven Organization, Crunchconf, October 2015
Creating a Data-Driven Organization, Crunchconf, October 2015Creating a Data-Driven Organization, Crunchconf, October 2015
Creating a Data-Driven Organization, Crunchconf, October 2015Carl Anderson
 
Creating a Data-Driven Organization (Data Day Seattle 2015)
Creating a Data-Driven Organization (Data Day Seattle 2015)Creating a Data-Driven Organization (Data Day Seattle 2015)
Creating a Data-Driven Organization (Data Day Seattle 2015)Carl Anderson
 
Creating a Data-Driven Organization: an executive summary
Creating a Data-Driven Organization: an executive summaryCreating a Data-Driven Organization: an executive summary
Creating a Data-Driven Organization: an executive summaryCarl Anderson
 

Plus de Carl Anderson (9)

Inspiring healthy habits: data science at WW
Inspiring healthy habits: data science at WWInspiring healthy habits: data science at WW
Inspiring healthy habits: data science at WW
 
Leveraging an in-house modeling framework for fun and profit
Leveraging an in-house modeling framework for fun and profitLeveraging an in-house modeling framework for fun and profit
Leveraging an in-house modeling framework for fun and profit
 
Data Quality: principles, approaches, and best practices
Data Quality: principles, approaches, and best practicesData Quality: principles, approaches, and best practices
Data Quality: principles, approaches, and best practices
 
Setting up Data Science for Success: The Data Layer
Setting up Data Science for Success: The Data LayerSetting up Data Science for Success: The Data Layer
Setting up Data Science for Success: The Data Layer
 
Creating a Data-Driven Organization -- thisismetis meetup
Creating a Data-Driven Organization -- thisismetis meetupCreating a Data-Driven Organization -- thisismetis meetup
Creating a Data-Driven Organization -- thisismetis meetup
 
Creating a Data-Driven Organization, Data Day Texas, January 2016
Creating a Data-Driven Organization, Data Day Texas, January 2016Creating a Data-Driven Organization, Data Day Texas, January 2016
Creating a Data-Driven Organization, Data Day Texas, January 2016
 
Creating a Data-Driven Organization, Crunchconf, October 2015
Creating a Data-Driven Organization, Crunchconf, October 2015Creating a Data-Driven Organization, Crunchconf, October 2015
Creating a Data-Driven Organization, Crunchconf, October 2015
 
Creating a Data-Driven Organization (Data Day Seattle 2015)
Creating a Data-Driven Organization (Data Day Seattle 2015)Creating a Data-Driven Organization (Data Day Seattle 2015)
Creating a Data-Driven Organization (Data Day Seattle 2015)
 
Creating a Data-Driven Organization: an executive summary
Creating a Data-Driven Organization: an executive summaryCreating a Data-Driven Organization: an executive summary
Creating a Data-Driven Organization: an executive summary
 

Dernier

NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...Amil Baba Dawood bangali
 
Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Seán Kennedy
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort servicejennyeacort
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectBoston Institute of Analytics
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.natarajan8993
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Jack DiGiovanna
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFAAndrei Kaleshka
 
2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING
2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING
2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSINGmarianagonzalez07
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Cathrine Wilhelmsen
 
Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanMYRABACSAFRA2
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Seán Kennedy
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理e4aez8ss
 
Machine learning classification ppt.ppt
Machine learning classification  ppt.pptMachine learning classification  ppt.ppt
Machine learning classification ppt.pptamreenkhanum0307
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Colleen Farrelly
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfJohn Sterrett
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsVICTOR MAESTRE RAMIREZ
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样vhwb25kk
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceSapana Sha
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdfHuman37
 

Dernier (20)

NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
 
Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis Project
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFA
 
2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING
2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING
2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)
 
Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population Mean
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
 
Machine learning classification ppt.ppt
Machine learning classification  ppt.pptMachine learning classification  ppt.ppt
Machine learning classification ppt.ppt
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdf
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business Professionals
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts Service
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf
 

Geo@Work, keynote from Carto Spatial Data Science conference

  • 1. GEO @ Carl Anderson Market Analytics Team
  • 2.
  • 3.
  • 4.
  • 5.
  • 6. Growing Very Rapidly... # Open Locations Today: 200 2010: 0
  • 7. 175,000+ Members Globally 200 Physical Locations 64 Cities 20 Countries ...and Globally
  • 10. World Metro Neighborhood Building Floor Multiple Scales
  • 11.
  • 12.
  • 13. What’s a good city, neighborhood, or location? What are the relevant metrics, trends? What are the firmographics for Bangkok? How do we determine and measure market potential and saturation? When will be too late to enter a given market? How do we determine comparable markets? Questions, Questions
  • 14. 175,000+ Members Globally 200 Physical Locations 64 Cities 20 Countries
  • 15.
  • 16. Space People & Business primarily to serve Sales, Marketing, Executives, Real Estate Location intelligence is the intersection of people and space
  • 17. Responsibilities of the Market Analytics Team: ● Geospatial strategy ● Data sourcing, provisioning ● Ad hoc analyses ● Develop predictive models ● Develop APIs ● Tooling ● Geo-expertise resource
  • 18. MODELS / DATA SCIENCE
  • 19.
  • 20.
  • 22.
  • 23. Caveat: you have to define an area not a point location to attach a segment. There is no definitive segment for a location as it is a function of scale of the area being considered. Portland Zoomed out coarser-grained Portland Zoomed in finer-grained 500m radius around each location was the smallest radius in which Esri would assign a segment to each location (smaller radii led to missing data). There is no correct scale. Nevertheless, we find some interesting findings. Segments Vary With Scale
  • 24. Enriched 140 WeWork US locations with 500m and 1km radius. Almost all were in handful of 67 segments. Goal: understand distribution of segments on current fleet Value: ● Filter ● Score Esri Tapestry Segmentation
  • 25. WeWork (open) Other Goal: predict which ZIP codes should have a coworking space, and why Coworking Model As of June 2017
  • 26. Score all ZIPs Opportunities: Where is there no coworking but there should be? Where is there coworking but there should not be? Train on balanced dataset Interpretable models: Decision trees, Logistic, NaiveBayes, kNN Enrich data Demographics, HHV, firmographics, daytime population... All Coworking spaces in US 3,600 Coworking spaces Aggregate by ZIP ~36,000 ZIP codes Test on unbalanced dataset Optimize for precision not accuracy or F1 Filter by population density ~16,000 Coworking spaces Goal: Predict which ZIP codes should have a coworking space, and why Value: ● List of ZIPs we should investigate ● Relative importance of features ● Validation of current fleet Coworking Model
  • 27. Comps Proforma: document set out case for a new location, including projected financial performance Given new Building, which are most similar “comps” in our fleet? Predict “detrended occupancy” All WeWork locations open >6 mo Enrich data Mine combinations of 1-10 features kNN with leave one out cross-validation Output ranked table for each WeWork Goal: Provide a ranked list of comparable WeWork locations given some non-WeWork location. Which features are important? Value: ● Better predictions ● Insights into drivers
  • 28. Comps Building Neighborhood Market Building # of floors # of businesses within Xm # of colleges within MSA Building total sq ft Distance to other WeWorks Undergrad, grad enrollment Year built How far can one walk, bike, drive in X seconds Population who commute by car, walk Building Class #businesses of different sizes Household income Building Rating Per capita income House values by ZIP WW #floors, #offices, #desks Walk, bike, transit score Daytime population Goal: Provide a ranked list of comparable WeWork locations given some non-WeWork location. Which features are important? Value: ● Better predictions ● Insights into drivers
  • 29. NY LA SF DC Goal: Provide a ranked list of comparable WeWork locations given some non-WeWork location. Which features are important? Chord diagram of nearest neighbor of each location Comps
  • 30. Goal: Determine whether # of amenities in different classes predict WeWork location success Value: ● Location Score / Feature ● Key categories? All WeWork locations open >6 mo List of all US storefront businesses For each business category & distance, how predictive is that category? lm(occupancy ~ #chinese restaurants within 200m) lm(occupancy ~ #pizza restaurants within 200m) ... lm(gyms ~ #pizza restaurants within 800m) Amenities
  • 31. ● Score every reasonable location in U.S. ○ Location attributes ■ Across street: Western Union , Blue Bottle ? ■ Neighborhood: food, transport, river/parks, fitness... ● Output: simple thumbs up / down ● Value: ○ Heatmaps ○ Monitor locations (buildings, other businesses) Goal: score every urban intersection in US with thumbs up / thumbs down, and understand why Work In Progress
  • 33. “I think the best technologies, and Twitter is included in this, disappear. They fade into the background, and they’re relevant when you want to use them, and they get out of the way when you don’t” JACK DORSEY 2012 Charlie Rose interview
  • 35. Data Quality / Availability
  • 36. Same for metros? Metro Identifiers