SlideShare a Scribd company logo
1 of 25
Download to read offline
Geo-Spotting: Mining Online Location-based
Services for Optimal Retail Store Placement
Dmytro Karamshuk
King's College London

Based on the paper:
D. Karamshuk, A. Noulas, S. Scellato, V. Nicosia, C. Mascolo. Geo-Spotting: Mining Online

Location-based Services for Optimal Retail Store Placement. ACM SIGKDD Conference on
Knowledge Discovery and Data Mining (KDD), Chicago, 2013
Optimal Retail Location Problem

Among L possible locations in the city select one where
a new store would be most profitable/popular.
Optimal Retail Location Problem
The problem is not new
●

A. Athiyaman. Location decision making: the case of retail service development in a
closed population. In Academy of Marketing Studies, volume 15, page 13, 2010.

●

O. Berman and D. Krass. The generalized maximal covering location problem.
Computers & Operations Research, 29(6):563–581, 2002.

●

A. Kubis and M. Hartmann. Analysis of location of large-area shopping centres. a
probabilistic gravity model for the halle-leipzig area. Jahrbuch für
Regionalwissenschaft, 27(1):43–57, 2007.

●

Pablo Jensen. Network-based predictions of retail store commercial categories and
optimal locations. Phys. Rev. E, 74:035101, Sep 2006.

Our approach:
explore fine-grained and cheap data from LBSN
Location-based social networks

●

check-in at places

●

share with your friends

●

receive bonuses for check-ins

●

search for places

●

leave comments for others
Check-ins around the world

over 40M users

over 4.5B check-ins
Collecting the Data

Dataset collected in New York
●

37K venues

●

47K users

●

621K checkins

●

May – November, 2010
accounts for »25% of the original data
How popular is a venue?

The distance between the two places is only few hundred meters
How popular is a venue?
Distribution of check-ins per place

Geographic distribution of venues

size = #checkin
●

popularity can be several orders of magnitude different from place to place

●

probably it depends on the location and types of places
Popularity and type of venue

●

different types and chains of
venues have different usage
patterns

●

we cannot compare check-ins
across venues of different
chains but we can across
individual chains

Number of check-ins per place for
individual chains of restaurants
Co-location with other venues
How frequently we observe a Starbucks close to a
railway station?

Does it influence the popularity of a restaurant?
Pablo Jensen. Analyzing the localization of retail stores
with complex systems tools. IDA ’09, pages 10–20, Berlin,
Heidelberg, 2009. Springer-Verlag.
User mobility between places
How many users go to a Starbucks
after railway station?

●

there is correspondence between colocation and mobility patterns

●

but also many discrepancies
Optimal Retail Location Problem

Among L possible locations in the city select one where new store would be most popular.
Define the area
An area is defined as a disc of radius r around a point with geographical coordinates l

The area is described by a set of numeric features
check-ins at venues in the disk.

designed from
Geographic features of an area
●

density – number of venues in the area

●

neighbors entropy – heterogeneity of venue types

●

competitiveness – percentage of competing venues
Geographic features of an area
●

quality by Jensen
–

define inter-types attractiveness coefficients

–

weight surrounding venues by their attractiveness
Mobility features of an area
●

area popularity – total number of checkins in the area

●

transition density – intensity of transitions inside the area

●

incoming flows – intensity of transitions from outside areas
Mobility features of an area
●

transition quality
●

define transition coefficients for each type

●

weight venues according to the product of coefficient and
check-ins volume
Ranking problem
Use area features

to rank all areas in a given set L

according to their potential popularity.

Compare with the ground truth: ranking of places basing
on their actual popularity.
Evaluation metrics
Compare the predicted and ground truth rankings.
●

Top-K locations ranking – use NDCG@K

●

Accuracy of the best prediction – Accuracy@X% of having the
best predicted store in the Top-X% of ground truth ranking
We explore random cross-validation approach and report
average values across all experiments.
Performance of individual features
NDCG@10

●

some indicators are general across various chains while some are chain-specific

●

the lack of competitors in the area play positive role as do the existence of place attractors

●

performance of In.Flow is in accordance with the fact that McDonalds attract more users
from the remote areas
Considering fusion of factors
Explore the fusion of features in a supervised learning approach

●

regression for ranking – conduct regression using Linear Regression,
SVR or M5P and then rank according to regressed values

●

pair-wise ranking – learn on pair-wise comparison using neural
networks RankNet

Use the same evaluation methodology as for individual features.
Results of the supervised learning
NDCG@10

Individual features

Supervised learning

●

supervised learning has better performance than the the best individual feature

●

the combination of geographic features and mobility features yields better result than
the combination of geographic features alone

●

regression to rank with SVR is the best performing technique
The best location prediction

Supervised learning
Individual features
●

supervised learning yields reliable and significantly improved result

●

the best prediction lies in top-20% of the ground truth ranking with
probability over 80%
Implications
●

we show how fine-grained data from location-based social networks
can be effectively explored in geographic retail analysis

●

this can inspire further works in location-based advertising, developing
indexes of urban areas, provision of location-based services etc. etc.

●

particularly we see a lot of potential in the approach of measuring user
flows from check-ins in various applications

●

we also faced some challenges when scaling this approach to other
chains and cities
Thank you for your attention!
Dmytro Karamshuk
King's College London
follow me on Twitter: @karamshuk

More Related Content

Viewers also liked

Locations and Networks at scale: From insights to predictive models, workshop...
Locations and Networks at scale: From insights to predictive models, workshop...Locations and Networks at scale: From insights to predictive models, workshop...
Locations and Networks at scale: From insights to predictive models, workshop...Dima Karamshuk
 
Identifying Partisan Slant in News Articles and Twitter during Political Crises
Identifying Partisan Slant in News Articles and Twitter during Political CrisesIdentifying Partisan Slant in News Articles and Twitter during Political Crises
Identifying Partisan Slant in News Articles and Twitter during Political CrisesDima Karamshuk
 
ISP-friendly Peer-assisted On-demand Streaming of Long Duration Content in BB...
ISP-friendly Peer-assisted On-demand Streaming of Long Duration Content in BB...ISP-friendly Peer-assisted On-demand Streaming of Long Duration Content in BB...
ISP-friendly Peer-assisted On-demand Streaming of Long Duration Content in BB...Dima Karamshuk
 
CD-GAIN: Content Delivery Through the Analysis of Users' Access Patterns, ta...
CD-GAIN: Content Delivery  Through the Analysis of Users' Access Patterns, ta...CD-GAIN: Content Delivery  Through the Analysis of Users' Access Patterns, ta...
CD-GAIN: Content Delivery Through the Analysis of Users' Access Patterns, ta...Dima Karamshuk
 
On Factors Affecting the Usage and Adoption of a Nation-wide TV Streaming Ser...
On Factors Affecting the Usage and Adoption of a Nation-wide TV Streaming Ser...On Factors Affecting the Usage and Adoption of a Nation-wide TV Streaming Ser...
On Factors Affecting the Usage and Adoption of a Nation-wide TV Streaming Ser...Dima Karamshuk
 
On Factors Affecting the Usage and Adoption of a Nation-wide TV Streaming Ser...
On Factors Affecting the Usage and Adoption of a Nation-wide TV Streaming Ser...On Factors Affecting the Usage and Adoption of a Nation-wide TV Streaming Ser...
On Factors Affecting the Usage and Adoption of a Nation-wide TV Streaming Ser...Dima Karamshuk
 
Modeling the Social, Spatial, and Temporal dimensions of Human Mobility in a ...
Modeling the Social, Spatial, and Temporal dimensions of Human Mobility in a ...Modeling the Social, Spatial, and Temporal dimensions of Human Mobility in a ...
Modeling the Social, Spatial, and Temporal dimensions of Human Mobility in a ...Dima Karamshuk
 
Take-away TV: Recharging Work Commutes with Greedy and Predictive Preloading ...
Take-away TV: Recharging Work Commutes with Greedy and Predictive Preloading ...Take-away TV: Recharging Work Commutes with Greedy and Predictive Preloading ...
Take-away TV: Recharging Work Commutes with Greedy and Predictive Preloading ...Dima Karamshuk
 
GEOVISUALIZING SPATIO-TEMPORAL PATTERNS IN TENNIS
GEOVISUALIZING SPATIO-TEMPORAL PATTERNS IN TENNISGEOVISUALIZING SPATIO-TEMPORAL PATTERNS IN TENNIS
GEOVISUALIZING SPATIO-TEMPORAL PATTERNS IN TENNISDamien Demaj
 
Human mobility,urban structure analysis,and spatial community detection from ...
Human mobility,urban structure analysis,and spatial community detection from ...Human mobility,urban structure analysis,and spatial community detection from ...
Human mobility,urban structure analysis,and spatial community detection from ...Song Gao
 
AgensGraph: a Multi-model Graph Database based on PostgreSql
AgensGraph: a Multi-model Graph Database based on PostgreSqlAgensGraph: a Multi-model Graph Database based on PostgreSql
AgensGraph: a Multi-model Graph Database based on PostgreSqlKisung Kim
 

Viewers also liked (11)

Locations and Networks at scale: From insights to predictive models, workshop...
Locations and Networks at scale: From insights to predictive models, workshop...Locations and Networks at scale: From insights to predictive models, workshop...
Locations and Networks at scale: From insights to predictive models, workshop...
 
Identifying Partisan Slant in News Articles and Twitter during Political Crises
Identifying Partisan Slant in News Articles and Twitter during Political CrisesIdentifying Partisan Slant in News Articles and Twitter during Political Crises
Identifying Partisan Slant in News Articles and Twitter during Political Crises
 
ISP-friendly Peer-assisted On-demand Streaming of Long Duration Content in BB...
ISP-friendly Peer-assisted On-demand Streaming of Long Duration Content in BB...ISP-friendly Peer-assisted On-demand Streaming of Long Duration Content in BB...
ISP-friendly Peer-assisted On-demand Streaming of Long Duration Content in BB...
 
CD-GAIN: Content Delivery Through the Analysis of Users' Access Patterns, ta...
CD-GAIN: Content Delivery  Through the Analysis of Users' Access Patterns, ta...CD-GAIN: Content Delivery  Through the Analysis of Users' Access Patterns, ta...
CD-GAIN: Content Delivery Through the Analysis of Users' Access Patterns, ta...
 
On Factors Affecting the Usage and Adoption of a Nation-wide TV Streaming Ser...
On Factors Affecting the Usage and Adoption of a Nation-wide TV Streaming Ser...On Factors Affecting the Usage and Adoption of a Nation-wide TV Streaming Ser...
On Factors Affecting the Usage and Adoption of a Nation-wide TV Streaming Ser...
 
On Factors Affecting the Usage and Adoption of a Nation-wide TV Streaming Ser...
On Factors Affecting the Usage and Adoption of a Nation-wide TV Streaming Ser...On Factors Affecting the Usage and Adoption of a Nation-wide TV Streaming Ser...
On Factors Affecting the Usage and Adoption of a Nation-wide TV Streaming Ser...
 
Modeling the Social, Spatial, and Temporal dimensions of Human Mobility in a ...
Modeling the Social, Spatial, and Temporal dimensions of Human Mobility in a ...Modeling the Social, Spatial, and Temporal dimensions of Human Mobility in a ...
Modeling the Social, Spatial, and Temporal dimensions of Human Mobility in a ...
 
Take-away TV: Recharging Work Commutes with Greedy and Predictive Preloading ...
Take-away TV: Recharging Work Commutes with Greedy and Predictive Preloading ...Take-away TV: Recharging Work Commutes with Greedy and Predictive Preloading ...
Take-away TV: Recharging Work Commutes with Greedy and Predictive Preloading ...
 
GEOVISUALIZING SPATIO-TEMPORAL PATTERNS IN TENNIS
GEOVISUALIZING SPATIO-TEMPORAL PATTERNS IN TENNISGEOVISUALIZING SPATIO-TEMPORAL PATTERNS IN TENNIS
GEOVISUALIZING SPATIO-TEMPORAL PATTERNS IN TENNIS
 
Human mobility,urban structure analysis,and spatial community detection from ...
Human mobility,urban structure analysis,and spatial community detection from ...Human mobility,urban structure analysis,and spatial community detection from ...
Human mobility,urban structure analysis,and spatial community detection from ...
 
AgensGraph: a Multi-model Graph Database based on PostgreSql
AgensGraph: a Multi-model Graph Database based on PostgreSqlAgensGraph: a Multi-model Graph Database based on PostgreSql
AgensGraph: a Multi-model Graph Database based on PostgreSql
 

Similar to Geo-Spotting: Mining Online Location-based Services for Optimal Retail Store Placement

Introduction to Targomo
Introduction to TargomoIntroduction to Targomo
Introduction to TargomoTargomo GmbH
 
A Big Data Telco Solution by Dr. Laura Wynter
A Big Data Telco Solution by Dr. Laura WynterA Big Data Telco Solution by Dr. Laura Wynter
A Big Data Telco Solution by Dr. Laura Wynterwkwsci-research
 
Jonathan Lenaghan, VP of Science and Technology, PlaceIQ at MLconf ATL 2016
Jonathan Lenaghan, VP of Science and Technology, PlaceIQ at MLconf ATL 2016Jonathan Lenaghan, VP of Science and Technology, PlaceIQ at MLconf ATL 2016
Jonathan Lenaghan, VP of Science and Technology, PlaceIQ at MLconf ATL 2016MLconf
 
I ii x_slides_albakour_online
I ii x_slides_albakour_onlineI ii x_slides_albakour_online
I ii x_slides_albakour_onlineDyaa AlBakour
 
Volunteering assistance to online geocoding services through a distributed kn...
Volunteering assistance to online geocoding services through a distributed kn...Volunteering assistance to online geocoding services through a distributed kn...
Volunteering assistance to online geocoding services through a distributed kn...José Pablo Gómez Barrón S.
 
Multidimensional scaling & Conjoint Analysis
Multidimensional scaling & Conjoint AnalysisMultidimensional scaling & Conjoint Analysis
Multidimensional scaling & Conjoint AnalysisOmer Maroof
 
Using Spatial Analysis to Drive Post-Pandemic Site Selection in Retail
Using Spatial Analysis to Drive Post-Pandemic Site Selection in RetailUsing Spatial Analysis to Drive Post-Pandemic Site Selection in Retail
Using Spatial Analysis to Drive Post-Pandemic Site Selection in RetailCARTO
 
Exploratory_Analysis_of_Data_ppt.pdf
Exploratory_Analysis_of_Data_ppt.pdfExploratory_Analysis_of_Data_ppt.pdf
Exploratory_Analysis_of_Data_ppt.pdfRushikeshKulkarni71
 
Improving the Mobile Customer Experience by Combining User Behaviour Analysis...
Improving the Mobile Customer Experience by Combining User Behaviour Analysis...Improving the Mobile Customer Experience by Combining User Behaviour Analysis...
Improving the Mobile Customer Experience by Combining User Behaviour Analysis...Merlien Institute
 
Location, mobile,geofences, proximity targeting, ,location, audience targetin...
Location, mobile,geofences, proximity targeting, ,location, audience targetin...Location, mobile,geofences, proximity targeting, ,location, audience targetin...
Location, mobile,geofences, proximity targeting, ,location, audience targetin...Mike Maiorano
 
Maximizing Benefits via a Scoring Framework for Location-Based Services
Maximizing Benefits via a Scoring Framework for Location-Based ServicesMaximizing Benefits via a Scoring Framework for Location-Based Services
Maximizing Benefits via a Scoring Framework for Location-Based ServicesCognizant
 
Off-line vs. On-line Evaluation of Recommender Systems in Small E-commerce
Off-line vs. On-line Evaluation of Recommender Systems in Small E-commerceOff-line vs. On-line Evaluation of Recommender Systems in Small E-commerce
Off-line vs. On-line Evaluation of Recommender Systems in Small E-commerceLadislav Peska
 
Orchestrating Collective Intelligence
Orchestrating Collective IntelligenceOrchestrating Collective Intelligence
Orchestrating Collective IntelligenceTuri, Inc.
 
Csr about
Csr aboutCsr about
Csr aboutCSR
 
New Research Articles 2020 November Issue International Journal of Software E...
New Research Articles 2020 November Issue International Journal of Software E...New Research Articles 2020 November Issue International Journal of Software E...
New Research Articles 2020 November Issue International Journal of Software E...ijseajournal
 
Geo-Enablement of the Supply Chain Analytics
Geo-Enablement of the Supply Chain AnalyticsGeo-Enablement of the Supply Chain Analytics
Geo-Enablement of the Supply Chain AnalyticsNishant Sinha
 
Geomarketing analyses for business by GEPOL
Geomarketing analyses for business by GEPOLGeomarketing analyses for business by GEPOL
Geomarketing analyses for business by GEPOLGepol Sp. z o. o.
 

Similar to Geo-Spotting: Mining Online Location-based Services for Optimal Retail Store Placement (20)

Introduction to Targomo
Introduction to TargomoIntroduction to Targomo
Introduction to Targomo
 
A Big Data Telco Solution by Dr. Laura Wynter
A Big Data Telco Solution by Dr. Laura WynterA Big Data Telco Solution by Dr. Laura Wynter
A Big Data Telco Solution by Dr. Laura Wynter
 
Jonathan Lenaghan, VP of Science and Technology, PlaceIQ at MLconf ATL 2016
Jonathan Lenaghan, VP of Science and Technology, PlaceIQ at MLconf ATL 2016Jonathan Lenaghan, VP of Science and Technology, PlaceIQ at MLconf ATL 2016
Jonathan Lenaghan, VP of Science and Technology, PlaceIQ at MLconf ATL 2016
 
I ii x_slides_albakour_online
I ii x_slides_albakour_onlineI ii x_slides_albakour_online
I ii x_slides_albakour_online
 
GIS
GISGIS
GIS
 
Volunteering assistance to online geocoding services through a distributed kn...
Volunteering assistance to online geocoding services through a distributed kn...Volunteering assistance to online geocoding services through a distributed kn...
Volunteering assistance to online geocoding services through a distributed kn...
 
Multidimensional scaling & Conjoint Analysis
Multidimensional scaling & Conjoint AnalysisMultidimensional scaling & Conjoint Analysis
Multidimensional scaling & Conjoint Analysis
 
ch14.ppt
ch14.pptch14.ppt
ch14.ppt
 
ch14.ppt
ch14.pptch14.ppt
ch14.ppt
 
Using Spatial Analysis to Drive Post-Pandemic Site Selection in Retail
Using Spatial Analysis to Drive Post-Pandemic Site Selection in RetailUsing Spatial Analysis to Drive Post-Pandemic Site Selection in Retail
Using Spatial Analysis to Drive Post-Pandemic Site Selection in Retail
 
Exploratory_Analysis_of_Data_ppt.pdf
Exploratory_Analysis_of_Data_ppt.pdfExploratory_Analysis_of_Data_ppt.pdf
Exploratory_Analysis_of_Data_ppt.pdf
 
Improving the Mobile Customer Experience by Combining User Behaviour Analysis...
Improving the Mobile Customer Experience by Combining User Behaviour Analysis...Improving the Mobile Customer Experience by Combining User Behaviour Analysis...
Improving the Mobile Customer Experience by Combining User Behaviour Analysis...
 
Location, mobile,geofences, proximity targeting, ,location, audience targetin...
Location, mobile,geofences, proximity targeting, ,location, audience targetin...Location, mobile,geofences, proximity targeting, ,location, audience targetin...
Location, mobile,geofences, proximity targeting, ,location, audience targetin...
 
Maximizing Benefits via a Scoring Framework for Location-Based Services
Maximizing Benefits via a Scoring Framework for Location-Based ServicesMaximizing Benefits via a Scoring Framework for Location-Based Services
Maximizing Benefits via a Scoring Framework for Location-Based Services
 
Off-line vs. On-line Evaluation of Recommender Systems in Small E-commerce
Off-line vs. On-line Evaluation of Recommender Systems in Small E-commerceOff-line vs. On-line Evaluation of Recommender Systems in Small E-commerce
Off-line vs. On-line Evaluation of Recommender Systems in Small E-commerce
 
Orchestrating Collective Intelligence
Orchestrating Collective IntelligenceOrchestrating Collective Intelligence
Orchestrating Collective Intelligence
 
Csr about
Csr aboutCsr about
Csr about
 
New Research Articles 2020 November Issue International Journal of Software E...
New Research Articles 2020 November Issue International Journal of Software E...New Research Articles 2020 November Issue International Journal of Software E...
New Research Articles 2020 November Issue International Journal of Software E...
 
Geo-Enablement of the Supply Chain Analytics
Geo-Enablement of the Supply Chain AnalyticsGeo-Enablement of the Supply Chain Analytics
Geo-Enablement of the Supply Chain Analytics
 
Geomarketing analyses for business by GEPOL
Geomarketing analyses for business by GEPOLGeomarketing analyses for business by GEPOL
Geomarketing analyses for business by GEPOL
 

Recently uploaded

Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAndikSusilo4
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 

Recently uploaded (20)

Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & Application
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 

Geo-Spotting: Mining Online Location-based Services for Optimal Retail Store Placement

  • 1. Geo-Spotting: Mining Online Location-based Services for Optimal Retail Store Placement Dmytro Karamshuk King's College London Based on the paper: D. Karamshuk, A. Noulas, S. Scellato, V. Nicosia, C. Mascolo. Geo-Spotting: Mining Online Location-based Services for Optimal Retail Store Placement. ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), Chicago, 2013
  • 2. Optimal Retail Location Problem Among L possible locations in the city select one where a new store would be most profitable/popular.
  • 3. Optimal Retail Location Problem The problem is not new ● A. Athiyaman. Location decision making: the case of retail service development in a closed population. In Academy of Marketing Studies, volume 15, page 13, 2010. ● O. Berman and D. Krass. The generalized maximal covering location problem. Computers & Operations Research, 29(6):563–581, 2002. ● A. Kubis and M. Hartmann. Analysis of location of large-area shopping centres. a probabilistic gravity model for the halle-leipzig area. Jahrbuch für Regionalwissenschaft, 27(1):43–57, 2007. ● Pablo Jensen. Network-based predictions of retail store commercial categories and optimal locations. Phys. Rev. E, 74:035101, Sep 2006. Our approach: explore fine-grained and cheap data from LBSN
  • 4. Location-based social networks ● check-in at places ● share with your friends ● receive bonuses for check-ins ● search for places ● leave comments for others
  • 5. Check-ins around the world over 40M users over 4.5B check-ins
  • 6. Collecting the Data Dataset collected in New York ● 37K venues ● 47K users ● 621K checkins ● May – November, 2010 accounts for »25% of the original data
  • 7. How popular is a venue? The distance between the two places is only few hundred meters
  • 8. How popular is a venue? Distribution of check-ins per place Geographic distribution of venues size = #checkin ● popularity can be several orders of magnitude different from place to place ● probably it depends on the location and types of places
  • 9. Popularity and type of venue ● different types and chains of venues have different usage patterns ● we cannot compare check-ins across venues of different chains but we can across individual chains Number of check-ins per place for individual chains of restaurants
  • 10. Co-location with other venues How frequently we observe a Starbucks close to a railway station? Does it influence the popularity of a restaurant? Pablo Jensen. Analyzing the localization of retail stores with complex systems tools. IDA ’09, pages 10–20, Berlin, Heidelberg, 2009. Springer-Verlag.
  • 11. User mobility between places How many users go to a Starbucks after railway station? ● there is correspondence between colocation and mobility patterns ● but also many discrepancies
  • 12. Optimal Retail Location Problem Among L possible locations in the city select one where new store would be most popular.
  • 13. Define the area An area is defined as a disc of radius r around a point with geographical coordinates l The area is described by a set of numeric features check-ins at venues in the disk. designed from
  • 14. Geographic features of an area ● density – number of venues in the area ● neighbors entropy – heterogeneity of venue types ● competitiveness – percentage of competing venues
  • 15. Geographic features of an area ● quality by Jensen – define inter-types attractiveness coefficients – weight surrounding venues by their attractiveness
  • 16. Mobility features of an area ● area popularity – total number of checkins in the area ● transition density – intensity of transitions inside the area ● incoming flows – intensity of transitions from outside areas
  • 17. Mobility features of an area ● transition quality ● define transition coefficients for each type ● weight venues according to the product of coefficient and check-ins volume
  • 18. Ranking problem Use area features to rank all areas in a given set L according to their potential popularity. Compare with the ground truth: ranking of places basing on their actual popularity.
  • 19. Evaluation metrics Compare the predicted and ground truth rankings. ● Top-K locations ranking – use NDCG@K ● Accuracy of the best prediction – Accuracy@X% of having the best predicted store in the Top-X% of ground truth ranking We explore random cross-validation approach and report average values across all experiments.
  • 20. Performance of individual features NDCG@10 ● some indicators are general across various chains while some are chain-specific ● the lack of competitors in the area play positive role as do the existence of place attractors ● performance of In.Flow is in accordance with the fact that McDonalds attract more users from the remote areas
  • 21. Considering fusion of factors Explore the fusion of features in a supervised learning approach ● regression for ranking – conduct regression using Linear Regression, SVR or M5P and then rank according to regressed values ● pair-wise ranking – learn on pair-wise comparison using neural networks RankNet Use the same evaluation methodology as for individual features.
  • 22. Results of the supervised learning NDCG@10 Individual features Supervised learning ● supervised learning has better performance than the the best individual feature ● the combination of geographic features and mobility features yields better result than the combination of geographic features alone ● regression to rank with SVR is the best performing technique
  • 23. The best location prediction Supervised learning Individual features ● supervised learning yields reliable and significantly improved result ● the best prediction lies in top-20% of the ground truth ranking with probability over 80%
  • 24. Implications ● we show how fine-grained data from location-based social networks can be effectively explored in geographic retail analysis ● this can inspire further works in location-based advertising, developing indexes of urban areas, provision of location-based services etc. etc. ● particularly we see a lot of potential in the approach of measuring user flows from check-ins in various applications ● we also faced some challenges when scaling this approach to other chains and cities
  • 25. Thank you for your attention! Dmytro Karamshuk King's College London follow me on Twitter: @karamshuk