SlideShare une entreprise Scribd logo
1  sur  41
3 Easy Ways To Reach
Financial Freedom:
How Twitter use Geo to win Advertising
Sen Xu
SIGSpatial 2016
MELT Workshop
Mobile Entity Localization, Tracking and Analysis
• Step one
– Use a catching title
• Twitter has more than 284 million monthly active users. (October 2014)
• 500 million Tweets are sent per day, or 1 billion every ~2 days. (August 2013)
• More than 300 billion Tweets have been sent since company founding in 2006. (October 2013)
• TPS record: one-second peak of 143,199 Tweets per second, in Japan (August 2013)
• 80% of our active users are mobile users. (October 2014)
• 40% of our active users simply consume content on Twitter.
• Twitter supports 35 different languages. (March 2013)
• 77% of Twitter accounts are outside the U.S. (October 2013)
Twitter
• Content Generation
– How to create features that make users want to share private information with us?
• How to get user to turn on locationservice?
• How to collect user birthday?
– Import third party data: data plumbing
– Data Correctness/Legal Issue/Disputed Territory
• Monetization
– Features for the other side: advertisers
– Targeting: Geo, Age, Interest, Behavior (Follow/Following)
• Service/Technology (AKA How to make your service faster)
– QA your data source
– Tech infra (Geohash-based Reverse-geocoding)
How to create features so attractive that users are
willing to share data
Content Generation
Targeting criteria: Geo, Bio, Behavior
Ads Analytics
Features for the other side: advertisers
Nielson DMA
Selection
Cleaning/Plumbing
Service
Dealing with third party data
QA Vendor data is absolute necessary
Spot the difference?
PlaceType: COUNTRY PlaceType: TOWN
More interesting (potentially
dangerous) insights:
Pitney Bowes Geometry (conflated)
Zipcode:
United States Mexico
Same PlaceType in different country may have different coverage
Territory in Dispute
Territory in Dispute
Definition
Plumbing
Cleaning
Geo Data Pipeline Infra
19
In the Geo Stack…
–A place has id, names, attributes, parents, and geography
•place_id: unique u64 id
•name: one place may have multiple names, but only one preferred name
•Attributes (annotations): open-ended key-value store for custom attributes. For POI, address,
phone, URL, twitter, existence, etc.
•parents: upper administrative level, e.g., in US, City’s closest parent is State (Admin1). Or
determined by geometry containment, e.g., POI can have Neighborhood as parent if it contained
by it.
•geography: point (for POI), polygon/multi-polygon (for all other place types). line geometry
Place:
Glossary
POI: Point of Interest. Using a point (lat,lon) as a simplified representation of places, common POIs are
restaurant, landmarks, parks, and dentist offices*
*although POIs can all be interesting/useful under certain occasions, some
will be more interesting than others for geotagging purposes.
Pitney Bowes:
Factual:
Polygonal data vendor (188 countries)
POI data vendor (49 countries)
What kind of data do we need for a fully-fledged Geo Service?
Service Required Data Set Rockdove Geoduck
Geocoding
(text to lat/lon)
Reverse-
Geocoding
(lat/lon to text)
• Popular Geopolitical names
and geometry (e.g.,
Neighborhood, City, State,
Country)
Unresolved merge of
13 different data
source of various
data quality
Pitney Bowes
• Polygonal data for specific
marketing needs
Unlicensed
simplified
geometries
Nielson
• Useful, High quality POI UGC… Factual
IP reverse-lookup IP blocks to lat/lon or Place
(confidence)
NetAcuity NetAcuity with
User modeling
22
User generate places (e.g., Mom’s basement)
Overlaps within the same PlaceType (data bug!)
Historically…rockdove allows
23
• Geometries within each PlaceType do not overlap against each other
• Keep Reverse-Geocoding (RGC) Trie sane
• Maintain Rockdove ID
• Historically geo-tagged Tweets will display
correctly (deleted)
• Reuse Rockdove ID and update with geometry
• Historically geotagged “New York City” tweets
will be related to the same PlaceID, with updated
geometry and attributes
Requirement for Geoduck
Geoduck Data Pipleine (v1)
25
Data Pipeline
26
• Duplicate places coming from different vendors with slightly different name and
geometry
• Simple Solution: For each incoming place, find potential candidates (name-match,
Levenshtein distance) then validate using geometry
Conflation Challenges
from O(N log N) to O(1)
Reverse-Geocoding
Geohash
Geometry/Geogr
aphy Input
Output Data
Structure
Transform into
Geohash with
precision set
arbitrarily (e.g.,
precision = 7)
Geohash-based Reverse-Geocoding
Geohash-based Reverse geocoding
• What would happen when user don’t share GPS?
– IP: NetAcuity, MaxMind, NeoStar
– DIY?
• Blacklist
• Whitelist
• Requires polygons
Mapping Uber’s Future: Uber Maps is Hiring
*https://newsroom.uber.com/mapping-ubers-future/
“
Over the past decade mapping innovation has
disrupted industries and changed daily life in ways I
couldn’t have imagined when I started. That
progress will only accelerate in the coming years
especially with technologies like self-driving cars. I
remain excited by the prospect of how maps can
put the world at our fingertips, improve everyday
life, impact billions of people and enable
innovations we can’t even imagine today.
”
--Brian McClendon, VP of Engineering, Uber
Twitter: @alex_senxu
Wechat: senxu_alex
Email: sen@uber.com

Contenu connexe

Similaire à 3 Easy Ways to Reach Financial Freedom: How Twitter user Geo to win advertising

Myths and challenges in knowledge extraction and analysis from human-generate...
Myths and challenges in knowledge extraction and analysis from human-generate...Myths and challenges in knowledge extraction and analysis from human-generate...
Myths and challenges in knowledge extraction and analysis from human-generate...
Marco Brambilla
 

Similaire à 3 Easy Ways to Reach Financial Freedom: How Twitter user Geo to win advertising (20)

Predicting News Popularity by Mining Online Discussions
Predicting News Popularity by Mining Online DiscussionsPredicting News Popularity by Mining Online Discussions
Predicting News Popularity by Mining Online Discussions
 
Web20 Mapping - by Alan Lew
Web20 Mapping - by  Alan LewWeb20 Mapping - by  Alan Lew
Web20 Mapping - by Alan Lew
 
Semanticnews 230913-final
Semanticnews 230913-finalSemanticnews 230913-final
Semanticnews 230913-final
 
Building a Spatial Database in PostgreSQL
Building a Spatial Database in PostgreSQLBuilding a Spatial Database in PostgreSQL
Building a Spatial Database in PostgreSQL
 
Spark summit europe 2015 magellan
Spark summit europe 2015 magellanSpark summit europe 2015 magellan
Spark summit europe 2015 magellan
 
GeoRabble GovHack Sydney Intro
GeoRabble GovHack Sydney IntroGeoRabble GovHack Sydney Intro
GeoRabble GovHack Sydney Intro
 
Geo-search-location-based-results-for-site-search
Geo-search-location-based-results-for-site-searchGeo-search-location-based-results-for-site-search
Geo-search-location-based-results-for-site-search
 
Myths and challenges in knowledge extraction and analysis from human-generate...
Myths and challenges in knowledge extraction and analysis from human-generate...Myths and challenges in knowledge extraction and analysis from human-generate...
Myths and challenges in knowledge extraction and analysis from human-generate...
 
A Journey to the World of GIS
A Journey to the World of GISA Journey to the World of GIS
A Journey to the World of GIS
 
CKX: Wellbeing Toronto - More Than Just a Map
CKX: Wellbeing Toronto - More Than Just a MapCKX: Wellbeing Toronto - More Than Just a Map
CKX: Wellbeing Toronto - More Than Just a Map
 
Magellen: Geospatial Analytics on Spark by Ram Sriharsha
Magellen: Geospatial Analytics on Spark by Ram SriharshaMagellen: Geospatial Analytics on Spark by Ram Sriharsha
Magellen: Geospatial Analytics on Spark by Ram Sriharsha
 
The GeoCoding Revolution
The GeoCoding RevolutionThe GeoCoding Revolution
The GeoCoding Revolution
 
Frames of Context for Data
Frames of Context for DataFrames of Context for Data
Frames of Context for Data
 
Managing Spatial Information and Services with Drupal: the GEO-MOOD approach
Managing Spatial Information and Services with Drupal: the GEO-MOOD approachManaging Spatial Information and Services with Drupal: the GEO-MOOD approach
Managing Spatial Information and Services with Drupal: the GEO-MOOD approach
 
Big Data Day LA 2016/ Data Science Track - Enabling Cross-Screen Advertising ...
Big Data Day LA 2016/ Data Science Track - Enabling Cross-Screen Advertising ...Big Data Day LA 2016/ Data Science Track - Enabling Cross-Screen Advertising ...
Big Data Day LA 2016/ Data Science Track - Enabling Cross-Screen Advertising ...
 
Middleware for indoor location-based services
Middleware for indoor location-based servicesMiddleware for indoor location-based services
Middleware for indoor location-based services
 
Inferring social media user attributes using language and network information
Inferring social media user attributes using language and network informationInferring social media user attributes using language and network information
Inferring social media user attributes using language and network information
 
Where should-you-go
Where should-you-goWhere should-you-go
Where should-you-go
 
GIS and Remote Sensing Training at Pitney Bowes Software
GIS and Remote Sensing Training at Pitney Bowes SoftwareGIS and Remote Sensing Training at Pitney Bowes Software
GIS and Remote Sensing Training at Pitney Bowes Software
 
Solving Mysteries Geographically
Solving Mysteries GeographicallySolving Mysteries Geographically
Solving Mysteries Geographically
 

Dernier

%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
masabamasaba
 
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
masabamasaba
 
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
masabamasaba
 

Dernier (20)

WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...
WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...
WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...
 
WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...
WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...
WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...
 
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
 
WSO2CON2024 - It's time to go Platformless
WSO2CON2024 - It's time to go PlatformlessWSO2CON2024 - It's time to go Platformless
WSO2CON2024 - It's time to go Platformless
 
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
Direct Style Effect Systems -The Print[A] Example- A Comprehension AidDirect Style Effect Systems -The Print[A] Example- A Comprehension Aid
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
 
WSO2CON 2024 Slides - Open Source to SaaS
WSO2CON 2024 Slides - Open Source to SaaSWSO2CON 2024 Slides - Open Source to SaaS
WSO2CON 2024 Slides - Open Source to SaaS
 
WSO2Con2024 - Enabling Transactional System's Exponential Growth With Simplicity
WSO2Con2024 - Enabling Transactional System's Exponential Growth With SimplicityWSO2Con2024 - Enabling Transactional System's Exponential Growth With Simplicity
WSO2Con2024 - Enabling Transactional System's Exponential Growth With Simplicity
 
AI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplateAI & Machine Learning Presentation Template
AI & Machine Learning Presentation Template
 
What Goes Wrong with Language Definitions and How to Improve the Situation
What Goes Wrong with Language Definitions and How to Improve the SituationWhat Goes Wrong with Language Definitions and How to Improve the Situation
What Goes Wrong with Language Definitions and How to Improve the Situation
 
%in kempton park+277-882-255-28 abortion pills for sale in kempton park
%in kempton park+277-882-255-28 abortion pills for sale in kempton park %in kempton park+277-882-255-28 abortion pills for sale in kempton park
%in kempton park+277-882-255-28 abortion pills for sale in kempton park
 
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
 
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
 
%in Soweto+277-882-255-28 abortion pills for sale in soweto
%in Soweto+277-882-255-28 abortion pills for sale in soweto%in Soweto+277-882-255-28 abortion pills for sale in soweto
%in Soweto+277-882-255-28 abortion pills for sale in soweto
 
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
 
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
 
WSO2CON 2024 - Freedom First—Unleashing Developer Potential with Open Source
WSO2CON 2024 - Freedom First—Unleashing Developer Potential with Open SourceWSO2CON 2024 - Freedom First—Unleashing Developer Potential with Open Source
WSO2CON 2024 - Freedom First—Unleashing Developer Potential with Open Source
 
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
 
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
 
%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburg
%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburg%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburg
%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburg
 
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
 

3 Easy Ways to Reach Financial Freedom: How Twitter user Geo to win advertising

  • 1. 3 Easy Ways To Reach Financial Freedom: How Twitter use Geo to win Advertising Sen Xu SIGSpatial 2016 MELT Workshop Mobile Entity Localization, Tracking and Analysis
  • 2. • Step one – Use a catching title
  • 3.
  • 4. • Twitter has more than 284 million monthly active users. (October 2014) • 500 million Tweets are sent per day, or 1 billion every ~2 days. (August 2013) • More than 300 billion Tweets have been sent since company founding in 2006. (October 2013) • TPS record: one-second peak of 143,199 Tweets per second, in Japan (August 2013) • 80% of our active users are mobile users. (October 2014) • 40% of our active users simply consume content on Twitter. • Twitter supports 35 different languages. (March 2013) • 77% of Twitter accounts are outside the U.S. (October 2013) Twitter
  • 5. • Content Generation – How to create features that make users want to share private information with us? • How to get user to turn on locationservice? • How to collect user birthday? – Import third party data: data plumbing – Data Correctness/Legal Issue/Disputed Territory • Monetization – Features for the other side: advertisers – Targeting: Geo, Age, Interest, Behavior (Follow/Following) • Service/Technology (AKA How to make your service faster) – QA your data source – Tech infra (Geohash-based Reverse-geocoding)
  • 6. How to create features so attractive that users are willing to share data Content Generation
  • 7.
  • 8.
  • 9. Targeting criteria: Geo, Bio, Behavior Ads Analytics Features for the other side: advertisers
  • 11.
  • 13. QA Vendor data is absolute necessary Spot the difference? PlaceType: COUNTRY PlaceType: TOWN
  • 15. Pitney Bowes Geometry (conflated) Zipcode: United States Mexico Same PlaceType in different country may have different coverage
  • 19. 19 In the Geo Stack…
  • 20. –A place has id, names, attributes, parents, and geography •place_id: unique u64 id •name: one place may have multiple names, but only one preferred name •Attributes (annotations): open-ended key-value store for custom attributes. For POI, address, phone, URL, twitter, existence, etc. •parents: upper administrative level, e.g., in US, City’s closest parent is State (Admin1). Or determined by geometry containment, e.g., POI can have Neighborhood as parent if it contained by it. •geography: point (for POI), polygon/multi-polygon (for all other place types). line geometry Place: Glossary POI: Point of Interest. Using a point (lat,lon) as a simplified representation of places, common POIs are restaurant, landmarks, parks, and dentist offices* *although POIs can all be interesting/useful under certain occasions, some will be more interesting than others for geotagging purposes. Pitney Bowes: Factual: Polygonal data vendor (188 countries) POI data vendor (49 countries)
  • 21. What kind of data do we need for a fully-fledged Geo Service? Service Required Data Set Rockdove Geoduck Geocoding (text to lat/lon) Reverse- Geocoding (lat/lon to text) • Popular Geopolitical names and geometry (e.g., Neighborhood, City, State, Country) Unresolved merge of 13 different data source of various data quality Pitney Bowes • Polygonal data for specific marketing needs Unlicensed simplified geometries Nielson • Useful, High quality POI UGC… Factual IP reverse-lookup IP blocks to lat/lon or Place (confidence) NetAcuity NetAcuity with User modeling
  • 22. 22 User generate places (e.g., Mom’s basement) Overlaps within the same PlaceType (data bug!) Historically…rockdove allows
  • 23. 23 • Geometries within each PlaceType do not overlap against each other • Keep Reverse-Geocoding (RGC) Trie sane • Maintain Rockdove ID • Historically geo-tagged Tweets will display correctly (deleted) • Reuse Rockdove ID and update with geometry • Historically geotagged “New York City” tweets will be related to the same PlaceID, with updated geometry and attributes Requirement for Geoduck
  • 26. 26 • Duplicate places coming from different vendors with slightly different name and geometry • Simple Solution: For each incoming place, find potential candidates (name-match, Levenshtein distance) then validate using geometry Conflation Challenges
  • 27. from O(N log N) to O(1) Reverse-Geocoding
  • 29.
  • 30.
  • 31.
  • 32.
  • 33.
  • 34.
  • 35.
  • 36. Geometry/Geogr aphy Input Output Data Structure Transform into Geohash with precision set arbitrarily (e.g., precision = 7)
  • 39. • What would happen when user don’t share GPS? – IP: NetAcuity, MaxMind, NeoStar – DIY? • Blacklist • Whitelist • Requires polygons
  • 40. Mapping Uber’s Future: Uber Maps is Hiring *https://newsroom.uber.com/mapping-ubers-future/ “ Over the past decade mapping innovation has disrupted industries and changed daily life in ways I couldn’t have imagined when I started. That progress will only accelerate in the coming years especially with technologies like self-driving cars. I remain excited by the prospect of how maps can put the world at our fingertips, improve everyday life, impact billions of people and enable innovations we can’t even imagine today. ” --Brian McClendon, VP of Engineering, Uber

Notes de l'éditeur

  1. In the Geo stack. The data pipeline is the part that covers vendor delivery, rockdove data migration, data base schema design, conflation pipeline (import, normalize, conflate), various data quality report, coverage maps, and data export. It’s the process that precedes the service, and the data it provides determines what kind of data the service will serve. The primary goal for the data pipeline is to have sane and high quality data delivered to downstream.