SlideShare une entreprise Scribd logo
1  sur  41
Télécharger pour lire hors ligne
Recommendation Architecture
Jeremy Schiff
MLConf 2015
03/27/2015
BEFORE	
   DURING	
  	
   AFTER	
  
DINERS	
  RESTAURANTS	
  
Understanding	
  
&	
  Evolving	
  
A2rac4ng	
  &	
  
Planning	
  
OpenTable: Deliver great experiences at 

every step, based on who you are
Proprietary	
   2	
  
OpenTable in Numbers
• Our network connects diners with more than
32,000 restaurants worldwide.
• Our diners have spent more than $30 billion
at our partner restaurants.
• OpenTable seats more than 16 million diners
each month.
• Every month, OpenTable diners write more
than 450,000 restaurant reviews

3	
  
Recommendations 
>>
Collaborative Filtering
4	
  
So what are recommendations?
5	
  
Building Recommendation Systems
• Importance of A/B
Testing
• Generating
Recommendations
• Recommendation
Explanations


6	
  
What’s the Goal

Minimizing Engineering Time to Improve The
Metric that Matters

• Make it Easy to Measure
• Make it Easy to Iterate
• Reduce Iteration Cycle Times
7	
  
Importance of A/B Testing
• If you don’t measure it,
you can’t improve it

 
 
 

• Metrics Drive Behavior
• Continued Forward
Progress
8	
  
Pick Your Business Metric
Revenue, Conversions 
• OpenTable 
• Amazon 

Engagement
• Netflix
• Pandora
• Spotify
9	
  
Measuring & The Iteration Loop
A/B	
  	
  
Tes4ng	
  
Weeks	
  
Measure	
  
10	
  
Measuring & The Iteration Loop
Op4mize	
  	
  
Models	
  
A/B	
  	
  
Tes4ng	
  
Days	
   Weeks	
  
Predict	
   Measure	
  
11	
  
Measuring & The Iteration Loop
Analyze	
  &	
  	
  
Introspect	
  
Op4mize	
  	
  
Models	
  
A/B	
  	
  
Tes4ng	
  
Hours	
   Days	
   Weeks	
  
Insights	
   Predict	
   Measure	
  
12	
  
Ranking Objectives
Objectives: 
• Training Error
- Minimize Loss Function
§  Often Convex
• Generalization Error
- Precision at K
• A/B Metric
- Conversion / Engagement

13	
  
Training, Generalization, and Online Error
• Training: Train on your specific dataset
- Dealing with Sparseness
• Test/Generalization: How does it generalize
to unseen data?
- Hyper-Parameter Tuning
• Online: How does it perform in the wild
- Model interaction effects between recommend
items (diversity)
Fundamental Differences in Usage

Right now vs. Planning

Cost of Being Wrong

Search vs. Recommendations


15	
  
Recommendation Stack
Query	
  Interpreta4on	
  
Retrieval	
  
Ranking	
  –	
  Item	
  &	
  Explana4on	
  
Index	
  
Building	
  
Context	
  for	
  Query	
  &	
  User	
  	
  
Model	
  
Building	
  
Explana4on	
  
Content	
  
Visualiza4on	
  
Collabora4ve	
  
Filters	
  
Item	
  /	
  User	
  
Metadata	
  
16	
  
Using Context, Frequency & Sentiment
•  Context
- Implicit: Location, Time, Mobile/Web
- Explicit: Query
•  High End Restaurant for Dinner
- Low Frequency, High Sentiment
•  Fast, Mediocre Sushi for Lunch
- High Frequency, Moderate
Sentiment

 

17	
  
How to use this data
• Frequency Data: 
- General: Popularity
- Personalized: Implicit CF
• Sentiment Data: 
- General: Good Experience
- Personalized: Explicit CF
• Good Recommendation
- Use both to drive your Business Metric
18	
  
Ranking
Phase 1: Bootstrap through heuristics

Phase 2: Learn to Rank
•  Many models
- E [ Revenue | Query, Position, Item, User ]
- E [ Engagement | Query, Position, Item, User ]
- Regression, RankSVM, LambdaMart…
•  Modeling Diversity is Important 


19	
  
Training Example
•  Context Free (Collaborative Filtering)
- Train for Content Based and Collaborative Filtering models.
- Create an Ensemble Model
- Perform Hyper-Parameter Tuning for each model
•  With Context (Search)
- Train a model using query (implicit & explicit)
§  Includes Context-Free Model
- Perform Hyper-Parameter Tuning
•  Evaluate Model using A/B
- Change models, objective functions, etc.
Training DataFlow
Collabora4ve	
  Filter	
  
Service	
  
(Real4me)	
  
Collabora4ve	
  Filter	
  
HyperParameter	
  Tuning	
  	
  
(Batch	
  with	
  Spark)	
  
Collabora4ve	
  Filter	
  
Training	
  
(Batch	
  with	
  Spark)	
  
Training DataFlow
Collabora4ve	
  Filter	
  
Service	
  
(Real4me)	
  
Collabora4ve	
  Filter	
  
HyperParameter	
  Tuning	
  	
  
(Batch	
  with	
  Spark)	
  
Collabora4ve	
  Filter	
  
Training	
  
(Batch	
  with	
  Spark)	
  
Search	
  Service	
  
(Real4me)	
  
Search	
  HyperParameter	
  
Tuning	
  	
  
(Batch	
  with	
  Spark)	
  
Search	
  Training	
  
(Batch	
  with	
  Spark)	
  
Training DataFlow
Collabora4ve	
  Filter	
  
Service	
  
(Real4me)	
  
Collabora4ve	
  Filter	
  
HyperParameter	
  Tuning	
  	
  
(Batch	
  with	
  Spark)	
  
Collabora4ve	
  Filter	
  
Training	
  
(Batch	
  with	
  Spark)	
  
Search	
  Service	
  
(Real4me)	
  
Search	
  HyperParameter	
  
Tuning	
  	
  
(Batch	
  with	
  Spark)	
  
Search	
  Training	
  
(Batch	
  with	
  Spark)	
  
User	
  Interac4on	
  Logs	
  
(Ka_a)	
  
A/B	
  Tes4ng	
  Dashboards	
  
Other	
  Services	
  
Compelling Recommendations
24	
  
Recommendation Explanations
•  Amazon

•  Ness
•  Netflix

•  Ness - Social
25	
  
Summarizing Content

• Essential for Mobile
• Balance Utility With Trust?
- Summarize, but surface raw
data

• Example: 
- Initially, read every review
- Later, use average star rating
26	
  
Summarizing Restaurant Attributes
27	
  
Dish Recommendation
• What to try once I have arrived?

28	
  
Edit	
  via	
  the	
  Header	
  &	
  Footer	
  menu	
  in	
  
PowerPoint	
  
29	
  29	
  
Analyzing Review Content
30	
  
The ingredients of a spectacular

dining experience…
31	
  
… and a spectacularly bad one
32	
  
Content Features

Pandora
• Music Genome Project

Natural Language Processing
• Topics & Tags
33	
  
Topic Modeling Methods
We applied two main topic
modeling methods:
• Latent Dirichlet Allocation
(LDA) 
- (Blei et al. 2003)
• Non-negative Matrix
Factorization (NMF) 
- (Aurora et al. 2012)
34	
  
The food was great! I loved the view of the
sailboats.

Bag of Words Model
food	
   great	
   chicken	
   sailboat	
   view	
   service	
  
1	
   1	
   0	
   1	
   1	
   0	
  
35	
  
Topics with NMF using TF-IDF
Word	
  1	
   Word	
  …	
   Word	
  N	
  
Review	
  1	
   0.8	
   0.9	
   0	
  
Review	
  …	
   0.6	
   0	
   0.8	
  
Review	
  N	
   0.9	
   0	
   0.8	
  
Reviews	
  
X	
  
Words	
  
Reviews	
  
X	
  
Topics	
  
Topics	
  
X	
  
Words	
  
36	
  
Describing Restaurants as Topics
Each	
  review	
  for	
  a	
  
given	
  restaurant	
  	
  
has	
  certain	
  topic	
  
distribuCon	
  
Combining	
  them,	
  
we	
  idenCfy	
  the	
  top	
  
topics	
  for	
  that	
  
restaurant.	
  
Topic 01! Topic 02! Topic 03! Topic 04! Topic 05!
Topic 01! Topic 02! Topic 03! Topic 04! Topic 05!
Topic 01! Topic 02! Topic 03! Topic 04! Topic 05!
review	
  1	
  
review	
  2	
  
review	
  N	
  
.	
  
.	
  
.	
  
Topic 01! Topic 02! Topic 03! Topic 04! Topic 05!
Restaurant	
  
37	
  
Examples of Topics
38	
  
Varying Topic By Region
•  San Francisco
•  `

•  London
•  Chicago

•  New York
39	
  
Building Recommendation Systems
• Importance of A/B
Testing
• Generating
Recommendations
• Recommendation
Explanations

40	
  
Thanks!
Jeremy Schiff
jschiff@opentable.com

Contenu connexe

Similaire à Jeremy Schiff, Senior Manager, Data Science, OpenTable at MLconf NYC

Recommendation Architecture - OpenTable - RecSys 2014 - Large Scale Recommend...
Recommendation Architecture - OpenTable - RecSys 2014 - Large Scale Recommend...Recommendation Architecture - OpenTable - RecSys 2014 - Large Scale Recommend...
Recommendation Architecture - OpenTable - RecSys 2014 - Large Scale Recommend...Jeremy Schiff
 
RecSys 2015 - Unifying the Problem of Search and Recommendations at OpenTable
RecSys 2015 - Unifying the Problem of Search and Recommendations at OpenTableRecSys 2015 - Unifying the Problem of Search and Recommendations at OpenTable
RecSys 2015 - Unifying the Problem of Search and Recommendations at OpenTableJeremy Schiff
 
GraphTour: De-cyphering recipes with Neo4j
GraphTour: De-cyphering recipes with Neo4jGraphTour: De-cyphering recipes with Neo4j
GraphTour: De-cyphering recipes with Neo4jNeo4j
 
Modern Perspectives on Recommender Systems and their Applications in Mendeley
Modern Perspectives on Recommender Systems and their Applications in MendeleyModern Perspectives on Recommender Systems and their Applications in Mendeley
Modern Perspectives on Recommender Systems and their Applications in MendeleyMaya Hristakeva
 
Modern Perspectives on Recommender Systems and their Applications in Mendeley
Modern Perspectives on Recommender Systems and their Applications in MendeleyModern Perspectives on Recommender Systems and their Applications in Mendeley
Modern Perspectives on Recommender Systems and their Applications in MendeleyKris Jack
 
Talis Aspire Update - Keji Adedeji | Talis Insight Europe 2016
Talis Aspire Update - Keji Adedeji | Talis Insight Europe 2016Talis Aspire Update - Keji Adedeji | Talis Insight Europe 2016
Talis Aspire Update - Keji Adedeji | Talis Insight Europe 2016Talis
 
Website Best Practices for the High Tech Marketer
Website Best Practices for the High Tech MarketerWebsite Best Practices for the High Tech Marketer
Website Best Practices for the High Tech Marketeredynamic
 
Making your site easier to use, an in-house usability testing case study
Making your site easier to use, an in-house usability testing case studyMaking your site easier to use, an in-house usability testing case study
Making your site easier to use, an in-house usability testing case studyJason Samuels
 
UX Webinar: Always Be Testing
UX Webinar: Always Be TestingUX Webinar: Always Be Testing
UX Webinar: Always Be TestingCharity Dynamics
 
Lets eat presentation_final_20160521
Lets eat presentation_final_20160521Lets eat presentation_final_20160521
Lets eat presentation_final_20160521Lesley Chapman
 
SEO Get up to speed - Williton
SEO  Get up to speed - WillitonSEO  Get up to speed - Williton
SEO Get up to speed - WillitonGet up to Speed
 
Startupmetrics4pirates dave mcclure
Startupmetrics4pirates dave mcclureStartupmetrics4pirates dave mcclure
Startupmetrics4pirates dave mcclureYounghwan Cheon
 
Don't drive your Race car on a dirt track!! - Athresh Krishnappa, Scrum Banga...
Don't drive your Race car on a dirt track!! - Athresh Krishnappa, Scrum Banga...Don't drive your Race car on a dirt track!! - Athresh Krishnappa, Scrum Banga...
Don't drive your Race car on a dirt track!! - Athresh Krishnappa, Scrum Banga...Scrum Bangalore
 
Embrace and Beyond Mobility: Design for the Ideal Dining Experience | 拥抱和超越移...
Embrace and Beyond Mobility:  Design for the Ideal Dining Experience | 拥抱和超越移...Embrace and Beyond Mobility:  Design for the Ideal Dining Experience | 拥抱和超越移...
Embrace and Beyond Mobility: Design for the Ideal Dining Experience | 拥抱和超越移...UX Consulting Pte Ltd
 
Measuring the Quality of Online Service - Jinyoung kim
Measuring the Quality of Online Service - Jinyoung kimMeasuring the Quality of Online Service - Jinyoung kim
Measuring the Quality of Online Service - Jinyoung kimJin Young Kim
 
[UPDATE] Udacity webinar on Recommendation Systems
[UPDATE] Udacity webinar on Recommendation Systems[UPDATE] Udacity webinar on Recommendation Systems
[UPDATE] Udacity webinar on Recommendation SystemsAxel de Romblay
 
Getting Design in Your Company's DNA
Getting Design in Your Company's DNAGetting Design in Your Company's DNA
Getting Design in Your Company's DNABrian Sullivan
 
digital marketing company in pcmc
digital marketing company in pcmcdigital marketing company in pcmc
digital marketing company in pcmcjiyapatsing
 

Similaire à Jeremy Schiff, Senior Manager, Data Science, OpenTable at MLconf NYC (20)

Recommendation Architecture - OpenTable - RecSys 2014 - Large Scale Recommend...
Recommendation Architecture - OpenTable - RecSys 2014 - Large Scale Recommend...Recommendation Architecture - OpenTable - RecSys 2014 - Large Scale Recommend...
Recommendation Architecture - OpenTable - RecSys 2014 - Large Scale Recommend...
 
RecSys 2015 - Unifying the Problem of Search and Recommendations at OpenTable
RecSys 2015 - Unifying the Problem of Search and Recommendations at OpenTableRecSys 2015 - Unifying the Problem of Search and Recommendations at OpenTable
RecSys 2015 - Unifying the Problem of Search and Recommendations at OpenTable
 
GraphTour: De-cyphering recipes with Neo4j
GraphTour: De-cyphering recipes with Neo4jGraphTour: De-cyphering recipes with Neo4j
GraphTour: De-cyphering recipes with Neo4j
 
Modern Perspectives on Recommender Systems and their Applications in Mendeley
Modern Perspectives on Recommender Systems and their Applications in MendeleyModern Perspectives on Recommender Systems and their Applications in Mendeley
Modern Perspectives on Recommender Systems and their Applications in Mendeley
 
Modern Perspectives on Recommender Systems and their Applications in Mendeley
Modern Perspectives on Recommender Systems and their Applications in MendeleyModern Perspectives on Recommender Systems and their Applications in Mendeley
Modern Perspectives on Recommender Systems and their Applications in Mendeley
 
Talis Aspire Update - Keji Adedeji | Talis Insight Europe 2016
Talis Aspire Update - Keji Adedeji | Talis Insight Europe 2016Talis Aspire Update - Keji Adedeji | Talis Insight Europe 2016
Talis Aspire Update - Keji Adedeji | Talis Insight Europe 2016
 
Edge Benchmarks
Edge BenchmarksEdge Benchmarks
Edge Benchmarks
 
Website Best Practices for the High Tech Marketer
Website Best Practices for the High Tech MarketerWebsite Best Practices for the High Tech Marketer
Website Best Practices for the High Tech Marketer
 
Making your site easier to use, an in-house usability testing case study
Making your site easier to use, an in-house usability testing case studyMaking your site easier to use, an in-house usability testing case study
Making your site easier to use, an in-house usability testing case study
 
UX Webinar: Always Be Testing
UX Webinar: Always Be TestingUX Webinar: Always Be Testing
UX Webinar: Always Be Testing
 
Omnyscope e245 march 2014 final
Omnyscope e245 march 2014 finalOmnyscope e245 march 2014 final
Omnyscope e245 march 2014 final
 
Lets eat presentation_final_20160521
Lets eat presentation_final_20160521Lets eat presentation_final_20160521
Lets eat presentation_final_20160521
 
SEO Get up to speed - Williton
SEO  Get up to speed - WillitonSEO  Get up to speed - Williton
SEO Get up to speed - Williton
 
Startupmetrics4pirates dave mcclure
Startupmetrics4pirates dave mcclureStartupmetrics4pirates dave mcclure
Startupmetrics4pirates dave mcclure
 
Don't drive your Race car on a dirt track!! - Athresh Krishnappa, Scrum Banga...
Don't drive your Race car on a dirt track!! - Athresh Krishnappa, Scrum Banga...Don't drive your Race car on a dirt track!! - Athresh Krishnappa, Scrum Banga...
Don't drive your Race car on a dirt track!! - Athresh Krishnappa, Scrum Banga...
 
Embrace and Beyond Mobility: Design for the Ideal Dining Experience | 拥抱和超越移...
Embrace and Beyond Mobility:  Design for the Ideal Dining Experience | 拥抱和超越移...Embrace and Beyond Mobility:  Design for the Ideal Dining Experience | 拥抱和超越移...
Embrace and Beyond Mobility: Design for the Ideal Dining Experience | 拥抱和超越移...
 
Measuring the Quality of Online Service - Jinyoung kim
Measuring the Quality of Online Service - Jinyoung kimMeasuring the Quality of Online Service - Jinyoung kim
Measuring the Quality of Online Service - Jinyoung kim
 
[UPDATE] Udacity webinar on Recommendation Systems
[UPDATE] Udacity webinar on Recommendation Systems[UPDATE] Udacity webinar on Recommendation Systems
[UPDATE] Udacity webinar on Recommendation Systems
 
Getting Design in Your Company's DNA
Getting Design in Your Company's DNAGetting Design in Your Company's DNA
Getting Design in Your Company's DNA
 
digital marketing company in pcmc
digital marketing company in pcmcdigital marketing company in pcmc
digital marketing company in pcmc
 

Plus de MLconf

Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...MLconf
 
Ted Willke - The Brain’s Guide to Dealing with Context in Language Understanding
Ted Willke - The Brain’s Guide to Dealing with Context in Language UnderstandingTed Willke - The Brain’s Guide to Dealing with Context in Language Understanding
Ted Willke - The Brain’s Guide to Dealing with Context in Language UnderstandingMLconf
 
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...MLconf
 
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold RushIgor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold RushMLconf
 
Josh Wills - Data Labeling as Religious Experience
Josh Wills - Data Labeling as Religious ExperienceJosh Wills - Data Labeling as Religious Experience
Josh Wills - Data Labeling as Religious ExperienceMLconf
 
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...MLconf
 
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...MLconf
 
Meghana Ravikumar - Optimized Image Classification on the Cheap
Meghana Ravikumar - Optimized Image Classification on the CheapMeghana Ravikumar - Optimized Image Classification on the Cheap
Meghana Ravikumar - Optimized Image Classification on the CheapMLconf
 
Noam Finkelstein - The Importance of Modeling Data Collection
Noam Finkelstein - The Importance of Modeling Data CollectionNoam Finkelstein - The Importance of Modeling Data Collection
Noam Finkelstein - The Importance of Modeling Data CollectionMLconf
 
June Andrews - The Uncanny Valley of ML
June Andrews - The Uncanny Valley of MLJune Andrews - The Uncanny Valley of ML
June Andrews - The Uncanny Valley of MLMLconf
 
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection TasksSneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection TasksMLconf
 
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...MLconf
 
Vito Ostuni - The Voice: New Challenges in a Zero UI World
Vito Ostuni - The Voice: New Challenges in a Zero UI WorldVito Ostuni - The Voice: New Challenges in a Zero UI World
Vito Ostuni - The Voice: New Challenges in a Zero UI WorldMLconf
 
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...MLconf
 
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...MLconf
 
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...MLconf
 
Neel Sundaresan - Teaching a machine to code
Neel Sundaresan - Teaching a machine to codeNeel Sundaresan - Teaching a machine to code
Neel Sundaresan - Teaching a machine to codeMLconf
 
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...MLconf
 
Soumith Chintala - Increasing the Impact of AI Through Better Software
Soumith Chintala - Increasing the Impact of AI Through Better SoftwareSoumith Chintala - Increasing the Impact of AI Through Better Software
Soumith Chintala - Increasing the Impact of AI Through Better SoftwareMLconf
 
Roy Lowrance - Predicting Bond Prices: Regime Changes
Roy Lowrance - Predicting Bond Prices: Regime ChangesRoy Lowrance - Predicting Bond Prices: Regime Changes
Roy Lowrance - Predicting Bond Prices: Regime ChangesMLconf
 

Plus de MLconf (20)

Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
 
Ted Willke - The Brain’s Guide to Dealing with Context in Language Understanding
Ted Willke - The Brain’s Guide to Dealing with Context in Language UnderstandingTed Willke - The Brain’s Guide to Dealing with Context in Language Understanding
Ted Willke - The Brain’s Guide to Dealing with Context in Language Understanding
 
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
 
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold RushIgor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
 
Josh Wills - Data Labeling as Religious Experience
Josh Wills - Data Labeling as Religious ExperienceJosh Wills - Data Labeling as Religious Experience
Josh Wills - Data Labeling as Religious Experience
 
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
 
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
 
Meghana Ravikumar - Optimized Image Classification on the Cheap
Meghana Ravikumar - Optimized Image Classification on the CheapMeghana Ravikumar - Optimized Image Classification on the Cheap
Meghana Ravikumar - Optimized Image Classification on the Cheap
 
Noam Finkelstein - The Importance of Modeling Data Collection
Noam Finkelstein - The Importance of Modeling Data CollectionNoam Finkelstein - The Importance of Modeling Data Collection
Noam Finkelstein - The Importance of Modeling Data Collection
 
June Andrews - The Uncanny Valley of ML
June Andrews - The Uncanny Valley of MLJune Andrews - The Uncanny Valley of ML
June Andrews - The Uncanny Valley of ML
 
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection TasksSneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
 
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
 
Vito Ostuni - The Voice: New Challenges in a Zero UI World
Vito Ostuni - The Voice: New Challenges in a Zero UI WorldVito Ostuni - The Voice: New Challenges in a Zero UI World
Vito Ostuni - The Voice: New Challenges in a Zero UI World
 
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
 
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
 
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
 
Neel Sundaresan - Teaching a machine to code
Neel Sundaresan - Teaching a machine to codeNeel Sundaresan - Teaching a machine to code
Neel Sundaresan - Teaching a machine to code
 
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
 
Soumith Chintala - Increasing the Impact of AI Through Better Software
Soumith Chintala - Increasing the Impact of AI Through Better SoftwareSoumith Chintala - Increasing the Impact of AI Through Better Software
Soumith Chintala - Increasing the Impact of AI Through Better Software
 
Roy Lowrance - Predicting Bond Prices: Regime Changes
Roy Lowrance - Predicting Bond Prices: Regime ChangesRoy Lowrance - Predicting Bond Prices: Regime Changes
Roy Lowrance - Predicting Bond Prices: Regime Changes
 

Dernier

[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilV3cube
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 

Dernier (20)

[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 

Jeremy Schiff, Senior Manager, Data Science, OpenTable at MLconf NYC

  • 2. BEFORE   DURING     AFTER   DINERS  RESTAURANTS   Understanding   &  Evolving   A2rac4ng  &   Planning   OpenTable: Deliver great experiences at 
 every step, based on who you are Proprietary   2  
  • 3. OpenTable in Numbers • Our network connects diners with more than 32,000 restaurants worldwide. • Our diners have spent more than $30 billion at our partner restaurants. • OpenTable seats more than 16 million diners each month. • Every month, OpenTable diners write more than 450,000 restaurant reviews 3  
  • 5. So what are recommendations? 5  
  • 6. Building Recommendation Systems • Importance of A/B Testing • Generating Recommendations • Recommendation Explanations 6  
  • 7. What’s the Goal Minimizing Engineering Time to Improve The Metric that Matters • Make it Easy to Measure • Make it Easy to Iterate • Reduce Iteration Cycle Times 7  
  • 8. Importance of A/B Testing • If you don’t measure it, you can’t improve it • Metrics Drive Behavior • Continued Forward Progress 8  
  • 9. Pick Your Business Metric Revenue, Conversions • OpenTable • Amazon Engagement • Netflix • Pandora • Spotify 9  
  • 10. Measuring & The Iteration Loop A/B     Tes4ng   Weeks   Measure   10  
  • 11. Measuring & The Iteration Loop Op4mize     Models   A/B     Tes4ng   Days   Weeks   Predict   Measure   11  
  • 12. Measuring & The Iteration Loop Analyze  &     Introspect   Op4mize     Models   A/B     Tes4ng   Hours   Days   Weeks   Insights   Predict   Measure   12  
  • 13. Ranking Objectives Objectives: • Training Error - Minimize Loss Function §  Often Convex • Generalization Error - Precision at K • A/B Metric - Conversion / Engagement 13  
  • 14. Training, Generalization, and Online Error • Training: Train on your specific dataset - Dealing with Sparseness • Test/Generalization: How does it generalize to unseen data? - Hyper-Parameter Tuning • Online: How does it perform in the wild - Model interaction effects between recommend items (diversity)
  • 15. Fundamental Differences in Usage Right now vs. Planning Cost of Being Wrong Search vs. Recommendations 15  
  • 16. Recommendation Stack Query  Interpreta4on   Retrieval   Ranking  –  Item  &  Explana4on   Index   Building   Context  for  Query  &  User     Model   Building   Explana4on   Content   Visualiza4on   Collabora4ve   Filters   Item  /  User   Metadata   16  
  • 17. Using Context, Frequency & Sentiment •  Context - Implicit: Location, Time, Mobile/Web - Explicit: Query •  High End Restaurant for Dinner - Low Frequency, High Sentiment •  Fast, Mediocre Sushi for Lunch - High Frequency, Moderate Sentiment 17  
  • 18. How to use this data • Frequency Data: - General: Popularity - Personalized: Implicit CF • Sentiment Data: - General: Good Experience - Personalized: Explicit CF • Good Recommendation - Use both to drive your Business Metric 18  
  • 19. Ranking Phase 1: Bootstrap through heuristics Phase 2: Learn to Rank •  Many models - E [ Revenue | Query, Position, Item, User ] - E [ Engagement | Query, Position, Item, User ] - Regression, RankSVM, LambdaMart… •  Modeling Diversity is Important 19  
  • 20. Training Example •  Context Free (Collaborative Filtering) - Train for Content Based and Collaborative Filtering models. - Create an Ensemble Model - Perform Hyper-Parameter Tuning for each model •  With Context (Search) - Train a model using query (implicit & explicit) §  Includes Context-Free Model - Perform Hyper-Parameter Tuning •  Evaluate Model using A/B - Change models, objective functions, etc.
  • 21. Training DataFlow Collabora4ve  Filter   Service   (Real4me)   Collabora4ve  Filter   HyperParameter  Tuning     (Batch  with  Spark)   Collabora4ve  Filter   Training   (Batch  with  Spark)  
  • 22. Training DataFlow Collabora4ve  Filter   Service   (Real4me)   Collabora4ve  Filter   HyperParameter  Tuning     (Batch  with  Spark)   Collabora4ve  Filter   Training   (Batch  with  Spark)   Search  Service   (Real4me)   Search  HyperParameter   Tuning     (Batch  with  Spark)   Search  Training   (Batch  with  Spark)  
  • 23. Training DataFlow Collabora4ve  Filter   Service   (Real4me)   Collabora4ve  Filter   HyperParameter  Tuning     (Batch  with  Spark)   Collabora4ve  Filter   Training   (Batch  with  Spark)   Search  Service   (Real4me)   Search  HyperParameter   Tuning     (Batch  with  Spark)   Search  Training   (Batch  with  Spark)   User  Interac4on  Logs   (Ka_a)   A/B  Tes4ng  Dashboards   Other  Services  
  • 25. Recommendation Explanations •  Amazon •  Ness •  Netflix •  Ness - Social 25  
  • 26. Summarizing Content • Essential for Mobile • Balance Utility With Trust? - Summarize, but surface raw data • Example: - Initially, read every review - Later, use average star rating 26  
  • 28. Dish Recommendation • What to try once I have arrived? 28  
  • 29. Edit  via  the  Header  &  Footer  menu  in   PowerPoint   29  29  
  • 31. The ingredients of a spectacular
 dining experience… 31  
  • 32. … and a spectacularly bad one 32  
  • 33. Content Features Pandora • Music Genome Project Natural Language Processing • Topics & Tags 33  
  • 34. Topic Modeling Methods We applied two main topic modeling methods: • Latent Dirichlet Allocation (LDA) - (Blei et al. 2003) • Non-negative Matrix Factorization (NMF) - (Aurora et al. 2012) 34  
  • 35. The food was great! I loved the view of the sailboats. Bag of Words Model food   great   chicken   sailboat   view   service   1   1   0   1   1   0   35  
  • 36. Topics with NMF using TF-IDF Word  1   Word  …   Word  N   Review  1   0.8   0.9   0   Review  …   0.6   0   0.8   Review  N   0.9   0   0.8   Reviews   X   Words   Reviews   X   Topics   Topics   X   Words   36  
  • 37. Describing Restaurants as Topics Each  review  for  a   given  restaurant     has  certain  topic   distribuCon   Combining  them,   we  idenCfy  the  top   topics  for  that   restaurant.   Topic 01! Topic 02! Topic 03! Topic 04! Topic 05! Topic 01! Topic 02! Topic 03! Topic 04! Topic 05! Topic 01! Topic 02! Topic 03! Topic 04! Topic 05! review  1   review  2   review  N   .   .   .   Topic 01! Topic 02! Topic 03! Topic 04! Topic 05! Restaurant   37  
  • 39. Varying Topic By Region •  San Francisco •  ` •  London •  Chicago •  New York 39  
  • 40. Building Recommendation Systems • Importance of A/B Testing • Generating Recommendations • Recommendation Explanations 40