SlideShare une entreprise Scribd logo
1  sur  47
Télécharger pour lire hors ligne
Emmys, Oscars, and

Machine Learning Algorithms

@ Netflix Scale
November,	
  2013	
  
	
  

Xavier	
  Amatriain	
  
Director	
  -­‐	
  Algorithms	
  Engineering	
  -­‐	
  Ne<lix	
  

@xamat	
  
“In a simple Netlfix-style item recommender, we would
simply apply some form of matrix factorization (i.e NMF)”
What we were interested in:
§  High quality recommendations

Proxy question:
§  Accuracy in predicted rating
§  Improve by 10% = $1million!
From the Netflix Prize
to today

2006

2013
Big Data @Netflix

§  > 40M subscribers
§  Ratings: ~5M/day

Time

§  Searches: >3M/day

Geo-information

§  Plays: > 50M/day

Impressions Device Info
§  Streamed hours:
Metadata
Social

Ratings o  5B hours in Q3 2013
Member Behavior

Demographics
Smart Models

§  Regression models (Logistic, Linear,
Elastic nets)
§  SVD & other MF models
§  Factorization Machines
§  Restricted Boltzmann Machines
§  Markov Chains & other graph models
§  Clustering (from k-means to HDP)
§  Deep ANN
§  LDA
§  Association Rules
§  GBDT/RF
§  …
Deep-dive into
Netflix Algorithms
Rating Prediction
2007 Progress Prize
▪  Top 2 algorithms
▪  SVD - Prize RMSE: 0.8914
▪  RBM - Prize RMSE: 0.8990

▪  Linear blend Prize RMSE: 0.88
▪  Currently in use as part of Netflix’ rating prediction component
▪  Limitations
▪  Designed for 100M ratings, we have 5B ratings
▪  Not adaptable as users add ratings
▪  Performance issues
SVD - Definition

A[n x m] = U[n x r] Λ [ r x r] (V[m x r])T
▪ 

A: n x m matrix (e.g., n documents, m terms)

▪ 

U: n x r matrix (n documents, r concepts)

▪ 

▪ 

Λ: r x r diagonal matrix (strength of each ‘concept’) (r:
rank of the matrix)
V: m x r matrix (m terms, r concepts)
SVD - Properties
▪  ‘spectral decomposition’ of the matrix:

= u1

u2

x λ1

λ2

▪  ‘documents’, ‘terms’ and ‘concepts’:
§  U: document-to-concept similarity matrix
§  V: term-to-concept similarity matrix
§  Λ: its diagonal elements: ‘strength’ of each concept

x

v1
v2
SVD for Rating Prediction
§  User factor vectors
§  Baseline (bias)

and item-factors vectors
(user & item deviation

from average)

§  Predict rating as
§  SVD++ (Koren et. Al) asymmetric variation w.
implicit feedback

§  Where
§ 
are three item factor vectors
§  Users are not parametrized
Restricted Boltzmann Machines
§  Restrict the connectivity in ANN
to make learning easier.

§ 

Only one layer of hidden units.

§ 

§ 

Although multiple layers are
possible

No connections between
hidden units.

§  Hidden units are independent
given the visible states..
§  RBMs can be stacked to form
Deep Belief Networks (DBN) – 4th
generation of ANNs
What about the final prize ensembles?
▪  Our offline studies showed they were too
computationally intensive to scale
▪  Expected improvement not worth the
engineering effort

▪  Plus, focus had already shifted to other
issues that had more impact than rating
prediction...
Ranking

Ranking
Ranking
§  Ranking = Scoring + Sorting +
Filtering bags of movies for
presentation to a user

§  Factors
§  Accuracy
§  Novelty

Key algorithm, sorts titles in most
contexts

§  Diversity
§  Freshness

§  Goal: Find the best possible ordering of
a set of videos for a user within a
specific context in real-time

§  Scalability

§ 

§  Objective: maximize consumption &

“enjoyment”

§  …
Example: Two features, linear model

2	
  
3	
  
4	
  
5	
  

Popularity	
  

Linear	
  Model:	
  
frank(u,v)	
  =	
  w1	
  p(v)	
  +	
  w2	
  r(u,v)	
  +	
  b	
  

Final	
  Ranking	
  

Predicted	
  RaEng	
  

1	
  
Example: Two features, linear model

2	
  
3	
  
4	
  
5	
  

Popularity	
  

Final	
  Ranking	
  

Predicted	
  RaEng	
  

1	
  
Ranking
Learning to Rank
▪  Ranking is a very important problem in many contexts
(search, advertising, recommendations)
▪  Quality of ranking is measured using ranking metrics
such as NDCG, MRR, MAP, FPC…
▪  It is hard to optimize machine-learned models directly on
these measures
▪  They are not differentiable

▪  We would like to use the same measure for optimizing
and for the final evaluation
Learning to Rank Approaches
§  ML problem: construct ranking model from training data
1. 

Pointwise (Ordinal regression, Logistic regression, SVM, GBDT, …)
Loss function defined on individual relevance judgment

§ 

2. 

Pairwise (RankSVM, RankBoost, RankNet, FRank…)
§ 
§ 

3. 

Loss function defined on pair-wise preferences
Goal: minimize number of inversions in ranking

Listwise
§ 

Indirect Loss Function (RankCosine, ListNet…)

§ 

Directly optimize IR measures (NDCG, MRR, FCP…)
§ 

Genetic Programming or Simulated Annealing

§ 

Use boosting to optimize NDCG (Adarank)

§ 

Gradient descent on smoothed version (CLiMF, TFMAP, GAPfm @cikm13)

§ 

Iterative Coordinate Ascent (Direct Rank @kdd13)
Similarity
Similars
What is similarity?
§ 

Similarity can refer to different dimensions
§  Similar in metadata/tags
§  Similar in user play behavior
§  Similar in user rating behavior
§  …

§ 

You can learn a model for each of them and finally learn
an ensemble
Graph-based similarities

0.8
0.3
0.3
0.4
0.2

0.3
0.7
Example of graph-based similarity: SimRank
▪  SimRank (Jeh & Widom, 02): “two objects are
similar if they are referenced by similar objects.”
Final similarity ensemble
§ 

Come up with a score of play similarity, rating similarity,
tag-based similarity…

§ 

Combine them using an ensemble
§  Weights are learned using regression over existing response

§ 

The final concept of “similarity” responds to what users
vote as similar
Row Selection
Row	
  SelecEon	
  Inputs	
  
§  Visitor	
  data	
  
§  Video	
  history	
  
§  Queue	
  adds	
  
§  RaEngs	
  
§  Taste	
  preferences	
  
§  Predicted	
  raEngs	
  
§  PVR	
  ranking	
  
§  etc.	
  

§  Global data
§  Metadata
§  Popularity
§  etc.
Row	
  SelecEon	
  Core	
  Algorithm	
  
SelecEon	
  
constraints	
  
Candidate	
  
data	
  

Groupify	
  

Score/Select	
  
ParEal	
  
selecEon	
  

Group	
  
selecEon	
  
SelecEon	
  Constraints	
  
•  Many	
  business	
  rules	
  and	
  heurisEcs	
  
–  RelaEve	
  posiEon	
  of	
  groups	
  
–  Presence/absence	
  of	
  groups	
  
–  Deduping	
  rules	
  
–  Number	
  of	
  groups	
  of	
  a	
  given	
  type	
  

•  Determined	
  by	
  past	
  A/B	
  tests	
  
•  Evolved	
  over	
  Eme	
  from	
  a	
  simple	
  template	
  
GroupificaEon	
  
•  Ranking	
  of	
  Etles	
  according	
  to	
  
ranking	
  algo	
  
•  Filtering	
  
•  Deduping	
  
•  Forma^ng	
  

Candidate	
  
data	
  

•  SelecEon	
  of	
  evidence	
  
•  ...	
  

First	
  n	
  
selected	
  
groups	
  

Group	
  for	
  
posiEon	
  n
+1	
  
Scoring	
  and	
  SelecEon	
  
•  Features	
  
considered	
  

–  ARO	
  scorer	
  
–  Greedy	
  algorithm	
  

–  Quality	
  of	
  items	
  

–  Framework	
  supports	
  
other	
  methods	
  

–  Diversity	
  of	
  tags	
  
–  Quality	
  of	
  evidence	
  

§  Backtracking	
  

–  Freshness	
  

§  Approximate	
  integer	
  
programming	
  

–  Recency	
  
–  ...	
  

§  etc.	
  
	
  
Search Recommendations
Search Recommendation
CombinaEon	
  of	
  Different	
  models	
  
Play	
  Aaer	
  Search:	
  TransiEon	
  on	
  Play	
  aaer	
  Query	
  	
  
Play	
  Related	
  to	
  Search:	
  Similarity	
  between	
  user’s	
  (query,	
  
play)	
  	
  
…	
  

• 
• 
• 

Read this paper for a description of a similar system at LinkedIn
Unavailable Title Recommendations
Gamification Algorithms
rating gameGame
Rating
Mood Selection Algorithm
1.  Editorially selected moods
2.  Sort them according to users consumption
3.  Take into account freshness, diversity...
More data or
better models?
More data or better models?

Really?

Anand Rajaraman: Former Stanford Prof. &
Senior VP at Walmart
More data or better models?
Sometimes, it’s not
about more data
More data or better models?
[Banko and Brill, 2001]

Norvig: “Google does not
have better Algorithms,
only more Data”

Many features/
low-bias models
More data or better models?

Sometimes, it’s not
about more data
“Data without a sound approach = noise”
Conclusion
More data +
Smarter models +
More accurate metrics +
Better system architectures
Lots of room for improvement!
Xavier Amatriain (@xamat)
xavier@netflix.com

Thanks!

We’re hiring!

Contenu connexe

Tendances

Machine Learning to Grow the World's Knowledge
Machine Learning to Grow  the World's KnowledgeMachine Learning to Grow  the World's Knowledge
Machine Learning to Grow the World's KnowledgeXavier Amatriain
 
Strata 2016 - Lessons Learned from building real-life Machine Learning Systems
Strata 2016 -  Lessons Learned from building real-life Machine Learning SystemsStrata 2016 -  Lessons Learned from building real-life Machine Learning Systems
Strata 2016 - Lessons Learned from building real-life Machine Learning SystemsXavier Amatriain
 
Building Large-scale Real-world Recommender Systems - Recsys2012 tutorial
Building Large-scale Real-world Recommender Systems - Recsys2012 tutorialBuilding Large-scale Real-world Recommender Systems - Recsys2012 tutorial
Building Large-scale Real-world Recommender Systems - Recsys2012 tutorialXavier Amatriain
 
MLConf Seattle 2015 - ML@Quora
MLConf Seattle 2015 - ML@QuoraMLConf Seattle 2015 - ML@Quora
MLConf Seattle 2015 - ML@QuoraXavier Amatriain
 
Machine learning the high interest credit card of technical debt [PWL]
Machine learning the high interest credit card of technical debt [PWL]Machine learning the high interest credit card of technical debt [PWL]
Machine learning the high interest credit card of technical debt [PWL]Jenia Gorokhovsky
 
Replicable Evaluation of Recommender Systems
Replicable Evaluation of Recommender SystemsReplicable Evaluation of Recommender Systems
Replicable Evaluation of Recommender SystemsAlejandro Bellogin
 
Past present and future of Recommender Systems: an Industry Perspective
Past present and future of Recommender Systems: an Industry PerspectivePast present and future of Recommender Systems: an Industry Perspective
Past present and future of Recommender Systems: an Industry PerspectiveXavier Amatriain
 
Barcelona ML Meetup - Lessons Learned
Barcelona ML Meetup - Lessons LearnedBarcelona ML Meetup - Lessons Learned
Barcelona ML Meetup - Lessons LearnedXavier Amatriain
 
Recommendation at Netflix Scale
Recommendation at Netflix ScaleRecommendation at Netflix Scale
Recommendation at Netflix ScaleJustin Basilico
 
Recommending the world's knowledge
Recommending the world's knowledgeRecommending the world's knowledge
Recommending the world's knowledgeLei Yang
 
Recsys 2014 Tutorial - The Recommender Problem Revisited
Recsys 2014 Tutorial - The Recommender Problem RevisitedRecsys 2014 Tutorial - The Recommender Problem Revisited
Recsys 2014 Tutorial - The Recommender Problem RevisitedXavier Amatriain
 
Machine Learning at Netflix Scale
Machine Learning at Netflix ScaleMachine Learning at Netflix Scale
Machine Learning at Netflix ScaleAish Fenton
 
Efficient Top-N Recommendation by Linear Regression
Efficient Top-N Recommendation by Linear RegressionEfficient Top-N Recommendation by Linear Regression
Efficient Top-N Recommendation by Linear RegressionMark Levy
 
Netflix Recommendations - Beyond the 5 Stars
Netflix Recommendations - Beyond the 5 StarsNetflix Recommendations - Beyond the 5 Stars
Netflix Recommendations - Beyond the 5 StarsXavier Amatriain
 
Recsys2016 Tutorial by Xavier and Deepak
Recsys2016 Tutorial by Xavier and DeepakRecsys2016 Tutorial by Xavier and Deepak
Recsys2016 Tutorial by Xavier and DeepakDeepak Agarwal
 
Aiinpractice2017deepaklongversion
Aiinpractice2017deepaklongversionAiinpractice2017deepaklongversion
Aiinpractice2017deepaklongversionDeepak Agarwal
 
Scaling Recommendations at Quora (RecSys talk 9/16/2016)
Scaling Recommendations at Quora (RecSys talk 9/16/2016)Scaling Recommendations at Quora (RecSys talk 9/16/2016)
Scaling Recommendations at Quora (RecSys talk 9/16/2016)Nikhil Dandekar
 
Recommendations for Building Machine Learning Software
Recommendations for Building Machine Learning SoftwareRecommendations for Building Machine Learning Software
Recommendations for Building Machine Learning SoftwareJustin Basilico
 
Recommender Systems (Machine Learning Summer School 2014 @ CMU)
Recommender Systems (Machine Learning Summer School 2014 @ CMU)Recommender Systems (Machine Learning Summer School 2014 @ CMU)
Recommender Systems (Machine Learning Summer School 2014 @ CMU)Xavier Amatriain
 

Tendances (20)

Machine Learning to Grow the World's Knowledge
Machine Learning to Grow  the World's KnowledgeMachine Learning to Grow  the World's Knowledge
Machine Learning to Grow the World's Knowledge
 
Strata 2016 - Lessons Learned from building real-life Machine Learning Systems
Strata 2016 -  Lessons Learned from building real-life Machine Learning SystemsStrata 2016 -  Lessons Learned from building real-life Machine Learning Systems
Strata 2016 - Lessons Learned from building real-life Machine Learning Systems
 
Building Large-scale Real-world Recommender Systems - Recsys2012 tutorial
Building Large-scale Real-world Recommender Systems - Recsys2012 tutorialBuilding Large-scale Real-world Recommender Systems - Recsys2012 tutorial
Building Large-scale Real-world Recommender Systems - Recsys2012 tutorial
 
MLConf Seattle 2015 - ML@Quora
MLConf Seattle 2015 - ML@QuoraMLConf Seattle 2015 - ML@Quora
MLConf Seattle 2015 - ML@Quora
 
Machine learning the high interest credit card of technical debt [PWL]
Machine learning the high interest credit card of technical debt [PWL]Machine learning the high interest credit card of technical debt [PWL]
Machine learning the high interest credit card of technical debt [PWL]
 
Replicable Evaluation of Recommender Systems
Replicable Evaluation of Recommender SystemsReplicable Evaluation of Recommender Systems
Replicable Evaluation of Recommender Systems
 
Past present and future of Recommender Systems: an Industry Perspective
Past present and future of Recommender Systems: an Industry PerspectivePast present and future of Recommender Systems: an Industry Perspective
Past present and future of Recommender Systems: an Industry Perspective
 
Barcelona ML Meetup - Lessons Learned
Barcelona ML Meetup - Lessons LearnedBarcelona ML Meetup - Lessons Learned
Barcelona ML Meetup - Lessons Learned
 
Recommendation at Netflix Scale
Recommendation at Netflix ScaleRecommendation at Netflix Scale
Recommendation at Netflix Scale
 
Recommending the world's knowledge
Recommending the world's knowledgeRecommending the world's knowledge
Recommending the world's knowledge
 
Recsys 2014 Tutorial - The Recommender Problem Revisited
Recsys 2014 Tutorial - The Recommender Problem RevisitedRecsys 2014 Tutorial - The Recommender Problem Revisited
Recsys 2014 Tutorial - The Recommender Problem Revisited
 
kdd2015
kdd2015kdd2015
kdd2015
 
Machine Learning at Netflix Scale
Machine Learning at Netflix ScaleMachine Learning at Netflix Scale
Machine Learning at Netflix Scale
 
Efficient Top-N Recommendation by Linear Regression
Efficient Top-N Recommendation by Linear RegressionEfficient Top-N Recommendation by Linear Regression
Efficient Top-N Recommendation by Linear Regression
 
Netflix Recommendations - Beyond the 5 Stars
Netflix Recommendations - Beyond the 5 StarsNetflix Recommendations - Beyond the 5 Stars
Netflix Recommendations - Beyond the 5 Stars
 
Recsys2016 Tutorial by Xavier and Deepak
Recsys2016 Tutorial by Xavier and DeepakRecsys2016 Tutorial by Xavier and Deepak
Recsys2016 Tutorial by Xavier and Deepak
 
Aiinpractice2017deepaklongversion
Aiinpractice2017deepaklongversionAiinpractice2017deepaklongversion
Aiinpractice2017deepaklongversion
 
Scaling Recommendations at Quora (RecSys talk 9/16/2016)
Scaling Recommendations at Quora (RecSys talk 9/16/2016)Scaling Recommendations at Quora (RecSys talk 9/16/2016)
Scaling Recommendations at Quora (RecSys talk 9/16/2016)
 
Recommendations for Building Machine Learning Software
Recommendations for Building Machine Learning SoftwareRecommendations for Building Machine Learning Software
Recommendations for Building Machine Learning Software
 
Recommender Systems (Machine Learning Summer School 2014 @ CMU)
Recommender Systems (Machine Learning Summer School 2014 @ CMU)Recommender Systems (Machine Learning Summer School 2014 @ CMU)
Recommender Systems (Machine Learning Summer School 2014 @ CMU)
 

En vedette

The Top 10 Mistakes on Local Websites & How to Fix Them
The Top 10 Mistakes on Local Websites & How to Fix ThemThe Top 10 Mistakes on Local Websites & How to Fix Them
The Top 10 Mistakes on Local Websites & How to Fix ThemGreg Gifford
 
Social Media are serious or maybe #not
Social Media are serious or maybe #notSocial Media are serious or maybe #not
Social Media are serious or maybe #notSocialab
 
Рекоменд. письмо_МинСпорт
Рекоменд. письмо_МинСпортРекоменд. письмо_МинСпорт
Рекоменд. письмо_МинСпортNatalia Sidorkina
 
50 Culinary Hacks to make you a Kitchen Master
50 Culinary Hacks to make you a Kitchen Master50 Culinary Hacks to make you a Kitchen Master
50 Culinary Hacks to make you a Kitchen MasterSofia M
 
The Importance of Being Awesome: From Words and Pictures to Code and Cyborgs
The Importance of Being Awesome: From Words and Pictures to Code and CyborgsThe Importance of Being Awesome: From Words and Pictures to Code and Cyborgs
The Importance of Being Awesome: From Words and Pictures to Code and CyborgsTania Kasongo
 
Your Number One Fan: Turning Employees into Recruiters and Brand Ambassadors
Your Number One Fan: Turning Employees into Recruiters and Brand AmbassadorsYour Number One Fan: Turning Employees into Recruiters and Brand Ambassadors
Your Number One Fan: Turning Employees into Recruiters and Brand AmbassadorsLinkedIn Talent Solutions
 
Olympics team usa bsktball rio 2016 defenders
Olympics team usa bsktball rio 2016 defendersOlympics team usa bsktball rio 2016 defenders
Olympics team usa bsktball rio 2016 defendersAvi Dey
 
Hướng dẫn chuẩn bị cho kì thi JLPT 2014
Hướng dẫn chuẩn bị cho kì thi JLPT 2014Hướng dẫn chuẩn bị cho kì thi JLPT 2014
Hướng dẫn chuẩn bị cho kì thi JLPT 2014Mina Japanese Center
 
Using Social Media As A Takeoff Point For Campaigns
Using Social Media As A Takeoff Point For CampaignsUsing Social Media As A Takeoff Point For Campaigns
Using Social Media As A Takeoff Point For Campaignsixigo.com
 
Repurpose Your Career - 5 Steps to your Next Career Pivot
Repurpose Your Career - 5 Steps to your Next Career PivotRepurpose Your Career - 5 Steps to your Next Career Pivot
Repurpose Your Career - 5 Steps to your Next Career PivotCareer Pivot
 
Lire avec le Numérique
Lire avec le NumériqueLire avec le Numérique
Lire avec le NumériqueLaila Methnani
 

En vedette (19)

The Top 10 Mistakes on Local Websites & How to Fix Them
The Top 10 Mistakes on Local Websites & How to Fix ThemThe Top 10 Mistakes on Local Websites & How to Fix Them
The Top 10 Mistakes on Local Websites & How to Fix Them
 
Gesture drawing
Gesture drawingGesture drawing
Gesture drawing
 
Social Media are serious or maybe #not
Social Media are serious or maybe #notSocial Media are serious or maybe #not
Social Media are serious or maybe #not
 
Trucs et astuces de recherche: voir les mots-clés en contexte
Trucs et astuces de recherche: voir les mots-clés en contexteTrucs et astuces de recherche: voir les mots-clés en contexte
Trucs et astuces de recherche: voir les mots-clés en contexte
 
Рекоменд. письмо_МинСпорт
Рекоменд. письмо_МинСпортРекоменд. письмо_МинСпорт
Рекоменд. письмо_МинСпорт
 
50 Culinary Hacks to make you a Kitchen Master
50 Culinary Hacks to make you a Kitchen Master50 Culinary Hacks to make you a Kitchen Master
50 Culinary Hacks to make you a Kitchen Master
 
The Importance of Being Awesome: From Words and Pictures to Code and Cyborgs
The Importance of Being Awesome: From Words and Pictures to Code and CyborgsThe Importance of Being Awesome: From Words and Pictures to Code and Cyborgs
The Importance of Being Awesome: From Words and Pictures to Code and Cyborgs
 
Professor Joanne Goodell presents Performance Based Funding
Professor Joanne Goodell presents Performance Based FundingProfessor Joanne Goodell presents Performance Based Funding
Professor Joanne Goodell presents Performance Based Funding
 
Intro
IntroIntro
Intro
 
Your Number One Fan: Turning Employees into Recruiters and Brand Ambassadors
Your Number One Fan: Turning Employees into Recruiters and Brand AmbassadorsYour Number One Fan: Turning Employees into Recruiters and Brand Ambassadors
Your Number One Fan: Turning Employees into Recruiters and Brand Ambassadors
 
Olympics team usa bsktball rio 2016 defenders
Olympics team usa bsktball rio 2016 defendersOlympics team usa bsktball rio 2016 defenders
Olympics team usa bsktball rio 2016 defenders
 
Hướng dẫn chuẩn bị cho kì thi JLPT 2014
Hướng dẫn chuẩn bị cho kì thi JLPT 2014Hướng dẫn chuẩn bị cho kì thi JLPT 2014
Hướng dẫn chuẩn bị cho kì thi JLPT 2014
 
SEO & Social Media Explained
SEO & Social Media ExplainedSEO & Social Media Explained
SEO & Social Media Explained
 
Introduction to CQRS
Introduction to CQRSIntroduction to CQRS
Introduction to CQRS
 
Conozca la bvc
Conozca la bvcConozca la bvc
Conozca la bvc
 
El diario
El diarioEl diario
El diario
 
Using Social Media As A Takeoff Point For Campaigns
Using Social Media As A Takeoff Point For CampaignsUsing Social Media As A Takeoff Point For Campaigns
Using Social Media As A Takeoff Point For Campaigns
 
Repurpose Your Career - 5 Steps to your Next Career Pivot
Repurpose Your Career - 5 Steps to your Next Career PivotRepurpose Your Career - 5 Steps to your Next Career Pivot
Repurpose Your Career - 5 Steps to your Next Career Pivot
 
Lire avec le Numérique
Lire avec le NumériqueLire avec le Numérique
Lire avec le Numérique
 

Similaire à MLConf - Emmys, Oscars & Machine Learning Algorithms at Netflix

acmsigtalkshare-121023190142-phpapp01.pptx
acmsigtalkshare-121023190142-phpapp01.pptxacmsigtalkshare-121023190142-phpapp01.pptx
acmsigtalkshare-121023190142-phpapp01.pptxdongchangim30
 
An introduction to Elasticsearch's advanced relevance ranking toolbox
An introduction to Elasticsearch's advanced relevance ranking toolboxAn introduction to Elasticsearch's advanced relevance ranking toolbox
An introduction to Elasticsearch's advanced relevance ranking toolboxElasticsearch
 
Dataiku at SF DataMining Meetup - Kaggle Yandex Challenge
Dataiku at SF DataMining Meetup - Kaggle Yandex ChallengeDataiku at SF DataMining Meetup - Kaggle Yandex Challenge
Dataiku at SF DataMining Meetup - Kaggle Yandex ChallengeDataiku
 
Rokach-GomaxSlides.pptx
Rokach-GomaxSlides.pptxRokach-GomaxSlides.pptx
Rokach-GomaxSlides.pptxJadna Almeida
 
Rokach-GomaxSlides (1).pptx
Rokach-GomaxSlides (1).pptxRokach-GomaxSlides (1).pptx
Rokach-GomaxSlides (1).pptxJadna Almeida
 
Webpage Personalization and User Profiling
Webpage Personalization and User ProfilingWebpage Personalization and User Profiling
Webpage Personalization and User Profilingyingfeng
 
Download
DownloadDownload
Downloadbutest
 
Download
DownloadDownload
Downloadbutest
 
Recommender systems in practice
Recommender systems in practiceRecommender systems in practice
Recommender systems in practiceBigData Republic
 
Buidling large scale recommendation engine
Buidling large scale recommendation engineBuidling large scale recommendation engine
Buidling large scale recommendation engineKeeyong Han
 
Nose Dive into Apache Spark ML
Nose Dive into Apache Spark MLNose Dive into Apache Spark ML
Nose Dive into Apache Spark MLAhmet Bulut
 
Rated Ranking Evaluator: An Open Source Approach for Search Quality Evaluation
Rated Ranking Evaluator: An Open Source Approach for Search Quality EvaluationRated Ranking Evaluator: An Open Source Approach for Search Quality Evaluation
Rated Ranking Evaluator: An Open Source Approach for Search Quality EvaluationAlessandro Benedetti
 
Haystack 2019 - Rated Ranking Evaluator: an Open Source Approach for Search Q...
Haystack 2019 - Rated Ranking Evaluator: an Open Source Approach for Search Q...Haystack 2019 - Rated Ranking Evaluator: an Open Source Approach for Search Q...
Haystack 2019 - Rated Ranking Evaluator: an Open Source Approach for Search Q...OpenSource Connections
 
Florian Douetteau @ Dataiku
Florian Douetteau @ DataikuFlorian Douetteau @ Dataiku
Florian Douetteau @ DataikuPAPIs.io
 
Personalized Page Generation for Browsing Recommendations
Personalized Page Generation for Browsing RecommendationsPersonalized Page Generation for Browsing Recommendations
Personalized Page Generation for Browsing RecommendationsJustin Basilico
 
Rated Ranking Evaluator: an Open Source Approach for Search Quality Evaluation
Rated Ranking Evaluator: an Open Source Approach for Search Quality EvaluationRated Ranking Evaluator: an Open Source Approach for Search Quality Evaluation
Rated Ranking Evaluator: an Open Source Approach for Search Quality EvaluationSease
 
Lucene/Solr Revolution 2015: Where Search Meets Machine Learning
Lucene/Solr Revolution 2015: Where Search Meets Machine LearningLucene/Solr Revolution 2015: Where Search Meets Machine Learning
Lucene/Solr Revolution 2015: Where Search Meets Machine LearningJoaquin Delgado PhD.
 
Lucene/Solr Revolution 2015: Where Search Meets Machine Learning
Lucene/Solr Revolution 2015: Where Search Meets Machine LearningLucene/Solr Revolution 2015: Where Search Meets Machine Learning
Lucene/Solr Revolution 2015: Where Search Meets Machine LearningS. Diana Hu
 

Similaire à MLConf - Emmys, Oscars & Machine Learning Algorithms at Netflix (20)

acmsigtalkshare-121023190142-phpapp01.pptx
acmsigtalkshare-121023190142-phpapp01.pptxacmsigtalkshare-121023190142-phpapp01.pptx
acmsigtalkshare-121023190142-phpapp01.pptx
 
An introduction to Elasticsearch's advanced relevance ranking toolbox
An introduction to Elasticsearch's advanced relevance ranking toolboxAn introduction to Elasticsearch's advanced relevance ranking toolbox
An introduction to Elasticsearch's advanced relevance ranking toolbox
 
Dataiku at SF DataMining Meetup - Kaggle Yandex Challenge
Dataiku at SF DataMining Meetup - Kaggle Yandex ChallengeDataiku at SF DataMining Meetup - Kaggle Yandex Challenge
Dataiku at SF DataMining Meetup - Kaggle Yandex Challenge
 
Rokach-GomaxSlides.pptx
Rokach-GomaxSlides.pptxRokach-GomaxSlides.pptx
Rokach-GomaxSlides.pptx
 
Rokach-GomaxSlides (1).pptx
Rokach-GomaxSlides (1).pptxRokach-GomaxSlides (1).pptx
Rokach-GomaxSlides (1).pptx
 
Webpage Personalization and User Profiling
Webpage Personalization and User ProfilingWebpage Personalization and User Profiling
Webpage Personalization and User Profiling
 
Download
DownloadDownload
Download
 
Download
DownloadDownload
Download
 
Recommender systems in practice
Recommender systems in practiceRecommender systems in practice
Recommender systems in practice
 
Buidling large scale recommendation engine
Buidling large scale recommendation engineBuidling large scale recommendation engine
Buidling large scale recommendation engine
 
Weka bike rental
Weka bike rentalWeka bike rental
Weka bike rental
 
JavaScript and Artificial Intelligence by Aatman & Sagar - AhmedabadJS
JavaScript and Artificial Intelligence by Aatman & Sagar - AhmedabadJSJavaScript and Artificial Intelligence by Aatman & Sagar - AhmedabadJS
JavaScript and Artificial Intelligence by Aatman & Sagar - AhmedabadJS
 
Nose Dive into Apache Spark ML
Nose Dive into Apache Spark MLNose Dive into Apache Spark ML
Nose Dive into Apache Spark ML
 
Rated Ranking Evaluator: An Open Source Approach for Search Quality Evaluation
Rated Ranking Evaluator: An Open Source Approach for Search Quality EvaluationRated Ranking Evaluator: An Open Source Approach for Search Quality Evaluation
Rated Ranking Evaluator: An Open Source Approach for Search Quality Evaluation
 
Haystack 2019 - Rated Ranking Evaluator: an Open Source Approach for Search Q...
Haystack 2019 - Rated Ranking Evaluator: an Open Source Approach for Search Q...Haystack 2019 - Rated Ranking Evaluator: an Open Source Approach for Search Q...
Haystack 2019 - Rated Ranking Evaluator: an Open Source Approach for Search Q...
 
Florian Douetteau @ Dataiku
Florian Douetteau @ DataikuFlorian Douetteau @ Dataiku
Florian Douetteau @ Dataiku
 
Personalized Page Generation for Browsing Recommendations
Personalized Page Generation for Browsing RecommendationsPersonalized Page Generation for Browsing Recommendations
Personalized Page Generation for Browsing Recommendations
 
Rated Ranking Evaluator: an Open Source Approach for Search Quality Evaluation
Rated Ranking Evaluator: an Open Source Approach for Search Quality EvaluationRated Ranking Evaluator: an Open Source Approach for Search Quality Evaluation
Rated Ranking Evaluator: an Open Source Approach for Search Quality Evaluation
 
Lucene/Solr Revolution 2015: Where Search Meets Machine Learning
Lucene/Solr Revolution 2015: Where Search Meets Machine LearningLucene/Solr Revolution 2015: Where Search Meets Machine Learning
Lucene/Solr Revolution 2015: Where Search Meets Machine Learning
 
Lucene/Solr Revolution 2015: Where Search Meets Machine Learning
Lucene/Solr Revolution 2015: Where Search Meets Machine LearningLucene/Solr Revolution 2015: Where Search Meets Machine Learning
Lucene/Solr Revolution 2015: Where Search Meets Machine Learning
 

Plus de Xavier Amatriain

Data/AI driven product development: from video streaming to telehealth
Data/AI driven product development: from video streaming to telehealthData/AI driven product development: from video streaming to telehealth
Data/AI driven product development: from video streaming to telehealthXavier Amatriain
 
AI-driven product innovation: from Recommender Systems to COVID-19
AI-driven product innovation: from Recommender Systems to COVID-19AI-driven product innovation: from Recommender Systems to COVID-19
AI-driven product innovation: from Recommender Systems to COVID-19Xavier Amatriain
 
AI for COVID-19 - Q42020 update
AI for COVID-19 - Q42020 updateAI for COVID-19 - Q42020 update
AI for COVID-19 - Q42020 updateXavier Amatriain
 
AI for COVID-19: An online virtual care approach
AI for COVID-19: An online virtual care approachAI for COVID-19: An online virtual care approach
AI for COVID-19: An online virtual care approachXavier Amatriain
 
Lessons learned from building practical deep learning systems
Lessons learned from building practical deep learning systemsLessons learned from building practical deep learning systems
Lessons learned from building practical deep learning systemsXavier Amatriain
 
AI for healthcare: Scaling Access and Quality of Care for Everyone
AI for healthcare: Scaling Access and Quality of Care for EveryoneAI for healthcare: Scaling Access and Quality of Care for Everyone
AI for healthcare: Scaling Access and Quality of Care for EveryoneXavier Amatriain
 
Towards online universal quality healthcare through AI
Towards online universal quality healthcare through AITowards online universal quality healthcare through AI
Towards online universal quality healthcare through AIXavier Amatriain
 
From one to zero: Going smaller as a growth strategy
From one to zero: Going smaller as a growth strategyFrom one to zero: Going smaller as a growth strategy
From one to zero: Going smaller as a growth strategyXavier Amatriain
 
Learning to speak medicine
Learning to speak medicineLearning to speak medicine
Learning to speak medicineXavier Amatriain
 
Medical advice as a Recommender System
Medical advice as a Recommender SystemMedical advice as a Recommender System
Medical advice as a Recommender SystemXavier Amatriain
 
Machine Learning for Q&A Sites: The Quora Example
Machine Learning for Q&A Sites: The Quora ExampleMachine Learning for Q&A Sites: The Quora Example
Machine Learning for Q&A Sites: The Quora ExampleXavier Amatriain
 
Past, present, and future of Recommender Systems: an industry perspective
Past, present, and future of Recommender Systems: an industry perspectivePast, present, and future of Recommender Systems: an industry perspective
Past, present, and future of Recommender Systems: an industry perspectiveXavier Amatriain
 
10 more lessons learned from building Machine Learning systems - MLConf
10 more lessons learned from building Machine Learning systems - MLConf10 more lessons learned from building Machine Learning systems - MLConf
10 more lessons learned from building Machine Learning systems - MLConfXavier Amatriain
 
10 more lessons learned from building Machine Learning systems
10 more lessons learned from building Machine Learning systems10 more lessons learned from building Machine Learning systems
10 more lessons learned from building Machine Learning systemsXavier Amatriain
 
Lean DevOps - Lessons Learned from Innovation-driven Companies
Lean DevOps - Lessons Learned from Innovation-driven CompaniesLean DevOps - Lessons Learned from Innovation-driven Companies
Lean DevOps - Lessons Learned from Innovation-driven CompaniesXavier Amatriain
 
10 Lessons Learned from Building Machine Learning Systems
10 Lessons Learned from Building Machine Learning Systems10 Lessons Learned from Building Machine Learning Systems
10 Lessons Learned from Building Machine Learning SystemsXavier Amatriain
 
Kdd 2014 Tutorial - the recommender problem revisited
Kdd 2014 Tutorial -  the recommender problem revisitedKdd 2014 Tutorial -  the recommender problem revisited
Kdd 2014 Tutorial - the recommender problem revisitedXavier Amatriain
 

Plus de Xavier Amatriain (18)

Data/AI driven product development: from video streaming to telehealth
Data/AI driven product development: from video streaming to telehealthData/AI driven product development: from video streaming to telehealth
Data/AI driven product development: from video streaming to telehealth
 
AI-driven product innovation: from Recommender Systems to COVID-19
AI-driven product innovation: from Recommender Systems to COVID-19AI-driven product innovation: from Recommender Systems to COVID-19
AI-driven product innovation: from Recommender Systems to COVID-19
 
AI for COVID-19 - Q42020 update
AI for COVID-19 - Q42020 updateAI for COVID-19 - Q42020 update
AI for COVID-19 - Q42020 update
 
AI for COVID-19: An online virtual care approach
AI for COVID-19: An online virtual care approachAI for COVID-19: An online virtual care approach
AI for COVID-19: An online virtual care approach
 
Lessons learned from building practical deep learning systems
Lessons learned from building practical deep learning systemsLessons learned from building practical deep learning systems
Lessons learned from building practical deep learning systems
 
AI for healthcare: Scaling Access and Quality of Care for Everyone
AI for healthcare: Scaling Access and Quality of Care for EveryoneAI for healthcare: Scaling Access and Quality of Care for Everyone
AI for healthcare: Scaling Access and Quality of Care for Everyone
 
Towards online universal quality healthcare through AI
Towards online universal quality healthcare through AITowards online universal quality healthcare through AI
Towards online universal quality healthcare through AI
 
From one to zero: Going smaller as a growth strategy
From one to zero: Going smaller as a growth strategyFrom one to zero: Going smaller as a growth strategy
From one to zero: Going smaller as a growth strategy
 
Learning to speak medicine
Learning to speak medicineLearning to speak medicine
Learning to speak medicine
 
ML to cure the world
ML to cure the worldML to cure the world
ML to cure the world
 
Medical advice as a Recommender System
Medical advice as a Recommender SystemMedical advice as a Recommender System
Medical advice as a Recommender System
 
Machine Learning for Q&A Sites: The Quora Example
Machine Learning for Q&A Sites: The Quora ExampleMachine Learning for Q&A Sites: The Quora Example
Machine Learning for Q&A Sites: The Quora Example
 
Past, present, and future of Recommender Systems: an industry perspective
Past, present, and future of Recommender Systems: an industry perspectivePast, present, and future of Recommender Systems: an industry perspective
Past, present, and future of Recommender Systems: an industry perspective
 
10 more lessons learned from building Machine Learning systems - MLConf
10 more lessons learned from building Machine Learning systems - MLConf10 more lessons learned from building Machine Learning systems - MLConf
10 more lessons learned from building Machine Learning systems - MLConf
 
10 more lessons learned from building Machine Learning systems
10 more lessons learned from building Machine Learning systems10 more lessons learned from building Machine Learning systems
10 more lessons learned from building Machine Learning systems
 
Lean DevOps - Lessons Learned from Innovation-driven Companies
Lean DevOps - Lessons Learned from Innovation-driven CompaniesLean DevOps - Lessons Learned from Innovation-driven Companies
Lean DevOps - Lessons Learned from Innovation-driven Companies
 
10 Lessons Learned from Building Machine Learning Systems
10 Lessons Learned from Building Machine Learning Systems10 Lessons Learned from Building Machine Learning Systems
10 Lessons Learned from Building Machine Learning Systems
 
Kdd 2014 Tutorial - the recommender problem revisited
Kdd 2014 Tutorial -  the recommender problem revisitedKdd 2014 Tutorial -  the recommender problem revisited
Kdd 2014 Tutorial - the recommender problem revisited
 

Dernier

Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESmohitsingh558521
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 

Dernier (20)

Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 

MLConf - Emmys, Oscars & Machine Learning Algorithms at Netflix

  • 1. Emmys, Oscars, and Machine Learning Algorithms @ Netflix Scale November,  2013     Xavier  Amatriain   Director  -­‐  Algorithms  Engineering  -­‐  Ne<lix   @xamat  
  • 2. “In a simple Netlfix-style item recommender, we would simply apply some form of matrix factorization (i.e NMF)”
  • 3. What we were interested in: §  High quality recommendations Proxy question: §  Accuracy in predicted rating §  Improve by 10% = $1million!
  • 4. From the Netflix Prize to today 2006 2013
  • 5. Big Data @Netflix §  > 40M subscribers §  Ratings: ~5M/day Time §  Searches: >3M/day Geo-information §  Plays: > 50M/day Impressions Device Info §  Streamed hours: Metadata Social Ratings o  5B hours in Q3 2013 Member Behavior Demographics
  • 6. Smart Models §  Regression models (Logistic, Linear, Elastic nets) §  SVD & other MF models §  Factorization Machines §  Restricted Boltzmann Machines §  Markov Chains & other graph models §  Clustering (from k-means to HDP) §  Deep ANN §  LDA §  Association Rules §  GBDT/RF §  …
  • 9. 2007 Progress Prize ▪  Top 2 algorithms ▪  SVD - Prize RMSE: 0.8914 ▪  RBM - Prize RMSE: 0.8990 ▪  Linear blend Prize RMSE: 0.88 ▪  Currently in use as part of Netflix’ rating prediction component ▪  Limitations ▪  Designed for 100M ratings, we have 5B ratings ▪  Not adaptable as users add ratings ▪  Performance issues
  • 10. SVD - Definition A[n x m] = U[n x r] Λ [ r x r] (V[m x r])T ▪  A: n x m matrix (e.g., n documents, m terms) ▪  U: n x r matrix (n documents, r concepts) ▪  ▪  Λ: r x r diagonal matrix (strength of each ‘concept’) (r: rank of the matrix) V: m x r matrix (m terms, r concepts)
  • 11. SVD - Properties ▪  ‘spectral decomposition’ of the matrix: = u1 u2 x λ1 λ2 ▪  ‘documents’, ‘terms’ and ‘concepts’: §  U: document-to-concept similarity matrix §  V: term-to-concept similarity matrix §  Λ: its diagonal elements: ‘strength’ of each concept x v1 v2
  • 12. SVD for Rating Prediction §  User factor vectors §  Baseline (bias) and item-factors vectors (user & item deviation from average) §  Predict rating as §  SVD++ (Koren et. Al) asymmetric variation w. implicit feedback §  Where §  are three item factor vectors §  Users are not parametrized
  • 13. Restricted Boltzmann Machines §  Restrict the connectivity in ANN to make learning easier. §  Only one layer of hidden units. §  §  Although multiple layers are possible No connections between hidden units. §  Hidden units are independent given the visible states.. §  RBMs can be stacked to form Deep Belief Networks (DBN) – 4th generation of ANNs
  • 14. What about the final prize ensembles? ▪  Our offline studies showed they were too computationally intensive to scale ▪  Expected improvement not worth the engineering effort ▪  Plus, focus had already shifted to other issues that had more impact than rating prediction...
  • 16. Ranking §  Ranking = Scoring + Sorting + Filtering bags of movies for presentation to a user §  Factors §  Accuracy §  Novelty Key algorithm, sorts titles in most contexts §  Diversity §  Freshness §  Goal: Find the best possible ordering of a set of videos for a user within a specific context in real-time §  Scalability §  §  Objective: maximize consumption & “enjoyment” §  …
  • 17. Example: Two features, linear model 2   3   4   5   Popularity   Linear  Model:   frank(u,v)  =  w1  p(v)  +  w2  r(u,v)  +  b   Final  Ranking   Predicted  RaEng   1  
  • 18. Example: Two features, linear model 2   3   4   5   Popularity   Final  Ranking   Predicted  RaEng   1  
  • 20. Learning to Rank ▪  Ranking is a very important problem in many contexts (search, advertising, recommendations) ▪  Quality of ranking is measured using ranking metrics such as NDCG, MRR, MAP, FPC… ▪  It is hard to optimize machine-learned models directly on these measures ▪  They are not differentiable ▪  We would like to use the same measure for optimizing and for the final evaluation
  • 21. Learning to Rank Approaches §  ML problem: construct ranking model from training data 1.  Pointwise (Ordinal regression, Logistic regression, SVM, GBDT, …) Loss function defined on individual relevance judgment §  2.  Pairwise (RankSVM, RankBoost, RankNet, FRank…) §  §  3.  Loss function defined on pair-wise preferences Goal: minimize number of inversions in ranking Listwise §  Indirect Loss Function (RankCosine, ListNet…) §  Directly optimize IR measures (NDCG, MRR, FCP…) §  Genetic Programming or Simulated Annealing §  Use boosting to optimize NDCG (Adarank) §  Gradient descent on smoothed version (CLiMF, TFMAP, GAPfm @cikm13) §  Iterative Coordinate Ascent (Direct Rank @kdd13)
  • 24. What is similarity? §  Similarity can refer to different dimensions §  Similar in metadata/tags §  Similar in user play behavior §  Similar in user rating behavior §  … §  You can learn a model for each of them and finally learn an ensemble
  • 26. Example of graph-based similarity: SimRank ▪  SimRank (Jeh & Widom, 02): “two objects are similar if they are referenced by similar objects.”
  • 27. Final similarity ensemble §  Come up with a score of play similarity, rating similarity, tag-based similarity… §  Combine them using an ensemble §  Weights are learned using regression over existing response §  The final concept of “similarity” responds to what users vote as similar
  • 29. Row  SelecEon  Inputs   §  Visitor  data   §  Video  history   §  Queue  adds   §  RaEngs   §  Taste  preferences   §  Predicted  raEngs   §  PVR  ranking   §  etc.   §  Global data §  Metadata §  Popularity §  etc.
  • 30. Row  SelecEon  Core  Algorithm   SelecEon   constraints   Candidate   data   Groupify   Score/Select   ParEal   selecEon   Group   selecEon  
  • 31. SelecEon  Constraints   •  Many  business  rules  and  heurisEcs   –  RelaEve  posiEon  of  groups   –  Presence/absence  of  groups   –  Deduping  rules   –  Number  of  groups  of  a  given  type   •  Determined  by  past  A/B  tests   •  Evolved  over  Eme  from  a  simple  template  
  • 32. GroupificaEon   •  Ranking  of  Etles  according  to   ranking  algo   •  Filtering   •  Deduping   •  Forma^ng   Candidate   data   •  SelecEon  of  evidence   •  ...   First  n   selected   groups   Group  for   posiEon  n +1  
  • 33. Scoring  and  SelecEon   •  Features   considered   –  ARO  scorer   –  Greedy  algorithm   –  Quality  of  items   –  Framework  supports   other  methods   –  Diversity  of  tags   –  Quality  of  evidence   §  Backtracking   –  Freshness   §  Approximate  integer   programming   –  Recency   –  ...   §  etc.    
  • 35. Search Recommendation CombinaEon  of  Different  models   Play  Aaer  Search:  TransiEon  on  Play  aaer  Query     Play  Related  to  Search:  Similarity  between  user’s  (query,   play)     …   •  •  •  Read this paper for a description of a similar system at LinkedIn
  • 38. rating gameGame Rating Mood Selection Algorithm 1.  Editorially selected moods 2.  Sort them according to users consumption 3.  Take into account freshness, diversity...
  • 40. More data or better models? Really? Anand Rajaraman: Former Stanford Prof. & Senior VP at Walmart
  • 41. More data or better models? Sometimes, it’s not about more data
  • 42. More data or better models? [Banko and Brill, 2001] Norvig: “Google does not have better Algorithms, only more Data” Many features/ low-bias models
  • 43. More data or better models? Sometimes, it’s not about more data
  • 44. “Data without a sound approach = noise”
  • 46. More data + Smarter models + More accurate metrics + Better system architectures Lots of room for improvement!