Recommending for the World

•

22 j'aime•2,792 vues

The Netflix experience is driven by a number of Machine Learning algorithms: personalized ranking, page generation, search, similarity, ratings, etc. On the 6th of January, we simultaneously launched Netflix in 130 new countries around the world, which brings the total to over 190 countries. Preparing for such a rapid expansion while ensuring each algorithm was ready to work seamlessly created new challenges for our recommendation and search teams. In this post, we highlight the four most interesting challenges we’ve encountered in making our algorithms operate globally and, most importantly, how this improved our ability to connect members worldwide with stories they'll love.

Ingénierie

Recommending for the World
Yves Raimond (@moustaki)
03/16
Research/Engineering Manager
Search & Recommendations Algorithm Engineering
Netflix

● > 75M members
● > 190 countries
● > 3.7B hours of content streamed every
month
● > 1000 device types
● 36% of peak US downstream traffic
Netflix scale

Goal
Help members find content to watch and enjoy
to maximize satisfaction and retention

▪
▪
▪
▪
▪
▪
▪
▪
▪
▪
▪ …
Models & Algorithms

Going global
● How do we make sure all these
algorithms are ready to work on a
global scale?
● Led us to investigate many
challenges, leading to many rollouts
of new algorithms, over the last year
○ Tech blog post
○ Company blog post

What would have happened if the two
videos were available to the same
members?

US
FR
1,000 users
100 users
100,000 users
10 users

US
FR
2016-01-01 2016-01-02
Newly available

What would have happened if the two
videos were available to the same
members for the same amount of time?

1) Similar users, in two different countries. Should they get similar
recommendations?

2) Overall, should recommendations be different for users in Japan vs users
in Argentina? What about new users?

Regional models
Group countries into regions, and train
individual models on each region.
Pros
● Easy!
● Catalog can be constrained to be
relatively uniform
● Solves question 2
Cons
● Doesn’t solve question 1
● How to define groupings?
● Algorithms x A/B model variants x
regions
● Biggest country in the region will
dominate
● Sparsity

$Sparsity and global models Only a small fraction of users from all countries would be interested in these titles. Models trained locally perform poorly -- lack of data. Pooling data from all countries discovers a worldwide community of interest, making recommendations better for these users.$

Local taste vs personal taste
● Personal taste benefits from global algorithms
○ Taste patterns travel globally
● Local taste still needs to be taken into account in order to solve 2)
● Incorporate signals and priors capturing local taste patterns (e.g. country and
language)

Instant search
● Ranking entities for partial queries
● Optimizing for the minimum number of interactions needed to find something
● Different languages involve very different interaction patterns
● How to automatically detect and adapt to such patterns in newly introduced
languages?

Hangul alphabet, 3 syllables but
requires 7 (2 + 3 + 2) interactions

Language & Recommendations
≈+
US US/AU FR
?

Tracking quality
● Objective: build algorithms that work equally well for all our members
● Looking at global metrics might hide issues with small subsets of members
● How to identify sub-optimality for a subset of our members?
○ Language, country, device, …
○ Slicing on all dimensions lead to sparsity and noisiness
○ Automatically grouping observations for the purpose of automatically detecting outliers
● Metrics, instrumentation and monitoring
○ Detect problems
○ Highlight areas of improvement

● Catalog differences, cultural awareness, language and metrics
● Worldwide communities of interest for better recommendations
○ Thinking about global actually led us to test and release better algorithms
○ But also need to capture signals and priors related to cultural preferences
● Quickly finding entities in any language
● Detecting issues at a finer grain
● … Still a lot of work to do!
○ Better global algorithms… (Now that we have data)
○ Better cultural/language awareness
○ Better user and item cold start
○ Reactiveness
○ Better algorithms for anomaly detection
Conclusion

Recommandé

Deep Learning for Recommender SystemsYves Raimond

Past, Present & Future of Recommender Systems: An Industry PerspectiveJustin Basilico

Making Netflix Machine Learning Algorithms ReliableJustin Basilico

Personalizing "The Netflix Experience" with Deep LearningAnoop Deoras

Deep Learning for Recommender SystemsJustin Basilico

Time, Context and Causality in Recommender SystemsYves Raimond

Déjà Vu: The Importance of Time and Causality in Recommender SystemsJustin Basilico

Sequential Decision Making in RecommendationsJaya Kawale

Recommandé

Deep Learning for Recommender SystemsYves Raimond

Past, Present & Future of Recommender Systems: An Industry PerspectiveJustin Basilico

Making Netflix Machine Learning Algorithms ReliableJustin Basilico

Personalizing "The Netflix Experience" with Deep LearningAnoop Deoras

Deep Learning for Recommender SystemsJustin Basilico

Time, Context and Causality in Recommender SystemsYves Raimond

Déjà Vu: The Importance of Time and Causality in Recommender SystemsJustin Basilico

Sequential Decision Making in RecommendationsJaya Kawale

Learning a Personalized HomepageJustin Basilico

Ehtsham Elahi, Senior Research Engineer, Personalization Science and Engineer...MLconf

Artwork Personalization at NetflixJustin Basilico

Context Aware Recommendations at NetflixLinas Baltrunas

Recent Trends in Personalization: A Netflix PerspectiveJustin Basilico

Machine Learning at Netflix ScaleAish Fenton

Recommendations for Building Machine Learning SoftwareJustin Basilico

Personalization at Netflix - Making Stories Travel Sudeep Das, Ph.D.

Personalized Page Generation for Browsing RecommendationsJustin Basilico

Missing values in recommender modelsParmeshwar Khurd

Tutorial on Deep Learning in Recommender System, Lars summer school 2019Anoop Deoras

Lessons Learned from Building Machine Learning Software at NetflixJustin Basilico

Contextualization at NetflixLinas Baltrunas

Shallow and Deep Latent Models for Recommender SystemAnoop Deoras

Recommendation at Netflix ScaleJustin Basilico

Deeper Things: How Netflix Leverages Deep Learning in Recommendations and Se...Sudeep Das, Ph.D.

Data council SF 2020 Building a Personalized Messaging System at NetflixGrace T. Huang

Recent Trends in Personalization at NetflixFörderverein Technische Fakultät

A Multi-Armed Bandit Framework For Recommendations at NetflixJaya Kawale

Crafting Recommenders: the Shallow and the Deep of it! Sudeep Das, Ph.D.

Gdc14 gda follow up seminar 20140518 fabioIGDA JAPAN

Localization Summit 2014: An overviewgloc247

Contenu connexe

Tendances

Learning a Personalized HomepageJustin Basilico

Ehtsham Elahi, Senior Research Engineer, Personalization Science and Engineer...MLconf

Artwork Personalization at NetflixJustin Basilico

Context Aware Recommendations at NetflixLinas Baltrunas

Recent Trends in Personalization: A Netflix PerspectiveJustin Basilico

Machine Learning at Netflix ScaleAish Fenton

Recommendations for Building Machine Learning SoftwareJustin Basilico

Personalization at Netflix - Making Stories Travel Sudeep Das, Ph.D.

Personalized Page Generation for Browsing RecommendationsJustin Basilico

Missing values in recommender modelsParmeshwar Khurd

Tutorial on Deep Learning in Recommender System, Lars summer school 2019Anoop Deoras

Lessons Learned from Building Machine Learning Software at NetflixJustin Basilico

Contextualization at NetflixLinas Baltrunas

Shallow and Deep Latent Models for Recommender SystemAnoop Deoras

Recommendation at Netflix ScaleJustin Basilico

Deeper Things: How Netflix Leverages Deep Learning in Recommendations and Se...Sudeep Das, Ph.D.

Data council SF 2020 Building a Personalized Messaging System at NetflixGrace T. Huang

Recent Trends in Personalization at NetflixFörderverein Technische Fakultät

A Multi-Armed Bandit Framework For Recommendations at NetflixJaya Kawale

Crafting Recommenders: the Shallow and the Deep of it! Sudeep Das, Ph.D.

Tendances (20)

Learning a Personalized Homepage

Ehtsham Elahi, Senior Research Engineer, Personalization Science and Engineer...

Artwork Personalization at Netflix

Context Aware Recommendations at Netflix

Recent Trends in Personalization: A Netflix Perspective

Machine Learning at Netflix Scale

Recommendations for Building Machine Learning Software

Personalization at Netflix - Making Stories Travel

Personalized Page Generation for Browsing Recommendations

Missing values in recommender models

Tutorial on Deep Learning in Recommender System, Lars summer school 2019

Lessons Learned from Building Machine Learning Software at Netflix

Contextualization at Netflix

Shallow and Deep Latent Models for Recommender System

Recommendation at Netflix Scale

Deeper Things: How Netflix Leverages Deep Learning in Recommendations and Se...

Data council SF 2020 Building a Personalized Messaging System at Netflix

Recent Trends in Personalization at Netflix

A Multi-Armed Bandit Framework For Recommendations at Netflix

Crafting Recommenders: the Shallow and the Deep of it!

Similaire à Recommending for the World

Gdc14 gda follow up seminar 20140518 fabioIGDA JAPAN

Localization Summit 2014: An overviewgloc247

Ubermix 1:1 open program Chris Scott

Realizing AI Conversational BotRakuten Group, Inc.

What Are the Basics of Product Manager Interviews by Google PMProduct School

MediaEval 2016 - COSMIR and the OpenMIC Challenge: A Plan for Sustainable Mus...multimediaeval

Webinar - Measuring What Matters with Google Analytics - 2015-12-3TechSoup

Microservices, the lean wayBruno Bossola

Microservices, the lean way - Bruno Bossola - Codemotion Amsterdam 2016Codemotion

Running Neo4j in Production: Tips, Tricks and OptimizationsNick Manning

Slack- a presentationPreeti Mohan

Natural language processing and its application in aiRam Kumar

Everything to know about ChatGPTKnoldus Inc.

KantanFest: Andy Waykantanmt

Webinar - Video Editing and Production with Adobe Premiere Pro - 2016-06-14TechSoup

OK Google, it's time to bot! - Hadar Franco & Stav LeviHadar Franco

Ok google, it's time to bot! - Hadar Franco, Albert + Stav Levi, MondayDroidConTLV

Apertium: a unique free/open-source MT system for related languages [but not ...Prompsit Language Engineering

Apertium: a unique free/open-source MT system for related languages [but not ...Gema Ramirez-Sanchez

Similaire à Recommending for the World (20)

Gdc14 gda follow up seminar 20140518 fabio

Localization Summit 2014: An overview

Ubermix 1:1 open program

Realizing AI Conversational Bot

What Are the Basics of Product Manager Interviews by Google PM

MediaEval 2016 - COSMIR and the OpenMIC Challenge: A Plan for Sustainable Mus...

Webinar - Measuring What Matters with Google Analytics - 2015-12-3

Microservices, the lean way

Microservices, the lean way - Bruno Bossola - Codemotion Amsterdam 2016

Running Neo4j in Production: Tips, Tricks and Optimizations

Slack- a presentation

Natural language processing and its application in ai

Everything to know about ChatGPT

KantanFest: Andy Way

Webinar - Video Editing and Production with Adobe Premiere Pro - 2016-06-14

OK Google, it's time to bot! - Hadar Franco & Stav Levi

Ok google, it's time to bot! - Hadar Franco, Albert + Stav Levi, Monday

Apertium: a unique free/open-source MT system for related languages [but not ...

Plus de Yves Raimond

(Some) pitfalls of distributed learningYves Raimond

Paris ML meetupYves Raimond

Spark Meetup @ Netflix, 05/19/2015Yves Raimond

Utilisation du Web Semantique pour les sites de la BBCYves Raimond

Linked Data on the BBCYves Raimond

Publishing and interlinking music-related data on the WebYves Raimond

Linked data and applicationsYves Raimond

Web of dataYves Raimond

Towards a musical Semantic WebYves Raimond

Plus de Yves Raimond (9)

(Some) pitfalls of distributed learning

Paris ML meetup

Spark Meetup @ Netflix, 05/19/2015

Utilisation du Web Semantique pour les sites de la BBC

Linked Data on the BBC

Publishing and interlinking music-related data on the Web

Linked data and applications

Web of data

Towards a musical Semantic Web

Dernier

young call girls in Green Park🔝 9953056974 🔝 escort Service9953056974 Low Rate Call Girls In Saket, Delhi NCR

What are the advantages and disadvantages of membrane structures.pptxwendy cai

Comparative Analysis of Text Summarization Techniquesugginaramesh

Architect Hassan Khalil Portfolio for 2024hassan khalil

Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR9953056974 Low Rate Call Girls In Saket, Delhi NCR

IVE Industry Focused Event - Defence Sector 2024Mark Billinghurst

Work Experience-Dalton Park.pptxfvvvvvvvLewisJB

Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptxnull - The Open Security Community

Introduction-To-Agricultural-Surveillance-Rover.pptxk795866

Application of Residue Theorem to evaluate real integrations.pptx959SahilShah

Arduino_CSE ece ppt for working and principal of arduino.pptSAURABHKUMAR892774

Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...srsj9000

An introduction to Semiconductor and its types.pptxPurva Nikam

Oxy acetylene welding presentation note.eptoze12

Past, Present and Future of Generative AIabhishek36461

CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdfAsst.prof M.Gokilavani

Design and analysis of solar grass cutter.pdfTagore Institute of Engineering And Technology

Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort servicejennyeacort

Concrete Mix Design - IS 10262-2019 - .pptxKartikeyaDwivedi3

Heart Disease Prediction using machine learning.pptxPoojaBan

Dernier (20)

young call girls in Green Park🔝 9953056974 🔝 escort Service

What are the advantages and disadvantages of membrane structures.pptx

Comparative Analysis of Text Summarization Techniques

Architect Hassan Khalil Portfolio for 2024

Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR

IVE Industry Focused Event - Defence Sector 2024

Work Experience-Dalton Park.pptxfvvvvvvv

Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptx

Introduction-To-Agricultural-Surveillance-Rover.pptx

Application of Residue Theorem to evaluate real integrations.pptx

Arduino_CSE ece ppt for working and principal of arduino.ppt

Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...

An introduction to Semiconductor and its types.pptx

Oxy acetylene welding presentation note.

Past, Present and Future of Generative AI

CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf

Design and analysis of solar grass cutter.pdf

Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service

Concrete Mix Design - IS 10262-2019 - .pptx

Heart Disease Prediction using machine learning.pptx

Recommending for the World

2. Recommending for the World Yves Raimond (@moustaki) 03/16 Research/Engineering Manager Search & Recommendations Algorithm Engineering Netflix

3. Some background

7. ● > 75M members ● > 190 countries ● > 3.7B hours of content streamed every month ● > 1000 device types ● 36% of peak US downstream traffic Netflix scale

8. Recommendations @ Netflix

9. Goal Help members find content to watch and enjoy to maximize satisfaction and retention

10.

11.

12.

13.

14.

15.

16.

17.

18.

19. ▪ ▪ ▪ ▪ ▪ ▪ ▪ ▪ ▪ ▪ ▪ … Models & Algorithms

20.

21. Going global ● How do we make sure all these algorithms are ready to work on a global scale? ● Led us to investigate many challenges, leading to many rollouts of new algorithms, over the last year ○ Tech blog post ○ Company blog post

22. Challenge 1: Uneven Video Availability

23. US FR

24. US FR 1,000 users 100 users

25. 0 ... ...Co-occurrences

26. ! = ?????

27. R ≈ UM

28. ! =

29. What would have happened if the two videos were available to the same members?

30. US FR 1,000 users 100 users 100,000 users 10 users

31. ≈

32. US FR 2016-01-01 2016-01-02 Newly available

33. What would have happened if the two videos were available to the same members for the same amount of time?

34. Challenge 2: Cultural Awareness

35. Two questions

36. 1) Similar users, in two different countries. Should they get similar recommendations?

37. 2) Overall, should recommendations be different for users in Japan vs users in Argentina? What about new users?

38. Regional models Group countries into regions, and train individual models on each region. Pros ● Easy! ● Catalog can be constrained to be relatively uniform ● Solves question 2 Cons ● Doesn’t solve question 1 ● How to define groupings? ● Algorithms x A/B model variants x regions ● Biggest country in the region will dominate ● Sparsity

39. Sparsity and global models Only a small fraction of users from all countries would be interested in these titles. Models trained locally perform poorly -- lack of data. Pooling data from all countries discovers a worldwide community of interest, making recommendations better for these users.

40. Global communities - Anime

41. Global communities - Bollywood

42. Local taste vs personal taste ● Personal taste benefits from global algorithms ○ Taste patterns travel globally ● Local taste still needs to be taken into account in order to solve 2) ● Incorporate signals and priors capturing local taste patterns (e.g. country and language)

43.

44. Challenge 3: Language

45. Instant search ● Ranking entities for partial queries ● Optimizing for the minimum number of interactions needed to find something ● Different languages involve very different interaction patterns ● How to automatically detect and adapt to such patterns in newly introduced languages?

46. Hangul alphabet, 3 syllables but requires 7 (2 + 3 + 2) interactions

47. One interaction

48. Language & Recommendations ≈+ US US/AU FR ?

49. Challenge 4: Does it even work?

50. Tracking quality ● Objective: build algorithms that work equally well for all our members ● Looking at global metrics might hide issues with small subsets of members ● How to identify sub-optimality for a subset of our members? ○ Language, country, device, … ○ Slicing on all dimensions lead to sparsity and noisiness ○ Automatically grouping observations for the purpose of automatically detecting outliers ● Metrics, instrumentation and monitoring ○ Detect problems ○ Highlight areas of improvement

51. Conclusion

52. ● Catalog differences, cultural awareness, language and metrics ● Worldwide communities of interest for better recommendations ○ Thinking about global actually led us to test and release better algorithms ○ But also need to capture signals and priors related to cultural preferences ● Quickly finding entities in any language ● Detecting issues at a finer grain ● … Still a lot of work to do! ○ Better global algorithms… (Now that we have data) ○ Better cultural/language awareness ○ Better user and item cold start ○ Reactiveness ○ Better algorithms for anomaly detection Conclusion

53. Questions?