SlideShare une entreprise Scribd logo
1  sur  39
Recommender System
    Introduction
  xiangliang@hulu.com
What is good recommender
         system?
Outline
• What is recommender system?
  – Mission
  – History
  – Problems
• What is good recommender system?
  – Experiment Methods
  – Evaluation Metric
Information Overload
How to solve information overload
• Catalog
  – Yahoo, DMOZ


• Search Engine
  – Google, Bing
Mission
• Help user find item of their interest.
• Help item provider deliver their item to
  right user.
• Help website improve user engagement.
Recommender
   System
Search Engine vs. Recommender
               System
• User will try search engine if
  – they have specific needs
  – they can use keywords to describe needs
• User will try recommender system if
  – they do not know what they want now
  – they can not use keywords to describe needs
History: Before 1992
• Content Filtering
  – An architecture for large scale information
    systems [1985] (Gifford, D.K)
  – MAFIA: An active mail-filter agent for an
    intelligent document processing support [1990]
    (Lutz, E.)
  – A rule-based message filtering system [1988]
    (Pollock, S. )
History: 1992-1998
• Tapestry by Xerox Palo Alto [1992]
  – First system designed by collaborative filtering
• Grouplens [1994]
  – First recommender system using rating data
• Movielens [1997]
  – First movie recommender system
  – Provide well-known dataset for researchers
History: 1992-1998
• Fab : content-based collaborative
  recommendation
  – First unified recommender system
• Empirical Analysis of Predictive Algorithms
  for Collaborative Filtering [1998] (John S.
  Breese)
  – Systematically evaluate user-based
    collaborative filtering
History: 1999-2005
• Amazon proposed item-based collaborative
  filtering (Patent is filed in 1998 and issued
  in 2001) [link]
• Thomas Hofmann proposed pLSA [1999]
  and apply similar method on collaborative
  filtering [2004]
• Pandora began music genome project
  [2000]
History: 1999-2005
• Lastfm using Audioscrobbler to generate
  user taste profile on musics.
• Evaluating collaborative filtering
  recommender systems [2004] (Jonathan L.
  Herlocker)
History: 2005-2009
• Toward the Next Generation of
  Recommender Systems: A Survey of the
  State-of-the-Art and Possible Extensions.
  [2005] (Alexander Tuzhilin)
• Netflix Prize [link]
  – Latent Factor Model (SVD, RSVD, NSVD, SVD++)
  – Temporal Dynamic Collaborative Filtering
  – Yehuda Koren [link]’s team get prize
History: 2005-2009
• ACM Conference on Recommender System
  [2007] (Minneapolis, Minnesota, USA)
• Digg, Youtube try recommender system.
History: 2010-now
•   Context-Aware Recommender Systems
•   Music Recommendation and Discovery
•   Recommender Systems and the Social Web
•   Information Heterogeneity and Fusion in
    Recommender Systems
•   Human Decision Making in Recommender Systems
•   Personalization in Mobile Applications
•   Novelty and Diversity in Recommender Systems
•   User-Centric Evaluation
History: 2010-now
• Facebook launches instant personalization
  [2010]
  – Clicker
  – Bing
  – Trip Advisor
  – Rotten Tomatoes
  – Pandora
  – ……
Problems
• Main Problems
  – Top-N Recommendation
  – Rating Prediction
Problems
• Top-N Recommendation
  – Input
        user      item
            A      a
            B      a
            B      b
            …      …

  – Output
Problems
• Top-N Recommendation
  – Input
        user      item     rating
            A      a
            B      a
            B      b
            …      …         …

  – Output
What is good recommender
         system?
Experiment Methods
• Offline Experiment
• User Survey
• Online Experiment
  – AB Testing
Experiment Methods
• Offline Experiment

                    DataSet




                Train                    Test


  • Advantage:
       • Only rely on dataset
       •
  • Disadvantage:
       • Offline metric can not reflect business goal
Experiment Methods
• User Survey
  – Advantage:
    • Can get subjective metrics
    • Lower risk than online testing
  – Disadvantage:
    • Higher cost than offline experiments
    • Some results may not have statistical significance
    • Users may have different behaviors under testing
      environment or real environment
    • It’s difficult to design double blink experiments.
Experiment Methods
• On line experiments (AB Testing)
  – Advantage:
     • Can get metrics related to business goal
  – Disadvantage:
     • High risk/cost
     • Need large user set to get statistical significant result
Experiment Metrics
•   User Satisfaction
•   Prediction Accuracy
•   Coverage
•   Diversity
•   Novelty
•   Serendipity
•   Trust
•   Robust
•   Real-time
Experiment Metrics
• User Satisfaction
  – Subjective metric
  – Measured by user survey or online experiments
Experiment Metrics
• Prediction Accuracy
  – Measured by offline experiments
  – Top-N Recommendation
     • Precision / Recall


  – Rating Prediction
     • MAE, RMSE
Experiment Metrics
• Coverage
  – Measure the ability of recommender system to
    recommend long-tail items.

                |         R (u , N ) |
                     u U
  Coverage
                           |I|

  – Entropy, Gini Index
Experiment Metrics
• Diversity
  – Measure the ability of recommender system to
    cover users’ different interests.
  – Different similarity metric generate different
    diversity metric.
Experiment Metrics
• Diversity (Example)




 Watch History           Related Items
Experiment Metrics
• Novelty
  – Measure the ability of recommender system to
    introduce long tail items to users.
  – International Workshop on Novelty and
    Diversity in Recommender Systems [link]
  – Music Recommendation and Discovery in the
    Long Tail [Oscar Celma]
Experiment Metrics
• Serendipity
  – A recommendation result is serendipity if:
     • it’s not related with user’s historical interest
     • it’s novelty to user
     • user will find it’s interesting after user view it
Experiment Metrics
• Trust
  – If user trust recommender system, they will
    interact with it.
  – Ways to improve trust:
     • Transparency
     • Social
     • Trust System (Epinion)
Experiment Metrics
• Robust
  – The ability of recommender system to prevent
    attack.
  – Neil Hurley. Tutorial on Robustness of
    Recommender System. ACM RecSys 2011.
Experiment Metrics
• Real-time
  – Generate new recommendations when user
    have new behaviors immediately.
Too many metric!
Which is most important?
How to do trade-off
• Business goal
• Our belief
• Making new algorithms by 3 steps
  experiments:
  – Offline testing
  – User survey
  – Online testing
Thanks!

Contenu connexe

Tendances

Recommendation engines
Recommendation enginesRecommendation engines
Recommendation engines
Georgian Micsa
 
Recommender Systems! @ASAI 2011
Recommender Systems! @ASAI 2011Recommender Systems! @ASAI 2011
Recommender Systems! @ASAI 2011
Ernesto Mislej
 
Item Based Collaborative Filtering Recommendation Algorithms
Item Based Collaborative Filtering Recommendation AlgorithmsItem Based Collaborative Filtering Recommendation Algorithms
Item Based Collaborative Filtering Recommendation Algorithms
nextlib
 

Tendances (20)

Recommender systems using collaborative filtering
Recommender systems using collaborative filteringRecommender systems using collaborative filtering
Recommender systems using collaborative filtering
 
An introduction to Recommender Systems
An introduction to Recommender SystemsAn introduction to Recommender Systems
An introduction to Recommender Systems
 
Recommender Systems
Recommender SystemsRecommender Systems
Recommender Systems
 
Recommendation Systems
Recommendation SystemsRecommendation Systems
Recommendation Systems
 
Recommendation engines
Recommendation enginesRecommendation engines
Recommendation engines
 
Recommendation Systems Basics
Recommendation Systems BasicsRecommendation Systems Basics
Recommendation Systems Basics
 
Recommendation System Explained
Recommendation System ExplainedRecommendation System Explained
Recommendation System Explained
 
Recommender systems
Recommender systemsRecommender systems
Recommender systems
 
Recommendation Engine Project Presentation
Recommendation Engine Project PresentationRecommendation Engine Project Presentation
Recommendation Engine Project Presentation
 
Recommender Systems! @ASAI 2011
Recommender Systems! @ASAI 2011Recommender Systems! @ASAI 2011
Recommender Systems! @ASAI 2011
 
Recent advances in deep recommender systems
Recent advances in deep recommender systemsRecent advances in deep recommender systems
Recent advances in deep recommender systems
 
[Final]collaborative filtering and recommender systems
[Final]collaborative filtering and recommender systems[Final]collaborative filtering and recommender systems
[Final]collaborative filtering and recommender systems
 
Introduction to Recommendation Systems
Introduction to Recommendation SystemsIntroduction to Recommendation Systems
Introduction to Recommendation Systems
 
Deep Learning for Recommender Systems RecSys2017 Tutorial
Deep Learning for Recommender Systems RecSys2017 Tutorial Deep Learning for Recommender Systems RecSys2017 Tutorial
Deep Learning for Recommender Systems RecSys2017 Tutorial
 
Recommender Systems: Advances in Collaborative Filtering
Recommender Systems: Advances in Collaborative FilteringRecommender Systems: Advances in Collaborative Filtering
Recommender Systems: Advances in Collaborative Filtering
 
Recommendation System
Recommendation SystemRecommendation System
Recommendation System
 
Boston ML - Architecting Recommender Systems
Boston ML - Architecting Recommender SystemsBoston ML - Architecting Recommender Systems
Boston ML - Architecting Recommender Systems
 
Past, present, and future of Recommender Systems: an industry perspective
Past, present, and future of Recommender Systems: an industry perspectivePast, present, and future of Recommender Systems: an industry perspective
Past, present, and future of Recommender Systems: an industry perspective
 
Item Based Collaborative Filtering Recommendation Algorithms
Item Based Collaborative Filtering Recommendation AlgorithmsItem Based Collaborative Filtering Recommendation Algorithms
Item Based Collaborative Filtering Recommendation Algorithms
 
Past, Present & Future of Recommender Systems: An Industry Perspective
Past, Present & Future of Recommender Systems: An Industry PerspectivePast, Present & Future of Recommender Systems: An Industry Perspective
Past, Present & Future of Recommender Systems: An Industry Perspective
 

En vedette

Lecture 6 lu factorization & determinants - section 2-5 2-7 3-1 and 3-2
Lecture 6   lu factorization & determinants - section 2-5 2-7 3-1 and 3-2Lecture 6   lu factorization & determinants - section 2-5 2-7 3-1 and 3-2
Lecture 6 lu factorization & determinants - section 2-5 2-7 3-1 and 3-2
njit-ronbrown
 
Factorization Machines with libFM
Factorization Machines with libFMFactorization Machines with libFM
Factorization Machines with libFM
Liangjie Hong
 
Matrix factorization
Matrix factorizationMatrix factorization
Matrix factorization
rubyyc
 
Building a Recommendation Engine - An example of a product recommendation engine
Building a Recommendation Engine - An example of a product recommendation engineBuilding a Recommendation Engine - An example of a product recommendation engine
Building a Recommendation Engine - An example of a product recommendation engine
NYC Predictive Analytics
 

En vedette (15)

Lecture 6 lu factorization & determinants - section 2-5 2-7 3-1 and 3-2
Lecture 6   lu factorization & determinants - section 2-5 2-7 3-1 and 3-2Lecture 6   lu factorization & determinants - section 2-5 2-7 3-1 and 3-2
Lecture 6 lu factorization & determinants - section 2-5 2-7 3-1 and 3-2
 
Factorization Machines with libFM
Factorization Machines with libFMFactorization Machines with libFM
Factorization Machines with libFM
 
Matrix factorization
Matrix factorizationMatrix factorization
Matrix factorization
 
Neighbor methods vs matrix factorization - case studies of real-life recommen...
Neighbor methods vs matrix factorization - case studies of real-life recommen...Neighbor methods vs matrix factorization - case studies of real-life recommen...
Neighbor methods vs matrix factorization - case studies of real-life recommen...
 
Collaborative Filtering with Spark
Collaborative Filtering with SparkCollaborative Filtering with Spark
Collaborative Filtering with Spark
 
Intro to Factorization Machines
Intro to Factorization MachinesIntro to Factorization Machines
Intro to Factorization Machines
 
آموزش محاسبات عددی - بخش دوم
آموزش محاسبات عددی - بخش دومآموزش محاسبات عددی - بخش دوم
آموزش محاسبات عددی - بخش دوم
 
Nonnegative Matrix Factorization
Nonnegative Matrix FactorizationNonnegative Matrix Factorization
Nonnegative Matrix Factorization
 
Matrix Factorization Technique for Recommender Systems
Matrix Factorization Technique for Recommender SystemsMatrix Factorization Technique for Recommender Systems
Matrix Factorization Technique for Recommender Systems
 
Introduction to Matrix Factorization Methods Collaborative Filtering
Introduction to Matrix Factorization Methods Collaborative FilteringIntroduction to Matrix Factorization Methods Collaborative Filtering
Introduction to Matrix Factorization Methods Collaborative Filtering
 
Beginners Guide to Non-Negative Matrix Factorization
Beginners Guide to Non-Negative Matrix FactorizationBeginners Guide to Non-Negative Matrix Factorization
Beginners Guide to Non-Negative Matrix Factorization
 
Recommender Systems
Recommender SystemsRecommender Systems
Recommender Systems
 
Recommendation system
Recommendation system Recommendation system
Recommendation system
 
Collaborative Filtering Recommendation System
Collaborative Filtering Recommendation SystemCollaborative Filtering Recommendation System
Collaborative Filtering Recommendation System
 
Building a Recommendation Engine - An example of a product recommendation engine
Building a Recommendation Engine - An example of a product recommendation engineBuilding a Recommendation Engine - An example of a product recommendation engine
Building a Recommendation Engine - An example of a product recommendation engine
 

Similaire à Recommender system introduction

Preference Elicitation Interface
Preference Elicitation InterfacePreference Elicitation Interface
Preference Elicitation Interface
晓愚 孟
 
Data Mining and Recommendation Systems
Data Mining and Recommendation SystemsData Mining and Recommendation Systems
Data Mining and Recommendation Systems
Salil Navgire
 
Recommender System _Module 1_Introduction to Recommender System.pptx
Recommender System _Module 1_Introduction to Recommender System.pptxRecommender System _Module 1_Introduction to Recommender System.pptx
Recommender System _Module 1_Introduction to Recommender System.pptx
Satyam Sharma
 

Similaire à Recommender system introduction (20)

Modern Perspectives on Recommender Systems and their Applications in Mendeley
Modern Perspectives on Recommender Systems and their Applications in MendeleyModern Perspectives on Recommender Systems and their Applications in Mendeley
Modern Perspectives on Recommender Systems and their Applications in Mendeley
 
Use of data science in recommendation system
Use of data science in  recommendation systemUse of data science in  recommendation system
Use of data science in recommendation system
 
productionising-recommenders
productionising-recommendersproductionising-recommenders
productionising-recommenders
 
Preference Elicitation Interface
Preference Elicitation InterfacePreference Elicitation Interface
Preference Elicitation Interface
 
Recommender Systems in a nutshell
Recommender Systems in a nutshellRecommender Systems in a nutshell
Recommender Systems in a nutshell
 
Evaluation in Audio Music Similarity
Evaluation in Audio Music SimilarityEvaluation in Audio Music Similarity
Evaluation in Audio Music Similarity
 
Design Recommender systems from scratch
Design Recommender systems from scratchDesign Recommender systems from scratch
Design Recommender systems from scratch
 
NISO Altmetrics Initiative: A Project Update - Martin Fenner, Technical Lead ...
NISO Altmetrics Initiative: A Project Update - Martin Fenner, Technical Lead ...NISO Altmetrics Initiative: A Project Update - Martin Fenner, Technical Lead ...
NISO Altmetrics Initiative: A Project Update - Martin Fenner, Technical Lead ...
 
recommendation system techunique and issue
recommendation system techunique and issuerecommendation system techunique and issue
recommendation system techunique and issue
 
Олександр Обєдніков “Рекомендательные системы”
Олександр Обєдніков “Рекомендательные системы”Олександр Обєдніков “Рекомендательные системы”
Олександр Обєдніков “Рекомендательные системы”
 
The Hive Think Tank: Machine Learning at Pinterest by Jure Leskovec
The Hive Think Tank: Machine Learning at Pinterest by Jure LeskovecThe Hive Think Tank: Machine Learning at Pinterest by Jure Leskovec
The Hive Think Tank: Machine Learning at Pinterest by Jure Leskovec
 
Recommandation systems -
Recommandation systems - Recommandation systems -
Recommandation systems -
 
Modern Perspectives on Recommender Systems and their Applications in Mendeley
Modern Perspectives on Recommender Systems and their Applications in MendeleyModern Perspectives on Recommender Systems and their Applications in Mendeley
Modern Perspectives on Recommender Systems and their Applications in Mendeley
 
Data Mining and Recommendation Systems
Data Mining and Recommendation SystemsData Mining and Recommendation Systems
Data Mining and Recommendation Systems
 
Demystifying Recommendation Systems
Demystifying Recommendation SystemsDemystifying Recommendation Systems
Demystifying Recommendation Systems
 
Recommender System _Module 1_Introduction to Recommender System.pptx
Recommender System _Module 1_Introduction to Recommender System.pptxRecommender System _Module 1_Introduction to Recommender System.pptx
Recommender System _Module 1_Introduction to Recommender System.pptx
 
Lecture Notes on Recommender System Introduction
Lecture Notes on Recommender System IntroductionLecture Notes on Recommender System Introduction
Lecture Notes on Recommender System Introduction
 
Agent technology for e commerce-recommendation systems
Agent technology for e commerce-recommendation systemsAgent technology for e commerce-recommendation systems
Agent technology for e commerce-recommendation systems
 
Mini-training: Personalization & Recommendation Demystified
Mini-training: Personalization & Recommendation DemystifiedMini-training: Personalization & Recommendation Demystified
Mini-training: Personalization & Recommendation Demystified
 
Paper prototype evaluation
Paper prototype evaluationPaper prototype evaluation
Paper prototype evaluation
 

Plus de Liang Xiang

Phd. Thesis : Temporal Recommendation
Phd. Thesis : Temporal RecommendationPhd. Thesis : Temporal Recommendation
Phd. Thesis : Temporal Recommendation
Liang Xiang
 
How to do model ensemble
How to do model ensembleHow to do model ensemble
How to do model ensemble
Liang Xiang
 
Temporal recommendation on graphs via long and short-term
Temporal recommendation on graphs via long  and short-termTemporal recommendation on graphs via long  and short-term
Temporal recommendation on graphs via long and short-term
Liang Xiang
 

Plus de Liang Xiang (6)

Kddcup2011
Kddcup2011Kddcup2011
Kddcup2011
 
Phd. Thesis : Temporal Recommendation
Phd. Thesis : Temporal RecommendationPhd. Thesis : Temporal Recommendation
Phd. Thesis : Temporal Recommendation
 
动态推荐系统关键技术研究
动态推荐系统关键技术研究动态推荐系统关键技术研究
动态推荐系统关键技术研究
 
动态推荐系统关键技术研究
动态推荐系统关键技术研究动态推荐系统关键技术研究
动态推荐系统关键技术研究
 
How to do model ensemble
How to do model ensembleHow to do model ensemble
How to do model ensemble
 
Temporal recommendation on graphs via long and short-term
Temporal recommendation on graphs via long  and short-termTemporal recommendation on graphs via long  and short-term
Temporal recommendation on graphs via long and short-term
 

Dernier

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Dernier (20)

Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Introduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDMIntroduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDM
 
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 

Recommender system introduction

  • 1. Recommender System Introduction xiangliang@hulu.com
  • 2. What is good recommender system?
  • 3. Outline • What is recommender system? – Mission – History – Problems • What is good recommender system? – Experiment Methods – Evaluation Metric
  • 5. How to solve information overload • Catalog – Yahoo, DMOZ • Search Engine – Google, Bing
  • 6. Mission • Help user find item of their interest. • Help item provider deliver their item to right user. • Help website improve user engagement.
  • 7. Recommender System
  • 8. Search Engine vs. Recommender System • User will try search engine if – they have specific needs – they can use keywords to describe needs • User will try recommender system if – they do not know what they want now – they can not use keywords to describe needs
  • 9. History: Before 1992 • Content Filtering – An architecture for large scale information systems [1985] (Gifford, D.K) – MAFIA: An active mail-filter agent for an intelligent document processing support [1990] (Lutz, E.) – A rule-based message filtering system [1988] (Pollock, S. )
  • 10. History: 1992-1998 • Tapestry by Xerox Palo Alto [1992] – First system designed by collaborative filtering • Grouplens [1994] – First recommender system using rating data • Movielens [1997] – First movie recommender system – Provide well-known dataset for researchers
  • 11. History: 1992-1998 • Fab : content-based collaborative recommendation – First unified recommender system • Empirical Analysis of Predictive Algorithms for Collaborative Filtering [1998] (John S. Breese) – Systematically evaluate user-based collaborative filtering
  • 12. History: 1999-2005 • Amazon proposed item-based collaborative filtering (Patent is filed in 1998 and issued in 2001) [link] • Thomas Hofmann proposed pLSA [1999] and apply similar method on collaborative filtering [2004] • Pandora began music genome project [2000]
  • 13. History: 1999-2005 • Lastfm using Audioscrobbler to generate user taste profile on musics. • Evaluating collaborative filtering recommender systems [2004] (Jonathan L. Herlocker)
  • 14. History: 2005-2009 • Toward the Next Generation of Recommender Systems: A Survey of the State-of-the-Art and Possible Extensions. [2005] (Alexander Tuzhilin) • Netflix Prize [link] – Latent Factor Model (SVD, RSVD, NSVD, SVD++) – Temporal Dynamic Collaborative Filtering – Yehuda Koren [link]’s team get prize
  • 15. History: 2005-2009 • ACM Conference on Recommender System [2007] (Minneapolis, Minnesota, USA) • Digg, Youtube try recommender system.
  • 16. History: 2010-now • Context-Aware Recommender Systems • Music Recommendation and Discovery • Recommender Systems and the Social Web • Information Heterogeneity and Fusion in Recommender Systems • Human Decision Making in Recommender Systems • Personalization in Mobile Applications • Novelty and Diversity in Recommender Systems • User-Centric Evaluation
  • 17. History: 2010-now • Facebook launches instant personalization [2010] – Clicker – Bing – Trip Advisor – Rotten Tomatoes – Pandora – ……
  • 18. Problems • Main Problems – Top-N Recommendation – Rating Prediction
  • 19. Problems • Top-N Recommendation – Input user item A a B a B b … … – Output
  • 20. Problems • Top-N Recommendation – Input user item rating A a B a B b … … … – Output
  • 21. What is good recommender system?
  • 22. Experiment Methods • Offline Experiment • User Survey • Online Experiment – AB Testing
  • 23. Experiment Methods • Offline Experiment DataSet Train Test • Advantage: • Only rely on dataset • • Disadvantage: • Offline metric can not reflect business goal
  • 24. Experiment Methods • User Survey – Advantage: • Can get subjective metrics • Lower risk than online testing – Disadvantage: • Higher cost than offline experiments • Some results may not have statistical significance • Users may have different behaviors under testing environment or real environment • It’s difficult to design double blink experiments.
  • 25. Experiment Methods • On line experiments (AB Testing) – Advantage: • Can get metrics related to business goal – Disadvantage: • High risk/cost • Need large user set to get statistical significant result
  • 26. Experiment Metrics • User Satisfaction • Prediction Accuracy • Coverage • Diversity • Novelty • Serendipity • Trust • Robust • Real-time
  • 27. Experiment Metrics • User Satisfaction – Subjective metric – Measured by user survey or online experiments
  • 28. Experiment Metrics • Prediction Accuracy – Measured by offline experiments – Top-N Recommendation • Precision / Recall – Rating Prediction • MAE, RMSE
  • 29. Experiment Metrics • Coverage – Measure the ability of recommender system to recommend long-tail items. | R (u , N ) | u U Coverage |I| – Entropy, Gini Index
  • 30. Experiment Metrics • Diversity – Measure the ability of recommender system to cover users’ different interests. – Different similarity metric generate different diversity metric.
  • 31. Experiment Metrics • Diversity (Example) Watch History Related Items
  • 32. Experiment Metrics • Novelty – Measure the ability of recommender system to introduce long tail items to users. – International Workshop on Novelty and Diversity in Recommender Systems [link] – Music Recommendation and Discovery in the Long Tail [Oscar Celma]
  • 33. Experiment Metrics • Serendipity – A recommendation result is serendipity if: • it’s not related with user’s historical interest • it’s novelty to user • user will find it’s interesting after user view it
  • 34. Experiment Metrics • Trust – If user trust recommender system, they will interact with it. – Ways to improve trust: • Transparency • Social • Trust System (Epinion)
  • 35. Experiment Metrics • Robust – The ability of recommender system to prevent attack. – Neil Hurley. Tutorial on Robustness of Recommender System. ACM RecSys 2011.
  • 36. Experiment Metrics • Real-time – Generate new recommendations when user have new behaviors immediately.
  • 37. Too many metric! Which is most important?
  • 38. How to do trade-off • Business goal • Our belief • Making new algorithms by 3 steps experiments: – Offline testing – User survey – Online testing