SlideShare une entreprise Scribd logo
1  sur  19
Télécharger pour lire hors ligne
Bayesian Personalized Rankingfor Non-Uniformly Sampled Items




                              Bayesian Personalized Ranking
                            for Non-Uniformly Sampled Items

                Zeno Gantner, Lucas Drumond, Christoph Freudenthaler,
                                Lars Schmidt-Thieme

                                                University of Hildesheim


                                                 21 August 2011




Zeno Gantner et al., University of Hildesheim                              1 / 15
Bayesian Personalized Rankingfor Non-Uniformly Sampled Items   Questions (and Answers)




                                                What?
               Who?                                                          Which?
                                                How?
                                                                           Where?
                             Why?

Zeno Gantner et al., University of Hildesheim                                            2 / 15
Bayesian Personalized Rankingfor Non-Uniformly Sampled Items   Which problem to solve?


Which problem to solve?




              Rating Prediction (Track 1)
                                                        vs.
                  Item Prediction (Track 2)


Zeno Gantner et al., University of Hildesheim                                            3 / 15
Bayesian Personalized Rankingfor Non-Uniformly Sampled Items   How did we tackle the problem?


How did we tackle the problem?
 Bayesian Personalized Ranking:
                                                                                                2
       BPR(DS ) = argmax                                 ln σ(ˆu,i (Θ) − ˆu,j (Θ) )−λ Θ
                                                              s          s
                                    Θ
                                            (u,i,j)∈DS

         DS contains all pairs of positive and negative items for each user,
                     1
         σ(x) = 1+e −x is the logistic function,
         Θ represents the model parameters,
         ˆu,i (Θ) is the predicted score for user u and item i, and
         s
         λ Θ 2 is a regularization term to prevent overfitting.

 interpretation 1: reduce ranking to pairwise classif. [Balcan et al. 2008]
 interpretation 2: optimize for smoothed area under the ROC curve (AUC)
 Model: matrix factorization
 Learning: stochastic gradient ascent

                                                                                 [Rendle et al., UAI 2009]
Zeno Gantner et al., University of Hildesheim                                                          4 / 15
Bayesian Personalized Rankingfor Non-Uniformly Sampled Items   How did we tackle the problem?


How did we tackle the problem?

                                                                                                2
              BPR(DS ) = argmax                                ln σ(ˆu,i − ˆu,j ) − λ Θ
                                                                    s      s
                                          Θ
                                                (u,i,j)∈DS

 problem: all negative items j are given the same weight




Zeno Gantner et al., University of Hildesheim                                                       5 / 15
Bayesian Personalized Rankingfor Non-Uniformly Sampled Items    How did we tackle the problem?


How did we tackle the problem?

                                                                                                 2
              BPR(DS ) = argmax                                 ln σ(ˆu,i − ˆu,j ) − λ Θ
                                                                     s      s
                                          Θ
                                                 (u,i,j)∈DS

 problem: all negative items j are given the same weight

 solution: adapt weights in the optimization criterion (and sampling
 probabilities in the learning algorithm)


        WBPR(DS ) = argmax                                     wu wi wj ln σ(ˆu,i − ˆu,j ) − λ Θ 2 ,
                                                                             s      s
                                         Θ
                                                (u,i,j)∈DS

 where
                                                                      +
                                                wj =           δ(j ∈ Iu ).                             (1)
                                                       u∈U

Zeno Gantner et al., University of Hildesheim                                                          5 / 15
Bayesian Personalized Rankingfor Non-Uniformly Sampled Items   Why did we not win?


Why did we not win?
But also: Why did we perform better than others?



 Why did we perform better than others?
         straightforward model that matches the prediction task pretty well
         scalability (e.g. k = 480 factors per user/item)
         integration of rating information (see paper)
         ensembles (see paper)




 Why did we not win?
         . . . two possible answers . . .



Zeno Gantner et al., University of Hildesheim                                        6 / 15
Bayesian Personalized Rankingfor Non-Uniformly Sampled Items   Why did we not win?


Taxonomy




Zeno Gantner et al., University of Hildesheim                                        7 / 15
Bayesian Personalized Rankingfor Non-Uniformly Sampled Items         Why did we not win?


Learn the right contrast


                                                                          rating < 80

                                                rating >= 80   liked?

                                                                           no rating




                                                rating >= 80

                                                               rated?      no rating

                                                rating < 80




                                                rating >= 80     ?          no rating




Zeno Gantner et al., University of Hildesheim                                              8 / 15
Bayesian Personalized Rankingfor Non-Uniformly Sampled Items          Why did we not win?


Learn the right contrast


                                                                                        rating < 80

                              rating >= 80                     liked?

                                                                                         no rating




                                                rating >= 80

                                                                rated?      no rating

                                                rating < 80




                                                rating >= 80      ?          no rating




Zeno Gantner et al., University of Hildesheim                                                         9 / 15
Bayesian Personalized Rankingfor Non-Uniformly Sampled Items          Why did we not win?


Learn the right contrast

                                                                           rating < 80

                                                rating >= 80    liked?

                                                                            no rating




                              rating >= 80

                                                               rated?                    no rating

                               rating < 80




                                                rating >= 80      ?          no rating




Zeno Gantner et al., University of Hildesheim                                                        10 / 15
Bayesian Personalized Rankingfor Non-Uniformly Sampled Items         Why did we not win?


Learn the right contrast

                                                                          rating < 80

                                                rating >= 80   liked?

                                                                           no rating




                                                rating >= 80

                                                               rated?      no rating

                                                rating < 80




                              rating >= 80                       ?                      no rating




Zeno Gantner et al., University of Hildesheim                                                       11 / 15
Bayesian Personalized Rankingfor Non-Uniformly Sampled Items   Where?


Where next?




         classification → ranking → pairwise classification
         pairwise classification: try other losses, e.g. soft margin (hinge) loss
         Bayesian2 Personalized Ranking
         beyond KDD Cup: consider different sampling schemes . . .
Zeno Gantner et al., University of Hildesheim                                 12 / 15
Bayesian Personalized Rankingfor Non-Uniformly Sampled Items   Summary


Summary
        Use matrix factorization optimized for Bayesian
        Personalized Ranking (BPR) to solve the item
        ranking problem.
                BPR reduces ranking (in this case: binary
                variables) to pairwise classification.
        Extend BPR to use different sampling scheme:
        Weighted BPR (WBPR).
        Open question: Learn a different contrast?
        Details can be found in the paper.
        Code: http://ismll.de/mymedialite/
        examples/kddcup2011.html


                               advertisement: Contribute to http://recsyswiki.com!

Zeno Gantner et al., University of Hildesheim                                  13 / 15
Bayesian Personalized Rankingfor Non-Uniformly Sampled Items   Questions




Zeno Gantner et al., University of Hildesheim                              14 / 15
Bayesian Personalized Rankingfor Non-Uniformly Sampled Items


Acknowledgements
 Thank you
         The organizers, for hosting a great competition.
         The participants, for sharing their insights.

 Funding
         German Research Council (Deutsche Forschungsgemeinschaft, DFG) project
         Multirelational Factorization Models.
         Development of the MyMediaLite software was co-funded by the European
         Commission FP7 project MyMedia under the grant agreement no. 215006.

 Picture credits
             by Michael Sauers, under Creative Commons by-nc-sa 2.0
         http://www.flickr.com/photos/travelinlibrarian/223839049/

               by Rob Starling, under Creative Commons by-sa 2.0
         http://en.wikipedia.org/wiki/File:Air_New_Zealand_B747-400_ZK-SUI_at_LHR.jpg

Zeno Gantner et al., University of Hildesheim                                           15 / 15
Bayesian Personalized Rankingfor Non-Uniformly Sampled Items


Numbers?



                                                 k     error in %
                                                 “liked” contrast
                                                320       5.52
                                                480       5.08
                                                “rated” contrast
                                                320       5.15
                                                480       4.87


 Estimated error on validation split (not leaderboard).




Zeno Gantner et al., University of Hildesheim                       16 / 15
Bayesian Personalized Rankingfor Non-Uniformly Sampled Items   Advertisement


MyMediaLite: Recommender System Algorithm Library
 functionality
         rating prediction
         item recommendation from implicit feedback
         group recommendation
 target groups                                                                 simple
         researchers, educators and students                                   free
         application developers                                                scalable
 development                                                                   well-documented
         written in C#, runs on Mono                                           well-tested
         GNU General Public License (GPL)                                      choice
         regular releases (ca. 1 per month)

                                    http://ismll.de/mymedialite

Zeno Gantner et al., University of Hildesheim                                                17 / 15
Bayesian Personalized Rankingfor Non-Uniformly Sampled Items   Advertisement


RecSys Wiki is looking for contributions




                                                                               Alan




                                                                               Zeno


                                   http://recsyswiki.com

Zeno Gantner et al., University of Hildesheim                                         18 / 15

Contenu connexe

En vedette

Blaha Forum 21 - Veränderung Führung Nachhaltigkeit - Unternehmen auf Kurs br...
Blaha Forum 21 - Veränderung Führung Nachhaltigkeit - Unternehmen auf Kurs br...Blaha Forum 21 - Veränderung Führung Nachhaltigkeit - Unternehmen auf Kurs br...
Blaha Forum 21 - Veränderung Führung Nachhaltigkeit - Unternehmen auf Kurs br...Heinz Peter Wallner
 
10 distribuciones linux mas importantes y sus características.
10 distribuciones linux mas importantes y sus características.10 distribuciones linux mas importantes y sus características.
10 distribuciones linux mas importantes y sus características.Michelle Love
 
Ensayo ecotecnologias (1)
Ensayo ecotecnologias (1)Ensayo ecotecnologias (1)
Ensayo ecotecnologias (1)Robert180696
 
Mind Mapping Literatur Personal Brand
Mind Mapping Literatur Personal BrandMind Mapping Literatur Personal Brand
Mind Mapping Literatur Personal BrandNatalia Gultom
 
Scada Hidrodata - Monitorización de Procesos
Scada Hidrodata - Monitorización de ProcesosScada Hidrodata - Monitorización de Procesos
Scada Hidrodata - Monitorización de ProcesosMiguel Gallardo Cabrerizo
 
Plattform Kooperativismus als Antwort auf den Plattform Kapitalismus
Plattform Kooperativismus als Antwort auf den Plattform KapitalismusPlattform Kooperativismus als Antwort auf den Plattform Kapitalismus
Plattform Kooperativismus als Antwort auf den Plattform KapitalismusThomas Doennebrink
 
Entonces, cual es mi identidad
Entonces, cual es mi identidadEntonces, cual es mi identidad
Entonces, cual es mi identidadYeshiva Torah
 
13119 60c0af96eefbe938c72eb484bc5ac596
13119 60c0af96eefbe938c72eb484bc5ac59613119 60c0af96eefbe938c72eb484bc5ac596
13119 60c0af96eefbe938c72eb484bc5ac5960nike7c
 
Income tax calculator 2015 - 2016
Income tax calculator 2015 - 2016Income tax calculator 2015 - 2016
Income tax calculator 2015 - 2016Altacit Global
 
Jensen Precast Redi-Rock Retaining Walls
Jensen Precast Redi-Rock Retaining WallsJensen Precast Redi-Rock Retaining Walls
Jensen Precast Redi-Rock Retaining WallsJensen Precast
 

En vedette (20)

Touch Toy - Kids Corner - Kinderterminal
Touch Toy - Kids Corner - KinderterminalTouch Toy - Kids Corner - Kinderterminal
Touch Toy - Kids Corner - Kinderterminal
 
Blaha Forum 21 - Veränderung Führung Nachhaltigkeit - Unternehmen auf Kurs br...
Blaha Forum 21 - Veränderung Führung Nachhaltigkeit - Unternehmen auf Kurs br...Blaha Forum 21 - Veränderung Führung Nachhaltigkeit - Unternehmen auf Kurs br...
Blaha Forum 21 - Veränderung Führung Nachhaltigkeit - Unternehmen auf Kurs br...
 
10 distribuciones linux mas importantes y sus características.
10 distribuciones linux mas importantes y sus características.10 distribuciones linux mas importantes y sus características.
10 distribuciones linux mas importantes y sus características.
 
Coaching de pnl
Coaching de pnlCoaching de pnl
Coaching de pnl
 
Recetario de cocina mexicana
Recetario de cocina mexicanaRecetario de cocina mexicana
Recetario de cocina mexicana
 
Ensayo ecotecnologias (1)
Ensayo ecotecnologias (1)Ensayo ecotecnologias (1)
Ensayo ecotecnologias (1)
 
Mind Mapping Literatur Personal Brand
Mind Mapping Literatur Personal BrandMind Mapping Literatur Personal Brand
Mind Mapping Literatur Personal Brand
 
Scada Hidrodata - Monitorización de Procesos
Scada Hidrodata - Monitorización de ProcesosScada Hidrodata - Monitorización de Procesos
Scada Hidrodata - Monitorización de Procesos
 
058-062_Cars&Bikes.ADM_asOK-2
058-062_Cars&Bikes.ADM_asOK-2058-062_Cars&Bikes.ADM_asOK-2
058-062_Cars&Bikes.ADM_asOK-2
 
Plattform Kooperativismus als Antwort auf den Plattform Kapitalismus
Plattform Kooperativismus als Antwort auf den Plattform KapitalismusPlattform Kooperativismus als Antwort auf den Plattform Kapitalismus
Plattform Kooperativismus als Antwort auf den Plattform Kapitalismus
 
3b2013quansiguigran
3b2013quansiguigran3b2013quansiguigran
3b2013quansiguigran
 
Entonces, cual es mi identidad
Entonces, cual es mi identidadEntonces, cual es mi identidad
Entonces, cual es mi identidad
 
13119 60c0af96eefbe938c72eb484bc5ac596
13119 60c0af96eefbe938c72eb484bc5ac59613119 60c0af96eefbe938c72eb484bc5ac596
13119 60c0af96eefbe938c72eb484bc5ac596
 
Income tax calculator 2015 - 2016
Income tax calculator 2015 - 2016Income tax calculator 2015 - 2016
Income tax calculator 2015 - 2016
 
Conferencia Empresarial
Conferencia EmpresarialConferencia Empresarial
Conferencia Empresarial
 
2012 SINCRONIZACION EN TIEMPO CERO
2012  SINCRONIZACION EN TIEMPO CERO2012  SINCRONIZACION EN TIEMPO CERO
2012 SINCRONIZACION EN TIEMPO CERO
 
Wi-Fi Evolution
Wi-Fi EvolutionWi-Fi Evolution
Wi-Fi Evolution
 
Jensen Precast Redi-Rock Retaining Walls
Jensen Precast Redi-Rock Retaining WallsJensen Precast Redi-Rock Retaining Walls
Jensen Precast Redi-Rock Retaining Walls
 
Los fitofármacos
Los fitofármacosLos fitofármacos
Los fitofármacos
 
Repetidores
RepetidoresRepetidores
Repetidores
 

Dernier

A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 

Dernier (20)

A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 

Bayesian Personalized Ranking for Non-Uniformly Sampled Items

  • 1. Bayesian Personalized Rankingfor Non-Uniformly Sampled Items Bayesian Personalized Ranking for Non-Uniformly Sampled Items Zeno Gantner, Lucas Drumond, Christoph Freudenthaler, Lars Schmidt-Thieme University of Hildesheim 21 August 2011 Zeno Gantner et al., University of Hildesheim 1 / 15
  • 2. Bayesian Personalized Rankingfor Non-Uniformly Sampled Items Questions (and Answers) What? Who? Which? How? Where? Why? Zeno Gantner et al., University of Hildesheim 2 / 15
  • 3. Bayesian Personalized Rankingfor Non-Uniformly Sampled Items Which problem to solve? Which problem to solve? Rating Prediction (Track 1) vs. Item Prediction (Track 2) Zeno Gantner et al., University of Hildesheim 3 / 15
  • 4. Bayesian Personalized Rankingfor Non-Uniformly Sampled Items How did we tackle the problem? How did we tackle the problem? Bayesian Personalized Ranking: 2 BPR(DS ) = argmax ln σ(ˆu,i (Θ) − ˆu,j (Θ) )−λ Θ s s Θ (u,i,j)∈DS DS contains all pairs of positive and negative items for each user, 1 σ(x) = 1+e −x is the logistic function, Θ represents the model parameters, ˆu,i (Θ) is the predicted score for user u and item i, and s λ Θ 2 is a regularization term to prevent overfitting. interpretation 1: reduce ranking to pairwise classif. [Balcan et al. 2008] interpretation 2: optimize for smoothed area under the ROC curve (AUC) Model: matrix factorization Learning: stochastic gradient ascent [Rendle et al., UAI 2009] Zeno Gantner et al., University of Hildesheim 4 / 15
  • 5. Bayesian Personalized Rankingfor Non-Uniformly Sampled Items How did we tackle the problem? How did we tackle the problem? 2 BPR(DS ) = argmax ln σ(ˆu,i − ˆu,j ) − λ Θ s s Θ (u,i,j)∈DS problem: all negative items j are given the same weight Zeno Gantner et al., University of Hildesheim 5 / 15
  • 6. Bayesian Personalized Rankingfor Non-Uniformly Sampled Items How did we tackle the problem? How did we tackle the problem? 2 BPR(DS ) = argmax ln σ(ˆu,i − ˆu,j ) − λ Θ s s Θ (u,i,j)∈DS problem: all negative items j are given the same weight solution: adapt weights in the optimization criterion (and sampling probabilities in the learning algorithm) WBPR(DS ) = argmax wu wi wj ln σ(ˆu,i − ˆu,j ) − λ Θ 2 , s s Θ (u,i,j)∈DS where + wj = δ(j ∈ Iu ). (1) u∈U Zeno Gantner et al., University of Hildesheim 5 / 15
  • 7. Bayesian Personalized Rankingfor Non-Uniformly Sampled Items Why did we not win? Why did we not win? But also: Why did we perform better than others? Why did we perform better than others? straightforward model that matches the prediction task pretty well scalability (e.g. k = 480 factors per user/item) integration of rating information (see paper) ensembles (see paper) Why did we not win? . . . two possible answers . . . Zeno Gantner et al., University of Hildesheim 6 / 15
  • 8. Bayesian Personalized Rankingfor Non-Uniformly Sampled Items Why did we not win? Taxonomy Zeno Gantner et al., University of Hildesheim 7 / 15
  • 9. Bayesian Personalized Rankingfor Non-Uniformly Sampled Items Why did we not win? Learn the right contrast rating < 80 rating >= 80 liked? no rating rating >= 80 rated? no rating rating < 80 rating >= 80 ? no rating Zeno Gantner et al., University of Hildesheim 8 / 15
  • 10. Bayesian Personalized Rankingfor Non-Uniformly Sampled Items Why did we not win? Learn the right contrast rating < 80 rating >= 80 liked? no rating rating >= 80 rated? no rating rating < 80 rating >= 80 ? no rating Zeno Gantner et al., University of Hildesheim 9 / 15
  • 11. Bayesian Personalized Rankingfor Non-Uniformly Sampled Items Why did we not win? Learn the right contrast rating < 80 rating >= 80 liked? no rating rating >= 80 rated? no rating rating < 80 rating >= 80 ? no rating Zeno Gantner et al., University of Hildesheim 10 / 15
  • 12. Bayesian Personalized Rankingfor Non-Uniformly Sampled Items Why did we not win? Learn the right contrast rating < 80 rating >= 80 liked? no rating rating >= 80 rated? no rating rating < 80 rating >= 80 ? no rating Zeno Gantner et al., University of Hildesheim 11 / 15
  • 13. Bayesian Personalized Rankingfor Non-Uniformly Sampled Items Where? Where next? classification → ranking → pairwise classification pairwise classification: try other losses, e.g. soft margin (hinge) loss Bayesian2 Personalized Ranking beyond KDD Cup: consider different sampling schemes . . . Zeno Gantner et al., University of Hildesheim 12 / 15
  • 14. Bayesian Personalized Rankingfor Non-Uniformly Sampled Items Summary Summary Use matrix factorization optimized for Bayesian Personalized Ranking (BPR) to solve the item ranking problem. BPR reduces ranking (in this case: binary variables) to pairwise classification. Extend BPR to use different sampling scheme: Weighted BPR (WBPR). Open question: Learn a different contrast? Details can be found in the paper. Code: http://ismll.de/mymedialite/ examples/kddcup2011.html advertisement: Contribute to http://recsyswiki.com! Zeno Gantner et al., University of Hildesheim 13 / 15
  • 15. Bayesian Personalized Rankingfor Non-Uniformly Sampled Items Questions Zeno Gantner et al., University of Hildesheim 14 / 15
  • 16. Bayesian Personalized Rankingfor Non-Uniformly Sampled Items Acknowledgements Thank you The organizers, for hosting a great competition. The participants, for sharing their insights. Funding German Research Council (Deutsche Forschungsgemeinschaft, DFG) project Multirelational Factorization Models. Development of the MyMediaLite software was co-funded by the European Commission FP7 project MyMedia under the grant agreement no. 215006. Picture credits by Michael Sauers, under Creative Commons by-nc-sa 2.0 http://www.flickr.com/photos/travelinlibrarian/223839049/ by Rob Starling, under Creative Commons by-sa 2.0 http://en.wikipedia.org/wiki/File:Air_New_Zealand_B747-400_ZK-SUI_at_LHR.jpg Zeno Gantner et al., University of Hildesheim 15 / 15
  • 17. Bayesian Personalized Rankingfor Non-Uniformly Sampled Items Numbers? k error in % “liked” contrast 320 5.52 480 5.08 “rated” contrast 320 5.15 480 4.87 Estimated error on validation split (not leaderboard). Zeno Gantner et al., University of Hildesheim 16 / 15
  • 18. Bayesian Personalized Rankingfor Non-Uniformly Sampled Items Advertisement MyMediaLite: Recommender System Algorithm Library functionality rating prediction item recommendation from implicit feedback group recommendation target groups simple researchers, educators and students free application developers scalable development well-documented written in C#, runs on Mono well-tested GNU General Public License (GPL) choice regular releases (ca. 1 per month) http://ismll.de/mymedialite Zeno Gantner et al., University of Hildesheim 17 / 15
  • 19. Bayesian Personalized Rankingfor Non-Uniformly Sampled Items Advertisement RecSys Wiki is looking for contributions Alan Zeno http://recsyswiki.com Zeno Gantner et al., University of Hildesheim 18 / 15