SlideShare a Scribd company logo
1 of 47
Download to read offline
Recommender Systems
Francesco Ricci
Free University of Bozen-Bolzano
fricci@unibz.it
2
Content
p  Paradox of choice and information overload
p  Personalization
p  Recommender system
p  Step 1: Preference elicitation
p  Step 2: Preference prediction - rating estimation
techniques
n  Contextualization
p  Step 3: Recommendations' presentation
p  Issues and problems
p  Questions
Explosion of Choice
p  A trip to a local supermarket:
n  85 different varieties and brands of crackers.
n  285 varieties of cookies.
n  165 varieties of “juice drinks”
n  75 iced teas
n  275 varieties of cereal
n  120 different pasta sauces
n  80 different pain relievers
n  40 options for toothpaste
n  95 varieties of snacks (chips, pretzels, etc.)
n  61 varieties of sun tan oil and sunblock
n  360 types of shampoo, conditioner, gel, and mousse.
n  90 different cold remedies and decongestants.
n  230 soups, including 29 different chicken soups
n  175 different salad dressings and if none of them suited,
15 extra-virgin olive oils and 42 vinegars and make
one’s own
New Domains for Choice
p  Telephone Services
p  Retirement Pensions
p  Medical Care
p  News
p  Choosing how to work
p  Choosing how to love
p  Choosing how to be
Choice and Well-Being
p  We have more choice, more freedom,
autonomy, and self determination
p  It seems that increased choice improves well-
being:
n  added options can only make us better off:
those who care will benefit, and those who do
not care can always ignore the added options
p  Various assessment of well-being have shown
that increased affluence have accompanied by
decreased well-being.
Neuroscience and Information Overload
p  Neuroscientists have discovered that
unproductivity and loss of drive can result from
decision overload
p  Our brains (120 bits per second) are configured
to make a certain number of decisions per
day and once we reach that limit, we can’t make
any more
p  Information processing has a cost: we can
have trouble separating the
trivial from the important –
this inf. processing makes
us tired.
6
7
Information Overload
p  Internet = information overload =
having too much information to make a
decision or remain informed about a topic
p  To make a decision or remain informed
about a topic you must perform exploratory search
(e.g., comparison, knowledge acquisition, product
selection, etc.)
n  not aware of the range of available options
n  may not know what to search
n  if presented with some results may not be able to
choose.
Personalization
p  “If I have 3 million customers on the
Web, I should have 3 million stores on the
Web”
n  Jeff Bezos, CEO and
founder, Amazon.com
n  Degree in Computer
Science
n  $34.2 billion (net worth),
ranked no. 15 in the
Forbes list of the
America's Wealthiest
People
8
Amazon.it
9
Movie Recommendation – YouTube
10Recommendations account for about 60% of all video clicks from
the home page.
Consumer Attitudes
11
12
The Long Tail
p  Economic model in which the market for non-hits (typically
large numbers of low-volume items) could be significant
and sometimes even greater than the market for big hits
(typically small numbers of high-volume items).
Goal
p  Recommend items that are good for you!
n  relevant
n  improve well being
n  rational choices
n  optimal
13
Step 1: Preference Elicitation
14
Last.fm – Preference Elicitation
Rating Recommendations
16
Alternative Methods
17
Remembering
p  D. Kahneman (nobel prize): what we
remember about an experience is
determined by (peak-end rule)
n  How the experience felt when it was at its peak
(best or worst)
n  How it felt when it ended
p  We rely on this summary later to remind how the
experience felt and decide whether to have that
experience again
p  So how well do we know what we want?
n  It is doubtful that we prefer an experience to
another very similar just because the first
ended better.
Bias of Remembered Utility 18
Step 2: Model Building
19
20
scoredatemovieuser
15/7/02211
58/2/042131
43/6/013452
45/1/051232
37/15/027682
51/22/01763
48/3/00454
19/10/055685
23/5/033425
212/28/002345
58/11/02766
46/15/03566
scoredatemovieuser
?1/6/05621
?9/13/04961
?8/18/0572
?11/22/0532
?6/13/02473
?8/12/01153
?9/1/00414
?8/27/05284
?4/4/05935
?7/16/03745
?2/14/04696
?10/3/03836
Training data Test data
Movie rating data
21
Items
Users
Matrix of ratings
Item-to-Item Collaborative Filtering
p  Suppose the prediction is made using two nearest-
neighbors, and that the items most similar to “Titanic” are
“Forrest Gump” and “Wall-E”
p  wtitanic, forrest = 0.85
p  wtitanic, wall-e = 0.75
p  r*eric, titanic = (0.85*5 + 0.75*4)/(0.85 + 0.75) = 4.53
22
target neigh. neigh.
23
User-Based Collaborative Filtering
p  A collection of n users U and a collection of m items I
p  A n × m matrix of ratings rui , with rui = ? if user u did not
rate item i
p  Prediction for user u and item j is computed as
p  Where, ru is the average rating of user u, K is a
normalization factor such that the absolute values of wuv sum
to 1, and
wuv =
(ruj −ru )(rvj −rv )
j∈Iuv
∑
(ruj −ru )2
(rvj −rv )2
j∈Iuv
∑j∈Iuv
∑
Pearson
Correlation of
users u and v
[Breese et al., 1998]
ruj
*
= ru + K wuv (rvj −rv )
v∈Nj (u)
∑ A set of neighbours of
u that have rated j
24
Geared
towards
females
Geared
towards
males
serious
escapist
The Princess
Diaries
The Lion King
Braveheart
Lethal
Weapon
Independence
Day
AmadeusThe Color
Purple
Dumb and
Dumber
Ocean’s 11
Sense and
Sensibility
Gus
Dave
Latent Factor Models
25
Basic Matrix Factorization Model
45531
312445
53432142
24542
522434
42331users
.2-.4.1
.5.6-.5
.5.3-.2
.32.11.1
-22.1-.7
.3.7-1
-.92.41.4.3-.4.8-.5-2.5.3-.21.1
1.3-.11.2-.72.91.4-1.31.4.5.7-.8
.1-.6.7.8.4-.3.92.41.7.6-.42.1
~
~
users
items
items
A rank-3 approximation
12 items
6 users
max 72 entries
12 x 3 entries
6 x 3 entries
54 total entries
26
Estimate Unknown Ratings
45531
312445
53432142
24542
522434
42331users
.2-.4.1
.5.6-.5
.5.3-.2
.32.11.1
-22.1-.7
.3.7-1
-.92.41.4.3-.4.8-.5-2.5.3-.21.1
1.3-.11.2-.72.91.4-1.31.4.5.7-.8
.1-.6.7.8.4-.3.92.41.7.6-.42.1
~
~
users
items
A rank-3 approximation
items
?
27
Estimate Unknown Ratings
45531
312445
53432142
24542
522434
42331
users
.2-.4.1
.5.6-.5
.5.3-.2
.32.11.1
-22.1-.7
.3.7-1
-.92.41.4.3-.4.8-.5-2.5.3-.21.1
1.3-.11.2-.72.91.4-1.31.4.5.7-.8
.1-.6.7.8.4-.3.92.41.7.6-.42.1
~
~
users
items
2.4
A rank-3 approximation
items -0.5*(-2) + 0.6*0.3 + 0.5*2.4 = 2.4
28
Matrix factorization as a cost function
Minp*,q*
rui
− pu
T
qi( )
2
+ λ pu
2
+ qi
2"
#
$
%
&
'
(
)*
+
,-
known rui
∑
regularization- user-factors of u
- item-factors of i
- rating by u for iuir
iq
up
•  Optimize by either stochastic gradient-descent or
alternating least squares
29
“Core” Recommendation Techniques
[Burke, 2007]
U is a set of users
I is a set of items/products
30
Content-Based Recommender with Centroid
Interesting Documents
Not interesting Documents
Centroid
User Model
Doc1
Doc2
Doc1 is estimated more interesting than Doc2
Centroid
politics
sports
Recommendations can be wrong
p  Recommenders tend to recommend items similar
to those browsed or purchased in the past
31
Context-Aware Computing
p  Gartner Top 10 strategic technology trends for IT
p  Context-aware computing is a style of computing
in which situational and environmental information
about people, places and things is used to
anticipate immediate
needs and proactively
offer enriched,
situation-aware and
usable content, functions
and experiences.
32http://www.gartner.com/it-glossary/context-aware-computing-2
Google Now
33
https://www.google.com/landing/now/
Types of Context - Mobile
p  Physical context
n  time, position, and activity of the user,
weather, light, and temperature ...
p  Social context
n  the presence and role of other people around the
user
p  Interaction media context
n  the device used to access the system and the type
of media that are browsed and personalized (text,
music, images, movies, …)
p  Modal context
n  The state of mind of the user, the user’s goals,
mood, experience, and cognitive capabilities. 34
[Fling, 2009]
Factors influencing Holiday Decision
Decision
Personal
Motivators
Personality
Disposable
Income
Health
Family
commitments
Past experience
Works
commitments
Hobbies and
interests
Knowledge of
potential
holidays
Lifestyle Attitudes,
opinions and
perceptions
Internal to the tourist External to the tourist
Availability of
products Advice of travel
agents
Information obtained
from tourism
organization and
media
Word-of-mouth
recommendations
Political restrictions:
visa, terrorism,
Health problems
Special promotion
and offers
Climate
[Swarbrooke & Horner, 2006]
q  Only ratings acquired in exactly the same context are
used
q  Hypothesis: pre-filtering can be enhanced by
exploiting semantic similarities between contexts
Traditional contextual pre-filtering
36	
  	
  
"sunny"
ratings
in-context
ratings
Ratings
filtering
Prediction
model
target context
predicted
rating
Distributional semantics of context
37
p  Assumption: two contexts are similar if their
composing conditions influence ratings
similarly
Condition User1 User2 User3 User4 User5 User6 User7
1 -0.7 0 0.9 0.1 -0.6 0
0.7 -0.8 0.5 0.8 0.4 -0.2 0
-0.5 0.7 0.2 -1 0.9 0.8 0.5
Semantic contextual pre-filtering
38
q  Key idea: reuse ratings acquired in similar
contexts
"similar context"
ratings
Ratings
filtering
Prediction
model
≈
≠ semantic
similarities
in-context
ratings
target
context
predicted
rating
Semantic Pre-Filtering vs. State of the art
39
0%	
  
2%	
  
4%	
  
6%	
  
8%	
  
10%	
  
12%	
  
14%	
  
16%	
  
18%	
  
Tourism	
   Music	
   Adom	
   Comoda	
   Movie	
   Library	
  
Semantic Pre-Filtering UI-Splitting CAMF
% = MAE (mean absolute error) reduction with respect to a
context-free Matrix Factorization model (the higher, the
better)
South Tyrol Suggest (STS)
•  A mobile Android context-aware RS
that recommends places of interests
(POIs) from a total of 27,000 POIs in
South Tyrol region
•  STS computes rating predictions for
all POIs using the personality of the
users, the ratings, and 14 contextual
factors, such as: weather forecast,
mood, budget, and travel goal.
Neuroticism Conscientious-
ness
Openness
ExtraversionAgreeableness
Big Five 
Personality
Traits
Food Advisor for a Family
Step 3: Recommendation Presentation
42
Colnago Ferrari
Anchoring
p  How do we determine what is reasonable to
spend for a race bicycle?
n  In an online shop that presents only bicycles
costing over 3.000E we may believe that
1.500 is not enough, or that a bicycle at that
price will be a bargain
n  Even if nobody will select the
highest-priced models, the
shop can reap benefits from
listing them – people is
induced to buy the cheaper
(but still expensive) ones.
Context increases Expected Utility 43
Dissatisfaction because of opportunity
costs
p  A study in which people were asked how much they
would be willing to pay for subscriptions to magazines
[Brenner, Rottenstreich,& Sood, 1999].
n  Some were asked about individual magazines or
videos
n  Others were asked about these same items as part
of a group with other magazines or videos
p  Respondents placed a higher value on the magazine
or the video when they were evaluating it in
isolation
n  If evaluated as part of a group, opportunity costs
associated with the other options reduce the
value of each of them.
Context decreases Expected Utility 44
ReRex
45
Context increases Expected Utility
Context used to
differentiate options
and decrease
opportunity cost.
Problems and Issues
p  Cold Start (new user and new item) - old items
are less interesting
p  Learning to interact
p  Measuring
p  Filter Bubble
p  How much to personalize
p  When to contextualize
p  How to deliver
contextualized content?
p  Multiple devices (synchronization)
46
47New edition is coming in 2015
Questions?

More Related Content

Viewers also liked

Viewers also liked (20)

Offline vs. online intermediation: a study of booking behaviour of tourists t...
Offline vs. online intermediation: a study of booking behaviour of tourists t...Offline vs. online intermediation: a study of booking behaviour of tourists t...
Offline vs. online intermediation: a study of booking behaviour of tourists t...
 
Corporate identity communication on corporate websites: Evidence from the Hon...
Corporate identity communication on corporate websites: Evidence from the Hon...Corporate identity communication on corporate websites: Evidence from the Hon...
Corporate identity communication on corporate websites: Evidence from the Hon...
 
Co-creation of a tourist experience enhanced by technology, in the context of...
Co-creation of a tourist experience enhanced by technology, in the context of...Co-creation of a tourist experience enhanced by technology, in the context of...
Co-creation of a tourist experience enhanced by technology, in the context of...
 
Technology as a Catalyst of Change: Enablers and Barriers of the Tourist Expe...
Technology as a Catalyst of Change: Enablers and Barriers of the Tourist Expe...Technology as a Catalyst of Change: Enablers and Barriers of the Tourist Expe...
Technology as a Catalyst of Change: Enablers and Barriers of the Tourist Expe...
 
Meetings Industry Stakeholders’ Cooperation relevance to City Destinations Pe...
Meetings Industry Stakeholders’ Cooperation relevance to City Destinations Pe...Meetings Industry Stakeholders’ Cooperation relevance to City Destinations Pe...
Meetings Industry Stakeholders’ Cooperation relevance to City Destinations Pe...
 
Tourists and Municipal Wi-Fi Networks. The case of Lugano (CH
Tourists and Municipal Wi-Fi Networks. The case of Lugano (CHTourists and Municipal Wi-Fi Networks. The case of Lugano (CH
Tourists and Municipal Wi-Fi Networks. The case of Lugano (CH
 
Content Analysis of TripAdvisor Reviews on Restaurants: A Case Study of Macau
Content Analysis of TripAdvisor Reviews on Restaurants: A Case Study of MacauContent Analysis of TripAdvisor Reviews on Restaurants: A Case Study of Macau
Content Analysis of TripAdvisor Reviews on Restaurants: A Case Study of Macau
 
Travel Social Media Involvement: A Proposed Measure
Travel Social Media Involvement: A Proposed MeasureTravel Social Media Involvement: A Proposed Measure
Travel Social Media Involvement: A Proposed Measure
 
Religious World Heritage Sites as Anchor Attractions for Developing Sustainab...
Religious World Heritage Sites as Anchor Attractions for Developing Sustainab...Religious World Heritage Sites as Anchor Attractions for Developing Sustainab...
Religious World Heritage Sites as Anchor Attractions for Developing Sustainab...
 
Usages and Role of Instant Messaging Applications during the Beatification of...
Usages and Role of Instant Messaging Applications during the Beatification of...Usages and Role of Instant Messaging Applications during the Beatification of...
Usages and Role of Instant Messaging Applications during the Beatification of...
 
Smart Tourism Challenges
Smart Tourism ChallengesSmart Tourism Challenges
Smart Tourism Challenges
 
An Investigation of Hotel Room Reservation: What Are The Diverse Pricing Stra...
An Investigation of Hotel Room Reservation: What Are The Diverse Pricing Stra...An Investigation of Hotel Room Reservation: What Are The Diverse Pricing Stra...
An Investigation of Hotel Room Reservation: What Are The Diverse Pricing Stra...
 
How effective are Asian hotels in communicating CSR efforts through the prope...
How effective are Asian hotels in communicating CSR efforts through the prope...How effective are Asian hotels in communicating CSR efforts through the prope...
How effective are Asian hotels in communicating CSR efforts through the prope...
 
Travellers' intended future trip arrangement strategies for things to do duri...
Travellers' intended future trip arrangement strategies for things to do duri...Travellers' intended future trip arrangement strategies for things to do duri...
Travellers' intended future trip arrangement strategies for things to do duri...
 
Innovation Practices in Alternative Tourism
Innovation Practices in Alternative TourismInnovation Practices in Alternative Tourism
Innovation Practices in Alternative Tourism
 
Conceptualising Smart Tourism Destination Dimensions
Conceptualising Smart Tourism Destination DimensionsConceptualising Smart Tourism Destination Dimensions
Conceptualising Smart Tourism Destination Dimensions
 
What strategies to dynamize a destination digitally?
What strategies to dynamize a destination digitally?What strategies to dynamize a destination digitally?
What strategies to dynamize a destination digitally?
 
GUEST FOCUS IN A DIGITAL WORLD
GUEST FOCUS IN A DIGITAL WORLDGUEST FOCUS IN A DIGITAL WORLD
GUEST FOCUS IN A DIGITAL WORLD
 
Visual Appeal of Hotel Websites: An Exploratory Eye Tracking Study on Chinese...
Visual Appeal of Hotel Websites: An Exploratory Eye Tracking Study on Chinese...Visual Appeal of Hotel Websites: An Exploratory Eye Tracking Study on Chinese...
Visual Appeal of Hotel Websites: An Exploratory Eye Tracking Study on Chinese...
 
The Role of Personal Value in Information Search Strategies for Community-Bas...
The Role of Personal Value in Information Search Strategies for Community-Bas...The Role of Personal Value in Information Search Strategies for Community-Bas...
The Role of Personal Value in Information Search Strategies for Community-Bas...
 

Similar to Recommender Systems

CMU Trecvid med13 nist
CMU Trecvid med13 nistCMU Trecvid med13 nist
CMU Trecvid med13 nist
Lu Jiang
 
Making sense of community engagement, impacts and outcomes
Making sense of community engagement, impacts and outcomesMaking sense of community engagement, impacts and outcomes
Making sense of community engagement, impacts and outcomes
MetroWater
 

Similar to Recommender Systems (20)

Observational studies in social media
Observational studies in social mediaObservational studies in social media
Observational studies in social media
 
Making sense of engagement
Making sense of engagementMaking sense of engagement
Making sense of engagement
 
Williams thinking fast and slow
Williams thinking fast and slowWilliams thinking fast and slow
Williams thinking fast and slow
 
An Early Warning System for Ambient Assisted Living
An Early Warning System for Ambient Assisted LivingAn Early Warning System for Ambient Assisted Living
An Early Warning System for Ambient Assisted Living
 
Fashiondatasc
FashiondatascFashiondatasc
Fashiondatasc
 
Cist2014 slides
Cist2014 slidesCist2014 slides
Cist2014 slides
 
Recommender systems
Recommender systemsRecommender systems
Recommender systems
 
CMU Trecvid med13 nist
CMU Trecvid med13 nistCMU Trecvid med13 nist
CMU Trecvid med13 nist
 
Pinellas behavioral economics and financial decision making
Pinellas behavioral economics and financial decision makingPinellas behavioral economics and financial decision making
Pinellas behavioral economics and financial decision making
 
master_thesis.pdf
master_thesis.pdfmaster_thesis.pdf
master_thesis.pdf
 
Making sense of community engagement, impacts and outcomes
Making sense of community engagement, impacts and outcomesMaking sense of community engagement, impacts and outcomes
Making sense of community engagement, impacts and outcomes
 
Past, present, and future of Recommender Systems: an industry perspective
Past, present, and future of Recommender Systems: an industry perspectivePast, present, and future of Recommender Systems: an industry perspective
Past, present, and future of Recommender Systems: an industry perspective
 
AI Driven Product Innovation
AI Driven Product InnovationAI Driven Product Innovation
AI Driven Product Innovation
 
AI-driven product innovation: from Recommender Systems to COVID-19
AI-driven product innovation: from Recommender Systems to COVID-19AI-driven product innovation: from Recommender Systems to COVID-19
AI-driven product innovation: from Recommender Systems to COVID-19
 
Yelp Dataset Challenge
Yelp Dataset ChallengeYelp Dataset Challenge
Yelp Dataset Challenge
 
Java andml may17-v1
Java andml may17-v1Java andml may17-v1
Java andml may17-v1
 
Euro 2015 prioritisation presentation
Euro 2015 prioritisation presentationEuro 2015 prioritisation presentation
Euro 2015 prioritisation presentation
 
SDNC13 -Day2- The subjective science of persona building by Stephen Masiclat
SDNC13 -Day2- The subjective science of persona building by Stephen Masiclat SDNC13 -Day2- The subjective science of persona building by Stephen Masiclat
SDNC13 -Day2- The subjective science of persona building by Stephen Masiclat
 
Tutorial: Context-awareness In Information Retrieval and Recommender Systems
Tutorial: Context-awareness In Information Retrieval and Recommender SystemsTutorial: Context-awareness In Information Retrieval and Recommender Systems
Tutorial: Context-awareness In Information Retrieval and Recommender Systems
 
IntroductionRecommenderSystems_Petroni.pdf
IntroductionRecommenderSystems_Petroni.pdfIntroductionRecommenderSystems_Petroni.pdf
IntroductionRecommenderSystems_Petroni.pdf
 

Recently uploaded

Vishram Singh - Textbook of Anatomy Upper Limb and Thorax.. Volume 1 (1).pdf
Vishram Singh - Textbook of Anatomy  Upper Limb and Thorax.. Volume 1 (1).pdfVishram Singh - Textbook of Anatomy  Upper Limb and Thorax.. Volume 1 (1).pdf
Vishram Singh - Textbook of Anatomy Upper Limb and Thorax.. Volume 1 (1).pdf
ssuserdda66b
 

Recently uploaded (20)

Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentation
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
Vishram Singh - Textbook of Anatomy Upper Limb and Thorax.. Volume 1 (1).pdf
Vishram Singh - Textbook of Anatomy  Upper Limb and Thorax.. Volume 1 (1).pdfVishram Singh - Textbook of Anatomy  Upper Limb and Thorax.. Volume 1 (1).pdf
Vishram Singh - Textbook of Anatomy Upper Limb and Thorax.. Volume 1 (1).pdf
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
Google Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxGoogle Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptx
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
 
Single or Multiple melodic lines structure
Single or Multiple melodic lines structureSingle or Multiple melodic lines structure
Single or Multiple melodic lines structure
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POS
 
Dyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptxDyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptx
 
Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdf
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
Fostering Friendships - Enhancing Social Bonds in the Classroom
Fostering Friendships - Enhancing Social Bonds  in the ClassroomFostering Friendships - Enhancing Social Bonds  in the Classroom
Fostering Friendships - Enhancing Social Bonds in the Classroom
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptx
 

Recommender Systems

  • 1. Recommender Systems Francesco Ricci Free University of Bozen-Bolzano fricci@unibz.it
  • 2. 2 Content p  Paradox of choice and information overload p  Personalization p  Recommender system p  Step 1: Preference elicitation p  Step 2: Preference prediction - rating estimation techniques n  Contextualization p  Step 3: Recommendations' presentation p  Issues and problems p  Questions
  • 3. Explosion of Choice p  A trip to a local supermarket: n  85 different varieties and brands of crackers. n  285 varieties of cookies. n  165 varieties of “juice drinks” n  75 iced teas n  275 varieties of cereal n  120 different pasta sauces n  80 different pain relievers n  40 options for toothpaste n  95 varieties of snacks (chips, pretzels, etc.) n  61 varieties of sun tan oil and sunblock n  360 types of shampoo, conditioner, gel, and mousse. n  90 different cold remedies and decongestants. n  230 soups, including 29 different chicken soups n  175 different salad dressings and if none of them suited, 15 extra-virgin olive oils and 42 vinegars and make one’s own
  • 4. New Domains for Choice p  Telephone Services p  Retirement Pensions p  Medical Care p  News p  Choosing how to work p  Choosing how to love p  Choosing how to be
  • 5. Choice and Well-Being p  We have more choice, more freedom, autonomy, and self determination p  It seems that increased choice improves well- being: n  added options can only make us better off: those who care will benefit, and those who do not care can always ignore the added options p  Various assessment of well-being have shown that increased affluence have accompanied by decreased well-being.
  • 6. Neuroscience and Information Overload p  Neuroscientists have discovered that unproductivity and loss of drive can result from decision overload p  Our brains (120 bits per second) are configured to make a certain number of decisions per day and once we reach that limit, we can’t make any more p  Information processing has a cost: we can have trouble separating the trivial from the important – this inf. processing makes us tired. 6
  • 7. 7 Information Overload p  Internet = information overload = having too much information to make a decision or remain informed about a topic p  To make a decision or remain informed about a topic you must perform exploratory search (e.g., comparison, knowledge acquisition, product selection, etc.) n  not aware of the range of available options n  may not know what to search n  if presented with some results may not be able to choose.
  • 8. Personalization p  “If I have 3 million customers on the Web, I should have 3 million stores on the Web” n  Jeff Bezos, CEO and founder, Amazon.com n  Degree in Computer Science n  $34.2 billion (net worth), ranked no. 15 in the Forbes list of the America's Wealthiest People 8
  • 10. Movie Recommendation – YouTube 10Recommendations account for about 60% of all video clicks from the home page.
  • 12. 12 The Long Tail p  Economic model in which the market for non-hits (typically large numbers of low-volume items) could be significant and sometimes even greater than the market for big hits (typically small numbers of high-volume items).
  • 13. Goal p  Recommend items that are good for you! n  relevant n  improve well being n  rational choices n  optimal 13
  • 14. Step 1: Preference Elicitation 14
  • 15. Last.fm – Preference Elicitation
  • 18. Remembering p  D. Kahneman (nobel prize): what we remember about an experience is determined by (peak-end rule) n  How the experience felt when it was at its peak (best or worst) n  How it felt when it ended p  We rely on this summary later to remind how the experience felt and decide whether to have that experience again p  So how well do we know what we want? n  It is doubtful that we prefer an experience to another very similar just because the first ended better. Bias of Remembered Utility 18
  • 19. Step 2: Model Building 19
  • 22. Item-to-Item Collaborative Filtering p  Suppose the prediction is made using two nearest- neighbors, and that the items most similar to “Titanic” are “Forrest Gump” and “Wall-E” p  wtitanic, forrest = 0.85 p  wtitanic, wall-e = 0.75 p  r*eric, titanic = (0.85*5 + 0.75*4)/(0.85 + 0.75) = 4.53 22 target neigh. neigh.
  • 23. 23 User-Based Collaborative Filtering p  A collection of n users U and a collection of m items I p  A n × m matrix of ratings rui , with rui = ? if user u did not rate item i p  Prediction for user u and item j is computed as p  Where, ru is the average rating of user u, K is a normalization factor such that the absolute values of wuv sum to 1, and wuv = (ruj −ru )(rvj −rv ) j∈Iuv ∑ (ruj −ru )2 (rvj −rv )2 j∈Iuv ∑j∈Iuv ∑ Pearson Correlation of users u and v [Breese et al., 1998] ruj * = ru + K wuv (rvj −rv ) v∈Nj (u) ∑ A set of neighbours of u that have rated j
  • 24. 24 Geared towards females Geared towards males serious escapist The Princess Diaries The Lion King Braveheart Lethal Weapon Independence Day AmadeusThe Color Purple Dumb and Dumber Ocean’s 11 Sense and Sensibility Gus Dave Latent Factor Models
  • 25. 25 Basic Matrix Factorization Model 45531 312445 53432142 24542 522434 42331users .2-.4.1 .5.6-.5 .5.3-.2 .32.11.1 -22.1-.7 .3.7-1 -.92.41.4.3-.4.8-.5-2.5.3-.21.1 1.3-.11.2-.72.91.4-1.31.4.5.7-.8 .1-.6.7.8.4-.3.92.41.7.6-.42.1 ~ ~ users items items A rank-3 approximation 12 items 6 users max 72 entries 12 x 3 entries 6 x 3 entries 54 total entries
  • 28. 28 Matrix factorization as a cost function Minp*,q* rui − pu T qi( ) 2 + λ pu 2 + qi 2" # $ % & ' ( )* + ,- known rui ∑ regularization- user-factors of u - item-factors of i - rating by u for iuir iq up •  Optimize by either stochastic gradient-descent or alternating least squares
  • 29. 29 “Core” Recommendation Techniques [Burke, 2007] U is a set of users I is a set of items/products
  • 30. 30 Content-Based Recommender with Centroid Interesting Documents Not interesting Documents Centroid User Model Doc1 Doc2 Doc1 is estimated more interesting than Doc2 Centroid politics sports
  • 31. Recommendations can be wrong p  Recommenders tend to recommend items similar to those browsed or purchased in the past 31
  • 32. Context-Aware Computing p  Gartner Top 10 strategic technology trends for IT p  Context-aware computing is a style of computing in which situational and environmental information about people, places and things is used to anticipate immediate needs and proactively offer enriched, situation-aware and usable content, functions and experiences. 32http://www.gartner.com/it-glossary/context-aware-computing-2
  • 34. Types of Context - Mobile p  Physical context n  time, position, and activity of the user, weather, light, and temperature ... p  Social context n  the presence and role of other people around the user p  Interaction media context n  the device used to access the system and the type of media that are browsed and personalized (text, music, images, movies, …) p  Modal context n  The state of mind of the user, the user’s goals, mood, experience, and cognitive capabilities. 34 [Fling, 2009]
  • 35. Factors influencing Holiday Decision Decision Personal Motivators Personality Disposable Income Health Family commitments Past experience Works commitments Hobbies and interests Knowledge of potential holidays Lifestyle Attitudes, opinions and perceptions Internal to the tourist External to the tourist Availability of products Advice of travel agents Information obtained from tourism organization and media Word-of-mouth recommendations Political restrictions: visa, terrorism, Health problems Special promotion and offers Climate [Swarbrooke & Horner, 2006]
  • 36. q  Only ratings acquired in exactly the same context are used q  Hypothesis: pre-filtering can be enhanced by exploiting semantic similarities between contexts Traditional contextual pre-filtering 36     "sunny" ratings in-context ratings Ratings filtering Prediction model target context predicted rating
  • 37. Distributional semantics of context 37 p  Assumption: two contexts are similar if their composing conditions influence ratings similarly Condition User1 User2 User3 User4 User5 User6 User7 1 -0.7 0 0.9 0.1 -0.6 0 0.7 -0.8 0.5 0.8 0.4 -0.2 0 -0.5 0.7 0.2 -1 0.9 0.8 0.5
  • 38. Semantic contextual pre-filtering 38 q  Key idea: reuse ratings acquired in similar contexts "similar context" ratings Ratings filtering Prediction model ≈ ≠ semantic similarities in-context ratings target context predicted rating
  • 39. Semantic Pre-Filtering vs. State of the art 39 0%   2%   4%   6%   8%   10%   12%   14%   16%   18%   Tourism   Music   Adom   Comoda   Movie   Library   Semantic Pre-Filtering UI-Splitting CAMF % = MAE (mean absolute error) reduction with respect to a context-free Matrix Factorization model (the higher, the better)
  • 40. South Tyrol Suggest (STS) •  A mobile Android context-aware RS that recommends places of interests (POIs) from a total of 27,000 POIs in South Tyrol region •  STS computes rating predictions for all POIs using the personality of the users, the ratings, and 14 contextual factors, such as: weather forecast, mood, budget, and travel goal. Neuroticism Conscientious- ness Openness ExtraversionAgreeableness Big Five Personality Traits
  • 41. Food Advisor for a Family
  • 42. Step 3: Recommendation Presentation 42
  • 43. Colnago Ferrari Anchoring p  How do we determine what is reasonable to spend for a race bicycle? n  In an online shop that presents only bicycles costing over 3.000E we may believe that 1.500 is not enough, or that a bicycle at that price will be a bargain n  Even if nobody will select the highest-priced models, the shop can reap benefits from listing them – people is induced to buy the cheaper (but still expensive) ones. Context increases Expected Utility 43
  • 44. Dissatisfaction because of opportunity costs p  A study in which people were asked how much they would be willing to pay for subscriptions to magazines [Brenner, Rottenstreich,& Sood, 1999]. n  Some were asked about individual magazines or videos n  Others were asked about these same items as part of a group with other magazines or videos p  Respondents placed a higher value on the magazine or the video when they were evaluating it in isolation n  If evaluated as part of a group, opportunity costs associated with the other options reduce the value of each of them. Context decreases Expected Utility 44
  • 45. ReRex 45 Context increases Expected Utility Context used to differentiate options and decrease opportunity cost.
  • 46. Problems and Issues p  Cold Start (new user and new item) - old items are less interesting p  Learning to interact p  Measuring p  Filter Bubble p  How much to personalize p  When to contextualize p  How to deliver contextualized content? p  Multiple devices (synchronization) 46
  • 47. 47New edition is coming in 2015 Questions?