SlideShare une entreprise Scribd logo
1  sur  31
Télécharger pour lire hors ligne
Recommender Systems –
Collaborative Filtering
Prof. Rajib L. Saha
Online Recommender Systems: collaborative
filtering (different from Association Rules!)
Collaborative Filtering
• If Person A has the same opinion as
Person B on an issue, A is more likely
to have B's opinion on a different
issue x, when compared to the opinion
of a person chosen randomly
Traditional Collaborative
Filtering
• Customer as a p-dimensional vector of
items
– p: the number of distinct catalog items
– Components: bought (1)/not bought (0);
ratings; rated (1)/not rated(0).
• Find Similarity between Customers A
& B
n customers X p items
These entries
could be
numbers
(ratings) as
well
Information on rating along with the information
on whether items were rated or not rated can be
better than either of them alone. Example: Netflix
Recommendation
Cosine-based similarity
Cos(A,B) = A.B/|A|*|B|
• A: (a1, a2, …, aN)
• B: (b1, b2, …, bN)
• A.B: a1*b1+a2*b2+…+aN*bN
• |A|: (a1
2+a2
2+…+aN
2)1/2 ; |B|: (b1
2+b2
2+…+bN
2)1/2
Example:
Cos(A,B)=
(3*1+5*4+0*0+1*0)/((32+52+02+12)1/2*(12+42+02+02)1/2)=0.94
1 2 3 4
A 3 5 0 1
B 1 4 0 0
Correlation-based Similarity
1 2 3 4
A 3 5 0 1
B 1 4 0 0
CorrAB =
Covariance (A,B)
ௌ௧ௗ௘௩ ஺ 	∗			ௌ௧ௗ௘௩(஻)
n customers X p items
While computing
similarity between
Persons 1 & 2, Item 2’s
rating cannot be
included, since Person 2
hasn’t bought Item 2.
For binary data
(bought/didn’t buy), it is
NOT an issue.
Normalizing Ratings
• Multiply the vector components by the
inverse frequency
• Inverse frequency: the inverse of the
number of customers who have
purchased or rated the item
Other measures …
• Find Nearest Neighbor(s) based on
distance (dissimilarity)
– Can use other Distance measures to
identify neighbors
• Euclidean distance
= √((3-1)2+(5-4)2+(0-0)2+(1-0)2)
• Manhattan distance
= (|3-1|+|5-4|+|0-0|+|1-0|)
1 2 3 4
A 3 5 0 1
B 1 4 0 0
Once similar, what item(s) to
recommend?
• The item that hasn’t been bought by the
user yet
• You may create a list of multiple items to
be considered for recommendation
• Finally, recommend the item he/she is
most likely to buy
– Rank each item according to how many
similar customers purchased it
– Or rated by most
– Or highest rated
– Or some other popularity criteria
Long Tail
Supply-side drivers:
•centralized warehousing with more offerings
•lower inventory cost of electronic products
Demand-side drivers:
•search engines
•recommender systems
Negatives
• Memory-based / Lazy-learning
– When does the recommendation engine
compute the “recommendation”?
• Computation-intensive
– Recall how it computes
“recommendation”? n2 similarities
How to reduce computation?
• Randomly sample customers
• Discard infrequent buyers
• Discard items that are very popular or
very unpopular
• Clustering can reduce #of rows
• PCA can reduce #of columns
Runtime vs. Quality of
Recommendation
• Recommend while the customer is
browsing
vis-à-vis
• Recommend better but later
Search-based Methods
• Based on previous purchases
– Books of the same/similar authors
– DVD titles of the same director
– Products that are identified by similar
keywords
A more sophisticated variation of the search-
based methods:
Item-to-Item Collaborative Filtering
• Cosine similarity among items
– Item being the vector
– Customers as components of the vector
• Correlation similarity among items
– Correlation of ratings of Items I & J
where users rated both I & J
While computing
similarity between Items
1 & 2, Person 2’s rating
cannot be included,
since Person 2 hasn’t
bough Item 2
Scalability and Performance of
Item-to-Item Collaborative Filtering
• Computation-expensive, however,
similar-items table is computed offline
• Online component: lookup similar
items for the user’s purchases &
ratings
• Dependent only on how many titles
the user has purchased or rated
Disadvantages of Item-based
Collaborative Filtering
• Less diversity between items, compared
to users’ taste, therefore the
recommendations are often obvious
• When considering what to recommend to
a user, who purchased a popular item,
the association rules and item-based
collaborative filtering might yield the
same recommendation, whereas the
user-based recommendation will likely
differ.
Association Rules vs. Recommender Systems
Market basket analysis
(Association rules)
• Finds many baskets
that contain the same
items
• Need lots of baskets
• Used for generating
impersonal, common
strategies
• Useful for large
physical stores (chains,
supermarkets)
Recommender Systems
(Collaborative filtering)
• Finds items which
have a large fraction of
their baskets in
common
• Number of baskets
unimportant
• Used for
personalization
• Useful for online
recommendation (long-
tail)
A Critical Limitation of
Collaborative Filtering
• Cold Start:
– How to create recommendation for new
users
– How about new items
How to address Cold Start?
• Approaches to address cold start with new users:
– Popular items (get quick reaction of the users)
– Demographically relevant items
– Browsing history
– Secondary source of data --- social network,
subscription
– Netflix – start with rating a few movies
• Approaches to address cold start with new items:
– Recommend to random users/ or some selective
users based on certain criteria
– How about offering the product to influential people
in the social network
Issues with Rating Matrix-based
Recommender Systems
• Person i likes Adventure Movie 1 (AM1), but has never
watched or rated Adventure Movie 2 (AM2); whereas
Person j has watched AM2 but not watched/rated AM1.
• What happens to the similarity between Persons i & j?
– The ratings of AM1 and AM2 are not even included while
computing similarity!
• Rating matrices are huge, and usually sparse. What is
the implication on computational burden on dealing
with the rating matrix?
Recommendation using SVD
R= UΣVT
RN x n: Rating Matrix
UN x r: User-feature matrix
Vn x r: Item-feature matrix
Dealing with New Users
R= UΣVT
• ‫ݎ‬௜ = ith row of rating matrix = item ratings of
user i
• ‫ݑ‬௜= ith row of user-feature matrix = feature
ratings of user i
• ‫ݎ‬௜ = ‫ݑ‬௜	Σ	்ܸ {dimension: 1xn = 1xr rxr rxn}
• ‫ݎ‬௜	ܸ = ‫ݑ‬௜	Σ	்ܸV = ‫ݑ‬௜	Σ	
• ‫ݎ‬௜	ܸ	Σିଵ
= ‫ݑ‬௜	Σ	Σିଵ
= ‫ݑ‬௜	
• ‫ݑ‬௡௘௪ = ‫ݎ‬௡௘௪	ܸ	Σିଵ
– Let the new users rate a few items and use those
partial ratings to compute feature ratings
Dealing with missing values
before applying SVD
• Impute the missing values in the
Rank matrix with user mean or item
mean
• If the rank matrix is already
normalized (mean-substracted),
missing values can be simply zeroes
Vulnerability of Recommender
Systems
• Recommender accuracy and neutrality may be
compromised
– malicious users may try to push or kill a product
through using fake accounts
– inconsistent ratings
– Some mechanism to establish integrity is necessary
• Privacy of users’ ratings or preferences can be
compromised
– In some systems it may be desired that users do not
get to know each other’s opinions
– Use of less transparent algorithm is less vulnerable
to hack
– SVD decomposition
Amazon Recommender System
• Non-personal, based on sales statistics
– Best sellers, promotional, etc.
• Recommendations based on browsing
history, personal
• Recommendations based on Association
Rules, non-personal
• Personalized recommendation: Sign in >
Account > Your Recommendations
• Personalized recommendation over email
Netflix Recommender System
• How did Cinematch work?
• How did Cinematch added value to
Netflix?
• How did Netflix created itself an
advantage through Long Tail effect?
• Blockbuster’s data disadvantage/The
Competitive advantage of Netflix
• Opportunities of Netflix in the Video-
On-Demand market

Contenu connexe

Tendances

Recommender Engines
Recommender EnginesRecommender Engines
Recommender Engines
Thomas Hess
 
Recommender system introduction
Recommender system   introductionRecommender system   introduction
Recommender system introduction
Liang Xiang
 
Social Recommender Systems Tutorial - WWW 2011
Social Recommender Systems Tutorial - WWW 2011Social Recommender Systems Tutorial - WWW 2011
Social Recommender Systems Tutorial - WWW 2011
idoguy
 
Amazon Item-to-Item Recommendations
Amazon Item-to-Item RecommendationsAmazon Item-to-Item Recommendations
Amazon Item-to-Item Recommendations
Roger Chen
 
Survey of Recommendation Systems
Survey of Recommendation SystemsSurvey of Recommendation Systems
Survey of Recommendation Systems
youalab
 
An Example of Predictive Analytics: Building a Recommendation Engine Using Py...
An Example of Predictive Analytics: Building a Recommendation Engine Using Py...An Example of Predictive Analytics: Building a Recommendation Engine Using Py...
An Example of Predictive Analytics: Building a Recommendation Engine Using Py...
PyData
 

Tendances (18)

Impersonal Recommendation system on top of Hadoop
Impersonal Recommendation system on top of HadoopImpersonal Recommendation system on top of Hadoop
Impersonal Recommendation system on top of Hadoop
 
Recommender Engines
Recommender EnginesRecommender Engines
Recommender Engines
 
Customer to Customer recommendation system
Customer to Customer recommendation systemCustomer to Customer recommendation system
Customer to Customer recommendation system
 
Recommender Systems
Recommender SystemsRecommender Systems
Recommender Systems
 
Recommendation system
Recommendation systemRecommendation system
Recommendation system
 
Recommendation Systems Basics
Recommendation Systems BasicsRecommendation Systems Basics
Recommendation Systems Basics
 
Recommender system introduction
Recommender system   introductionRecommender system   introduction
Recommender system introduction
 
Recommender Systems
Recommender SystemsRecommender Systems
Recommender Systems
 
An introduction to Recommender Systems
An introduction to Recommender SystemsAn introduction to Recommender Systems
An introduction to Recommender Systems
 
Introduction to Recommendation Systems
Introduction to Recommendation SystemsIntroduction to Recommendation Systems
Introduction to Recommendation Systems
 
Social Recommender Systems Tutorial - WWW 2011
Social Recommender Systems Tutorial - WWW 2011Social Recommender Systems Tutorial - WWW 2011
Social Recommender Systems Tutorial - WWW 2011
 
Introduction to Recommendation System
Introduction to Recommendation SystemIntroduction to Recommendation System
Introduction to Recommendation System
 
Amazon Item-to-Item Recommendations
Amazon Item-to-Item RecommendationsAmazon Item-to-Item Recommendations
Amazon Item-to-Item Recommendations
 
Recommender Systems (Machine Learning Summer School 2014 @ CMU)
Recommender Systems (Machine Learning Summer School 2014 @ CMU)Recommender Systems (Machine Learning Summer School 2014 @ CMU)
Recommender Systems (Machine Learning Summer School 2014 @ CMU)
 
Survey of Recommendation Systems
Survey of Recommendation SystemsSurvey of Recommendation Systems
Survey of Recommendation Systems
 
Past, present, and future of Recommender Systems: an industry perspective
Past, present, and future of Recommender Systems: an industry perspectivePast, present, and future of Recommender Systems: an industry perspective
Past, present, and future of Recommender Systems: an industry perspective
 
Tutorial on Sequence Aware Recommender Systems - ACM RecSys 2018
Tutorial on Sequence Aware Recommender Systems - ACM RecSys 2018Tutorial on Sequence Aware Recommender Systems - ACM RecSys 2018
Tutorial on Sequence Aware Recommender Systems - ACM RecSys 2018
 
An Example of Predictive Analytics: Building a Recommendation Engine Using Py...
An Example of Predictive Analytics: Building a Recommendation Engine Using Py...An Example of Predictive Analytics: Building a Recommendation Engine Using Py...
An Example of Predictive Analytics: Building a Recommendation Engine Using Py...
 

Similaire à best online data science courses

Similaire à best online data science courses (20)

Overview of recommender system
Overview of recommender systemOverview of recommender system
Overview of recommender system
 
Demystifying Recommendation Systems
Demystifying Recommendation SystemsDemystifying Recommendation Systems
Demystifying Recommendation Systems
 
Recommender Systems
Recommender SystemsRecommender Systems
Recommender Systems
 
Lecture Notes on Recommender System Introduction
Lecture Notes on Recommender System IntroductionLecture Notes on Recommender System Introduction
Lecture Notes on Recommender System Introduction
 
Recommender systems
Recommender systemsRecommender systems
Recommender systems
 
Recommender lecture
Recommender lectureRecommender lecture
Recommender lecture
 
Lec7 collaborative filtering
Lec7 collaborative filteringLec7 collaborative filtering
Lec7 collaborative filtering
 
Recommender systems for E-commerce
Recommender systems for E-commerceRecommender systems for E-commerce
Recommender systems for E-commerce
 
Modern Perspectives on Recommender Systems and their Applications in Mendeley
Modern Perspectives on Recommender Systems and their Applications in MendeleyModern Perspectives on Recommender Systems and their Applications in Mendeley
Modern Perspectives on Recommender Systems and their Applications in Mendeley
 
Олександр Обєдніков “Рекомендательные системы”
Олександр Обєдніков “Рекомендательные системы”Олександр Обєдніков “Рекомендательные системы”
Олександр Обєдніков “Рекомендательные системы”
 
recommendation system techunique and issue
recommendation system techunique and issuerecommendation system techunique and issue
recommendation system techunique and issue
 
Recommender systems
Recommender systemsRecommender systems
Recommender systems
 
Use of data science in recommendation system
Use of data science in  recommendation systemUse of data science in  recommendation system
Use of data science in recommendation system
 
case based recommendation approach for market basket data
case based recommendation approach for market basket datacase based recommendation approach for market basket data
case based recommendation approach for market basket data
 
Mini-training: Personalization & Recommendation Demystified
Mini-training: Personalization & Recommendation DemystifiedMini-training: Personalization & Recommendation Demystified
Mini-training: Personalization & Recommendation Demystified
 
Recommender Systems in a nutshell
Recommender Systems in a nutshellRecommender Systems in a nutshell
Recommender Systems in a nutshell
 
Recommandation systems -
Recommandation systems - Recommandation systems -
Recommandation systems -
 
Agent technology for e commerce-recommendation systems
Agent technology for e commerce-recommendation systemsAgent technology for e commerce-recommendation systems
Agent technology for e commerce-recommendation systems
 
Recommenders.ppt
Recommenders.pptRecommenders.ppt
Recommenders.ppt
 
Recommenders.ppt
Recommenders.pptRecommenders.ppt
Recommenders.ppt
 

Plus de prathyusha1234

Plus de prathyusha1234 (20)

Business analytics course in delhi
Business analytics course in delhiBusiness analytics course in delhi
Business analytics course in delhi
 
Business analytics courses in india
Business analytics courses in indiaBusiness analytics courses in india
Business analytics courses in india
 
Business analytics courses in india
Business analytics courses in indiaBusiness analytics courses in india
Business analytics courses in india
 
Business analytics course in chennai
Business analytics course in chennaiBusiness analytics course in chennai
Business analytics course in chennai
 
Business analytics online course
Business analytics online courseBusiness analytics online course
Business analytics online course
 
Business analytics training in bangalore
Business analytics training in bangaloreBusiness analytics training in bangalore
Business analytics training in bangalore
 
Pmi acp training in hyderabad
Pmi acp training in hyderabadPmi acp training in hyderabad
Pmi acp training in hyderabad
 
Pmi acp training in hyderabad
Pmi acp training in hyderabadPmi acp training in hyderabad
Pmi acp training in hyderabad
 
Data science training
Data science trainingData science training
Data science training
 
Business analytics course in mumbai
Business analytics course in mumbaiBusiness analytics course in mumbai
Business analytics course in mumbai
 
Business analytics course in chennai
Business analytics course in chennaiBusiness analytics course in chennai
Business analytics course in chennai
 
Business analytics course in delhi
Business analytics course in delhiBusiness analytics course in delhi
Business analytics course in delhi
 
Pmi acp training
Pmi acp trainingPmi acp training
Pmi acp training
 
Pmi acp training
Pmi acp trainingPmi acp training
Pmi acp training
 
Pmi acp training
Pmi acp trainingPmi acp training
Pmi acp training
 
Pmi acp training
Pmi acp trainingPmi acp training
Pmi acp training
 
Business analytics training in hyderabad
Business analytics training in hyderabadBusiness analytics training in hyderabad
Business analytics training in hyderabad
 
Pmi acp training
Pmi acp trainingPmi acp training
Pmi acp training
 
Pmi acp training
Pmi acp trainingPmi acp training
Pmi acp training
 
Pmi acp training
Pmi acp trainingPmi acp training
Pmi acp training
 

Dernier

Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
ciinovamais
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
kauryashika82
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
QucHHunhnh
 

Dernier (20)

Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and Modifications
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Magic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptxMagic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptx
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POS
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docx
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 

best online data science courses

  • 1. Recommender Systems – Collaborative Filtering Prof. Rajib L. Saha
  • 2. Online Recommender Systems: collaborative filtering (different from Association Rules!)
  • 3. Collaborative Filtering • If Person A has the same opinion as Person B on an issue, A is more likely to have B's opinion on a different issue x, when compared to the opinion of a person chosen randomly
  • 4. Traditional Collaborative Filtering • Customer as a p-dimensional vector of items – p: the number of distinct catalog items – Components: bought (1)/not bought (0); ratings; rated (1)/not rated(0). • Find Similarity between Customers A & B
  • 5. n customers X p items
  • 6. These entries could be numbers (ratings) as well Information on rating along with the information on whether items were rated or not rated can be better than either of them alone. Example: Netflix Recommendation
  • 7. Cosine-based similarity Cos(A,B) = A.B/|A|*|B| • A: (a1, a2, …, aN) • B: (b1, b2, …, bN) • A.B: a1*b1+a2*b2+…+aN*bN • |A|: (a1 2+a2 2+…+aN 2)1/2 ; |B|: (b1 2+b2 2+…+bN 2)1/2 Example: Cos(A,B)= (3*1+5*4+0*0+1*0)/((32+52+02+12)1/2*(12+42+02+02)1/2)=0.94 1 2 3 4 A 3 5 0 1 B 1 4 0 0
  • 8. Correlation-based Similarity 1 2 3 4 A 3 5 0 1 B 1 4 0 0 CorrAB = Covariance (A,B) ௌ௧ௗ௘௩ ஺ ∗ ௌ௧ௗ௘௩(஻)
  • 9. n customers X p items While computing similarity between Persons 1 & 2, Item 2’s rating cannot be included, since Person 2 hasn’t bought Item 2. For binary data (bought/didn’t buy), it is NOT an issue.
  • 10. Normalizing Ratings • Multiply the vector components by the inverse frequency • Inverse frequency: the inverse of the number of customers who have purchased or rated the item
  • 11. Other measures … • Find Nearest Neighbor(s) based on distance (dissimilarity) – Can use other Distance measures to identify neighbors • Euclidean distance = √((3-1)2+(5-4)2+(0-0)2+(1-0)2) • Manhattan distance = (|3-1|+|5-4|+|0-0|+|1-0|) 1 2 3 4 A 3 5 0 1 B 1 4 0 0
  • 12. Once similar, what item(s) to recommend? • The item that hasn’t been bought by the user yet • You may create a list of multiple items to be considered for recommendation • Finally, recommend the item he/she is most likely to buy – Rank each item according to how many similar customers purchased it – Or rated by most – Or highest rated – Or some other popularity criteria
  • 13. Long Tail Supply-side drivers: •centralized warehousing with more offerings •lower inventory cost of electronic products Demand-side drivers: •search engines •recommender systems
  • 14. Negatives • Memory-based / Lazy-learning – When does the recommendation engine compute the “recommendation”? • Computation-intensive – Recall how it computes “recommendation”? n2 similarities
  • 15. How to reduce computation? • Randomly sample customers • Discard infrequent buyers • Discard items that are very popular or very unpopular • Clustering can reduce #of rows • PCA can reduce #of columns
  • 16. Runtime vs. Quality of Recommendation • Recommend while the customer is browsing vis-à-vis • Recommend better but later
  • 17. Search-based Methods • Based on previous purchases – Books of the same/similar authors – DVD titles of the same director – Products that are identified by similar keywords
  • 18. A more sophisticated variation of the search- based methods: Item-to-Item Collaborative Filtering • Cosine similarity among items – Item being the vector – Customers as components of the vector • Correlation similarity among items – Correlation of ratings of Items I & J where users rated both I & J
  • 19. While computing similarity between Items 1 & 2, Person 2’s rating cannot be included, since Person 2 hasn’t bough Item 2
  • 20. Scalability and Performance of Item-to-Item Collaborative Filtering • Computation-expensive, however, similar-items table is computed offline • Online component: lookup similar items for the user’s purchases & ratings • Dependent only on how many titles the user has purchased or rated
  • 21. Disadvantages of Item-based Collaborative Filtering • Less diversity between items, compared to users’ taste, therefore the recommendations are often obvious • When considering what to recommend to a user, who purchased a popular item, the association rules and item-based collaborative filtering might yield the same recommendation, whereas the user-based recommendation will likely differ.
  • 22. Association Rules vs. Recommender Systems Market basket analysis (Association rules) • Finds many baskets that contain the same items • Need lots of baskets • Used for generating impersonal, common strategies • Useful for large physical stores (chains, supermarkets) Recommender Systems (Collaborative filtering) • Finds items which have a large fraction of their baskets in common • Number of baskets unimportant • Used for personalization • Useful for online recommendation (long- tail)
  • 23. A Critical Limitation of Collaborative Filtering • Cold Start: – How to create recommendation for new users – How about new items
  • 24. How to address Cold Start? • Approaches to address cold start with new users: – Popular items (get quick reaction of the users) – Demographically relevant items – Browsing history – Secondary source of data --- social network, subscription – Netflix – start with rating a few movies • Approaches to address cold start with new items: – Recommend to random users/ or some selective users based on certain criteria – How about offering the product to influential people in the social network
  • 25. Issues with Rating Matrix-based Recommender Systems • Person i likes Adventure Movie 1 (AM1), but has never watched or rated Adventure Movie 2 (AM2); whereas Person j has watched AM2 but not watched/rated AM1. • What happens to the similarity between Persons i & j? – The ratings of AM1 and AM2 are not even included while computing similarity! • Rating matrices are huge, and usually sparse. What is the implication on computational burden on dealing with the rating matrix?
  • 26. Recommendation using SVD R= UΣVT RN x n: Rating Matrix UN x r: User-feature matrix Vn x r: Item-feature matrix
  • 27. Dealing with New Users R= UΣVT • ‫ݎ‬௜ = ith row of rating matrix = item ratings of user i • ‫ݑ‬௜= ith row of user-feature matrix = feature ratings of user i • ‫ݎ‬௜ = ‫ݑ‬௜ Σ ்ܸ {dimension: 1xn = 1xr rxr rxn} • ‫ݎ‬௜ ܸ = ‫ݑ‬௜ Σ ்ܸV = ‫ݑ‬௜ Σ • ‫ݎ‬௜ ܸ Σିଵ = ‫ݑ‬௜ Σ Σିଵ = ‫ݑ‬௜ • ‫ݑ‬௡௘௪ = ‫ݎ‬௡௘௪ ܸ Σିଵ – Let the new users rate a few items and use those partial ratings to compute feature ratings
  • 28. Dealing with missing values before applying SVD • Impute the missing values in the Rank matrix with user mean or item mean • If the rank matrix is already normalized (mean-substracted), missing values can be simply zeroes
  • 29. Vulnerability of Recommender Systems • Recommender accuracy and neutrality may be compromised – malicious users may try to push or kill a product through using fake accounts – inconsistent ratings – Some mechanism to establish integrity is necessary • Privacy of users’ ratings or preferences can be compromised – In some systems it may be desired that users do not get to know each other’s opinions – Use of less transparent algorithm is less vulnerable to hack – SVD decomposition
  • 30. Amazon Recommender System • Non-personal, based on sales statistics – Best sellers, promotional, etc. • Recommendations based on browsing history, personal • Recommendations based on Association Rules, non-personal • Personalized recommendation: Sign in > Account > Your Recommendations • Personalized recommendation over email
  • 31. Netflix Recommender System • How did Cinematch work? • How did Cinematch added value to Netflix? • How did Netflix created itself an advantage through Long Tail effect? • Blockbuster’s data disadvantage/The Competitive advantage of Netflix • Opportunities of Netflix in the Video- On-Demand market