SlideShare une entreprise Scribd logo
1  sur  26
Télécharger pour lire hors ligne
Considerations on
Recommendation Independence
for a Find-Good-Items Task
Toshihiro Kamishima* and Shotaro Akaho*

http://www.kamishima.net/
*National Institute of Advanced Industrial Science and Technology (AIST), Japan
Workshop on Responsible Recommendation (FATREC)

in conjunction with RecSys2017 @ Como, Italy, Aug. 31, 2017
1
START
This workshop paper
A preliminary experiment of applying a regularization approach to
a recommender for a find-good-items task

Independence of a preference score was enhanced, but that of
relevance was not

Errors in AUC was not worsen by enhancing independence
This Talk
2
Our previous work
We advocated a concept of recommendation independence

We developed two types of approaches to enhance
recommendation independence for a predicting-ratings task

a regularization approach using a constraint term

a model-based approach using a special graphical model
Sensitive Feature
3
S : sensitive feature
This represents information to be ignored in a recommendation
process

Its values are determined depending on a user and/or an item
As in a case of standard recommendation, we use random variables

X: a user, Y: an item, and R: a recommendation outcome
A sensitive feature is restricted to a binary type
We adopt a variable required for recommendation independence
Ex. Sensitive feature = movie’s popularity / user’s gender
Recommendation Independence
4
No information about a sensitive feature influences the outcome

The status of the sensitive feature is explicitly excluded from the
inference of the recommendation outcome
Recommendation Independence
the statistical independence
between a recommendation outcome, R, and a sensitive feature, S
Recommendation outcomes are predicted

under the constraint of recommendation independence
[Kamishima 12, Kamishima 13]
Pr[R | S] = Pr[R]
R ⫫ S
≡
dislike like
Effect of Independence Enhancement
5
Standard Independence-enhanced
two distributions are
largely diverged
two distributions
become closer
The bias that older movies were rated higher
could be successfully canceled by enhancing independence
dislike like
✽ each bin of histograms of predicted scores for older and newer movies
a sensitive feature = whether a movie is newer or older
6
Applications
Application
Adherence to Laws and Regulations
7
A recommendation service must be managed
while adhering to laws and regulations
suspicious placement keyword-matching advertisement
Advertisements indicating arrest records were more frequently
displayed for names that are more popular among individuals of
African descent than those of European descent

↓

Socially discriminative treatments must be avoided
sensitive feature = users’ demographic information
↓
Legally or socially sensitive information

can be excluded from the inference process of recommendation
[Sweeney 13]
Application
Fair Treatment of Content Providers
8
System managers should fairly treat their content providers
The US FTC has investigated Google to determine whether the search
engine ranks its own services higher than those of competitors
Fair treatment in search engines
sensitive feature = a content provider of a candidate item
↓

Information about who provides a candidate item can be ignored,

and providers are treated fairly
Fair treatment in recommendation
A hotel booking site should not abuse their position to recommend
hotels of its group company
[Bloomberg]
Application
Exclusion of Unwanted Information
9
sensitive feature = a political conviction of a friend candidate
↓

Information about whether a candidate is conservative or progressive

can be ignored in a recommendation process
Filter Bubble: To fit for Pariser’s preference, conservative people are
eliminated from his friend recommendation list in FaceBook
Information unwanted by a user is excluded from recommendation
[TED Talk by Eli Pariser, http://www.filterbubble.com/]
10
Independence-Enhanced
Recommendation Algorithms
Independence-Enhanced
Recommendation Task
11
Predicting Ratings: a task to predict a rating value that a user
would provide to an item
Dataset
Random variables: user X, item Y, rating R, sensitive feature S
Prediction Function
Dataset
Prediction Function
Standard Recommendation Independence-Enhanced Rec.
D = {(xi, yi, ri)} D = {(xi, yi, ri, si)}
Çr(x, y) Çr(x, y, s)
[Kamishima+ 12, Kamishima+ 13, Kamishima+ 14, Kamishima+16]
Çr(x, y) = + bx + cy + pxqÒ
y
Probabilistic Matrix Factorization
12
Probabilistic Matrix Factorization Model
predict a preference rating of an item y rated by a user x

well-performed and widely used
[Salakhutdinov 08, Koren 08]
For a given training dataset, model parameters are learned by
minimizing the squared loss function with an L2 regularizer.
cross effect of

users and itemsglobal bias
user-dependent bias item-dependent bias
≥
D (ri * Çr(xi, yi))2
+ Ò⇥Ò2
Prediction Function
Objective Function
L2 regularizer
regularization parameter
squared loss function
≥
D ri * Çr(xi, yi, si)
2
* ⌘ indep(R, S) + Ò⇥Ò2
Independence-Enhaned PMF
13
a prediction function is selected according to a sensitive value
sensitive feature
Çr(x, y, s) = (s)
+ b(s)
x + c(s)
y + p(s)
x q(s)
y
Ò
Prediction Function
Objective Function independence parameter: control the balance
between the independence and accuracy
independence term: a regularizer to constrain independence

The larger value indicates that ratings and sensitive values are more
independent 

Matching means of predicted ratings for two sensitive values
[Kamishima+ 13]
For a Find-Good-Items Task
14
cross-entropy loss
Find Good Items: a task to find some items preferred by a user

↓

making a preference score independent, instead of a predicted rating
≥
D CE ri * Çr(xi, yi, si) * ⌘ indep(R, S) + Ò⇥Ò2
Çr(x, y, s) = sig (s)
+ b(s)
x + c(s)
y + p(s)
x q(s)
y
Ò
Preference Score: How strongly a user prefers an item
enhancing the independence between
a preference score and a sensitive feature
sigmoid function
[Kamishima+ 17]
Preference Score vs Sensitive Feature
15
Observation 1: A preference score could be successfully made
independent from a sensitive feature
Observation 2: A ranking accuracy (AUC) did not worsen so
much by enhancement of the recommendation independence

This is was contrasted with the increase of a prediction error (MAE)
in a predicting ratings task
AUC
η0.70
0.75
0.80
0.85
0.90
10
−2
10
−1
1 10
1
10
2
KS
η0
0.05
0.10
0.15
0.20
10
−2
10
−1
1 10
1
10
2
Kolmogolov-Smirnov statistic

between Pr[R | S=0] and Pr[R | S=1]
ranking accuracy (AUC)
[Kamishima+ 17]
Relevance and Sensitive Feature
16
1.0 0.9 0.7 0.10.30.51.0
Recommending top-k items whose preference scores are the largest
sort according to their preference scores
select top-k items
irrelevant itemsrelevant items
check the independence from a relevance, not from a preference score
Observation 3: The relevance of items was not independent
from a sensitive feature for some values of k, in particular, small k

↓

A need for a new method that fits for a ranked item list
[Kamishima+ 17]
Conclusions
17
Contributions
We advocated a notion of recommendation independence and
developed methods to enhance it

We tested a preliminary approach to enhance independence, but it
was not effective for a ranked item list 

Future work
Developing an independence-enhancement method being fit for a
ranked item list

Acknowledgment
We would like to thank for providing datasets for the Grouplens research lab

This work is supported by MEXT/JSPS KAKENHI Grant Number JP24500194,
JP15K00327, and JP16H02864

Survey Slide of Fairness-Aware Data Mining
http://www.kamishima.net/archive/fadm.pdf
18
Extra Slides
Prediction:
Latent Class Model
19
[Hofmann 99]
z
y
x
r
Latent Class Model: A probabilistic model for collaborative filtering
A basic topic model, pLSA
extended so as to be able to deal with 

ratings r given by users x to items y
Çr(x, y) = EPr[rx,y][level(r)]
=
≥
r Pr[rx, y] level(r)
the r-th rating value
A rating value can be predicted by the expectation of ratings
Model parameters can be learned by an EM algorithm
latent topic variable
Independence-Enhanced LCM
20
z
y
x
r
s
z
y
x
r
s
Independence-Enhancement by a Model-based Approach
A sensitive variable is embedded into the original LCM

A rating and a sensitive variable are mutually independent

A user, an item, and a rating are conditionally independent given Z
A type 2 model can more strictly enhance recommendation independence,
because in addition to X and Y, Z depends on a sensitive variable
Type 1 model Type 2 model
[Kamishima+ 16]
Popularity Bias
21
[Celma 08]
Popularity Bias

the tendency for popular items to be recommended more frequently
Flixster data
The degree popularity of an item is measured

by the number of users who rated the item

Short-head items are frequently and highly rated
[Jamali+ 10]
long-tail (bottom 99%)
share in ratings: 52.8%

mean rating: 3.53
short-head (top 1%)
share in ratings: 47.2%

mean rating: 3.71
sensitive feature = popularity of items
↓

Popularity bias can be corrected
Kolmogorov-Smirnov Statistic
22
The statistic of the two-sample Kolmogorov-Smirnov test
a nonparametric test for the equality of two distribution

↓

Evaluating the degree of independence

by measuring the equality between Pr[R | S=0] and Pr[R | S=1]
Kolmogorov-Smirnov statistic
the area between two empirical
cumulative distributions
Wikipedia
Recommendation Diversity
23
[Ziegler+ 05, Zhang+ 08, Latha+ 09, Adomavicius+ 12]
Recommendation Diversity
Similar items are not recommended in a single list, to a single user,
to all users, or in a temporally successive lists
recommendation list
similar items
excluded
Diversity
Items that are similar in a
specified metric are excluded
from recommendation results

The mutual relations
among results
Independence
Information about a sensitive
feature is excluded from
recommendation results

The relations between
results and sensitive values
Diversity vs Independence
24
Diversity
Depending on the definition of
similarity measures
Independence
Depending on the specification of
sensitive feature
Similarity
A function of two items
Sensitive Feature
A function of a user-item pair
Because a sensitive feature depends on a user, neutrality can be
applicable for coping with users’ factor, such as, users’ gender or age,
which cannot be straightforwardly dealt by using diversity
Diversity vs Independence
25
short-head
long-tail
short-head
long-tail
standard recommendation diversified recommendation
Because a set of recommendations are diversified by abandoning
short-head items, predicted ratings are still biased
Prediction ratings themselves are unbiased by enhancing
recommendation independence
Privacy-preserving Data Mining
26
recommendation results, R, and sensitive features, S,

are statistically independent
In a context of privacy-preservation
Even if the information about R is disclosed,
the information about S will not exposed
mutual information between a recommendation result, R,

and a sensitive feature, S, is zero

I(R; S) = 0
In particular, a notion of the t-closeness has strong connection

Contenu connexe

Tendances

Fairness-aware Classifier with Prejudice Remover Regularizer
Fairness-aware Classifier with Prejudice Remover RegularizerFairness-aware Classifier with Prejudice Remover Regularizer
Fairness-aware Classifier with Prejudice Remover RegularizerToshihiro Kamishima
 
GTC 2021: Counterfactual Learning to Rank in E-commerce
GTC 2021: Counterfactual Learning to Rank in E-commerceGTC 2021: Counterfactual Learning to Rank in E-commerce
GTC 2021: Counterfactual Learning to Rank in E-commerceGrubhubTech
 
Information Retrieval Models for Recommender Systems - PhD slides
Information Retrieval Models for Recommender Systems - PhD slidesInformation Retrieval Models for Recommender Systems - PhD slides
Information Retrieval Models for Recommender Systems - PhD slidesDaniel Valcarce
 
Boston ML - Architecting Recommender Systems
Boston ML - Architecting Recommender SystemsBoston ML - Architecting Recommender Systems
Boston ML - Architecting Recommender SystemsJames Kirk
 
Movie recommendation system using collaborative filtering system
Movie recommendation system using collaborative filtering system Movie recommendation system using collaborative filtering system
Movie recommendation system using collaborative filtering system Mauryasuraj98
 
Movies recommendation system in R Studio, Machine learning
Movies recommendation system in  R Studio, Machine learning Movies recommendation system in  R Studio, Machine learning
Movies recommendation system in R Studio, Machine learning Mauryasuraj98
 
Module 6: Ensemble Algorithms
Module 6:  Ensemble AlgorithmsModule 6:  Ensemble Algorithms
Module 6: Ensemble AlgorithmsSara Hooker
 
Tutorial on Advances in Bias-aware Recommendation on the Web @ WSDM 2021
Tutorial on Advances in Bias-aware Recommendation on the Web @ WSDM 2021Tutorial on Advances in Bias-aware Recommendation on the Web @ WSDM 2021
Tutorial on Advances in Bias-aware Recommendation on the Web @ WSDM 2021Mirko Marras
 
Recommender system introduction
Recommender system   introductionRecommender system   introduction
Recommender system introductionLiang Xiang
 
Module 2: Machine Learning Deep Dive
Module 2:  Machine Learning Deep DiveModule 2:  Machine Learning Deep Dive
Module 2: Machine Learning Deep DiveSara Hooker
 
Collaborative Filtering Recommendation System
Collaborative Filtering Recommendation SystemCollaborative Filtering Recommendation System
Collaborative Filtering Recommendation SystemMilind Gokhale
 
Modern Recommendation for Advanced Practitioners part2
Modern Recommendation for Advanced Practitioners part2Modern Recommendation for Advanced Practitioners part2
Modern Recommendation for Advanced Practitioners part2Flavian Vasile
 
Best Practices in Recommender System Challenges
Best Practices in Recommender System ChallengesBest Practices in Recommender System Challenges
Best Practices in Recommender System ChallengesAlan Said
 
movieRecommendation_FinalReport
movieRecommendation_FinalReportmovieRecommendation_FinalReport
movieRecommendation_FinalReportSohini Sarkar
 
Movie recommendation project
Movie recommendation projectMovie recommendation project
Movie recommendation projectAbhishek Jaisingh
 
[RIIT 2017] Identifying Grey Sheep Users By The Distribution of User Similari...
[RIIT 2017] Identifying Grey Sheep Users By The Distribution of User Similari...[RIIT 2017] Identifying Grey Sheep Users By The Distribution of User Similari...
[RIIT 2017] Identifying Grey Sheep Users By The Distribution of User Similari...YONG ZHENG
 
Anomaly Detection
Anomaly DetectionAnomaly Detection
Anomaly Detectionguest0edcaf
 
Module 3: Linear Regression
Module 3:  Linear RegressionModule 3:  Linear Regression
Module 3: Linear RegressionSara Hooker
 
Movie recommender system using the user's psychological profile
Movie recommender system using the user's psychological profileMovie recommender system using the user's psychological profile
Movie recommender system using the user's psychological profileUniversity Politehnica Bucharest
 

Tendances (20)

Fairness-aware Classifier with Prejudice Remover Regularizer
Fairness-aware Classifier with Prejudice Remover RegularizerFairness-aware Classifier with Prejudice Remover Regularizer
Fairness-aware Classifier with Prejudice Remover Regularizer
 
GTC 2021: Counterfactual Learning to Rank in E-commerce
GTC 2021: Counterfactual Learning to Rank in E-commerceGTC 2021: Counterfactual Learning to Rank in E-commerce
GTC 2021: Counterfactual Learning to Rank in E-commerce
 
Information Retrieval Models for Recommender Systems - PhD slides
Information Retrieval Models for Recommender Systems - PhD slidesInformation Retrieval Models for Recommender Systems - PhD slides
Information Retrieval Models for Recommender Systems - PhD slides
 
Boston ML - Architecting Recommender Systems
Boston ML - Architecting Recommender SystemsBoston ML - Architecting Recommender Systems
Boston ML - Architecting Recommender Systems
 
Movie recommendation system using collaborative filtering system
Movie recommendation system using collaborative filtering system Movie recommendation system using collaborative filtering system
Movie recommendation system using collaborative filtering system
 
Movies recommendation system in R Studio, Machine learning
Movies recommendation system in  R Studio, Machine learning Movies recommendation system in  R Studio, Machine learning
Movies recommendation system in R Studio, Machine learning
 
Module 6: Ensemble Algorithms
Module 6:  Ensemble AlgorithmsModule 6:  Ensemble Algorithms
Module 6: Ensemble Algorithms
 
Tutorial on Advances in Bias-aware Recommendation on the Web @ WSDM 2021
Tutorial on Advances in Bias-aware Recommendation on the Web @ WSDM 2021Tutorial on Advances in Bias-aware Recommendation on the Web @ WSDM 2021
Tutorial on Advances in Bias-aware Recommendation on the Web @ WSDM 2021
 
Recommender system introduction
Recommender system   introductionRecommender system   introduction
Recommender system introduction
 
Module 2: Machine Learning Deep Dive
Module 2:  Machine Learning Deep DiveModule 2:  Machine Learning Deep Dive
Module 2: Machine Learning Deep Dive
 
Collaborative Filtering Recommendation System
Collaborative Filtering Recommendation SystemCollaborative Filtering Recommendation System
Collaborative Filtering Recommendation System
 
Modern Recommendation for Advanced Practitioners part2
Modern Recommendation for Advanced Practitioners part2Modern Recommendation for Advanced Practitioners part2
Modern Recommendation for Advanced Practitioners part2
 
Best Practices in Recommender System Challenges
Best Practices in Recommender System ChallengesBest Practices in Recommender System Challenges
Best Practices in Recommender System Challenges
 
movieRecommendation_FinalReport
movieRecommendation_FinalReportmovieRecommendation_FinalReport
movieRecommendation_FinalReport
 
Movie recommendation project
Movie recommendation projectMovie recommendation project
Movie recommendation project
 
[RIIT 2017] Identifying Grey Sheep Users By The Distribution of User Similari...
[RIIT 2017] Identifying Grey Sheep Users By The Distribution of User Similari...[RIIT 2017] Identifying Grey Sheep Users By The Distribution of User Similari...
[RIIT 2017] Identifying Grey Sheep Users By The Distribution of User Similari...
 
Recommender Systems
Recommender SystemsRecommender Systems
Recommender Systems
 
Anomaly Detection
Anomaly DetectionAnomaly Detection
Anomaly Detection
 
Module 3: Linear Regression
Module 3:  Linear RegressionModule 3:  Linear Regression
Module 3: Linear Regression
 
Movie recommender system using the user's psychological profile
Movie recommender system using the user's psychological profileMovie recommender system using the user's psychological profile
Movie recommender system using the user's psychological profile
 

En vedette

はじめてのパターン認識4章後編
はじめてのパターン認識4章後編はじめてのパターン認識4章後編
はじめてのパターン認識4章後編Tanaka Hidenori
 
計算論的学習理論入門 -PAC学習とかVC次元とか-
計算論的学習理論入門 -PAC学習とかVC次元とか-計算論的学習理論入門 -PAC学習とかVC次元とか-
計算論的学習理論入門 -PAC学習とかVC次元とか-sleepy_yoshi
 
非線形データの次元圧縮 150905 WACODE 2nd
非線形データの次元圧縮 150905 WACODE 2nd非線形データの次元圧縮 150905 WACODE 2nd
非線形データの次元圧縮 150905 WACODE 2ndMika Yoshimura
 
機械学習とその理論 (情報オリンピック2015春合宿講義資料)
機械学習とその理論 (情報オリンピック2015春合宿講義資料)機械学習とその理論 (情報オリンピック2015春合宿講義資料)
機械学習とその理論 (情報オリンピック2015春合宿講義資料)irrrrr
 
今さら聞けないカーネル法とサポートベクターマシン
今さら聞けないカーネル法とサポートベクターマシン今さら聞けないカーネル法とサポートベクターマシン
今さら聞けないカーネル法とサポートベクターマシンShinya Shimizu
 
パターン認識と機械学習(PRML)第2章 確率分布 2.3 ガウス分布
パターン認識と機械学習(PRML)第2章 確率分布 2.3 ガウス分布パターン認識と機械学習(PRML)第2章 確率分布 2.3 ガウス分布
パターン認識と機械学習(PRML)第2章 確率分布 2.3 ガウス分布Nagayoshi Yamashita
 

En vedette (7)

はじめてのパターン認識4章後編
はじめてのパターン認識4章後編はじめてのパターン認識4章後編
はじめてのパターン認識4章後編
 
計算論的学習理論入門 -PAC学習とかVC次元とか-
計算論的学習理論入門 -PAC学習とかVC次元とか-計算論的学習理論入門 -PAC学習とかVC次元とか-
計算論的学習理論入門 -PAC学習とかVC次元とか-
 
非線形データの次元圧縮 150905 WACODE 2nd
非線形データの次元圧縮 150905 WACODE 2nd非線形データの次元圧縮 150905 WACODE 2nd
非線形データの次元圧縮 150905 WACODE 2nd
 
機械学習とその理論 (情報オリンピック2015春合宿講義資料)
機械学習とその理論 (情報オリンピック2015春合宿講義資料)機械学習とその理論 (情報オリンピック2015春合宿講義資料)
機械学習とその理論 (情報オリンピック2015春合宿講義資料)
 
EMアルゴリズム
EMアルゴリズムEMアルゴリズム
EMアルゴリズム
 
今さら聞けないカーネル法とサポートベクターマシン
今さら聞けないカーネル法とサポートベクターマシン今さら聞けないカーネル法とサポートベクターマシン
今さら聞けないカーネル法とサポートベクターマシン
 
パターン認識と機械学習(PRML)第2章 確率分布 2.3 ガウス分布
パターン認識と機械学習(PRML)第2章 確率分布 2.3 ガウス分布パターン認識と機械学習(PRML)第2章 確率分布 2.3 ガウス分布
パターン認識と機械学習(PRML)第2章 確率分布 2.3 ガウス分布
 

Similaire à Considerations on Recommendation Independence for a Find-Good-Items Task

IRJET-Survey on Identification of Top-K Competitors using Data Mining
IRJET-Survey on Identification of Top-K Competitors using Data MiningIRJET-Survey on Identification of Top-K Competitors using Data Mining
IRJET-Survey on Identification of Top-K Competitors using Data MiningIRJET Journal
 
Novel Algorithms for Ranking and Suggesting True Popular Items
Novel Algorithms for Ranking and Suggesting True Popular ItemsNovel Algorithms for Ranking and Suggesting True Popular Items
Novel Algorithms for Ranking and Suggesting True Popular ItemsIJMER
 
A Modified KS-test for Feature Selection
A Modified KS-test for Feature SelectionA Modified KS-test for Feature Selection
A Modified KS-test for Feature SelectionIOSR Journals
 
Empirical Evaluation of Active Learning in Recommender Systems
Empirical Evaluation of Active Learning in Recommender SystemsEmpirical Evaluation of Active Learning in Recommender Systems
Empirical Evaluation of Active Learning in Recommender SystemsUniversity of Bergen
 
AI in Entertainment – Movie Recommendation System
AI in Entertainment – Movie Recommendation SystemAI in Entertainment – Movie Recommendation System
AI in Entertainment – Movie Recommendation SystemIRJET Journal
 
Recommender Systems: Advances in Collaborative Filtering
Recommender Systems: Advances in Collaborative FilteringRecommender Systems: Advances in Collaborative Filtering
Recommender Systems: Advances in Collaborative FilteringChangsung Moon
 
Recommender Systems Fairness Evaluation via Generalized Cross Entropy
Recommender Systems Fairness Evaluation via Generalized Cross EntropyRecommender Systems Fairness Evaluation via Generalized Cross Entropy
Recommender Systems Fairness Evaluation via Generalized Cross EntropyVito Walter Anelli
 
Rating System Algorithms Document
Rating System Algorithms DocumentRating System Algorithms Document
Rating System Algorithms DocumentScandala Tamang
 
reinforcement-learning-141009013546-conversion-gate02.pdf
reinforcement-learning-141009013546-conversion-gate02.pdfreinforcement-learning-141009013546-conversion-gate02.pdf
reinforcement-learning-141009013546-conversion-gate02.pdfVaishnavGhadge1
 
Classification Algorithms with Attribute Selection: an evaluation study using...
Classification Algorithms with Attribute Selection: an evaluation study using...Classification Algorithms with Attribute Selection: an evaluation study using...
Classification Algorithms with Attribute Selection: an evaluation study using...Eswar Publications
 
Predicting an Applicant Status Using Principal Component, Discriminant and Lo...
Predicting an Applicant Status Using Principal Component, Discriminant and Lo...Predicting an Applicant Status Using Principal Component, Discriminant and Lo...
Predicting an Applicant Status Using Principal Component, Discriminant and Lo...inventionjournals
 
Exploring social influence via posterior effect of word of-mouth
Exploring social influence via posterior effect of word of-mouthExploring social influence via posterior effect of word of-mouth
Exploring social influence via posterior effect of word of-mouthmoresmile
 
Selecting Best Tractor Ranking Wise by Software using MADM(Multiple –Attribut...
Selecting Best Tractor Ranking Wise by Software using MADM(Multiple –Attribut...Selecting Best Tractor Ranking Wise by Software using MADM(Multiple –Attribut...
Selecting Best Tractor Ranking Wise by Software using MADM(Multiple –Attribut...IRJET Journal
 
Reinforcement learning
Reinforcement  learningReinforcement  learning
Reinforcement learningSKS
 
Default Credit Card Prediction
Default Credit Card PredictionDefault Credit Card Prediction
Default Credit Card PredictionAlexandre Pinto
 
A Hybrid Theory Of Power Theft Detection
A Hybrid Theory Of Power Theft DetectionA Hybrid Theory Of Power Theft Detection
A Hybrid Theory Of Power Theft DetectionCamella Taylor
 
Feature selection concepts and methods
Feature selection concepts and methodsFeature selection concepts and methods
Feature selection concepts and methodsReza Ramezani
 

Similaire à Considerations on Recommendation Independence for a Find-Good-Items Task (20)

IRJET-Survey on Identification of Top-K Competitors using Data Mining
IRJET-Survey on Identification of Top-K Competitors using Data MiningIRJET-Survey on Identification of Top-K Competitors using Data Mining
IRJET-Survey on Identification of Top-K Competitors using Data Mining
 
Novel Algorithms for Ranking and Suggesting True Popular Items
Novel Algorithms for Ranking and Suggesting True Popular ItemsNovel Algorithms for Ranking and Suggesting True Popular Items
Novel Algorithms for Ranking and Suggesting True Popular Items
 
A Modified KS-test for Feature Selection
A Modified KS-test for Feature SelectionA Modified KS-test for Feature Selection
A Modified KS-test for Feature Selection
 
Empirical Evaluation of Active Learning in Recommender Systems
Empirical Evaluation of Active Learning in Recommender SystemsEmpirical Evaluation of Active Learning in Recommender Systems
Empirical Evaluation of Active Learning in Recommender Systems
 
AI in Entertainment – Movie Recommendation System
AI in Entertainment – Movie Recommendation SystemAI in Entertainment – Movie Recommendation System
AI in Entertainment – Movie Recommendation System
 
Recommender Systems: Advances in Collaborative Filtering
Recommender Systems: Advances in Collaborative FilteringRecommender Systems: Advances in Collaborative Filtering
Recommender Systems: Advances in Collaborative Filtering
 
South Tyrol Suggests - STS
South Tyrol Suggests - STSSouth Tyrol Suggests - STS
South Tyrol Suggests - STS
 
Recommender Systems Fairness Evaluation via Generalized Cross Entropy
Recommender Systems Fairness Evaluation via Generalized Cross EntropyRecommender Systems Fairness Evaluation via Generalized Cross Entropy
Recommender Systems Fairness Evaluation via Generalized Cross Entropy
 
Rating System Algorithms Document
Rating System Algorithms DocumentRating System Algorithms Document
Rating System Algorithms Document
 
reinforcement-learning-141009013546-conversion-gate02.pdf
reinforcement-learning-141009013546-conversion-gate02.pdfreinforcement-learning-141009013546-conversion-gate02.pdf
reinforcement-learning-141009013546-conversion-gate02.pdf
 
Classification Algorithms with Attribute Selection: an evaluation study using...
Classification Algorithms with Attribute Selection: an evaluation study using...Classification Algorithms with Attribute Selection: an evaluation study using...
Classification Algorithms with Attribute Selection: an evaluation study using...
 
Predicting an Applicant Status Using Principal Component, Discriminant and Lo...
Predicting an Applicant Status Using Principal Component, Discriminant and Lo...Predicting an Applicant Status Using Principal Component, Discriminant and Lo...
Predicting an Applicant Status Using Principal Component, Discriminant and Lo...
 
Exploring social influence via posterior effect of word of-mouth
Exploring social influence via posterior effect of word of-mouthExploring social influence via posterior effect of word of-mouth
Exploring social influence via posterior effect of word of-mouth
 
Selecting Best Tractor Ranking Wise by Software using MADM(Multiple –Attribut...
Selecting Best Tractor Ranking Wise by Software using MADM(Multiple –Attribut...Selecting Best Tractor Ranking Wise by Software using MADM(Multiple –Attribut...
Selecting Best Tractor Ranking Wise by Software using MADM(Multiple –Attribut...
 
Reinforcement learning
Reinforcement  learningReinforcement  learning
Reinforcement learning
 
Default Credit Card Prediction
Default Credit Card PredictionDefault Credit Card Prediction
Default Credit Card Prediction
 
A Hybrid Theory Of Power Theft Detection
A Hybrid Theory Of Power Theft DetectionA Hybrid Theory Of Power Theft Detection
A Hybrid Theory Of Power Theft Detection
 
Fuzzy Rough Set Feature Selection to Enhance Phishing Attack Detection
Fuzzy Rough Set Feature Selection to Enhance Phishing Attack Detection Fuzzy Rough Set Feature Selection to Enhance Phishing Attack Detection
Fuzzy Rough Set Feature Selection to Enhance Phishing Attack Detection
 
Feature selection concepts and methods
Feature selection concepts and methodsFeature selection concepts and methods
Feature selection concepts and methods
 
T0 numtq0njc=
T0 numtq0njc=T0 numtq0njc=
T0 numtq0njc=
 

Plus de Toshihiro Kamishima

RecSys2018論文読み会 資料
RecSys2018論文読み会 資料RecSys2018論文読み会 資料
RecSys2018論文読み会 資料Toshihiro Kamishima
 
機械学習研究でのPythonの利用
機械学習研究でのPythonの利用機械学習研究でのPythonの利用
機械学習研究でのPythonの利用Toshihiro Kamishima
 
科学技術計算関連Pythonパッケージの概要
科学技術計算関連Pythonパッケージの概要科学技術計算関連Pythonパッケージの概要
科学技術計算関連Pythonパッケージの概要Toshihiro Kamishima
 
PyMCがあれば,ベイズ推定でもう泣いたりなんかしない
PyMCがあれば,ベイズ推定でもう泣いたりなんかしないPyMCがあれば,ベイズ推定でもう泣いたりなんかしない
PyMCがあれば,ベイズ推定でもう泣いたりなんかしないToshihiro Kamishima
 
Absolute and Relative Clustering
Absolute and Relative ClusteringAbsolute and Relative Clustering
Absolute and Relative ClusteringToshihiro Kamishima
 
Enhancement of the Neutrality in Recommendation
Enhancement of the Neutrality in RecommendationEnhancement of the Neutrality in Recommendation
Enhancement of the Neutrality in RecommendationToshihiro Kamishima
 
OpenOpt の線形計画で圧縮センシング
OpenOpt の線形計画で圧縮センシングOpenOpt の線形計画で圧縮センシング
OpenOpt の線形計画で圧縮センシングToshihiro Kamishima
 
Pythonによる機械学習実験の管理
Pythonによる機械学習実験の管理Pythonによる機械学習実験の管理
Pythonによる機械学習実験の管理Toshihiro Kamishima
 

Plus de Toshihiro Kamishima (12)

RecSys2018論文読み会 資料
RecSys2018論文読み会 資料RecSys2018論文読み会 資料
RecSys2018論文読み会 資料
 
WSDM2018読み会 資料
WSDM2018読み会 資料WSDM2018読み会 資料
WSDM2018読み会 資料
 
機械学習研究でのPythonの利用
機械学習研究でのPythonの利用機械学習研究でのPythonの利用
機械学習研究でのPythonの利用
 
KDD2016勉強会 資料
KDD2016勉強会 資料KDD2016勉強会 資料
KDD2016勉強会 資料
 
科学技術計算関連Pythonパッケージの概要
科学技術計算関連Pythonパッケージの概要科学技術計算関連Pythonパッケージの概要
科学技術計算関連Pythonパッケージの概要
 
WSDM2016勉強会 資料
WSDM2016勉強会 資料WSDM2016勉強会 資料
WSDM2016勉強会 資料
 
ICML2015読み会 資料
ICML2015読み会 資料ICML2015読み会 資料
ICML2015読み会 資料
 
PyMCがあれば,ベイズ推定でもう泣いたりなんかしない
PyMCがあれば,ベイズ推定でもう泣いたりなんかしないPyMCがあれば,ベイズ推定でもう泣いたりなんかしない
PyMCがあれば,ベイズ推定でもう泣いたりなんかしない
 
Absolute and Relative Clustering
Absolute and Relative ClusteringAbsolute and Relative Clustering
Absolute and Relative Clustering
 
Enhancement of the Neutrality in Recommendation
Enhancement of the Neutrality in RecommendationEnhancement of the Neutrality in Recommendation
Enhancement of the Neutrality in Recommendation
 
OpenOpt の線形計画で圧縮センシング
OpenOpt の線形計画で圧縮センシングOpenOpt の線形計画で圧縮センシング
OpenOpt の線形計画で圧縮センシング
 
Pythonによる機械学習実験の管理
Pythonによる機械学習実験の管理Pythonによる機械学習実験の管理
Pythonによる機械学習実験の管理
 

Dernier

CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangaloreamitlee9823
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Delhi Call girls
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxolyaivanovalion
 
ELKO dropshipping via API with DroFx.pptx
ELKO dropshipping via API with DroFx.pptxELKO dropshipping via API with DroFx.pptx
ELKO dropshipping via API with DroFx.pptxolyaivanovalion
 
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...amitlee9823
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionfulawalesam
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysismanisha194592
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramMoniSankarHazra
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Valters Lauzums
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...amitlee9823
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...amitlee9823
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxolyaivanovalion
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceDelhi Call girls
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxolyaivanovalion
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...amitlee9823
 

Dernier (20)

CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
ELKO dropshipping via API with DroFx.pptx
ELKO dropshipping via API with DroFx.pptxELKO dropshipping via API with DroFx.pptx
ELKO dropshipping via API with DroFx.pptx
 
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics Program
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptx
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptx
 
Anomaly detection and data imputation within time series
Anomaly detection and data imputation within time seriesAnomaly detection and data imputation within time series
Anomaly detection and data imputation within time series
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 

Considerations on Recommendation Independence for a Find-Good-Items Task

  • 1. Considerations on Recommendation Independence for a Find-Good-Items Task Toshihiro Kamishima* and Shotaro Akaho* http://www.kamishima.net/ *National Institute of Advanced Industrial Science and Technology (AIST), Japan Workshop on Responsible Recommendation (FATREC) in conjunction with RecSys2017 @ Como, Italy, Aug. 31, 2017 1 START
  • 2. This workshop paper A preliminary experiment of applying a regularization approach to a recommender for a find-good-items task Independence of a preference score was enhanced, but that of relevance was not Errors in AUC was not worsen by enhancing independence This Talk 2 Our previous work We advocated a concept of recommendation independence We developed two types of approaches to enhance recommendation independence for a predicting-ratings task a regularization approach using a constraint term a model-based approach using a special graphical model
  • 3. Sensitive Feature 3 S : sensitive feature This represents information to be ignored in a recommendation process Its values are determined depending on a user and/or an item As in a case of standard recommendation, we use random variables X: a user, Y: an item, and R: a recommendation outcome A sensitive feature is restricted to a binary type We adopt a variable required for recommendation independence Ex. Sensitive feature = movie’s popularity / user’s gender
  • 4. Recommendation Independence 4 No information about a sensitive feature influences the outcome The status of the sensitive feature is explicitly excluded from the inference of the recommendation outcome Recommendation Independence the statistical independence between a recommendation outcome, R, and a sensitive feature, S Recommendation outcomes are predicted under the constraint of recommendation independence [Kamishima 12, Kamishima 13] Pr[R | S] = Pr[R] R ⫫ S ≡
  • 5. dislike like Effect of Independence Enhancement 5 Standard Independence-enhanced two distributions are largely diverged two distributions become closer The bias that older movies were rated higher could be successfully canceled by enhancing independence dislike like ✽ each bin of histograms of predicted scores for older and newer movies a sensitive feature = whether a movie is newer or older
  • 7. Application Adherence to Laws and Regulations 7 A recommendation service must be managed while adhering to laws and regulations suspicious placement keyword-matching advertisement Advertisements indicating arrest records were more frequently displayed for names that are more popular among individuals of African descent than those of European descent ↓ Socially discriminative treatments must be avoided sensitive feature = users’ demographic information ↓ Legally or socially sensitive information can be excluded from the inference process of recommendation [Sweeney 13]
  • 8. Application Fair Treatment of Content Providers 8 System managers should fairly treat their content providers The US FTC has investigated Google to determine whether the search engine ranks its own services higher than those of competitors Fair treatment in search engines sensitive feature = a content provider of a candidate item ↓ Information about who provides a candidate item can be ignored, and providers are treated fairly Fair treatment in recommendation A hotel booking site should not abuse their position to recommend hotels of its group company [Bloomberg]
  • 9. Application Exclusion of Unwanted Information 9 sensitive feature = a political conviction of a friend candidate ↓ Information about whether a candidate is conservative or progressive can be ignored in a recommendation process Filter Bubble: To fit for Pariser’s preference, conservative people are eliminated from his friend recommendation list in FaceBook Information unwanted by a user is excluded from recommendation [TED Talk by Eli Pariser, http://www.filterbubble.com/]
  • 11. Independence-Enhanced Recommendation Task 11 Predicting Ratings: a task to predict a rating value that a user would provide to an item Dataset Random variables: user X, item Y, rating R, sensitive feature S Prediction Function Dataset Prediction Function Standard Recommendation Independence-Enhanced Rec. D = {(xi, yi, ri)} D = {(xi, yi, ri, si)} Çr(x, y) Çr(x, y, s) [Kamishima+ 12, Kamishima+ 13, Kamishima+ 14, Kamishima+16]
  • 12. Çr(x, y) = + bx + cy + pxqÒ y Probabilistic Matrix Factorization 12 Probabilistic Matrix Factorization Model predict a preference rating of an item y rated by a user x well-performed and widely used [Salakhutdinov 08, Koren 08] For a given training dataset, model parameters are learned by minimizing the squared loss function with an L2 regularizer. cross effect of users and itemsglobal bias user-dependent bias item-dependent bias ≥ D (ri * Çr(xi, yi))2 + Ò⇥Ò2 Prediction Function Objective Function L2 regularizer regularization parameter squared loss function
  • 13. ≥ D ri * Çr(xi, yi, si) 2 * ⌘ indep(R, S) + Ò⇥Ò2 Independence-Enhaned PMF 13 a prediction function is selected according to a sensitive value sensitive feature Çr(x, y, s) = (s) + b(s) x + c(s) y + p(s) x q(s) y Ò Prediction Function Objective Function independence parameter: control the balance between the independence and accuracy independence term: a regularizer to constrain independence The larger value indicates that ratings and sensitive values are more independent Matching means of predicted ratings for two sensitive values [Kamishima+ 13]
  • 14. For a Find-Good-Items Task 14 cross-entropy loss Find Good Items: a task to find some items preferred by a user ↓ making a preference score independent, instead of a predicted rating ≥ D CE ri * Çr(xi, yi, si) * ⌘ indep(R, S) + Ò⇥Ò2 Çr(x, y, s) = sig (s) + b(s) x + c(s) y + p(s) x q(s) y Ò Preference Score: How strongly a user prefers an item enhancing the independence between a preference score and a sensitive feature sigmoid function [Kamishima+ 17]
  • 15. Preference Score vs Sensitive Feature 15 Observation 1: A preference score could be successfully made independent from a sensitive feature Observation 2: A ranking accuracy (AUC) did not worsen so much by enhancement of the recommendation independence This is was contrasted with the increase of a prediction error (MAE) in a predicting ratings task AUC η0.70 0.75 0.80 0.85 0.90 10 −2 10 −1 1 10 1 10 2 KS η0 0.05 0.10 0.15 0.20 10 −2 10 −1 1 10 1 10 2 Kolmogolov-Smirnov statistic between Pr[R | S=0] and Pr[R | S=1] ranking accuracy (AUC) [Kamishima+ 17]
  • 16. Relevance and Sensitive Feature 16 1.0 0.9 0.7 0.10.30.51.0 Recommending top-k items whose preference scores are the largest sort according to their preference scores select top-k items irrelevant itemsrelevant items check the independence from a relevance, not from a preference score Observation 3: The relevance of items was not independent from a sensitive feature for some values of k, in particular, small k ↓ A need for a new method that fits for a ranked item list [Kamishima+ 17]
  • 17. Conclusions 17 Contributions We advocated a notion of recommendation independence and developed methods to enhance it We tested a preliminary approach to enhance independence, but it was not effective for a ranked item list Future work Developing an independence-enhancement method being fit for a ranked item list Acknowledgment We would like to thank for providing datasets for the Grouplens research lab This work is supported by MEXT/JSPS KAKENHI Grant Number JP24500194, JP15K00327, and JP16H02864 Survey Slide of Fairness-Aware Data Mining http://www.kamishima.net/archive/fadm.pdf
  • 19. Prediction: Latent Class Model 19 [Hofmann 99] z y x r Latent Class Model: A probabilistic model for collaborative filtering A basic topic model, pLSA extended so as to be able to deal with ratings r given by users x to items y Çr(x, y) = EPr[rx,y][level(r)] = ≥ r Pr[rx, y] level(r) the r-th rating value A rating value can be predicted by the expectation of ratings Model parameters can be learned by an EM algorithm latent topic variable
  • 20. Independence-Enhanced LCM 20 z y x r s z y x r s Independence-Enhancement by a Model-based Approach A sensitive variable is embedded into the original LCM A rating and a sensitive variable are mutually independent A user, an item, and a rating are conditionally independent given Z A type 2 model can more strictly enhance recommendation independence, because in addition to X and Y, Z depends on a sensitive variable Type 1 model Type 2 model [Kamishima+ 16]
  • 21. Popularity Bias 21 [Celma 08] Popularity Bias the tendency for popular items to be recommended more frequently Flixster data The degree popularity of an item is measured by the number of users who rated the item Short-head items are frequently and highly rated [Jamali+ 10] long-tail (bottom 99%) share in ratings: 52.8% mean rating: 3.53 short-head (top 1%) share in ratings: 47.2% mean rating: 3.71 sensitive feature = popularity of items ↓ Popularity bias can be corrected
  • 22. Kolmogorov-Smirnov Statistic 22 The statistic of the two-sample Kolmogorov-Smirnov test a nonparametric test for the equality of two distribution ↓ Evaluating the degree of independence by measuring the equality between Pr[R | S=0] and Pr[R | S=1] Kolmogorov-Smirnov statistic the area between two empirical cumulative distributions Wikipedia
  • 23. Recommendation Diversity 23 [Ziegler+ 05, Zhang+ 08, Latha+ 09, Adomavicius+ 12] Recommendation Diversity Similar items are not recommended in a single list, to a single user, to all users, or in a temporally successive lists recommendation list similar items excluded Diversity Items that are similar in a specified metric are excluded from recommendation results The mutual relations among results Independence Information about a sensitive feature is excluded from recommendation results The relations between results and sensitive values
  • 24. Diversity vs Independence 24 Diversity Depending on the definition of similarity measures Independence Depending on the specification of sensitive feature Similarity A function of two items Sensitive Feature A function of a user-item pair Because a sensitive feature depends on a user, neutrality can be applicable for coping with users’ factor, such as, users’ gender or age, which cannot be straightforwardly dealt by using diversity
  • 25. Diversity vs Independence 25 short-head long-tail short-head long-tail standard recommendation diversified recommendation Because a set of recommendations are diversified by abandoning short-head items, predicted ratings are still biased Prediction ratings themselves are unbiased by enhancing recommendation independence
  • 26. Privacy-preserving Data Mining 26 recommendation results, R, and sensitive features, S, are statistically independent In a context of privacy-preservation Even if the information about R is disclosed, the information about S will not exposed mutual information between a recommendation result, R, and a sensitive feature, S, is zero I(R; S) = 0 In particular, a notion of the t-closeness has strong connection