SlideShare une entreprise Scribd logo
1  sur  17
True Fit Skin Care 
Chang Liu 
Fellow at Insight Data Science 2014
So many products… 
What makes it so hard? Overwhelming information
What makes it so hard? Overwhelming information 
So many products… So many reviews…
What makes it so hard? Overwhelming information 
So many products… So many reviews… 
Reviews can be so long…
What makes it so hard? Overwhelming information 
So many products… So many reviews… 
Reviews can be so long… 
So many ingredients…
So many products… So many reviews… 
Time 
spent 
Money 
wasted 
Happiness 
What makes it so hard? Overwhelming information 
Reviews can be so long… 
So many ingredients…
32k Reviewers 
• w/ 2+ reviews 
~1200 Products 
• ~80 brands 
• 8 categories 
184k Reviews 
• Rating [1-5] 
• Review text 
• Quick take 
Collaborative Filter using User Reviews from Sephora.com 
Product 
X Y … 
Reviewers 
1 … 
2 … 
3 … 
… … 
… … 
N … 
Algorithm: 
• Item-centric collaborative filter 
• Pearson’s correlation coefficients 
to measure pairwise similarity
32k Reviewers 
• w/ 2+ reviews 
~1200 Products 
• ~80 brands 
• 8 categories 
184k Reviews 
• Rating [1-5] 
• Review text 
• Quick take 
Collaborative Filter using User Reviews from Sephora.com 
Product 
X Y … 
Reviewers 
1 … 
2 … 
3 … 
… … 
… … 
N … 
Algorithm: 
• Item-centric collaborative filter 
• Pearson’s correlation coefficients 
to measure pairwise similarity 
Similarity = cXY = 
(Xi - X)(Yi -Y ) 
N 
å 
i=1 
N 
å (X- X)2 
(Y-Y )2 
i i i=1 
N 
å 
i=1 
M 
å / cij 
recommendation scoreui = rujcij 
j
32k Reviewers 
• w/ 2+ reviews 
~1200 Products 
• ~80 brands 
• 8 categories 
184k Reviews 
• Rating [1-5] 
• Review text 
• Quick take 
Collaborative Filter using User Reviews from Sephora.com 
Product 
X Y … 
Reviewers 
1 … 
2 … 
3 … 
… … 
… … 
N … 
Algorithm: 
• Item-centric collaborative filter 
• Pearson’s correlation coefficients 
to measure pairwise similarity 
Similarity = cXY = 
(Xi - X)(Yi -Y ) 
N 
å 
i=1 
N 
å (X- X)2 
(Y-Y )2 
i i i=1 
N 
å 
M 
å / cij 
Cross Validation 
• 5-fold for reviewer 
• Leave-one-out for product 
• Accuracy = 86.3% ± 1% 
i=1 
recommendation scoreui = rujcij 
j
Visualize the similarity matrix 
White = high similarity 
Black = low similarity 
Sorted by brands 
alphabetically
White in a square 
= 
Users reviews are similar 
for all products in a brand 
= 
Strong customer loyalty 
There are structures!
Expensive! 
“Organic 
& Natural” 
There are structures! For example… 
Cost effective
There are structures! For example… 
Expensive! 
“Organic 
& Natural” 
Cost effective 
Actionable Insights 
For Sephora.com: 
Send marketing emails to 
new customers of brands 
with stronger customer 
loyalty!
Chang Liu 
PhD. in Civil Engineering @CMU 
J8D8L5@gmail.com 
linkedin.com/in/changliucmu 
github.com/R4trtry
Is the rating a good measure of reviewers’ perspective? 
• Trained a NaïveBaysian classifier for 
sentiment analysis 
• W/ 250 thousand reviews from 
Birchbox.com 
• A website that sends out free 
samples from smaller brands and 
gathers massive user reviews 
Most common words Most informative feature 
Word Count Negative Positive 
skin 91349 re-wash Penny 
product 82481 garbage hook 
use 64044 mediocre gorgeous 
love 55691 ketchup perk 
feel 47879 trash stock 
face 42615 unimpressive glowing 
like 41427 survey splurge 
great 34155 ineffective effortless 
really 31672 gag Christmas 
smell 27621 worthless happily 
text quick take 
Precision 95.3% 85.4% 
Recall 89.8% 93.1% 
Worth 
every 
penny! 
Another Validation
Another Validation 
Is the rating a good measure of reviewers’ perspective?
Product X 
Algorithm: Item-centric collaborative filter 
similarity 
87.4% 
Product Y 
Product X 
Product Y 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
Reviewers 
Product 
X Y … 
1 … 
2 … 
3 … 
… … 
… … 
N … 
M products reviewed by N reviewers 
Pairwise similarities are measured 
by Pearson's correlation coefficients: 
cXY = 
(Xi - X)(Yi -Y ) 
N 
å 
i=1 
N 
å (X- X)2 
(Y-Y )2 
i i i=1 
N 
å 
i=1 
Then weight the ratings 
based on the correlation coefficients: 
Scorei = 
cijr uj 
M 
å 
j 
| cij | 
ruj : User u's preference on item j

Contenu connexe

Similaire à Chang liu insight 2014

Top data science institutes in hyderabad
Top data science institutes in hyderabadTop data science institutes in hyderabad
Top data science institutes in hyderabadprathyusha1234
 
best online data science courses
best online data science coursesbest online data science courses
best online data science coursesprathyusha1234
 
Online feedback correlation using clustering
Online feedback correlation using clusteringOnline feedback correlation using clustering
Online feedback correlation using clusteringawesomesos
 
Overview of recommender system
Overview of recommender systemOverview of recommender system
Overview of recommender systemStanley Wang
 
alaTest Dmexco 22 Sept 2011 - Increase Conversion with Reviews
alaTest Dmexco 22 Sept 2011 - Increase Conversion with ReviewsalaTest Dmexco 22 Sept 2011 - Increase Conversion with Reviews
alaTest Dmexco 22 Sept 2011 - Increase Conversion with Reviewsalatest_jakob
 
Product Recommendations Enhanced with Reviews
Product Recommendations Enhanced with ReviewsProduct Recommendations Enhanced with Reviews
Product Recommendations Enhanced with Reviewsmaranlar
 
Actual cases of applying AI related technologiesin Rakuten
Actual cases of applying AI related technologiesin RakutenActual cases of applying AI related technologiesin Rakuten
Actual cases of applying AI related technologiesin RakutenRakuten Group, Inc.
 
Automated solutions for product and pricing research
Automated solutions for product and pricing researchAutomated solutions for product and pricing research
Automated solutions for product and pricing researchRay Poynter
 
Recommendation Systems
Recommendation SystemsRecommendation Systems
Recommendation SystemsRobin Reni
 
The Case For Reviews Webinar
The Case For Reviews WebinarThe Case For Reviews Webinar
The Case For Reviews WebinarTrustpilot
 
Control Your Online Reputation - MSP Social Media Breakfast
Control Your Online Reputation - MSP Social Media BreakfastControl Your Online Reputation - MSP Social Media Breakfast
Control Your Online Reputation - MSP Social Media BreakfastAaron Weiche
 
Building a Recommendation System for EverQuest Landmark’s Marketplace
Building a Recommendation System for EverQuest Landmark’s MarketplaceBuilding a Recommendation System for EverQuest Landmark’s Marketplace
Building a Recommendation System for EverQuest Landmark’s MarketplaceBen Weber
 
Building a Recommendation System for EverQuest Landmark's Marketplace
Building a Recommendation System for EverQuest Landmark's MarketplaceBuilding a Recommendation System for EverQuest Landmark's Marketplace
Building a Recommendation System for EverQuest Landmark's MarketplaceBen Weber
 
Introduction to recommender systems
Introduction to recommender systemsIntroduction to recommender systems
Introduction to recommender systemsAndrea Gigli
 
Collaborative Filtering 1: User-based CF
Collaborative Filtering 1: User-based CFCollaborative Filtering 1: User-based CF
Collaborative Filtering 1: User-based CFYusuke Yamamoto
 
Email Marketing + Social Proof = The Perfect Formula to Convert More Browsers...
Email Marketing + Social Proof = The Perfect Formula to Convert More Browsers...Email Marketing + Social Proof = The Perfect Formula to Convert More Browsers...
Email Marketing + Social Proof = The Perfect Formula to Convert More Browsers...Trustpilot
 
Daniel Lemin - Why Online Reviews Are The Future of Local & Search Marketing
Daniel Lemin - Why Online Reviews Are The Future of Local & Search MarketingDaniel Lemin - Why Online Reviews Are The Future of Local & Search Marketing
Daniel Lemin - Why Online Reviews Are The Future of Local & Search MarketingJulia Grosman
 

Similaire à Chang liu insight 2014 (20)

Top data science institutes in hyderabad
Top data science institutes in hyderabadTop data science institutes in hyderabad
Top data science institutes in hyderabad
 
best online data science courses
best online data science coursesbest online data science courses
best online data science courses
 
Online feedback correlation using clustering
Online feedback correlation using clusteringOnline feedback correlation using clustering
Online feedback correlation using clustering
 
Overview of recommender system
Overview of recommender systemOverview of recommender system
Overview of recommender system
 
alaTest Dmexco 22 Sept 2011 - Increase Conversion with Reviews
alaTest Dmexco 22 Sept 2011 - Increase Conversion with ReviewsalaTest Dmexco 22 Sept 2011 - Increase Conversion with Reviews
alaTest Dmexco 22 Sept 2011 - Increase Conversion with Reviews
 
Product Recommendations Enhanced with Reviews
Product Recommendations Enhanced with ReviewsProduct Recommendations Enhanced with Reviews
Product Recommendations Enhanced with Reviews
 
Gio
GioGio
Gio
 
Actual cases of applying AI related technologiesin Rakuten
Actual cases of applying AI related technologiesin RakutenActual cases of applying AI related technologiesin Rakuten
Actual cases of applying AI related technologiesin Rakuten
 
Automated solutions for product and pricing research
Automated solutions for product and pricing researchAutomated solutions for product and pricing research
Automated solutions for product and pricing research
 
Fashiondatasc
FashiondatascFashiondatasc
Fashiondatasc
 
Recommendation Systems
Recommendation SystemsRecommendation Systems
Recommendation Systems
 
Voice of the Market, Tom Anderson
Voice of the Market, Tom AndersonVoice of the Market, Tom Anderson
Voice of the Market, Tom Anderson
 
The Case For Reviews Webinar
The Case For Reviews WebinarThe Case For Reviews Webinar
The Case For Reviews Webinar
 
Control Your Online Reputation - MSP Social Media Breakfast
Control Your Online Reputation - MSP Social Media BreakfastControl Your Online Reputation - MSP Social Media Breakfast
Control Your Online Reputation - MSP Social Media Breakfast
 
Building a Recommendation System for EverQuest Landmark’s Marketplace
Building a Recommendation System for EverQuest Landmark’s MarketplaceBuilding a Recommendation System for EverQuest Landmark’s Marketplace
Building a Recommendation System for EverQuest Landmark’s Marketplace
 
Building a Recommendation System for EverQuest Landmark's Marketplace
Building a Recommendation System for EverQuest Landmark's MarketplaceBuilding a Recommendation System for EverQuest Landmark's Marketplace
Building a Recommendation System for EverQuest Landmark's Marketplace
 
Introduction to recommender systems
Introduction to recommender systemsIntroduction to recommender systems
Introduction to recommender systems
 
Collaborative Filtering 1: User-based CF
Collaborative Filtering 1: User-based CFCollaborative Filtering 1: User-based CF
Collaborative Filtering 1: User-based CF
 
Email Marketing + Social Proof = The Perfect Formula to Convert More Browsers...
Email Marketing + Social Proof = The Perfect Formula to Convert More Browsers...Email Marketing + Social Proof = The Perfect Formula to Convert More Browsers...
Email Marketing + Social Proof = The Perfect Formula to Convert More Browsers...
 
Daniel Lemin - Why Online Reviews Are The Future of Local & Search Marketing
Daniel Lemin - Why Online Reviews Are The Future of Local & Search MarketingDaniel Lemin - Why Online Reviews Are The Future of Local & Search Marketing
Daniel Lemin - Why Online Reviews Are The Future of Local & Search Marketing
 

Dernier

Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DaySri Ambati
 

Dernier (20)

Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
 

Chang liu insight 2014

  • 1. True Fit Skin Care Chang Liu Fellow at Insight Data Science 2014
  • 2. So many products… What makes it so hard? Overwhelming information
  • 3. What makes it so hard? Overwhelming information So many products… So many reviews…
  • 4. What makes it so hard? Overwhelming information So many products… So many reviews… Reviews can be so long…
  • 5. What makes it so hard? Overwhelming information So many products… So many reviews… Reviews can be so long… So many ingredients…
  • 6. So many products… So many reviews… Time spent Money wasted Happiness What makes it so hard? Overwhelming information Reviews can be so long… So many ingredients…
  • 7. 32k Reviewers • w/ 2+ reviews ~1200 Products • ~80 brands • 8 categories 184k Reviews • Rating [1-5] • Review text • Quick take Collaborative Filter using User Reviews from Sephora.com Product X Y … Reviewers 1 … 2 … 3 … … … … … N … Algorithm: • Item-centric collaborative filter • Pearson’s correlation coefficients to measure pairwise similarity
  • 8. 32k Reviewers • w/ 2+ reviews ~1200 Products • ~80 brands • 8 categories 184k Reviews • Rating [1-5] • Review text • Quick take Collaborative Filter using User Reviews from Sephora.com Product X Y … Reviewers 1 … 2 … 3 … … … … … N … Algorithm: • Item-centric collaborative filter • Pearson’s correlation coefficients to measure pairwise similarity Similarity = cXY = (Xi - X)(Yi -Y ) N å i=1 N å (X- X)2 (Y-Y )2 i i i=1 N å i=1 M å / cij recommendation scoreui = rujcij j
  • 9. 32k Reviewers • w/ 2+ reviews ~1200 Products • ~80 brands • 8 categories 184k Reviews • Rating [1-5] • Review text • Quick take Collaborative Filter using User Reviews from Sephora.com Product X Y … Reviewers 1 … 2 … 3 … … … … … N … Algorithm: • Item-centric collaborative filter • Pearson’s correlation coefficients to measure pairwise similarity Similarity = cXY = (Xi - X)(Yi -Y ) N å i=1 N å (X- X)2 (Y-Y )2 i i i=1 N å M å / cij Cross Validation • 5-fold for reviewer • Leave-one-out for product • Accuracy = 86.3% ± 1% i=1 recommendation scoreui = rujcij j
  • 10. Visualize the similarity matrix White = high similarity Black = low similarity Sorted by brands alphabetically
  • 11. White in a square = Users reviews are similar for all products in a brand = Strong customer loyalty There are structures!
  • 12. Expensive! “Organic & Natural” There are structures! For example… Cost effective
  • 13. There are structures! For example… Expensive! “Organic & Natural” Cost effective Actionable Insights For Sephora.com: Send marketing emails to new customers of brands with stronger customer loyalty!
  • 14. Chang Liu PhD. in Civil Engineering @CMU J8D8L5@gmail.com linkedin.com/in/changliucmu github.com/R4trtry
  • 15. Is the rating a good measure of reviewers’ perspective? • Trained a NaïveBaysian classifier for sentiment analysis • W/ 250 thousand reviews from Birchbox.com • A website that sends out free samples from smaller brands and gathers massive user reviews Most common words Most informative feature Word Count Negative Positive skin 91349 re-wash Penny product 82481 garbage hook use 64044 mediocre gorgeous love 55691 ketchup perk feel 47879 trash stock face 42615 unimpressive glowing like 41427 survey splurge great 34155 ineffective effortless really 31672 gag Christmas smell 27621 worthless happily text quick take Precision 95.3% 85.4% Recall 89.8% 93.1% Worth every penny! Another Validation
  • 16. Another Validation Is the rating a good measure of reviewers’ perspective?
  • 17. Product X Algorithm: Item-centric collaborative filter similarity 87.4% Product Y Product X Product Y 1 1 1 1 1 1 1 1 1 1 1 1 Reviewers Product X Y … 1 … 2 … 3 … … … … … N … M products reviewed by N reviewers Pairwise similarities are measured by Pearson's correlation coefficients: cXY = (Xi - X)(Yi -Y ) N å i=1 N å (X- X)2 (Y-Y )2 i i i=1 N å i=1 Then weight the ratings based on the correlation coefficients: Scorei = cijr uj M å j | cij | ruj : User u's preference on item j

Notes de l'éditeur

  1. Hi My name is Chang. I created true fit skin care, a web-app that recommend skin care products for you. I’m not an expert As you can see, the background is crowded with skin care products in boxes, bottles and jars. This is what it looks like in out bathroom. My wife
  2. Estee lauder, is not doing so well, it’s a bit expensive, so there are actually very small number of reviews per product. FAB, instead, is very cost effective, therefore has pretty good customer loyalty. Origins, on the other hand, makes products with organic and natural ingredients. Therefore, customer who likes their product are paying for this natural concept.
  3. Estee lauder, is not doing so well, it’s a bit expensive, so there are actually very small number of reviews per product. FAB, instead, is very cost effective, therefore has pretty good customer loyalty. Origins, on the other hand, makes products with organic and natural ingredients. Therefore, customer who likes their product are paying for this natural concept.
  4. And this is me, just finiwww.linkedin.com/in/changliucmu/shed phd in civil engineering at carnegie Mellon University. I studied pipe monitoring using data driven approach. The image here shows the transmission pipe lines across the US.