SlideShare une entreprise Scribd logo
1  sur  33
Télécharger pour lire hors ligne
Jure Leskovec (@jure)
Pinterest and Stanford
1Jure Leskove, Pinterest & Stanford University
2
Recommendations
drive whole businesses!
Jure Leskove, Pinterest & Stanford University
People and Items
3
+
–
+
+
+
–
People Items
Fundamental problem:
Making items discoverable!Jure Leskove, Pinterest & Stanford University
Understanding Products
To make relevant recommendations
we need to understand the products
and how they fit together
Discovering relationships
between products
4Jure Leskove, Pinterest & Stanford University
Product Graph
Ingest product catalogs:
 10s of millions of products
 100s of millions of descriptions, reviews
Infer product networks with multiple
types of directed relationships:
 Input:
Data about items (products)
 Output:
Network with multiple types of relationships
5Jure Leskove, Pinterest & Stanford University
Product Graph: Relations
6
Substitutes:
Purchase
instead
Complements:
Purchase
in addition
Jure Leskove, Pinterest & Stanford University
Product Graph: Description
7
: cleaner; quieter
: cheaper; high power
: well made, easy to install
: fits perfectly, great value
Jure Leskove, Pinterest & Stanford University
Product Graph: Overview
8
substitute
complement
Jure Leskove, Pinterest & Stanford University
Product Graph:What it does?
1. Understands the notions of
substitute and complement goods
is substitutable for
complements
9Jure Leskove, Pinterest & Stanford University
Product Graph:What it does?
2. Generates explanations
of why certain products are
preferred
“Good quality, soft, light
weight, the colors are
beautiful and exactly like
the picture!”
People prefer this
because:
10Jure Leskove, Pinterest & Stanford University
Product Graph:What it does?
3. Discovers micro-categories
of products
Small clusters of tightly related products:
11Jure Leskove, Pinterest & Stanford University
Product Graph:What it does?
4. Recommends
baskets of related products
Query: Suggested outfit:
Query: Suggested outfit:
12Jure Leskove, Pinterest & Stanford University
Product Graph: Overview
Building networks from products
Modeling: Can we use product data
to model product relationships?
Understanding: Can we explain
why people prefer certain products
over others?
13Jure Leskove, Pinterest & Stanford University
Problem Setting
Binary prediction task:
Given a pair of products, x and y, predict
whether they are related
(substitute/complementary)
Goal: Build a probabilistic model
that encodes
14Jure Leskove, Pinterest & Stanford University
Problem Setting
How to learn
from data
Train by maximum likelihood:
15
XComplementary
Not
Complementary
Jure Leskove, Pinterest & Stanford University
Approach
Products are described by their properties:
 Review text, Product description,
Brand, Price, …
[0.3, 0, 0, 0.3, 0, 0, 0.2, 0, 0, 0.1]
[0.1, 0, 0, 0, 0.2, 0, 0, 0.1, 0, 0.2]
Challenges:
 How do we discover right features?
 How do we explain relationships?
 How do we identify micro-categories?
16
Shoes Female
Jure Leskove, Pinterest & Stanford University
Our Solution: SCEPTRE
Link Prediction Review “topics”
Discover topics that “explain” product relations
17
Learn to discover topics that
explain the product graph
Jure Leskove, Pinterest & Stanford University
Challenges: Relation Direction
why do people who view
X eventually buy Y?
Relationships we want to learn
are not symmetric
18
Relationships: Explained by product “properties”
“baby, pajamas, pants, colorful”
Directedness: Subjective/qualitative language
“true size, fits well, items are the same color as on the picture”
Jure Leskove, Pinterest & Stanford University
Challenges: Multiple Relations
19
We want to learn multiple
relationships simultaneously
Solution: Learn multiple regressors (one for each
graph), that operate on a single set of topics
Jure Leskove, Pinterest & Stanford University
Challenges: Micro-Categories
20
Model discovers thousands of topics
but no micro-categories
Solution: Product hierarchy
Laptop charger specific topics
are only active for chargers.
These are micro-categories.
Topics at the top are common to all
electronics products, and will contain
generic electronics language
Associate each node in the category
tree with a small number of topics:
Jure Leskove, Pinterest & Stanford University
Building the Graph
C++ implementation that runs
on a single (large-memory) machine
 OpenMP to parallelize computations
Experimental results:
Active part of the Amazon catalog
 10m products
 150m reviews
 250m relationships
21Jure Leskove, Pinterest & Stanford University
Example: Product Graph
22Jure Leskove, Pinterest & Stanford University
Example: Product Graph
23Jure Leskove, Pinterest & Stanford University
Edge Prediction Accuracy
24
Substitute Complement
Men’s
Clothing
96.7% 94.1%
Women’s
Clothing
95.9% 94.1%
Books 93.8% 89.9%
Electronics 95.7% 88.8%
Movies 85.6% -
Music 90.4% -
OVERALL 94.83% 90.23%
Jure Leskove, Pinterest & Stanford University
Results: Micro-Categories
25Jure Leskove, Pinterest & Stanford University
27
How does all this fit
into Pinterest?
Jure Leskove, Pinterest & Stanford University
Connecting People & Objects
28Jure Leskove, Pinterest & Stanford University
Pins: Richly Annotated Objects
29Jure Leskove, Pinterest & Stanford University
Pins are Collected in Boards
30Jure Leskove, Pinterest & Stanford University
30+ Billion Pins
categorized by people into more than
750 Million Boards
50% of pins have been created
in the last 6 months
31
32
Discovering relationships
between objectsJure Leskove, Pinterest & Stanford University
We are hiring!
33
jure@pinterest.com
References
 Inferring Networks of Substitutable and Complementary
Products by J. McAuley, R. Pandey, J. Leskovec. ACM SIGKDD
International Conference on Knowledge Discovery and Data
Mining (KDD), 2015.
 Hidden Factors and Hidden Topics: Understanding Rating
Dimensions with Review Text by J. McAuley, J. Leskovec. ACM
Conference on Recommender Systems (RecSys), 2013.
 Learning Attitudes and Attributes from Multi-Aspect
Reviews by J. McAuley, J. Leskovec, D. Jurafsky. IEEE
International Conference On Data Mining (ICDM), 2012.
34Jure Leskove, Pinterest & Stanford University

Contenu connexe

Similaire à Inferring networks of substitute and complementary products

Master Minds on Data Science - Maarten de Rijke
Master Minds on Data Science - Maarten de RijkeMaster Minds on Data Science - Maarten de Rijke
Master Minds on Data Science - Maarten de RijkeMedia Perspectives
 
Personalizing the web building effective recommender systems
Personalizing the web building effective recommender systemsPersonalizing the web building effective recommender systems
Personalizing the web building effective recommender systemsAravindharamanan S
 
Machine Support for Interacting with Scientific Publications Improving Inform...
Machine Support for Interacting with Scientific Publications Improving Inform...Machine Support for Interacting with Scientific Publications Improving Inform...
Machine Support for Interacting with Scientific Publications Improving Inform...Christoph Lange
 
Introduction to recommendation system
Introduction to recommendation systemIntroduction to recommendation system
Introduction to recommendation systemAravindharamanan S
 
Big Data in Learning Analytics - Analytics for Everyday Learning
Big Data in Learning Analytics - Analytics for Everyday LearningBig Data in Learning Analytics - Analytics for Everyday Learning
Big Data in Learning Analytics - Analytics for Everyday LearningStefan Dietze
 
Invited Talk MESOCA 2014: Evolving software systems: emerging trends and chal...
Invited Talk MESOCA 2014: Evolving software systems: emerging trends and chal...Invited Talk MESOCA 2014: Evolving software systems: emerging trends and chal...
Invited Talk MESOCA 2014: Evolving software systems: emerging trends and chal...Alexander Serebrenik
 
Testing the ideas
Testing the ideasTesting the ideas
Testing the ideasDan Ashby
 
Exploratory Testing Explained
Exploratory Testing ExplainedExploratory Testing Explained
Exploratory Testing ExplainedTechWell
 
Software Citation and a Proposal (NSF workshop at Havard Medical School)
Software Citation and a Proposal (NSF workshop at Havard Medical School)Software Citation and a Proposal (NSF workshop at Havard Medical School)
Software Citation and a Proposal (NSF workshop at Havard Medical School)James Howison
 
Brochure data science learning path board-infinity (1)
Brochure   data science learning path board-infinity (1)Brochure   data science learning path board-infinity (1)
Brochure data science learning path board-infinity (1)NirupamNishant2
 
Sa discover text webinar
Sa discover text webinarSa discover text webinar
Sa discover text webinarQuestionPro
 
PROMISE 2011: Seven Habits of High Impactful Empirical Software Engineers (La...
PROMISE 2011: Seven Habits of High Impactful Empirical Software Engineers (La...PROMISE 2011: Seven Habits of High Impactful Empirical Software Engineers (La...
PROMISE 2011: Seven Habits of High Impactful Empirical Software Engineers (La...CS, NcState
 
Data science essentials in azure ml
Data science essentials in azure mlData science essentials in azure ml
Data science essentials in azure mlMostafa
 
Top 5 algorithms used in Data Science
Top 5 algorithms used in Data ScienceTop 5 algorithms used in Data Science
Top 5 algorithms used in Data ScienceEdureka!
 
Module 1.3 data exploratory
Module 1.3  data exploratoryModule 1.3  data exploratory
Module 1.3 data exploratorySara Hooker
 
Databases set for scientific research.pptx
Databases set for scientific research.pptxDatabases set for scientific research.pptx
Databases set for scientific research.pptxzahraashouman
 

Similaire à Inferring networks of substitute and complementary products (20)

Master Minds on Data Science - Maarten de Rijke
Master Minds on Data Science - Maarten de RijkeMaster Minds on Data Science - Maarten de Rijke
Master Minds on Data Science - Maarten de Rijke
 
Personalizing the web building effective recommender systems
Personalizing the web building effective recommender systemsPersonalizing the web building effective recommender systems
Personalizing the web building effective recommender systems
 
Machine Support for Interacting with Scientific Publications Improving Inform...
Machine Support for Interacting with Scientific Publications Improving Inform...Machine Support for Interacting with Scientific Publications Improving Inform...
Machine Support for Interacting with Scientific Publications Improving Inform...
 
DataMind Pitch August 2013
DataMind Pitch August 2013DataMind Pitch August 2013
DataMind Pitch August 2013
 
Introduction to recommendation system
Introduction to recommendation systemIntroduction to recommendation system
Introduction to recommendation system
 
Big Data in Learning Analytics - Analytics for Everyday Learning
Big Data in Learning Analytics - Analytics for Everyday LearningBig Data in Learning Analytics - Analytics for Everyday Learning
Big Data in Learning Analytics - Analytics for Everyday Learning
 
Invited Talk MESOCA 2014: Evolving software systems: emerging trends and chal...
Invited Talk MESOCA 2014: Evolving software systems: emerging trends and chal...Invited Talk MESOCA 2014: Evolving software systems: emerging trends and chal...
Invited Talk MESOCA 2014: Evolving software systems: emerging trends and chal...
 
Testing the ideas
Testing the ideasTesting the ideas
Testing the ideas
 
Exploratory Testing Explained
Exploratory Testing ExplainedExploratory Testing Explained
Exploratory Testing Explained
 
Software Citation and a Proposal (NSF workshop at Havard Medical School)
Software Citation and a Proposal (NSF workshop at Havard Medical School)Software Citation and a Proposal (NSF workshop at Havard Medical School)
Software Citation and a Proposal (NSF workshop at Havard Medical School)
 
Brochure data science learning path board-infinity (1)
Brochure   data science learning path board-infinity (1)Brochure   data science learning path board-infinity (1)
Brochure data science learning path board-infinity (1)
 
Elsevier02012011
Elsevier02012011Elsevier02012011
Elsevier02012011
 
Sa discover text webinar
Sa discover text webinarSa discover text webinar
Sa discover text webinar
 
PROMISE 2011: Seven Habits of High Impactful Empirical Software Engineers (La...
PROMISE 2011: Seven Habits of High Impactful Empirical Software Engineers (La...PROMISE 2011: Seven Habits of High Impactful Empirical Software Engineers (La...
PROMISE 2011: Seven Habits of High Impactful Empirical Software Engineers (La...
 
Introduction to spss
Introduction to spssIntroduction to spss
Introduction to spss
 
Bayesian reasoning
Bayesian reasoningBayesian reasoning
Bayesian reasoning
 
Data science essentials in azure ml
Data science essentials in azure mlData science essentials in azure ml
Data science essentials in azure ml
 
Top 5 algorithms used in Data Science
Top 5 algorithms used in Data ScienceTop 5 algorithms used in Data Science
Top 5 algorithms used in Data Science
 
Module 1.3 data exploratory
Module 1.3  data exploratoryModule 1.3  data exploratory
Module 1.3 data exploratory
 
Databases set for scientific research.pptx
Databases set for scientific research.pptxDatabases set for scientific research.pptx
Databases set for scientific research.pptx
 

Plus de Turi, Inc.

Webinar - Analyzing Video
Webinar - Analyzing VideoWebinar - Analyzing Video
Webinar - Analyzing VideoTuri, Inc.
 
Webinar - Patient Readmission Risk
Webinar - Patient Readmission RiskWebinar - Patient Readmission Risk
Webinar - Patient Readmission RiskTuri, Inc.
 
Webinar - Know Your Customer - Arya (20160526)
Webinar - Know Your Customer - Arya (20160526)Webinar - Know Your Customer - Arya (20160526)
Webinar - Know Your Customer - Arya (20160526)Turi, Inc.
 
Webinar - Product Matching - Palombo (20160428)
Webinar - Product Matching - Palombo (20160428)Webinar - Product Matching - Palombo (20160428)
Webinar - Product Matching - Palombo (20160428)Turi, Inc.
 
Webinar - Pattern Mining Log Data - Vega (20160426)
Webinar - Pattern Mining Log Data - Vega (20160426)Webinar - Pattern Mining Log Data - Vega (20160426)
Webinar - Pattern Mining Log Data - Vega (20160426)Turi, Inc.
 
Webinar - Fraud Detection - Palombo (20160428)
Webinar - Fraud Detection - Palombo (20160428)Webinar - Fraud Detection - Palombo (20160428)
Webinar - Fraud Detection - Palombo (20160428)Turi, Inc.
 
Scaling Up Machine Learning: How to Benchmark GraphLab Create on Huge Datasets
Scaling Up Machine Learning: How to Benchmark GraphLab Create on Huge DatasetsScaling Up Machine Learning: How to Benchmark GraphLab Create on Huge Datasets
Scaling Up Machine Learning: How to Benchmark GraphLab Create on Huge DatasetsTuri, Inc.
 
Pattern Mining: Extracting Value from Log Data
Pattern Mining: Extracting Value from Log DataPattern Mining: Extracting Value from Log Data
Pattern Mining: Extracting Value from Log DataTuri, Inc.
 
Intelligent Applications with Machine Learning Toolkits
Intelligent Applications with Machine Learning ToolkitsIntelligent Applications with Machine Learning Toolkits
Intelligent Applications with Machine Learning ToolkitsTuri, Inc.
 
Text Analysis with Machine Learning
Text Analysis with Machine LearningText Analysis with Machine Learning
Text Analysis with Machine LearningTuri, Inc.
 
Machine Learning with GraphLab Create
Machine Learning with GraphLab CreateMachine Learning with GraphLab Create
Machine Learning with GraphLab CreateTuri, Inc.
 
Machine Learning in Production with Dato Predictive Services
Machine Learning in Production with Dato Predictive ServicesMachine Learning in Production with Dato Predictive Services
Machine Learning in Production with Dato Predictive ServicesTuri, Inc.
 
Machine Learning in 2016: Live Q&A with Carlos Guestrin
Machine Learning in 2016: Live Q&A with Carlos GuestrinMachine Learning in 2016: Live Q&A with Carlos Guestrin
Machine Learning in 2016: Live Q&A with Carlos GuestrinTuri, Inc.
 
Scalable data structures for data science
Scalable data structures for data scienceScalable data structures for data science
Scalable data structures for data scienceTuri, Inc.
 
Introduction to Deep Learning for Image Analysis at Strata NYC, Sep 2015
Introduction to Deep Learning for Image Analysis at Strata NYC, Sep 2015Introduction to Deep Learning for Image Analysis at Strata NYC, Sep 2015
Introduction to Deep Learning for Image Analysis at Strata NYC, Sep 2015Turi, Inc.
 
Introduction to Recommender Systems
Introduction to Recommender SystemsIntroduction to Recommender Systems
Introduction to Recommender SystemsTuri, Inc.
 
Machine learning in production
Machine learning in productionMachine learning in production
Machine learning in productionTuri, Inc.
 
Overview of Machine Learning and Feature Engineering
Overview of Machine Learning and Feature EngineeringOverview of Machine Learning and Feature Engineering
Overview of Machine Learning and Feature EngineeringTuri, Inc.
 
Building Personalized Data Products with Dato
Building Personalized Data Products with DatoBuilding Personalized Data Products with Dato
Building Personalized Data Products with DatoTuri, Inc.
 

Plus de Turi, Inc. (20)

Webinar - Analyzing Video
Webinar - Analyzing VideoWebinar - Analyzing Video
Webinar - Analyzing Video
 
Webinar - Patient Readmission Risk
Webinar - Patient Readmission RiskWebinar - Patient Readmission Risk
Webinar - Patient Readmission Risk
 
Webinar - Know Your Customer - Arya (20160526)
Webinar - Know Your Customer - Arya (20160526)Webinar - Know Your Customer - Arya (20160526)
Webinar - Know Your Customer - Arya (20160526)
 
Webinar - Product Matching - Palombo (20160428)
Webinar - Product Matching - Palombo (20160428)Webinar - Product Matching - Palombo (20160428)
Webinar - Product Matching - Palombo (20160428)
 
Webinar - Pattern Mining Log Data - Vega (20160426)
Webinar - Pattern Mining Log Data - Vega (20160426)Webinar - Pattern Mining Log Data - Vega (20160426)
Webinar - Pattern Mining Log Data - Vega (20160426)
 
Webinar - Fraud Detection - Palombo (20160428)
Webinar - Fraud Detection - Palombo (20160428)Webinar - Fraud Detection - Palombo (20160428)
Webinar - Fraud Detection - Palombo (20160428)
 
Scaling Up Machine Learning: How to Benchmark GraphLab Create on Huge Datasets
Scaling Up Machine Learning: How to Benchmark GraphLab Create on Huge DatasetsScaling Up Machine Learning: How to Benchmark GraphLab Create on Huge Datasets
Scaling Up Machine Learning: How to Benchmark GraphLab Create on Huge Datasets
 
Pattern Mining: Extracting Value from Log Data
Pattern Mining: Extracting Value from Log DataPattern Mining: Extracting Value from Log Data
Pattern Mining: Extracting Value from Log Data
 
Intelligent Applications with Machine Learning Toolkits
Intelligent Applications with Machine Learning ToolkitsIntelligent Applications with Machine Learning Toolkits
Intelligent Applications with Machine Learning Toolkits
 
Text Analysis with Machine Learning
Text Analysis with Machine LearningText Analysis with Machine Learning
Text Analysis with Machine Learning
 
Machine Learning with GraphLab Create
Machine Learning with GraphLab CreateMachine Learning with GraphLab Create
Machine Learning with GraphLab Create
 
Machine Learning in Production with Dato Predictive Services
Machine Learning in Production with Dato Predictive ServicesMachine Learning in Production with Dato Predictive Services
Machine Learning in Production with Dato Predictive Services
 
Machine Learning in 2016: Live Q&A with Carlos Guestrin
Machine Learning in 2016: Live Q&A with Carlos GuestrinMachine Learning in 2016: Live Q&A with Carlos Guestrin
Machine Learning in 2016: Live Q&A with Carlos Guestrin
 
Scalable data structures for data science
Scalable data structures for data scienceScalable data structures for data science
Scalable data structures for data science
 
Introduction to Deep Learning for Image Analysis at Strata NYC, Sep 2015
Introduction to Deep Learning for Image Analysis at Strata NYC, Sep 2015Introduction to Deep Learning for Image Analysis at Strata NYC, Sep 2015
Introduction to Deep Learning for Image Analysis at Strata NYC, Sep 2015
 
Introduction to Recommender Systems
Introduction to Recommender SystemsIntroduction to Recommender Systems
Introduction to Recommender Systems
 
Machine learning in production
Machine learning in productionMachine learning in production
Machine learning in production
 
Overview of Machine Learning and Feature Engineering
Overview of Machine Learning and Feature EngineeringOverview of Machine Learning and Feature Engineering
Overview of Machine Learning and Feature Engineering
 
SFrame
SFrameSFrame
SFrame
 
Building Personalized Data Products with Dato
Building Personalized Data Products with DatoBuilding Personalized Data Products with Dato
Building Personalized Data Products with Dato
 

Dernier

Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilV3cube
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsRoshan Dwivedi
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 

Dernier (20)

Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 

Inferring networks of substitute and complementary products

  • 1. Jure Leskovec (@jure) Pinterest and Stanford 1Jure Leskove, Pinterest & Stanford University
  • 2. 2 Recommendations drive whole businesses! Jure Leskove, Pinterest & Stanford University
  • 3. People and Items 3 + – + + + – People Items Fundamental problem: Making items discoverable!Jure Leskove, Pinterest & Stanford University
  • 4. Understanding Products To make relevant recommendations we need to understand the products and how they fit together Discovering relationships between products 4Jure Leskove, Pinterest & Stanford University
  • 5. Product Graph Ingest product catalogs:  10s of millions of products  100s of millions of descriptions, reviews Infer product networks with multiple types of directed relationships:  Input: Data about items (products)  Output: Network with multiple types of relationships 5Jure Leskove, Pinterest & Stanford University
  • 6. Product Graph: Relations 6 Substitutes: Purchase instead Complements: Purchase in addition Jure Leskove, Pinterest & Stanford University
  • 7. Product Graph: Description 7 : cleaner; quieter : cheaper; high power : well made, easy to install : fits perfectly, great value Jure Leskove, Pinterest & Stanford University
  • 8. Product Graph: Overview 8 substitute complement Jure Leskove, Pinterest & Stanford University
  • 9. Product Graph:What it does? 1. Understands the notions of substitute and complement goods is substitutable for complements 9Jure Leskove, Pinterest & Stanford University
  • 10. Product Graph:What it does? 2. Generates explanations of why certain products are preferred “Good quality, soft, light weight, the colors are beautiful and exactly like the picture!” People prefer this because: 10Jure Leskove, Pinterest & Stanford University
  • 11. Product Graph:What it does? 3. Discovers micro-categories of products Small clusters of tightly related products: 11Jure Leskove, Pinterest & Stanford University
  • 12. Product Graph:What it does? 4. Recommends baskets of related products Query: Suggested outfit: Query: Suggested outfit: 12Jure Leskove, Pinterest & Stanford University
  • 13. Product Graph: Overview Building networks from products Modeling: Can we use product data to model product relationships? Understanding: Can we explain why people prefer certain products over others? 13Jure Leskove, Pinterest & Stanford University
  • 14. Problem Setting Binary prediction task: Given a pair of products, x and y, predict whether they are related (substitute/complementary) Goal: Build a probabilistic model that encodes 14Jure Leskove, Pinterest & Stanford University
  • 15. Problem Setting How to learn from data Train by maximum likelihood: 15 XComplementary Not Complementary Jure Leskove, Pinterest & Stanford University
  • 16. Approach Products are described by their properties:  Review text, Product description, Brand, Price, … [0.3, 0, 0, 0.3, 0, 0, 0.2, 0, 0, 0.1] [0.1, 0, 0, 0, 0.2, 0, 0, 0.1, 0, 0.2] Challenges:  How do we discover right features?  How do we explain relationships?  How do we identify micro-categories? 16 Shoes Female Jure Leskove, Pinterest & Stanford University
  • 17. Our Solution: SCEPTRE Link Prediction Review “topics” Discover topics that “explain” product relations 17 Learn to discover topics that explain the product graph Jure Leskove, Pinterest & Stanford University
  • 18. Challenges: Relation Direction why do people who view X eventually buy Y? Relationships we want to learn are not symmetric 18 Relationships: Explained by product “properties” “baby, pajamas, pants, colorful” Directedness: Subjective/qualitative language “true size, fits well, items are the same color as on the picture” Jure Leskove, Pinterest & Stanford University
  • 19. Challenges: Multiple Relations 19 We want to learn multiple relationships simultaneously Solution: Learn multiple regressors (one for each graph), that operate on a single set of topics Jure Leskove, Pinterest & Stanford University
  • 20. Challenges: Micro-Categories 20 Model discovers thousands of topics but no micro-categories Solution: Product hierarchy Laptop charger specific topics are only active for chargers. These are micro-categories. Topics at the top are common to all electronics products, and will contain generic electronics language Associate each node in the category tree with a small number of topics: Jure Leskove, Pinterest & Stanford University
  • 21. Building the Graph C++ implementation that runs on a single (large-memory) machine  OpenMP to parallelize computations Experimental results: Active part of the Amazon catalog  10m products  150m reviews  250m relationships 21Jure Leskove, Pinterest & Stanford University
  • 22. Example: Product Graph 22Jure Leskove, Pinterest & Stanford University
  • 23. Example: Product Graph 23Jure Leskove, Pinterest & Stanford University
  • 24. Edge Prediction Accuracy 24 Substitute Complement Men’s Clothing 96.7% 94.1% Women’s Clothing 95.9% 94.1% Books 93.8% 89.9% Electronics 95.7% 88.8% Movies 85.6% - Music 90.4% - OVERALL 94.83% 90.23% Jure Leskove, Pinterest & Stanford University
  • 25. Results: Micro-Categories 25Jure Leskove, Pinterest & Stanford University
  • 26. 27 How does all this fit into Pinterest? Jure Leskove, Pinterest & Stanford University
  • 27. Connecting People & Objects 28Jure Leskove, Pinterest & Stanford University
  • 28. Pins: Richly Annotated Objects 29Jure Leskove, Pinterest & Stanford University
  • 29. Pins are Collected in Boards 30Jure Leskove, Pinterest & Stanford University
  • 30. 30+ Billion Pins categorized by people into more than 750 Million Boards 50% of pins have been created in the last 6 months 31
  • 31. 32 Discovering relationships between objectsJure Leskove, Pinterest & Stanford University
  • 33. References  Inferring Networks of Substitutable and Complementary Products by J. McAuley, R. Pandey, J. Leskovec. ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), 2015.  Hidden Factors and Hidden Topics: Understanding Rating Dimensions with Review Text by J. McAuley, J. Leskovec. ACM Conference on Recommender Systems (RecSys), 2013.  Learning Attitudes and Attributes from Multi-Aspect Reviews by J. McAuley, J. Leskovec, D. Jurafsky. IEEE International Conference On Data Mining (ICDM), 2012. 34Jure Leskove, Pinterest & Stanford University