SlideShare une entreprise Scribd logo
1  sur  46
Télécharger pour lire hors ligne
Deep Learning for Semantic Search in E-commerce
Somnath Banerjee
Head of Search Algorithms at Walmart Labs
https://www.linkedin.com/in/somnath-banerjee/
March 19, 2019
Walmart E-commerce search problem
2
100M+
Customers
100M+
Queries
100M+
Items
Store Associate E-commerce Search
providesthe functionalityof a human butatscale
3
????
Flash Drive
USB Drive
Thumb Drive
Jump Drive
Pen Drive
Zip Drive
Memory Stick
USB Stick
USB Flash Drive
USB Memory
USB Storage Device
Flush Drive
USC Drive
Thamb Drive
Jmp Drive
Pin Drive
Zap Drive
Memory Steak
USB Stock
USB Flash Drve
MisspelledQueries
4
saddle
Horse Saddle
Bike Saddle
Outline
5
• Core problems of e-commerce search
• Semantic search in e-commerce
• Deep Learning for semantic search
– Query classification
– Query token tagging
– Neural IR
– Image understanding (sneak peek)
Core of E-commerce Search
6
Text Query
Text
Query
Find Items
Catalog
Core problems of E-commerce search
7
Learning book
Tide 100 oz
Tide 100 fl oz
Tide 100 ounce
Neck style?
Fabric?
No. of pockets?
Ziploc
Ambiguity
Missing catalog values
Levi’s
Levi Strauss
Signature by Levi Strauss and Co.
Open vocabulary in query and catalog
Buying decision is influenced by item attractiveness
8
pump shoes
Tags
$300!!!
Presence of
expensive items
Image quality
Matching
query to items
Ranking
items
Core technical problems of e-commerce search
9
Pump shoes
✅
✅
❌
Position 1
Position 2
Text matching is not enough
10
Lemon
Lemon BalmLemon Fruit
Nivea 16oz
Nivea 15.5oz
Tire sealant
16oz
Query understanding
•Attributeunderstanding
Matching query and
item
•Text matching
•Attributematching
Ranking Items
Semantic Search
11
Deep learning for semantic search
12
Deep Learning for
Query understanding
Matching query and
item
•Text matching
•Attributematching
Ranking Items
Deep learning for semantic search
13
Deep Learning for
Query understanding
Matching query and
item
•Text matching
•Attributematching
Ranking Items
Neural IR
End-to-end matching and ranking
Image understanding
Not just text search
Outline
14
• Core problems of e-commerce search
• Semantic search in e-commerce
• Deep Learning for semantic search
– Query classification
– Query token tagging
– Neural IR
– Image understanding (sneak peek)
Query Classification
15
Text query
product type 1 : confidence level
product type 2 : confidence level
product type 3 : confidence level
Product Type
• A predefined list
• Indicates a specific product in the catalog
• Every item in the catalog is tagged with a product type
Query classification examples
16
Computer Video Cards: 0.85
Laptop Computers: 0.08
Desktop Computers: 0.06
nvidia gpu
Food StorageBags: 1.0ziploc bags
Largenumber of product
typesbedroom furniture
Hard to balance
precision vs recall
Query classification challenges
17
Shorttext
•Queries areof 2-3
tokens
Largescale
classification
•Thousandsof
producttypes
(classes)
Multi-class, multi-
label problem
•Samequery can
have multiple
producttypes
Needs to respond
in few milliseconds
•Classifies queries
at runtime
Unbalanced class
distribution
•Someproduct
types aremuch
morepopular
Data and Model
18
BiLSTM
<query, product type
ordered>
<query, item ordered>
Historical Search Log
https://guillaumegenthial.github.io/sequence-tagging-with-tensorflow.html
Output Layer
word2vec
Softmax/sigmoid
Usage of query classification
19
Without Query Classification After we understand the query “lemon” as a fruit
lemon
20% reduction of
irrelevant items
in certain query
segments
Key Learnings
20
Logistic
RegressionDeep
Learning
6%
higher
accuracy
CNN
BiLSTM
More
accurate
1 K80 GPU
8 Core CPU
6X
faster
1 K80 GPU48 Core CPU
Equal
Accuracy Training Time
Key Learnings - instability in Prediction
21
Television Stands: 0.32;
Laptop Computers: 0.27
Hard Drives: 0.11
Hard Drives: 1.00
Old model
New model
samsung 850 evo 250gb 2.5 inch
Instability in Prediction
22
Training data N Training data (N + 1)2.5% difference
in training data
Model N Model (N + 1)
Top predicted class is different for 10% of the test set
Instability in prediction – different seeds
23
Training data N
Model N Model N’
Top predicted class is different for 7% of the test set
Seed 1 Seed 2
Different tensorflow
and numpy seeds
Sources of Instability
24
Overfitting
•Deep Learning model has high
variance, particularly on the low
traffic queries
•Simpler models could be more
stable but less accurate
Sigmoid (1-vs-all) classifier
is more stable
•Softmax scores are
interdependent across classes
and less stable
Noisy training data
•Item order data is less noisy
than click
Rounding errors in the
arithmetic operations
•CPU is more stable than GPU
Reduction of Instability
25
40% reduction
of instability
Softmax
Clicks
CNN
Sigmoid
Orders
BiLSTM
• Product Type
• Brand
• Color
• Gender
• Age Group
• Size (value & unit)
– Pack Size
– Screen Size
– Shoe Size
– …
• Character
• Style
• Material
• …
Attributes to match
26
Not Feasible – Separate classifier for
each attribute
• Too many classes (e.g. 100K+ brand values)
• Sparse attributes; most attribute prediction
should be NA
• Creating training data of <query, attribute> is
more noisy and inaccurate
Query token tagging
27
Query
Query tokens
tagged with
Attribute Names
faded glory long sleeve shirts for women
Faded Glory
Long sleeve
shirts
women
for
Product type
Brand
Sleeve length
Gender
NULL
Training data
28
blue women levis jeans
Brand Product
Type
GenderColor
toys for girls 3 – 6 years
Age
Value
Age
Unit
GenderProduct
Type
Human curated data
It is a hard task for human
•Is "outside” a producttype token in the
query, “canopytents for outside”?
Disagreement between taggers are
high (~30%)
Fortunately 10K training data is a
good start
Model – BiLSTM-CRF
29
word2vec
Features
for CRF
https://guillaumegenthial.github.io/sequence-tagging-with-tensorflow.html
Linear Chain CRF
querytokens
𝑃(𝑡𝑎𝑔1, … , 𝑡𝑎𝑔 𝑛)
Char
embedding
Char Embeddings
30
G P U
word2vec
BiLSTM-CRF Network
Character embedding network
word2vec type
learnt on character
sequence
• Maps a sequence of characters to a fixed size vector
• Handles out of vocabulary words
• Handles misspellings
Char Embedding
31
sansung tvsansung tv
Brand Product
Type
NULL Product
Type
With Char EmbeddingWithout Char Embedding
Improving search results using query tagging
32
Women citizen eco drive watch
Before After understanding the Gender token
Regex matchwill be incorrect for
queries like
pioneer women dinnerware
wonder women bedding
spider man car seats
Other use cases of query tagging
33
samsung tv 32 in
32 in vizio tv
sanyo flat screen tv
led tv sony 55”
samsung tv stand
sony tv remote
TV queries
Not
TV queries
Customer Demand Analysis
• Most searched brand of TV
Attribute filter suggestion
• Suggest top attributes (e.g. brand,
screen size) that customers look for
for in a product type query (e.g. TV)
Search query log
Traditional
IR
Semantic
Search
Neural IR
Neural IR
34
Token and synonym
match
Learning to Rank
• Attribute extraction
• Token, synonym and
attribute match
• Learning to rank
End-to-end matching
and ranking
Neural IR – Design 1
35
Query
Item
Title
Input
Embedding
Concatenation
Neural
Transformation
Transformed
feature
Relevance
Score
• Runtime computation
• Not scalable for large
number of items
Neural IR – Design 2
36
Query
Item
Title
Input
Embedding
Neural
Transformation
Query, item
embeddings
Relevance
Score
item embeddings can be
computed offline and
indexed
shared weights
Input Embedding
37
Comparable Accuracy
Input
Embedding
token 1
…
token n
Query
or
Item Title
… AVG
word2vec
token 1
…
token n
… CNN
word2vec
Input
Embedding
Query
or
Item Title
Training Data
38
query, item title, click through rate (ctr)*
Historical search log
*Position bias correction for ctr of a query, item pair
𝑐𝑡𝑟 =
σ 𝑟 𝑐𝑙𝑖𝑐𝑘𝑠_𝑐𝑜𝑟𝑟𝑒𝑐𝑡𝑒𝑑 𝑟
σ 𝑟 𝑖𝑚𝑝𝑟𝑒𝑠𝑠𝑖𝑜𝑛𝑠 𝑟
𝑐𝑙𝑖𝑐𝑘𝑠_𝑐𝑜𝑟𝑟𝑒𝑐𝑡𝑒𝑑 𝑟 = 𝑐𝑙𝑖𝑐𝑘𝑠 𝑟 + 𝑖𝑚𝑝𝑟𝑒𝑠𝑠𝑖𝑜𝑛𝑠 𝑟 − 𝑐𝑙𝑖𝑐𝑘𝑠 𝑟 ∗ 𝑃 𝑐𝑙𝑖𝑐𝑘 𝑟)
𝑟 = 𝑟𝑎𝑛𝑘 𝑎𝑡 𝑤ℎ𝑖𝑐ℎ 𝑡ℎ𝑒 𝑖𝑡𝑒𝑚 𝑤𝑎𝑠 𝑑𝑖𝑠𝑝𝑙𝑎𝑦𝑒𝑑
Training Loss
39
Point-wise Pair-wise
𝑥 𝑞 = 𝑞𝑢𝑒𝑟𝑦 𝑓𝑒𝑎𝑡𝑢𝑟𝑒𝑠
𝑥 𝑝 = 𝑖𝑡𝑒𝑚 𝑓𝑒𝑎𝑡𝑢𝑟𝑒𝑠
𝑓 𝑥 𝑞, 𝑥 𝑝 → ctr
Regression problem
Sigmoid cross entropy loss
Brooks shoes
𝑥 𝑞 𝑥 𝑝
𝑥 𝑛
relevant less relevantquery
𝑓 𝑥 𝑞, 𝑥 𝑝 > 𝑓 𝑥 𝑞, 𝑥 𝑛
𝑤ℎ𝑒𝑛 𝑐𝑡𝑟 𝑥 𝑞, 𝑥 𝑝 > 𝑐𝑡𝑟(𝑥 𝑞, 𝑥 𝑛)
Minimize pair inversions
Pair-wise logistic loss
Accuracy on pair-wise loss
40
NDCG captures quality of overall ranking
Pair accuracy captures if higher ctr (relevant) items ranked above the lower ctr items
0.00%
10.00%
20.00%
30.00%
40.00%
50.00%
60.00%
70.00%
Design 1 Design 2
NDCG@10liftagainstbaseline
30.00%
31.00%
32.00%
33.00%
34.00%
35.00%
36.00%
Design 1 Design 2
PairAccuracyliftagainstbaseline
Pros
• End to end approach
• Enables Semantic matching implicitly
• Handles different data types (text, image)
Cons
• Not scalable (yet)
• Not so successful (yet)
Neural IR
41
Image understanding
42
Predicted Attributes
• Product type
• Style
• Material
• Color
Attribute
Prediction
Visual
Search
Compatible
Outfit
Image understanding key learnings
43
• Multi-task learning is
more accurate
• Predicting style is
harder than predicting
product type
Attribute
Prediction
Visual
Search
Compatible
Outfit
• A/B test on hayneedle.com
• Comparable results against
a well established startup
• Under exploration
• Early results beating
token based approach
Future
44
Evolution of mobile phone
Future
45
Web Search E-commerce Search
Conversational
commerce
Seamless search
and personalized
results
V-Commerce
46

Contenu connexe

Tendances

Neural Text Embeddings for Information Retrieval (WSDM 2017)
Neural Text Embeddings for Information Retrieval (WSDM 2017)Neural Text Embeddings for Information Retrieval (WSDM 2017)
Neural Text Embeddings for Information Retrieval (WSDM 2017)Bhaskar Mitra
 
제 14회 보아즈(BOAZ) 빅데이터 컨퍼런스 - [주식팀] : 특정 시간대의 주가 변동 패턴을 이용한 실시간 주가 예측
제 14회 보아즈(BOAZ) 빅데이터 컨퍼런스 - [주식팀] : 특정 시간대의 주가 변동 패턴을 이용한 실시간 주가 예측제 14회 보아즈(BOAZ) 빅데이터 컨퍼런스 - [주식팀] : 특정 시간대의 주가 변동 패턴을 이용한 실시간 주가 예측
제 14회 보아즈(BOAZ) 빅데이터 컨퍼런스 - [주식팀] : 특정 시간대의 주가 변동 패턴을 이용한 실시간 주가 예측BOAZ Bigdata
 
Graph kernels
Graph kernelsGraph kernels
Graph kernelsLuc Brun
 
Building Large-scale Real-world Recommender Systems - Recsys2012 tutorial
Building Large-scale Real-world Recommender Systems - Recsys2012 tutorialBuilding Large-scale Real-world Recommender Systems - Recsys2012 tutorial
Building Large-scale Real-world Recommender Systems - Recsys2012 tutorialXavier Amatriain
 
A Technical Introduction to WiredTiger
A Technical Introduction to WiredTigerA Technical Introduction to WiredTiger
A Technical Introduction to WiredTigerMongoDB
 
Feature Engineering
Feature EngineeringFeature Engineering
Feature EngineeringSri Ambati
 
Vector databases and neural search
Vector databases and neural searchVector databases and neural search
Vector databases and neural searchDmitry Kan
 
Haystack 2019 - Making the case for human judgement relevance testing - Tara ...
Haystack 2019 - Making the case for human judgement relevance testing - Tara ...Haystack 2019 - Making the case for human judgement relevance testing - Tara ...
Haystack 2019 - Making the case for human judgement relevance testing - Tara ...OpenSource Connections
 
[236] 카카오의데이터파이프라인 윤도영
[236] 카카오의데이터파이프라인 윤도영[236] 카카오의데이터파이프라인 윤도영
[236] 카카오의데이터파이프라인 윤도영NAVER D2
 
Machine Learning at Netflix Scale
Machine Learning at Netflix ScaleMachine Learning at Netflix Scale
Machine Learning at Netflix ScaleAish Fenton
 
ONNX and MLflow
ONNX and MLflowONNX and MLflow
ONNX and MLflowamesar0
 
딥 러닝 자연어 처리를 학습을 위한 파워포인트. (Deep Learning for Natural Language Processing)
딥 러닝 자연어 처리를 학습을 위한 파워포인트. (Deep Learning for Natural Language Processing)딥 러닝 자연어 처리를 학습을 위한 파워포인트. (Deep Learning for Natural Language Processing)
딥 러닝 자연어 처리를 학습을 위한 파워포인트. (Deep Learning for Natural Language Processing)WON JOON YOO
 
Advanced Model Comparison and Automated Deployment Using ML
Advanced Model Comparison and Automated Deployment Using MLAdvanced Model Comparison and Automated Deployment Using ML
Advanced Model Comparison and Automated Deployment Using MLDatabricks
 
Locality Sensitive Hashing By Spark
Locality Sensitive Hashing By SparkLocality Sensitive Hashing By Spark
Locality Sensitive Hashing By SparkSpark Summit
 
트위터의 추천 시스템 파헤치기
트위터의 추천 시스템 파헤치기트위터의 추천 시스템 파헤치기
트위터의 추천 시스템 파헤치기Yan So
 
Transactional writes to cloud storage with Eric Liang
Transactional writes to cloud storage with Eric LiangTransactional writes to cloud storage with Eric Liang
Transactional writes to cloud storage with Eric LiangDatabricks
 
Deep Learning for Personalized Search and Recommender Systems
Deep Learning for Personalized Search and Recommender SystemsDeep Learning for Personalized Search and Recommender Systems
Deep Learning for Personalized Search and Recommender SystemsBenjamin Le
 
Neo4j in Production: A look at Neo4j in the Real World
Neo4j in Production: A look at Neo4j in the Real WorldNeo4j in Production: A look at Neo4j in the Real World
Neo4j in Production: A look at Neo4j in the Real WorldNeo4j
 
An introduction to Elasticsearch's advanced relevance ranking toolbox
An introduction to Elasticsearch's advanced relevance ranking toolboxAn introduction to Elasticsearch's advanced relevance ranking toolbox
An introduction to Elasticsearch's advanced relevance ranking toolboxElasticsearch
 

Tendances (20)

Neural Text Embeddings for Information Retrieval (WSDM 2017)
Neural Text Embeddings for Information Retrieval (WSDM 2017)Neural Text Embeddings for Information Retrieval (WSDM 2017)
Neural Text Embeddings for Information Retrieval (WSDM 2017)
 
제 14회 보아즈(BOAZ) 빅데이터 컨퍼런스 - [주식팀] : 특정 시간대의 주가 변동 패턴을 이용한 실시간 주가 예측
제 14회 보아즈(BOAZ) 빅데이터 컨퍼런스 - [주식팀] : 특정 시간대의 주가 변동 패턴을 이용한 실시간 주가 예측제 14회 보아즈(BOAZ) 빅데이터 컨퍼런스 - [주식팀] : 특정 시간대의 주가 변동 패턴을 이용한 실시간 주가 예측
제 14회 보아즈(BOAZ) 빅데이터 컨퍼런스 - [주식팀] : 특정 시간대의 주가 변동 패턴을 이용한 실시간 주가 예측
 
Graph kernels
Graph kernelsGraph kernels
Graph kernels
 
Building Large-scale Real-world Recommender Systems - Recsys2012 tutorial
Building Large-scale Real-world Recommender Systems - Recsys2012 tutorialBuilding Large-scale Real-world Recommender Systems - Recsys2012 tutorial
Building Large-scale Real-world Recommender Systems - Recsys2012 tutorial
 
A Technical Introduction to WiredTiger
A Technical Introduction to WiredTigerA Technical Introduction to WiredTiger
A Technical Introduction to WiredTiger
 
Feature Engineering
Feature EngineeringFeature Engineering
Feature Engineering
 
Vector databases and neural search
Vector databases and neural searchVector databases and neural search
Vector databases and neural search
 
Haystack 2019 - Making the case for human judgement relevance testing - Tara ...
Haystack 2019 - Making the case for human judgement relevance testing - Tara ...Haystack 2019 - Making the case for human judgement relevance testing - Tara ...
Haystack 2019 - Making the case for human judgement relevance testing - Tara ...
 
[236] 카카오의데이터파이프라인 윤도영
[236] 카카오의데이터파이프라인 윤도영[236] 카카오의데이터파이프라인 윤도영
[236] 카카오의데이터파이프라인 윤도영
 
Recommender Systems
Recommender SystemsRecommender Systems
Recommender Systems
 
Machine Learning at Netflix Scale
Machine Learning at Netflix ScaleMachine Learning at Netflix Scale
Machine Learning at Netflix Scale
 
ONNX and MLflow
ONNX and MLflowONNX and MLflow
ONNX and MLflow
 
딥 러닝 자연어 처리를 학습을 위한 파워포인트. (Deep Learning for Natural Language Processing)
딥 러닝 자연어 처리를 학습을 위한 파워포인트. (Deep Learning for Natural Language Processing)딥 러닝 자연어 처리를 학습을 위한 파워포인트. (Deep Learning for Natural Language Processing)
딥 러닝 자연어 처리를 학습을 위한 파워포인트. (Deep Learning for Natural Language Processing)
 
Advanced Model Comparison and Automated Deployment Using ML
Advanced Model Comparison and Automated Deployment Using MLAdvanced Model Comparison and Automated Deployment Using ML
Advanced Model Comparison and Automated Deployment Using ML
 
Locality Sensitive Hashing By Spark
Locality Sensitive Hashing By SparkLocality Sensitive Hashing By Spark
Locality Sensitive Hashing By Spark
 
트위터의 추천 시스템 파헤치기
트위터의 추천 시스템 파헤치기트위터의 추천 시스템 파헤치기
트위터의 추천 시스템 파헤치기
 
Transactional writes to cloud storage with Eric Liang
Transactional writes to cloud storage with Eric LiangTransactional writes to cloud storage with Eric Liang
Transactional writes to cloud storage with Eric Liang
 
Deep Learning for Personalized Search and Recommender Systems
Deep Learning for Personalized Search and Recommender SystemsDeep Learning for Personalized Search and Recommender Systems
Deep Learning for Personalized Search and Recommender Systems
 
Neo4j in Production: A look at Neo4j in the Real World
Neo4j in Production: A look at Neo4j in the Real WorldNeo4j in Production: A look at Neo4j in the Real World
Neo4j in Production: A look at Neo4j in the Real World
 
An introduction to Elasticsearch's advanced relevance ranking toolbox
An introduction to Elasticsearch's advanced relevance ranking toolboxAn introduction to Elasticsearch's advanced relevance ranking toolbox
An introduction to Elasticsearch's advanced relevance ranking toolbox
 

Similaire à Deep Learning for Semantic Search in E-commerce​

Which library should you choose for data-science? That's the question!
Which library should you choose for data-science? That's the question!Which library should you choose for data-science? That's the question!
Which library should you choose for data-science? That's the question!Anastasia Bobyreva
 
Using Spark-Solr at Scale: Productionizing Spark for Search with Apache Solr...
 Using Spark-Solr at Scale: Productionizing Spark for Search with Apache Solr... Using Spark-Solr at Scale: Productionizing Spark for Search with Apache Solr...
Using Spark-Solr at Scale: Productionizing Spark for Search with Apache Solr...Databricks
 
Strata London - Deep Learning 05-2015
Strata London - Deep Learning 05-2015Strata London - Deep Learning 05-2015
Strata London - Deep Learning 05-2015Turi, Inc.
 
Data Science for Dummies - Data Engineering with Titanic dataset + Databricks...
Data Science for Dummies - Data Engineering with Titanic dataset + Databricks...Data Science for Dummies - Data Engineering with Titanic dataset + Databricks...
Data Science for Dummies - Data Engineering with Titanic dataset + Databricks...Rodney Joyce
 
Automated Hyperparameter Tuning, Scaling and Tracking
Automated Hyperparameter Tuning, Scaling and TrackingAutomated Hyperparameter Tuning, Scaling and Tracking
Automated Hyperparameter Tuning, Scaling and TrackingDatabricks
 
Practical End-to-End Learning to Rank Using Fusion - Andy Liu, Lucidworks
Practical End-to-End Learning to Rank Using Fusion - Andy Liu, Lucidworks Practical End-to-End Learning to Rank Using Fusion - Andy Liu, Lucidworks
Practical End-to-End Learning to Rank Using Fusion - Andy Liu, Lucidworks Lucidworks
 
[RakutenTechConf2013] [B-3_3] Rakuten Category
[RakutenTechConf2013] [B-3_3] Rakuten Category[RakutenTechConf2013] [B-3_3] Rakuten Category
[RakutenTechConf2013] [B-3_3] Rakuten CategoryRakuten Group, Inc.
 
VSSML18. Clustering and Latent Dirichlet Allocation
VSSML18. Clustering and Latent Dirichlet AllocationVSSML18. Clustering and Latent Dirichlet Allocation
VSSML18. Clustering and Latent Dirichlet AllocationBigML, Inc
 
Análisis del roadmap del Elastic Stack
Análisis del roadmap del Elastic StackAnálisis del roadmap del Elastic Stack
Análisis del roadmap del Elastic StackElasticsearch
 
Telecom datascience master_public
Telecom datascience master_publicTelecom datascience master_public
Telecom datascience master_publicVincent Michel
 
Faceted search with Oracle InMemory option
Faceted search with Oracle InMemory optionFaceted search with Oracle InMemory option
Faceted search with Oracle InMemory optionAlexander Tokarev
 
Approximate "Now" is Better Than Accurate "Later"
Approximate "Now" is Better Than Accurate "Later"Approximate "Now" is Better Than Accurate "Later"
Approximate "Now" is Better Than Accurate "Later"NUS-ISS
 
BSSML17 - Deepnets
BSSML17 - DeepnetsBSSML17 - Deepnets
BSSML17 - DeepnetsBigML, Inc
 
ML Infra @ Spotify: Lessons Learned - Romain Yon - NYC ML Meetup
ML Infra @ Spotify: Lessons Learned - Romain Yon -  NYC ML MeetupML Infra @ Spotify: Lessons Learned - Romain Yon -  NYC ML Meetup
ML Infra @ Spotify: Lessons Learned - Romain Yon - NYC ML MeetupRomain Yon
 
Modern Search: Using ML & NLP advances to enhance search and discovery
Modern Search: Using ML & NLP advances to enhance search and discoveryModern Search: Using ML & NLP advances to enhance search and discovery
Modern Search: Using ML & NLP advances to enhance search and discoveryAll Things Open
 
Using Deep Learning and Customized Solr Components to Improve search Relevanc...
Using Deep Learning and Customized Solr Components to Improve search Relevanc...Using Deep Learning and Customized Solr Components to Improve search Relevanc...
Using Deep Learning and Customized Solr Components to Improve search Relevanc...Lucidworks
 
Introduction to machine learning with GPUs
Introduction to machine learning with GPUsIntroduction to machine learning with GPUs
Introduction to machine learning with GPUsCarol McDonald
 

Similaire à Deep Learning for Semantic Search in E-commerce​ (20)

Which library should you choose for data-science? That's the question!
Which library should you choose for data-science? That's the question!Which library should you choose for data-science? That's the question!
Which library should you choose for data-science? That's the question!
 
Using Spark-Solr at Scale: Productionizing Spark for Search with Apache Solr...
 Using Spark-Solr at Scale: Productionizing Spark for Search with Apache Solr... Using Spark-Solr at Scale: Productionizing Spark for Search with Apache Solr...
Using Spark-Solr at Scale: Productionizing Spark for Search with Apache Solr...
 
Summit EU Machine Learning
Summit EU Machine LearningSummit EU Machine Learning
Summit EU Machine Learning
 
Strata London - Deep Learning 05-2015
Strata London - Deep Learning 05-2015Strata London - Deep Learning 05-2015
Strata London - Deep Learning 05-2015
 
Data Science for Dummies - Data Engineering with Titanic dataset + Databricks...
Data Science for Dummies - Data Engineering with Titanic dataset + Databricks...Data Science for Dummies - Data Engineering with Titanic dataset + Databricks...
Data Science for Dummies - Data Engineering with Titanic dataset + Databricks...
 
Automated Hyperparameter Tuning, Scaling and Tracking
Automated Hyperparameter Tuning, Scaling and TrackingAutomated Hyperparameter Tuning, Scaling and Tracking
Automated Hyperparameter Tuning, Scaling and Tracking
 
Role of Data Science in eCommerce
Role of Data Science in eCommerceRole of Data Science in eCommerce
Role of Data Science in eCommerce
 
Practical End-to-End Learning to Rank Using Fusion - Andy Liu, Lucidworks
Practical End-to-End Learning to Rank Using Fusion - Andy Liu, Lucidworks Practical End-to-End Learning to Rank Using Fusion - Andy Liu, Lucidworks
Practical End-to-End Learning to Rank Using Fusion - Andy Liu, Lucidworks
 
[RakutenTechConf2013] [B-3_3] Rakuten Category
[RakutenTechConf2013] [B-3_3] Rakuten Category[RakutenTechConf2013] [B-3_3] Rakuten Category
[RakutenTechConf2013] [B-3_3] Rakuten Category
 
ATP 2014
ATP 2014ATP 2014
ATP 2014
 
VSSML18. Clustering and Latent Dirichlet Allocation
VSSML18. Clustering and Latent Dirichlet AllocationVSSML18. Clustering and Latent Dirichlet Allocation
VSSML18. Clustering and Latent Dirichlet Allocation
 
Análisis del roadmap del Elastic Stack
Análisis del roadmap del Elastic StackAnálisis del roadmap del Elastic Stack
Análisis del roadmap del Elastic Stack
 
Telecom datascience master_public
Telecom datascience master_publicTelecom datascience master_public
Telecom datascience master_public
 
Faceted search with Oracle InMemory option
Faceted search with Oracle InMemory optionFaceted search with Oracle InMemory option
Faceted search with Oracle InMemory option
 
Approximate "Now" is Better Than Accurate "Later"
Approximate "Now" is Better Than Accurate "Later"Approximate "Now" is Better Than Accurate "Later"
Approximate "Now" is Better Than Accurate "Later"
 
BSSML17 - Deepnets
BSSML17 - DeepnetsBSSML17 - Deepnets
BSSML17 - Deepnets
 
ML Infra @ Spotify: Lessons Learned - Romain Yon - NYC ML Meetup
ML Infra @ Spotify: Lessons Learned - Romain Yon -  NYC ML MeetupML Infra @ Spotify: Lessons Learned - Romain Yon -  NYC ML Meetup
ML Infra @ Spotify: Lessons Learned - Romain Yon - NYC ML Meetup
 
Modern Search: Using ML & NLP advances to enhance search and discovery
Modern Search: Using ML & NLP advances to enhance search and discoveryModern Search: Using ML & NLP advances to enhance search and discovery
Modern Search: Using ML & NLP advances to enhance search and discovery
 
Using Deep Learning and Customized Solr Components to Improve search Relevanc...
Using Deep Learning and Customized Solr Components to Improve search Relevanc...Using Deep Learning and Customized Solr Components to Improve search Relevanc...
Using Deep Learning and Customized Solr Components to Improve search Relevanc...
 
Introduction to machine learning with GPUs
Introduction to machine learning with GPUsIntroduction to machine learning with GPUs
Introduction to machine learning with GPUs
 

Dernier

Porous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingPorous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingrakeshbaidya232001
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escortsranjana rawat
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Christo Ananth
 
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Bookingdharasingh5698
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
Glass Ceramics: Processing and Properties
Glass Ceramics: Processing and PropertiesGlass Ceramics: Processing and Properties
Glass Ceramics: Processing and PropertiesPrabhanshu Chaturvedi
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)simmis5
 
University management System project report..pdf
University management System project report..pdfUniversity management System project report..pdf
University management System project report..pdfKamal Acharya
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Call Girls in Nagpur High Profile
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...ranjana rawat
 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Dr.Costas Sachpazis
 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxupamatechverse
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxpranjaldaimarysona
 
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptxBSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptxfenichawla
 

Dernier (20)

DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINEDJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
 
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
 
Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024
 
Porous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingPorous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writing
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
 
Roadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and RoutesRoadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and Routes
 
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
 
Glass Ceramics: Processing and Properties
Glass Ceramics: Processing and PropertiesGlass Ceramics: Processing and Properties
Glass Ceramics: Processing and Properties
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)
 
University management System project report..pdf
University management System project report..pdfUniversity management System project report..pdf
University management System project report..pdf
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
 
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptx
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptx
 
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptxBSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
 

Deep Learning for Semantic Search in E-commerce​

  • 1. Deep Learning for Semantic Search in E-commerce Somnath Banerjee Head of Search Algorithms at Walmart Labs https://www.linkedin.com/in/somnath-banerjee/ March 19, 2019
  • 2. Walmart E-commerce search problem 2 100M+ Customers 100M+ Queries 100M+ Items Store Associate E-commerce Search providesthe functionalityof a human butatscale
  • 3. 3 ???? Flash Drive USB Drive Thumb Drive Jump Drive Pen Drive Zip Drive Memory Stick USB Stick USB Flash Drive USB Memory USB Storage Device Flush Drive USC Drive Thamb Drive Jmp Drive Pin Drive Zap Drive Memory Steak USB Stock USB Flash Drve MisspelledQueries
  • 5. Outline 5 • Core problems of e-commerce search • Semantic search in e-commerce • Deep Learning for semantic search – Query classification – Query token tagging – Neural IR – Image understanding (sneak peek)
  • 6. Core of E-commerce Search 6 Text Query Text Query Find Items Catalog
  • 7. Core problems of E-commerce search 7 Learning book Tide 100 oz Tide 100 fl oz Tide 100 ounce Neck style? Fabric? No. of pockets? Ziploc Ambiguity Missing catalog values Levi’s Levi Strauss Signature by Levi Strauss and Co. Open vocabulary in query and catalog
  • 8. Buying decision is influenced by item attractiveness 8 pump shoes Tags $300!!! Presence of expensive items Image quality
  • 9. Matching query to items Ranking items Core technical problems of e-commerce search 9 Pump shoes ✅ ✅ ❌ Position 1 Position 2
  • 10. Text matching is not enough 10 Lemon Lemon BalmLemon Fruit Nivea 16oz Nivea 15.5oz Tire sealant 16oz
  • 11. Query understanding •Attributeunderstanding Matching query and item •Text matching •Attributematching Ranking Items Semantic Search 11
  • 12. Deep learning for semantic search 12 Deep Learning for Query understanding Matching query and item •Text matching •Attributematching Ranking Items
  • 13. Deep learning for semantic search 13 Deep Learning for Query understanding Matching query and item •Text matching •Attributematching Ranking Items Neural IR End-to-end matching and ranking Image understanding Not just text search
  • 14. Outline 14 • Core problems of e-commerce search • Semantic search in e-commerce • Deep Learning for semantic search – Query classification – Query token tagging – Neural IR – Image understanding (sneak peek)
  • 15. Query Classification 15 Text query product type 1 : confidence level product type 2 : confidence level product type 3 : confidence level Product Type • A predefined list • Indicates a specific product in the catalog • Every item in the catalog is tagged with a product type
  • 16. Query classification examples 16 Computer Video Cards: 0.85 Laptop Computers: 0.08 Desktop Computers: 0.06 nvidia gpu Food StorageBags: 1.0ziploc bags Largenumber of product typesbedroom furniture Hard to balance precision vs recall
  • 17. Query classification challenges 17 Shorttext •Queries areof 2-3 tokens Largescale classification •Thousandsof producttypes (classes) Multi-class, multi- label problem •Samequery can have multiple producttypes Needs to respond in few milliseconds •Classifies queries at runtime Unbalanced class distribution •Someproduct types aremuch morepopular
  • 18. Data and Model 18 BiLSTM <query, product type ordered> <query, item ordered> Historical Search Log https://guillaumegenthial.github.io/sequence-tagging-with-tensorflow.html Output Layer word2vec Softmax/sigmoid
  • 19. Usage of query classification 19 Without Query Classification After we understand the query “lemon” as a fruit lemon 20% reduction of irrelevant items in certain query segments
  • 20. Key Learnings 20 Logistic RegressionDeep Learning 6% higher accuracy CNN BiLSTM More accurate 1 K80 GPU 8 Core CPU 6X faster 1 K80 GPU48 Core CPU Equal Accuracy Training Time
  • 21. Key Learnings - instability in Prediction 21 Television Stands: 0.32; Laptop Computers: 0.27 Hard Drives: 0.11 Hard Drives: 1.00 Old model New model samsung 850 evo 250gb 2.5 inch
  • 22. Instability in Prediction 22 Training data N Training data (N + 1)2.5% difference in training data Model N Model (N + 1) Top predicted class is different for 10% of the test set
  • 23. Instability in prediction – different seeds 23 Training data N Model N Model N’ Top predicted class is different for 7% of the test set Seed 1 Seed 2 Different tensorflow and numpy seeds
  • 24. Sources of Instability 24 Overfitting •Deep Learning model has high variance, particularly on the low traffic queries •Simpler models could be more stable but less accurate Sigmoid (1-vs-all) classifier is more stable •Softmax scores are interdependent across classes and less stable Noisy training data •Item order data is less noisy than click Rounding errors in the arithmetic operations •CPU is more stable than GPU
  • 25. Reduction of Instability 25 40% reduction of instability Softmax Clicks CNN Sigmoid Orders BiLSTM
  • 26. • Product Type • Brand • Color • Gender • Age Group • Size (value & unit) – Pack Size – Screen Size – Shoe Size – … • Character • Style • Material • … Attributes to match 26 Not Feasible – Separate classifier for each attribute • Too many classes (e.g. 100K+ brand values) • Sparse attributes; most attribute prediction should be NA • Creating training data of <query, attribute> is more noisy and inaccurate
  • 27. Query token tagging 27 Query Query tokens tagged with Attribute Names faded glory long sleeve shirts for women Faded Glory Long sleeve shirts women for Product type Brand Sleeve length Gender NULL
  • 28. Training data 28 blue women levis jeans Brand Product Type GenderColor toys for girls 3 – 6 years Age Value Age Unit GenderProduct Type Human curated data It is a hard task for human •Is "outside” a producttype token in the query, “canopytents for outside”? Disagreement between taggers are high (~30%) Fortunately 10K training data is a good start
  • 29. Model – BiLSTM-CRF 29 word2vec Features for CRF https://guillaumegenthial.github.io/sequence-tagging-with-tensorflow.html Linear Chain CRF querytokens 𝑃(𝑡𝑎𝑔1, … , 𝑡𝑎𝑔 𝑛) Char embedding
  • 30. Char Embeddings 30 G P U word2vec BiLSTM-CRF Network Character embedding network word2vec type learnt on character sequence
  • 31. • Maps a sequence of characters to a fixed size vector • Handles out of vocabulary words • Handles misspellings Char Embedding 31 sansung tvsansung tv Brand Product Type NULL Product Type With Char EmbeddingWithout Char Embedding
  • 32. Improving search results using query tagging 32 Women citizen eco drive watch Before After understanding the Gender token Regex matchwill be incorrect for queries like pioneer women dinnerware wonder women bedding spider man car seats
  • 33. Other use cases of query tagging 33 samsung tv 32 in 32 in vizio tv sanyo flat screen tv led tv sony 55” samsung tv stand sony tv remote TV queries Not TV queries Customer Demand Analysis • Most searched brand of TV Attribute filter suggestion • Suggest top attributes (e.g. brand, screen size) that customers look for for in a product type query (e.g. TV) Search query log
  • 34. Traditional IR Semantic Search Neural IR Neural IR 34 Token and synonym match Learning to Rank • Attribute extraction • Token, synonym and attribute match • Learning to rank End-to-end matching and ranking
  • 35. Neural IR – Design 1 35 Query Item Title Input Embedding Concatenation Neural Transformation Transformed feature Relevance Score • Runtime computation • Not scalable for large number of items
  • 36. Neural IR – Design 2 36 Query Item Title Input Embedding Neural Transformation Query, item embeddings Relevance Score item embeddings can be computed offline and indexed shared weights
  • 37. Input Embedding 37 Comparable Accuracy Input Embedding token 1 … token n Query or Item Title … AVG word2vec token 1 … token n … CNN word2vec Input Embedding Query or Item Title
  • 38. Training Data 38 query, item title, click through rate (ctr)* Historical search log *Position bias correction for ctr of a query, item pair 𝑐𝑡𝑟 = σ 𝑟 𝑐𝑙𝑖𝑐𝑘𝑠_𝑐𝑜𝑟𝑟𝑒𝑐𝑡𝑒𝑑 𝑟 σ 𝑟 𝑖𝑚𝑝𝑟𝑒𝑠𝑠𝑖𝑜𝑛𝑠 𝑟 𝑐𝑙𝑖𝑐𝑘𝑠_𝑐𝑜𝑟𝑟𝑒𝑐𝑡𝑒𝑑 𝑟 = 𝑐𝑙𝑖𝑐𝑘𝑠 𝑟 + 𝑖𝑚𝑝𝑟𝑒𝑠𝑠𝑖𝑜𝑛𝑠 𝑟 − 𝑐𝑙𝑖𝑐𝑘𝑠 𝑟 ∗ 𝑃 𝑐𝑙𝑖𝑐𝑘 𝑟) 𝑟 = 𝑟𝑎𝑛𝑘 𝑎𝑡 𝑤ℎ𝑖𝑐ℎ 𝑡ℎ𝑒 𝑖𝑡𝑒𝑚 𝑤𝑎𝑠 𝑑𝑖𝑠𝑝𝑙𝑎𝑦𝑒𝑑
  • 39. Training Loss 39 Point-wise Pair-wise 𝑥 𝑞 = 𝑞𝑢𝑒𝑟𝑦 𝑓𝑒𝑎𝑡𝑢𝑟𝑒𝑠 𝑥 𝑝 = 𝑖𝑡𝑒𝑚 𝑓𝑒𝑎𝑡𝑢𝑟𝑒𝑠 𝑓 𝑥 𝑞, 𝑥 𝑝 → ctr Regression problem Sigmoid cross entropy loss Brooks shoes 𝑥 𝑞 𝑥 𝑝 𝑥 𝑛 relevant less relevantquery 𝑓 𝑥 𝑞, 𝑥 𝑝 > 𝑓 𝑥 𝑞, 𝑥 𝑛 𝑤ℎ𝑒𝑛 𝑐𝑡𝑟 𝑥 𝑞, 𝑥 𝑝 > 𝑐𝑡𝑟(𝑥 𝑞, 𝑥 𝑛) Minimize pair inversions Pair-wise logistic loss
  • 40. Accuracy on pair-wise loss 40 NDCG captures quality of overall ranking Pair accuracy captures if higher ctr (relevant) items ranked above the lower ctr items 0.00% 10.00% 20.00% 30.00% 40.00% 50.00% 60.00% 70.00% Design 1 Design 2 NDCG@10liftagainstbaseline 30.00% 31.00% 32.00% 33.00% 34.00% 35.00% 36.00% Design 1 Design 2 PairAccuracyliftagainstbaseline
  • 41. Pros • End to end approach • Enables Semantic matching implicitly • Handles different data types (text, image) Cons • Not scalable (yet) • Not so successful (yet) Neural IR 41
  • 42. Image understanding 42 Predicted Attributes • Product type • Style • Material • Color Attribute Prediction Visual Search Compatible Outfit
  • 43. Image understanding key learnings 43 • Multi-task learning is more accurate • Predicting style is harder than predicting product type Attribute Prediction Visual Search Compatible Outfit • A/B test on hayneedle.com • Comparable results against a well established startup • Under exploration • Early results beating token based approach
  • 45. Future 45 Web Search E-commerce Search Conversational commerce Seamless search and personalized results V-Commerce
  • 46. 46