SlideShare une entreprise Scribd logo
1  sur  19
Télécharger pour lire hors ligne
Real-time Machine Learning
with Hopsworks
An integrated Feature Store and Model Serving
platform
Jim Dowling - CEO
ML Operational Capabilities
Business
Value
Online predictions
Batch updates
Offline predictions
Batch updates
Traditional
Analytics
Training/Test Data
Analytical ML
Operational ML
Real-Time
Machine Learning
Where business value is generated in AI
Online inference
Batch features
Offline inference
Batch features
Model Serving
Online Feature Store
Batch jobs
Offline Feature Store
Model Serving
Online Feature Store
Online inference
Streaming features
Online predictions
Real-time updates
Data
warehouse
Applications
-
Services
Search, Versioning, Statistics, Code
Lineage, Provenance
Feature Views
Model Registry
Feature Groups
Online
Applications &
Services
KServe
Feature Store Models
Where Feature Stores and Model Serving meet
Feature
Groups
Feature
Views
Batch
(DataFrames)
Read Feature Vectors
Online API
Read Files/DataFrames
Offline API
Streaming
(Data Instances)
Models
Feature Store
Transformer Prediction
Service
Predictor
Model
Artifact
Online Predictions
REST API
Model Registry
Deploy
Inference logs
(Data Instances)
Model Serving
Code
Model
files
Model Server
Inference Logger
From Raw Data to Online Predictions
Search, Versioning, Statistics, Transformations
Lineage, Provenance
Versioning, Experiments, Metrics, Code Canary, A/B Testing
Keeping Your Pipelines on Track
Model
Registry
Batch Apps
Online Apps
Feature Groups
Feature Views
Vector DB
Training
Pipelines
Inference
Pipelines
Online
Offline
Model
artifact
Index Creation
Encoder
schema
transformation
functions
versioning
versioning
versioning
experiments
versioning
schema
schema
schema
✓ Versioning →
■ code : feature eng., transformation functions, model training, model serving scripts
■ assets: model files, model artifacts, experiments
■ configuration: experiment settings, deployments, indexes
✓ Schema management → columns, data types // fg, fv, models, deployments
✓ Transformation functions → avoid training / serving skew
✓ Provenance and Lineage → track predictions down to the ingested features
Provenance
versioning
Data warehouse
(historical data)
Applications, Service
(context, trends)
Feature
Pipelines
Batch
Streaming
versioning
A Closer Look to Inference Pipelines
Data warehouse
(historical data)
Model
Registry
Batch Apps
Online Apps
Feature Groups
Feature Views
Applications, Service
(context, trends)
Feature
Pipelines
Vector DB
Batch
Streaming
Training
Pipelines
Offline
Index Creation
Encoder
Model artifact
Batch Inference Jobs
Prediction Service
Transformer
Predictor
Model artifact
Online
Recent
features
Embeddings
Online
predictions
Inference logs
Inference logger
Batch data
Batch
predictions
Feature Store
Inference Request
Streaming
Feature Pipeline
Feature Group
FG 1
FG 2
FG 4
FG 3
Feature View
FV 1
FV 2
FG 5
FV 3
Features
Feature 1
Feature 2
Feature 4 (pk)
Feature 3
Feature 5
Feature 6
Feature 7 (pk)
Feature 9
Feature 8
Model Serving
Transformer
Feature 4 (pk)
Feature Vector
Vector DB
Embedding
Embedding
Embedding
Embeddings
Predictor
Embeddings
Model Input
Inference Response
Prediction
Prediction
Embedding space
Online Apps
Similarity
search
Feedback
Lookups
Inference logs
Model
A Deeper Look to Real-time Inference Pipelines
mapping
Real-Time, Personalized
Recommendation Systems
Candidate Retrieval and Ranking
Embedding
User-Query
Encoder
Features
Embeddings compress high dimensional data, retaining semantic relationships
current user search
user session data
user purchases
user profile
What about Multi-Modal Similarity Search?
Can a “user query” find “items”
with similarity search?
Yes, by mapping the “user query” embedding
into the “item” embedding space with a
two-tower model.
Representation learning for retrieval usually involves supervised learning with labeled or
pseudo-labeled data from user-item interactions.
Training data for our Two-Tower Model will be User-Item Interactions
Log user-item interactions as training data for our two-tower model and ranking model.
Retail Website
Search
Item 1
Item 2
Item 3
Item 4
Purchase 3
Click 2
Click 3
Score: 0
Item 1
Score: 1
Item 2
Score: 5
Item 3
Score: 0
Item 4
Features
Features
Features
Features
Training the Two-Tower Embedding Modoel
User Query
embeddings
User Query
encoder
Item
embeddings
Item encoder
Item category,
price, popularity,
etc
User features,
preferences,
history
Dot product
(Loss fn)
0 → Non-interaction
LOSS
1 → highest interaction
User-Item
Interactions
Training Data
Model Training for Embedding Models and Ranking Model
Feature Views
items
user queries
Feature Store
Training Data
retrieval.csv
ranking.csv
Ranking
User/Query
Embedding
Item
Embedding
Hopsworks Model Registry
Train Models Train Models
Models
item user clicks
Build the ANN Index on Items. Similarity Search with user queries on it.
OpenSearch k-NN
(ANN Index)
items.csv
Job computes
embeddings for all
Items
Encode all items
Insert all pairs
(item-ID, embedding)
Two-Tower Network with a Vector Database for ANN Search
Source: https://cloud.google.com/blog/products/ai-machine-learning/vertex-matching-engine-blazing-fast-and-massively-scalable-nearest-neighbor-search
Retrieval and Ranking for Personalized Real-time Recommendation Systems
User-Query
Embedding
User-Query
Encoder
Features
Candidate
Retrieval
Ranking
Model
Ranked items
Hopsworks
Feature Store
OpenSearch k-NN
(items)
Candidate
Items
Trends,
Feedback
Search
Get
Features
for
items
Features
Real-time Recommendation Systems
Query
Model
Retrieve closest
candidates using
similarity search
Enrich with
features for
candidates
Ranked
candidates
Recommended
candidates
Ranking
Model
Candidate 1
Candidate 2
Candidate N
Recommendation
request
Enrich with
item/user features
Real-time Recommendation Systems with Hopsworks
User
Query
Model
Retrieve closest
candidates with
similarity search
Enrich with
features for
candidates
Recommendation
request
Recommended
candidates
Enrich with
item/user features
Ranking
Model
Ranked
candidates
Candidate 1
Candidate 2
Candidate N
Hopsworks Feature Store
Predictor Predictor
KServe
Deployment
OpenSearch K-NN
KServe
Deployment
Transformer
Transformer
Extended Retrieval and Ranking Architecture
Embeddings, Retrieval, Filtering, Ranking
Jointly train with
two-tower model:
User/query embedding
Item embedding models
Built Approx Nearest
Neighbor (ANN) Index
with items and item
embedding model.
User/Query &
Item Embeddings
With a ranking model,
score all the candidate
items with both user
and item features,
ensuring, candidate
diversity.
Ranking
Remove candidate
items for various
reasons:
• underage user
• item sold out
• item bought
before
• item not available
in user’s region
Filtering
Retrieve candidate
items based on the user
embedding from the
ANN Index -
similarity search
Retrieval

Contenu connexe

Tendances

2 TomcatによるWebアプリケーションサーバ構築 第4章 Tomcatの構成(2)-デプロイ
2 TomcatによるWebアプリケーションサーバ構築 第4章 Tomcatの構成(2)-デプロイ2 TomcatによるWebアプリケーションサーバ構築 第4章 Tomcatの構成(2)-デプロイ
2 TomcatによるWebアプリケーションサーバ構築 第4章 Tomcatの構成(2)-デプロイ
Enpel
 

Tendances (20)

JSON:APIについてざっくり入門
JSON:APIについてざっくり入門JSON:APIについてざっくり入門
JSON:APIについてざっくり入門
 
AWSのログ管理ベストプラクティス
AWSのログ管理ベストプラクティスAWSのログ管理ベストプラクティス
AWSのログ管理ベストプラクティス
 
2 TomcatによるWebアプリケーションサーバ構築 第4章 Tomcatの構成(2)-デプロイ
2 TomcatによるWebアプリケーションサーバ構築 第4章 Tomcatの構成(2)-デプロイ2 TomcatによるWebアプリケーションサーバ構築 第4章 Tomcatの構成(2)-デプロイ
2 TomcatによるWebアプリケーションサーバ構築 第4章 Tomcatの構成(2)-デプロイ
 
Batch Message Listener capabilities of the Apache Kafka Connector
Batch Message Listener capabilities of the Apache Kafka ConnectorBatch Message Listener capabilities of the Apache Kafka Connector
Batch Message Listener capabilities of the Apache Kafka Connector
 
[오픈소스컨설팅]클라우드자동화 및 운영효율화방안
[오픈소스컨설팅]클라우드자동화 및 운영효율화방안[오픈소스컨설팅]클라우드자동화 및 운영효율화방안
[오픈소스컨설팅]클라우드자동화 및 운영효율화방안
 
Amazon SNS+SQSによる Fanoutシナリオの話
Amazon SNS+SQSによる Fanoutシナリオの話Amazon SNS+SQSによる Fanoutシナリオの話
Amazon SNS+SQSによる Fanoutシナリオの話
 
OpenStack を 拡張する NetApp Unified Driver の使い方 Vol.001
OpenStack を 拡張する NetApp Unified Driver の使い方 Vol.001OpenStack を 拡張する NetApp Unified Driver の使い方 Vol.001
OpenStack を 拡張する NetApp Unified Driver の使い方 Vol.001
 
サーバーが完膚なきまでに死んでもMySQLのデータを失わないための表技
サーバーが完膚なきまでに死んでもMySQLのデータを失わないための表技サーバーが完膚なきまでに死んでもMySQLのデータを失わないための表技
サーバーが完膚なきまでに死んでもMySQLのデータを失わないための表技
 
AWS Black Belt Tech シリーズ 2015 - AWS OpsWorks
AWS Black Belt Tech シリーズ 2015 - AWS OpsWorksAWS Black Belt Tech シリーズ 2015 - AWS OpsWorks
AWS Black Belt Tech シリーズ 2015 - AWS OpsWorks
 
20190206 AWS Black Belt Online Seminar Amazon SageMaker Basic Session
20190206 AWS Black Belt Online Seminar Amazon SageMaker Basic Session20190206 AWS Black Belt Online Seminar Amazon SageMaker Basic Session
20190206 AWS Black Belt Online Seminar Amazon SageMaker Basic Session
 
[기술 트렌드] Gartner 선정 10대 전략 기술
[기술 트렌드] Gartner 선정 10대 전략 기술[기술 트렌드] Gartner 선정 10대 전략 기술
[기술 트렌드] Gartner 선정 10대 전략 기술
 
마이크로서비스 기반 클라우드 아키텍처 구성 모범 사례 - 윤석찬 (AWS 테크에반젤리스트)
마이크로서비스 기반 클라우드 아키텍처 구성 모범 사례 - 윤석찬 (AWS 테크에반젤리스트) 마이크로서비스 기반 클라우드 아키텍처 구성 모범 사례 - 윤석찬 (AWS 테크에반젤리스트)
마이크로서비스 기반 클라우드 아키텍처 구성 모범 사례 - 윤석찬 (AWS 테크에반젤리스트)
 
202110 AWS Black Belt Online Seminar AWS Site-to-Site VPN
202110 AWS Black Belt Online Seminar AWS Site-to-Site VPN202110 AWS Black Belt Online Seminar AWS Site-to-Site VPN
202110 AWS Black Belt Online Seminar AWS Site-to-Site VPN
 
Spring Boot + Netflix Eureka
Spring Boot + Netflix EurekaSpring Boot + Netflix Eureka
Spring Boot + Netflix Eureka
 
20200721 AWS Black Belt Online Seminar AWS App Mesh
20200721 AWS Black Belt Online Seminar AWS App Mesh20200721 AWS Black Belt Online Seminar AWS App Mesh
20200721 AWS Black Belt Online Seminar AWS App Mesh
 
AWS RDSでの冗長化
AWS RDSでの冗長化AWS RDSでの冗長化
AWS RDSでの冗長化
 
20191023 AWS Black Belt Online Seminar Amazon EMR
20191023 AWS Black Belt Online Seminar Amazon EMR20191023 AWS Black Belt Online Seminar Amazon EMR
20191023 AWS Black Belt Online Seminar Amazon EMR
 
Visual StudioやAzureからAzure DevOpsを使う
Visual StudioやAzureからAzure DevOpsを使うVisual StudioやAzureからAzure DevOpsを使う
Visual StudioやAzureからAzure DevOpsを使う
 
Amazon SageMaker 모델 배포 방법 소개::김대근, AI/ML 스페셜리스트 솔루션즈 아키텍트, AWS::AWS AIML 스페셜 웨비나
Amazon SageMaker 모델 배포 방법 소개::김대근, AI/ML 스페셜리스트 솔루션즈 아키텍트, AWS::AWS AIML 스페셜 웨비나Amazon SageMaker 모델 배포 방법 소개::김대근, AI/ML 스페셜리스트 솔루션즈 아키텍트, AWS::AWS AIML 스페셜 웨비나
Amazon SageMaker 모델 배포 방법 소개::김대근, AI/ML 스페셜리스트 솔루션즈 아키텍트, AWS::AWS AIML 스페셜 웨비나
 
Security hub workshop
Security hub workshopSecurity hub workshop
Security hub workshop
 

Similaire à Real-time Machine Learning with Hopsworks

test - Future of Ecommerce: How to Improve the Online Shopping Experience Usi...
test - Future of Ecommerce: How to Improve the Online Shopping Experience Usi...test - Future of Ecommerce: How to Improve the Online Shopping Experience Usi...
test - Future of Ecommerce: How to Improve the Online Shopping Experience Usi...
Skyl.ai
 

Similaire à Real-time Machine Learning with Hopsworks (20)

Real-Time Recommendations with Hopsworks and OpenSearch - MLOps World 2022
Real-Time Recommendations  with Hopsworks and OpenSearch - MLOps World 2022Real-Time Recommendations  with Hopsworks and OpenSearch - MLOps World 2022
Real-Time Recommendations with Hopsworks and OpenSearch - MLOps World 2022
 
Contextually Relevant Retail APIs for Dynamic Insights & Experiences
Contextually Relevant Retail APIs for Dynamic Insights & ExperiencesContextually Relevant Retail APIs for Dynamic Insights & Experiences
Contextually Relevant Retail APIs for Dynamic Insights & Experiences
 
Building Intelligent Apps with MongoDB & Google Cloud
Building Intelligent Apps with MongoDB & Google CloudBuilding Intelligent Apps with MongoDB & Google Cloud
Building Intelligent Apps with MongoDB & Google Cloud
 
MongoDB.local Austin 2018: Building Intelligent Apps with MongoDB & Google Cloud
MongoDB.local Austin 2018: Building Intelligent Apps with MongoDB & Google CloudMongoDB.local Austin 2018: Building Intelligent Apps with MongoDB & Google Cloud
MongoDB.local Austin 2018: Building Intelligent Apps with MongoDB & Google Cloud
 
Building Intelligent Apps with MongoDB and Google Cloud - Jane Fine
Building Intelligent Apps with MongoDB and Google Cloud - Jane FineBuilding Intelligent Apps with MongoDB and Google Cloud - Jane Fine
Building Intelligent Apps with MongoDB and Google Cloud - Jane Fine
 
MongoDB.local DC 2018: Building Intelligent Apps with MongoDB & Google Cloud
MongoDB.local DC 2018: Building Intelligent Apps with MongoDB & Google CloudMongoDB.local DC 2018: Building Intelligent Apps with MongoDB & Google Cloud
MongoDB.local DC 2018: Building Intelligent Apps with MongoDB & Google Cloud
 
Recsys2016 Tutorial by Xavier and Deepak
Recsys2016 Tutorial by Xavier and DeepakRecsys2016 Tutorial by Xavier and Deepak
Recsys2016 Tutorial by Xavier and Deepak
 
MongoDB.local Sydney 2019: Building Intelligent Apps with MongoDB & Google Cloud
MongoDB.local Sydney 2019: Building Intelligent Apps with MongoDB & Google CloudMongoDB.local Sydney 2019: Building Intelligent Apps with MongoDB & Google Cloud
MongoDB.local Sydney 2019: Building Intelligent Apps with MongoDB & Google Cloud
 
Models in Minutes using AutoML
Models in Minutes using AutoMLModels in Minutes using AutoML
Models in Minutes using AutoML
 
Wix Machine Learning - Ran Romano
Wix Machine Learning - Ran RomanoWix Machine Learning - Ran Romano
Wix Machine Learning - Ran Romano
 
#TDXRecap India tour
#TDXRecap India tour#TDXRecap India tour
#TDXRecap India tour
 
Managing the Machine Learning Lifecycle with MLflow
Managing the Machine Learning Lifecycle with MLflowManaging the Machine Learning Lifecycle with MLflow
Managing the Machine Learning Lifecycle with MLflow
 
Data Product Architectures
Data Product ArchitecturesData Product Architectures
Data Product Architectures
 
[第43回 Machine Learning 15minutes! × 2] Azure AI Updates
[第43回 Machine Learning 15minutes! × 2] Azure AI Updates[第43回 Machine Learning 15minutes! × 2] Azure AI Updates
[第43回 Machine Learning 15minutes! × 2] Azure AI Updates
 
CCCDjango2010.pdf
CCCDjango2010.pdfCCCDjango2010.pdf
CCCDjango2010.pdf
 
Telecom datascience master_public
Telecom datascience master_publicTelecom datascience master_public
Telecom datascience master_public
 
PyCon Sweden 2022 - Dowling - Serverless ML with Hopsworks.pdf
PyCon Sweden 2022 - Dowling - Serverless ML with Hopsworks.pdfPyCon Sweden 2022 - Dowling - Serverless ML with Hopsworks.pdf
PyCon Sweden 2022 - Dowling - Serverless ML with Hopsworks.pdf
 
Interleaving, Evaluation to Self-learning Search @904Labs
Interleaving, Evaluation to Self-learning Search @904LabsInterleaving, Evaluation to Self-learning Search @904Labs
Interleaving, Evaluation to Self-learning Search @904Labs
 
test - Future of Ecommerce: How to Improve the Online Shopping Experience Usi...
test - Future of Ecommerce: How to Improve the Online Shopping Experience Usi...test - Future of Ecommerce: How to Improve the Online Shopping Experience Usi...
test - Future of Ecommerce: How to Improve the Online Shopping Experience Usi...
 
How an AI-backed recommendation system can help increase revenue for your onl...
How an AI-backed recommendation system can help increase revenue for your onl...How an AI-backed recommendation system can help increase revenue for your onl...
How an AI-backed recommendation system can help increase revenue for your onl...
 

Dernier

%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
masabamasaba
 
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
masabamasaba
 
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdfintroduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
VishalKumarJha10
 
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
Health
 

Dernier (20)

AI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplateAI & Machine Learning Presentation Template
AI & Machine Learning Presentation Template
 
%in kempton park+277-882-255-28 abortion pills for sale in kempton park
%in kempton park+277-882-255-28 abortion pills for sale in kempton park %in kempton park+277-882-255-28 abortion pills for sale in kempton park
%in kempton park+277-882-255-28 abortion pills for sale in kempton park
 
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
 
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
 
Chinsurah Escorts ☎️8617697112 Starting From 5K to 15K High Profile Escorts ...
Chinsurah Escorts ☎️8617697112  Starting From 5K to 15K High Profile Escorts ...Chinsurah Escorts ☎️8617697112  Starting From 5K to 15K High Profile Escorts ...
Chinsurah Escorts ☎️8617697112 Starting From 5K to 15K High Profile Escorts ...
 
Payment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdf
Payment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdfPayment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdf
Payment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdf
 
10 Trends Likely to Shape Enterprise Technology in 2024
10 Trends Likely to Shape Enterprise Technology in 202410 Trends Likely to Shape Enterprise Technology in 2024
10 Trends Likely to Shape Enterprise Technology in 2024
 
%in Lydenburg+277-882-255-28 abortion pills for sale in Lydenburg
%in Lydenburg+277-882-255-28 abortion pills for sale in Lydenburg%in Lydenburg+277-882-255-28 abortion pills for sale in Lydenburg
%in Lydenburg+277-882-255-28 abortion pills for sale in Lydenburg
 
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
Direct Style Effect Systems -The Print[A] Example- A Comprehension AidDirect Style Effect Systems -The Print[A] Example- A Comprehension Aid
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
 
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
 
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
 
SHRMPro HRMS Software Solutions Presentation
SHRMPro HRMS Software Solutions PresentationSHRMPro HRMS Software Solutions Presentation
SHRMPro HRMS Software Solutions Presentation
 
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfThe Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
 
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdfintroduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
 
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
 
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
 
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
 
Architecture decision records - How not to get lost in the past
Architecture decision records - How not to get lost in the pastArchitecture decision records - How not to get lost in the past
Architecture decision records - How not to get lost in the past
 
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
 

Real-time Machine Learning with Hopsworks

  • 1. Real-time Machine Learning with Hopsworks An integrated Feature Store and Model Serving platform Jim Dowling - CEO
  • 2. ML Operational Capabilities Business Value Online predictions Batch updates Offline predictions Batch updates Traditional Analytics Training/Test Data Analytical ML Operational ML Real-Time Machine Learning Where business value is generated in AI Online inference Batch features Offline inference Batch features Model Serving Online Feature Store Batch jobs Offline Feature Store Model Serving Online Feature Store Online inference Streaming features Online predictions Real-time updates
  • 3. Data warehouse Applications - Services Search, Versioning, Statistics, Code Lineage, Provenance Feature Views Model Registry Feature Groups Online Applications & Services KServe Feature Store Models Where Feature Stores and Model Serving meet
  • 4. Feature Groups Feature Views Batch (DataFrames) Read Feature Vectors Online API Read Files/DataFrames Offline API Streaming (Data Instances) Models Feature Store Transformer Prediction Service Predictor Model Artifact Online Predictions REST API Model Registry Deploy Inference logs (Data Instances) Model Serving Code Model files Model Server Inference Logger From Raw Data to Online Predictions Search, Versioning, Statistics, Transformations Lineage, Provenance Versioning, Experiments, Metrics, Code Canary, A/B Testing
  • 5. Keeping Your Pipelines on Track Model Registry Batch Apps Online Apps Feature Groups Feature Views Vector DB Training Pipelines Inference Pipelines Online Offline Model artifact Index Creation Encoder schema transformation functions versioning versioning versioning experiments versioning schema schema schema ✓ Versioning → ■ code : feature eng., transformation functions, model training, model serving scripts ■ assets: model files, model artifacts, experiments ■ configuration: experiment settings, deployments, indexes ✓ Schema management → columns, data types // fg, fv, models, deployments ✓ Transformation functions → avoid training / serving skew ✓ Provenance and Lineage → track predictions down to the ingested features Provenance versioning Data warehouse (historical data) Applications, Service (context, trends) Feature Pipelines Batch Streaming versioning
  • 6. A Closer Look to Inference Pipelines Data warehouse (historical data) Model Registry Batch Apps Online Apps Feature Groups Feature Views Applications, Service (context, trends) Feature Pipelines Vector DB Batch Streaming Training Pipelines Offline Index Creation Encoder Model artifact Batch Inference Jobs Prediction Service Transformer Predictor Model artifact Online Recent features Embeddings Online predictions Inference logs Inference logger Batch data Batch predictions
  • 7. Feature Store Inference Request Streaming Feature Pipeline Feature Group FG 1 FG 2 FG 4 FG 3 Feature View FV 1 FV 2 FG 5 FV 3 Features Feature 1 Feature 2 Feature 4 (pk) Feature 3 Feature 5 Feature 6 Feature 7 (pk) Feature 9 Feature 8 Model Serving Transformer Feature 4 (pk) Feature Vector Vector DB Embedding Embedding Embedding Embeddings Predictor Embeddings Model Input Inference Response Prediction Prediction Embedding space Online Apps Similarity search Feedback Lookups Inference logs Model A Deeper Look to Real-time Inference Pipelines mapping
  • 9. Embedding User-Query Encoder Features Embeddings compress high dimensional data, retaining semantic relationships current user search user session data user purchases user profile
  • 10. What about Multi-Modal Similarity Search? Can a “user query” find “items” with similarity search? Yes, by mapping the “user query” embedding into the “item” embedding space with a two-tower model. Representation learning for retrieval usually involves supervised learning with labeled or pseudo-labeled data from user-item interactions.
  • 11. Training data for our Two-Tower Model will be User-Item Interactions Log user-item interactions as training data for our two-tower model and ranking model. Retail Website Search Item 1 Item 2 Item 3 Item 4 Purchase 3 Click 2 Click 3 Score: 0 Item 1 Score: 1 Item 2 Score: 5 Item 3 Score: 0 Item 4 Features Features Features Features
  • 12. Training the Two-Tower Embedding Modoel User Query embeddings User Query encoder Item embeddings Item encoder Item category, price, popularity, etc User features, preferences, history Dot product (Loss fn) 0 → Non-interaction LOSS 1 → highest interaction User-Item Interactions Training Data
  • 13. Model Training for Embedding Models and Ranking Model Feature Views items user queries Feature Store Training Data retrieval.csv ranking.csv Ranking User/Query Embedding Item Embedding Hopsworks Model Registry Train Models Train Models Models item user clicks
  • 14. Build the ANN Index on Items. Similarity Search with user queries on it. OpenSearch k-NN (ANN Index) items.csv Job computes embeddings for all Items Encode all items Insert all pairs (item-ID, embedding)
  • 15. Two-Tower Network with a Vector Database for ANN Search Source: https://cloud.google.com/blog/products/ai-machine-learning/vertex-matching-engine-blazing-fast-and-massively-scalable-nearest-neighbor-search
  • 16. Retrieval and Ranking for Personalized Real-time Recommendation Systems User-Query Embedding User-Query Encoder Features Candidate Retrieval Ranking Model Ranked items Hopsworks Feature Store OpenSearch k-NN (items) Candidate Items Trends, Feedback Search Get Features for items Features
  • 17. Real-time Recommendation Systems Query Model Retrieve closest candidates using similarity search Enrich with features for candidates Ranked candidates Recommended candidates Ranking Model Candidate 1 Candidate 2 Candidate N Recommendation request Enrich with item/user features
  • 18. Real-time Recommendation Systems with Hopsworks User Query Model Retrieve closest candidates with similarity search Enrich with features for candidates Recommendation request Recommended candidates Enrich with item/user features Ranking Model Ranked candidates Candidate 1 Candidate 2 Candidate N Hopsworks Feature Store Predictor Predictor KServe Deployment OpenSearch K-NN KServe Deployment Transformer Transformer
  • 19. Extended Retrieval and Ranking Architecture Embeddings, Retrieval, Filtering, Ranking Jointly train with two-tower model: User/query embedding Item embedding models Built Approx Nearest Neighbor (ANN) Index with items and item embedding model. User/Query & Item Embeddings With a ranking model, score all the candidate items with both user and item features, ensuring, candidate diversity. Ranking Remove candidate items for various reasons: • underage user • item sold out • item bought before • item not available in user’s region Filtering Retrieve candidate items based on the user embedding from the ANN Index - similarity search Retrieval