SlideShare a Scribd company logo
1 of 25
Model Drift Monitoring using
Tensorflow Model Analysis (TFMA)
Vivek Raja P S
A little about myself...
● From Madurai, Tamil Nadu
● Bachelor’s in Computer Science and Engineering (2020)
● Founder & Organizer of Tamil FOSS Community
● 14x Multi-cloud certified (Azure, Oracle & Alibaba)
● Microsoft Certified Data Scientist Associate, AI Engineer
Associate, Data Engineer Associate
● OCI Certified Solution Architect Professional and
Developer Associate
● 15x Hackathon Winner
● Published 3 AI & IoT research papers, 1 patent
● Loves to play guitar, Learning French, Binge watching
Netflix
Session’s agenda
● Introduction to ML in Production
● Model Drift Monitoring (MDM) – What definition states
● MDM from production point of view
● How Model Drift can be monitored using TFMA
● Demo
● Queries
ML In Production
We all know to train, test,
deploy ML Models
But what really happens
after that?
Credits: Laurence Moroney
What do we
need to care
about?
● Packaging the model
● Finding right
infrastructure
● Deploying model
● High availability API
● Versioning the model
● Monitoring the model
Model Drift Monitoring
- What definition states
Understanding Model Drift
Many machine learning models tend to be black boxes, where
explainability is very limited, which can make it difficult to
understand why a model is not performing as expected. This is
especially true with regard to how a model performs over time with
new training data.
What is concept drift?
A model that was initially working pretty well could later degrade due
to a concept called data drift or concept drift. Data drift occurs when
the underlying statistical structure of your data changes over time.
Lets understand with an example
Technically, predictive modeling is the problem of approximating a mapping function (f)
given input data (X) to predict an output value (y).
Often, this mapping is assumed to be static, meaning that the mapping learned from
historical data is just as valid in the future on new data and that the relationships between
input and output data do not change.
Lets understand with an example
The change to the data could take any form
● A gradual change over time.
● A recurring or cyclical change.
● A sudden or abrupt change.
There are domains where predictions are ordered by time, such as time series
forecasting and predictions on streaming data where the problem of concept
drift is more likely and should be explicitly tested for and addressed.
Lets understand with an example
Forecasting COVID-19 cases in India -
bit.ly/vivekarticle
Website: bit.ly/covidtrack
Model Drift Monitoring
-production point of view
From Production POV
Indre Zliobaite in the 2010 paper titled “Learning under Concept Drift: An Overview”
● Future assumption: a designer needs to make an assumption about the future data
source.
● Change type: a designer needs to identify possible change patterns.
● Learner adaptivity: based on the change type and the future assumption, a designer
chooses the mechanisms which make the learner adaptive.
● Model selection: a designer needs a criterion to choose a particular parametrization of
the selected learner at every time step (e.g. the weights for ensemble members, the
window size for variable window method).
From Production POV
● Defining critical thresholds, detecting violations of these thresholds, and safeguarding the
machine learning production system from degradation is the main goal of model
monitoring.
● Model monitoring is very similar to continuous integration/continuous deployment
(CI/CD) in traditional software development. In CI/CD systems, you monitor the whole
software development and deployment lifecycle using automated tools and alerting. The
goal of model monitoring is to bring some of these well-established rules and systems
into modern production systems that utilize machine learning.
From Production POV
● Data drift:
● Model learning capacity:
● Performance degrading change:
● Maintenance and manual deployment cycles:
Tensorflow Model Analysis
Overview
TensorFlow Model Analysis (TFMA) is a library for performing model
evaluation.
● For: Machine Learning Engineers or Data Scientists
● who: want to analyze and understand their TensorFlow models
● it is: a standalone library or component of a TFX pipeline
● that: evaluates models on large amounts of data in a distributed manner on
the same metrics defined in training. These metrics are compared over
slices of data, and visualized in Jupyter or Colab notebooks.
● unlike: some model introspection tools like tensorboard that offer model
introspection
Overview
TensorFlow Model Analysis allows you to perform model evaluations in the TFX
pipeline, and view resultant metrics and plots in a Jupyter notebook. Specifically, it
can provide:
● Metrics computed on entire training and holdout dataset, as well as next-day
evaluations
● Tracking metrics over time
● Model quality performance on different feature slices
● Model validation for ensuring that model's maintain consistent performance
Time to code
Open for Q&A
Let’s connect
Email ID: vivekraja98@gmail.com
Linkedin: @Vivek Raja P S GitHub: @Vivek0712 Twitter: @vivekraja007

More Related Content

What's hot

Sequence Modelling with Deep Learning
Sequence Modelling with Deep LearningSequence Modelling with Deep Learning
Sequence Modelling with Deep LearningNatasha Latysheva
 
Data mining & data warehousing (ppt)
Data mining & data warehousing (ppt)Data mining & data warehousing (ppt)
Data mining & data warehousing (ppt)Harish Chand
 
Introduction to Data streaming - 05/12/2014
Introduction to Data streaming - 05/12/2014Introduction to Data streaming - 05/12/2014
Introduction to Data streaming - 05/12/2014Raja Chiky
 
Remote Procedure Call (RPC) Server creation semantics & call semantics
Remote Procedure Call (RPC) Server creation semantics & call semanticsRemote Procedure Call (RPC) Server creation semantics & call semantics
Remote Procedure Call (RPC) Server creation semantics & call semanticssvm
 
Distributed Systems
Distributed SystemsDistributed Systems
Distributed Systemsvampugani
 
Introduction to Hadoop and Hadoop component
Introduction to Hadoop and Hadoop component Introduction to Hadoop and Hadoop component
Introduction to Hadoop and Hadoop component rebeccatho
 
Cloud Computing: Hadoop
Cloud Computing: HadoopCloud Computing: Hadoop
Cloud Computing: Hadoopdarugar
 
Thread priority in java
Thread priority in javaThread priority in java
Thread priority in javaRakesh Mittal
 
Database , 12 Reliability
Database , 12 ReliabilityDatabase , 12 Reliability
Database , 12 ReliabilityAli Usman
 
15. Transactions in DBMS
15. Transactions in DBMS15. Transactions in DBMS
15. Transactions in DBMSkoolkampus
 
24. Advanced Transaction Processing in DBMS
24. Advanced Transaction Processing in DBMS24. Advanced Transaction Processing in DBMS
24. Advanced Transaction Processing in DBMSkoolkampus
 
Limitations of memory system performance
Limitations of memory system performanceLimitations of memory system performance
Limitations of memory system performanceSyed Zaid Irshad
 
Data storage in cloud computing
Data storage in cloud computingData storage in cloud computing
Data storage in cloud computingjamunaashok
 

What's hot (20)

Sequence Modelling with Deep Learning
Sequence Modelling with Deep LearningSequence Modelling with Deep Learning
Sequence Modelling with Deep Learning
 
Distributed DBMS - Unit 6 - Query Processing
Distributed DBMS - Unit 6 - Query ProcessingDistributed DBMS - Unit 6 - Query Processing
Distributed DBMS - Unit 6 - Query Processing
 
Data mining & data warehousing (ppt)
Data mining & data warehousing (ppt)Data mining & data warehousing (ppt)
Data mining & data warehousing (ppt)
 
Introduction to Data streaming - 05/12/2014
Introduction to Data streaming - 05/12/2014Introduction to Data streaming - 05/12/2014
Introduction to Data streaming - 05/12/2014
 
Remote Procedure Call (RPC) Server creation semantics & call semantics
Remote Procedure Call (RPC) Server creation semantics & call semanticsRemote Procedure Call (RPC) Server creation semantics & call semantics
Remote Procedure Call (RPC) Server creation semantics & call semantics
 
Database users
Database usersDatabase users
Database users
 
Distributed Systems
Distributed SystemsDistributed Systems
Distributed Systems
 
Introduction to Hadoop and Hadoop component
Introduction to Hadoop and Hadoop component Introduction to Hadoop and Hadoop component
Introduction to Hadoop and Hadoop component
 
Cloud Computing: Hadoop
Cloud Computing: HadoopCloud Computing: Hadoop
Cloud Computing: Hadoop
 
Thread priority in java
Thread priority in javaThread priority in java
Thread priority in java
 
Concurrency control
Concurrency controlConcurrency control
Concurrency control
 
Database , 12 Reliability
Database , 12 ReliabilityDatabase , 12 Reliability
Database , 12 Reliability
 
Data warehouse physical design
Data warehouse physical designData warehouse physical design
Data warehouse physical design
 
15. Transactions in DBMS
15. Transactions in DBMS15. Transactions in DBMS
15. Transactions in DBMS
 
24. Advanced Transaction Processing in DBMS
24. Advanced Transaction Processing in DBMS24. Advanced Transaction Processing in DBMS
24. Advanced Transaction Processing in DBMS
 
Data preprocessing
Data preprocessingData preprocessing
Data preprocessing
 
Limitations of memory system performance
Limitations of memory system performanceLimitations of memory system performance
Limitations of memory system performance
 
Parallel processing
Parallel processingParallel processing
Parallel processing
 
Middleware
MiddlewareMiddleware
Middleware
 
Data storage in cloud computing
Data storage in cloud computingData storage in cloud computing
Data storage in cloud computing
 

Similar to Model Drift Monitoring using Tensorflow Model Analysis

PAPIs LATAM 2019 - Training and deploying ML models with Kubeflow and TensorF...
PAPIs LATAM 2019 - Training and deploying ML models with Kubeflow and TensorF...PAPIs LATAM 2019 - Training and deploying ML models with Kubeflow and TensorF...
PAPIs LATAM 2019 - Training and deploying ML models with Kubeflow and TensorF...Gabriel Moreira
 
PAPIs LATAM 2019 - Training and deploying ML models with Kubeflow and TensorF...
PAPIs LATAM 2019 - Training and deploying ML models with Kubeflow and TensorF...PAPIs LATAM 2019 - Training and deploying ML models with Kubeflow and TensorF...
PAPIs LATAM 2019 - Training and deploying ML models with Kubeflow and TensorF...Gabriel Moreira
 
Ml infra at an early stage
Ml infra at an early stageMl infra at an early stage
Ml infra at an early stageNick Handel
 
Week 3 data journey and data storage
Week 3   data journey and data storageWeek 3   data journey and data storage
Week 3 data journey and data storageAjay Taneja
 
The A-Z of Data: Introduction to MLOps
The A-Z of Data: Introduction to MLOpsThe A-Z of Data: Introduction to MLOps
The A-Z of Data: Introduction to MLOpsDataPhoenix
 
MLOps - Build pipelines with Tensor Flow Extended & Kubeflow
MLOps - Build pipelines with Tensor Flow Extended & KubeflowMLOps - Build pipelines with Tensor Flow Extended & Kubeflow
MLOps - Build pipelines with Tensor Flow Extended & KubeflowJan Kirenz
 
From Data Science to MLOps
From Data Science to MLOpsFrom Data Science to MLOps
From Data Science to MLOpsCarl W. Handlin
 
AI hype or reality
AI  hype or realityAI  hype or reality
AI hype or realityAwantik Das
 
Rsqrd AI: ML Tooling at an AI-first Startup
Rsqrd AI: ML Tooling at an AI-first StartupRsqrd AI: ML Tooling at an AI-first Startup
Rsqrd AI: ML Tooling at an AI-first StartupSanjana Chowdhury
 
Ml ops intro session
Ml ops   intro sessionMl ops   intro session
Ml ops intro sessionAvinash Patil
 
Technical debt in machine learning - Data Natives Berlin 2018
Technical debt in machine learning - Data Natives Berlin 2018Technical debt in machine learning - Data Natives Berlin 2018
Technical debt in machine learning - Data Natives Berlin 2018Jaroslaw Szymczak
 
Consolidating MLOps at One of Europe’s Biggest Airports
Consolidating MLOps at One of Europe’s Biggest AirportsConsolidating MLOps at One of Europe’s Biggest Airports
Consolidating MLOps at One of Europe’s Biggest AirportsDatabricks
 
Rsqrd AI: From R&D to ROI of AI
Rsqrd AI: From R&D to ROI of AIRsqrd AI: From R&D to ROI of AI
Rsqrd AI: From R&D to ROI of AISanjana Chowdhury
 
Considerations for Abstracting Complexities of a Real-Time ML Platform, Zhenz...
Considerations for Abstracting Complexities of a Real-Time ML Platform, Zhenz...Considerations for Abstracting Complexities of a Real-Time ML Platform, Zhenz...
Considerations for Abstracting Complexities of a Real-Time ML Platform, Zhenz...HostedbyConfluent
 
Why do the majority of Data Science projects never make it to production?
Why do the majority of Data Science projects never make it to production?Why do the majority of Data Science projects never make it to production?
Why do the majority of Data Science projects never make it to production?Itai Yaffe
 
Deploying ML models in the enterprise
Deploying ML models in the enterpriseDeploying ML models in the enterprise
Deploying ML models in the enterprisedoppenhe
 

Similar to Model Drift Monitoring using Tensorflow Model Analysis (20)

PAPIs LATAM 2019 - Training and deploying ML models with Kubeflow and TensorF...
PAPIs LATAM 2019 - Training and deploying ML models with Kubeflow and TensorF...PAPIs LATAM 2019 - Training and deploying ML models with Kubeflow and TensorF...
PAPIs LATAM 2019 - Training and deploying ML models with Kubeflow and TensorF...
 
PAPIs LATAM 2019 - Training and deploying ML models with Kubeflow and TensorF...
PAPIs LATAM 2019 - Training and deploying ML models with Kubeflow and TensorF...PAPIs LATAM 2019 - Training and deploying ML models with Kubeflow and TensorF...
PAPIs LATAM 2019 - Training and deploying ML models with Kubeflow and TensorF...
 
Ml infra at an early stage
Ml infra at an early stageMl infra at an early stage
Ml infra at an early stage
 
Week 3 data journey and data storage
Week 3   data journey and data storageWeek 3   data journey and data storage
Week 3 data journey and data storage
 
The A-Z of Data: Introduction to MLOps
The A-Z of Data: Introduction to MLOpsThe A-Z of Data: Introduction to MLOps
The A-Z of Data: Introduction to MLOps
 
MLOps - Build pipelines with Tensor Flow Extended & Kubeflow
MLOps - Build pipelines with Tensor Flow Extended & KubeflowMLOps - Build pipelines with Tensor Flow Extended & Kubeflow
MLOps - Build pipelines with Tensor Flow Extended & Kubeflow
 
C2_W1---.pdf
C2_W1---.pdfC2_W1---.pdf
C2_W1---.pdf
 
From Data Science to MLOps
From Data Science to MLOpsFrom Data Science to MLOps
From Data Science to MLOps
 
AI hype or reality
AI  hype or realityAI  hype or reality
AI hype or reality
 
Rsqrd AI: ML Tooling at an AI-first Startup
Rsqrd AI: ML Tooling at an AI-first StartupRsqrd AI: ML Tooling at an AI-first Startup
Rsqrd AI: ML Tooling at an AI-first Startup
 
Ml ops intro session
Ml ops   intro sessionMl ops   intro session
Ml ops intro session
 
Technical debt in machine learning - Data Natives Berlin 2018
Technical debt in machine learning - Data Natives Berlin 2018Technical debt in machine learning - Data Natives Berlin 2018
Technical debt in machine learning - Data Natives Berlin 2018
 
Consolidating MLOps at One of Europe’s Biggest Airports
Consolidating MLOps at One of Europe’s Biggest AirportsConsolidating MLOps at One of Europe’s Biggest Airports
Consolidating MLOps at One of Europe’s Biggest Airports
 
Rsqrd AI: From R&D to ROI of AI
Rsqrd AI: From R&D to ROI of AIRsqrd AI: From R&D to ROI of AI
Rsqrd AI: From R&D to ROI of AI
 
Considerations for Abstracting Complexities of a Real-Time ML Platform, Zhenz...
Considerations for Abstracting Complexities of a Real-Time ML Platform, Zhenz...Considerations for Abstracting Complexities of a Real-Time ML Platform, Zhenz...
Considerations for Abstracting Complexities of a Real-Time ML Platform, Zhenz...
 
Aws autopilot
Aws autopilotAws autopilot
Aws autopilot
 
MLOps for production-level machine learning
MLOps for production-level machine learningMLOps for production-level machine learning
MLOps for production-level machine learning
 
Why do the majority of Data Science projects never make it to production?
Why do the majority of Data Science projects never make it to production?Why do the majority of Data Science projects never make it to production?
Why do the majority of Data Science projects never make it to production?
 
Deploying ML models in the enterprise
Deploying ML models in the enterpriseDeploying ML models in the enterprise
Deploying ML models in the enterprise
 
Machine Learning
Machine LearningMachine Learning
Machine Learning
 

More from Vivek Raja P S

Literature Text Translation & Audio Synthesis using Microsoft Azure Cognitive...
Literature Text Translation & Audio Synthesis using Microsoft Azure Cognitive...Literature Text Translation & Audio Synthesis using Microsoft Azure Cognitive...
Literature Text Translation & Audio Synthesis using Microsoft Azure Cognitive...Vivek Raja P S
 
Getting Started with Containers
Getting Started with ContainersGetting Started with Containers
Getting Started with ContainersVivek Raja P S
 
Detecting and Visualizing Twitter Sentiment during COVID-19 Pandemic using AW...
Detecting and Visualizing Twitter Sentiment during COVID-19 Pandemic using AW...Detecting and Visualizing Twitter Sentiment during COVID-19 Pandemic using AW...
Detecting and Visualizing Twitter Sentiment during COVID-19 Pandemic using AW...Vivek Raja P S
 
Getting Started with Azure AutoML
Getting Started with Azure AutoMLGetting Started with Azure AutoML
Getting Started with Azure AutoMLVivek Raja P S
 
Strategies and Roadmap to win any Hackathon
Strategies and Roadmap to win any HackathonStrategies and Roadmap to win any Hackathon
Strategies and Roadmap to win any HackathonVivek Raja P S
 
AI/ ML solutions to tackle COVID-19
AI/ ML solutions to tackle COVID-19AI/ ML solutions to tackle COVID-19
AI/ ML solutions to tackle COVID-19Vivek Raja P S
 

More from Vivek Raja P S (7)

Literature Text Translation & Audio Synthesis using Microsoft Azure Cognitive...
Literature Text Translation & Audio Synthesis using Microsoft Azure Cognitive...Literature Text Translation & Audio Synthesis using Microsoft Azure Cognitive...
Literature Text Translation & Audio Synthesis using Microsoft Azure Cognitive...
 
Getting Started with Containers
Getting Started with ContainersGetting Started with Containers
Getting Started with Containers
 
Detecting and Visualizing Twitter Sentiment during COVID-19 Pandemic using AW...
Detecting and Visualizing Twitter Sentiment during COVID-19 Pandemic using AW...Detecting and Visualizing Twitter Sentiment during COVID-19 Pandemic using AW...
Detecting and Visualizing Twitter Sentiment during COVID-19 Pandemic using AW...
 
Getting Started with Azure AutoML
Getting Started with Azure AutoMLGetting Started with Azure AutoML
Getting Started with Azure AutoML
 
Tensorflow Ecosystem
Tensorflow EcosystemTensorflow Ecosystem
Tensorflow Ecosystem
 
Strategies and Roadmap to win any Hackathon
Strategies and Roadmap to win any HackathonStrategies and Roadmap to win any Hackathon
Strategies and Roadmap to win any Hackathon
 
AI/ ML solutions to tackle COVID-19
AI/ ML solutions to tackle COVID-19AI/ ML solutions to tackle COVID-19
AI/ ML solutions to tackle COVID-19
 

Recently uploaded

[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 

Recently uploaded (20)

[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 

Model Drift Monitoring using Tensorflow Model Analysis

  • 1. Model Drift Monitoring using Tensorflow Model Analysis (TFMA) Vivek Raja P S
  • 2. A little about myself... ● From Madurai, Tamil Nadu ● Bachelor’s in Computer Science and Engineering (2020) ● Founder & Organizer of Tamil FOSS Community ● 14x Multi-cloud certified (Azure, Oracle & Alibaba) ● Microsoft Certified Data Scientist Associate, AI Engineer Associate, Data Engineer Associate ● OCI Certified Solution Architect Professional and Developer Associate ● 15x Hackathon Winner ● Published 3 AI & IoT research papers, 1 patent ● Loves to play guitar, Learning French, Binge watching Netflix
  • 3. Session’s agenda ● Introduction to ML in Production ● Model Drift Monitoring (MDM) – What definition states ● MDM from production point of view ● How Model Drift can be monitored using TFMA ● Demo ● Queries
  • 5.
  • 6. We all know to train, test, deploy ML Models But what really happens after that?
  • 8. What do we need to care about? ● Packaging the model ● Finding right infrastructure ● Deploying model ● High availability API ● Versioning the model ● Monitoring the model
  • 9. Model Drift Monitoring - What definition states
  • 10. Understanding Model Drift Many machine learning models tend to be black boxes, where explainability is very limited, which can make it difficult to understand why a model is not performing as expected. This is especially true with regard to how a model performs over time with new training data.
  • 11. What is concept drift? A model that was initially working pretty well could later degrade due to a concept called data drift or concept drift. Data drift occurs when the underlying statistical structure of your data changes over time.
  • 12. Lets understand with an example Technically, predictive modeling is the problem of approximating a mapping function (f) given input data (X) to predict an output value (y). Often, this mapping is assumed to be static, meaning that the mapping learned from historical data is just as valid in the future on new data and that the relationships between input and output data do not change.
  • 13. Lets understand with an example The change to the data could take any form ● A gradual change over time. ● A recurring or cyclical change. ● A sudden or abrupt change. There are domains where predictions are ordered by time, such as time series forecasting and predictions on streaming data where the problem of concept drift is more likely and should be explicitly tested for and addressed.
  • 14. Lets understand with an example Forecasting COVID-19 cases in India - bit.ly/vivekarticle Website: bit.ly/covidtrack
  • 16. From Production POV Indre Zliobaite in the 2010 paper titled “Learning under Concept Drift: An Overview” ● Future assumption: a designer needs to make an assumption about the future data source. ● Change type: a designer needs to identify possible change patterns. ● Learner adaptivity: based on the change type and the future assumption, a designer chooses the mechanisms which make the learner adaptive. ● Model selection: a designer needs a criterion to choose a particular parametrization of the selected learner at every time step (e.g. the weights for ensemble members, the window size for variable window method).
  • 17. From Production POV ● Defining critical thresholds, detecting violations of these thresholds, and safeguarding the machine learning production system from degradation is the main goal of model monitoring. ● Model monitoring is very similar to continuous integration/continuous deployment (CI/CD) in traditional software development. In CI/CD systems, you monitor the whole software development and deployment lifecycle using automated tools and alerting. The goal of model monitoring is to bring some of these well-established rules and systems into modern production systems that utilize machine learning.
  • 18. From Production POV ● Data drift: ● Model learning capacity: ● Performance degrading change: ● Maintenance and manual deployment cycles:
  • 20. Overview TensorFlow Model Analysis (TFMA) is a library for performing model evaluation. ● For: Machine Learning Engineers or Data Scientists ● who: want to analyze and understand their TensorFlow models ● it is: a standalone library or component of a TFX pipeline ● that: evaluates models on large amounts of data in a distributed manner on the same metrics defined in training. These metrics are compared over slices of data, and visualized in Jupyter or Colab notebooks. ● unlike: some model introspection tools like tensorboard that offer model introspection
  • 21. Overview TensorFlow Model Analysis allows you to perform model evaluations in the TFX pipeline, and view resultant metrics and plots in a Jupyter notebook. Specifically, it can provide: ● Metrics computed on entire training and holdout dataset, as well as next-day evaluations ● Tracking metrics over time ● Model quality performance on different feature slices ● Model validation for ensuring that model's maintain consistent performance
  • 24.
  • 25. Let’s connect Email ID: vivekraja98@gmail.com Linkedin: @Vivek Raja P S GitHub: @Vivek0712 Twitter: @vivekraja007