SlideShare une entreprise Scribd logo
1  sur  42
Testing AI,ML Applications
BY:
Divya Rakhiani
Tarun Maini
VodQA 2019 pune
1
AGENDA
Intro + Quick agenda walkthrough(brief talk)
a. What is AI/ML
b. How technology is shifting towards AI, ML
c. Where does a QA step in
d. Challenges while testing AI,ML application
Hands-ON Activity:
1. Create and Test a basic Beer-Wine Classifier
1. Create an Image Classifier ( via CLI )
a. Retrain a Mobile Net
b. Generate test data
c. Create Optimized graphs
d. Test you classifier
1. Dynamic Image Classifier via Android App - (OPTIONAL)
a. Retrain a Mobile Net
b. Generate test data
c. Create Optimized graphs
d. Test you classifier
2
What is AI/ML ? Why the
buzzword Data Science ?
3
4
“Machine learning is an application
of artificial intelligence (AI) that
provides systems the ability to
automatically learn and improve
from experience without being
explicitly programmed“
5
6
How technology is shifting
towards AI/ML & affected
the world around us ?
7
8
What are the types of
ML ?
9
©ThoughtWorks 2017 Commercial in Confidence
10
Example:
11
Problem we are dealing with:
Beer-Wine
Classification
12
COMPONENTS
1
3
Training Data --> Algorithm --> Model --> Test Data --> Prediction/Output
● Label: Is what you're attempting to predict or forecast
● Features: are an individual measurable property OR the descriptive attributes
● Feature Vectors: A feature vector is a vector in which each dimension represent a certain
feature of an example
● Learning Rate: number of time data is reread in a model to perform accurate predictions.
● Hyperparameters : is a parameter whose value is set before the learning process begins to fine
tune performance such as coefficient of features for logistic regression model.
Frequent terms used in ML
1
4
©ThoughtWorks 2017 Commercial in Confidence
Supervised Learning Recipe
15Source: http://slideplayer.com/slide/9493622/
Where does a QA step in
16
©ThoughtWorks 2017 Commercial in Confidence
17
Training data Vs Test data
● Training set— Data subset to train a model
● Test set— Data subset to test the trained model
You could imagine slicing the single data set as follows:
1
8
Validation Set
1
9
Guidelines to generate test
data for ML features
20
Avoid Data snooping bias
2
1
Stratified Sampling
2
2
Avoid UnderFitting or OverFitting
2
3
Testing the feature
● Test whether the value of features lies between the threshold values
● Test whether the feature importance changed with respect to previous QA run
● Test the feature unsuitability by testing RAM, usage, inference latency etc.
● Test/Review whether the generated feature violates the data compliance related issues
2
4
Image Classification problem statement
2
5
It depends on application type.
Examples :
● Decision tree → classification
● Random forest → categorization
● Naive bayes algorithm → classification
APIs of few libraries used to develop/test ML models
● Tensorflow
● Cloud Vision API
● Natural Language
● Google Speech
Some algorithmic models
2
6
TensorFlow
2
7
Decision tree
2
8
How good is the model ?
29
30
Train Classifier - by hyperparameters
Random_brightness = 0
Architecture = inception_v3
Random_crop = 0
Flip_left_right = false
Bottleneck_dir = /tmp/bottleneck'
Testing_percentage = 10
Validation_percentage = 10
Learning_rate = 0.01
How_many_training_steps = 4000
3
1
32
Accuracy of the classification models ?
3
3
Confusion matrix
3
4
Accuracy
True positive + True Negative
Total Predictions
3
5
Precision
Out of all the predictions predicted as beer , how many are correctly classified as beer ?
True Positive +False Positive
True Positive
3
6
Recall
Out of all the drinks labeled as beer , How many were correctly predicted ?
True Positive
True Positive +False Positive
3
7
Metrics used for Regression Model
● Root Mean Square Error : is a measure of accuracy, to compare forecasting errors of different
models for a particular dataset and not between datasets
● Mean Absolute Error : how much % error the model makes in its predictions.
● Entropy : is used as an impurity measure of the model.
3
8
In Conclusion
3
9
Challenges in testing
● Fast machines and processors
● Generate training data
● Generate test Data
● Know the Threshold and test with new data
● Data Filtering/quality of data - Enhancing data, Prevent overfitting & underfitting
4
0
PREREQUISITES
Please complete all the following steps:
● Clone all the following repositories at local:
a. https://github.com/tarunmaini16/beer-wine-classifier
b. https://github.com/tarunmaini16/image-classifier
c. https://github.com/tarunmaini16/android-image-classifier
● Pull following docker images (optional):
a. https://cloud.docker.com/u/tarunmaini/repository/docker/tarunmaini/wine-beer-classification
b. https://cloud.docker.com/u/tarunmaini/repository/docker/tarunmaini/image-classifier
● Install Python at system and python plugin in IntelliJ
● Install Tensorflow via terminal $ pip install --upgrade “tensorflow==1.9*”
● Android Studio Setup [v3.1+]
● Android Device OR Virtual Emulator ( API Level = 27/28, Target = Android 8.1/9 )
● Bring your data Cables to connect mobile device
● ADB setup
41
SNAPSHOTS
42

Contenu connexe

Tendances

ATAGTR2017 CDC Tests - Integration Tests cant be made simpler than this!
ATAGTR2017 CDC Tests - Integration Tests cant be made simpler than this!ATAGTR2017 CDC Tests - Integration Tests cant be made simpler than this!
ATAGTR2017 CDC Tests - Integration Tests cant be made simpler than this!Agile Testing Alliance
 
THE PLEASURES OF ON-PREM, TOMER GABEL
THE PLEASURES OF ON-PREM, TOMER GABELTHE PLEASURES OF ON-PREM, TOMER GABEL
THE PLEASURES OF ON-PREM, TOMER GABELDevOpsDays Tel Aviv
 
Testing in DevOps: UKStar conference
Testing in DevOps: UKStar conferenceTesting in DevOps: UKStar conference
Testing in DevOps: UKStar conferenceLaurent PY
 
AWS Well-Architected: Build Better Architecture, Better Business
AWS Well-Architected: Build Better Architecture, Better BusinessAWS Well-Architected: Build Better Architecture, Better Business
AWS Well-Architected: Build Better Architecture, Better BusinessDevOps.com
 
Real Testing Scenario Strategy - The Role of Exploratory Testing
Real Testing Scenario Strategy - The Role of Exploratory TestingReal Testing Scenario Strategy - The Role of Exploratory Testing
Real Testing Scenario Strategy - The Role of Exploratory TestingAdam Sandman
 
Datadog: From a single product to a growing platform by Alexis Lê-Quôc, CTO
Datadog: From a single product to a growing platform by Alexis Lê-Quôc, CTODatadog: From a single product to a growing platform by Alexis Lê-Quôc, CTO
Datadog: From a single product to a growing platform by Alexis Lê-Quôc, CTOTheFamily
 
Implementing BDD at scale for agile and DevOps teams
Implementing BDD at scale for agile and DevOps teamsImplementing BDD at scale for agile and DevOps teams
Implementing BDD at scale for agile and DevOps teamsLaurent PY
 
Codemotion tech pills - Continuous performance
Codemotion tech pills  - Continuous performanceCodemotion tech pills  - Continuous performance
Codemotion tech pills - Continuous performanceBert Jan Schrijver
 
A differnt Type of Supermarket Delivery
A differnt Type of Supermarket DeliveryA differnt Type of Supermarket Delivery
A differnt Type of Supermarket DeliveryThoughtworks
 
SeleniumCamp 2020 - Shift Right and Observability
SeleniumCamp 2020 - Shift Right and ObservabilitySeleniumCamp 2020 - Shift Right and Observability
SeleniumCamp 2020 - Shift Right and ObservabilityMarcus Merrell
 
CD in Machine Learning Systems
CD in Machine Learning SystemsCD in Machine Learning Systems
CD in Machine Learning SystemsThoughtworks
 
Your Testing Is Flawed: Introducing A New Open Source Tool For Accurate Kuber...
Your Testing Is Flawed: Introducing A New Open Source Tool For Accurate Kuber...Your Testing Is Flawed: Introducing A New Open Source Tool For Accurate Kuber...
Your Testing Is Flawed: Introducing A New Open Source Tool For Accurate Kuber...StormForge .io
 
Continuous Behavior - BDD in Continuous Delivery (CoDers Who Test, Gothenburg...
Continuous Behavior - BDD in Continuous Delivery (CoDers Who Test, Gothenburg...Continuous Behavior - BDD in Continuous Delivery (CoDers Who Test, Gothenburg...
Continuous Behavior - BDD in Continuous Delivery (CoDers Who Test, Gothenburg...Gáspár Nagy
 
Amsterdam JUG - Continuous performance
Amsterdam JUG - Continuous performanceAmsterdam JUG - Continuous performance
Amsterdam JUG - Continuous performanceBert Jan Schrijver
 
We are sinking: Hitting the testing iceberg (CukenFest London, 2018)
We are sinking: Hitting the testing iceberg (CukenFest London, 2018)We are sinking: Hitting the testing iceberg (CukenFest London, 2018)
We are sinking: Hitting the testing iceberg (CukenFest London, 2018)Gáspár Nagy
 
DeTesters meetup november 2018 - Continuous performance: load testing with G...
DeTesters meetup november 2018  - Continuous performance: load testing with G...DeTesters meetup november 2018  - Continuous performance: load testing with G...
DeTesters meetup november 2018 - Continuous performance: load testing with G...Bert Jan Schrijver
 
Can i service this from my raspberry pi
Can i service this from my raspberry piCan i service this from my raspberry pi
Can i service this from my raspberry piThoughtworks
 
A Prophet in Production Shiri Hochhauser
A Prophet in Production   Shiri HochhauserA Prophet in Production   Shiri Hochhauser
A Prophet in Production Shiri HochhauserDevOpsDays Tel Aviv
 
QMetry test management for jira factsheet
QMetry test management for jira factsheetQMetry test management for jira factsheet
QMetry test management for jira factsheetSuketu Patel
 
Connecting the clouds, A TrueLime Story
Connecting the clouds, A TrueLime StoryConnecting the clouds, A TrueLime Story
Connecting the clouds, A TrueLime StoryJeroen Fürst
 

Tendances (20)

ATAGTR2017 CDC Tests - Integration Tests cant be made simpler than this!
ATAGTR2017 CDC Tests - Integration Tests cant be made simpler than this!ATAGTR2017 CDC Tests - Integration Tests cant be made simpler than this!
ATAGTR2017 CDC Tests - Integration Tests cant be made simpler than this!
 
THE PLEASURES OF ON-PREM, TOMER GABEL
THE PLEASURES OF ON-PREM, TOMER GABELTHE PLEASURES OF ON-PREM, TOMER GABEL
THE PLEASURES OF ON-PREM, TOMER GABEL
 
Testing in DevOps: UKStar conference
Testing in DevOps: UKStar conferenceTesting in DevOps: UKStar conference
Testing in DevOps: UKStar conference
 
AWS Well-Architected: Build Better Architecture, Better Business
AWS Well-Architected: Build Better Architecture, Better BusinessAWS Well-Architected: Build Better Architecture, Better Business
AWS Well-Architected: Build Better Architecture, Better Business
 
Real Testing Scenario Strategy - The Role of Exploratory Testing
Real Testing Scenario Strategy - The Role of Exploratory TestingReal Testing Scenario Strategy - The Role of Exploratory Testing
Real Testing Scenario Strategy - The Role of Exploratory Testing
 
Datadog: From a single product to a growing platform by Alexis Lê-Quôc, CTO
Datadog: From a single product to a growing platform by Alexis Lê-Quôc, CTODatadog: From a single product to a growing platform by Alexis Lê-Quôc, CTO
Datadog: From a single product to a growing platform by Alexis Lê-Quôc, CTO
 
Implementing BDD at scale for agile and DevOps teams
Implementing BDD at scale for agile and DevOps teamsImplementing BDD at scale for agile and DevOps teams
Implementing BDD at scale for agile and DevOps teams
 
Codemotion tech pills - Continuous performance
Codemotion tech pills  - Continuous performanceCodemotion tech pills  - Continuous performance
Codemotion tech pills - Continuous performance
 
A differnt Type of Supermarket Delivery
A differnt Type of Supermarket DeliveryA differnt Type of Supermarket Delivery
A differnt Type of Supermarket Delivery
 
SeleniumCamp 2020 - Shift Right and Observability
SeleniumCamp 2020 - Shift Right and ObservabilitySeleniumCamp 2020 - Shift Right and Observability
SeleniumCamp 2020 - Shift Right and Observability
 
CD in Machine Learning Systems
CD in Machine Learning SystemsCD in Machine Learning Systems
CD in Machine Learning Systems
 
Your Testing Is Flawed: Introducing A New Open Source Tool For Accurate Kuber...
Your Testing Is Flawed: Introducing A New Open Source Tool For Accurate Kuber...Your Testing Is Flawed: Introducing A New Open Source Tool For Accurate Kuber...
Your Testing Is Flawed: Introducing A New Open Source Tool For Accurate Kuber...
 
Continuous Behavior - BDD in Continuous Delivery (CoDers Who Test, Gothenburg...
Continuous Behavior - BDD in Continuous Delivery (CoDers Who Test, Gothenburg...Continuous Behavior - BDD in Continuous Delivery (CoDers Who Test, Gothenburg...
Continuous Behavior - BDD in Continuous Delivery (CoDers Who Test, Gothenburg...
 
Amsterdam JUG - Continuous performance
Amsterdam JUG - Continuous performanceAmsterdam JUG - Continuous performance
Amsterdam JUG - Continuous performance
 
We are sinking: Hitting the testing iceberg (CukenFest London, 2018)
We are sinking: Hitting the testing iceberg (CukenFest London, 2018)We are sinking: Hitting the testing iceberg (CukenFest London, 2018)
We are sinking: Hitting the testing iceberg (CukenFest London, 2018)
 
DeTesters meetup november 2018 - Continuous performance: load testing with G...
DeTesters meetup november 2018  - Continuous performance: load testing with G...DeTesters meetup november 2018  - Continuous performance: load testing with G...
DeTesters meetup november 2018 - Continuous performance: load testing with G...
 
Can i service this from my raspberry pi
Can i service this from my raspberry piCan i service this from my raspberry pi
Can i service this from my raspberry pi
 
A Prophet in Production Shiri Hochhauser
A Prophet in Production   Shiri HochhauserA Prophet in Production   Shiri Hochhauser
A Prophet in Production Shiri Hochhauser
 
QMetry test management for jira factsheet
QMetry test management for jira factsheetQMetry test management for jira factsheet
QMetry test management for jira factsheet
 
Connecting the clouds, A TrueLime Story
Connecting the clouds, A TrueLime StoryConnecting the clouds, A TrueLime Story
Connecting the clouds, A TrueLime Story
 

Similaire à vodQA Pune (2019) - Testing AI,ML applications

2024-02-24_Session 1 - PMLE_UPDATED.pptx
2024-02-24_Session 1 - PMLE_UPDATED.pptx2024-02-24_Session 1 - PMLE_UPDATED.pptx
2024-02-24_Session 1 - PMLE_UPDATED.pptxgdgsurrey
 
Freenome's Biological Machine Learning Platform
Freenome's Biological Machine Learning PlatformFreenome's Biological Machine Learning Platform
Freenome's Biological Machine Learning PlatformBrandon White
 
implementing_ai_for_improved_performance_testing_the_key_to_success.pdf
implementing_ai_for_improved_performance_testing_the_key_to_success.pdfimplementing_ai_for_improved_performance_testing_the_key_to_success.pdf
implementing_ai_for_improved_performance_testing_the_key_to_success.pdfsarah david
 
Reproducibility and experiments management in Machine Learning
Reproducibility and experiments management in Machine Learning Reproducibility and experiments management in Machine Learning
Reproducibility and experiments management in Machine Learning Mikhail Rozhkov
 
MLOps and Data Quality: Deploying Reliable ML Models in Production
MLOps and Data Quality: Deploying Reliable ML Models in ProductionMLOps and Data Quality: Deploying Reliable ML Models in Production
MLOps and Data Quality: Deploying Reliable ML Models in ProductionProvectus
 
Machine learning in production
Machine learning in productionMachine learning in production
Machine learning in productionTuri, Inc.
 
Integrating AI in software quality in absence of a well-defined requirements
Integrating AI in software quality in absence of a well-defined requirementsIntegrating AI in software quality in absence of a well-defined requirements
Integrating AI in software quality in absence of a well-defined requirementsNagarro
 
Experimentation at Blue Apron (webinar)
Experimentation at Blue Apron (webinar)Experimentation at Blue Apron (webinar)
Experimentation at Blue Apron (webinar)Optimizely
 
Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...
Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...
Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...Data Con LA
 
Testing and Deployment - Full Stack Deep Learning
Testing and Deployment - Full Stack Deep LearningTesting and Deployment - Full Stack Deep Learning
Testing and Deployment - Full Stack Deep LearningSergey Karayev
 
implementing_ai_for_improved_performance_testing_the_key_to_success.pptx
implementing_ai_for_improved_performance_testing_the_key_to_success.pptximplementing_ai_for_improved_performance_testing_the_key_to_success.pptx
implementing_ai_for_improved_performance_testing_the_key_to_success.pptxsarah david
 
Barga Data Science lecture 10
Barga Data Science lecture 10Barga Data Science lecture 10
Barga Data Science lecture 10Roger Barga
 
State of the Market - Data Quality in 2023
State of the Market - Data Quality in 2023State of the Market - Data Quality in 2023
State of the Market - Data Quality in 2023RTTS
 
To Open Banking and Beyond: Developing APIs that are Resilient to every new I...
To Open Banking and Beyond: Developing APIs that are Resilient to every new I...To Open Banking and Beyond: Developing APIs that are Resilient to every new I...
To Open Banking and Beyond: Developing APIs that are Resilient to every new I...Curiosity Software Ireland
 
How Will Your ML Project Fail
How Will Your ML Project FailHow Will Your ML Project Fail
How Will Your ML Project FailElena Samuylova
 
Scale your Testing and Quality with Automation Engineering and ML - Carlos Ki...
Scale your Testing and Quality with Automation Engineering and ML - Carlos Ki...Scale your Testing and Quality with Automation Engineering and ML - Carlos Ki...
Scale your Testing and Quality with Automation Engineering and ML - Carlos Ki...QA or the Highway
 
Model Monitoring at Scale with Apache Spark and Verta
Model Monitoring at Scale with Apache Spark and VertaModel Monitoring at Scale with Apache Spark and Verta
Model Monitoring at Scale with Apache Spark and VertaDatabricks
 
Performance Testing
Performance TestingPerformance Testing
Performance TestingSelin Gungor
 

Similaire à vodQA Pune (2019) - Testing AI,ML applications (20)

Test AI/ML Applications
Test AI/ML ApplicationsTest AI/ML Applications
Test AI/ML Applications
 
2024-02-24_Session 1 - PMLE_UPDATED.pptx
2024-02-24_Session 1 - PMLE_UPDATED.pptx2024-02-24_Session 1 - PMLE_UPDATED.pptx
2024-02-24_Session 1 - PMLE_UPDATED.pptx
 
Freenome's Biological Machine Learning Platform
Freenome's Biological Machine Learning PlatformFreenome's Biological Machine Learning Platform
Freenome's Biological Machine Learning Platform
 
implementing_ai_for_improved_performance_testing_the_key_to_success.pdf
implementing_ai_for_improved_performance_testing_the_key_to_success.pdfimplementing_ai_for_improved_performance_testing_the_key_to_success.pdf
implementing_ai_for_improved_performance_testing_the_key_to_success.pdf
 
Reproducibility and experiments management in Machine Learning
Reproducibility and experiments management in Machine Learning Reproducibility and experiments management in Machine Learning
Reproducibility and experiments management in Machine Learning
 
MLOps and Data Quality: Deploying Reliable ML Models in Production
MLOps and Data Quality: Deploying Reliable ML Models in ProductionMLOps and Data Quality: Deploying Reliable ML Models in Production
MLOps and Data Quality: Deploying Reliable ML Models in Production
 
Machine learning in production
Machine learning in productionMachine learning in production
Machine learning in production
 
Integrating AI in software quality in absence of a well-defined requirements
Integrating AI in software quality in absence of a well-defined requirementsIntegrating AI in software quality in absence of a well-defined requirements
Integrating AI in software quality in absence of a well-defined requirements
 
Experimentation at Blue Apron (webinar)
Experimentation at Blue Apron (webinar)Experimentation at Blue Apron (webinar)
Experimentation at Blue Apron (webinar)
 
Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...
Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...
Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...
 
Testing and Deployment - Full Stack Deep Learning
Testing and Deployment - Full Stack Deep LearningTesting and Deployment - Full Stack Deep Learning
Testing and Deployment - Full Stack Deep Learning
 
implementing_ai_for_improved_performance_testing_the_key_to_success.pptx
implementing_ai_for_improved_performance_testing_the_key_to_success.pptximplementing_ai_for_improved_performance_testing_the_key_to_success.pptx
implementing_ai_for_improved_performance_testing_the_key_to_success.pptx
 
Managing machine learning
Managing machine learningManaging machine learning
Managing machine learning
 
Barga Data Science lecture 10
Barga Data Science lecture 10Barga Data Science lecture 10
Barga Data Science lecture 10
 
State of the Market - Data Quality in 2023
State of the Market - Data Quality in 2023State of the Market - Data Quality in 2023
State of the Market - Data Quality in 2023
 
To Open Banking and Beyond: Developing APIs that are Resilient to every new I...
To Open Banking and Beyond: Developing APIs that are Resilient to every new I...To Open Banking and Beyond: Developing APIs that are Resilient to every new I...
To Open Banking and Beyond: Developing APIs that are Resilient to every new I...
 
How Will Your ML Project Fail
How Will Your ML Project FailHow Will Your ML Project Fail
How Will Your ML Project Fail
 
Scale your Testing and Quality with Automation Engineering and ML - Carlos Ki...
Scale your Testing and Quality with Automation Engineering and ML - Carlos Ki...Scale your Testing and Quality with Automation Engineering and ML - Carlos Ki...
Scale your Testing and Quality with Automation Engineering and ML - Carlos Ki...
 
Model Monitoring at Scale with Apache Spark and Verta
Model Monitoring at Scale with Apache Spark and VertaModel Monitoring at Scale with Apache Spark and Verta
Model Monitoring at Scale with Apache Spark and Verta
 
Performance Testing
Performance TestingPerformance Testing
Performance Testing
 

Plus de vodQA

Performance Testing
Performance TestingPerformance Testing
Performance TestingvodQA
 
Testing Strategy in Micro Frontend architecture
Testing Strategy in Micro Frontend architectureTesting Strategy in Micro Frontend architecture
Testing Strategy in Micro Frontend architecturevodQA
 
Api testing libraries using java script an overview
Api testing libraries using java script   an overviewApi testing libraries using java script   an overview
Api testing libraries using java script an overviewvodQA
 
Testing face authentication on mobile
Testing face authentication on mobileTesting face authentication on mobile
Testing face authentication on mobilevodQA
 
Testing cna
Testing cnaTesting cna
Testing cnavodQA
 
Etl engine testing with scala
Etl engine testing with scalaEtl engine testing with scala
Etl engine testing with scalavodQA
 
EDA for QAs
EDA for QAsEDA for QAs
EDA for QAsvodQA
 
vodQA Pune (2019) - Browser automation using dev tools
vodQA Pune (2019) - Browser automation using dev toolsvodQA Pune (2019) - Browser automation using dev tools
vodQA Pune (2019) - Browser automation using dev toolsvodQA
 
vodQA Pune (2019) - Augmented reality overview and testing challenges
vodQA Pune (2019) - Augmented reality overview and testing challengesvodQA Pune (2019) - Augmented reality overview and testing challenges
vodQA Pune (2019) - Augmented reality overview and testing challengesvodQA
 
vodQA Pune (2019) - Performance testing cloud deployments
vodQA Pune (2019) - Performance testing cloud deploymentsvodQA Pune (2019) - Performance testing cloud deployments
vodQA Pune (2019) - Performance testing cloud deploymentsvodQA
 
vodQA Pune (2019) - Jenkins pipeline As code
vodQA Pune (2019) - Jenkins pipeline As codevodQA Pune (2019) - Jenkins pipeline As code
vodQA Pune (2019) - Jenkins pipeline As codevodQA
 
vodQA(Pune) 2018 - Consumer driven contract testing using pact
vodQA(Pune) 2018 - Consumer driven contract testing using pactvodQA(Pune) 2018 - Consumer driven contract testing using pact
vodQA(Pune) 2018 - Consumer driven contract testing using pactvodQA
 
vodQA(Pune) 2018 - Visual testing of web apps in headless environment manis...
vodQA(Pune) 2018 - Visual testing of web apps in headless environment   manis...vodQA(Pune) 2018 - Visual testing of web apps in headless environment   manis...
vodQA(Pune) 2018 - Visual testing of web apps in headless environment manis...vodQA
 
vodQA(Pune) 2018 - Enhancing the capabilities of testing team preparing for...
vodQA(Pune) 2018 - Enhancing the capabilities of testing team   preparing for...vodQA(Pune) 2018 - Enhancing the capabilities of testing team   preparing for...
vodQA(Pune) 2018 - Enhancing the capabilities of testing team preparing for...vodQA
 
vodQA(Pune) 2018 - QAing the security way
vodQA(Pune) 2018 - QAing the security wayvodQA(Pune) 2018 - QAing the security way
vodQA(Pune) 2018 - QAing the security wayvodQA
 
vodQA(Pune) 2018 - Docker in Testing
vodQA(Pune) 2018 - Docker in TestingvodQA(Pune) 2018 - Docker in Testing
vodQA(Pune) 2018 - Docker in TestingvodQA
 
Mobile automation using appium.pptx
Mobile automation using appium.pptxMobile automation using appium.pptx
Mobile automation using appium.pptxvodQA
 
An approach to app security - For beginners
An approach to app security - For beginnersAn approach to app security - For beginners
An approach to app security - For beginnersvodQA
 
Retrospective
RetrospectiveRetrospective
RetrospectivevodQA
 
Whys and Hows of Automation
Whys and Hows of AutomationWhys and Hows of Automation
Whys and Hows of AutomationvodQA
 

Plus de vodQA (20)

Performance Testing
Performance TestingPerformance Testing
Performance Testing
 
Testing Strategy in Micro Frontend architecture
Testing Strategy in Micro Frontend architectureTesting Strategy in Micro Frontend architecture
Testing Strategy in Micro Frontend architecture
 
Api testing libraries using java script an overview
Api testing libraries using java script   an overviewApi testing libraries using java script   an overview
Api testing libraries using java script an overview
 
Testing face authentication on mobile
Testing face authentication on mobileTesting face authentication on mobile
Testing face authentication on mobile
 
Testing cna
Testing cnaTesting cna
Testing cna
 
Etl engine testing with scala
Etl engine testing with scalaEtl engine testing with scala
Etl engine testing with scala
 
EDA for QAs
EDA for QAsEDA for QAs
EDA for QAs
 
vodQA Pune (2019) - Browser automation using dev tools
vodQA Pune (2019) - Browser automation using dev toolsvodQA Pune (2019) - Browser automation using dev tools
vodQA Pune (2019) - Browser automation using dev tools
 
vodQA Pune (2019) - Augmented reality overview and testing challenges
vodQA Pune (2019) - Augmented reality overview and testing challengesvodQA Pune (2019) - Augmented reality overview and testing challenges
vodQA Pune (2019) - Augmented reality overview and testing challenges
 
vodQA Pune (2019) - Performance testing cloud deployments
vodQA Pune (2019) - Performance testing cloud deploymentsvodQA Pune (2019) - Performance testing cloud deployments
vodQA Pune (2019) - Performance testing cloud deployments
 
vodQA Pune (2019) - Jenkins pipeline As code
vodQA Pune (2019) - Jenkins pipeline As codevodQA Pune (2019) - Jenkins pipeline As code
vodQA Pune (2019) - Jenkins pipeline As code
 
vodQA(Pune) 2018 - Consumer driven contract testing using pact
vodQA(Pune) 2018 - Consumer driven contract testing using pactvodQA(Pune) 2018 - Consumer driven contract testing using pact
vodQA(Pune) 2018 - Consumer driven contract testing using pact
 
vodQA(Pune) 2018 - Visual testing of web apps in headless environment manis...
vodQA(Pune) 2018 - Visual testing of web apps in headless environment   manis...vodQA(Pune) 2018 - Visual testing of web apps in headless environment   manis...
vodQA(Pune) 2018 - Visual testing of web apps in headless environment manis...
 
vodQA(Pune) 2018 - Enhancing the capabilities of testing team preparing for...
vodQA(Pune) 2018 - Enhancing the capabilities of testing team   preparing for...vodQA(Pune) 2018 - Enhancing the capabilities of testing team   preparing for...
vodQA(Pune) 2018 - Enhancing the capabilities of testing team preparing for...
 
vodQA(Pune) 2018 - QAing the security way
vodQA(Pune) 2018 - QAing the security wayvodQA(Pune) 2018 - QAing the security way
vodQA(Pune) 2018 - QAing the security way
 
vodQA(Pune) 2018 - Docker in Testing
vodQA(Pune) 2018 - Docker in TestingvodQA(Pune) 2018 - Docker in Testing
vodQA(Pune) 2018 - Docker in Testing
 
Mobile automation using appium.pptx
Mobile automation using appium.pptxMobile automation using appium.pptx
Mobile automation using appium.pptx
 
An approach to app security - For beginners
An approach to app security - For beginnersAn approach to app security - For beginners
An approach to app security - For beginners
 
Retrospective
RetrospectiveRetrospective
Retrospective
 
Whys and Hows of Automation
Whys and Hows of AutomationWhys and Hows of Automation
Whys and Hows of Automation
 

Dernier

08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdfChristopherTHyatt
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 

Dernier (20)

08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 

vodQA Pune (2019) - Testing AI,ML applications

  • 1. Testing AI,ML Applications BY: Divya Rakhiani Tarun Maini VodQA 2019 pune 1
  • 2. AGENDA Intro + Quick agenda walkthrough(brief talk) a. What is AI/ML b. How technology is shifting towards AI, ML c. Where does a QA step in d. Challenges while testing AI,ML application Hands-ON Activity: 1. Create and Test a basic Beer-Wine Classifier 1. Create an Image Classifier ( via CLI ) a. Retrain a Mobile Net b. Generate test data c. Create Optimized graphs d. Test you classifier 1. Dynamic Image Classifier via Android App - (OPTIONAL) a. Retrain a Mobile Net b. Generate test data c. Create Optimized graphs d. Test you classifier 2
  • 3. What is AI/ML ? Why the buzzword Data Science ? 3
  • 4. 4
  • 5. “Machine learning is an application of artificial intelligence (AI) that provides systems the ability to automatically learn and improve from experience without being explicitly programmed“ 5
  • 6. 6
  • 7. How technology is shifting towards AI/ML & affected the world around us ? 7
  • 8. 8
  • 9. What are the types of ML ? 9
  • 10. ©ThoughtWorks 2017 Commercial in Confidence 10
  • 12. Problem we are dealing with: Beer-Wine Classification 12
  • 13. COMPONENTS 1 3 Training Data --> Algorithm --> Model --> Test Data --> Prediction/Output
  • 14. ● Label: Is what you're attempting to predict or forecast ● Features: are an individual measurable property OR the descriptive attributes ● Feature Vectors: A feature vector is a vector in which each dimension represent a certain feature of an example ● Learning Rate: number of time data is reread in a model to perform accurate predictions. ● Hyperparameters : is a parameter whose value is set before the learning process begins to fine tune performance such as coefficient of features for logistic regression model. Frequent terms used in ML 1 4
  • 15. ©ThoughtWorks 2017 Commercial in Confidence Supervised Learning Recipe 15Source: http://slideplayer.com/slide/9493622/
  • 16. Where does a QA step in 16
  • 17. ©ThoughtWorks 2017 Commercial in Confidence 17
  • 18. Training data Vs Test data ● Training set— Data subset to train a model ● Test set— Data subset to test the trained model You could imagine slicing the single data set as follows: 1 8
  • 20. Guidelines to generate test data for ML features 20
  • 23. Avoid UnderFitting or OverFitting 2 3
  • 24. Testing the feature ● Test whether the value of features lies between the threshold values ● Test whether the feature importance changed with respect to previous QA run ● Test the feature unsuitability by testing RAM, usage, inference latency etc. ● Test/Review whether the generated feature violates the data compliance related issues 2 4
  • 26. It depends on application type. Examples : ● Decision tree → classification ● Random forest → categorization ● Naive bayes algorithm → classification APIs of few libraries used to develop/test ML models ● Tensorflow ● Cloud Vision API ● Natural Language ● Google Speech Some algorithmic models 2 6
  • 29. How good is the model ? 29
  • 30. 30
  • 31. Train Classifier - by hyperparameters Random_brightness = 0 Architecture = inception_v3 Random_crop = 0 Flip_left_right = false Bottleneck_dir = /tmp/bottleneck' Testing_percentage = 10 Validation_percentage = 10 Learning_rate = 0.01 How_many_training_steps = 4000 3 1
  • 32. 32
  • 33. Accuracy of the classification models ? 3 3
  • 35. Accuracy True positive + True Negative Total Predictions 3 5
  • 36. Precision Out of all the predictions predicted as beer , how many are correctly classified as beer ? True Positive +False Positive True Positive 3 6
  • 37. Recall Out of all the drinks labeled as beer , How many were correctly predicted ? True Positive True Positive +False Positive 3 7
  • 38. Metrics used for Regression Model ● Root Mean Square Error : is a measure of accuracy, to compare forecasting errors of different models for a particular dataset and not between datasets ● Mean Absolute Error : how much % error the model makes in its predictions. ● Entropy : is used as an impurity measure of the model. 3 8
  • 40. Challenges in testing ● Fast machines and processors ● Generate training data ● Generate test Data ● Know the Threshold and test with new data ● Data Filtering/quality of data - Enhancing data, Prevent overfitting & underfitting 4 0
  • 41. PREREQUISITES Please complete all the following steps: ● Clone all the following repositories at local: a. https://github.com/tarunmaini16/beer-wine-classifier b. https://github.com/tarunmaini16/image-classifier c. https://github.com/tarunmaini16/android-image-classifier ● Pull following docker images (optional): a. https://cloud.docker.com/u/tarunmaini/repository/docker/tarunmaini/wine-beer-classification b. https://cloud.docker.com/u/tarunmaini/repository/docker/tarunmaini/image-classifier ● Install Python at system and python plugin in IntelliJ ● Install Tensorflow via terminal $ pip install --upgrade “tensorflow==1.9*” ● Android Studio Setup [v3.1+] ● Android Device OR Virtual Emulator ( API Level = 27/28, Target = Android 8.1/9 ) ● Bring your data Cables to connect mobile device ● ADB setup 41

Notes de l'éditeur

  1. Machine Learning is the field of study that gives computers the ability to learn without being explicitly programmed.” - Arthur Samuel, 1959 Machine Learning is the science of programming computers so they can “learn from data” A computer program is said to learn from experience E with respect to some task T and some performance measure P, if its performance on T, as measured by P, improves with experience E. - Tom Mitchell, 1997
  2. Machine learning is an application of artificial intelligence (AI) that provides systems the ability to automatically learn and improve from experience without being explicitly programmed Machine learning is a form of AI that enables a system to learn from data rather than through explicit programming. However, machine learning is not a simple process. As the algorithms ingest training data, it is then possible to produce more precise models based on that data. A machine learning model is the output generated when you train your machine learning algorithm with data. After training, when you provide a model with an input, you will be given an output. For example, a predictive algorithm will create a predictive model. Then, when you provide the predictive model with data, you will receive a prediction based on the data that trained the model.
  3. Social Networking: FB automatically recognises faces suggests to tag a friend. Banking / Finance: Fraud detection algorithms to classify fraudulent transactions are in place. Mobile: -Personal Assistants -Voice to text -Technology Online Shopping: Recommendations of similar products Search Engines: Google’s autocomplete suggestions for search Medicine : Researches on using ML for disease diagnosis. - Google’s DeepMind Health
  4. Machine learning has the potential to automate a large portion of skilled labor, but the degree to which this affects a workforce depends on the level of difficulty involved in the job. Education : 1.)Algorithms can analyze test results, drastically reducing the time teachers spend in their leisure time on grading 2.)A student's attendance and academic history can help determine gaps in knowledge and learning disabilities. Law: J.P. Morgan, for example, uses a software program dubbed COIN (Control Intelligence) to review documents and previous cases in seconds that would otherwise take 360,000 hours. Transportation : 1.) Rolls Royce and Google have teamed up to design and launch the world's first self-driving ship by 2020. 2.)NASA having successfully launched and landed an autonomous space shuttle Manual Labour : driverless trucks operating in mining pits in Australia, operated remotely from a distant control center.(particular jobs that involve some element of danger or potential harm, such as work in factories and mining) Healthcare: Hospitals are currently using AI algorithms to more accurately detect tumors in radiology scans and analyze different moles for skin cancer, and machine learning is being adapted to accelerate research toward a cure for cancer. Alexa: voice-activated control of your smart-home (the dimming of lights, closing of blinds, locking of doors, etc., all at your command).
  5. Supervised: Supervised learning identifies patterns in data given pre-determined features and labeled data. Unsupervised: Unsupervised learning identifies patterns in data, which is particularly helpful for unlabeled and unstructured data. Semi-supervised: A blend of supervised and unsupervised learning. Best in situations where there is some labeled data but not a lot. Reinforcement: Reinforcement learning provides feedback to the algorithm as it trains; it is essentially experience-driven decision Typical business uses of supervised learning include recognizing objects in images, predicting financial results, detecting fraud, and evaluating risk. Unsupervised : Categorizing news, books, and other things, recommending items to customers. Semi : detecting spam, classifying web-content, and analyzing speech
  6. Color Taste (differs with acidity /Alcoholic content)
  7. https://semanti.ca/blog/?glossary-of-machine-learning-terms A feature vector is a one dimensional matrix which is used to describe a feature of an image. It can be used to describe an entire image (Global feature) or a feature present at in a location in the image space (local feature) Bias The bias is an error from erroneous assumptions in the learning algorithm. High bias can cause an algorithm to miss the relevant relations between features and target outputs (underfitting) Only thing in the process that has human intervention
  8. Gathering data Preparing that data Choosing a model Training Evaluation Hyperparameter tuning Prediction.
  9. Things to explain here: The approach to Development and testing isn’t traditional here . But does that mean NO QA for ML applications ? NO, the answer is being adaptive enough to learn how to test those predictions and as of now due to lack to knowledge Data Scientist develop + test the models that they create . Which in long term will not work when applications scale. So now , the problem at hand is How do we test predictions ![ The challenge for QA ]
  10. Gathering data Preparing that data Choosing a model Training Evaluation Hyperparameter tuning Prediction.
  11. Training Set : 80% , Test Data : 20% Make sure that your test set meets the following two conditions: Is large enough to yield statistically meaningful results. Is representative of the data set as a whole. In other words, don't pick a test set with different characteristics than the training set. Never train on test data. If you are seeing surprisingly good results on your evaluation metrics, it might be a sign that you are accidentally training on the test set. For example, high accuracy might indicate that test data has leaked into the training set. For example, consider a model that predicts whether an email is spam, using the subject line, email body, and sender's email address as features. We apportion the data into training and test sets, with an 80-20 split. After training, the model achieves 99% precision on both the training set and the test set. We'd expect a lower precision on the test set, so we take another look at the data and discover that many of the examples in the test set are duplicates of examples in the training set (we neglected to scrub duplicate entries for the same spam email from our input database before splitting the data). We've inadvertently trained on some of our test data, and as a result, we're no longer accurately measuring how well our model generalizes to new data. Data snooping bias: Test set have to be created immediately after receiving the dataset. Otherwise as humans we derive a pattern around all the data and there is a possibility of bias while training the model, which is called as the ‘data snooping bias’.
  12. A validation dataset is a dataset of examples used to tune the hyperparameters (i.e. the architecture) of a classifier. It is sometimes also called the development set or the "dev set". In artificial neural networks, a hyperparameter is, for example, the number of hidden units.[7][8] It, as well as the testing set (as mentioned above), should follow the same probability distribution as the training dataset In the figure, "Tweak model" means adjusting anything about the model you can dream up—from changing the learning rate, to adding or removing features, to designing a completely new model from scratch. At the end of this workflow, you pick the model that does best on the test set. Dividing the data set into two sets is a good idea, but not a panacea. You can greatly reduce your chances of overfitting by partitioning the data set into the three subsets shown in the following figure: Use the validation set to evaluate results from the training set. Then, use the test set to double-check your evaluation after the model has "passed" the validation set. The following figure shows this new workflow: In this improved workflow: Pick the model that does best on the validation set. Double-check that model against the test set.
  13. Things to explain here: The approach to Development and testing isn’t traditional here . But does that mean NO QA for ML applications ? NO, the answer is being adaptive enough to learn how to test those predictions and as of now due to lack to knowledge Data Scientist develop + test the models that they create . Which in long term will not work when applications scale. So now, the problem at hand is How do we test predictions ![ The challenge for QA ] <Show this data to tarun-k> Some of the ways of generating data are: E.g In Linear Regression make_regression() takes several inputs as shown in the example above. The inputs configured above are the number of test data points generated n_samples the number of input features n_features and finally the noise level noise in the output date * * what was this star for - divya? In Clustering - make_blobs() from sklearn can be used to clustering data for any number of features n_features with corresponding labels
  14. Underfitting: A statistical model or a machine learning algorithm is said to have underfitting when it cannot capture the underlying trend of the data. (It’s just like trying to fit undersized pants!) Underfitting destroys the accuracy of our machine learning model. Its occurrence simply means that our model or the algorithm does not fit the data well enough. It usually happens when we have less data to build an accurate model and also when we try to build a linear model with a non-linear data. In such cases the rules of the machine learning model are too easy and flexible to be applied on such a minimal data and therefore the model will probably make a lot of wrong predictions. Underfitting can be avoided by using more data and also reducing the features by feature selection. Overfitting: A statistical model is said to be overfitted, when we train it with a lot of data (just like fitting ourselves in an oversized pants!). When a model gets trained with so much of data, it starts learning from the noise and inaccurate data entries in our data set. Then the model does not categorize the data correctly, because of too much of details and noise. The causes of overfitting are the non-parametric and non-linear methods because these types of machine learning algorithms have more freedom in building the model based on the dataset and therefore they can really build unrealistic models. A solution to avoid overfitting is using a linear algorithm if we have linear data or using the parameters like the maximal depth if we are using decision trees.
  15. Revise prevoios ustuff https://dzone.com/articles/testing-features-of-ml-models
  16. What do u think was involved in building this algo ? Take is as your mind reads information after it has been fed similar information ! In this exercise, we will retrain a MobileNet. MobileNet is a a small efficient convolutional neural network. "Convolutional" just means that the same calculations are performed at each location in the image.
  17. Tensorflow: is used for acquiring data, training models, serving predictions, and refining future results Cloud Vision API provides a REST API to understand and extract information from an image. It uses powerful machine learning models to classify images into thousands of categories, detect faces, identify adult content, emotions, OCR support and more. Natural Language API is used to identify parts of speech and to detect multiple types of entities like persons, monuments, etc. It can also perform sentiment analysis. It currently supports three languages: English, Spanish and Japanese Speech API is used to translate audio files into text. It is able to identify over 80 languages and their variants, and can work with most audio files
  18. Description TensorFlow is an open-source software library for dataflow and differentiable programming across a range of tasks. It is a symbolic math library, and is also used for machine learning applications such as neural networks TensorFlow can train and run deep neural networks for handwritten digit classification, image recognition, word embeddings, recurrent neural networks, sequence-to-sequence models for machine translation, natural language processing, and PDE (partial differential equation) based simulations. Best of all, TensorFlow supports production prediction at scale, with the same models used for training TensorFlow allows developers to create dataflow graphs—structures that describe how data moves through a graph, or a series of processing nodes. Each node in the graph represents a mathematical operation, and each connection or edge between nodes is a multidimensional data array, or tensor. -wher edoes it come from Used for? Wherwe using it* *
  19. Decision trees can be applied to both classification & regression tasks. For regression task, decision trees use the MSE instead of gini score. Scikit uses CART Algorithm to grow decision trees. Main issue with Decision trees is the sensitivity to change in training data --------------Random Forest ------------------ Random forest is an ensemble of Decision trees. Instead of searching for the best feature to split a node, it searches for the best feature among a random subset of features, thus introducing more randomness hence less bias. Important quality of Random Forests is that they make it easy to measure the relative importance of a feature. It takes the features which reduces impurity on average to grow trees. ------Naives Bayes----------- Random forest is an ensemble of Decision trees. Instead of searching for the best feature to split a node, it searches for the best feature among a random subset of features, thus introducing more randomness hence less bias. Important quality of Random Forests is that they make it easy to measure the relative importance of a feature. It takes the features which reduces impurity on average to grow trees.
  20. Things to explain here: The approach to Development and testing isn’t traditional here . But does that mean NO QA for ML applications ? NO, the answer is being adaptive enough to learn how to test those predictions and as of now due to lack to knowledge Data Scientist develop + test the models that they create . Which in long term will not work when applications scale. So now , the problem at hand is How do we test predictions ![ The challenge for QA ]
  21. If you specify a small learning_rate, like 0.005, the training will take longer, but the overall precision might increase For example, 'mobilenet_1.0_224' will pick a model that is 17 MB in size and takes 224 pixel input images, while 'mobilenet_0.25_128_quantized' will choose a much less accurate, but smaller and faster network that's 920 KB on disk and takes 128x128 images
  22. Should we talk about F-beta score ? When False positives are ok and False negatives are NOT ok → use precision .. like you cannot tell a sick person that he is healthy . But you may tell a person that healthy person is sick and needs re-test When False negatives are OK but False positives are not ok .Then use recall . Eg. If Important mail goes to spam is wrong . Spam mail in inbox might be ok .
  23. RMSE: In meteorology, to see how effectively a mathematical model predicts the behavior of the atmosphhere. This is type regression
  24. So, we have data for which we are trying to achieve a prediction/output and we have to chose the best model/ algorithm to achieve accurate prediction . So , we evaluate the model using the different metrics