vodQA Pune (2019) - Testing AI,ML applications

•Télécharger en tant que PPTX, PDF•

2 j'aime•441 vues

vodQA

Technologie

Testing AI,ML Applications
BY:
Divya Rakhiani
Tarun Maini
VodQA 2019 pune
1

AGENDA
Intro + Quick agenda walkthrough(brief talk)
a. What is AI/ML
b. How technology is shifting towards AI, ML
c. Where does a QA step in
d. Challenges while testing AI,ML application
Hands-ON Activity:
1. Create and Test a basic Beer-Wine Classifier
1. Create an Image Classifier ( via CLI )
a. Retrain a Mobile Net
b. Generate test data
c. Create Optimized graphs
d. Test you classifier
1. Dynamic Image Classifier via Android App - (OPTIONAL)
a. Retrain a Mobile Net
b. Generate test data
c. Create Optimized graphs
d. Test you classifier
2

What is AI/ML ? Why the
buzzword Data Science ?
3

“Machine learning is an application
of artificial intelligence (AI) that
provides systems the ability to
automatically learn and improve
from experience without being
explicitly programmed“
5

How technology is shifting
towards AI/ML & affected
the world around us ?
7

©ThoughtWorks 2017 Commercial in Confidence
10

Problem we are dealing with:
Beer-Wine
Classification
12

COMPONENTS
1
3
Training Data --> Algorithm --> Model --> Test Data --> Prediction/Output

● Label: Is what you're attempting to predict or forecast
● Features: are an individual measurable property OR the descriptive attributes
● Feature Vectors: A feature vector is a vector in which each dimension represent a certain
feature of an example
● Learning Rate: number of time data is reread in a model to perform accurate predictions.
● Hyperparameters : is a parameter whose value is set before the learning process begins to fine
tune performance such as coefficient of features for logistic regression model.
Frequent terms used in ML
1
4

©ThoughtWorks 2017 Commercial in Confidence
Supervised Learning Recipe
15Source: http://slideplayer.com/slide/9493622/

©ThoughtWorks 2017 Commercial in Confidence
17

Training data Vs Test data
● Training set— Data subset to train a model
● Test set— Data subset to test the trained model
You could imagine slicing the single data set as follows:
1
8

Guidelines to generate test
data for ML features
20

Testing the feature
● Test whether the value of features lies between the threshold values
● Test whether the feature importance changed with respect to previous QA run
● Test the feature unsuitability by testing RAM, usage, inference latency etc.
● Test/Review whether the generated feature violates the data compliance related issues
2
4

Image Classification problem statement
2
5

It depends on application type.
Examples :
● Decision tree → classification
● Random forest → categorization
● Naive bayes algorithm → classification
APIs of few libraries used to develop/test ML models
● Tensorflow
● Cloud Vision API
● Natural Language
● Google Speech
Some algorithmic models
2
6

Train Classifier - by hyperparameters
Random_brightness = 0
Architecture = inception_v3
Random_crop = 0
Flip_left_right = false
Bottleneck_dir = /tmp/bottleneck'
Testing_percentage = 10
Validation_percentage = 10
Learning_rate = 0.01
How_many_training_steps = 4000
3
1

Accuracy of the classification models ?
3
3

Accuracy
True positive + True Negative
Total Predictions
3
5

Precision
Out of all the predictions predicted as beer , how many are correctly classified as beer ?
True Positive +False Positive
True Positive
3
6

Recall
Out of all the drinks labeled as beer , How many were correctly predicted ?
True Positive
True Positive +False Positive
3
7

Metrics used for Regression Model
● Root Mean Square Error : is a measure of accuracy, to compare forecasting errors of different
models for a particular dataset and not between datasets
● Mean Absolute Error : how much % error the model makes in its predictions.
● Entropy : is used as an impurity measure of the model.
3
8

Challenges in testing
● Fast machines and processors
● Generate training data
● Generate test Data
● Know the Threshold and test with new data
● Data Filtering/quality of data - Enhancing data, Prevent overfitting & underfitting
4
0

PREREQUISITES
Please complete all the following steps:
● Clone all the following repositories at local:
a. https://github.com/tarunmaini16/beer-wine-classifier
b. https://github.com/tarunmaini16/image-classifier
c. https://github.com/tarunmaini16/android-image-classifier
● Pull following docker images (optional):
a. https://cloud.docker.com/u/tarunmaini/repository/docker/tarunmaini/wine-beer-classification
b. https://cloud.docker.com/u/tarunmaini/repository/docker/tarunmaini/image-classifier
● Install Python at system and python plugin in IntelliJ
● Install Tensorflow via terminal $ pip install --upgrade “tensorflow==1.9*”
● Android Studio Setup [v3.1+]
● Android Device OR Virtual Emulator ( API Level = 27/28, Target = Android 8.1/9 )
● Bring your data Cables to connect mobile device
● ADB setup
41

Recommandé

Performance Testing : Cloud DeploymentsShreyas Chaudhari

vodQA Pune (2019) - Design patterns in test automationvodQA

vodQA Pune (2019) - Insights into big data testingvodQA

vodQA Pune (2019) - Testing ethereum smart contractsvodQA

Alexander Andelkovic. Comaqa Spring 2018. Using Artificial Intelligence to Te...COMAQA.BY

ThoughtWorks Continuous DeliveryKyle Hodgson

ATAGTR2017 Differentiation using Testing Tools and Automation in the BFS COTS...Agile Testing Alliance

Agile Software Architecturecesarioramos

Recommandé

Performance Testing : Cloud DeploymentsShreyas Chaudhari

vodQA Pune (2019) - Design patterns in test automationvodQA

vodQA Pune (2019) - Insights into big data testingvodQA

vodQA Pune (2019) - Testing ethereum smart contractsvodQA

Alexander Andelkovic. Comaqa Spring 2018. Using Artificial Intelligence to Te...COMAQA.BY

ThoughtWorks Continuous DeliveryKyle Hodgson

ATAGTR2017 Differentiation using Testing Tools and Automation in the BFS COTS...Agile Testing Alliance

Agile Software Architecturecesarioramos

ATAGTR2017 CDC Tests - Integration Tests cant be made simpler than this!Agile Testing Alliance

THE PLEASURES OF ON-PREM, TOMER GABELDevOpsDays Tel Aviv

Testing in DevOps: UKStar conferenceLaurent PY

AWS Well-Architected: Build Better Architecture, Better BusinessDevOps.com

Real Testing Scenario Strategy - The Role of Exploratory TestingAdam Sandman

Datadog: From a single product to a growing platform by Alexis Lê-Quôc, CTOTheFamily

Implementing BDD at scale for agile and DevOps teamsLaurent PY

Codemotion tech pills - Continuous performanceBert Jan Schrijver

A differnt Type of Supermarket DeliveryThoughtworks

SeleniumCamp 2020 - Shift Right and ObservabilityMarcus Merrell

CD in Machine Learning SystemsThoughtworks

Your Testing Is Flawed: Introducing A New Open Source Tool For Accurate Kuber...StormForge .io

Continuous Behavior - BDD in Continuous Delivery (CoDers Who Test, Gothenburg...Gáspár Nagy

Amsterdam JUG - Continuous performanceBert Jan Schrijver

We are sinking: Hitting the testing iceberg (CukenFest London, 2018)Gáspár Nagy

DeTesters meetup november 2018 - Continuous performance: load testing with G...Bert Jan Schrijver

Can i service this from my raspberry piThoughtworks

A Prophet in Production Shiri HochhauserDevOpsDays Tel Aviv

QMetry test management for jira factsheetSuketu Patel

Connecting the clouds, A TrueLime StoryJeroen Fürst

Test AI/ML Applications🍻 Tarun Maini

2024-02-24_Session 1 - PMLE_UPDATED.pptxgdgsurrey

Contenu connexe

Tendances

ATAGTR2017 CDC Tests - Integration Tests cant be made simpler than this!Agile Testing Alliance

THE PLEASURES OF ON-PREM, TOMER GABELDevOpsDays Tel Aviv

Testing in DevOps: UKStar conferenceLaurent PY

AWS Well-Architected: Build Better Architecture, Better BusinessDevOps.com

Real Testing Scenario Strategy - The Role of Exploratory TestingAdam Sandman

Datadog: From a single product to a growing platform by Alexis Lê-Quôc, CTOTheFamily

Implementing BDD at scale for agile and DevOps teamsLaurent PY

Codemotion tech pills - Continuous performanceBert Jan Schrijver

A differnt Type of Supermarket DeliveryThoughtworks

SeleniumCamp 2020 - Shift Right and ObservabilityMarcus Merrell

CD in Machine Learning SystemsThoughtworks

Your Testing Is Flawed: Introducing A New Open Source Tool For Accurate Kuber...StormForge .io

Continuous Behavior - BDD in Continuous Delivery (CoDers Who Test, Gothenburg...Gáspár Nagy

Amsterdam JUG - Continuous performanceBert Jan Schrijver

We are sinking: Hitting the testing iceberg (CukenFest London, 2018)Gáspár Nagy

DeTesters meetup november 2018 - Continuous performance: load testing with G...Bert Jan Schrijver

Can i service this from my raspberry piThoughtworks

A Prophet in Production Shiri HochhauserDevOpsDays Tel Aviv

QMetry test management for jira factsheetSuketu Patel

Connecting the clouds, A TrueLime StoryJeroen Fürst

Tendances (20)

ATAGTR2017 CDC Tests - Integration Tests cant be made simpler than this!

THE PLEASURES OF ON-PREM, TOMER GABEL

Testing in DevOps: UKStar conference

AWS Well-Architected: Build Better Architecture, Better Business

Real Testing Scenario Strategy - The Role of Exploratory Testing

Datadog: From a single product to a growing platform by Alexis Lê-Quôc, CTO

Implementing BDD at scale for agile and DevOps teams

Codemotion tech pills - Continuous performance

A differnt Type of Supermarket Delivery

SeleniumCamp 2020 - Shift Right and Observability

CD in Machine Learning Systems

Your Testing Is Flawed: Introducing A New Open Source Tool For Accurate Kuber...

Continuous Behavior - BDD in Continuous Delivery (CoDers Who Test, Gothenburg...

Amsterdam JUG - Continuous performance

We are sinking: Hitting the testing iceberg (CukenFest London, 2018)

DeTesters meetup november 2018 - Continuous performance: load testing with G...

Can i service this from my raspberry pi

A Prophet in Production Shiri Hochhauser

QMetry test management for jira factsheet

Connecting the clouds, A TrueLime Story

Similaire à vodQA Pune (2019) - Testing AI,ML applications

Test AI/ML Applications🍻 Tarun Maini

2024-02-24_Session 1 - PMLE_UPDATED.pptxgdgsurrey

Freenome's Biological Machine Learning PlatformBrandon White

implementing_ai_for_improved_performance_testing_the_key_to_success.pdfsarah david

Reproducibility and experiments management in Machine Learning Mikhail Rozhkov

MLOps and Data Quality: Deploying Reliable ML Models in ProductionProvectus

Machine learning in productionTuri, Inc.

Integrating AI in software quality in absence of a well-defined requirementsNagarro

Experimentation at Blue Apron (webinar)Optimizely

Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...Data Con LA

Testing and Deployment - Full Stack Deep LearningSergey Karayev

implementing_ai_for_improved_performance_testing_the_key_to_success.pptxsarah david

Managing machine learningDavid Murgatroyd

Barga Data Science lecture 10Roger Barga

State of the Market - Data Quality in 2023RTTS

To Open Banking and Beyond: Developing APIs that are Resilient to every new I...Curiosity Software Ireland

How Will Your ML Project FailElena Samuylova

Scale your Testing and Quality with Automation Engineering and ML - Carlos Ki...QA or the Highway

Model Monitoring at Scale with Apache Spark and VertaDatabricks

Performance TestingSelin Gungor

Similaire à vodQA Pune (2019) - Testing AI,ML applications (20)

Test AI/ML Applications

2024-02-24_Session 1 - PMLE_UPDATED.pptx

Freenome's Biological Machine Learning Platform

implementing_ai_for_improved_performance_testing_the_key_to_success.pdf

Reproducibility and experiments management in Machine Learning

MLOps and Data Quality: Deploying Reliable ML Models in Production

Machine learning in production

Integrating AI in software quality in absence of a well-defined requirements

Experimentation at Blue Apron (webinar)

Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...

Testing and Deployment - Full Stack Deep Learning

implementing_ai_for_improved_performance_testing_the_key_to_success.pptx

Managing machine learning

Barga Data Science lecture 10

State of the Market - Data Quality in 2023

To Open Banking and Beyond: Developing APIs that are Resilient to every new I...

How Will Your ML Project Fail

Scale your Testing and Quality with Automation Engineering and ML - Carlos Ki...

Model Monitoring at Scale with Apache Spark and Verta

Performance Testing

Plus de vodQA

Performance TestingvodQA

Testing Strategy in Micro Frontend architecturevodQA

Api testing libraries using java script an overviewvodQA

Testing face authentication on mobilevodQA

Testing cnavodQA

Etl engine testing with scalavodQA

EDA for QAsvodQA

vodQA Pune (2019) - Browser automation using dev toolsvodQA

vodQA Pune (2019) - Augmented reality overview and testing challengesvodQA

vodQA Pune (2019) - Performance testing cloud deploymentsvodQA

vodQA Pune (2019) - Jenkins pipeline As codevodQA

vodQA(Pune) 2018 - Consumer driven contract testing using pactvodQA

vodQA(Pune) 2018 - Visual testing of web apps in headless environment manis...vodQA

vodQA(Pune) 2018 - Enhancing the capabilities of testing team preparing for...vodQA

vodQA(Pune) 2018 - QAing the security wayvodQA

vodQA(Pune) 2018 - Docker in TestingvodQA

Mobile automation using appium.pptxvodQA

An approach to app security - For beginnersvodQA

RetrospectivevodQA

Whys and Hows of AutomationvodQA

Plus de vodQA (20)

Performance Testing

Testing Strategy in Micro Frontend architecture

Api testing libraries using java script an overview

Testing face authentication on mobile

Testing cna

Etl engine testing with scala

EDA for QAs

vodQA Pune (2019) - Browser automation using dev tools

vodQA Pune (2019) - Augmented reality overview and testing challenges

vodQA Pune (2019) - Performance testing cloud deployments

vodQA Pune (2019) - Jenkins pipeline As code

vodQA(Pune) 2018 - Consumer driven contract testing using pact

vodQA(Pune) 2018 - Visual testing of web apps in headless environment manis...

vodQA(Pune) 2018 - Enhancing the capabilities of testing team preparing for...

vodQA(Pune) 2018 - QAing the security way

vodQA(Pune) 2018 - Docker in Testing

Mobile automation using appium.pptx

An approach to app security - For beginners

Retrospective

Whys and Hows of Automation

Dernier

08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls

From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software

08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo

Evaluating the top large language models.pdfChristopherTHyatt

TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous

Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun

GenAI Risks & Security Meetup 01052024.pdflior mazor

IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge

Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung

Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2

Artificial Intelligence: Facts and MythsJoaquim Jorge

08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls

2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge

Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93

Dernier (20)

08448380779 Call Girls In Greater Kailash - I Women Seeking Men

From Event to Action: Accelerate Your Decision Making with Real-Time Automation

08448380779 Call Girls In Diplomatic Enclave Women Seeking Men

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...

Evaluating the top large language models.pdf

TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke

Powerful Google developer tools for immediate impact! (2023-24 C)

GenAI Risks & Security Meetup 01052024.pdf

IAC 2024 - IA Fast Track to Search Focused AI Solutions

Axa Assurance Maroc - Insurer Innovation Award 2024

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...

Exploring the Future Potential of AI-Enabled Smartphone Processors

Artificial Intelligence: Facts and Myths

08448380779 Call Girls In Friends Colony Women Seeking Men

2024: Domino Containers - The Next Step. News from the Domino Container commu...

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf

Driving Behavioral Change for Information Management through Data-Driven Gree...

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff

vodQA Pune (2019) - Testing AI,ML applications

1. Testing AI,ML Applications BY: Divya Rakhiani Tarun Maini VodQA 2019 pune 1

2. AGENDA Intro + Quick agenda walkthrough(brief talk) a. What is AI/ML b. How technology is shifting towards AI, ML c. Where does a QA step in d. Challenges while testing AI,ML application Hands-ON Activity: 1. Create and Test a basic Beer-Wine Classifier 1. Create an Image Classifier ( via CLI ) a. Retrain a Mobile Net b. Generate test data c. Create Optimized graphs d. Test you classifier 1. Dynamic Image Classifier via Android App - (OPTIONAL) a. Retrain a Mobile Net b. Generate test data c. Create Optimized graphs d. Test you classifier 2

3. What is AI/ML ? Why the buzzword Data Science ? 3

4. 4

5. “Machine learning is an application of artificial intelligence (AI) that provides systems the ability to automatically learn and improve from experience without being explicitly programmed“ 5

6. 6

7. How technology is shifting towards AI/ML & affected the world around us ? 7

8. 8

9. What are the types of ML ? 9

11. Example: 11

12. Problem we are dealing with: Beer-Wine Classification 12

13. COMPONENTS 1 3 Training Data --> Algorithm --> Model --> Test Data --> Prediction/Output

14. ● Label: Is what you're attempting to predict or forecast ● Features: are an individual measurable property OR the descriptive attributes ● Feature Vectors: A feature vector is a vector in which each dimension represent a certain feature of an example ● Learning Rate: number of time data is reread in a model to perform accurate predictions. ● Hyperparameters : is a parameter whose value is set before the learning process begins to fine tune performance such as coefficient of features for logistic regression model. Frequent terms used in ML 1 4

15. ©ThoughtWorks 2017 Commercial in Confidence Supervised Learning Recipe 15Source: http://slideplayer.com/slide/9493622/

16. Where does a QA step in 16

18. Training data Vs Test data ● Training set— Data subset to train a model ● Test set— Data subset to test the trained model You could imagine slicing the single data set as follows: 1 8

19. Validation Set 1 9

20. Guidelines to generate test data for ML features 20

21. Avoid Data snooping bias 2 1

22. Stratified Sampling 2 2

23. Avoid UnderFitting or OverFitting 2 3

24. Testing the feature ● Test whether the value of features lies between the threshold values ● Test whether the feature importance changed with respect to previous QA run ● Test the feature unsuitability by testing RAM, usage, inference latency etc. ● Test/Review whether the generated feature violates the data compliance related issues 2 4

25. Image Classification problem statement 2 5

26. It depends on application type. Examples : ● Decision tree → classification ● Random forest → categorization ● Naive bayes algorithm → classification APIs of few libraries used to develop/test ML models ● Tensorflow ● Cloud Vision API ● Natural Language ● Google Speech Some algorithmic models 2 6

27. TensorFlow 2 7

28. Decision tree 2 8

29. How good is the model ? 29

30. 30

31. Train Classifier - by hyperparameters Random_brightness = 0 Architecture = inception_v3 Random_crop = 0 Flip_left_right = false Bottleneck_dir = /tmp/bottleneck' Testing_percentage = 10 Validation_percentage = 10 Learning_rate = 0.01 How_many_training_steps = 4000 3 1

32. 32

33. Accuracy of the classification models ? 3 3

34. Confusion matrix 3 4

35. Accuracy True positive + True Negative Total Predictions 3 5

36. Precision Out of all the predictions predicted as beer , how many are correctly classified as beer ? True Positive +False Positive True Positive 3 6

37. Recall Out of all the drinks labeled as beer , How many were correctly predicted ? True Positive True Positive +False Positive 3 7

38. Metrics used for Regression Model ● Root Mean Square Error : is a measure of accuracy, to compare forecasting errors of different models for a particular dataset and not between datasets ● Mean Absolute Error : how much % error the model makes in its predictions. ● Entropy : is used as an impurity measure of the model. 3 8

39. In Conclusion 3 9

40. Challenges in testing ● Fast machines and processors ● Generate training data ● Generate test Data ● Know the Threshold and test with new data ● Data Filtering/quality of data - Enhancing data, Prevent overfitting & underfitting 4 0

41. PREREQUISITES Please complete all the following steps: ● Clone all the following repositories at local: a. https://github.com/tarunmaini16/beer-wine-classifier b. https://github.com/tarunmaini16/image-classifier c. https://github.com/tarunmaini16/android-image-classifier ● Pull following docker images (optional): a. https://cloud.docker.com/u/tarunmaini/repository/docker/tarunmaini/wine-beer-classification b. https://cloud.docker.com/u/tarunmaini/repository/docker/tarunmaini/image-classifier ● Install Python at system and python plugin in IntelliJ ● Install Tensorflow via terminal $ pip install --upgrade “tensorflow==1.9*” ● Android Studio Setup [v3.1+] ● Android Device OR Virtual Emulator ( API Level = 27/28, Target = Android 8.1/9 ) ● Bring your data Cables to connect mobile device ● ADB setup 41

42. SNAPSHOTS 42

Notes de l'éditeur

Machine Learning is the field of study that gives computers the ability to learn without being explicitly programmed.” - Arthur Samuel, 1959 Machine Learning is the science of programming computers so they can “learn from data” A computer program is said to learn from experience E with respect to some task T and some performance measure P, if its performance on T, as measured by P, improves with experience E. - Tom Mitchell, 1997
Machine learning is an application of artificial intelligence (AI) that provides systems the ability to automatically learn and improve from experience without being explicitly programmed Machine learning is a form of AI that enables a system to learn from data rather than through explicit programming. However, machine learning is not a simple process. As the algorithms ingest training data, it is then possible to produce more precise models based on that data. A machine learning model is the output generated when you train your machine learning algorithm with data. After training, when you provide a model with an input, you will be given an output. For example, a predictive algorithm will create a predictive model. Then, when you provide the predictive model with data, you will receive a prediction based on the data that trained the model.
Social Networking: FB automatically recognises faces suggests to tag a friend. Banking / Finance: Fraud detection algorithms to classify fraudulent transactions are in place. Mobile: -Personal Assistants -Voice to text -Technology Online Shopping: Recommendations of similar products Search Engines: Google’s autocomplete suggestions for search Medicine : Researches on using ML for disease diagnosis. - Google’s DeepMind Health
Machine learning has the potential to automate a large portion of skilled labor, but the degree to which this affects a workforce depends on the level of difficulty involved in the job. Education : 1.)Algorithms can analyze test results, drastically reducing the time teachers spend in their leisure time on grading 2.)A student's attendance and academic history can help determine gaps in knowledge and learning disabilities. Law: J.P. Morgan, for example, uses a software program dubbed COIN (Control Intelligence) to review documents and previous cases in seconds that would otherwise take 360,000 hours. Transportation : 1.) Rolls Royce and Google have teamed up to design and launch the world's first self-driving ship by 2020. 2.)NASA having successfully launched and landed an autonomous space shuttle Manual Labour : driverless trucks operating in mining pits in Australia, operated remotely from a distant control center.(particular jobs that involve some element of danger or potential harm, such as work in factories and mining) Healthcare: Hospitals are currently using AI algorithms to more accurately detect tumors in radiology scans and analyze different moles for skin cancer, and machine learning is being adapted to accelerate research toward a cure for cancer. Alexa: voice-activated control of your smart-home (the dimming of lights, closing of blinds, locking of doors, etc., all at your command).
Supervised: Supervised learning identifies patterns in data given pre-determined features and labeled data. Unsupervised: Unsupervised learning identifies patterns in data, which is particularly helpful for unlabeled and unstructured data. Semi-supervised: A blend of supervised and unsupervised learning. Best in situations where there is some labeled data but not a lot. Reinforcement: Reinforcement learning provides feedback to the algorithm as it trains; it is essentially experience-driven decision Typical business uses of supervised learning include recognizing objects in images, predicting financial results, detecting fraud, and evaluating risk. Unsupervised : Categorizing news, books, and other things, recommending items to customers. Semi : detecting spam, classifying web-content, and analyzing speech
Color Taste (differs with acidity /Alcoholic content)
https://semanti.ca/blog/?glossary-of-machine-learning-terms A feature vector is a one dimensional matrix which is used to describe a feature of an image. It can be used to describe an entire image (Global feature) or a feature present at in a location in the image space (local feature) Bias The bias is an error from erroneous assumptions in the learning algorithm. High bias can cause an algorithm to miss the relevant relations between features and target outputs (underfitting) Only thing in the process that has human intervention
Gathering data Preparing that data Choosing a model Training Evaluation Hyperparameter tuning Prediction.
Things to explain here: The approach to Development and testing isn’t traditional here . But does that mean NO QA for ML applications ? NO, the answer is being adaptive enough to learn how to test those predictions and as of now due to lack to knowledge Data Scientist develop + test the models that they create . Which in long term will not work when applications scale. So now , the problem at hand is How do we test predictions ![ The challenge for QA ]
Gathering data Preparing that data Choosing a model Training Evaluation Hyperparameter tuning Prediction.
Training Set : 80% , Test Data : 20%Make sure that your test set meets the following two conditions: Is large enough to yield statistically meaningful results. Is representative of the data set as a whole. In other words, don't pick a test set with different characteristics than the training set. Never train on test data. If you are seeing surprisingly good results on your evaluation metrics, it might be a sign that you are accidentally training on the test set. For example, high accuracy might indicate that test data has leaked into the training set. For example, consider a model that predicts whether an email is spam, using the subject line, email body, and sender's email address as features. We apportion the data into training and test sets, with an 80-20 split. After training, the model achieves 99% precision on both the training set and the test set. We'd expect a lower precision on the test set, so we take another look at the data and discover that many of the examples in the test set are duplicates of examples in the training set (we neglected to scrub duplicate entries for the same spam email from our input database before splitting the data). We've inadvertently trained on some of our test data, and as a result, we're no longer accurately measuring how well our model generalizes to new data. Data snooping bias: Test set have to be created immediately after receiving the dataset. Otherwise as humans we derive a pattern around all the data and there is a possibility of bias while training the model, which is called as the ‘data snooping bias’.
A validation dataset is a dataset of examples used to tune the hyperparameters (i.e. the architecture) of a classifier. It is sometimes also called the development set or the "dev set". In artificial neural networks, a hyperparameter is, for example, the number of hidden units.[7][8] It, as well as the testing set (as mentioned above), should follow the same probability distribution as the training dataset In the figure, "Tweak model" means adjusting anything about the model you can dream up—from changing the learning rate, to adding or removing features, to designing a completely new model from scratch. At the end of this workflow, you pick the model that does best on the test set. Dividing the data set into two sets is a good idea, but not a panacea. You can greatly reduce your chances of overfitting by partitioning the data set into the three subsets shown in the following figure: Use the validation set to evaluate results from the training set. Then, use the test set to double-check your evaluation after the model has "passed" the validation set. The following figure shows this new workflow: In this improved workflow: Pick the model that does best on the validation set. Double-check that model against the test set.
Things to explain here: The approach to Development and testing isn’t traditional here . But does that mean NO QA for ML applications ? NO, the answer is being adaptive enough to learn how to test those predictions and as of now due to lack to knowledge Data Scientist develop + test the models that they create . Which in long term will not work when applications scale. So now, the problem at hand is How do we test predictions ![ The challenge for QA ] <Show this data to tarun-k> Some of the ways of generating data are: E.g In Linear Regression make_regression() takes several inputs as shown in the example above. The inputs configured above are the number of test data points generated n_samples the number of input features n_features and finally the noise level noise in the output date * * what was this star for - divya? In Clustering - make_blobs() from sklearn can be used to clustering data for any number of features n_features with corresponding labels
Underfitting: A statistical model or a machine learning algorithm is said to have underfitting when it cannot capture the underlying trend of the data. (It’s just like trying to fit undersized pants!) Underfitting destroys the accuracy of our machine learning model. Its occurrence simply means that our model or the algorithm does not fit the data well enough. It usually happens when we have less data to build an accurate model and also when we try to build a linear model with a non-linear data. In such cases the rules of the machine learning model are too easy and flexible to be applied on such a minimal data and therefore the model will probably make a lot of wrong predictions. Underfitting can be avoided by using more data and also reducing the features by feature selection. Overfitting: A statistical model is said to be overfitted, when we train it with a lot of data (just like fitting ourselves in an oversized pants!). When a model gets trained with so much of data, it starts learning from the noise and inaccurate data entries in our data set. Then the model does not categorize the data correctly, because of too much of details and noise. The causes of overfitting are the non-parametric and non-linear methods because these types of machine learning algorithms have more freedom in building the model based on the dataset and therefore they can really build unrealistic models. A solution to avoid overfitting is using a linear algorithm if we have linear data or using the parameters like the maximal depth if we are using decision trees.
Revise prevoios ustuff https://dzone.com/articles/testing-features-of-ml-models
What do u think was involved in building this algo ? Take is as your mind reads information after it has been fed similar information ! In this exercise, we will retrain a MobileNet. MobileNet is a a small efficient convolutional neural network. "Convolutional" just means that the same calculations are performed at each location in the image.
Tensorflow: is used for acquiring data, training models, serving predictions, and refining future results Cloud Vision API provides a REST API to understand and extract information from an image. It uses powerful machine learning models to classify images into thousands of categories, detect faces, identify adult content, emotions, OCR support and more. Natural Language API is used to identify parts of speech and to detect multiple types of entities like persons, monuments, etc. It can also perform sentiment analysis. It currently supports three languages: English, Spanish and Japanese Speech API is used to translate audio files into text. It is able to identify over 80 languages and their variants, and can work with most audio files
Description TensorFlow is an open-source software library for dataflow and differentiable programming across a range of tasks. It is a symbolic math library, and is also used for machine learning applications such as neural networks TensorFlow can train and run deep neural networks for handwritten digit classification, image recognition, word embeddings, recurrent neural networks, sequence-to-sequence models for machine translation, natural language processing, and PDE (partial differential equation) based simulations. Best of all, TensorFlow supports production prediction at scale, with the same models used for training TensorFlow allows developers to create dataflow graphs—structures that describe how data moves through a graph, or a series of processing nodes. Each node in the graph represents a mathematical operation, and each connection or edge between nodes is a multidimensional data array, or tensor. -wher edoes it come from Used for? Wherwe using it* *
Decision trees can be applied to both classification & regression tasks. For regression task, decision trees use the MSE instead of gini score. Scikit uses CART Algorithm to grow decision trees. Main issue with Decision trees is the sensitivity to change in training data --------------Random Forest ------------------ Random forest is an ensemble of Decision trees. Instead of searching for the best feature to split a node, it searches for the best feature among a random subset of features, thus introducing more randomness hence less bias. Important quality of Random Forests is that they make it easy to measure the relative importance of a feature. It takes the features which reduces impurity on average to grow trees. ------Naives Bayes----------- Random forest is an ensemble of Decision trees. Instead of searching for the best feature to split a node, it searches for the best feature among a random subset of features, thus introducing more randomness hence less bias. Important quality of Random Forests is that they make it easy to measure the relative importance of a feature. It takes the features which reduces impurity on average to grow trees.
Things to explain here: The approach to Development and testing isn’t traditional here . But does that mean NO QA for ML applications ? NO, the answer is being adaptive enough to learn how to test those predictions and as of now due to lack to knowledge Data Scientist develop + test the models that they create . Which in long term will not work when applications scale. So now , the problem at hand is How do we test predictions ![ The challenge for QA ]
If you specify a small learning_rate, like 0.005, the training will take longer, but the overall precision might increase For example, 'mobilenet_1.0_224' will pick a model that is 17 MB in size and takes 224 pixel input images, while 'mobilenet_0.25_128_quantized' will choose a much less accurate, but smaller and faster network that's 920 KB on disk and takes 128x128 images
Should we talk about F-beta score ? When False positives are ok and False negatives are NOT ok → use precision .. like you cannot tell a sick person that he is healthy . But you may tell a person that healthy person is sick and needs re-test When False negatives are OK but False positives are not ok .Then use recall . Eg. If Important mail goes to spam is wrong . Spam mail in inbox might be ok .
RMSE: In meteorology, to see how effectively a mathematical model predicts the behavior of the atmosphhere. This is type regression
So, we have data for which we are trying to achieve a prediction/output and we have to chose the best model/ algorithm to achieve accurate prediction . So , we evaluate the model using the different metrics