SlideShare a Scribd company logo
1 of 18
What is computer
vision?
in robotic SW testing…
Agenda
• Computer vision overview
• How computer vision relates to robotic SW testing?
• Under the hood: pixels, OCR, machine learning
Mika Kaukoranta @mikaukora2
Computer vision
Mika Kaukoranta @mikaukora3
Sub-domains
scene reconstruction, event
detection, video tracking, object
recognition, 3D pose estimation,
learning, indexing, motion
estimation, and image restoration
Related fields
artificial intelligence, solid-state physics,
neurobiology, signal Processing,
mathematics,
Distinctions
computer graphics, image
processing, image analysis,
machine vision, imaging, pattern
recognition, photogammetry
Overview
• Computer vision - an interdisciplinary field that deals with how computers
can be made to gain high-level understanding from digital images or videos
• Image processing - neither require assumptions nor produce
interpretations about the image content
• Machine vision - focus on applications, mainly in manufacturing, e.g.,
vision based robots and systems for vision based inspection
• Imaging - focus on the process of producing images, but sometimes also
deals with processing and analysis of images
Mika Kaukoranta @mikaukora4
Mika Kaukoranta @mikaukora5
object recognition
optical character detection (OCR)
medical imaging
machine vision
Reference
Reference
Reference
Reference
Computer vision in SW testing and
automation
Mika Kaukoranta @mikaukora6
Take screenshot
Analyze image
Control keyboard and
mouse
Computer vision in SW testing and
automation
• Generic instead of application specific approach
• Control over any UI (user interface)
‒ Legacy systems, remote desktop connections, systems that “can’t be
automated”
• Visual inspection (vs. API’s or objects)
• Enabler for machine learning approaches
Mika Kaukoranta @mikaukora7
Mika Kaukoranta @mikaukora8
Computer Vision - Generic approach to any
UI
Reference
Reference
Testing and control approaches
• Record mouse coordinates
‒ Fixed position.
• Template matching
‒ Crop and find match. Fixed UI.
• Object recognition
‒ Detect object positions. Fixed elements.
• Optical character recognition (OCR)
‒ Recognize text elements. Fixed texts.
• Combinations of the above
• Combinations with other approaches such as API access
Mika Kaukoranta @mikaukora9
ClickCoord 200,300
ClickIcon button.png
ClickButton 1
ClickText OK
Discussion
• Do you have systems that are hard to automate?
• Could computer vision help?
Mika Kaukoranta @mikaukora10
• Grayscale image
• Pixels represented as single 8-bit number (0-255)
Pixels in memory
Mika Kaukoranta @mikaukora11
Reference
• RGB image
• Pixels represented as three 8-bit numbers
[0-255, 0-255, 0-255]
Pixels in memory
Mika Kaukoranta @mikaukora12
Reference
Processing steps in OCR
Mika Kaukoranta @mikaukora13
Image capture
Image
preprocessing
Text detection
Character
segmentation
Character
recognition
Found text:
“value:”,
“123”,
“Unit:”,
“euro”
Trained model
Machine learning process
Mika Kaukoranta @mikaukora14
Gather and prepare
training data
Training
Inference (prediction)
“A” is “A”
“A” is “A”
“A” is “A”
“A” is ?
“A” with 87 % probability
• More machine learning
• Automatic testing, e.g. Testar, AET
• Robotic process automation (RPA)
Future development
Mika Kaukoranta @mikaukora15
• Recognize template images from video stream
• Test case passes when image is found
• Can be used for end user video testing, for example
Template matching demo
Mika Kaukoranta @mikaukora16
17
Marker 1 Marker 2 Marker 3 Marker 4 Marker 5 Marker 6 Marker 7
Mika Kaukoranta @mikaukora
Thank you!
Qentinel
www.qentinel.com
Mika Kaukoranta
Mika.kaukoranta@Qentinel.com
Mika Kaukoranta @mikaukora18

More Related Content

What's hot

What's hot (20)

Computer Vision Presentation Artificial Intelligence (AI)
Computer Vision Presentation Artificial Intelligence (AI)Computer Vision Presentation Artificial Intelligence (AI)
Computer Vision Presentation Artificial Intelligence (AI)
 
Computer vision
Computer visionComputer vision
Computer vision
 
Computer vision ppt
Computer vision pptComputer vision ppt
Computer vision ppt
 
Computer vision
Computer visionComputer vision
Computer vision
 
Computer vision
Computer visionComputer vision
Computer vision
 
Computer vision
Computer visionComputer vision
Computer vision
 
Computer Vision
Computer VisionComputer Vision
Computer Vision
 
Ai lecture 03 computer vision
Ai lecture 03 computer visionAi lecture 03 computer vision
Ai lecture 03 computer vision
 
Introduction to Computer Vision.pdf
Introduction to Computer Vision.pdfIntroduction to Computer Vision.pdf
Introduction to Computer Vision.pdf
 
Computer vision ppt
Computer vision pptComputer vision ppt
Computer vision ppt
 
An Introduction to Image Processing and Artificial Intelligence
An Introduction to Image Processing and Artificial IntelligenceAn Introduction to Image Processing and Artificial Intelligence
An Introduction to Image Processing and Artificial Intelligence
 
Computer Vision Introduction
Computer Vision IntroductionComputer Vision Introduction
Computer Vision Introduction
 
Computer vesion
Computer vesionComputer vesion
Computer vesion
 
Object Recognition
Object RecognitionObject Recognition
Object Recognition
 
Cse image processing ppt
Cse image processing pptCse image processing ppt
Cse image processing ppt
 
COM2304: Introduction to Computer Vision & Image Processing
COM2304: Introduction to Computer Vision & Image Processing COM2304: Introduction to Computer Vision & Image Processing
COM2304: Introduction to Computer Vision & Image Processing
 
Computer Vision
Computer VisionComputer Vision
Computer Vision
 
Computer vision
Computer visionComputer vision
Computer vision
 
Computer vision and robotics
Computer vision and roboticsComputer vision and robotics
Computer vision and robotics
 
Image recognition
Image recognitionImage recognition
Image recognition
 

Similar to What is computer vision?

cseimageprocessingppt-170902095009.pdf
cseimageprocessingppt-170902095009.pdfcseimageprocessingppt-170902095009.pdf
cseimageprocessingppt-170902095009.pdf
RaviRenu1
 

Similar to What is computer vision? (20)

ICS1020CV_2022.pdf
ICS1020CV_2022.pdfICS1020CV_2022.pdf
ICS1020CV_2022.pdf
 
AISF19 - Unleash Computer Vision at the Edge
AISF19 - Unleash Computer Vision at the EdgeAISF19 - Unleash Computer Vision at the Edge
AISF19 - Unleash Computer Vision at the Edge
 
Application of image processing.ppt
Application of image processing.pptApplication of image processing.ppt
Application of image processing.ppt
 
ICS1020 CV
ICS1020 CVICS1020 CV
ICS1020 CV
 
Traffic Violation Detector using Object Detection
Traffic Violation Detector using Object DetectionTraffic Violation Detector using Object Detection
Traffic Violation Detector using Object Detection
 
Introduction to Computer Vision - Image formation
Introduction to Computer Vision -  Image formationIntroduction to Computer Vision -  Image formation
Introduction to Computer Vision - Image formation
 
Introduction to Object recognition
Introduction to Object recognitionIntroduction to Object recognition
Introduction to Object recognition
 
Makine Öğrenmesi ile Görüntü Tanıma | Image Recognition using Machine Learning
Makine Öğrenmesi ile Görüntü Tanıma | Image Recognition using Machine LearningMakine Öğrenmesi ile Görüntü Tanıma | Image Recognition using Machine Learning
Makine Öğrenmesi ile Görüntü Tanıma | Image Recognition using Machine Learning
 
Overview of Computer Vision For Footwear Industry
Overview of Computer Vision For Footwear IndustryOverview of Computer Vision For Footwear Industry
Overview of Computer Vision For Footwear Industry
 
Computer Vision(4).pptx
Computer Vision(4).pptxComputer Vision(4).pptx
Computer Vision(4).pptx
 
Principle of Artificial Intellingence presentation.pptx
Principle of Artificial Intellingence presentation.pptxPrinciple of Artificial Intellingence presentation.pptx
Principle of Artificial Intellingence presentation.pptx
 
Opticalcharacter recognition
Opticalcharacter recognition Opticalcharacter recognition
Opticalcharacter recognition
 
Paper based interaction
Paper based interactionPaper based interaction
Paper based interaction
 
Final Report on Optical Character Recognition
Final Report on Optical Character Recognition Final Report on Optical Character Recognition
Final Report on Optical Character Recognition
 
Introduction
IntroductionIntroduction
Introduction
 
PPT s01-machine vision-s2
PPT s01-machine vision-s2PPT s01-machine vision-s2
PPT s01-machine vision-s2
 
introdaction.pptx
introdaction.pptxintrodaction.pptx
introdaction.pptx
 
cseimageprocessingppt-170902095009.pdf
cseimageprocessingppt-170902095009.pdfcseimageprocessingppt-170902095009.pdf
cseimageprocessingppt-170902095009.pdf
 
Image Processing ppt
Image Processing pptImage Processing ppt
Image Processing ppt
 
Comp4010 Lecture4 AR Tracking and Interaction
Comp4010 Lecture4 AR Tracking and InteractionComp4010 Lecture4 AR Tracking and Interaction
Comp4010 Lecture4 AR Tracking and Interaction
 

More from Qentinel

More from Qentinel (20)

Sap Finug hosted by Qentinel 12.3.2019, esitykset
Sap Finug hosted by Qentinel 12.3.2019, esityksetSap Finug hosted by Qentinel 12.3.2019, esitykset
Sap Finug hosted by Qentinel 12.3.2019, esitykset
 
Qentinel's garage story in Slush 2018
Qentinel's garage story in Slush 2018Qentinel's garage story in Slush 2018
Qentinel's garage story in Slush 2018
 
SAP End-to-end liiketoimintaprosessin testaus
SAP End-to-end liiketoimintaprosessin testausSAP End-to-end liiketoimintaprosessin testaus
SAP End-to-end liiketoimintaprosessin testaus
 
End-to-end huoltoprosessin testaus, IFS Asiakaspäivä
End-to-end huoltoprosessin testaus, IFS AsiakaspäiväEnd-to-end huoltoprosessin testaus, IFS Asiakaspäivä
End-to-end huoltoprosessin testaus, IFS Asiakaspäivä
 
Women in Tech - tukiäly asiakaskokemuksen kumppanina
Women in Tech - tukiäly asiakaskokemuksen kumppaninaWomen in Tech - tukiäly asiakaskokemuksen kumppanina
Women in Tech - tukiäly asiakaskokemuksen kumppanina
 
Writing Readable Test Automation - Qentinel Automation Clinic 1.3.2018
Writing Readable Test Automation - Qentinel Automation Clinic 1.3.2018Writing Readable Test Automation - Qentinel Automation Clinic 1.3.2018
Writing Readable Test Automation - Qentinel Automation Clinic 1.3.2018
 
Ecosystem Automation as a Service - Qentinel Automation Clinic 1.3.2018
Ecosystem Automation as a Service - Qentinel Automation Clinic 1.3.2018Ecosystem Automation as a Service - Qentinel Automation Clinic 1.3.2018
Ecosystem Automation as a Service - Qentinel Automation Clinic 1.3.2018
 
Menesty ekosysteemissä -webinaari 14.11.2017
Menesty ekosysteemissä -webinaari 14.11.2017Menesty ekosysteemissä -webinaari 14.11.2017
Menesty ekosysteemissä -webinaari 14.11.2017
 
Infrastructure As a Code (IAC) Jani Haapala 2017
Infrastructure As a Code (IAC) Jani Haapala 2017Infrastructure As a Code (IAC) Jani Haapala 2017
Infrastructure As a Code (IAC) Jani Haapala 2017
 
GDPR and test data challenge Antti Heimola 20170504
GDPR and test data challenge Antti Heimola 20170504GDPR and test data challenge Antti Heimola 20170504
GDPR and test data challenge Antti Heimola 20170504
 
Asiakaskokemus ekosysteemissä-qentinel-2017-04-27
Asiakaskokemus ekosysteemissä-qentinel-2017-04-27Asiakaskokemus ekosysteemissä-qentinel-2017-04-27
Asiakaskokemus ekosysteemissä-qentinel-2017-04-27
 
Kilpailuetua muutoksessa –webinaari. Miten johdan epävarmuuksilla?
Kilpailuetua muutoksessa –webinaari. Miten johdan epävarmuuksilla?Kilpailuetua muutoksessa –webinaari. Miten johdan epävarmuuksilla?
Kilpailuetua muutoksessa –webinaari. Miten johdan epävarmuuksilla?
 
Etumatkan kolme-taitoa-esko-hannula-20170216
Etumatkan kolme-taitoa-esko-hannula-20170216Etumatkan kolme-taitoa-esko-hannula-20170216
Etumatkan kolme-taitoa-esko-hannula-20170216
 
Asiakaskokemus tulevaisuudessa -webinaari Qentinel 10.1.2017
Asiakaskokemus tulevaisuudessa -webinaari Qentinel 10.1.2017Asiakaskokemus tulevaisuudessa -webinaari Qentinel 10.1.2017
Asiakaskokemus tulevaisuudessa -webinaari Qentinel 10.1.2017
 
Test Automation Nightmares - Antti Heimola, Qentinel
Test Automation Nightmares - Antti Heimola, QentinelTest Automation Nightmares - Antti Heimola, Qentinel
Test Automation Nightmares - Antti Heimola, Qentinel
 
End-to-end testaus eri päätelaitteilla - Antti Heimola
End-to-end testaus eri päätelaitteilla - Antti HeimolaEnd-to-end testaus eri päätelaitteilla - Antti Heimola
End-to-end testaus eri päätelaitteilla - Antti Heimola
 
Testiautomaatio ei ole tekninen ongelma - Kalle Huttunen
Testiautomaatio ei ole tekninen ongelma - Kalle HuttunenTestiautomaatio ei ole tekninen ongelma - Kalle Huttunen
Testiautomaatio ei ole tekninen ongelma - Kalle Huttunen
 
Safety nets with fast feedback loops | Jani haapala 2016-10
Safety nets with fast feedback loops | Jani haapala 2016-10Safety nets with fast feedback loops | Jani haapala 2016-10
Safety nets with fast feedback loops | Jani haapala 2016-10
 
Jos sinulla olisi kaikki tieto - tietäisitkö kaiken? Esko Hannulan esitys 8.9...
Jos sinulla olisi kaikki tieto - tietäisitkö kaiken? Esko Hannulan esitys 8.9...Jos sinulla olisi kaikki tieto - tietäisitkö kaiken? Esko Hannulan esitys 8.9...
Jos sinulla olisi kaikki tieto - tietäisitkö kaiken? Esko Hannulan esitys 8.9...
 
CI Security Scan - Teemu Vesalan esitys 7.6. Testiautomaatioklinkassa
CI Security Scan - Teemu Vesalan esitys 7.6. TestiautomaatioklinkassaCI Security Scan - Teemu Vesalan esitys 7.6. Testiautomaatioklinkassa
CI Security Scan - Teemu Vesalan esitys 7.6. Testiautomaatioklinkassa
 

Recently uploaded

Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 

Recently uploaded (20)

Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 

What is computer vision?

  • 1. What is computer vision? in robotic SW testing…
  • 2. Agenda • Computer vision overview • How computer vision relates to robotic SW testing? • Under the hood: pixels, OCR, machine learning Mika Kaukoranta @mikaukora2
  • 3. Computer vision Mika Kaukoranta @mikaukora3 Sub-domains scene reconstruction, event detection, video tracking, object recognition, 3D pose estimation, learning, indexing, motion estimation, and image restoration Related fields artificial intelligence, solid-state physics, neurobiology, signal Processing, mathematics, Distinctions computer graphics, image processing, image analysis, machine vision, imaging, pattern recognition, photogammetry
  • 4. Overview • Computer vision - an interdisciplinary field that deals with how computers can be made to gain high-level understanding from digital images or videos • Image processing - neither require assumptions nor produce interpretations about the image content • Machine vision - focus on applications, mainly in manufacturing, e.g., vision based robots and systems for vision based inspection • Imaging - focus on the process of producing images, but sometimes also deals with processing and analysis of images Mika Kaukoranta @mikaukora4
  • 5. Mika Kaukoranta @mikaukora5 object recognition optical character detection (OCR) medical imaging machine vision Reference Reference Reference Reference
  • 6. Computer vision in SW testing and automation Mika Kaukoranta @mikaukora6 Take screenshot Analyze image Control keyboard and mouse
  • 7. Computer vision in SW testing and automation • Generic instead of application specific approach • Control over any UI (user interface) ‒ Legacy systems, remote desktop connections, systems that “can’t be automated” • Visual inspection (vs. API’s or objects) • Enabler for machine learning approaches Mika Kaukoranta @mikaukora7
  • 8. Mika Kaukoranta @mikaukora8 Computer Vision - Generic approach to any UI Reference Reference
  • 9. Testing and control approaches • Record mouse coordinates ‒ Fixed position. • Template matching ‒ Crop and find match. Fixed UI. • Object recognition ‒ Detect object positions. Fixed elements. • Optical character recognition (OCR) ‒ Recognize text elements. Fixed texts. • Combinations of the above • Combinations with other approaches such as API access Mika Kaukoranta @mikaukora9 ClickCoord 200,300 ClickIcon button.png ClickButton 1 ClickText OK
  • 10. Discussion • Do you have systems that are hard to automate? • Could computer vision help? Mika Kaukoranta @mikaukora10
  • 11. • Grayscale image • Pixels represented as single 8-bit number (0-255) Pixels in memory Mika Kaukoranta @mikaukora11 Reference
  • 12. • RGB image • Pixels represented as three 8-bit numbers [0-255, 0-255, 0-255] Pixels in memory Mika Kaukoranta @mikaukora12 Reference
  • 13. Processing steps in OCR Mika Kaukoranta @mikaukora13 Image capture Image preprocessing Text detection Character segmentation Character recognition Found text: “value:”, “123”, “Unit:”, “euro”
  • 14. Trained model Machine learning process Mika Kaukoranta @mikaukora14 Gather and prepare training data Training Inference (prediction) “A” is “A” “A” is “A” “A” is “A” “A” is ? “A” with 87 % probability
  • 15. • More machine learning • Automatic testing, e.g. Testar, AET • Robotic process automation (RPA) Future development Mika Kaukoranta @mikaukora15
  • 16. • Recognize template images from video stream • Test case passes when image is found • Can be used for end user video testing, for example Template matching demo Mika Kaukoranta @mikaukora16
  • 17. 17 Marker 1 Marker 2 Marker 3 Marker 4 Marker 5 Marker 6 Marker 7 Mika Kaukoranta @mikaukora