SlideShare une entreprise Scribd logo
1  sur  1
Télécharger pour lire hors ligne
Biologically-inspired Active Vision System for Object Recognition

                                        Martin Peniak, Davide Marocco
                                           University of Plymouth

                                              Ron Babich, John Tran
                                                NVIDIA Research
                                                      Outline

    1. Introduction
           a. Biological vision vs Computer vision
           b. The role of active perception
           c. Neural networks and Genetic Algorithms
    2. Background
           a. Presentation of related research (Marocco, Floreano, etc.)
    3. Preliminary Experiments
           a. Method (neural networks + genetic algorithms on GPU)
           b. Results (video of evolved controllers)
    4. Conclusions

A long-standing challenge in robotics is the development of a truly robust and general-purpose vision system
suitable for object identification, navigation, and other tasks. An unconventional but promising approach for
tackling this challenge relies on the concept of active perception, inspired by the observation that biological
organisms interact with the world in order to make sense of it. In the context of vision, this argues for a system
that takes in only a small part of the scene at a time (mimicking that captured by the fovea in the human eye),
moving from one such part to another in rapid succession. By leveraging a neural network for control, it is possible
to evolve an active vision system with the desired characteristics.

Prior work has relied on very small arrays of photoreceptors (e.g., 5x5), applied to simple identification tasks such
as distinguishing a triangle from a square. Although valuable as proofs of concept, tackling real-world problems
will require much larger systems backed by much larger neural networks, where the computational cost of training
grows super-linearly. We thus turn to an efficient CUDA implementation, scalable to many GPUs in parallel.

Our system is based on an Elman-type recurrent neural network with a biologically-inspired retina. The neural
network is evolved through a genetic algorithm incorporating the island model, which involves segregated
populations whose members migrate between “islands” only infrequently. This design both facilitates parallel
scaling and improves the quality of the final solution by avoiding convergence to local optima.

The active vision system was required to learn to recognize five different objects from Amsterdam Library of
Object Images (ALOI). These objects were presented to the system during the evolutionary process in 16 different
illuminations and 36 different rotation angles. Every neural network controller was able to explore each of these
variations in parallel on GPU, which made the evolutionary process significantly faster than a multi-threaded CPU
code. At the end of evolution, the controllers with the highest fitness were able to successfully recognize all the
objects within 20 time-steps. Our preliminary results suggest that this system is tolerant to variations in object
rotation, position and scale.

Contenu connexe

Tendances

CVPR 2020 Workshop: Sparsity in the neocortex, and its implications for conti...
CVPR 2020 Workshop: Sparsity in the neocortex, and its implications for conti...CVPR 2020 Workshop: Sparsity in the neocortex, and its implications for conti...
CVPR 2020 Workshop: Sparsity in the neocortex, and its implications for conti...Christy Maver
 
Location, Location, Location - A Framework for Intelligence and Cortical Comp...
Location, Location, Location - A Framework for Intelligence and Cortical Comp...Location, Location, Location - A Framework for Intelligence and Cortical Comp...
Location, Location, Location - A Framework for Intelligence and Cortical Comp...Numenta
 
Have We Missed Half of What the Neocortex Does? by Jeff Hawkins (12/15/2017)
Have We Missed Half of What the Neocortex Does? by Jeff Hawkins (12/15/2017)Have We Missed Half of What the Neocortex Does? by Jeff Hawkins (12/15/2017)
Have We Missed Half of What the Neocortex Does? by Jeff Hawkins (12/15/2017)Numenta
 
Осадчий А.Е. Анализ многомерных магнито- и электроэнцефалографических данных ...
Осадчий А.Е. Анализ многомерных магнито- и электроэнцефалографических данных ...Осадчий А.Е. Анализ многомерных магнито- и электроэнцефалографических данных ...
Осадчий А.Е. Анализ многомерных магнито- и электроэнцефалографических данных ...bigdatabm
 
Jeff Hawkins Human Brain Project Summit Keynote: "Location, Location, Locatio...
Jeff Hawkins Human Brain Project Summit Keynote: "Location, Location, Locatio...Jeff Hawkins Human Brain Project Summit Keynote: "Location, Location, Locatio...
Jeff Hawkins Human Brain Project Summit Keynote: "Location, Location, Locatio...Numenta
 
Have We Missed Half of What the Neocortex Does? A New Predictive Framework ...
 Have We Missed Half of What the Neocortex Does?  A New Predictive Framework ... Have We Missed Half of What the Neocortex Does?  A New Predictive Framework ...
Have We Missed Half of What the Neocortex Does? A New Predictive Framework ...Numenta
 
Brian Simulator (a draft)
Brian Simulator (a draft)Brian Simulator (a draft)
Brian Simulator (a draft)Jimmy Lu
 
Locations in the Neocortex: A Theory of Sensorimotor Prediction Using Cortica...
Locations in the Neocortex: A Theory of Sensorimotor Prediction Using Cortica...Locations in the Neocortex: A Theory of Sensorimotor Prediction Using Cortica...
Locations in the Neocortex: A Theory of Sensorimotor Prediction Using Cortica...Numenta
 
Labeling fundus images for classification models
Labeling fundus images for classification modelsLabeling fundus images for classification models
Labeling fundus images for classification modelsPetteriTeikariPhD
 
On the Development of a Brain Simulator
On the Development of a Brain SimulatorOn the Development of a Brain Simulator
On the Development of a Brain SimulatorJimmy Lu
 
The Predictive Neuron: How Active Dendrites Enable Spatiotemporal Computation...
The Predictive Neuron: How Active Dendrites Enable Spatiotemporal Computation...The Predictive Neuron: How Active Dendrites Enable Spatiotemporal Computation...
The Predictive Neuron: How Active Dendrites Enable Spatiotemporal Computation...Numenta
 
2017 07 03_meetup_d
2017 07 03_meetup_d2017 07 03_meetup_d
2017 07 03_meetup_dDana Brophy
 
Human brain project 2010
Human brain project 2010Human brain project 2010
Human brain project 2010Karlos Svoboda
 
UCAmI Presentation Dec.2013, Guanacaste, Costa Rica
UCAmI Presentation Dec.2013, Guanacaste, Costa RicaUCAmI Presentation Dec.2013, Guanacaste, Costa Rica
UCAmI Presentation Dec.2013, Guanacaste, Costa RicaNatalia Díaz Rodríguez
 
Measuring visual acuity and contrast sensitivity by optomotor reflex in rodents
Measuring visual acuity and contrast sensitivity by optomotor reflex in rodentsMeasuring visual acuity and contrast sensitivity by optomotor reflex in rodents
Measuring visual acuity and contrast sensitivity by optomotor reflex in rodentsInsideScientific
 

Tendances (18)

CVPR 2020 Workshop: Sparsity in the neocortex, and its implications for conti...
CVPR 2020 Workshop: Sparsity in the neocortex, and its implications for conti...CVPR 2020 Workshop: Sparsity in the neocortex, and its implications for conti...
CVPR 2020 Workshop: Sparsity in the neocortex, and its implications for conti...
 
Location, Location, Location - A Framework for Intelligence and Cortical Comp...
Location, Location, Location - A Framework for Intelligence and Cortical Comp...Location, Location, Location - A Framework for Intelligence and Cortical Comp...
Location, Location, Location - A Framework for Intelligence and Cortical Comp...
 
Have We Missed Half of What the Neocortex Does? by Jeff Hawkins (12/15/2017)
Have We Missed Half of What the Neocortex Does? by Jeff Hawkins (12/15/2017)Have We Missed Half of What the Neocortex Does? by Jeff Hawkins (12/15/2017)
Have We Missed Half of What the Neocortex Does? by Jeff Hawkins (12/15/2017)
 
Осадчий А.Е. Анализ многомерных магнито- и электроэнцефалографических данных ...
Осадчий А.Е. Анализ многомерных магнито- и электроэнцефалографических данных ...Осадчий А.Е. Анализ многомерных магнито- и электроэнцефалографических данных ...
Осадчий А.Е. Анализ многомерных магнито- и электроэнцефалографических данных ...
 
Jeff Hawkins Human Brain Project Summit Keynote: "Location, Location, Locatio...
Jeff Hawkins Human Brain Project Summit Keynote: "Location, Location, Locatio...Jeff Hawkins Human Brain Project Summit Keynote: "Location, Location, Locatio...
Jeff Hawkins Human Brain Project Summit Keynote: "Location, Location, Locatio...
 
Have We Missed Half of What the Neocortex Does? A New Predictive Framework ...
 Have We Missed Half of What the Neocortex Does?  A New Predictive Framework ... Have We Missed Half of What the Neocortex Does?  A New Predictive Framework ...
Have We Missed Half of What the Neocortex Does? A New Predictive Framework ...
 
Brian Simulator (a draft)
Brian Simulator (a draft)Brian Simulator (a draft)
Brian Simulator (a draft)
 
Locations in the Neocortex: A Theory of Sensorimotor Prediction Using Cortica...
Locations in the Neocortex: A Theory of Sensorimotor Prediction Using Cortica...Locations in the Neocortex: A Theory of Sensorimotor Prediction Using Cortica...
Locations in the Neocortex: A Theory of Sensorimotor Prediction Using Cortica...
 
Labeling fundus images for classification models
Labeling fundus images for classification modelsLabeling fundus images for classification models
Labeling fundus images for classification models
 
On the Development of a Brain Simulator
On the Development of a Brain SimulatorOn the Development of a Brain Simulator
On the Development of a Brain Simulator
 
The Predictive Neuron: How Active Dendrites Enable Spatiotemporal Computation...
The Predictive Neuron: How Active Dendrites Enable Spatiotemporal Computation...The Predictive Neuron: How Active Dendrites Enable Spatiotemporal Computation...
The Predictive Neuron: How Active Dendrites Enable Spatiotemporal Computation...
 
NEUROINFORMATICS
NEUROINFORMATICSNEUROINFORMATICS
NEUROINFORMATICS
 
Paper presentation report
Paper presentation reportPaper presentation report
Paper presentation report
 
2017 07 03_meetup_d
2017 07 03_meetup_d2017 07 03_meetup_d
2017 07 03_meetup_d
 
Human brain project 2010
Human brain project 2010Human brain project 2010
Human brain project 2010
 
UCAmI Presentation Dec.2013, Guanacaste, Costa Rica
UCAmI Presentation Dec.2013, Guanacaste, Costa RicaUCAmI Presentation Dec.2013, Guanacaste, Costa Rica
UCAmI Presentation Dec.2013, Guanacaste, Costa Rica
 
Measuring visual acuity and contrast sensitivity by optomotor reflex in rodents
Measuring visual acuity and contrast sensitivity by optomotor reflex in rodentsMeasuring visual acuity and contrast sensitivity by optomotor reflex in rodents
Measuring visual acuity and contrast sensitivity by optomotor reflex in rodents
 
Executable Biology Tutorial
Executable Biology TutorialExecutable Biology Tutorial
Executable Biology Tutorial
 

Similaire à Bio-inspired Active Vision System

Bio-inspired Active Vision System
Bio-inspired Active Vision SystemBio-inspired Active Vision System
Bio-inspired Active Vision SystemMartin Peniak
 
GPU Computing for Cognitive Robotics
GPU Computing for Cognitive RoboticsGPU Computing for Cognitive Robotics
GPU Computing for Cognitive RoboticsMartin Peniak
 
Aquila: An Open-Source GPU-Accelerated Toolkit for Cognitive and Neuro-Roboti...
Aquila: An Open-Source GPU-Accelerated Toolkit for Cognitive and Neuro-Roboti...Aquila: An Open-Source GPU-Accelerated Toolkit for Cognitive and Neuro-Roboti...
Aquila: An Open-Source GPU-Accelerated Toolkit for Cognitive and Neuro-Roboti...Martin Peniak
 
Real Time Object Detection with Audio Feedback using Yolo v3
Real Time Object Detection with Audio Feedback using Yolo v3Real Time Object Detection with Audio Feedback using Yolo v3
Real Time Object Detection with Audio Feedback using Yolo v3ijtsrd
 
A novel enhanced algorithm for efficient human tracking
A novel enhanced algorithm for efficient human trackingA novel enhanced algorithm for efficient human tracking
A novel enhanced algorithm for efficient human trackingIJICTJOURNAL
 
Biometric Iris Recognition Based on Hybrid Technique
Biometric Iris Recognition Based on Hybrid TechniqueBiometric Iris Recognition Based on Hybrid Technique
Biometric Iris Recognition Based on Hybrid Techniqueijsc
 
Biometric Iris Recognition Based on Hybrid Technique
Biometric Iris Recognition Based on Hybrid Technique  Biometric Iris Recognition Based on Hybrid Technique
Biometric Iris Recognition Based on Hybrid Technique ijsc
 
Analytical Review on the Correlation between Ai and Neuroscience
Analytical Review on the Correlation between Ai and NeuroscienceAnalytical Review on the Correlation between Ai and Neuroscience
Analytical Review on the Correlation between Ai and NeuroscienceIOSR Journals
 
HUMAN MOTION DETECTION AND TRACKING FOR VIDEO SURVEILLANCE
HUMAN MOTION  DETECTION AND TRACKING FOR VIDEO SURVEILLANCEHUMAN MOTION  DETECTION AND TRACKING FOR VIDEO SURVEILLANCE
HUMAN MOTION DETECTION AND TRACKING FOR VIDEO SURVEILLANCENEHA THADEUS
 
Review of Pose Recognition Systems
Review of Pose Recognition SystemsReview of Pose Recognition Systems
Review of Pose Recognition Systemsvivatechijri
 
Crime Detection using Machine Learning
Crime Detection using Machine LearningCrime Detection using Machine Learning
Crime Detection using Machine LearningIRJET Journal
 
Object Detection and Tracking AI Robot
Object Detection and Tracking AI RobotObject Detection and Tracking AI Robot
Object Detection and Tracking AI RobotIRJET Journal
 
Brain Computer Interface and Artificial Brain: Interfacing Microelectronics a...
Brain Computer Interface and Artificial Brain: Interfacing Microelectronics a...Brain Computer Interface and Artificial Brain: Interfacing Microelectronics a...
Brain Computer Interface and Artificial Brain: Interfacing Microelectronics a...Lk Rigor
 
A LOW COST EEG BASED BCI PROSTHETIC USING MOTOR IMAGERY
A LOW COST EEG BASED BCI PROSTHETIC USING MOTOR IMAGERY A LOW COST EEG BASED BCI PROSTHETIC USING MOTOR IMAGERY
A LOW COST EEG BASED BCI PROSTHETIC USING MOTOR IMAGERY ijitcs
 

Similaire à Bio-inspired Active Vision System (20)

Bio-inspired Active Vision System
Bio-inspired Active Vision SystemBio-inspired Active Vision System
Bio-inspired Active Vision System
 
GPU Computing for Cognitive Robotics
GPU Computing for Cognitive RoboticsGPU Computing for Cognitive Robotics
GPU Computing for Cognitive Robotics
 
Aquila: An Open-Source GPU-Accelerated Toolkit for Cognitive and Neuro-Roboti...
Aquila: An Open-Source GPU-Accelerated Toolkit for Cognitive and Neuro-Roboti...Aquila: An Open-Source GPU-Accelerated Toolkit for Cognitive and Neuro-Roboti...
Aquila: An Open-Source GPU-Accelerated Toolkit for Cognitive and Neuro-Roboti...
 
Real Time Object Detection with Audio Feedback using Yolo v3
Real Time Object Detection with Audio Feedback using Yolo v3Real Time Object Detection with Audio Feedback using Yolo v3
Real Time Object Detection with Audio Feedback using Yolo v3
 
A novel enhanced algorithm for efficient human tracking
A novel enhanced algorithm for efficient human trackingA novel enhanced algorithm for efficient human tracking
A novel enhanced algorithm for efficient human tracking
 
Biometric Iris Recognition Based on Hybrid Technique
Biometric Iris Recognition Based on Hybrid TechniqueBiometric Iris Recognition Based on Hybrid Technique
Biometric Iris Recognition Based on Hybrid Technique
 
Biometric Iris Recognition Based on Hybrid Technique
Biometric Iris Recognition Based on Hybrid Technique  Biometric Iris Recognition Based on Hybrid Technique
Biometric Iris Recognition Based on Hybrid Technique
 
Analytical Review on the Correlation between Ai and Neuroscience
Analytical Review on the Correlation between Ai and NeuroscienceAnalytical Review on the Correlation between Ai and Neuroscience
Analytical Review on the Correlation between Ai and Neuroscience
 
HUMAN MOTION DETECTION AND TRACKING FOR VIDEO SURVEILLANCE
HUMAN MOTION  DETECTION AND TRACKING FOR VIDEO SURVEILLANCEHUMAN MOTION  DETECTION AND TRACKING FOR VIDEO SURVEILLANCE
HUMAN MOTION DETECTION AND TRACKING FOR VIDEO SURVEILLANCE
 
Review of Pose Recognition Systems
Review of Pose Recognition SystemsReview of Pose Recognition Systems
Review of Pose Recognition Systems
 
Crime Detection using Machine Learning
Crime Detection using Machine LearningCrime Detection using Machine Learning
Crime Detection using Machine Learning
 
Object Detection and Tracking AI Robot
Object Detection and Tracking AI RobotObject Detection and Tracking AI Robot
Object Detection and Tracking AI Robot
 
19
1919
19
 
Bci
BciBci
Bci
 
Bci
BciBci
Bci
 
Visual reasoning
Visual reasoningVisual reasoning
Visual reasoning
 
Brain Computer Interface and Artificial Brain: Interfacing Microelectronics a...
Brain Computer Interface and Artificial Brain: Interfacing Microelectronics a...Brain Computer Interface and Artificial Brain: Interfacing Microelectronics a...
Brain Computer Interface and Artificial Brain: Interfacing Microelectronics a...
 
K0966468
K0966468K0966468
K0966468
 
A LOW COST EEG BASED BCI PROSTHETIC USING MOTOR IMAGERY
A LOW COST EEG BASED BCI PROSTHETIC USING MOTOR IMAGERY A LOW COST EEG BASED BCI PROSTHETIC USING MOTOR IMAGERY
A LOW COST EEG BASED BCI PROSTHETIC USING MOTOR IMAGERY
 
Ijarcce 27
Ijarcce 27Ijarcce 27
Ijarcce 27
 

Plus de Martin Peniak

GPU-accelerated Action Acquisition Through Multiple Time Scales Recurrent Neu...
GPU-accelerated Action Acquisition Through Multiple Time Scales Recurrent Neu...GPU-accelerated Action Acquisition Through Multiple Time Scales Recurrent Neu...
GPU-accelerated Action Acquisition Through Multiple Time Scales Recurrent Neu...Martin Peniak
 
What is Aquila Software Architecture for Cognitive Robotics?
What is Aquila Software Architecture for Cognitive Robotics?What is Aquila Software Architecture for Cognitive Robotics?
What is Aquila Software Architecture for Cognitive Robotics?Martin Peniak
 
Cognitive robotics tools and technology
Cognitive robotics tools and technologyCognitive robotics tools and technology
Cognitive robotics tools and technologyMartin Peniak
 
Fluoridation, the scientific fraud of a century
Fluoridation, the scientific fraud of a centuryFluoridation, the scientific fraud of a century
Fluoridation, the scientific fraud of a centuryMartin Peniak
 
Introduction to humanoid robot iCub, YARP and simulator
Introduction to humanoid robot iCub, YARP and simulatorIntroduction to humanoid robot iCub, YARP and simulator
Introduction to humanoid robot iCub, YARP and simulatorMartin Peniak
 
Co-evolving controller and sensing abilities in a simulated Mars Rover explorer
Co-evolving controller and sensing abilities in a simulated Mars Rover explorerCo-evolving controller and sensing abilities in a simulated Mars Rover explorer
Co-evolving controller and sensing abilities in a simulated Mars Rover explorerMartin Peniak
 
Vedecká evidencia poukazujúca na spojenie hmoty a vedomia
Vedecká evidencia poukazujúca na spojenie hmoty a vedomiaVedecká evidencia poukazujúca na spojenie hmoty a vedomia
Vedecká evidencia poukazujúca na spojenie hmoty a vedomiaMartin Peniak
 
Scientific evidence for a connection between mind and matter
Scientific evidence for a connection between mind and matterScientific evidence for a connection between mind and matter
Scientific evidence for a connection between mind and matterMartin Peniak
 
Introduction to parallel computing using CUDA
Introduction to parallel computing using CUDAIntroduction to parallel computing using CUDA
Introduction to parallel computing using CUDAMartin Peniak
 

Plus de Martin Peniak (10)

GPU-accelerated Action Acquisition Through Multiple Time Scales Recurrent Neu...
GPU-accelerated Action Acquisition Through Multiple Time Scales Recurrent Neu...GPU-accelerated Action Acquisition Through Multiple Time Scales Recurrent Neu...
GPU-accelerated Action Acquisition Through Multiple Time Scales Recurrent Neu...
 
What is Aquila Software Architecture for Cognitive Robotics?
What is Aquila Software Architecture for Cognitive Robotics?What is Aquila Software Architecture for Cognitive Robotics?
What is Aquila Software Architecture for Cognitive Robotics?
 
Cognitive robotics tools and technology
Cognitive robotics tools and technologyCognitive robotics tools and technology
Cognitive robotics tools and technology
 
Aquila 2.0
Aquila 2.0Aquila 2.0
Aquila 2.0
 
Fluoridation, the scientific fraud of a century
Fluoridation, the scientific fraud of a centuryFluoridation, the scientific fraud of a century
Fluoridation, the scientific fraud of a century
 
Introduction to humanoid robot iCub, YARP and simulator
Introduction to humanoid robot iCub, YARP and simulatorIntroduction to humanoid robot iCub, YARP and simulator
Introduction to humanoid robot iCub, YARP and simulator
 
Co-evolving controller and sensing abilities in a simulated Mars Rover explorer
Co-evolving controller and sensing abilities in a simulated Mars Rover explorerCo-evolving controller and sensing abilities in a simulated Mars Rover explorer
Co-evolving controller and sensing abilities in a simulated Mars Rover explorer
 
Vedecká evidencia poukazujúca na spojenie hmoty a vedomia
Vedecká evidencia poukazujúca na spojenie hmoty a vedomiaVedecká evidencia poukazujúca na spojenie hmoty a vedomia
Vedecká evidencia poukazujúca na spojenie hmoty a vedomia
 
Scientific evidence for a connection between mind and matter
Scientific evidence for a connection between mind and matterScientific evidence for a connection between mind and matter
Scientific evidence for a connection between mind and matter
 
Introduction to parallel computing using CUDA
Introduction to parallel computing using CUDAIntroduction to parallel computing using CUDA
Introduction to parallel computing using CUDA
 

Bio-inspired Active Vision System

  • 1. Biologically-inspired Active Vision System for Object Recognition Martin Peniak, Davide Marocco University of Plymouth Ron Babich, John Tran NVIDIA Research Outline 1. Introduction a. Biological vision vs Computer vision b. The role of active perception c. Neural networks and Genetic Algorithms 2. Background a. Presentation of related research (Marocco, Floreano, etc.) 3. Preliminary Experiments a. Method (neural networks + genetic algorithms on GPU) b. Results (video of evolved controllers) 4. Conclusions A long-standing challenge in robotics is the development of a truly robust and general-purpose vision system suitable for object identification, navigation, and other tasks. An unconventional but promising approach for tackling this challenge relies on the concept of active perception, inspired by the observation that biological organisms interact with the world in order to make sense of it. In the context of vision, this argues for a system that takes in only a small part of the scene at a time (mimicking that captured by the fovea in the human eye), moving from one such part to another in rapid succession. By leveraging a neural network for control, it is possible to evolve an active vision system with the desired characteristics. Prior work has relied on very small arrays of photoreceptors (e.g., 5x5), applied to simple identification tasks such as distinguishing a triangle from a square. Although valuable as proofs of concept, tackling real-world problems will require much larger systems backed by much larger neural networks, where the computational cost of training grows super-linearly. We thus turn to an efficient CUDA implementation, scalable to many GPUs in parallel. Our system is based on an Elman-type recurrent neural network with a biologically-inspired retina. The neural network is evolved through a genetic algorithm incorporating the island model, which involves segregated populations whose members migrate between “islands” only infrequently. This design both facilitates parallel scaling and improves the quality of the final solution by avoiding convergence to local optima. The active vision system was required to learn to recognize five different objects from Amsterdam Library of Object Images (ALOI). These objects were presented to the system during the evolutionary process in 16 different illuminations and 36 different rotation angles. Every neural network controller was able to explore each of these variations in parallel on GPU, which made the evolutionary process significantly faster than a multi-threaded CPU code. At the end of evolution, the controllers with the highest fitness were able to successfully recognize all the objects within 20 time-steps. Our preliminary results suggest that this system is tolerant to variations in object rotation, position and scale.