SlideShare a Scribd company logo
1 of 49
Download to read offline
Robot Vision for the Visually Impaired
           Vivek Pradeep, Gerard Medioni, James Weiland


                            presented by
                      Phongsathorn Eakamongul

                        Department of Computer Science
                         Asian Institute of Technology



                           2010, December 7




Phongsathorn (AIT)       Robot Vision for the Visually Impaired   Short Occasion   1 / 18
Outline




1   Abstracts



2   System Description



3   Result




       Phongsathorn (AIT)   Robot Vision for the Visually Impaired   Short Occasion   2 / 18
Abstracts



   head-mounted : wide-field information compare to shoulder or waist-mounted
   design in literature which require body rotations
   stereo-vision
   navigational assistance device
   visual odometry : dense 3D with 2D elevation grids
   metric-topological SLAM
   build vicinity map
   3D traversability analysis to steer subjects away from obstacles in the path
   use microvibration motors provides cues for taking evasive action : they use tactile
   cues instead of audio since the latter impose greater cognitive load on the subject,
   and blind users rely on hearing to perform a wide variety of other tasks
   experiment running at 10 Hz




     Phongsathorn (AIT)        Robot Vision for the Visually Impaired    Short Occasion   3 / 18
Abstracts



   head-mounted : wide-field information compare to shoulder or waist-mounted
   design in literature which require body rotations
   stereo-vision
   navigational assistance device
   visual odometry : dense 3D with 2D elevation grids
   metric-topological SLAM
   build vicinity map
   3D traversability analysis to steer subjects away from obstacles in the path
   use microvibration motors provides cues for taking evasive action : they use tactile
   cues instead of audio since the latter impose greater cognitive load on the subject,
   and blind users rely on hearing to perform a wide variety of other tasks
   experiment running at 10 Hz




     Phongsathorn (AIT)        Robot Vision for the Visually Impaired    Short Occasion   3 / 18
Abstracts



   head-mounted : wide-field information compare to shoulder or waist-mounted
   design in literature which require body rotations
   stereo-vision
   navigational assistance device
   visual odometry : dense 3D with 2D elevation grids
   metric-topological SLAM
   build vicinity map
   3D traversability analysis to steer subjects away from obstacles in the path
   use microvibration motors provides cues for taking evasive action : they use tactile
   cues instead of audio since the latter impose greater cognitive load on the subject,
   and blind users rely on hearing to perform a wide variety of other tasks
   experiment running at 10 Hz




     Phongsathorn (AIT)        Robot Vision for the Visually Impaired    Short Occasion   3 / 18
Abstracts



   head-mounted : wide-field information compare to shoulder or waist-mounted
   design in literature which require body rotations
   stereo-vision
   navigational assistance device
   visual odometry : dense 3D with 2D elevation grids
   metric-topological SLAM
   build vicinity map
   3D traversability analysis to steer subjects away from obstacles in the path
   use microvibration motors provides cues for taking evasive action : they use tactile
   cues instead of audio since the latter impose greater cognitive load on the subject,
   and blind users rely on hearing to perform a wide variety of other tasks
   experiment running at 10 Hz




     Phongsathorn (AIT)        Robot Vision for the Visually Impaired    Short Occasion   3 / 18
Abstracts



   head-mounted : wide-field information compare to shoulder or waist-mounted
   design in literature which require body rotations
   stereo-vision
   navigational assistance device
   visual odometry : dense 3D with 2D elevation grids
   metric-topological SLAM
   build vicinity map
   3D traversability analysis to steer subjects away from obstacles in the path
   use microvibration motors provides cues for taking evasive action : they use tactile
   cues instead of audio since the latter impose greater cognitive load on the subject,
   and blind users rely on hearing to perform a wide variety of other tasks
   experiment running at 10 Hz




     Phongsathorn (AIT)        Robot Vision for the Visually Impaired    Short Occasion   3 / 18
Abstracts



   head-mounted : wide-field information compare to shoulder or waist-mounted
   design in literature which require body rotations
   stereo-vision
   navigational assistance device
   visual odometry : dense 3D with 2D elevation grids
   metric-topological SLAM
   build vicinity map
   3D traversability analysis to steer subjects away from obstacles in the path
   use microvibration motors provides cues for taking evasive action : they use tactile
   cues instead of audio since the latter impose greater cognitive load on the subject,
   and blind users rely on hearing to perform a wide variety of other tasks
   experiment running at 10 Hz




     Phongsathorn (AIT)        Robot Vision for the Visually Impaired    Short Occasion   3 / 18
Abstracts



   head-mounted : wide-field information compare to shoulder or waist-mounted
   design in literature which require body rotations
   stereo-vision
   navigational assistance device
   visual odometry : dense 3D with 2D elevation grids
   metric-topological SLAM
   build vicinity map
   3D traversability analysis to steer subjects away from obstacles in the path
   use microvibration motors provides cues for taking evasive action : they use tactile
   cues instead of audio since the latter impose greater cognitive load on the subject,
   and blind users rely on hearing to perform a wide variety of other tasks
   experiment running at 10 Hz




     Phongsathorn (AIT)        Robot Vision for the Visually Impaired    Short Occasion   3 / 18
Abstracts



   head-mounted : wide-field information compare to shoulder or waist-mounted
   design in literature which require body rotations
   stereo-vision
   navigational assistance device
   visual odometry : dense 3D with 2D elevation grids
   metric-topological SLAM
   build vicinity map
   3D traversability analysis to steer subjects away from obstacles in the path
   use microvibration motors provides cues for taking evasive action : they use tactile
   cues instead of audio since the latter impose greater cognitive load on the subject,
   and blind users rely on hearing to perform a wide variety of other tasks
   experiment running at 10 Hz




     Phongsathorn (AIT)        Robot Vision for the Visually Impaired    Short Occasion   3 / 18
Abstracts



   head-mounted : wide-field information compare to shoulder or waist-mounted
   design in literature which require body rotations
   stereo-vision
   navigational assistance device
   visual odometry : dense 3D with 2D elevation grids
   metric-topological SLAM
   build vicinity map
   3D traversability analysis to steer subjects away from obstacles in the path
   use microvibration motors provides cues for taking evasive action : they use tactile
   cues instead of audio since the latter impose greater cognitive load on the subject,
   and blind users rely on hearing to perform a wide variety of other tasks
   experiment running at 10 Hz




     Phongsathorn (AIT)        Robot Vision for the Visually Impaired    Short Occasion   3 / 18
Abstracts



   head-mounted : wide-field information compare to shoulder or waist-mounted
   design in literature which require body rotations
   stereo-vision
   navigational assistance device
   visual odometry : dense 3D with 2D elevation grids
   metric-topological SLAM
   build vicinity map
   3D traversability analysis to steer subjects away from obstacles in the path
   use microvibration motors provides cues for taking evasive action : they use tactile
   cues instead of audio since the latter impose greater cognitive load on the subject,
   and blind users rely on hearing to perform a wide variety of other tasks
   experiment running at 10 Hz




     Phongsathorn (AIT)        Robot Vision for the Visually Impaired    Short Occasion   3 / 18
Introduction




    visual impairment : need long cane or guide dog
    In US, 109,000 people : use long canes, 7,000 use dog guides
    only 1,500 graduate from dog-guid user program
    Electronic travel aids (ETAs), leveraging ultrasonic, laser, or vision sensors




      Phongsathorn (AIT)         Robot Vision for the Visually Impaired     Short Occasion   4 / 18
Introduction




    visual impairment : need long cane or guide dog
    In US, 109,000 people : use long canes, 7,000 use dog guides
    only 1,500 graduate from dog-guid user program
    Electronic travel aids (ETAs), leveraging ultrasonic, laser, or vision sensors




      Phongsathorn (AIT)         Robot Vision for the Visually Impaired     Short Occasion   4 / 18
Introduction




    visual impairment : need long cane or guide dog
    In US, 109,000 people : use long canes, 7,000 use dog guides
    only 1,500 graduate from dog-guid user program
    Electronic travel aids (ETAs), leveraging ultrasonic, laser, or vision sensors




      Phongsathorn (AIT)         Robot Vision for the Visually Impaired     Short Occasion   4 / 18
Introduction




    visual impairment : need long cane or guide dog
    In US, 109,000 people : use long canes, 7,000 use dog guides
    only 1,500 graduate from dog-guid user program
    Electronic travel aids (ETAs), leveraging ultrasonic, laser, or vision sensors




      Phongsathorn (AIT)         Robot Vision for the Visually Impaired     Short Occasion   4 / 18
Introduction




    visual impairment : need long cane or guide dog
    In US, 109,000 people : use long canes, 7,000 use dog guides
    only 1,500 graduate from dog-guid user program
    Electronic travel aids (ETAs), leveraging ultrasonic, laser, or vision sensors




      Phongsathorn (AIT)         Robot Vision for the Visually Impaired     Short Occasion   4 / 18
wearable array of microvibration motors provides a tactile cues
and guide user along safe path




         Phongsathorn (AIT)      Robot Vision for the Visually Impaired   Short Occasion   5 / 18
Outline




1   Abstracts



2   System Description



3   Result




       Phongsathorn (AIT)   Robot Vision for the Visually Impaired   Short Occasion   6 / 18
Online SLAM + obstacle detection




     Phongsathorn (AIT)   Robot Vision for the Visually Impaired   Short Occasion   7 / 18
Stereo Vision Odometry




                                       t−1   t−1 t      t−1 t−1  t
   matched correspondences across (PL , PR , PL ) or (PL , PR , PR ) can be
   computed using three-point algorithm in RANSAC setting
   for robustness, features matching and reprojection errors are measured across
   four views
   Sparse Bundle Adjustment
   feature covariances can be propagated to get motion uncertainty for use in the
   SLAM filter




     Phongsathorn (AIT)        Robot Vision for the Visually Impaired   Short Occasion   8 / 18
Stereo Vision Odometry




                                       t−1   t−1 t      t−1 t−1  t
   matched correspondences across (PL , PR , PL ) or (PL , PR , PR ) can be
   computed using three-point algorithm in RANSAC setting
   for robustness, features matching and reprojection errors are measured across
   four views
   Sparse Bundle Adjustment
   feature covariances can be propagated to get motion uncertainty for use in the
   SLAM filter




     Phongsathorn (AIT)        Robot Vision for the Visually Impaired   Short Occasion   8 / 18
Stereo Vision Odometry




                                       t−1   t−1 t      t−1 t−1  t
   matched correspondences across (PL , PR , PL ) or (PL , PR , PR ) can be
   computed using three-point algorithm in RANSAC setting
   for robustness, features matching and reprojection errors are measured across
   four views
   Sparse Bundle Adjustment
   feature covariances can be propagated to get motion uncertainty for use in the
   SLAM filter




     Phongsathorn (AIT)        Robot Vision for the Visually Impaired   Short Occasion   8 / 18
Stereo Vision Odometry




                                       t−1   t−1 t      t−1 t−1  t
   matched correspondences across (PL , PR , PL ) or (PL , PR , PR ) can be
   computed using three-point algorithm in RANSAC setting
   for robustness, features matching and reprojection errors are measured across
   four views
   Sparse Bundle Adjustment
   feature covariances can be propagated to get motion uncertainty for use in the
   SLAM filter




     Phongsathorn (AIT)        Robot Vision for the Visually Impaired   Short Occasion   8 / 18
Stereo Vision Odometry




                                       t−1   t−1 t      t−1 t−1  t
   matched correspondences across (PL , PR , PL ) or (PL , PR , PR ) can be
   computed using three-point algorithm in RANSAC setting
   for robustness, features matching and reprojection errors are measured across
   four views
   Sparse Bundle Adjustment
   feature covariances can be propagated to get motion uncertainty for use in the
   SLAM filter




     Phongsathorn (AIT)        Robot Vision for the Visually Impaired   Short Occasion   8 / 18
SLAM




  Rao-Blackwellised particle filter (RBPF) in FastSLAM framework
  which use KLT and SIFT tracking
  construct 2 maps
       SLAM map : collection of sparse landmarks that propagated every frame to yield
       consistent camera pose estimates, for SLAM purpose only
       traversability map : dense 3D cloud from triangulation




    Phongsathorn (AIT)          Robot Vision for the Visually Impaired      Short Occasion   9 / 18
SLAM




  Rao-Blackwellised particle filter (RBPF) in FastSLAM framework
  which use KLT and SIFT tracking
  construct 2 maps
       SLAM map : collection of sparse landmarks that propagated every frame to yield
       consistent camera pose estimates, for SLAM purpose only
       traversability map : dense 3D cloud from triangulation




    Phongsathorn (AIT)          Robot Vision for the Visually Impaired      Short Occasion   9 / 18
SLAM




  Rao-Blackwellised particle filter (RBPF) in FastSLAM framework
  which use KLT and SIFT tracking
  construct 2 maps
       SLAM map : collection of sparse landmarks that propagated every frame to yield
       consistent camera pose estimates, for SLAM purpose only
       traversability map : dense 3D cloud from triangulation




    Phongsathorn (AIT)          Robot Vision for the Visually Impaired      Short Occasion   9 / 18
SLAM




  Rao-Blackwellised particle filter (RBPF) in FastSLAM framework
  which use KLT and SIFT tracking
  construct 2 maps
       SLAM map : collection of sparse landmarks that propagated every frame to yield
       consistent camera pose estimates, for SLAM purpose only
       traversability map : dense 3D cloud from triangulation




    Phongsathorn (AIT)          Robot Vision for the Visually Impaired      Short Occasion   9 / 18
SLAM




  Rao-Blackwellised particle filter (RBPF) in FastSLAM framework
  which use KLT and SIFT tracking
  construct 2 maps
       SLAM map : collection of sparse landmarks that propagated every frame to yield
       consistent camera pose estimates, for SLAM purpose only
       traversability map : dense 3D cloud from triangulation




    Phongsathorn (AIT)          Robot Vision for the Visually Impaired      Short Occasion   9 / 18
SLAM




  Rao-Blackwellised particle filter (RBPF) in FastSLAM framework
  which use KLT and SIFT tracking
  construct 2 maps
       SLAM map : collection of sparse landmarks that propagated every frame to yield
       consistent camera pose estimates, for SLAM purpose only
       traversability map : dense 3D cloud from triangulation




    Phongsathorn (AIT)          Robot Vision for the Visually Impaired      Short Occasion   9 / 18
Metric-Topological SLAM

        serveral thousands of landmarks environment
        two levels of environment representation
                local, metric (submap) : estimates state information
                         six dimensional camera trajectory st
                         sparse map mt
                         feature observations (KLT/SIFT) z t
                         camera motion estimates u t

RBPF
p(st , mt |z t , u t ) ≈ p(st |z t , u t )   i   p(mt (i)|st , z t , u t )

                         mt (i) : ith landmark in the map represented by N(µi , σ i )
                         each time feature is observed, the corresponding lankmark is updated using EKF
                         RBPF enables us to only update the observed landmark instead of the whole map
                global topological
                         map is represents as a collection of submap

annotated graph
G = ({i M}i∈Ωt , {b Λ}a,b∈Ωt )
                  a

                         i
                          M : annotated submaps
                         Ωt : set of computed submaps
                         b
                         a Λ : coordinate transformations between adjacent maps

             Phongsathorn (AIT)                              Robot Vision for the Visually Impaired   Short Occasion   10 / 18
Metric-Topological SLAM

        serveral thousands of landmarks environment
        two levels of environment representation
                local, metric (submap) : estimates state information
                         six dimensional camera trajectory st
                         sparse map mt
                         feature observations (KLT/SIFT) z t
                         camera motion estimates u t

RBPF
p(st , mt |z t , u t ) ≈ p(st |z t , u t )   i   p(mt (i)|st , z t , u t )

                         mt (i) : ith landmark in the map represented by N(µi , σ i )
                         each time feature is observed, the corresponding lankmark is updated using EKF
                         RBPF enables us to only update the observed landmark instead of the whole map
                global topological
                         map is represents as a collection of submap

annotated graph
G = ({i M}i∈Ωt , {b Λ}a,b∈Ωt )
                  a

                         i
                          M : annotated submaps
                         Ωt : set of computed submaps
                         b
                         a Λ : coordinate transformations between adjacent maps

             Phongsathorn (AIT)                              Robot Vision for the Visually Impaired   Short Occasion   10 / 18
Metric-Topological SLAM

        serveral thousands of landmarks environment
        two levels of environment representation
                local, metric (submap) : estimates state information
                         six dimensional camera trajectory st
                         sparse map mt
                         feature observations (KLT/SIFT) z t
                         camera motion estimates u t

RBPF
p(st , mt |z t , u t ) ≈ p(st |z t , u t )   i   p(mt (i)|st , z t , u t )

                         mt (i) : ith landmark in the map represented by N(µi , σ i )
                         each time feature is observed, the corresponding lankmark is updated using EKF
                         RBPF enables us to only update the observed landmark instead of the whole map
                global topological
                         map is represents as a collection of submap

annotated graph
G = ({i M}i∈Ωt , {b Λ}a,b∈Ωt )
                  a

                         i
                          M : annotated submaps
                         Ωt : set of computed submaps
                         b
                         a Λ : coordinate transformations between adjacent maps

             Phongsathorn (AIT)                              Robot Vision for the Visually Impaired   Short Occasion   10 / 18
Metric-Topological SLAM

        serveral thousands of landmarks environment
        two levels of environment representation
                local, metric (submap) : estimates state information
                         six dimensional camera trajectory st
                         sparse map mt
                         feature observations (KLT/SIFT) z t
                         camera motion estimates u t

RBPF
p(st , mt |z t , u t ) ≈ p(st |z t , u t )   i   p(mt (i)|st , z t , u t )

                         mt (i) : ith landmark in the map represented by N(µi , σ i )
                         each time feature is observed, the corresponding lankmark is updated using EKF
                         RBPF enables us to only update the observed landmark instead of the whole map
                global topological
                         map is represents as a collection of submap

annotated graph
G = ({i M}i∈Ωt , {b Λ}a,b∈Ωt )
                  a

                         i
                          M : annotated submaps
                         Ωt : set of computed submaps
                         b
                         a Λ : coordinate transformations between adjacent maps

             Phongsathorn (AIT)                              Robot Vision for the Visually Impaired   Short Occasion   10 / 18
Metric-Topological SLAM

        serveral thousands of landmarks environment
        two levels of environment representation
                local, metric (submap) : estimates state information
                         six dimensional camera trajectory st
                         sparse map mt
                         feature observations (KLT/SIFT) z t
                         camera motion estimates u t

RBPF
p(st , mt |z t , u t ) ≈ p(st |z t , u t )   i   p(mt (i)|st , z t , u t )

                         mt (i) : ith landmark in the map represented by N(µi , σ i )
                         each time feature is observed, the corresponding lankmark is updated using EKF
                         RBPF enables us to only update the observed landmark instead of the whole map
                global topological
                         map is represents as a collection of submap

annotated graph
G = ({i M}i∈Ωt , {b Λ}a,b∈Ωt )
                  a

                         i
                          M : annotated submaps
                         Ωt : set of computed submaps
                         b
                         a Λ : coordinate transformations between adjacent maps

             Phongsathorn (AIT)                              Robot Vision for the Visually Impaired   Short Occasion   10 / 18
Metric-Topological SLAM

        serveral thousands of landmarks environment
        two levels of environment representation
                local, metric (submap) : estimates state information
                         six dimensional camera trajectory st
                         sparse map mt
                         feature observations (KLT/SIFT) z t
                         camera motion estimates u t

RBPF
p(st , mt |z t , u t ) ≈ p(st |z t , u t )   i   p(mt (i)|st , z t , u t )

                         mt (i) : ith landmark in the map represented by N(µi , σ i )
                         each time feature is observed, the corresponding lankmark is updated using EKF
                         RBPF enables us to only update the observed landmark instead of the whole map
                global topological
                         map is represents as a collection of submap

annotated graph
G = ({i M}i∈Ωt , {b Λ}a,b∈Ωt )
                  a

                         i
                          M : annotated submaps
                         Ωt : set of computed submaps
                         b
                         a Λ : coordinate transformations between adjacent maps

             Phongsathorn (AIT)                              Robot Vision for the Visually Impaired   Short Occasion   10 / 18
Metric-Topological SLAM

        serveral thousands of landmarks environment
        two levels of environment representation
                local, metric (submap) : estimates state information
                         six dimensional camera trajectory st
                         sparse map mt
                         feature observations (KLT/SIFT) z t
                         camera motion estimates u t

RBPF
p(st , mt |z t , u t ) ≈ p(st |z t , u t )   i   p(mt (i)|st , z t , u t )

                         mt (i) : ith landmark in the map represented by N(µi , σ i )
                         each time feature is observed, the corresponding lankmark is updated using EKF
                         RBPF enables us to only update the observed landmark instead of the whole map
                global topological
                         map is represents as a collection of submap

annotated graph
G = ({i M}i∈Ωt , {b Λ}a,b∈Ωt )
                  a

                         i
                          M : annotated submaps
                         Ωt : set of computed submaps
                         b
                         a Λ : coordinate transformations between adjacent maps

             Phongsathorn (AIT)                              Robot Vision for the Visually Impaired   Short Occasion   10 / 18
Metric-Topological SLAM

        serveral thousands of landmarks environment
        two levels of environment representation
                local, metric (submap) : estimates state information
                         six dimensional camera trajectory st
                         sparse map mt
                         feature observations (KLT/SIFT) z t
                         camera motion estimates u t

RBPF
p(st , mt |z t , u t ) ≈ p(st |z t , u t )   i   p(mt (i)|st , z t , u t )

                         mt (i) : ith landmark in the map represented by N(µi , σ i )
                         each time feature is observed, the corresponding lankmark is updated using EKF
                         RBPF enables us to only update the observed landmark instead of the whole map
                global topological
                         map is represents as a collection of submap

annotated graph
G = ({i M}i∈Ωt , {b Λ}a,b∈Ωt )
                  a

                         i
                          M : annotated submaps
                         Ωt : set of computed submaps
                         b
                         a Λ : coordinate transformations between adjacent maps

             Phongsathorn (AIT)                              Robot Vision for the Visually Impaired   Short Occasion   10 / 18
Traversability Map




   5 radius sphere
   multi-surface elevation map : point cloud is quantized into 2D grid




      Phongsathorn (AIT)        Robot Vision for the Visually Impaired   Short Occasion   11 / 18
Traversability Map




   5 radius sphere
   multi-surface elevation map : point cloud is quantized into 2D grid




      Phongsathorn (AIT)        Robot Vision for the Visually Impaired   Short Occasion   11 / 18
Traversability Map




   5 radius sphere
   multi-surface elevation map : point cloud is quantized into 2D grid




      Phongsathorn (AIT)        Robot Vision for the Visually Impaired   Short Occasion   11 / 18
Prediction Motion and Cue Generation




   if magnitude of translation respect to previous position exceeds certain threshold,
   the direction of motion and reference position are updated
   little translation -> no update
   cue generation : most continuous traversable path ( Green color in picture )




     Phongsathorn (AIT)          Robot Vision for the Visually Impaired   Short Occasion   12 / 18
Outline




1   Abstracts



2   System Description



3   Result




       Phongsathorn (AIT)   Robot Vision for the Visually Impaired   Short Occasion   13 / 18
Result


Green : travesible
Red : not




       Phongsathorn (AIT)   Robot Vision for the Visually Impaired   Short Occasion   14 / 18
error of camera frame-to-frame heading (yaw), when compared with
readings from a commercially Inertial Measurement Unit (IMU)




camera motion : slow (< 5 degree/s), medium (5-20 degree/s), fast (20-30 degree/s)




       Phongsathorn (AIT)       Robot Vision for the Visually Impaired   Short Occasion   15 / 18
SLAM result




     Phongsathorn (AIT)   Robot Vision for the Visually Impaired   Short Occasion   16 / 18
Traversability Map




one frame exp
patch that has thickness > 30 cm is labeled as vertical
5 horizontal patches is labeled as traversable
       Phongsathorn (AIT)        Robot Vision for the Visually Impaired   Short Occasion   17 / 18
Experiment



Manually generate cues : wireless remote control




Autonomous generate cues, like group 4




       Phongsathorn (AIT)       Robot Vision for the Visually Impaired   Short Occasion   18 / 18

More Related Content

Similar to Robot Vision for the Visually Impaired

Design and Implementation of Ultrasonic Navigator for Visually Impaired
Design and Implementation of Ultrasonic Navigator for Visually ImpairedDesign and Implementation of Ultrasonic Navigator for Visually Impaired
Design and Implementation of Ultrasonic Navigator for Visually ImpairedDr.SHANTHI K.G
 
IRJET- A Leading Hand for the Blind –A Review
IRJET- A Leading Hand for the Blind –A ReviewIRJET- A Leading Hand for the Blind –A Review
IRJET- A Leading Hand for the Blind –A ReviewIRJET Journal
 
Visually Impaired People Monitoring in a Smart Home using Electronic White Cane
Visually Impaired People Monitoring in a Smart Home using Electronic White CaneVisually Impaired People Monitoring in a Smart Home using Electronic White Cane
Visually Impaired People Monitoring in a Smart Home using Electronic White CaneAIRCC Publishing Corporation
 
VISUALLY IMPAIRED PEOPLE MONITORING IN A SMART HOME USING ELECTRONIC WHITE CANE
VISUALLY IMPAIRED PEOPLE MONITORING IN A SMART HOME USING ELECTRONIC WHITE CANEVISUALLY IMPAIRED PEOPLE MONITORING IN A SMART HOME USING ELECTRONIC WHITE CANE
VISUALLY IMPAIRED PEOPLE MONITORING IN A SMART HOME USING ELECTRONIC WHITE CANEijcsit
 
Sensor Stick walking aid for the blinds
Sensor Stick walking aid for the blindsSensor Stick walking aid for the blinds
Sensor Stick walking aid for the blindsIJRES Journal
 
Robotic assistance for the visually impaired and elderly people
Robotic assistance for the visually impaired and elderly peopleRobotic assistance for the visually impaired and elderly people
Robotic assistance for the visually impaired and elderly peopletheijes
 
Robotic assistance for the visually impaired and elderly people
Robotic assistance for the visually impaired and elderly peopleRobotic assistance for the visually impaired and elderly people
Robotic assistance for the visually impaired and elderly peopletheijes
 
GPS & GSM based Voice Alert System for Blind Person
GPS & GSM based Voice Alert System for Blind PersonGPS & GSM based Voice Alert System for Blind Person
GPS & GSM based Voice Alert System for Blind Personijsrd.com
 
SMART BLIND STICK USING VOICE MODULE
SMART BLIND STICK USING VOICE MODULESMART BLIND STICK USING VOICE MODULE
SMART BLIND STICK USING VOICE MODULEIRJET Journal
 
RASPBERRY PI BASED SMART WALKING STICK FOR VISUALLY IMPAIRED PERSON
RASPBERRY PI BASED SMART WALKING STICK FOR VISUALLY IMPAIRED PERSONRASPBERRY PI BASED SMART WALKING STICK FOR VISUALLY IMPAIRED PERSON
RASPBERRY PI BASED SMART WALKING STICK FOR VISUALLY IMPAIRED PERSONIRJET Journal
 
ARTIFICIAL INTELLIGENCE BASED SMART NAVIGATION SYSTEM FOR BLIND PEOPLE
ARTIFICIAL INTELLIGENCE BASED SMART NAVIGATION SYSTEM FOR BLIND PEOPLEARTIFICIAL INTELLIGENCE BASED SMART NAVIGATION SYSTEM FOR BLIND PEOPLE
ARTIFICIAL INTELLIGENCE BASED SMART NAVIGATION SYSTEM FOR BLIND PEOPLEIRJET Journal
 
Blind Stick Using Ultrasonic Sensor with Voice announcement and GPS tracking
Blind Stick Using Ultrasonic Sensor with Voice announcement and GPS trackingBlind Stick Using Ultrasonic Sensor with Voice announcement and GPS tracking
Blind Stick Using Ultrasonic Sensor with Voice announcement and GPS trackingvivatechijri
 
IRJET- Bemythirdeye- A Smart Electronic Blind Stick with Goggles
IRJET- Bemythirdeye- A Smart Electronic Blind Stick with GogglesIRJET- Bemythirdeye- A Smart Electronic Blind Stick with Goggles
IRJET- Bemythirdeye- A Smart Electronic Blind Stick with GogglesIRJET Journal
 
IRJET- Smart Assistive Device for Visually Impaired
IRJET- Smart Assistive Device for Visually ImpairedIRJET- Smart Assistive Device for Visually Impaired
IRJET- Smart Assistive Device for Visually ImpairedIRJET Journal
 
Final year project report final sem.pdf
Final year project report final sem.pdfFinal year project report final sem.pdf
Final year project report final sem.pdfAmaanAhmed55
 
Walking in Place with Wearable Technology: the development of a system for tr...
Walking in Place with Wearable Technology: the development of a system for tr...Walking in Place with Wearable Technology: the development of a system for tr...
Walking in Place with Wearable Technology: the development of a system for tr...Karel Van Isacker
 
IoT Based Assistive Glove for Visually Impaired People
IoT Based Assistive Glove for Visually Impaired PeopleIoT Based Assistive Glove for Visually Impaired People
IoT Based Assistive Glove for Visually Impaired PeopleIRJET Journal
 
Eye Directive Wheel Chair
Eye Directive Wheel ChairEye Directive Wheel Chair
Eye Directive Wheel ChairDhruv Shah
 

Similar to Robot Vision for the Visually Impaired (20)

Metric Table
Metric TableMetric Table
Metric Table
 
Design and Implementation of Ultrasonic Navigator for Visually Impaired
Design and Implementation of Ultrasonic Navigator for Visually ImpairedDesign and Implementation of Ultrasonic Navigator for Visually Impaired
Design and Implementation of Ultrasonic Navigator for Visually Impaired
 
Smart Walking Stick for Visually Challenged People
Smart Walking Stick for Visually Challenged PeopleSmart Walking Stick for Visually Challenged People
Smart Walking Stick for Visually Challenged People
 
IRJET- A Leading Hand for the Blind –A Review
IRJET- A Leading Hand for the Blind –A ReviewIRJET- A Leading Hand for the Blind –A Review
IRJET- A Leading Hand for the Blind –A Review
 
Visually Impaired People Monitoring in a Smart Home using Electronic White Cane
Visually Impaired People Monitoring in a Smart Home using Electronic White CaneVisually Impaired People Monitoring in a Smart Home using Electronic White Cane
Visually Impaired People Monitoring in a Smart Home using Electronic White Cane
 
VISUALLY IMPAIRED PEOPLE MONITORING IN A SMART HOME USING ELECTRONIC WHITE CANE
VISUALLY IMPAIRED PEOPLE MONITORING IN A SMART HOME USING ELECTRONIC WHITE CANEVISUALLY IMPAIRED PEOPLE MONITORING IN A SMART HOME USING ELECTRONIC WHITE CANE
VISUALLY IMPAIRED PEOPLE MONITORING IN A SMART HOME USING ELECTRONIC WHITE CANE
 
Sensor Stick walking aid for the blinds
Sensor Stick walking aid for the blindsSensor Stick walking aid for the blinds
Sensor Stick walking aid for the blinds
 
Robotic assistance for the visually impaired and elderly people
Robotic assistance for the visually impaired and elderly peopleRobotic assistance for the visually impaired and elderly people
Robotic assistance for the visually impaired and elderly people
 
Robotic assistance for the visually impaired and elderly people
Robotic assistance for the visually impaired and elderly peopleRobotic assistance for the visually impaired and elderly people
Robotic assistance for the visually impaired and elderly people
 
GPS & GSM based Voice Alert System for Blind Person
GPS & GSM based Voice Alert System for Blind PersonGPS & GSM based Voice Alert System for Blind Person
GPS & GSM based Voice Alert System for Blind Person
 
SMART BLIND STICK USING VOICE MODULE
SMART BLIND STICK USING VOICE MODULESMART BLIND STICK USING VOICE MODULE
SMART BLIND STICK USING VOICE MODULE
 
RASPBERRY PI BASED SMART WALKING STICK FOR VISUALLY IMPAIRED PERSON
RASPBERRY PI BASED SMART WALKING STICK FOR VISUALLY IMPAIRED PERSONRASPBERRY PI BASED SMART WALKING STICK FOR VISUALLY IMPAIRED PERSON
RASPBERRY PI BASED SMART WALKING STICK FOR VISUALLY IMPAIRED PERSON
 
ARTIFICIAL INTELLIGENCE BASED SMART NAVIGATION SYSTEM FOR BLIND PEOPLE
ARTIFICIAL INTELLIGENCE BASED SMART NAVIGATION SYSTEM FOR BLIND PEOPLEARTIFICIAL INTELLIGENCE BASED SMART NAVIGATION SYSTEM FOR BLIND PEOPLE
ARTIFICIAL INTELLIGENCE BASED SMART NAVIGATION SYSTEM FOR BLIND PEOPLE
 
Blind Stick Using Ultrasonic Sensor with Voice announcement and GPS tracking
Blind Stick Using Ultrasonic Sensor with Voice announcement and GPS trackingBlind Stick Using Ultrasonic Sensor with Voice announcement and GPS tracking
Blind Stick Using Ultrasonic Sensor with Voice announcement and GPS tracking
 
IRJET- Bemythirdeye- A Smart Electronic Blind Stick with Goggles
IRJET- Bemythirdeye- A Smart Electronic Blind Stick with GogglesIRJET- Bemythirdeye- A Smart Electronic Blind Stick with Goggles
IRJET- Bemythirdeye- A Smart Electronic Blind Stick with Goggles
 
IRJET- Smart Assistive Device for Visually Impaired
IRJET- Smart Assistive Device for Visually ImpairedIRJET- Smart Assistive Device for Visually Impaired
IRJET- Smart Assistive Device for Visually Impaired
 
Final year project report final sem.pdf
Final year project report final sem.pdfFinal year project report final sem.pdf
Final year project report final sem.pdf
 
Walking in Place with Wearable Technology: the development of a system for tr...
Walking in Place with Wearable Technology: the development of a system for tr...Walking in Place with Wearable Technology: the development of a system for tr...
Walking in Place with Wearable Technology: the development of a system for tr...
 
IoT Based Assistive Glove for Visually Impaired People
IoT Based Assistive Glove for Visually Impaired PeopleIoT Based Assistive Glove for Visually Impaired People
IoT Based Assistive Glove for Visually Impaired People
 
Eye Directive Wheel Chair
Eye Directive Wheel ChairEye Directive Wheel Chair
Eye Directive Wheel Chair
 

Recently uploaded

Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Victor Rentea
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Orbitshub
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesrafiqahmad00786416
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistandanishmna97
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024The Digital Insurer
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdfSandro Moreira
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxRustici Software
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKJago de Vreede
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...apidays
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...apidays
 

Recently uploaded (20)

Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 

Robot Vision for the Visually Impaired

  • 1. Robot Vision for the Visually Impaired Vivek Pradeep, Gerard Medioni, James Weiland presented by Phongsathorn Eakamongul Department of Computer Science Asian Institute of Technology 2010, December 7 Phongsathorn (AIT) Robot Vision for the Visually Impaired Short Occasion 1 / 18
  • 2. Outline 1 Abstracts 2 System Description 3 Result Phongsathorn (AIT) Robot Vision for the Visually Impaired Short Occasion 2 / 18
  • 3. Abstracts head-mounted : wide-field information compare to shoulder or waist-mounted design in literature which require body rotations stereo-vision navigational assistance device visual odometry : dense 3D with 2D elevation grids metric-topological SLAM build vicinity map 3D traversability analysis to steer subjects away from obstacles in the path use microvibration motors provides cues for taking evasive action : they use tactile cues instead of audio since the latter impose greater cognitive load on the subject, and blind users rely on hearing to perform a wide variety of other tasks experiment running at 10 Hz Phongsathorn (AIT) Robot Vision for the Visually Impaired Short Occasion 3 / 18
  • 4. Abstracts head-mounted : wide-field information compare to shoulder or waist-mounted design in literature which require body rotations stereo-vision navigational assistance device visual odometry : dense 3D with 2D elevation grids metric-topological SLAM build vicinity map 3D traversability analysis to steer subjects away from obstacles in the path use microvibration motors provides cues for taking evasive action : they use tactile cues instead of audio since the latter impose greater cognitive load on the subject, and blind users rely on hearing to perform a wide variety of other tasks experiment running at 10 Hz Phongsathorn (AIT) Robot Vision for the Visually Impaired Short Occasion 3 / 18
  • 5. Abstracts head-mounted : wide-field information compare to shoulder or waist-mounted design in literature which require body rotations stereo-vision navigational assistance device visual odometry : dense 3D with 2D elevation grids metric-topological SLAM build vicinity map 3D traversability analysis to steer subjects away from obstacles in the path use microvibration motors provides cues for taking evasive action : they use tactile cues instead of audio since the latter impose greater cognitive load on the subject, and blind users rely on hearing to perform a wide variety of other tasks experiment running at 10 Hz Phongsathorn (AIT) Robot Vision for the Visually Impaired Short Occasion 3 / 18
  • 6. Abstracts head-mounted : wide-field information compare to shoulder or waist-mounted design in literature which require body rotations stereo-vision navigational assistance device visual odometry : dense 3D with 2D elevation grids metric-topological SLAM build vicinity map 3D traversability analysis to steer subjects away from obstacles in the path use microvibration motors provides cues for taking evasive action : they use tactile cues instead of audio since the latter impose greater cognitive load on the subject, and blind users rely on hearing to perform a wide variety of other tasks experiment running at 10 Hz Phongsathorn (AIT) Robot Vision for the Visually Impaired Short Occasion 3 / 18
  • 7. Abstracts head-mounted : wide-field information compare to shoulder or waist-mounted design in literature which require body rotations stereo-vision navigational assistance device visual odometry : dense 3D with 2D elevation grids metric-topological SLAM build vicinity map 3D traversability analysis to steer subjects away from obstacles in the path use microvibration motors provides cues for taking evasive action : they use tactile cues instead of audio since the latter impose greater cognitive load on the subject, and blind users rely on hearing to perform a wide variety of other tasks experiment running at 10 Hz Phongsathorn (AIT) Robot Vision for the Visually Impaired Short Occasion 3 / 18
  • 8. Abstracts head-mounted : wide-field information compare to shoulder or waist-mounted design in literature which require body rotations stereo-vision navigational assistance device visual odometry : dense 3D with 2D elevation grids metric-topological SLAM build vicinity map 3D traversability analysis to steer subjects away from obstacles in the path use microvibration motors provides cues for taking evasive action : they use tactile cues instead of audio since the latter impose greater cognitive load on the subject, and blind users rely on hearing to perform a wide variety of other tasks experiment running at 10 Hz Phongsathorn (AIT) Robot Vision for the Visually Impaired Short Occasion 3 / 18
  • 9. Abstracts head-mounted : wide-field information compare to shoulder or waist-mounted design in literature which require body rotations stereo-vision navigational assistance device visual odometry : dense 3D with 2D elevation grids metric-topological SLAM build vicinity map 3D traversability analysis to steer subjects away from obstacles in the path use microvibration motors provides cues for taking evasive action : they use tactile cues instead of audio since the latter impose greater cognitive load on the subject, and blind users rely on hearing to perform a wide variety of other tasks experiment running at 10 Hz Phongsathorn (AIT) Robot Vision for the Visually Impaired Short Occasion 3 / 18
  • 10. Abstracts head-mounted : wide-field information compare to shoulder or waist-mounted design in literature which require body rotations stereo-vision navigational assistance device visual odometry : dense 3D with 2D elevation grids metric-topological SLAM build vicinity map 3D traversability analysis to steer subjects away from obstacles in the path use microvibration motors provides cues for taking evasive action : they use tactile cues instead of audio since the latter impose greater cognitive load on the subject, and blind users rely on hearing to perform a wide variety of other tasks experiment running at 10 Hz Phongsathorn (AIT) Robot Vision for the Visually Impaired Short Occasion 3 / 18
  • 11. Abstracts head-mounted : wide-field information compare to shoulder or waist-mounted design in literature which require body rotations stereo-vision navigational assistance device visual odometry : dense 3D with 2D elevation grids metric-topological SLAM build vicinity map 3D traversability analysis to steer subjects away from obstacles in the path use microvibration motors provides cues for taking evasive action : they use tactile cues instead of audio since the latter impose greater cognitive load on the subject, and blind users rely on hearing to perform a wide variety of other tasks experiment running at 10 Hz Phongsathorn (AIT) Robot Vision for the Visually Impaired Short Occasion 3 / 18
  • 12. Abstracts head-mounted : wide-field information compare to shoulder or waist-mounted design in literature which require body rotations stereo-vision navigational assistance device visual odometry : dense 3D with 2D elevation grids metric-topological SLAM build vicinity map 3D traversability analysis to steer subjects away from obstacles in the path use microvibration motors provides cues for taking evasive action : they use tactile cues instead of audio since the latter impose greater cognitive load on the subject, and blind users rely on hearing to perform a wide variety of other tasks experiment running at 10 Hz Phongsathorn (AIT) Robot Vision for the Visually Impaired Short Occasion 3 / 18
  • 13. Introduction visual impairment : need long cane or guide dog In US, 109,000 people : use long canes, 7,000 use dog guides only 1,500 graduate from dog-guid user program Electronic travel aids (ETAs), leveraging ultrasonic, laser, or vision sensors Phongsathorn (AIT) Robot Vision for the Visually Impaired Short Occasion 4 / 18
  • 14. Introduction visual impairment : need long cane or guide dog In US, 109,000 people : use long canes, 7,000 use dog guides only 1,500 graduate from dog-guid user program Electronic travel aids (ETAs), leveraging ultrasonic, laser, or vision sensors Phongsathorn (AIT) Robot Vision for the Visually Impaired Short Occasion 4 / 18
  • 15. Introduction visual impairment : need long cane or guide dog In US, 109,000 people : use long canes, 7,000 use dog guides only 1,500 graduate from dog-guid user program Electronic travel aids (ETAs), leveraging ultrasonic, laser, or vision sensors Phongsathorn (AIT) Robot Vision for the Visually Impaired Short Occasion 4 / 18
  • 16. Introduction visual impairment : need long cane or guide dog In US, 109,000 people : use long canes, 7,000 use dog guides only 1,500 graduate from dog-guid user program Electronic travel aids (ETAs), leveraging ultrasonic, laser, or vision sensors Phongsathorn (AIT) Robot Vision for the Visually Impaired Short Occasion 4 / 18
  • 17. Introduction visual impairment : need long cane or guide dog In US, 109,000 people : use long canes, 7,000 use dog guides only 1,500 graduate from dog-guid user program Electronic travel aids (ETAs), leveraging ultrasonic, laser, or vision sensors Phongsathorn (AIT) Robot Vision for the Visually Impaired Short Occasion 4 / 18
  • 18. wearable array of microvibration motors provides a tactile cues and guide user along safe path Phongsathorn (AIT) Robot Vision for the Visually Impaired Short Occasion 5 / 18
  • 19. Outline 1 Abstracts 2 System Description 3 Result Phongsathorn (AIT) Robot Vision for the Visually Impaired Short Occasion 6 / 18
  • 20. Online SLAM + obstacle detection Phongsathorn (AIT) Robot Vision for the Visually Impaired Short Occasion 7 / 18
  • 21. Stereo Vision Odometry t−1 t−1 t t−1 t−1 t matched correspondences across (PL , PR , PL ) or (PL , PR , PR ) can be computed using three-point algorithm in RANSAC setting for robustness, features matching and reprojection errors are measured across four views Sparse Bundle Adjustment feature covariances can be propagated to get motion uncertainty for use in the SLAM filter Phongsathorn (AIT) Robot Vision for the Visually Impaired Short Occasion 8 / 18
  • 22. Stereo Vision Odometry t−1 t−1 t t−1 t−1 t matched correspondences across (PL , PR , PL ) or (PL , PR , PR ) can be computed using three-point algorithm in RANSAC setting for robustness, features matching and reprojection errors are measured across four views Sparse Bundle Adjustment feature covariances can be propagated to get motion uncertainty for use in the SLAM filter Phongsathorn (AIT) Robot Vision for the Visually Impaired Short Occasion 8 / 18
  • 23. Stereo Vision Odometry t−1 t−1 t t−1 t−1 t matched correspondences across (PL , PR , PL ) or (PL , PR , PR ) can be computed using three-point algorithm in RANSAC setting for robustness, features matching and reprojection errors are measured across four views Sparse Bundle Adjustment feature covariances can be propagated to get motion uncertainty for use in the SLAM filter Phongsathorn (AIT) Robot Vision for the Visually Impaired Short Occasion 8 / 18
  • 24. Stereo Vision Odometry t−1 t−1 t t−1 t−1 t matched correspondences across (PL , PR , PL ) or (PL , PR , PR ) can be computed using three-point algorithm in RANSAC setting for robustness, features matching and reprojection errors are measured across four views Sparse Bundle Adjustment feature covariances can be propagated to get motion uncertainty for use in the SLAM filter Phongsathorn (AIT) Robot Vision for the Visually Impaired Short Occasion 8 / 18
  • 25. Stereo Vision Odometry t−1 t−1 t t−1 t−1 t matched correspondences across (PL , PR , PL ) or (PL , PR , PR ) can be computed using three-point algorithm in RANSAC setting for robustness, features matching and reprojection errors are measured across four views Sparse Bundle Adjustment feature covariances can be propagated to get motion uncertainty for use in the SLAM filter Phongsathorn (AIT) Robot Vision for the Visually Impaired Short Occasion 8 / 18
  • 26. SLAM Rao-Blackwellised particle filter (RBPF) in FastSLAM framework which use KLT and SIFT tracking construct 2 maps SLAM map : collection of sparse landmarks that propagated every frame to yield consistent camera pose estimates, for SLAM purpose only traversability map : dense 3D cloud from triangulation Phongsathorn (AIT) Robot Vision for the Visually Impaired Short Occasion 9 / 18
  • 27. SLAM Rao-Blackwellised particle filter (RBPF) in FastSLAM framework which use KLT and SIFT tracking construct 2 maps SLAM map : collection of sparse landmarks that propagated every frame to yield consistent camera pose estimates, for SLAM purpose only traversability map : dense 3D cloud from triangulation Phongsathorn (AIT) Robot Vision for the Visually Impaired Short Occasion 9 / 18
  • 28. SLAM Rao-Blackwellised particle filter (RBPF) in FastSLAM framework which use KLT and SIFT tracking construct 2 maps SLAM map : collection of sparse landmarks that propagated every frame to yield consistent camera pose estimates, for SLAM purpose only traversability map : dense 3D cloud from triangulation Phongsathorn (AIT) Robot Vision for the Visually Impaired Short Occasion 9 / 18
  • 29. SLAM Rao-Blackwellised particle filter (RBPF) in FastSLAM framework which use KLT and SIFT tracking construct 2 maps SLAM map : collection of sparse landmarks that propagated every frame to yield consistent camera pose estimates, for SLAM purpose only traversability map : dense 3D cloud from triangulation Phongsathorn (AIT) Robot Vision for the Visually Impaired Short Occasion 9 / 18
  • 30. SLAM Rao-Blackwellised particle filter (RBPF) in FastSLAM framework which use KLT and SIFT tracking construct 2 maps SLAM map : collection of sparse landmarks that propagated every frame to yield consistent camera pose estimates, for SLAM purpose only traversability map : dense 3D cloud from triangulation Phongsathorn (AIT) Robot Vision for the Visually Impaired Short Occasion 9 / 18
  • 31. SLAM Rao-Blackwellised particle filter (RBPF) in FastSLAM framework which use KLT and SIFT tracking construct 2 maps SLAM map : collection of sparse landmarks that propagated every frame to yield consistent camera pose estimates, for SLAM purpose only traversability map : dense 3D cloud from triangulation Phongsathorn (AIT) Robot Vision for the Visually Impaired Short Occasion 9 / 18
  • 32. Metric-Topological SLAM serveral thousands of landmarks environment two levels of environment representation local, metric (submap) : estimates state information six dimensional camera trajectory st sparse map mt feature observations (KLT/SIFT) z t camera motion estimates u t RBPF p(st , mt |z t , u t ) ≈ p(st |z t , u t ) i p(mt (i)|st , z t , u t ) mt (i) : ith landmark in the map represented by N(µi , σ i ) each time feature is observed, the corresponding lankmark is updated using EKF RBPF enables us to only update the observed landmark instead of the whole map global topological map is represents as a collection of submap annotated graph G = ({i M}i∈Ωt , {b Λ}a,b∈Ωt ) a i M : annotated submaps Ωt : set of computed submaps b a Λ : coordinate transformations between adjacent maps Phongsathorn (AIT) Robot Vision for the Visually Impaired Short Occasion 10 / 18
  • 33. Metric-Topological SLAM serveral thousands of landmarks environment two levels of environment representation local, metric (submap) : estimates state information six dimensional camera trajectory st sparse map mt feature observations (KLT/SIFT) z t camera motion estimates u t RBPF p(st , mt |z t , u t ) ≈ p(st |z t , u t ) i p(mt (i)|st , z t , u t ) mt (i) : ith landmark in the map represented by N(µi , σ i ) each time feature is observed, the corresponding lankmark is updated using EKF RBPF enables us to only update the observed landmark instead of the whole map global topological map is represents as a collection of submap annotated graph G = ({i M}i∈Ωt , {b Λ}a,b∈Ωt ) a i M : annotated submaps Ωt : set of computed submaps b a Λ : coordinate transformations between adjacent maps Phongsathorn (AIT) Robot Vision for the Visually Impaired Short Occasion 10 / 18
  • 34. Metric-Topological SLAM serveral thousands of landmarks environment two levels of environment representation local, metric (submap) : estimates state information six dimensional camera trajectory st sparse map mt feature observations (KLT/SIFT) z t camera motion estimates u t RBPF p(st , mt |z t , u t ) ≈ p(st |z t , u t ) i p(mt (i)|st , z t , u t ) mt (i) : ith landmark in the map represented by N(µi , σ i ) each time feature is observed, the corresponding lankmark is updated using EKF RBPF enables us to only update the observed landmark instead of the whole map global topological map is represents as a collection of submap annotated graph G = ({i M}i∈Ωt , {b Λ}a,b∈Ωt ) a i M : annotated submaps Ωt : set of computed submaps b a Λ : coordinate transformations between adjacent maps Phongsathorn (AIT) Robot Vision for the Visually Impaired Short Occasion 10 / 18
  • 35. Metric-Topological SLAM serveral thousands of landmarks environment two levels of environment representation local, metric (submap) : estimates state information six dimensional camera trajectory st sparse map mt feature observations (KLT/SIFT) z t camera motion estimates u t RBPF p(st , mt |z t , u t ) ≈ p(st |z t , u t ) i p(mt (i)|st , z t , u t ) mt (i) : ith landmark in the map represented by N(µi , σ i ) each time feature is observed, the corresponding lankmark is updated using EKF RBPF enables us to only update the observed landmark instead of the whole map global topological map is represents as a collection of submap annotated graph G = ({i M}i∈Ωt , {b Λ}a,b∈Ωt ) a i M : annotated submaps Ωt : set of computed submaps b a Λ : coordinate transformations between adjacent maps Phongsathorn (AIT) Robot Vision for the Visually Impaired Short Occasion 10 / 18
  • 36. Metric-Topological SLAM serveral thousands of landmarks environment two levels of environment representation local, metric (submap) : estimates state information six dimensional camera trajectory st sparse map mt feature observations (KLT/SIFT) z t camera motion estimates u t RBPF p(st , mt |z t , u t ) ≈ p(st |z t , u t ) i p(mt (i)|st , z t , u t ) mt (i) : ith landmark in the map represented by N(µi , σ i ) each time feature is observed, the corresponding lankmark is updated using EKF RBPF enables us to only update the observed landmark instead of the whole map global topological map is represents as a collection of submap annotated graph G = ({i M}i∈Ωt , {b Λ}a,b∈Ωt ) a i M : annotated submaps Ωt : set of computed submaps b a Λ : coordinate transformations between adjacent maps Phongsathorn (AIT) Robot Vision for the Visually Impaired Short Occasion 10 / 18
  • 37. Metric-Topological SLAM serveral thousands of landmarks environment two levels of environment representation local, metric (submap) : estimates state information six dimensional camera trajectory st sparse map mt feature observations (KLT/SIFT) z t camera motion estimates u t RBPF p(st , mt |z t , u t ) ≈ p(st |z t , u t ) i p(mt (i)|st , z t , u t ) mt (i) : ith landmark in the map represented by N(µi , σ i ) each time feature is observed, the corresponding lankmark is updated using EKF RBPF enables us to only update the observed landmark instead of the whole map global topological map is represents as a collection of submap annotated graph G = ({i M}i∈Ωt , {b Λ}a,b∈Ωt ) a i M : annotated submaps Ωt : set of computed submaps b a Λ : coordinate transformations between adjacent maps Phongsathorn (AIT) Robot Vision for the Visually Impaired Short Occasion 10 / 18
  • 38. Metric-Topological SLAM serveral thousands of landmarks environment two levels of environment representation local, metric (submap) : estimates state information six dimensional camera trajectory st sparse map mt feature observations (KLT/SIFT) z t camera motion estimates u t RBPF p(st , mt |z t , u t ) ≈ p(st |z t , u t ) i p(mt (i)|st , z t , u t ) mt (i) : ith landmark in the map represented by N(µi , σ i ) each time feature is observed, the corresponding lankmark is updated using EKF RBPF enables us to only update the observed landmark instead of the whole map global topological map is represents as a collection of submap annotated graph G = ({i M}i∈Ωt , {b Λ}a,b∈Ωt ) a i M : annotated submaps Ωt : set of computed submaps b a Λ : coordinate transformations between adjacent maps Phongsathorn (AIT) Robot Vision for the Visually Impaired Short Occasion 10 / 18
  • 39. Metric-Topological SLAM serveral thousands of landmarks environment two levels of environment representation local, metric (submap) : estimates state information six dimensional camera trajectory st sparse map mt feature observations (KLT/SIFT) z t camera motion estimates u t RBPF p(st , mt |z t , u t ) ≈ p(st |z t , u t ) i p(mt (i)|st , z t , u t ) mt (i) : ith landmark in the map represented by N(µi , σ i ) each time feature is observed, the corresponding lankmark is updated using EKF RBPF enables us to only update the observed landmark instead of the whole map global topological map is represents as a collection of submap annotated graph G = ({i M}i∈Ωt , {b Λ}a,b∈Ωt ) a i M : annotated submaps Ωt : set of computed submaps b a Λ : coordinate transformations between adjacent maps Phongsathorn (AIT) Robot Vision for the Visually Impaired Short Occasion 10 / 18
  • 40. Traversability Map 5 radius sphere multi-surface elevation map : point cloud is quantized into 2D grid Phongsathorn (AIT) Robot Vision for the Visually Impaired Short Occasion 11 / 18
  • 41. Traversability Map 5 radius sphere multi-surface elevation map : point cloud is quantized into 2D grid Phongsathorn (AIT) Robot Vision for the Visually Impaired Short Occasion 11 / 18
  • 42. Traversability Map 5 radius sphere multi-surface elevation map : point cloud is quantized into 2D grid Phongsathorn (AIT) Robot Vision for the Visually Impaired Short Occasion 11 / 18
  • 43. Prediction Motion and Cue Generation if magnitude of translation respect to previous position exceeds certain threshold, the direction of motion and reference position are updated little translation -> no update cue generation : most continuous traversable path ( Green color in picture ) Phongsathorn (AIT) Robot Vision for the Visually Impaired Short Occasion 12 / 18
  • 44. Outline 1 Abstracts 2 System Description 3 Result Phongsathorn (AIT) Robot Vision for the Visually Impaired Short Occasion 13 / 18
  • 45. Result Green : travesible Red : not Phongsathorn (AIT) Robot Vision for the Visually Impaired Short Occasion 14 / 18
  • 46. error of camera frame-to-frame heading (yaw), when compared with readings from a commercially Inertial Measurement Unit (IMU) camera motion : slow (< 5 degree/s), medium (5-20 degree/s), fast (20-30 degree/s) Phongsathorn (AIT) Robot Vision for the Visually Impaired Short Occasion 15 / 18
  • 47. SLAM result Phongsathorn (AIT) Robot Vision for the Visually Impaired Short Occasion 16 / 18
  • 48. Traversability Map one frame exp patch that has thickness > 30 cm is labeled as vertical 5 horizontal patches is labeled as traversable Phongsathorn (AIT) Robot Vision for the Visually Impaired Short Occasion 17 / 18
  • 49. Experiment Manually generate cues : wireless remote control Autonomous generate cues, like group 4 Phongsathorn (AIT) Robot Vision for the Visually Impaired Short Occasion 18 / 18