SlideShare une entreprise Scribd logo
1  sur  30
Télécharger pour lire hors ligne
SpeeG
        A	
  Mul&modal	
  Speech-­‐	
  and	
  
     Gesture-­‐based	
  Text	
  Input	
  Solu&on
Lode	
  Hoste,	
  Bruno	
  Dumas	
  and	
  Beat	
  Signer
Text-input for set-top boxes




Vrije Universiteit Brussel   SpeeG - Lode Hoste   2
Vrije Universiteit Brussel   SpeeG - Lode Hoste   3
Vrije Universiteit Brussel   SpeeG - Lode Hoste   4
Text-input for set-top boxes




Vrije Universiteit Brussel   SpeeG - Lode Hoste   5
1D Keyboard for Kinect   Chatpad Controller    Virtual Keyboard for Xbox




                SwiftKey               8Pen                  EdgeWriter




                   Dasher         Speech Dasher                SpeeG


Vrije Universiteit Brussel       SpeeG - Lode Hoste                               6
Virtual keyboard




Vrije Universiteit Brussel        SpeeG - Lode Hoste   7
Kinect 1D keyboard




Vrije Universiteit Brussel     SpeeG - Lode Hoste   8
Kinect 1D keyboard

Vrije Universiteit Brussel    SpeeG - Lode Hoste   9
1D Keyboard for Kinect   Chatpad Controller    Virtual Keyboard for Xbox




                SwiftKey               8Pen                  EdgeWriter




                   Dasher         Speech Dasher                SpeeG


Vrije Universiteit Brussel       SpeeG - Lode Hoste                               10
1D Keyboard for Kinect   Chatpad Controller    Virtual Keyboard for Xbox




                SwiftKey               8Pen                  EdgeWriter




                   Dasher         Speech Dasher                SpeeG


Vrije Universiteit Brussel       SpeeG - Lode Hoste                               11
Dasher


 Continuous input
 Joystick / Gaze / ...
 Open vocabulary
 Allows imprecise navigation




Vrije Universiteit Brussel   SpeeG - Lode Hoste   12
Dasher




Vrije Universiteit Brussel   SpeeG - Lode Hoste   13
Goals:                         Used technologies:
           Controller-free                           Kinect
           Text input                                CMU Sphinx
           Without training                          Dasher




Vrije Universiteit Brussel    SpeeG - Lode Hoste                        14
SpeeG




Vrije Universiteit Brussel   SpeeG - Lode Hoste   15
Vrije Universiteit Brussel   SpeeG - Lode Hoste   16
SpeeG Architecture

                                             5




                             User                                 GUI (JDasher)
                                         3
                             1
                                                                         4
                                         2




                     Speech Recogniser                           Hand Tracking
                      (CMU Sphinx 4)                       (Microsoft Kinect and NITE)
Vrije Universiteit Brussel                   SpeeG - Lode Hoste                          17
Evaluation




                Virtual Keyboard                               Kinect Keyboard


                                         5




                             User                             GUI (JDasher)
                       Speech-only                                   SpeeG
                                     3
                             1
Vrije Universiteit Brussel               SpeeG - Lode Hoste                      18
Evaluation


       7 (male) users: 23-31y
                                                         “this was easy for us”
                                                         “he will allow a rare lie”
                                                         “did you eat yet”
                                                             1-3: DARPA’s TIMIT



                                                         “my watch fell in the water”
                                                         “the world is a stage”
                                                         “peek out the window”
                                                             4-6: MacKenzie and Soukoreff




   Performed a quantitative (Words per minute and nr of errors)
   and qualitative (feedback and preference) evaluation
Vrije Universiteit Brussel          SpeeG - Lode Hoste                                      19
Virtual keyboard
                             6.3 WPM

                10

                9

                8

                7
                                                                        User 1
                6
                                                                        User 2
          WPM




                5                                                       User 3
                                                                        User 4
                4                                                       User 5
                                                                        User 6
                3
                                                                        User 7
                2

                1

                0
                         S1        S2   S3              S4    S5   S6
                                             Sentence



Vrije Universiteit Brussel               SpeeG - Lode Hoste                      20
Kinect Keyboard
                             1.83 WPM

               3.50


               3.00


               2.50
                                                                          User 1
               2.00                                                       User 2
         WPM




                                                                          User 3

               1.50                                                       User 4
                                                                          User 5
                                                                          User 6
               1.00
                                                                         *User 7
               0.50


               0.00
                             S1     S2   S3              S4    S5   S6
                                              Sentence




Vrije Universiteit Brussel                SpeeG - Lode Hoste                       21
Speech-only
                             11 WPM
         40

         35                                                               User

                                                                          1
         30

         25                                                                User 1
                                                                           User 2
   WPM




         20                                                                User 3
                                                                           User 4
         15                                                                User 5
                                                                      Speech Recognis
                                                                           User 6
                                                                       (CMU Sphinx 4
         10                                                                User 7


         5

         0
                    S1           S2   S3              S4    S5   S6
                                           Sentence

Vrije Universiteit Brussel             SpeeG - Lode Hoste                           22
SpeeG
                             5.8 WPM
         10

         9

         8

         7
                                                                       User 2
         6
                                                                       User 1
   WPM




         5                                                             User 3
                                                                       User 4
         4                                                             User 5
                                                                       User 6
         3
                                                                       User 7
         2

         1

         0
                    S1            S2   S3              S4    S5   S6
                                            Sentence

Vrije Universiteit Brussel              SpeeG - Lode Hoste                      23
SpeeG
                         2.6 7.8 WPM
         10

         9

         8

         7
                                                                       User 2
         6
                                                                       User 1
   WPM




         5                                                             User 3
                                                                       User 4
         4                                                             User 5
                                                                       User 6
         3
                                                                       User 7
         2

         1

         0
                    S1          S2     S3              S4    S5   S6
                                            Sentence

Vrije Universiteit Brussel              SpeeG - Lode Hoste                      24
Mean WPM per sentence
             and input device                                    Virtual Keyboard for Xbox          1D Keyboard for Xbox



                                                                                               5

         25
                                                                      Speech-only
                                                                          User                                 SpeeG
                                                                                                         GUI (JDasher)
                                                                                           3
                                                                           1
                                                                                                                 4
                                                                                           2
         20

                                                                       Speech Recogniser                 Hand Tracking
                                                                        (CMU Sphinx 4)             (Microsoft Kinect and NITE)



         15
                                                                                                    Controller
   WPM




                                                                                                    Speech only

         10                                                                                         Kinect only
                                                                                                    SpeeG


         5



         0
                   S1        S2   S3              S4        S5                 S6
                                       Sentence

Vrije Universiteit Brussel             SpeeG - Lode Hoste                                                                        25
Errors per sentence
                                     and input device                              Virtual Keyboard for Xbox           1D Keyboard for Xbox



                                                                                                                 5


                              10
                                                                                        Speech-only
                                                                                            User                                  SpeeG
                                                                                                                            GUI (JDasher)
                              9                                                              1
                                                                                                             3


                                                                                                                                    4
                                                                                                             2

                              8

                              7                                                          Speech Recogniser
                                                                                          (CMU Sphinx 4)
                                                                                                                            Hand Tracking
                                                                                                                      (Microsoft Kinect and NITE)
      Mean number of errors




                              6
                                                                                                                     Controller
                              5                                                                                      Speech only
                              4                                                                                      Kinect only
                                                                                                                     SpeeG
                              3

                              2

                              1

                              0
                                      S1     S2     S3                S4      S5                 S6
                                                           Sentence


Vrije Universiteit Brussel                               SpeeG - Lode Hoste                                                                         26
Vrije Universiteit Brussel   SpeeG - Lode Hoste   27
Future work
                             Other visualisations
                             Smaller gestures
                             Dedicated commands (gesture / voice)




Vrije Universiteit Brussel                 SpeeG - Lode Hoste       28
Vrije Universiteit Brussel   SpeeG - Lode Hoste   29
SpeeG
           A	
  Mul&modal	
  Speech-­‐	
  and	
  
     Gesture-­‐	
  based	
  Text	
  Input	
  Solu&on
   Lode	
  Hoste,	
  Bruno	
  Dumas,	
  Beat	
  Signer


                             Kinect                                                       Speech

   - Controller-free text input                                     - Non-native speakers
   - Real-time correction                                           - Untrained voice recogniser
   - Dasher, zoomable interface                                     - 6-12 WPM
     - probabilities                                                - Perceived fastest
     - alphabetic order                                             - Game-like character
     - character-level                                              - Novice and experts



Vrije Universiteit Brussel        Special thanks to Jorn De Baerdenmaeker and Keith Vertaenen
                                                       SpeeG - Lode Hoste                          30

Contenu connexe

Plus de Beat Signer

Case Studies and Course Review - Lecture 12 - Information Visualisation (4019...
Case Studies and Course Review - Lecture 12 - Information Visualisation (4019...Case Studies and Course Review - Lecture 12 - Information Visualisation (4019...
Case Studies and Course Review - Lecture 12 - Information Visualisation (4019...Beat Signer
 
Dashboards - Lecture 11 - Information Visualisation (4019538FNR)
Dashboards - Lecture 11 - Information Visualisation (4019538FNR)Dashboards - Lecture 11 - Information Visualisation (4019538FNR)
Dashboards - Lecture 11 - Information Visualisation (4019538FNR)Beat Signer
 
Interaction - Lecture 10 - Information Visualisation (4019538FNR)
Interaction - Lecture 10 - Information Visualisation (4019538FNR)Interaction - Lecture 10 - Information Visualisation (4019538FNR)
Interaction - Lecture 10 - Information Visualisation (4019538FNR)Beat Signer
 
View Manipulation and Reduction - Lecture 9 - Information Visualisation (4019...
View Manipulation and Reduction - Lecture 9 - Information Visualisation (4019...View Manipulation and Reduction - Lecture 9 - Information Visualisation (4019...
View Manipulation and Reduction - Lecture 9 - Information Visualisation (4019...Beat Signer
 
Visualisation Techniques - Lecture 8 - Information Visualisation (4019538FNR)
Visualisation Techniques - Lecture 8 - Information Visualisation (4019538FNR)Visualisation Techniques - Lecture 8 - Information Visualisation (4019538FNR)
Visualisation Techniques - Lecture 8 - Information Visualisation (4019538FNR)Beat Signer
 
Design Guidelines and Principles - Lecture 7 - Information Visualisation (401...
Design Guidelines and Principles - Lecture 7 - Information Visualisation (401...Design Guidelines and Principles - Lecture 7 - Information Visualisation (401...
Design Guidelines and Principles - Lecture 7 - Information Visualisation (401...Beat Signer
 
Data Processing and Visualisation Frameworks - Lecture 6 - Information Visual...
Data Processing and Visualisation Frameworks - Lecture 6 - Information Visual...Data Processing and Visualisation Frameworks - Lecture 6 - Information Visual...
Data Processing and Visualisation Frameworks - Lecture 6 - Information Visual...Beat Signer
 
Data Presentation - Lecture 5 - Information Visualisation (4019538FNR)
Data Presentation - Lecture 5 - Information Visualisation (4019538FNR)Data Presentation - Lecture 5 - Information Visualisation (4019538FNR)
Data Presentation - Lecture 5 - Information Visualisation (4019538FNR)Beat Signer
 
Analysis and Validation - Lecture 4 - Information Visualisation (4019538FNR)
Analysis and Validation - Lecture 4 - Information Visualisation (4019538FNR)Analysis and Validation - Lecture 4 - Information Visualisation (4019538FNR)
Analysis and Validation - Lecture 4 - Information Visualisation (4019538FNR)Beat Signer
 
Data Representation - Lecture 3 - Information Visualisation (4019538FNR)
Data Representation - Lecture 3 - Information Visualisation (4019538FNR)Data Representation - Lecture 3 - Information Visualisation (4019538FNR)
Data Representation - Lecture 3 - Information Visualisation (4019538FNR)Beat Signer
 
Human Perception and Colour Theory - Lecture 2 - Information Visualisation (4...
Human Perception and Colour Theory - Lecture 2 - Information Visualisation (4...Human Perception and Colour Theory - Lecture 2 - Information Visualisation (4...
Human Perception and Colour Theory - Lecture 2 - Information Visualisation (4...Beat Signer
 
Introduction - Lecture 1 - Information Visualisation (4019538FNR)
Introduction - Lecture 1 - Information Visualisation (4019538FNR)Introduction - Lecture 1 - Information Visualisation (4019538FNR)
Introduction - Lecture 1 - Information Visualisation (4019538FNR)Beat Signer
 
Towards a Framework for Dynamic Data Physicalisation
Towards a Framework for Dynamic Data PhysicalisationTowards a Framework for Dynamic Data Physicalisation
Towards a Framework for Dynamic Data PhysicalisationBeat Signer
 
Cross-Media Information Spaces and Architectures (CISA)
Cross-Media Information Spaces and Architectures (CISA)Cross-Media Information Spaces and Architectures (CISA)
Cross-Media Information Spaces and Architectures (CISA)Beat Signer
 
Cross-Media Document Linking and Navigation
Cross-Media Document Linking and NavigationCross-Media Document Linking and Navigation
Cross-Media Document Linking and NavigationBeat Signer
 
An Analysis of Cross-Document Linking Mechanisms
An Analysis of Cross-Document Linking MechanismsAn Analysis of Cross-Document Linking Mechanisms
An Analysis of Cross-Document Linking MechanismsBeat Signer
 
Crossing Spaces: Towards Cross-Media Personal Information Management User Int...
Crossing Spaces: Towards Cross-Media Personal Information Management User Int...Crossing Spaces: Towards Cross-Media Personal Information Management User Int...
Crossing Spaces: Towards Cross-Media Personal Information Management User Int...Beat Signer
 
Designing Prosthetic Memory: Audio or Transcript, That is the Question
Designing Prosthetic Memory: Audio or Transcript, That is the QuestionDesigning Prosthetic Memory: Audio or Transcript, That is the Question
Designing Prosthetic Memory: Audio or Transcript, That is the QuestionBeat Signer
 
Introduction - Lecture 1 - Advanced Topics in Information Systems (4016792ENR)
Introduction - Lecture 1 - Advanced Topics in Information Systems (4016792ENR)Introduction - Lecture 1 - Advanced Topics in Information Systems (4016792ENR)
Introduction - Lecture 1 - Advanced Topics in Information Systems (4016792ENR)Beat Signer
 
Bespoke Map Customization Behavior and Its Implications for the Design of Mul...
Bespoke Map Customization Behavior and Its Implications for the Design of Mul...Bespoke Map Customization Behavior and Its Implications for the Design of Mul...
Bespoke Map Customization Behavior and Its Implications for the Design of Mul...Beat Signer
 

Plus de Beat Signer (20)

Case Studies and Course Review - Lecture 12 - Information Visualisation (4019...
Case Studies and Course Review - Lecture 12 - Information Visualisation (4019...Case Studies and Course Review - Lecture 12 - Information Visualisation (4019...
Case Studies and Course Review - Lecture 12 - Information Visualisation (4019...
 
Dashboards - Lecture 11 - Information Visualisation (4019538FNR)
Dashboards - Lecture 11 - Information Visualisation (4019538FNR)Dashboards - Lecture 11 - Information Visualisation (4019538FNR)
Dashboards - Lecture 11 - Information Visualisation (4019538FNR)
 
Interaction - Lecture 10 - Information Visualisation (4019538FNR)
Interaction - Lecture 10 - Information Visualisation (4019538FNR)Interaction - Lecture 10 - Information Visualisation (4019538FNR)
Interaction - Lecture 10 - Information Visualisation (4019538FNR)
 
View Manipulation and Reduction - Lecture 9 - Information Visualisation (4019...
View Manipulation and Reduction - Lecture 9 - Information Visualisation (4019...View Manipulation and Reduction - Lecture 9 - Information Visualisation (4019...
View Manipulation and Reduction - Lecture 9 - Information Visualisation (4019...
 
Visualisation Techniques - Lecture 8 - Information Visualisation (4019538FNR)
Visualisation Techniques - Lecture 8 - Information Visualisation (4019538FNR)Visualisation Techniques - Lecture 8 - Information Visualisation (4019538FNR)
Visualisation Techniques - Lecture 8 - Information Visualisation (4019538FNR)
 
Design Guidelines and Principles - Lecture 7 - Information Visualisation (401...
Design Guidelines and Principles - Lecture 7 - Information Visualisation (401...Design Guidelines and Principles - Lecture 7 - Information Visualisation (401...
Design Guidelines and Principles - Lecture 7 - Information Visualisation (401...
 
Data Processing and Visualisation Frameworks - Lecture 6 - Information Visual...
Data Processing and Visualisation Frameworks - Lecture 6 - Information Visual...Data Processing and Visualisation Frameworks - Lecture 6 - Information Visual...
Data Processing and Visualisation Frameworks - Lecture 6 - Information Visual...
 
Data Presentation - Lecture 5 - Information Visualisation (4019538FNR)
Data Presentation - Lecture 5 - Information Visualisation (4019538FNR)Data Presentation - Lecture 5 - Information Visualisation (4019538FNR)
Data Presentation - Lecture 5 - Information Visualisation (4019538FNR)
 
Analysis and Validation - Lecture 4 - Information Visualisation (4019538FNR)
Analysis and Validation - Lecture 4 - Information Visualisation (4019538FNR)Analysis and Validation - Lecture 4 - Information Visualisation (4019538FNR)
Analysis and Validation - Lecture 4 - Information Visualisation (4019538FNR)
 
Data Representation - Lecture 3 - Information Visualisation (4019538FNR)
Data Representation - Lecture 3 - Information Visualisation (4019538FNR)Data Representation - Lecture 3 - Information Visualisation (4019538FNR)
Data Representation - Lecture 3 - Information Visualisation (4019538FNR)
 
Human Perception and Colour Theory - Lecture 2 - Information Visualisation (4...
Human Perception and Colour Theory - Lecture 2 - Information Visualisation (4...Human Perception and Colour Theory - Lecture 2 - Information Visualisation (4...
Human Perception and Colour Theory - Lecture 2 - Information Visualisation (4...
 
Introduction - Lecture 1 - Information Visualisation (4019538FNR)
Introduction - Lecture 1 - Information Visualisation (4019538FNR)Introduction - Lecture 1 - Information Visualisation (4019538FNR)
Introduction - Lecture 1 - Information Visualisation (4019538FNR)
 
Towards a Framework for Dynamic Data Physicalisation
Towards a Framework for Dynamic Data PhysicalisationTowards a Framework for Dynamic Data Physicalisation
Towards a Framework for Dynamic Data Physicalisation
 
Cross-Media Information Spaces and Architectures (CISA)
Cross-Media Information Spaces and Architectures (CISA)Cross-Media Information Spaces and Architectures (CISA)
Cross-Media Information Spaces and Architectures (CISA)
 
Cross-Media Document Linking and Navigation
Cross-Media Document Linking and NavigationCross-Media Document Linking and Navigation
Cross-Media Document Linking and Navigation
 
An Analysis of Cross-Document Linking Mechanisms
An Analysis of Cross-Document Linking MechanismsAn Analysis of Cross-Document Linking Mechanisms
An Analysis of Cross-Document Linking Mechanisms
 
Crossing Spaces: Towards Cross-Media Personal Information Management User Int...
Crossing Spaces: Towards Cross-Media Personal Information Management User Int...Crossing Spaces: Towards Cross-Media Personal Information Management User Int...
Crossing Spaces: Towards Cross-Media Personal Information Management User Int...
 
Designing Prosthetic Memory: Audio or Transcript, That is the Question
Designing Prosthetic Memory: Audio or Transcript, That is the QuestionDesigning Prosthetic Memory: Audio or Transcript, That is the Question
Designing Prosthetic Memory: Audio or Transcript, That is the Question
 
Introduction - Lecture 1 - Advanced Topics in Information Systems (4016792ENR)
Introduction - Lecture 1 - Advanced Topics in Information Systems (4016792ENR)Introduction - Lecture 1 - Advanced Topics in Information Systems (4016792ENR)
Introduction - Lecture 1 - Advanced Topics in Information Systems (4016792ENR)
 
Bespoke Map Customization Behavior and Its Implications for the Design of Mul...
Bespoke Map Customization Behavior and Its Implications for the Design of Mul...Bespoke Map Customization Behavior and Its Implications for the Design of Mul...
Bespoke Map Customization Behavior and Its Implications for the Design of Mul...
 

Dernier

User Guide: Magellan MX™ Weather Station
User Guide: Magellan MX™ Weather StationUser Guide: Magellan MX™ Weather Station
User Guide: Magellan MX™ Weather StationColumbia Weather Systems
 
OECD bibliometric indicators: Selected highlights, April 2024
OECD bibliometric indicators: Selected highlights, April 2024OECD bibliometric indicators: Selected highlights, April 2024
OECD bibliometric indicators: Selected highlights, April 2024innovationoecd
 
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptxTHE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptxNandakishor Bhaurao Deshmukh
 
Citronella presentation SlideShare mani upadhyay
Citronella presentation SlideShare mani upadhyayCitronella presentation SlideShare mani upadhyay
Citronella presentation SlideShare mani upadhyayupadhyaymani499
 
Thermodynamics ,types of system,formulae ,gibbs free energy .pptx
Thermodynamics ,types of system,formulae ,gibbs free energy .pptxThermodynamics ,types of system,formulae ,gibbs free energy .pptx
Thermodynamics ,types of system,formulae ,gibbs free energy .pptxuniversity
 
ECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptx
ECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptxECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptx
ECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptxmaryFF1
 
Bioteknologi kelas 10 kumer smapsa .pptx
Bioteknologi kelas 10 kumer smapsa .pptxBioteknologi kelas 10 kumer smapsa .pptx
Bioteknologi kelas 10 kumer smapsa .pptx023NiWayanAnggiSriWa
 
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuine
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 GenuineCall Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuine
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuinethapagita
 
Microphone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptxMicrophone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptxpriyankatabhane
 
User Guide: Orion™ Weather Station (Columbia Weather Systems)
User Guide: Orion™ Weather Station (Columbia Weather Systems)User Guide: Orion™ Weather Station (Columbia Weather Systems)
User Guide: Orion™ Weather Station (Columbia Weather Systems)Columbia Weather Systems
 
GenAI talk for Young at Wageningen University & Research (WUR) March 2024
GenAI talk for Young at Wageningen University & Research (WUR) March 2024GenAI talk for Young at Wageningen University & Research (WUR) March 2024
GenAI talk for Young at Wageningen University & Research (WUR) March 2024Jene van der Heide
 
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.PraveenaKalaiselvan1
 
Pests of Bengal gram_Identification_Dr.UPR.pdf
Pests of Bengal gram_Identification_Dr.UPR.pdfPests of Bengal gram_Identification_Dr.UPR.pdf
Pests of Bengal gram_Identification_Dr.UPR.pdfPirithiRaju
 
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubai
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In DubaiDubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubai
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubaikojalkojal131
 
Speech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptxSpeech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptxpriyankatabhane
 
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...lizamodels9
 
Observational constraints on mergers creating magnetism in massive stars
Observational constraints on mergers creating magnetism in massive starsObservational constraints on mergers creating magnetism in massive stars
Observational constraints on mergers creating magnetism in massive starsSérgio Sacani
 
User Guide: Capricorn FLX™ Weather Station
User Guide: Capricorn FLX™ Weather StationUser Guide: Capricorn FLX™ Weather Station
User Guide: Capricorn FLX™ Weather StationColumbia Weather Systems
 
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)riyaescorts54
 

Dernier (20)

User Guide: Magellan MX™ Weather Station
User Guide: Magellan MX™ Weather StationUser Guide: Magellan MX™ Weather Station
User Guide: Magellan MX™ Weather Station
 
Volatile Oils Pharmacognosy And Phytochemistry -I
Volatile Oils Pharmacognosy And Phytochemistry -IVolatile Oils Pharmacognosy And Phytochemistry -I
Volatile Oils Pharmacognosy And Phytochemistry -I
 
OECD bibliometric indicators: Selected highlights, April 2024
OECD bibliometric indicators: Selected highlights, April 2024OECD bibliometric indicators: Selected highlights, April 2024
OECD bibliometric indicators: Selected highlights, April 2024
 
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptxTHE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
 
Citronella presentation SlideShare mani upadhyay
Citronella presentation SlideShare mani upadhyayCitronella presentation SlideShare mani upadhyay
Citronella presentation SlideShare mani upadhyay
 
Thermodynamics ,types of system,formulae ,gibbs free energy .pptx
Thermodynamics ,types of system,formulae ,gibbs free energy .pptxThermodynamics ,types of system,formulae ,gibbs free energy .pptx
Thermodynamics ,types of system,formulae ,gibbs free energy .pptx
 
ECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptx
ECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptxECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptx
ECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptx
 
Bioteknologi kelas 10 kumer smapsa .pptx
Bioteknologi kelas 10 kumer smapsa .pptxBioteknologi kelas 10 kumer smapsa .pptx
Bioteknologi kelas 10 kumer smapsa .pptx
 
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuine
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 GenuineCall Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuine
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuine
 
Microphone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptxMicrophone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptx
 
User Guide: Orion™ Weather Station (Columbia Weather Systems)
User Guide: Orion™ Weather Station (Columbia Weather Systems)User Guide: Orion™ Weather Station (Columbia Weather Systems)
User Guide: Orion™ Weather Station (Columbia Weather Systems)
 
GenAI talk for Young at Wageningen University & Research (WUR) March 2024
GenAI talk for Young at Wageningen University & Research (WUR) March 2024GenAI talk for Young at Wageningen University & Research (WUR) March 2024
GenAI talk for Young at Wageningen University & Research (WUR) March 2024
 
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
 
Pests of Bengal gram_Identification_Dr.UPR.pdf
Pests of Bengal gram_Identification_Dr.UPR.pdfPests of Bengal gram_Identification_Dr.UPR.pdf
Pests of Bengal gram_Identification_Dr.UPR.pdf
 
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubai
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In DubaiDubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubai
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubai
 
Speech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptxSpeech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptx
 
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
 
Observational constraints on mergers creating magnetism in massive stars
Observational constraints on mergers creating magnetism in massive starsObservational constraints on mergers creating magnetism in massive stars
Observational constraints on mergers creating magnetism in massive stars
 
User Guide: Capricorn FLX™ Weather Station
User Guide: Capricorn FLX™ Weather StationUser Guide: Capricorn FLX™ Weather Station
User Guide: Capricorn FLX™ Weather Station
 
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)
 

SpeeG - A Multimodal Speech- and Gesture-based Text Input Solution

  • 1. SpeeG A  Mul&modal  Speech-­‐  and   Gesture-­‐based  Text  Input  Solu&on Lode  Hoste,  Bruno  Dumas  and  Beat  Signer
  • 2. Text-input for set-top boxes Vrije Universiteit Brussel SpeeG - Lode Hoste 2
  • 3. Vrije Universiteit Brussel SpeeG - Lode Hoste 3
  • 4. Vrije Universiteit Brussel SpeeG - Lode Hoste 4
  • 5. Text-input for set-top boxes Vrije Universiteit Brussel SpeeG - Lode Hoste 5
  • 6. 1D Keyboard for Kinect Chatpad Controller Virtual Keyboard for Xbox SwiftKey 8Pen EdgeWriter Dasher Speech Dasher SpeeG Vrije Universiteit Brussel SpeeG - Lode Hoste 6
  • 7. Virtual keyboard Vrije Universiteit Brussel SpeeG - Lode Hoste 7
  • 8. Kinect 1D keyboard Vrije Universiteit Brussel SpeeG - Lode Hoste 8
  • 9. Kinect 1D keyboard Vrije Universiteit Brussel SpeeG - Lode Hoste 9
  • 10. 1D Keyboard for Kinect Chatpad Controller Virtual Keyboard for Xbox SwiftKey 8Pen EdgeWriter Dasher Speech Dasher SpeeG Vrije Universiteit Brussel SpeeG - Lode Hoste 10
  • 11. 1D Keyboard for Kinect Chatpad Controller Virtual Keyboard for Xbox SwiftKey 8Pen EdgeWriter Dasher Speech Dasher SpeeG Vrije Universiteit Brussel SpeeG - Lode Hoste 11
  • 12. Dasher Continuous input Joystick / Gaze / ... Open vocabulary Allows imprecise navigation Vrije Universiteit Brussel SpeeG - Lode Hoste 12
  • 13. Dasher Vrije Universiteit Brussel SpeeG - Lode Hoste 13
  • 14. Goals: Used technologies: Controller-free Kinect Text input CMU Sphinx Without training Dasher Vrije Universiteit Brussel SpeeG - Lode Hoste 14
  • 15. SpeeG Vrije Universiteit Brussel SpeeG - Lode Hoste 15
  • 16. Vrije Universiteit Brussel SpeeG - Lode Hoste 16
  • 17. SpeeG Architecture 5 User GUI (JDasher) 3 1 4 2 Speech Recogniser Hand Tracking (CMU Sphinx 4) (Microsoft Kinect and NITE) Vrije Universiteit Brussel SpeeG - Lode Hoste 17
  • 18. Evaluation Virtual Keyboard Kinect Keyboard 5 User GUI (JDasher) Speech-only SpeeG 3 1 Vrije Universiteit Brussel SpeeG - Lode Hoste 18
  • 19. Evaluation 7 (male) users: 23-31y “this was easy for us” “he will allow a rare lie” “did you eat yet” 1-3: DARPA’s TIMIT “my watch fell in the water” “the world is a stage” “peek out the window” 4-6: MacKenzie and Soukoreff Performed a quantitative (Words per minute and nr of errors) and qualitative (feedback and preference) evaluation Vrije Universiteit Brussel SpeeG - Lode Hoste 19
  • 20. Virtual keyboard 6.3 WPM 10 9 8 7 User 1 6 User 2 WPM 5 User 3 User 4 4 User 5 User 6 3 User 7 2 1 0 S1 S2 S3 S4 S5 S6 Sentence Vrije Universiteit Brussel SpeeG - Lode Hoste 20
  • 21. Kinect Keyboard 1.83 WPM 3.50 3.00 2.50 User 1 2.00 User 2 WPM User 3 1.50 User 4 User 5 User 6 1.00 *User 7 0.50 0.00 S1 S2 S3 S4 S5 S6 Sentence Vrije Universiteit Brussel SpeeG - Lode Hoste 21
  • 22. Speech-only 11 WPM 40 35 User 1 30 25 User 1 User 2 WPM 20 User 3 User 4 15 User 5 Speech Recognis User 6 (CMU Sphinx 4 10 User 7 5 0 S1 S2 S3 S4 S5 S6 Sentence Vrije Universiteit Brussel SpeeG - Lode Hoste 22
  • 23. SpeeG 5.8 WPM 10 9 8 7 User 2 6 User 1 WPM 5 User 3 User 4 4 User 5 User 6 3 User 7 2 1 0 S1 S2 S3 S4 S5 S6 Sentence Vrije Universiteit Brussel SpeeG - Lode Hoste 23
  • 24. SpeeG 2.6 7.8 WPM 10 9 8 7 User 2 6 User 1 WPM 5 User 3 User 4 4 User 5 User 6 3 User 7 2 1 0 S1 S2 S3 S4 S5 S6 Sentence Vrije Universiteit Brussel SpeeG - Lode Hoste 24
  • 25. Mean WPM per sentence and input device Virtual Keyboard for Xbox 1D Keyboard for Xbox 5 25 Speech-only User SpeeG GUI (JDasher) 3 1 4 2 20 Speech Recogniser Hand Tracking (CMU Sphinx 4) (Microsoft Kinect and NITE) 15 Controller WPM Speech only 10 Kinect only SpeeG 5 0 S1 S2 S3 S4 S5 S6 Sentence Vrije Universiteit Brussel SpeeG - Lode Hoste 25
  • 26. Errors per sentence and input device Virtual Keyboard for Xbox 1D Keyboard for Xbox 5 10 Speech-only User SpeeG GUI (JDasher) 9 1 3 4 2 8 7 Speech Recogniser (CMU Sphinx 4) Hand Tracking (Microsoft Kinect and NITE) Mean number of errors 6 Controller 5 Speech only 4 Kinect only SpeeG 3 2 1 0 S1 S2 S3 S4 S5 S6 Sentence Vrije Universiteit Brussel SpeeG - Lode Hoste 26
  • 27. Vrije Universiteit Brussel SpeeG - Lode Hoste 27
  • 28. Future work Other visualisations Smaller gestures Dedicated commands (gesture / voice) Vrije Universiteit Brussel SpeeG - Lode Hoste 28
  • 29. Vrije Universiteit Brussel SpeeG - Lode Hoste 29
  • 30. SpeeG A  Mul&modal  Speech-­‐  and   Gesture-­‐  based  Text  Input  Solu&on Lode  Hoste,  Bruno  Dumas,  Beat  Signer Kinect Speech - Controller-free text input - Non-native speakers - Real-time correction - Untrained voice recogniser - Dasher, zoomable interface - 6-12 WPM - probabilities - Perceived fastest - alphabetic order - Game-like character - character-level - Novice and experts Vrije Universiteit Brussel Special thanks to Jorn De Baerdenmaeker and Keith Vertaenen SpeeG - Lode Hoste 30