SlideShare une entreprise Scribd logo
1  sur  22
WP 3 Presentation:
Dialogue Manager
Jürgen Geiger
Overview
• Goals
• Achievements
• Open Questions
• List of Publications
04.06.2013 WP 3 Presentation 2
Goals
• Dialogue Manager
– Back-end for HMI
– Control all other modules
• Applications: Games, Reading service, …
• Physiological Monitoring
04.06.2013 WP 3 Presentation 3
Tasks
T3.1 User identification via speech or face recognition
T3.2 Knowledge representation
T3.3 Development of a dialogue system
T3.4 Development and Integration of a game collection
T3.5 Web 2.0 wrapper for web services
T3.6 Integration of further software modules
T3.7 Adaptable behaviour of the robot platform
T3.8 Integration of natural language understanding
T3.9 Physiological monitoring
T3.10 Integration of the physiological monitoring into the dialogue manager
04.06.2013 WP 3 Presentation 4
Deliverables
04.06.2013 WP 3 Presentation 5
Name Due
D3.1 Report on the dialogue manager concept 09/2010
D3.2 Knowledge databases 04/2011
D3.3 Identification System (face & voice) 01/2011
D3.4 Prototype of dialogue manager 04/2011
D3.5 Physiological Monitoring (PM) 02/2011
D3.6 Dialogue system with integrated PM 06/2011
D3.7 Dialogue system updated to user‘s needs 05/2012
D3.8 Final dialogue system with integrated PM 02/2013
Achievements
• Dialogue Manager
– Control of all other modules
– Natural language understanding
• Software modules
– Physiological Monitoring
– User Identification
• Adaptable behaviour
– Emotions
• Physiological Monitoring
04.06.2013 WP 3 Presentation 6
Dialogue Manager: Overview (T3.3, D3.1, D3.4, D3.7, D3.8)
• Central component of the ALIAS robot („brain“)
– Reproduces the basic mechanisms of human thinking
– Decides on the behavior of the robot
– Communicates with all other modules
04.06.2013 WP 3 Presentation 7
Hello
Robot!
TTS
Face Detect
ASR
Robot Control
GUI
Touch Screen
DM Core
Situation Model
Action
Input
CES
Understanding
Physio Monitor
Dialogue Manager: Overview
04.06.2013 WP 3 Presentation 8
• Components
– DM-Core („Brain“)
• NLU-Engine understands human
verbal messages
• Decision-Engine decides on the
behavior
• Based on conceptual event
representations (human thinking)
– DM-Communicator
• Communicates with sensing and
acting modules
• Translates between modules and
DM-Core
Natural Language Understanding
• NLU-Engine (T3.8, D3.2)
Based on Cognesys CES technology
Extracts and processes the conceptual meaning of
verbal messages
Resistent to syntactically or grammatically degraded
informations
Uses knowledge and current situation to identify and
check the practicability of identified statements
• NLU-Knowledge Database (T3.2, D3.2)
World knowledge: understands the world in general,
simulates human memory
Expert knowledge: understands the world of elderly
people and depends on the robots functionality
04.06.2013 WP 3 Presentation 9
Acting and Behavior (T3.8, D3.2)
04.06.2013 WP 3 Presentation 10
• Decision-Engine
Based on Cognesys CES technology
Processes conceptual event representations
like humans do
Uses a situation model like human memory
Situation model
• Represents the currently relevant objects and
their states and modalities
• Represents history of events that constitutes the
current situation
Proactive behavior
• Example: inform the user about new mails, invites the
user to stay in contact with its relatives
Dialogue Management (T3.6)
04.06.2013 WP 3 Presentation 11
• ASR Adapter
– Receives spoken user input as text
– The NLU-Engine processes the text
• GUI Adapter
– Controls the GUI, processes user input
• Menus
• Games, TV, audio books, email …
• Skype call and alarm call control flow
– Synchronizes the GUI menus with BCI masks
• BCI Adapter
– Controls the Brain Computer Interface masks
– Processes user inputs
Dialogue Management
04.06.2013 WP 3 Presentation 12
• TTS Adapter
– Sends text to be spoken to the
Text-To-Speech module
• RD Adapter
– Interface to the robots low-level-controlers
– Controls navigation and movement behavior
– Controls the robots head emotions
– Receives speaker ident information
User identification: speech (T3.1, D3.3)
• Research aspects
– Speaker diarization
– Overlap detection
– Speech activity detection
• Implementation for the robot
04.06.2013 WP 3 Presentation 13
Research aspects
• Speaker diarization
– „Who speaks when?“
– Utilise the output of a speech transcription system to suppress
linguistic variation
• Overlap detection
– Overlapping speech degrades performance
– Detect & handle overlap
• Voice activity detection
04.06.2013 WP 3 Presentation 14
Speaker Recognition : Implementation
• Integrated with DM
• Running permanently
• DM receives name of
speaker
• Used during TTS output
– To call the user by his name
04.06.2013 WP 3 Presentation 15
User Identification: Face (T3.1, D3.3)
• Omnidirectional camera
• Viola & Jones algorithm for face detection
• Fusion with laser-based leg pair detection
• Face identification using Eigenfaces
• Keep eye contact with user
04.06.2013 WP 3 Presentation 16
Gaming with Speech Control (T3.4, D3.8)
• Control game via ASR
• Noughts and crosses
• AI to control computer player
• Touchscreen control also
possible
04.06.2013 WP 3 Presentation 17
Reading Service (T3.5, D3.8)
• Customised GUI
• Based on open-source software
• Functionality:
– Read out e-books
– Recognition from camera
04.06.2013 WP 3 Presentation 18
Display of Emotions (T3.7, D3.8)
• Can ALIAS display emotions?
• 5 basic emotions (Disgust, Fear, Joy, Sadness, Surprise)
• Integrated into Dialogue System
04.06.2013 WP 3 Presentation 19
Disgust Neutral Sadness
Physiological Monitoring (T3.9, T3.10, D3.5, D3.6)
• Vital function monitoring system
• Recording, saving, display of vital function data
– Manual data input
– Data input directly by sensors
• Alarm function for suspicious data values
04.06.2013 WP 3 Presentation 20
Open questions
• Personal data: storage and
usage
– Person ID, physiological monitoring
– Who gets access?
• Learning how to use the robot
– Self-explanatory system
– Systems adapts to the user
• Tablet PC?
04.06.2013 WP 3 Presentation 21
Selected Publications
• J. Geiger, M. Hofmann, B.Schuller and G. Rigoll: "Gait-based Person Identification by Spectral, Cepstral and Energy-
related Audio Features," ICASSP 2013
• J. Geiger, T. Leykauf, T. Rehrl, F. Wallhoff, G. Rigoll: "The Robot ALIAS as a Gaming Platform for Elderly Persons," AAL-
Kongress 2013
• J. Geiger, I. Yenin, T. Rehrl, F. Wallhoff, G. Rigoll: "Display of Emotions with the Robotic Platform ALIAS", AAL-Kongress
2013
• T. Rehrl, J. Geiger, M. Golcar, S. Gentsch, J. Knobloch, G. Rigoll: "The Robot ALIAS as a Database for Health Monitoring
for Elderly People," AAL-Kongress 2013
• T. Rehrl, R. Troncy, A. Bley, S. Ihsen, K. Scheibl, W. Schneider, S. Glende, S. Goetze, J. Kessler, C. Hintermueller, and F.
Wallhoff: “The Ambient Adaptable Living Assistant is Meeting its Users,“ AAL-Forum 2012
• T. Rehrl, J. Blume, A. Bannat, G. Rigoll, and F. Wallhoff: “On-line Learning of Dynamic Gestures for Human-Robot
Interaction,“ KI 2012
• J. Geiger, R. Vipperla, S. Bozonnet, N. Evans, B. Schuller, G. Rigoll: " Convolutive Non-Negative Sparse Coding and New
Features for Speech Overlap Handling in Speaker Diarization", INTERSPEECH 2012
• R. Vipperla, J. Geiger, S. Bozonnet, D. Wang, N. Evans, B. Schuller, G. Rigoll: "Speech Overlap Detection and Attribution
Using Convolutive Non-Negative Sparse Coding", ICASSP 2012
• J. Geiger, M. Lakhal, B. Schuller, and G. Rigoll: “Learning new acoustic events in an HMM-based system using MAP
adaptation,“ INTERSPEECH 2011
• T. Rehrl, J. Blume, J. Geiger, A. Bannat, F. Wallhoff, S. Ihsen, Y. Jeanrenaud, M. Merten, B. Schönebeck, S. Glende, and
C. Nedopil: “ALIAS: Der anpassungsfähige Ambient Living Assistent,“ AAL-Kongress 2011
04.06.2013 WP 3 Presentation 22

Contenu connexe

En vedette

En vedette (7)

Characteristics of highly effective enterprise virtual assistants
Characteristics of highly effective enterprise virtual assistantsCharacteristics of highly effective enterprise virtual assistants
Characteristics of highly effective enterprise virtual assistants
 
16 Quotes that Defined AI and Intelligent Virtual Assistants in 2015
16 Quotes that Defined AI and Intelligent Virtual Assistants in 201516 Quotes that Defined AI and Intelligent Virtual Assistants in 2015
16 Quotes that Defined AI and Intelligent Virtual Assistants in 2015
 
Ai powered personal assistants
Ai powered personal assistantsAi powered personal assistants
Ai powered personal assistants
 
Kevin Shaw at AI Frontiers: AI on the Edge: Bringing Intelligence to Small De...
Kevin Shaw at AI Frontiers: AI on the Edge: Bringing Intelligence to Small De...Kevin Shaw at AI Frontiers: AI on the Edge: Bringing Intelligence to Small De...
Kevin Shaw at AI Frontiers: AI on the Edge: Bringing Intelligence to Small De...
 
Making Intelligent Virtual Assistants a Reality
Making Intelligent Virtual Assistants a RealityMaking Intelligent Virtual Assistants a Reality
Making Intelligent Virtual Assistants a Reality
 
Naghi Prasad at AI Frontiers: Building AI systems to automate enterprise proc...
Naghi Prasad at AI Frontiers: Building AI systems to automate enterprise proc...Naghi Prasad at AI Frontiers: Building AI systems to automate enterprise proc...
Naghi Prasad at AI Frontiers: Building AI systems to automate enterprise proc...
 
AI Agent and Chatbot Trends For Enterprises
AI Agent and Chatbot Trends For EnterprisesAI Agent and Chatbot Trends For Enterprises
AI Agent and Chatbot Trends For Enterprises
 

Similaire à ALIAS WP3 Results

MSR2014 opening
MSR2014 openingMSR2014 opening
MSR2014 opening
Sung Kim
 
VOICE-ASSISTANT-IN-PYTHON-pptx.pptx
VOICE-ASSISTANT-IN-PYTHON-pptx.pptxVOICE-ASSISTANT-IN-PYTHON-pptx.pptx
VOICE-ASSISTANT-IN-PYTHON-pptx.pptx
ITB450RUTIKASALUNKHE
 
1 PROGRAM ISEM RESEARCH PAPER FOR APPLIED.docx
1 PROGRAM ISEM RESEARCH PAPER FOR APPLIED.docx1 PROGRAM ISEM RESEARCH PAPER FOR APPLIED.docx
1 PROGRAM ISEM RESEARCH PAPER FOR APPLIED.docx
honey725342
 

Similaire à ALIAS WP3 Results (20)

LoCloud - D5.4: Analysis and Recommendations
LoCloud - D5.4: Analysis and RecommendationsLoCloud - D5.4: Analysis and Recommendations
LoCloud - D5.4: Analysis and Recommendations
 
Behaviometrics: Behavior Modeling from Heterogeneous Sensory Time-Series
Behaviometrics: Behavior Modeling from Heterogeneous Sensory Time-SeriesBehaviometrics: Behavior Modeling from Heterogeneous Sensory Time-Series
Behaviometrics: Behavior Modeling from Heterogeneous Sensory Time-Series
 
Ontology of a temperature sensor
Ontology of a temperature sensorOntology of a temperature sensor
Ontology of a temperature sensor
 
CRC Final Report
CRC Final ReportCRC Final Report
CRC Final Report
 
IRJET- Virtual Vision for Blinds
IRJET- Virtual Vision for BlindsIRJET- Virtual Vision for Blinds
IRJET- Virtual Vision for Blinds
 
MSR2014 opening
MSR2014 openingMSR2014 opening
MSR2014 opening
 
VOICE-ASSISTANT-IN-PYTHON-pptx.pptx
VOICE-ASSISTANT-IN-PYTHON-pptx.pptxVOICE-ASSISTANT-IN-PYTHON-pptx.pptx
VOICE-ASSISTANT-IN-PYTHON-pptx.pptx
 
1 PROGRAM ISEM RESEARCH PAPER FOR APPLIED.docx
1 PROGRAM ISEM RESEARCH PAPER FOR APPLIED.docx1 PROGRAM ISEM RESEARCH PAPER FOR APPLIED.docx
1 PROGRAM ISEM RESEARCH PAPER FOR APPLIED.docx
 
IRJET - Sign Language Converter
IRJET -  	  Sign Language ConverterIRJET -  	  Sign Language Converter
IRJET - Sign Language Converter
 
Wearable technologies: what's brewing in the lab?
Wearable technologies: what's brewing in the lab?Wearable technologies: what's brewing in the lab?
Wearable technologies: what's brewing in the lab?
 
UCIAD overview
UCIAD overviewUCIAD overview
UCIAD overview
 
Subtitling & translation of weblectures by Carlos Turró Ribalta ...
Subtitling & translation of weblectures by Carlos Turró Ribalta              ...Subtitling & translation of weblectures by Carlos Turró Ribalta              ...
Subtitling & translation of weblectures by Carlos Turró Ribalta ...
 
Leveraging Open Standards to Build Highly Extensible Autonomous Systems
Leveraging Open Standards to Build Highly Extensible Autonomous SystemsLeveraging Open Standards to Build Highly Extensible Autonomous Systems
Leveraging Open Standards to Build Highly Extensible Autonomous Systems
 
Mobile user experience conference 2009 - The rise of the mobile context
Mobile user experience conference 2009 - The rise of the mobile contextMobile user experience conference 2009 - The rise of the mobile context
Mobile user experience conference 2009 - The rise of the mobile context
 
DT project.pdf
DT project.pdfDT project.pdf
DT project.pdf
 
Mid-term Review Meeting - WP1
Mid-term Review Meeting - WP1Mid-term Review Meeting - WP1
Mid-term Review Meeting - WP1
 
“SKYE : Voice Based AI Desktop Assistant”
“SKYE : Voice Based AI Desktop Assistant”“SKYE : Voice Based AI Desktop Assistant”
“SKYE : Voice Based AI Desktop Assistant”
 
Personal Voice Assistant using python.pptx
Personal Voice Assistant using python.pptxPersonal Voice Assistant using python.pptx
Personal Voice Assistant using python.pptx
 
introduction-to_mobile_computing 1
 introduction-to_mobile_computing 1 introduction-to_mobile_computing 1
introduction-to_mobile_computing 1
 
School updated
School updatedSchool updated
School updated
 

Dernier

Dernier (20)

GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 

ALIAS WP3 Results

  • 1. WP 3 Presentation: Dialogue Manager Jürgen Geiger
  • 2. Overview • Goals • Achievements • Open Questions • List of Publications 04.06.2013 WP 3 Presentation 2
  • 3. Goals • Dialogue Manager – Back-end for HMI – Control all other modules • Applications: Games, Reading service, … • Physiological Monitoring 04.06.2013 WP 3 Presentation 3
  • 4. Tasks T3.1 User identification via speech or face recognition T3.2 Knowledge representation T3.3 Development of a dialogue system T3.4 Development and Integration of a game collection T3.5 Web 2.0 wrapper for web services T3.6 Integration of further software modules T3.7 Adaptable behaviour of the robot platform T3.8 Integration of natural language understanding T3.9 Physiological monitoring T3.10 Integration of the physiological monitoring into the dialogue manager 04.06.2013 WP 3 Presentation 4
  • 5. Deliverables 04.06.2013 WP 3 Presentation 5 Name Due D3.1 Report on the dialogue manager concept 09/2010 D3.2 Knowledge databases 04/2011 D3.3 Identification System (face & voice) 01/2011 D3.4 Prototype of dialogue manager 04/2011 D3.5 Physiological Monitoring (PM) 02/2011 D3.6 Dialogue system with integrated PM 06/2011 D3.7 Dialogue system updated to user‘s needs 05/2012 D3.8 Final dialogue system with integrated PM 02/2013
  • 6. Achievements • Dialogue Manager – Control of all other modules – Natural language understanding • Software modules – Physiological Monitoring – User Identification • Adaptable behaviour – Emotions • Physiological Monitoring 04.06.2013 WP 3 Presentation 6
  • 7. Dialogue Manager: Overview (T3.3, D3.1, D3.4, D3.7, D3.8) • Central component of the ALIAS robot („brain“) – Reproduces the basic mechanisms of human thinking – Decides on the behavior of the robot – Communicates with all other modules 04.06.2013 WP 3 Presentation 7 Hello Robot! TTS Face Detect ASR Robot Control GUI Touch Screen DM Core Situation Model Action Input CES Understanding Physio Monitor
  • 8. Dialogue Manager: Overview 04.06.2013 WP 3 Presentation 8 • Components – DM-Core („Brain“) • NLU-Engine understands human verbal messages • Decision-Engine decides on the behavior • Based on conceptual event representations (human thinking) – DM-Communicator • Communicates with sensing and acting modules • Translates between modules and DM-Core
  • 9. Natural Language Understanding • NLU-Engine (T3.8, D3.2) Based on Cognesys CES technology Extracts and processes the conceptual meaning of verbal messages Resistent to syntactically or grammatically degraded informations Uses knowledge and current situation to identify and check the practicability of identified statements • NLU-Knowledge Database (T3.2, D3.2) World knowledge: understands the world in general, simulates human memory Expert knowledge: understands the world of elderly people and depends on the robots functionality 04.06.2013 WP 3 Presentation 9
  • 10. Acting and Behavior (T3.8, D3.2) 04.06.2013 WP 3 Presentation 10 • Decision-Engine Based on Cognesys CES technology Processes conceptual event representations like humans do Uses a situation model like human memory Situation model • Represents the currently relevant objects and their states and modalities • Represents history of events that constitutes the current situation Proactive behavior • Example: inform the user about new mails, invites the user to stay in contact with its relatives
  • 11. Dialogue Management (T3.6) 04.06.2013 WP 3 Presentation 11 • ASR Adapter – Receives spoken user input as text – The NLU-Engine processes the text • GUI Adapter – Controls the GUI, processes user input • Menus • Games, TV, audio books, email … • Skype call and alarm call control flow – Synchronizes the GUI menus with BCI masks • BCI Adapter – Controls the Brain Computer Interface masks – Processes user inputs
  • 12. Dialogue Management 04.06.2013 WP 3 Presentation 12 • TTS Adapter – Sends text to be spoken to the Text-To-Speech module • RD Adapter – Interface to the robots low-level-controlers – Controls navigation and movement behavior – Controls the robots head emotions – Receives speaker ident information
  • 13. User identification: speech (T3.1, D3.3) • Research aspects – Speaker diarization – Overlap detection – Speech activity detection • Implementation for the robot 04.06.2013 WP 3 Presentation 13
  • 14. Research aspects • Speaker diarization – „Who speaks when?“ – Utilise the output of a speech transcription system to suppress linguistic variation • Overlap detection – Overlapping speech degrades performance – Detect & handle overlap • Voice activity detection 04.06.2013 WP 3 Presentation 14
  • 15. Speaker Recognition : Implementation • Integrated with DM • Running permanently • DM receives name of speaker • Used during TTS output – To call the user by his name 04.06.2013 WP 3 Presentation 15
  • 16. User Identification: Face (T3.1, D3.3) • Omnidirectional camera • Viola & Jones algorithm for face detection • Fusion with laser-based leg pair detection • Face identification using Eigenfaces • Keep eye contact with user 04.06.2013 WP 3 Presentation 16
  • 17. Gaming with Speech Control (T3.4, D3.8) • Control game via ASR • Noughts and crosses • AI to control computer player • Touchscreen control also possible 04.06.2013 WP 3 Presentation 17
  • 18. Reading Service (T3.5, D3.8) • Customised GUI • Based on open-source software • Functionality: – Read out e-books – Recognition from camera 04.06.2013 WP 3 Presentation 18
  • 19. Display of Emotions (T3.7, D3.8) • Can ALIAS display emotions? • 5 basic emotions (Disgust, Fear, Joy, Sadness, Surprise) • Integrated into Dialogue System 04.06.2013 WP 3 Presentation 19 Disgust Neutral Sadness
  • 20. Physiological Monitoring (T3.9, T3.10, D3.5, D3.6) • Vital function monitoring system • Recording, saving, display of vital function data – Manual data input – Data input directly by sensors • Alarm function for suspicious data values 04.06.2013 WP 3 Presentation 20
  • 21. Open questions • Personal data: storage and usage – Person ID, physiological monitoring – Who gets access? • Learning how to use the robot – Self-explanatory system – Systems adapts to the user • Tablet PC? 04.06.2013 WP 3 Presentation 21
  • 22. Selected Publications • J. Geiger, M. Hofmann, B.Schuller and G. Rigoll: "Gait-based Person Identification by Spectral, Cepstral and Energy- related Audio Features," ICASSP 2013 • J. Geiger, T. Leykauf, T. Rehrl, F. Wallhoff, G. Rigoll: "The Robot ALIAS as a Gaming Platform for Elderly Persons," AAL- Kongress 2013 • J. Geiger, I. Yenin, T. Rehrl, F. Wallhoff, G. Rigoll: "Display of Emotions with the Robotic Platform ALIAS", AAL-Kongress 2013 • T. Rehrl, J. Geiger, M. Golcar, S. Gentsch, J. Knobloch, G. Rigoll: "The Robot ALIAS as a Database for Health Monitoring for Elderly People," AAL-Kongress 2013 • T. Rehrl, R. Troncy, A. Bley, S. Ihsen, K. Scheibl, W. Schneider, S. Glende, S. Goetze, J. Kessler, C. Hintermueller, and F. Wallhoff: “The Ambient Adaptable Living Assistant is Meeting its Users,“ AAL-Forum 2012 • T. Rehrl, J. Blume, A. Bannat, G. Rigoll, and F. Wallhoff: “On-line Learning of Dynamic Gestures for Human-Robot Interaction,“ KI 2012 • J. Geiger, R. Vipperla, S. Bozonnet, N. Evans, B. Schuller, G. Rigoll: " Convolutive Non-Negative Sparse Coding and New Features for Speech Overlap Handling in Speaker Diarization", INTERSPEECH 2012 • R. Vipperla, J. Geiger, S. Bozonnet, D. Wang, N. Evans, B. Schuller, G. Rigoll: "Speech Overlap Detection and Attribution Using Convolutive Non-Negative Sparse Coding", ICASSP 2012 • J. Geiger, M. Lakhal, B. Schuller, and G. Rigoll: “Learning new acoustic events in an HMM-based system using MAP adaptation,“ INTERSPEECH 2011 • T. Rehrl, J. Blume, J. Geiger, A. Bannat, F. Wallhoff, S. Ihsen, Y. Jeanrenaud, M. Merten, B. Schönebeck, S. Glende, and C. Nedopil: “ALIAS: Der anpassungsfähige Ambient Living Assistent,“ AAL-Kongress 2011 04.06.2013 WP 3 Presentation 22