SlideShare une entreprise Scribd logo
1  sur  23
Télécharger pour lire hors ligne
Expressive Gestures for NAO

  Quoc Anh Le- Catherine Pelachaud
CNRS, LTCI, Telecom-ParisTech, France




     NAO TechDay, 13/06/2012, Paris
Motivation

    Importance of expressive
     gestures [Li et al, 2009]
    • Communicating messages
    • Expressing affective states

    Relation between gesture and
     speech [Kendon, 2004]
    • Two aspects of the same
       process of utterance
    • Complement and supplement

    Believability and life-likeness
    • Robot should communicate in
       a human-like way (emotion,
       persionality, etc) [Fong, 2003]

page 1    NAO TechDay 2012          Le Quoc Anh & Catherine Pelachaud
Objectives

          Generate     communicative gestures for Nao robot
           •    Integrated within an existing platform for virtual agent
           •    Nonverbal behaviors described symbolically
           •    Synchronization (gestures and speech)
           •    Expressivity of gestures

          GVLEX project (Gesture    & Voice for Expressive Reading)
           • Robot tells a story expressively.
           • Partners : LIMSI (linguistic aspects), Aldebaran (robotics),
             Acapela (speech synthesis), Telecom ParisTech
             (expressive gestures)
page 2         NAO TechDay 2012    Le Quoc Anh & Catherine Pelachaud
State of the art

            Several initiatives recently:
             • Salem et Kopp (2012): robot ASIMO, the virtual
                framework MAX, gesture description with MURML.

             • Aaron Holroyd et Charles Rich (2011): robot Melvin,
               motion scripts with BML, simple gestures, feedback to
               synchronize gestures and speech

             • Ng-Thow-Hing et al. (2010): robot ASIMO, gestures
               selection, synchronization between gestures and speech.

             • Nozawa et al. (2006): motion scripts with MPML-HP,
               robot HOAP-1

            Our system: Focus on expressivity and synchronization of
             gestures with speech using a common platform (SAIBA
             compliant [Kopp, 2006]) for Greta and for Nao

page 3         NAO TechDay 2012             Le Quoc Anh & Catherine Pelachaud
Our methodology

          Gesture describes with a     symbolic language (BML)

          Gestural expressivity (amplitude, fluidity, power,
             repetition, speed, stiffness,…)

          Elaboration of    gestures from a storytelling video corpus
             (Martin et al., 2009)

            Execution of the animation by translating into joint values



page 4        NAO TechDay 2012     Le Quoc Anh & Catherine Pelachaud
Problem and Solution

          Using a common framework to control both virtual and
           physical agents raises several problems:
           • Different degrees of freedom
           • Limit of movement space and speed
          Solution:
           • Use the same representation language
             - same algorithm for selecting and planning gestures
             - different algorithm for creating the animation
           • Elaborate one gesture repository for the robot and
             another one for the Greta agent
           • Gesture movement space and velocity specification


page 5      NAO TechDay 2012       Le Quoc Anh & Catherine Pelachaud
Steps

         1.     Build a library of gestures from a corpus of storytelling video: the gesture
                shapes should not be identical (between the human, virtual agent, robot)
                but they have to convey the same meaning.
                                                     GRETA
                                                     System

         2.     Use the GRETA system to generate gestures for Nao
               • Following the SAIBA framework
                  - Two representation languages: FML (Function Markup Language) and BML (Behavior
                    Markup Language)

                   - Three separated modules: plan communicative intents, select and plan gestures, and
                     realize gestures
                                                                            Behavior
                                                               BML
              Intent                 Behavior                               Realizer
Text          Planning       FML     Planning

                                                                BML          Behavior
                                                                             Realizer


page 6           NAO TechDay 2012              Le Quoc Anh & Catherine Pelachaud
Global diagram




         Nao Lexicon                                  Greta Lexicon



                                    LEXICONs


                 Gesture                       Synchronisation
                 Selection                     with AI speech




                 Planification of              Modification of
                 gesture                       gesture
                                               expressivity
                 duration                                          KEYFRAMES
FML                                  BML

page 7       NAO TechDay 2012              Le Quoc Anh & Catherine Pelachaud
Nao Lexicon




   Gesture annotations [Martin et al., 2009]
                                                                     Gesture Lexicon




                                                                   Gesture velocity specification
                                                                   (Minimal duration,Fitt’s Law)
                                                        Position                                                               …
                                                                       000            001            002            010
                                                       (fromto)
                                                                                                                               …
                                                         000            0         0.15:0.18388   0.25:0.28679   0.166:0.2270

                                                                                                                               …
                                                         001       0.15:0.18388        0         0.19:0.19552   0.147:0.2754

                                                                                                                               …
                                                         002       0.25:0.28679   0.19:0.19552        0         1.621;0.3501
Gesture space specification                                                                                                    …
                                                         010       0.166:0.2270   0.147:0.2754   1.621;0.3501        0

                                                                                                                               …
                                                          …            …              …              …              …




page 8       NAO TechDay 2012                  Le Quoc Anh & Catherine Pelachaud
Gesture Specification
            Gesture->Phases->Hands (wrist position, palm orientation, hand shape,...)
             [Kendon, 2004]
            Only stroke phases are specified. Other phases will be generated automatically
             by the system
             1. <gesture id="greeting" category="ICONIC" min_time="1.0“ hand="RIGHT">
             2. <phase type="STROKE-START" twohand="ASSYMMETRIC“ >
             3.       <hand side="RIGHT">
             4.            <vertical_location>YUpperPeriphery</vertical_location>
             5.            <horizontal_location>XPeriphery</horizontal_location>
             6.            <location_distance>ZNear</location_distance>
             7.            <hand_shape>OPEN</handshape>
             8.            <palm_orientation>AWAY</palm_orientation>
             9.       </hand>
             10. </phase>
             11. <phase type="STROKE-END" twohand="ASSYMMETRIC">
             12.        <hand side="RIGHT">
             13.           <vertical_location>YUpperPeriphery</vertical_location>
             14.           <horizontal_location>XExtremePeriphery</horizontal_location>
             15.           <location_distance>ZNear</location_distance>
             16.           <hand_shape>OPEN</handshape>
             17.           <palm_orientation>AWAY</palm_orientation>
             18.        </hand>
             19.</phase>
             20.</gesture>                   An example for a greeting gesture



page 9        NAO TechDay 2012                    Le Quoc Anh & Catherine Pelachaud
Synchronization of gestures with speech

         The stroke phase coincides or precedes emphasized words of the
          speech [McNeill, 1992]

         Gesture stroke phase timing specified by synch points




page 10    NAO TechDay 2012        Le Quoc Anh & Catherine Pelachaud
Synchronization of gestures with speech

           Algorithm
           •     Compute preparation phase
           •     Delete gesture if not enough time
           •     Add a hold phase to fit gesture planned duration
           •     Coarticulation between several gestures
                 - If enough time, retraction phase (ie go back to rest position)


                     Start               end   Start                   end

                 - Otherwise, go from end of stroke to preparation phase of next
                   gesture     S-start   S-end      S-start   S-end



                        Start
                                                                       end
page 11        NAO TechDay 2012          Le Quoc Anh & Catherine Pelachaud
Gestural Expressivity vs. Affective States

          A set of gesture dimensions [Hartmann, 2005]
           • Spatial Extent (SPC): Amplitude of gesture movement
           • Temporal Extent (TMP): Speed of gesture movement
           • Power (PWR): Acceleration of gesture movement
           • Fluidity (FLD): Smoothness and Continuity
           • Repetition (REP): Number of stroke phases in a gesture movement
           • Stiffness (STF): Tension/Flexibility

          Example [Mancini, 2008]

          Affective states     SPC         TMP            FLD             PWR
          Sadness              Low         Low            High            Low
          Happy                High        High           High            High
          Angrily              High        High           Low             High


page 12     NAO TechDay 2012          Le Quoc Anh & Catherine Pelachaud
Spatial Extent (SPC)

      A    real number in the interval [-1 .. 1]
          • Zero corresponds to a neutral behavior
          • -1 corresponds to small and contracted movements
          • 1 corresponds to wide and large movements

       Guarantee    the unchangeability of the meaning
          • Gesture (modifiable dimension, unmodifiable
            dimension)
          • Example: Negation (vertical position is fixed)



page 13    NAO TechDay 2012   Le Quoc Anh & Catherine Pelachaud
Temporal Extent (TMP)

      A    real number in the interval [-1 .. 1]
          • Zero corresponds to a neutral behavior
          • Slow if the value is negative
          • Fast if the value is positive




page 14    NAO TechDay 2012   Le Quoc Anh & Catherine Pelachaud
Power (PWR)

      A    real number in the interval [-1 .. 1]
          • Zero corresponds to a neutral behavior
          • Movements more powerful correspond to higher
            acceleration

       Affect    hand shape (close to open)
          • More relax/open if the value is negative
          • Fist corresponds to 1

       Affect       duration of stroke phase repetitions

page 15    NAO TechDay 2012     Le Quoc Anh & Catherine Pelachaud
Fluidity (FLD)

      A    real number in the interval [-1 .. 1]
          • Zero corresponds to a neutral behavior
          • Higher values allow smooth and continuous
            execution of movements
          • Lower values create discontinuity in the movements

       Not     yet implemented for Nao




page 16    NAO TechDay 2012   Le Quoc Anh & Catherine Pelachaud
Repetition (REP)

       Number  of stroke phase repeats in a gesture
          movement

       Tendency  to rhythmic repeats of specific
          movements

       Each   stroke coincides with a emphasized
          word/words of the speech




page 17    NAO TechDay 2012   Le Quoc Anh & Catherine Pelachaud
Animation Computation & Execution

       Schedule   and plan gestures phases
       Compute expressivity parameters
       Translate symbolic descriptions into joint values
       Execute animation
        • Send timed key-positions to the robot using
          available APIs
        • Animation is obtained by interpolating between joint
          values with robot built-in proprietary procedures.




page 18   NAO TechDay 2012   Le Quoc Anh & Catherine Pelachaud
Example
<bml>                                                        <lexicon>                                              <bml>
<speech id="s1" start="0.0“                                                                                         <speech id="s1" start="0.0“
                                                             <gesture id="hungry" category="BEAT" hand="BOTH">
vce=speaker=Antoine spd=180                                                                                     vce=speaker=Antoine spd=180
Et le troisième dit en colère: vce=speaker=AntoineLoud     <phase type="STROKE" twohand="SYMMETRIC">              Et le troisième dit tristement: vce=speaker=AntoineSad spd=90
spd=200                                                                                                           <tm id="tm1"/>J'ai très faim!
                                                             <hand side="RIGHT">
<tm id="tm1"/>J'ai très faim!                                                                                       </speech>
</speech>                                                    <vertical_location>YCenterCenter</vertical_location>   <gesture id=“hungry" start="s1:tm1" end=“start+1.5" stroke="0.5“>
<gesture id=“hungry" start="s1:tm1" end=“start+1.5"                                                                                           <PWR.value>-1.0</PWR.value>
stroke="0.5“>            <PWR.value>1.0</PWR.value>          <horizontal_location>XCenter</horizontal_location>                               <SPC.value>-0.3</SPC.value>
                         <SPC.value>0.6</SPC.value>          <location_distance>ZMiddle</location_distance>                                   <TMP.value>-0.2</TMP.value>
                         <TMP.value>0.2</TMP.value>                                                                                           <FLD.value>0</FLD.value>
                         <FLD.value>0</FLD.value>            <hand_shape>CLOSED</handshape>                                                   <STF.value>0</STF.value>
                         <STF.value>0</STF.value>                                                                                              <REP.value>0</REP.value>
                                                             <palm_orientation>INWARD</palm_orientation>
                          <REP.value>0</REP.value>
</gesture>                                                   </hand>                                                </gesture>
</bml>                                                                                                              </bml>
                                                             </phase>

                                                             </gesture>

                                                             </lexicon>




                                                           The same gesture prototype
Different expressivity                                                                                                           Different expressivity
     (i.e. Anger)                                                                                                                    (i.e. Sadness)




          page 19                  NAO TechDay 2012                                Le Quoc Anh & Catherine Pelachaud
Video demo: Nao tells a story




page 20   NAO TechDay 2012   Le Quoc Anh & Catherine Pelachaud
Conclusion

         Conclusion
          • A gesture model is designed, implemented for Nao while taking into
            account physical constraints of the robot.
          • Common platform for both virtual agent and robot
          • Expressivity model

         Future work
          • Create gestures with different emotional colour and personal style
          • Validate the model through perceptive evaluations




page 21    NAO TechDay 2012        Le Quoc Anh & Catherine Pelachaud
Acknowledgment

       This work has  been funded by the ANR GVLEX project
       It is supported from members of the laboratory TSI,
        Telecom-ParisTech




page 22   NAO TechDay 2012   Le Quoc Anh & Catherine Pelachaud

Contenu connexe

En vedette

Ryokan Performance Sheet
Ryokan Performance SheetRyokan Performance Sheet
Ryokan Performance SheetCsz Corporation
 
Journée Inter-GDR ISIS et Robotique: Interaction Homme-Robot
Journée Inter-GDR ISIS et Robotique: Interaction Homme-RobotJournée Inter-GDR ISIS et Robotique: Interaction Homme-Robot
Journée Inter-GDR ISIS et Robotique: Interaction Homme-RobotLê Anh
 
Meet RLWdesign 20122011
Meet RLWdesign 20122011Meet RLWdesign 20122011
Meet RLWdesign 20122011robwhittaker
 
Wyniki badania
Wyniki badania Wyniki badania
Wyniki badania dragonzo86
 
STAR Vietnam Presentation
STAR Vietnam PresentationSTAR Vietnam Presentation
STAR Vietnam Presentationnhatquang2506
 

En vedette (6)

Ryokan Performance Sheet
Ryokan Performance SheetRyokan Performance Sheet
Ryokan Performance Sheet
 
Journée Inter-GDR ISIS et Robotique: Interaction Homme-Robot
Journée Inter-GDR ISIS et Robotique: Interaction Homme-RobotJournée Inter-GDR ISIS et Robotique: Interaction Homme-Robot
Journée Inter-GDR ISIS et Robotique: Interaction Homme-Robot
 
Meet RLWdesign 20122011
Meet RLWdesign 20122011Meet RLWdesign 20122011
Meet RLWdesign 20122011
 
Fathers Advice to Son
Fathers Advice to SonFathers Advice to Son
Fathers Advice to Son
 
Wyniki badania
Wyniki badania Wyniki badania
Wyniki badania
 
STAR Vietnam Presentation
STAR Vietnam PresentationSTAR Vietnam Presentation
STAR Vietnam Presentation
 

Similaire à Nao Tech Day

IEEE Humanoids 2011
IEEE Humanoids 2011IEEE Humanoids 2011
IEEE Humanoids 2011Lê Anh
 
SiriusCon 2015 - Breathe Life into Your Designer!
SiriusCon 2015 - Breathe Life into Your Designer!SiriusCon 2015 - Breathe Life into Your Designer!
SiriusCon 2015 - Breathe Life into Your Designer!melbats
 
ACM ICMI Workshop 2012
ACM ICMI Workshop 2012ACM ICMI Workshop 2012
ACM ICMI Workshop 2012Lê Anh
 
EclipseCon Eu 2015 - Breathe life into your Designer!
EclipseCon Eu 2015 - Breathe life into your Designer!EclipseCon Eu 2015 - Breathe life into your Designer!
EclipseCon Eu 2015 - Breathe life into your Designer!melbats
 
Pharo: A Reflective System
Pharo: A Reflective SystemPharo: A Reflective System
Pharo: A Reflective SystemPharo
 
Keynote- SOA & Beyond : Future Computing
Keynote- SOA & Beyond : Future ComputingKeynote- SOA & Beyond : Future Computing
Keynote- SOA & Beyond : Future ComputingWSO2
 
Breathe Life Into Your IDE
Breathe Life Into Your IDEBreathe Life Into Your IDE
Breathe Life Into Your IDEBenoit Combemale
 
Functional solid
Functional solidFunctional solid
Functional solidMatt Stine
 
Affective Computing and Intelligent Interaction (ACII 2011)
Affective Computing and Intelligent Interaction (ACII 2011)Affective Computing and Intelligent Interaction (ACII 2011)
Affective Computing and Intelligent Interaction (ACII 2011)Lê Anh
 
Dynamic Synthesis of Mediators to Support Interoperability in Autonomic Systems
Dynamic Synthesis of Mediators to Support Interoperability in Autonomic SystemsDynamic Synthesis of Mediators to Support Interoperability in Autonomic Systems
Dynamic Synthesis of Mediators to Support Interoperability in Autonomic SystemsAmel Bennaceur
 
Itinerary Website (Web Development Document)
Itinerary Website (Web Development Document)Itinerary Website (Web Development Document)
Itinerary Website (Web Development Document)Traitet Thepbandansuk
 
Why MOVO? Insights into the new beta community.
Why MOVO? Insights into the new beta community.Why MOVO? Insights into the new beta community.
Why MOVO? Insights into the new beta community.Marie-Pier Faucher
 
Event-driven Model Transformations in Domain-specific Modeling Languages
Event-driven Model Transformations in Domain-specific Modeling LanguagesEvent-driven Model Transformations in Domain-specific Modeling Languages
Event-driven Model Transformations in Domain-specific Modeling LanguagesIstvan Rath
 
Semantically-aware Networks and Services for Training and Knowledge Managemen...
Semantically-aware Networks and Services for Training and Knowledge Managemen...Semantically-aware Networks and Services for Training and Knowledge Managemen...
Semantically-aware Networks and Services for Training and Knowledge Managemen...Gilbert Paquette
 
ACII 2011, USA
ACII 2011, USAACII 2011, USA
ACII 2011, USALê Anh
 
Lecture Notes in Computer Science (LNCS)
Lecture Notes in Computer Science (LNCS)Lecture Notes in Computer Science (LNCS)
Lecture Notes in Computer Science (LNCS)Lê Anh
 
Agile Business Analysis - The Key to Effective Requirements on Agile Projects
Agile Business Analysis - The Key to Effective Requirements on Agile ProjectsAgile Business Analysis - The Key to Effective Requirements on Agile Projects
Agile Business Analysis - The Key to Effective Requirements on Agile ProjectsLilian De Munno
 
Layarintroductionfordevelopers 110308080829-phpapp02
Layarintroductionfordevelopers 110308080829-phpapp02Layarintroductionfordevelopers 110308080829-phpapp02
Layarintroductionfordevelopers 110308080829-phpapp02Sami Hamri
 

Similaire à Nao Tech Day (20)

IEEE Humanoids 2011
IEEE Humanoids 2011IEEE Humanoids 2011
IEEE Humanoids 2011
 
SiriusCon 2015 - Breathe Life into Your Designer!
SiriusCon 2015 - Breathe Life into Your Designer!SiriusCon 2015 - Breathe Life into Your Designer!
SiriusCon 2015 - Breathe Life into Your Designer!
 
ACM ICMI Workshop 2012
ACM ICMI Workshop 2012ACM ICMI Workshop 2012
ACM ICMI Workshop 2012
 
EclipseCon Eu 2015 - Breathe life into your Designer!
EclipseCon Eu 2015 - Breathe life into your Designer!EclipseCon Eu 2015 - Breathe life into your Designer!
EclipseCon Eu 2015 - Breathe life into your Designer!
 
Pharo: A Reflective System
Pharo: A Reflective SystemPharo: A Reflective System
Pharo: A Reflective System
 
Keynote- SOA & Beyond : Future Computing
Keynote- SOA & Beyond : Future ComputingKeynote- SOA & Beyond : Future Computing
Keynote- SOA & Beyond : Future Computing
 
Breathe Life Into Your IDE
Breathe Life Into Your IDEBreathe Life Into Your IDE
Breathe Life Into Your IDE
 
Functional solid
Functional solidFunctional solid
Functional solid
 
Affective Computing and Intelligent Interaction (ACII 2011)
Affective Computing and Intelligent Interaction (ACII 2011)Affective Computing and Intelligent Interaction (ACII 2011)
Affective Computing and Intelligent Interaction (ACII 2011)
 
Dynamic Synthesis of Mediators to Support Interoperability in Autonomic Systems
Dynamic Synthesis of Mediators to Support Interoperability in Autonomic SystemsDynamic Synthesis of Mediators to Support Interoperability in Autonomic Systems
Dynamic Synthesis of Mediators to Support Interoperability in Autonomic Systems
 
Itinerary Website (Web Development Document)
Itinerary Website (Web Development Document)Itinerary Website (Web Development Document)
Itinerary Website (Web Development Document)
 
Why MOVO? Insights into the new beta community.
Why MOVO? Insights into the new beta community.Why MOVO? Insights into the new beta community.
Why MOVO? Insights into the new beta community.
 
Event-driven Model Transformations in Domain-specific Modeling Languages
Event-driven Model Transformations in Domain-specific Modeling LanguagesEvent-driven Model Transformations in Domain-specific Modeling Languages
Event-driven Model Transformations in Domain-specific Modeling Languages
 
Semantically-aware Networks and Services for Training and Knowledge Managemen...
Semantically-aware Networks and Services for Training and Knowledge Managemen...Semantically-aware Networks and Services for Training and Knowledge Managemen...
Semantically-aware Networks and Services for Training and Knowledge Managemen...
 
ACII 2011, USA
ACII 2011, USAACII 2011, USA
ACII 2011, USA
 
Lecture Notes in Computer Science (LNCS)
Lecture Notes in Computer Science (LNCS)Lecture Notes in Computer Science (LNCS)
Lecture Notes in Computer Science (LNCS)
 
Agile Business Analysis - The Key to Effective Requirements on Agile Projects
Agile Business Analysis - The Key to Effective Requirements on Agile ProjectsAgile Business Analysis - The Key to Effective Requirements on Agile Projects
Agile Business Analysis - The Key to Effective Requirements on Agile Projects
 
Let's talk about... Microservices
Let's talk about... MicroservicesLet's talk about... Microservices
Let's talk about... Microservices
 
My Robot
My RobotMy Robot
My Robot
 
Layarintroductionfordevelopers 110308080829-phpapp02
Layarintroductionfordevelopers 110308080829-phpapp02Layarintroductionfordevelopers 110308080829-phpapp02
Layarintroductionfordevelopers 110308080829-phpapp02
 

Plus de Lê Anh

Spark docker
Spark dockerSpark docker
Spark dockerLê Anh
 
Presentation des outils traitements distribues
Presentation des outils traitements distribuesPresentation des outils traitements distribues
Presentation des outils traitements distribuesLê Anh
 
Automatic vs. human question answering over multimedia meeting recordings
Automatic vs. human question answering over multimedia meeting recordingsAutomatic vs. human question answering over multimedia meeting recordings
Automatic vs. human question answering over multimedia meeting recordingsLê Anh
 
Lap trinh java hieu qua
Lap trinh java hieu quaLap trinh java hieu qua
Lap trinh java hieu quaLê Anh
 
Cahier de charges
Cahier de chargesCahier de charges
Cahier de chargesLê Anh
 
Final report. nguyen ngoc anh.01.07.2013
Final report. nguyen ngoc anh.01.07.2013Final report. nguyen ngoc anh.01.07.2013
Final report. nguyen ngoc anh.01.07.2013Lê Anh
 
Lequocanh
LequocanhLequocanh
LequocanhLê Anh
 
These lequocanh v7
These lequocanh v7These lequocanh v7
These lequocanh v7Lê Anh
 
Applying Computer Vision to Traffic Monitoring System in Vietnam
Applying Computer Vision to Traffic Monitoring System in Vietnam Applying Computer Vision to Traffic Monitoring System in Vietnam
Applying Computer Vision to Traffic Monitoring System in Vietnam Lê Anh
 
Poster WACAI 2012
Poster WACAI 2012Poster WACAI 2012
Poster WACAI 2012Lê Anh
 
ICMI 2012 Workshop on gesture and speech production
ICMI 2012 Workshop on gesture and speech productionICMI 2012 Workshop on gesture and speech production
ICMI 2012 Workshop on gesture and speech productionLê Anh
 
Mid-term thesis report
Mid-term thesis reportMid-term thesis report
Mid-term thesis reportLê Anh
 
Người Ảo
Người ẢoNgười Ảo
Người ẢoLê Anh
 

Plus de Lê Anh (13)

Spark docker
Spark dockerSpark docker
Spark docker
 
Presentation des outils traitements distribues
Presentation des outils traitements distribuesPresentation des outils traitements distribues
Presentation des outils traitements distribues
 
Automatic vs. human question answering over multimedia meeting recordings
Automatic vs. human question answering over multimedia meeting recordingsAutomatic vs. human question answering over multimedia meeting recordings
Automatic vs. human question answering over multimedia meeting recordings
 
Lap trinh java hieu qua
Lap trinh java hieu quaLap trinh java hieu qua
Lap trinh java hieu qua
 
Cahier de charges
Cahier de chargesCahier de charges
Cahier de charges
 
Final report. nguyen ngoc anh.01.07.2013
Final report. nguyen ngoc anh.01.07.2013Final report. nguyen ngoc anh.01.07.2013
Final report. nguyen ngoc anh.01.07.2013
 
Lequocanh
LequocanhLequocanh
Lequocanh
 
These lequocanh v7
These lequocanh v7These lequocanh v7
These lequocanh v7
 
Applying Computer Vision to Traffic Monitoring System in Vietnam
Applying Computer Vision to Traffic Monitoring System in Vietnam Applying Computer Vision to Traffic Monitoring System in Vietnam
Applying Computer Vision to Traffic Monitoring System in Vietnam
 
Poster WACAI 2012
Poster WACAI 2012Poster WACAI 2012
Poster WACAI 2012
 
ICMI 2012 Workshop on gesture and speech production
ICMI 2012 Workshop on gesture and speech productionICMI 2012 Workshop on gesture and speech production
ICMI 2012 Workshop on gesture and speech production
 
Mid-term thesis report
Mid-term thesis reportMid-term thesis report
Mid-term thesis report
 
Người Ảo
Người ẢoNgười Ảo
Người Ảo
 

Dernier

HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesBoston Institute of Analytics
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsRoshan Dwivedi
 
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Principled Technologies
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024SynarionITSolutions
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 

Dernier (20)

HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 

Nao Tech Day

  • 1. Expressive Gestures for NAO Quoc Anh Le- Catherine Pelachaud CNRS, LTCI, Telecom-ParisTech, France NAO TechDay, 13/06/2012, Paris
  • 2. Motivation  Importance of expressive gestures [Li et al, 2009] • Communicating messages • Expressing affective states  Relation between gesture and speech [Kendon, 2004] • Two aspects of the same process of utterance • Complement and supplement  Believability and life-likeness • Robot should communicate in a human-like way (emotion, persionality, etc) [Fong, 2003] page 1 NAO TechDay 2012 Le Quoc Anh & Catherine Pelachaud
  • 3. Objectives  Generate communicative gestures for Nao robot • Integrated within an existing platform for virtual agent • Nonverbal behaviors described symbolically • Synchronization (gestures and speech) • Expressivity of gestures  GVLEX project (Gesture & Voice for Expressive Reading) • Robot tells a story expressively. • Partners : LIMSI (linguistic aspects), Aldebaran (robotics), Acapela (speech synthesis), Telecom ParisTech (expressive gestures) page 2 NAO TechDay 2012 Le Quoc Anh & Catherine Pelachaud
  • 4. State of the art  Several initiatives recently: • Salem et Kopp (2012): robot ASIMO, the virtual framework MAX, gesture description with MURML. • Aaron Holroyd et Charles Rich (2011): robot Melvin, motion scripts with BML, simple gestures, feedback to synchronize gestures and speech • Ng-Thow-Hing et al. (2010): robot ASIMO, gestures selection, synchronization between gestures and speech. • Nozawa et al. (2006): motion scripts with MPML-HP, robot HOAP-1  Our system: Focus on expressivity and synchronization of gestures with speech using a common platform (SAIBA compliant [Kopp, 2006]) for Greta and for Nao page 3 NAO TechDay 2012 Le Quoc Anh & Catherine Pelachaud
  • 5. Our methodology  Gesture describes with a symbolic language (BML)  Gestural expressivity (amplitude, fluidity, power, repetition, speed, stiffness,…)  Elaboration of gestures from a storytelling video corpus (Martin et al., 2009)  Execution of the animation by translating into joint values page 4 NAO TechDay 2012 Le Quoc Anh & Catherine Pelachaud
  • 6. Problem and Solution  Using a common framework to control both virtual and physical agents raises several problems: • Different degrees of freedom • Limit of movement space and speed  Solution: • Use the same representation language - same algorithm for selecting and planning gestures - different algorithm for creating the animation • Elaborate one gesture repository for the robot and another one for the Greta agent • Gesture movement space and velocity specification page 5 NAO TechDay 2012 Le Quoc Anh & Catherine Pelachaud
  • 7. Steps 1. Build a library of gestures from a corpus of storytelling video: the gesture shapes should not be identical (between the human, virtual agent, robot) but they have to convey the same meaning. GRETA System 2. Use the GRETA system to generate gestures for Nao • Following the SAIBA framework - Two representation languages: FML (Function Markup Language) and BML (Behavior Markup Language) - Three separated modules: plan communicative intents, select and plan gestures, and realize gestures Behavior BML Intent Behavior Realizer Text Planning FML Planning BML Behavior Realizer page 6 NAO TechDay 2012 Le Quoc Anh & Catherine Pelachaud
  • 8. Global diagram Nao Lexicon Greta Lexicon LEXICONs Gesture Synchronisation Selection with AI speech Planification of Modification of gesture gesture expressivity duration KEYFRAMES FML BML page 7 NAO TechDay 2012 Le Quoc Anh & Catherine Pelachaud
  • 9. Nao Lexicon Gesture annotations [Martin et al., 2009] Gesture Lexicon Gesture velocity specification (Minimal duration,Fitt’s Law) Position … 000 001 002 010 (fromto) … 000 0 0.15:0.18388 0.25:0.28679 0.166:0.2270 … 001 0.15:0.18388 0 0.19:0.19552 0.147:0.2754 … 002 0.25:0.28679 0.19:0.19552 0 1.621;0.3501 Gesture space specification … 010 0.166:0.2270 0.147:0.2754 1.621;0.3501 0 … … … … … … page 8 NAO TechDay 2012 Le Quoc Anh & Catherine Pelachaud
  • 10. Gesture Specification  Gesture->Phases->Hands (wrist position, palm orientation, hand shape,...) [Kendon, 2004]  Only stroke phases are specified. Other phases will be generated automatically by the system 1. <gesture id="greeting" category="ICONIC" min_time="1.0“ hand="RIGHT"> 2. <phase type="STROKE-START" twohand="ASSYMMETRIC“ > 3. <hand side="RIGHT"> 4. <vertical_location>YUpperPeriphery</vertical_location> 5. <horizontal_location>XPeriphery</horizontal_location> 6. <location_distance>ZNear</location_distance> 7. <hand_shape>OPEN</handshape> 8. <palm_orientation>AWAY</palm_orientation> 9. </hand> 10. </phase> 11. <phase type="STROKE-END" twohand="ASSYMMETRIC"> 12. <hand side="RIGHT"> 13. <vertical_location>YUpperPeriphery</vertical_location> 14. <horizontal_location>XExtremePeriphery</horizontal_location> 15. <location_distance>ZNear</location_distance> 16. <hand_shape>OPEN</handshape> 17. <palm_orientation>AWAY</palm_orientation> 18. </hand> 19.</phase> 20.</gesture> An example for a greeting gesture page 9 NAO TechDay 2012 Le Quoc Anh & Catherine Pelachaud
  • 11. Synchronization of gestures with speech  The stroke phase coincides or precedes emphasized words of the speech [McNeill, 1992]  Gesture stroke phase timing specified by synch points page 10 NAO TechDay 2012 Le Quoc Anh & Catherine Pelachaud
  • 12. Synchronization of gestures with speech  Algorithm • Compute preparation phase • Delete gesture if not enough time • Add a hold phase to fit gesture planned duration • Coarticulation between several gestures - If enough time, retraction phase (ie go back to rest position) Start end Start end - Otherwise, go from end of stroke to preparation phase of next gesture S-start S-end S-start S-end Start end page 11 NAO TechDay 2012 Le Quoc Anh & Catherine Pelachaud
  • 13. Gestural Expressivity vs. Affective States  A set of gesture dimensions [Hartmann, 2005] • Spatial Extent (SPC): Amplitude of gesture movement • Temporal Extent (TMP): Speed of gesture movement • Power (PWR): Acceleration of gesture movement • Fluidity (FLD): Smoothness and Continuity • Repetition (REP): Number of stroke phases in a gesture movement • Stiffness (STF): Tension/Flexibility  Example [Mancini, 2008] Affective states SPC TMP FLD PWR Sadness Low Low High Low Happy High High High High Angrily High High Low High page 12 NAO TechDay 2012 Le Quoc Anh & Catherine Pelachaud
  • 14. Spatial Extent (SPC) A real number in the interval [-1 .. 1] • Zero corresponds to a neutral behavior • -1 corresponds to small and contracted movements • 1 corresponds to wide and large movements  Guarantee the unchangeability of the meaning • Gesture (modifiable dimension, unmodifiable dimension) • Example: Negation (vertical position is fixed) page 13 NAO TechDay 2012 Le Quoc Anh & Catherine Pelachaud
  • 15. Temporal Extent (TMP) A real number in the interval [-1 .. 1] • Zero corresponds to a neutral behavior • Slow if the value is negative • Fast if the value is positive page 14 NAO TechDay 2012 Le Quoc Anh & Catherine Pelachaud
  • 16. Power (PWR) A real number in the interval [-1 .. 1] • Zero corresponds to a neutral behavior • Movements more powerful correspond to higher acceleration  Affect hand shape (close to open) • More relax/open if the value is negative • Fist corresponds to 1  Affect duration of stroke phase repetitions page 15 NAO TechDay 2012 Le Quoc Anh & Catherine Pelachaud
  • 17. Fluidity (FLD) A real number in the interval [-1 .. 1] • Zero corresponds to a neutral behavior • Higher values allow smooth and continuous execution of movements • Lower values create discontinuity in the movements  Not yet implemented for Nao page 16 NAO TechDay 2012 Le Quoc Anh & Catherine Pelachaud
  • 18. Repetition (REP)  Number of stroke phase repeats in a gesture movement  Tendency to rhythmic repeats of specific movements  Each stroke coincides with a emphasized word/words of the speech page 17 NAO TechDay 2012 Le Quoc Anh & Catherine Pelachaud
  • 19. Animation Computation & Execution  Schedule and plan gestures phases  Compute expressivity parameters  Translate symbolic descriptions into joint values  Execute animation • Send timed key-positions to the robot using available APIs • Animation is obtained by interpolating between joint values with robot built-in proprietary procedures. page 18 NAO TechDay 2012 Le Quoc Anh & Catherine Pelachaud
  • 20. Example <bml> <lexicon> <bml> <speech id="s1" start="0.0“ <speech id="s1" start="0.0“ <gesture id="hungry" category="BEAT" hand="BOTH"> vce=speaker=Antoine spd=180 vce=speaker=Antoine spd=180 Et le troisième dit en colère: vce=speaker=AntoineLoud <phase type="STROKE" twohand="SYMMETRIC"> Et le troisième dit tristement: vce=speaker=AntoineSad spd=90 spd=200 <tm id="tm1"/>J'ai très faim! <hand side="RIGHT"> <tm id="tm1"/>J'ai très faim! </speech> </speech> <vertical_location>YCenterCenter</vertical_location> <gesture id=“hungry" start="s1:tm1" end=“start+1.5" stroke="0.5“> <gesture id=“hungry" start="s1:tm1" end=“start+1.5" <PWR.value>-1.0</PWR.value> stroke="0.5“> <PWR.value>1.0</PWR.value> <horizontal_location>XCenter</horizontal_location> <SPC.value>-0.3</SPC.value> <SPC.value>0.6</SPC.value> <location_distance>ZMiddle</location_distance> <TMP.value>-0.2</TMP.value> <TMP.value>0.2</TMP.value> <FLD.value>0</FLD.value> <FLD.value>0</FLD.value> <hand_shape>CLOSED</handshape> <STF.value>0</STF.value> <STF.value>0</STF.value> <REP.value>0</REP.value> <palm_orientation>INWARD</palm_orientation> <REP.value>0</REP.value> </gesture> </hand> </gesture> </bml> </bml> </phase> </gesture> </lexicon> The same gesture prototype Different expressivity Different expressivity (i.e. Anger) (i.e. Sadness) page 19 NAO TechDay 2012 Le Quoc Anh & Catherine Pelachaud
  • 21. Video demo: Nao tells a story page 20 NAO TechDay 2012 Le Quoc Anh & Catherine Pelachaud
  • 22. Conclusion  Conclusion • A gesture model is designed, implemented for Nao while taking into account physical constraints of the robot. • Common platform for both virtual agent and robot • Expressivity model  Future work • Create gestures with different emotional colour and personal style • Validate the model through perceptive evaluations page 21 NAO TechDay 2012 Le Quoc Anh & Catherine Pelachaud
  • 23. Acknowledgment  This work has been funded by the ANR GVLEX project  It is supported from members of the laboratory TSI, Telecom-ParisTech page 22 NAO TechDay 2012 Le Quoc Anh & Catherine Pelachaud