SlideShare une entreprise Scribd logo
1  sur  6
Télécharger pour lire hors ligne
WHITE PAPER




Understanding Voice User Interface
Design




200 - 750 Palladium Dr, Ottawa, Ontario, K2V 1C7, Canada   Tel +1.613.271.8989   Fax +1.613.271.8388   www.pronexus.com
WHITE PAPER




Table of Contents

User                                 3
Task                                 3
Cognitive Load                       4
Persona                              4
Natural Language (Prompting)         4
Natural Language (Understanding)     4
Effective Error Recovery             4
Pronexus Speech and IVR Consulting   5
Summary                              6




                                         2
WHITE PAPER




Controlling a machine by simply talking to it was science fiction only a short time ago. Until recently, this area was
considered to be artificial intelligence. However, with advances in technology, Voice User Interfaces (VUI) have be-
come more commonplace, and people are taking advantage of the value that these hands-free, eyes-free interfaces
provide in many situations.

Creating intuitive and usable VUIs involves coordinating a large number of research-based best practices, with the
unique needs of each project. This white paper explores the following factors that influence VUI design:

 •    User
 •    Task
 •    Cognitive load
 •    Persona
 •    Natural language (prompting)
 •    Natural language (understanding)
 •    Effective error recovery.

User                                                          Task
A VUI should be designed according to user                    It is important to understand exactly what the goal is for
characteristics. Will there be a lot of new one-time users,   a system (e.g. to place a phone call, buy a plane ticket,
e.g., callers to a company? Or will there be mainly           or update your work schedule) and to determine the
frequent users, who interact with the system many times       most efficient and intuitive way to get it done. You need
a day, as in the case of a messaging system?                  to ask questions like:

For novice users, a system has to provide appropriate           •   How does the typical person do this task?
instructions and anticipate their questions well enough
                                                                •   Is there a way to do it in fewer steps?
that they get comfortably to their goal. Prompts for
ordering pizza might include “how many pizzas”; “what           •   How do most people picture this task, i.e., their
size” and “what toppings do you want”.                              “mental model” of the task?

A system that will have frequent expert users won’t need      By carefully mapping out these factors, a VUI can work
as much instruction, can use more complex commands,           with the way people typically approach a problem, rather
and must get tasks done in a minimum number of steps.         than working against it.
For example, an advanced user command might be:
“I want to order one large pepperoni pizza.” This
command gets everything done in a single step.

Ideally, a VUI should provide enough instructions for
novices, while not holding back experts who already know
what they want.
                                                                                                                           3
WHITE PAPER




Cognitive Load
Unlike vision, where your eyes can easily scan a scene       the transaction moving forward, as well as keep the user
repeatedly, speech comes in as a single stream of sound.     interested. For example, make it clear when the user
You must rely on your memory to know what happened a         should respond, and don’t keep them waiting with huge
few seconds, or a few minutes, in the past. This means       chunks of text. Also, restricting or widening the
that VUIs must carefully manage how much pressure is         vocabulary in the system prompts can encourage or
put on the user’s memory. Keeping this “cognitive load” to   discourage the user to do the same, e.g., always using
a reasonable level is crucial in making a VUI easy to use.   exact terms versus allowing synonyms for the same
Giving the user appropriate amounts of feedback, as well     concepts; open-ended versus closed-ended questions.
as splitting up a complex task into an intuitive sequence
of smaller interactions – taking advantage of that “mental
model” – can go a long way to reduce the cognitive load.     Natural Language (Understanding)
                                                             This is where most of the magic happens. Everything up
                                                             to this point helps to make a system easier for the user to
Persona
                                                             understand, and to help the user make appropriate
“Persona” refers to the personality that the system          responses. Anticipating all the different thing a user could
portrays by the choice of voice actor, the tone of the       say is the job of a well designed grammar. The grammar
wording in the prompts, and way the system interacts with    determines what the machine can ‘understand’, and then
the user. These choices depend on the target user group      act upon. An extremely limited grammar may accept only
and the nature of the company hosting the system.            the exact words “yes” and “no”, while a more complex
For example, a restaurant with a beach theme many want       grammar might also correctly accept “yes please”,
a loud “surfer dude” persona for a system taking             “uuuuhh…yup”, or “yeah, baby!” An even more complex
reservations, but a stock brokerage would likely prefer      grammar may also accept “I don’t know”, and then follow
a more quietly polite and professional persona. For many     a different set of prompts to help out the user who needs
applications, a more neutral polite/professional persona     more assistance.
is the best choice.


                                                             Effective Error Recovery
Natural Language (Prompting)                                 Unfortunately, some errors or misunderstandings can oc-
                                                             cur, just as they do in regular conversation. By anticipat-
While most VUIs are not like a conversation with another     ing the most common errors, and giving the user effective
person, by making use of certain aspects of conversation,    and escalating instructions and feedback, the VUI can get
the interaction can be much smoother and more natural        back them back on track as smoothly as possible.
for the user. Appropriate pacing and turn taking can keep
                                                                                                                            4
WHITE PAPER




Examples                                                     Pronexus Speech and IVR Consulting
                                                             The technical side of speech recognition has matured to a
If a user remains silent, apparently not knowing what to
                                                             point where correctly capturing spoken input is very
do, after a short interval the system could give more
                                                             reliable. Now a major challenge is creating intuitive and
detailed instructions:
                                                             efficient VUI to properly harness this technology as a tool
System:        “Please enter your PIN.                       to get real work done in the real world.
User:          <silence>
                                                             Developing an effective VUI involves a number of
System:         “Please enter the four digit personal        activities that a typical software developer may not have
                 identification number that you chose        the time or human-factors expertise to tackle.
                 earlier.”
If a user says something that is not in the system           This is where Pronexus VUI consulting services can help
grammars, leading to a recognition error, the system         you succeed. We can design a new VUI from the ground
should let the user know what kind of input is required:     up, edit and optimize an existing voice application, or
                                                             even adapt an IVR (touch-tone) system to migrate it
System:        “Who do you want to speak to?”                effectively to get all the advantages of a VUI.
User:          “Bob.”
                                                             Pronexus VUI consulting services include:
System:        “Sorry, I couldn’t find that name. Please
                say the first and last name of the person    1. VUI design:
                you want to call.”
                                                                 • Task analysis/requirements gathering
User:          “Bob Jones.”
                                                                 • Interpret requirements into sample dialogs for
                                                                    demonstration (user centered design approach)
The idea is to give the user the appropriate level of
                                                                 • Call flow and dialog design (typically as a Visio®
instructions to keep them moving toward their goal, and
                                                                    and Word® document)
in as natural a manner as possible. In many cases, the
user would not be aware that an error occurred, and will         • Produce a list of prompts (have them profession-
                                                                    ally recorded if desired)
just hear the next prompt in a logical sequence. If things
are clearly not progressing, the VUI can even offer to           • Develop and tune grammars
transfer the user to a live agent before the user gets too
frustrated.
                                                             2. VUI review:

                                                                 • Analyze an existing application's call flow and
                                                                    dialogs for inconsistencies or weaknesses

                                                                 • Usability assessment, to point out likely problems

                                                                                                                         5
WHITE PAPER




    • Produce a report of suggested changes to                              Summary
        optimize VUI (typically as a Visio and Word docu-
                                                                            Some would say that a great VUI is one that people
        ment)
                                                                            don’t remember using – it seamlessly got the job done.
                                                                            A good VUI can help overcome user aversion to
3. Tuning services:                                                         technology because there are no new skills to learn --
                                                                            people already understand how to make a phone call.
    • Analyze an existing application's dialogs and                         Users can perform transactions quickly and easily, as
        grammars                                                            though they had been speaking to a knowledgeable
    • Re-evaluate an existing call flow's success in                        person. When care is taken designing a VUI, both the
        achieving its goal                                                  user and the organization implementing the automation
    • Optimize recognition parameters                                       win: the use has more control over conducting business
                                                                            at their own convenience, while the organization benefits
                                                                            from improved productivity and enhanced service
Pronexus can provide individual services, or can produce                    delivery. With today’s high recognition rates, VUIs have
an entire application.
                                                                            moved from the realm of science fiction, to providing
                                                                            reliable tools that are ready for use right now.




Established in 1994, Pronexus specializes in telephony, speech and database integration technologies. Our flagship product VBVoice IVR software
enables developing feature-rich inbound and outbound IVR solutions that improve customer service and streamline internal processes. Thousands
of companies all over the world use VBVoice in applications varying from auto-attendants to automated payments, fax applications, notifications, polls
and surveys. Part of our business model is to provide customers flexibility to choose between developing an IVR application in house, leveraging
Pronexus’ Professional Services or buying a turn-key IVR application from one of our partners who have integrated VBVoice into their solutions.



200 - 750 Palladium Dr, Ottawa, Ontario, K2V 1C7, Canada           Tel +1.613.271.8989            Fax +1.613.271.8388           www.pronexus.com

Contenu connexe

En vedette

How Natural User Interfaces are changing Human Computer Interaction
How Natural User Interfaces are changing Human Computer InteractionHow Natural User Interfaces are changing Human Computer Interaction
How Natural User Interfaces are changing Human Computer Interaction
Marco Silva
 

En vedette (19)

Evangelizing and Designing Voice User Interface: Adopting VUI in a GUI world
 Evangelizing and Designing Voice User Interface: Adopting VUI in a GUI world Evangelizing and Designing Voice User Interface: Adopting VUI in a GUI world
Evangelizing and Designing Voice User Interface: Adopting VUI in a GUI world
 
The Human Interface
The Human InterfaceThe Human Interface
The Human Interface
 
User Interface
User InterfaceUser Interface
User Interface
 
Voice Design Coaching programa Voice Design Branding
Voice Design Coaching   programa Voice Design BrandingVoice Design Coaching   programa Voice Design Branding
Voice Design Coaching programa Voice Design Branding
 
Touch first and touch free designs
Touch first and touch free designsTouch first and touch free designs
Touch first and touch free designs
 
Natural User Interface Demo based on - 3D Brick Game using Kinect
Natural User Interface Demo based on - 3D Brick Game using KinectNatural User Interface Demo based on - 3D Brick Game using Kinect
Natural User Interface Demo based on - 3D Brick Game using Kinect
 
Designing Voice-Driven Game Experiences | Dave Isbitski
Designing Voice-Driven Game Experiences | Dave IsbitskiDesigning Voice-Driven Game Experiences | Dave Isbitski
Designing Voice-Driven Game Experiences | Dave Isbitski
 
NUI - Natural User Interfaces: nur ein Hype oder die nächste Stufe der Intera...
NUI - Natural User Interfaces: nur ein Hype oder die nächste Stufe der Intera...NUI - Natural User Interfaces: nur ein Hype oder die nächste Stufe der Intera...
NUI - Natural User Interfaces: nur ein Hype oder die nächste Stufe der Intera...
 
How Natural User Interfaces are changing Human Computer Interaction
How Natural User Interfaces are changing Human Computer InteractionHow Natural User Interfaces are changing Human Computer Interaction
How Natural User Interfaces are changing Human Computer Interaction
 
Ride the Wave of Conversational UX
Ride the Wave of Conversational UXRide the Wave of Conversational UX
Ride the Wave of Conversational UX
 
UX Berlin about Movie Her and the future of Voice Interfaces
UX Berlin about Movie Her and the future of Voice InterfacesUX Berlin about Movie Her and the future of Voice Interfaces
UX Berlin about Movie Her and the future of Voice Interfaces
 
Natural User Interface Workshop
Natural User Interface WorkshopNatural User Interface Workshop
Natural User Interface Workshop
 
Voice Driven Type Design
Voice Driven Type DesignVoice Driven Type Design
Voice Driven Type Design
 
NUI WARS: Kinect vs Structure vs Leap vs RealSense vs Myo
NUI WARS: Kinect vs Structure vs Leap vs RealSense vs MyoNUI WARS: Kinect vs Structure vs Leap vs RealSense vs Myo
NUI WARS: Kinect vs Structure vs Leap vs RealSense vs Myo
 
Future voice experience 2024
Future voice experience 2024Future voice experience 2024
Future voice experience 2024
 
Natural user interface
Natural user interfaceNatural user interface
Natural user interface
 
Conversational apps UX best practices
Conversational apps UX best practicesConversational apps UX best practices
Conversational apps UX best practices
 
Converations on conversational Ux
Converations on conversational UxConverations on conversational Ux
Converations on conversational Ux
 
Introduction to Facebook Messenger, Conversational UI & NLP
Introduction to Facebook Messenger, Conversational UI & NLPIntroduction to Facebook Messenger, Conversational UI & NLP
Introduction to Facebook Messenger, Conversational UI & NLP
 

Similaire à Understanding Voice User Interface Design

Dialogue management system
Dialogue management systemDialogue management system
Dialogue management system
Mayank Agarwal
 
It Role State Exploration 7 Nov Illumine
It Role State Exploration 7 Nov  IllumineIt Role State Exploration 7 Nov  Illumine
It Role State Exploration 7 Nov Illumine
ibecome
 
UX Design - Lecture # 2.pptx
UX Design - Lecture # 2.pptxUX Design - Lecture # 2.pptx
UX Design - Lecture # 2.pptx
Zaid227349
 

Similaire à Understanding Voice User Interface Design (20)

Getting Started with Voice UI
Getting Started with Voice UIGetting Started with Voice UI
Getting Started with Voice UI
 
Voice Tech TO #1
Voice Tech TO #1Voice Tech TO #1
Voice Tech TO #1
 
The Laws of User Experience: Making it or breaking it with the UX Factor
The Laws of User Experience: Making it or breaking it with the UX FactorThe Laws of User Experience: Making it or breaking it with the UX Factor
The Laws of User Experience: Making it or breaking it with the UX Factor
 
The Laws of User Experience: Making it or Breaking It with the UX Factor
The Laws of User Experience: Making it or Breaking It with the UX FactorThe Laws of User Experience: Making it or Breaking It with the UX Factor
The Laws of User Experience: Making it or Breaking It with the UX Factor
 
Alex wright mons_workshop_051214
Alex wright mons_workshop_051214Alex wright mons_workshop_051214
Alex wright mons_workshop_051214
 
Dialogue management system
Dialogue management systemDialogue management system
Dialogue management system
 
It Role State Exploration 7 Nov Illumine
It Role State Exploration 7 Nov  IllumineIt Role State Exploration 7 Nov  Illumine
It Role State Exploration 7 Nov Illumine
 
Intro to user experience design
Intro to user experience designIntro to user experience design
Intro to user experience design
 
Hci lec 5,6
Hci lec 5,6Hci lec 5,6
Hci lec 5,6
 
Top 10 jakob nielsen’s phenomenal rules of uiux design for 2022
Top 10 jakob nielsen’s phenomenal rules of uiux design for 2022Top 10 jakob nielsen’s phenomenal rules of uiux design for 2022
Top 10 jakob nielsen’s phenomenal rules of uiux design for 2022
 
Usability Workshop at Lillebaelt Academy
Usability Workshop at Lillebaelt AcademyUsability Workshop at Lillebaelt Academy
Usability Workshop at Lillebaelt Academy
 
DT project.pdf
DT project.pdfDT project.pdf
DT project.pdf
 
Chapter 4
Chapter 4 Chapter 4
Chapter 4
 
UX Design - Lecture # 2.pptx
UX Design - Lecture # 2.pptxUX Design - Lecture # 2.pptx
UX Design - Lecture # 2.pptx
 
When Worlds Collide: Improving the User Experience by Applying Progressive In...
When Worlds Collide: Improving the User Experience by Applying Progressive In...When Worlds Collide: Improving the User Experience by Applying Progressive In...
When Worlds Collide: Improving the User Experience by Applying Progressive In...
 
Importance of UX-UI in Android/iOS Development- Stackon
Importance of UX-UI in Android/iOS Development- StackonImportance of UX-UI in Android/iOS Development- Stackon
Importance of UX-UI in Android/iOS Development- Stackon
 
Usability of UI Design (motivation, heuristics, tools)
Usability of UI Design (motivation, heuristics, tools)Usability of UI Design (motivation, heuristics, tools)
Usability of UI Design (motivation, heuristics, tools)
 
Embry-Riddle Campus Solutions UX Design
Embry-Riddle Campus Solutions UX Design Embry-Riddle Campus Solutions UX Design
Embry-Riddle Campus Solutions UX Design
 
Usability Essentials to Know
Usability Essentials to KnowUsability Essentials to Know
Usability Essentials to Know
 
Usability - what is it & why is it important
Usability - what is it & why is it importantUsability - what is it & why is it important
Usability - what is it & why is it important
 

Dernier

Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 

Dernier (20)

Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 

Understanding Voice User Interface Design

  • 1. WHITE PAPER Understanding Voice User Interface Design 200 - 750 Palladium Dr, Ottawa, Ontario, K2V 1C7, Canada Tel +1.613.271.8989 Fax +1.613.271.8388 www.pronexus.com
  • 2. WHITE PAPER Table of Contents User 3 Task 3 Cognitive Load 4 Persona 4 Natural Language (Prompting) 4 Natural Language (Understanding) 4 Effective Error Recovery 4 Pronexus Speech and IVR Consulting 5 Summary 6 2
  • 3. WHITE PAPER Controlling a machine by simply talking to it was science fiction only a short time ago. Until recently, this area was considered to be artificial intelligence. However, with advances in technology, Voice User Interfaces (VUI) have be- come more commonplace, and people are taking advantage of the value that these hands-free, eyes-free interfaces provide in many situations. Creating intuitive and usable VUIs involves coordinating a large number of research-based best practices, with the unique needs of each project. This white paper explores the following factors that influence VUI design: • User • Task • Cognitive load • Persona • Natural language (prompting) • Natural language (understanding) • Effective error recovery. User Task A VUI should be designed according to user It is important to understand exactly what the goal is for characteristics. Will there be a lot of new one-time users, a system (e.g. to place a phone call, buy a plane ticket, e.g., callers to a company? Or will there be mainly or update your work schedule) and to determine the frequent users, who interact with the system many times most efficient and intuitive way to get it done. You need a day, as in the case of a messaging system? to ask questions like: For novice users, a system has to provide appropriate • How does the typical person do this task? instructions and anticipate their questions well enough • Is there a way to do it in fewer steps? that they get comfortably to their goal. Prompts for ordering pizza might include “how many pizzas”; “what • How do most people picture this task, i.e., their size” and “what toppings do you want”. “mental model” of the task? A system that will have frequent expert users won’t need By carefully mapping out these factors, a VUI can work as much instruction, can use more complex commands, with the way people typically approach a problem, rather and must get tasks done in a minimum number of steps. than working against it. For example, an advanced user command might be: “I want to order one large pepperoni pizza.” This command gets everything done in a single step. Ideally, a VUI should provide enough instructions for novices, while not holding back experts who already know what they want. 3
  • 4. WHITE PAPER Cognitive Load Unlike vision, where your eyes can easily scan a scene the transaction moving forward, as well as keep the user repeatedly, speech comes in as a single stream of sound. interested. For example, make it clear when the user You must rely on your memory to know what happened a should respond, and don’t keep them waiting with huge few seconds, or a few minutes, in the past. This means chunks of text. Also, restricting or widening the that VUIs must carefully manage how much pressure is vocabulary in the system prompts can encourage or put on the user’s memory. Keeping this “cognitive load” to discourage the user to do the same, e.g., always using a reasonable level is crucial in making a VUI easy to use. exact terms versus allowing synonyms for the same Giving the user appropriate amounts of feedback, as well concepts; open-ended versus closed-ended questions. as splitting up a complex task into an intuitive sequence of smaller interactions – taking advantage of that “mental model” – can go a long way to reduce the cognitive load. Natural Language (Understanding) This is where most of the magic happens. Everything up to this point helps to make a system easier for the user to Persona understand, and to help the user make appropriate “Persona” refers to the personality that the system responses. Anticipating all the different thing a user could portrays by the choice of voice actor, the tone of the say is the job of a well designed grammar. The grammar wording in the prompts, and way the system interacts with determines what the machine can ‘understand’, and then the user. These choices depend on the target user group act upon. An extremely limited grammar may accept only and the nature of the company hosting the system. the exact words “yes” and “no”, while a more complex For example, a restaurant with a beach theme many want grammar might also correctly accept “yes please”, a loud “surfer dude” persona for a system taking “uuuuhh…yup”, or “yeah, baby!” An even more complex reservations, but a stock brokerage would likely prefer grammar may also accept “I don’t know”, and then follow a more quietly polite and professional persona. For many a different set of prompts to help out the user who needs applications, a more neutral polite/professional persona more assistance. is the best choice. Effective Error Recovery Natural Language (Prompting) Unfortunately, some errors or misunderstandings can oc- cur, just as they do in regular conversation. By anticipat- While most VUIs are not like a conversation with another ing the most common errors, and giving the user effective person, by making use of certain aspects of conversation, and escalating instructions and feedback, the VUI can get the interaction can be much smoother and more natural back them back on track as smoothly as possible. for the user. Appropriate pacing and turn taking can keep 4
  • 5. WHITE PAPER Examples Pronexus Speech and IVR Consulting The technical side of speech recognition has matured to a If a user remains silent, apparently not knowing what to point where correctly capturing spoken input is very do, after a short interval the system could give more reliable. Now a major challenge is creating intuitive and detailed instructions: efficient VUI to properly harness this technology as a tool System: “Please enter your PIN. to get real work done in the real world. User: <silence> Developing an effective VUI involves a number of System: “Please enter the four digit personal activities that a typical software developer may not have identification number that you chose the time or human-factors expertise to tackle. earlier.” If a user says something that is not in the system This is where Pronexus VUI consulting services can help grammars, leading to a recognition error, the system you succeed. We can design a new VUI from the ground should let the user know what kind of input is required: up, edit and optimize an existing voice application, or even adapt an IVR (touch-tone) system to migrate it System: “Who do you want to speak to?” effectively to get all the advantages of a VUI. User: “Bob.” Pronexus VUI consulting services include: System: “Sorry, I couldn’t find that name. Please say the first and last name of the person 1. VUI design: you want to call.” • Task analysis/requirements gathering User: “Bob Jones.” • Interpret requirements into sample dialogs for demonstration (user centered design approach) The idea is to give the user the appropriate level of • Call flow and dialog design (typically as a Visio® instructions to keep them moving toward their goal, and and Word® document) in as natural a manner as possible. In many cases, the user would not be aware that an error occurred, and will • Produce a list of prompts (have them profession- ally recorded if desired) just hear the next prompt in a logical sequence. If things are clearly not progressing, the VUI can even offer to • Develop and tune grammars transfer the user to a live agent before the user gets too frustrated. 2. VUI review: • Analyze an existing application's call flow and dialogs for inconsistencies or weaknesses • Usability assessment, to point out likely problems 5
  • 6. WHITE PAPER • Produce a report of suggested changes to Summary optimize VUI (typically as a Visio and Word docu- Some would say that a great VUI is one that people ment) don’t remember using – it seamlessly got the job done. A good VUI can help overcome user aversion to 3. Tuning services: technology because there are no new skills to learn -- people already understand how to make a phone call. • Analyze an existing application's dialogs and Users can perform transactions quickly and easily, as grammars though they had been speaking to a knowledgeable • Re-evaluate an existing call flow's success in person. When care is taken designing a VUI, both the achieving its goal user and the organization implementing the automation • Optimize recognition parameters win: the use has more control over conducting business at their own convenience, while the organization benefits from improved productivity and enhanced service Pronexus can provide individual services, or can produce delivery. With today’s high recognition rates, VUIs have an entire application. moved from the realm of science fiction, to providing reliable tools that are ready for use right now. Established in 1994, Pronexus specializes in telephony, speech and database integration technologies. Our flagship product VBVoice IVR software enables developing feature-rich inbound and outbound IVR solutions that improve customer service and streamline internal processes. Thousands of companies all over the world use VBVoice in applications varying from auto-attendants to automated payments, fax applications, notifications, polls and surveys. Part of our business model is to provide customers flexibility to choose between developing an IVR application in house, leveraging Pronexus’ Professional Services or buying a turn-key IVR application from one of our partners who have integrated VBVoice into their solutions. 200 - 750 Palladium Dr, Ottawa, Ontario, K2V 1C7, Canada Tel +1.613.271.8989 Fax +1.613.271.8388 www.pronexus.com