SlideShare une entreprise Scribd logo
1  sur  36
Télécharger pour lire hors ligne
State of the Art/Best Practices in
Speech Technology
Dan Burnett, Director of Speech Technologies
Why speech?


                    Ma Ma




                    Vok Say Oh




2   © Voxeo Corporation
Speech is the natural human interface

      15% of world population has a personal
       computer

      Greater than 60% of world population has a
       mobile phone




© Voxeo Corporation
What is communication?


                                       Your
                                     Customer




                    You
         (Your speech-enabled IVR)




© Voxeo Corporation
Communication is natural?




                      249694




© Voxeo Corporation
But for IVRs . . .


                                            Your
                                          Customer




                     You
      (Your untuned speech-enabled IVR)




© Voxeo Corporation
So why do we tune?

      For better communication, which leads to



                       More satisfied customers
                       Shorter call durations




© Voxeo Corporation
What can we tune?




                      Your untuned speech-enabled IVR



© Voxeo Corporation
What can we tune?




                      Your untuned speech-enabled IVR



© Voxeo Corporation
What we say – prompts

      Goal: naturally reduce variability in caller's
       responses

      Because: predictability simplifies grammars
       and increases recognition accuracy




© Voxeo Corporation
Prompt tuning

        Vocabulary
          •  Use the words your customers use
          •  For sales, say “sales”; For billing, say “billing”; ...
          •  Are you calling to learn more about our products, to
                fix a problem with your bill, or …

        Keep in mind
          •  Speech allows your customer to describe things
             THEIR way rather than to use your internal company
             description
          •  Make it easier for them to do that!


© Voxeo Corporation
Prompt tuning

        Prompt specificity
          •  General: “What would you like?”
          •  More specific: “Which department would you like?”
          •  Precise: “Would you like A, B, C, or something else?”

        Keep in mind
          •  The caller will often use the exact words YOU use




© Voxeo Corporation
Ever heard this before?

        For Sales, press 1
        For Billing, press 2
        For option I can't remember, press 3
        For another option I can't remember, press 4
        For yet another option I can't remember, press 5
        For more of the same, press 6
        Blah blah, press 7
        For help with this menu, press 8
        To hear these options again, press 9
© Voxeo Corporation
Prompt tuning

        Prompt length
          •  Keep it short: less than a few sentences total, only
             one of which asks for input
          •  Or: provide pauses (at least one second long) for
             interruption

        Keep in mind
          •  Speech communication is only natural if it's not
             drawn out
          •  Primacy and recency effects




© Voxeo Corporation
What can we tune?




                      Your untuned speech-enabled IVR



© Voxeo Corporation
What we listen for – grammars

      Goal: Cover everything they are likely to say,
       and nothing more

      Because: Accuracy in grammar coverage
       directly affects recognition accuracy




© Voxeo Corporation
Grammar tuning

        Cover everything they say
          •  Pre- and post- phrases such as please, I would like,
             and thank you
          •  Synonyms such as (for yes/no) yeah, sure, absolutely
             not

        Keep in mind
          •  Recognizers can only hear it if it's in the grammar




© Voxeo Corporation
Grammar tuning

        Include only what they say
          •  Write grammars that don't overgenerate
          •  If matching numbers/digits, only include valid strings
                if at all possible

        Keep in mind
          •  Every unnecessary grammar phrase is a potential
                misrecognition




© Voxeo Corporation
What can we tune?




                      Your untuned speech-enabled IVR



© Voxeo Corporation
How we listen – parameter optimization

      Goal: Optimize recognizer parameter settings
      Because: Better accuracy, of course!




© Voxeo Corporation
Parameter optimization – which parameters?

      Rejection threshold
      Endpointer settings (sensitivity)

      Large grammar parameters




© Voxeo Corporation
Rejection threshold – what is it?

                                                     False
                                                   Rejections

         Misrecognitions




0
                          Rejection Threshold
         100
© Voxeo Corporation
Rejection threshold – what is it?

Cutoff value for the recognizer confidence below      False
 which the speaker's utterance will be rejected
   Rejections

         Misrecognitions




0
                          Rejection Threshold
         100
© Voxeo Corporation
Rejection threshold – total error

                                                     False
                                                   Rejections

         Misrecognitions




0
                          Rejection Threshold
         100
© Voxeo Corporation
Rejection threshold – comparison
                                        ASR
                                      Engine A



                                                    ASR
                                                  Engine B




0
                    Rejection Threshold
            100
© Voxeo Corporation
Rejection threshold – comparison
                                            ASR
                                          Engine A



                                                        ASR
                                                      Engine B




    Optimal thresholds


0
                        Rejection Threshold
            100
© Voxeo Corporation
Rejection threshold – another
comparison

                                                      ASR
                                                    Engine A



                      Optimal thresholds
                         ASR
                                                                Engine B




0
                           Rejection Threshold
                      100
© Voxeo Corporation
Parameter optimization

        Rejection threshold
          •  Generally largest impact on accuracy
          •  Optimum varies across recognition engines
          •  Optimum varies by set of active grammars

        Keep in mind
          •  Optimizing the rejection threshold is the SINGLE
                MOST IMPORTANT parameter tuning you can do




© Voxeo Corporation
Endpointer sensitivity


                                       Your
                                     Customer




                     You
             (Your hard-of-hearing
              speech-enabled IVR)




© Voxeo Corporation
Parameter optimization

        Endpointer sensitivity
          •  Second-largest impact on accuracy
          •  Unnecessarily high and low sensitivity are both bad
          •  Optimum should be set once, checked annually

        Keep in mind
          •  If the recognizer can't hear you, it can't understand
                what you say




© Voxeo Corporation
Parameter optimization

        Large grammar parameters
          •  Typically need to be adjusted if grammar has more
             than 5000 entries
          •  Typically consumes more memory and/or CPU
          •  Vary by ASR engine, so ask

        Keep in mind
          •  If your grammar has many options, your recognizer
                needs to “think” more than the default settings
                usually allow



© Voxeo Corporation
What can we tune?




                      Your untuned speech-enabled IVR



© Voxeo Corporation
Summary – Keep in mind

        Speech allows your customer to describe things THEIR way rather than to
         use your internal company description – make it easy for them!

        The caller will often use the exact words YOU use

        Speech communication is only natural if it's not drawn out

        Recognizers can only hear it if it's in the grammar

        Every unnecessary grammar phrase is a potential misrecognition

        Optimizing the rejection threshold is the SINGLE MOST IMPORTANT
         parameter tuning you can do

        If the recognizer can't hear you, it can't understand what you say

        If your grammar has many options, your recognizer needs to “think” more
         than the default settings usually allow
© Voxeo Corporation
For help




34   © Voxeo Corporation
State of the Art/Best Practices in
Speech Technology
Dan Burnett, Director of Speech Technologies

dburnett@voxeo.com
Voxeo Summit 2010: Best Practices in Speech Technology

Contenu connexe

En vedette

Voxeo Summit Day 2 -What's new in CXP 14
Voxeo Summit Day 2 -What's new in CXP 14Voxeo Summit Day 2 -What's new in CXP 14
Voxeo Summit Day 2 -What's new in CXP 14Voxeo Corp
 
Jodo Mission of Hawaii - June 2012
Jodo Mission of Hawaii - June 2012Jodo Mission of Hawaii - June 2012
Jodo Mission of Hawaii - June 2012Ryan Ozawa
 
Voxeo Summit Day 1 - A view into the Voxeo cloud
Voxeo Summit Day 1 - A view into the Voxeo cloudVoxeo Summit Day 1 - A view into the Voxeo cloud
Voxeo Summit Day 1 - A view into the Voxeo cloudVoxeo Corp
 
Voxeo Summit Day 2 -Voxeo APIs and SDKs
Voxeo Summit Day 2 -Voxeo APIs and SDKsVoxeo Summit Day 2 -Voxeo APIs and SDKs
Voxeo Summit Day 2 -Voxeo APIs and SDKsVoxeo Corp
 
Voxeo Summit Day 1 - Extending your IVR investment to mobile
Voxeo Summit Day 1 - Extending your IVR investment to mobileVoxeo Summit Day 1 - Extending your IVR investment to mobile
Voxeo Summit Day 1 - Extending your IVR investment to mobileVoxeo Corp
 
Voxeo Summit Day 2 - Securing customer interactions
Voxeo Summit Day 2 - Securing customer interactionsVoxeo Summit Day 2 - Securing customer interactions
Voxeo Summit Day 2 - Securing customer interactionsVoxeo Corp
 

En vedette (7)

Voxeo Summit Day 2 -What's new in CXP 14
Voxeo Summit Day 2 -What's new in CXP 14Voxeo Summit Day 2 -What's new in CXP 14
Voxeo Summit Day 2 -What's new in CXP 14
 
Jodo Mission of Hawaii - June 2012
Jodo Mission of Hawaii - June 2012Jodo Mission of Hawaii - June 2012
Jodo Mission of Hawaii - June 2012
 
Voxeo Summit Day 1 - A view into the Voxeo cloud
Voxeo Summit Day 1 - A view into the Voxeo cloudVoxeo Summit Day 1 - A view into the Voxeo cloud
Voxeo Summit Day 1 - A view into the Voxeo cloud
 
Voxeo Summit Day 2 -Voxeo APIs and SDKs
Voxeo Summit Day 2 -Voxeo APIs and SDKsVoxeo Summit Day 2 -Voxeo APIs and SDKs
Voxeo Summit Day 2 -Voxeo APIs and SDKs
 
Voxeo Summit Day 1 - Extending your IVR investment to mobile
Voxeo Summit Day 1 - Extending your IVR investment to mobileVoxeo Summit Day 1 - Extending your IVR investment to mobile
Voxeo Summit Day 1 - Extending your IVR investment to mobile
 
Roman stadium en
Roman stadium enRoman stadium en
Roman stadium en
 
Voxeo Summit Day 2 - Securing customer interactions
Voxeo Summit Day 2 - Securing customer interactionsVoxeo Summit Day 2 - Securing customer interactions
Voxeo Summit Day 2 - Securing customer interactions
 

Similaire à Voxeo Summit 2010: Best Practices in Speech Technology

Voxeo Summit 2010: VoiceObjects Personalization
Voxeo Summit 2010: VoiceObjects PersonalizationVoxeo Summit 2010: VoiceObjects Personalization
Voxeo Summit 2010: VoiceObjects PersonalizationVoxeo Corp
 
Doesthe recognizermatter
Doesthe recognizermatterDoesthe recognizermatter
Doesthe recognizermatterpurplestains88
 
Doesthe recognizermatter
Doesthe recognizermatterDoesthe recognizermatter
Doesthe recognizermatterpurplestains88
 
Voxeo Summit 2010: Prophecy 10 - Unified Self Service
Voxeo Summit 2010: Prophecy 10 - Unified Self ServiceVoxeo Summit 2010: Prophecy 10 - Unified Self Service
Voxeo Summit 2010: Prophecy 10 - Unified Self ServiceVoxeo Corp
 
Shop By Voice Product Overview
Shop By Voice Product OverviewShop By Voice Product Overview
Shop By Voice Product OverviewAlora Chistiakoff
 
Vocal zoom company presentation pp 97 2003 290413
Vocal zoom company presentation pp 97 2003 290413Vocal zoom company presentation pp 97 2003 290413
Vocal zoom company presentation pp 97 2003 290413Yechiel Kurtz
 
Jacada Visual IVR
Jacada Visual IVRJacada Visual IVR
Jacada Visual IVRJacada
 
Voxeo Summit 2010: VoiceObjects Spotlight
Voxeo Summit 2010: VoiceObjects SpotlightVoxeo Summit 2010: VoiceObjects Spotlight
Voxeo Summit 2010: VoiceObjects SpotlightVoxeo Corp
 
Call center solution
Call center solutionCall center solution
Call center solutionKien Nguyen
 
Video Receptionist Flyer
Video Receptionist FlyerVideo Receptionist Flyer
Video Receptionist FlyerWinTech LLC
 

Similaire à Voxeo Summit 2010: Best Practices in Speech Technology (11)

Voxeo Summit 2010: VoiceObjects Personalization
Voxeo Summit 2010: VoiceObjects PersonalizationVoxeo Summit 2010: VoiceObjects Personalization
Voxeo Summit 2010: VoiceObjects Personalization
 
Doesthe recognizermatter
Doesthe recognizermatterDoesthe recognizermatter
Doesthe recognizermatter
 
Doesthe recognizermatter
Doesthe recognizermatterDoesthe recognizermatter
Doesthe recognizermatter
 
Voxeo Summit 2010: Prophecy 10 - Unified Self Service
Voxeo Summit 2010: Prophecy 10 - Unified Self ServiceVoxeo Summit 2010: Prophecy 10 - Unified Self Service
Voxeo Summit 2010: Prophecy 10 - Unified Self Service
 
Shop By Voice Product Overview
Shop By Voice Product OverviewShop By Voice Product Overview
Shop By Voice Product Overview
 
Vocal zoom company presentation pp 97 2003 290413
Vocal zoom company presentation pp 97 2003 290413Vocal zoom company presentation pp 97 2003 290413
Vocal zoom company presentation pp 97 2003 290413
 
Jacada Visual IVR
Jacada Visual IVRJacada Visual IVR
Jacada Visual IVR
 
Voxeo Summit 2010: VoiceObjects Spotlight
Voxeo Summit 2010: VoiceObjects SpotlightVoxeo Summit 2010: VoiceObjects Spotlight
Voxeo Summit 2010: VoiceObjects Spotlight
 
Call center solution
Call center solutionCall center solution
Call center solution
 
Demystifying Speech Application Tuning
Demystifying Speech Application TuningDemystifying Speech Application Tuning
Demystifying Speech Application Tuning
 
Video Receptionist Flyer
Video Receptionist FlyerVideo Receptionist Flyer
Video Receptionist Flyer
 

Plus de Voxeo Corp

Voxeo Summit Day 2 - Voxeo CXP - IVR on Steroids
Voxeo Summit Day 2 - Voxeo CXP - IVR on SteroidsVoxeo Summit Day 2 - Voxeo CXP - IVR on Steroids
Voxeo Summit Day 2 - Voxeo CXP - IVR on SteroidsVoxeo Corp
 
Voxeo Summit Day 2 - Using CXP hotspot analytics
Voxeo Summit Day 2 - Using CXP hotspot analyticsVoxeo Summit Day 2 - Using CXP hotspot analytics
Voxeo Summit Day 2 - Using CXP hotspot analyticsVoxeo Corp
 
Voxeo Summit Day 2 - Real-time communications with WebRTC
Voxeo Summit Day 2 - Real-time communications with WebRTCVoxeo Summit Day 2 - Real-time communications with WebRTC
Voxeo Summit Day 2 - Real-time communications with WebRTCVoxeo Corp
 
Voxeo Summit Day 2 - Voxeo CXP for business users
Voxeo Summit Day 2 - Voxeo CXP for business usersVoxeo Summit Day 2 - Voxeo CXP for business users
Voxeo Summit Day 2 - Voxeo CXP for business usersVoxeo Corp
 
Voxeo Summit Day 2 - Creating raving fans
Voxeo Summit Day 2 - Creating raving fansVoxeo Summit Day 2 - Creating raving fans
Voxeo Summit Day 2 - Creating raving fansVoxeo Corp
 
Voxeo Summit Day 2 - Advanced CCXML topics
Voxeo Summit Day 2 - Advanced CCXML topicsVoxeo Summit Day 2 - Advanced CCXML topics
Voxeo Summit Day 2 - Advanced CCXML topicsVoxeo Corp
 
Voxeo Summit Day 2 - The science of customer obsession
Voxeo Summit Day 2 - The science of customer obsessionVoxeo Summit Day 2 - The science of customer obsession
Voxeo Summit Day 2 - The science of customer obsessionVoxeo Corp
 
Voxeo Summit Day 1 - The Art of The Possible
Voxeo Summit Day 1 - The Art of The PossibleVoxeo Summit Day 1 - The Art of The Possible
Voxeo Summit Day 1 - The Art of The PossibleVoxeo Corp
 
Voxeo Summit Day 1 - Prophecy log search
Voxeo Summit Day 1 - Prophecy log searchVoxeo Summit Day 1 - Prophecy log search
Voxeo Summit Day 1 - Prophecy log searchVoxeo Corp
 
Voxeo Summit Day 1 - Customer experience analytics
Voxeo Summit Day 1 - Customer experience analyticsVoxeo Summit Day 1 - Customer experience analytics
Voxeo Summit Day 1 - Customer experience analyticsVoxeo Corp
 
Voxeo Summit Day 1 - Communications-enabled Business Processes (CEBP)
Voxeo Summit Day 1 - Communications-enabled Business Processes (CEBP)Voxeo Summit Day 1 - Communications-enabled Business Processes (CEBP)
Voxeo Summit Day 1 - Communications-enabled Business Processes (CEBP)Voxeo Corp
 
Voxeo Summit Day 1 - Lessons learned from large scale deployments
Voxeo Summit Day 1 - Lessons learned from large scale deploymentsVoxeo Summit Day 1 - Lessons learned from large scale deployments
Voxeo Summit Day 1 - Lessons learned from large scale deploymentsVoxeo Corp
 
Voxeo Jam Session: What's New in Prophecy 11 and VoiceObjects 11?
Voxeo Jam Session: What's New in Prophecy 11 and VoiceObjects 11?Voxeo Jam Session: What's New in Prophecy 11 and VoiceObjects 11?
Voxeo Jam Session: What's New in Prophecy 11 and VoiceObjects 11?Voxeo Corp
 
How Do You Hear Me Now?
How Do You Hear Me Now?How Do You Hear Me Now?
How Do You Hear Me Now?Voxeo Corp
 
CCXML For Advanced Communications Applications
CCXML For Advanced Communications ApplicationsCCXML For Advanced Communications Applications
CCXML For Advanced Communications ApplicationsVoxeo Corp
 
IPv6 and How It Impacts Communication Applications
IPv6 and How It Impacts Communication ApplicationsIPv6 and How It Impacts Communication Applications
IPv6 and How It Impacts Communication ApplicationsVoxeo Corp
 
7 Critical Success Factors for Outbound IVR
7 Critical Success Factors for Outbound IVR7 Critical Success Factors for Outbound IVR
7 Critical Success Factors for Outbound IVRVoxeo Corp
 
5 Questions When Analyzing Your Analytics Options
5 Questions When Analyzing Your Analytics Options5 Questions When Analyzing Your Analytics Options
5 Questions When Analyzing Your Analytics OptionsVoxeo Corp
 
Serving the Social Customer: Scaling Your Support For Twitter, Facebook and More
Serving the Social Customer: Scaling Your Support For Twitter, Facebook and MoreServing the Social Customer: Scaling Your Support For Twitter, Facebook and More
Serving the Social Customer: Scaling Your Support For Twitter, Facebook and MoreVoxeo Corp
 
Comparative ASR Evaluation - Voxeo - SpeechTEK NY 2010
Comparative ASR Evaluation - Voxeo - SpeechTEK NY 2010Comparative ASR Evaluation - Voxeo - SpeechTEK NY 2010
Comparative ASR Evaluation - Voxeo - SpeechTEK NY 2010Voxeo Corp
 

Plus de Voxeo Corp (20)

Voxeo Summit Day 2 - Voxeo CXP - IVR on Steroids
Voxeo Summit Day 2 - Voxeo CXP - IVR on SteroidsVoxeo Summit Day 2 - Voxeo CXP - IVR on Steroids
Voxeo Summit Day 2 - Voxeo CXP - IVR on Steroids
 
Voxeo Summit Day 2 - Using CXP hotspot analytics
Voxeo Summit Day 2 - Using CXP hotspot analyticsVoxeo Summit Day 2 - Using CXP hotspot analytics
Voxeo Summit Day 2 - Using CXP hotspot analytics
 
Voxeo Summit Day 2 - Real-time communications with WebRTC
Voxeo Summit Day 2 - Real-time communications with WebRTCVoxeo Summit Day 2 - Real-time communications with WebRTC
Voxeo Summit Day 2 - Real-time communications with WebRTC
 
Voxeo Summit Day 2 - Voxeo CXP for business users
Voxeo Summit Day 2 - Voxeo CXP for business usersVoxeo Summit Day 2 - Voxeo CXP for business users
Voxeo Summit Day 2 - Voxeo CXP for business users
 
Voxeo Summit Day 2 - Creating raving fans
Voxeo Summit Day 2 - Creating raving fansVoxeo Summit Day 2 - Creating raving fans
Voxeo Summit Day 2 - Creating raving fans
 
Voxeo Summit Day 2 - Advanced CCXML topics
Voxeo Summit Day 2 - Advanced CCXML topicsVoxeo Summit Day 2 - Advanced CCXML topics
Voxeo Summit Day 2 - Advanced CCXML topics
 
Voxeo Summit Day 2 - The science of customer obsession
Voxeo Summit Day 2 - The science of customer obsessionVoxeo Summit Day 2 - The science of customer obsession
Voxeo Summit Day 2 - The science of customer obsession
 
Voxeo Summit Day 1 - The Art of The Possible
Voxeo Summit Day 1 - The Art of The PossibleVoxeo Summit Day 1 - The Art of The Possible
Voxeo Summit Day 1 - The Art of The Possible
 
Voxeo Summit Day 1 - Prophecy log search
Voxeo Summit Day 1 - Prophecy log searchVoxeo Summit Day 1 - Prophecy log search
Voxeo Summit Day 1 - Prophecy log search
 
Voxeo Summit Day 1 - Customer experience analytics
Voxeo Summit Day 1 - Customer experience analyticsVoxeo Summit Day 1 - Customer experience analytics
Voxeo Summit Day 1 - Customer experience analytics
 
Voxeo Summit Day 1 - Communications-enabled Business Processes (CEBP)
Voxeo Summit Day 1 - Communications-enabled Business Processes (CEBP)Voxeo Summit Day 1 - Communications-enabled Business Processes (CEBP)
Voxeo Summit Day 1 - Communications-enabled Business Processes (CEBP)
 
Voxeo Summit Day 1 - Lessons learned from large scale deployments
Voxeo Summit Day 1 - Lessons learned from large scale deploymentsVoxeo Summit Day 1 - Lessons learned from large scale deployments
Voxeo Summit Day 1 - Lessons learned from large scale deployments
 
Voxeo Jam Session: What's New in Prophecy 11 and VoiceObjects 11?
Voxeo Jam Session: What's New in Prophecy 11 and VoiceObjects 11?Voxeo Jam Session: What's New in Prophecy 11 and VoiceObjects 11?
Voxeo Jam Session: What's New in Prophecy 11 and VoiceObjects 11?
 
How Do You Hear Me Now?
How Do You Hear Me Now?How Do You Hear Me Now?
How Do You Hear Me Now?
 
CCXML For Advanced Communications Applications
CCXML For Advanced Communications ApplicationsCCXML For Advanced Communications Applications
CCXML For Advanced Communications Applications
 
IPv6 and How It Impacts Communication Applications
IPv6 and How It Impacts Communication ApplicationsIPv6 and How It Impacts Communication Applications
IPv6 and How It Impacts Communication Applications
 
7 Critical Success Factors for Outbound IVR
7 Critical Success Factors for Outbound IVR7 Critical Success Factors for Outbound IVR
7 Critical Success Factors for Outbound IVR
 
5 Questions When Analyzing Your Analytics Options
5 Questions When Analyzing Your Analytics Options5 Questions When Analyzing Your Analytics Options
5 Questions When Analyzing Your Analytics Options
 
Serving the Social Customer: Scaling Your Support For Twitter, Facebook and More
Serving the Social Customer: Scaling Your Support For Twitter, Facebook and MoreServing the Social Customer: Scaling Your Support For Twitter, Facebook and More
Serving the Social Customer: Scaling Your Support For Twitter, Facebook and More
 
Comparative ASR Evaluation - Voxeo - SpeechTEK NY 2010
Comparative ASR Evaluation - Voxeo - SpeechTEK NY 2010Comparative ASR Evaluation - Voxeo - SpeechTEK NY 2010
Comparative ASR Evaluation - Voxeo - SpeechTEK NY 2010
 

Dernier

GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?XfilesPro
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAndikSusilo4
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 

Dernier (20)

GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & Application
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 

Voxeo Summit 2010: Best Practices in Speech Technology

  • 1. State of the Art/Best Practices in Speech Technology Dan Burnett, Director of Speech Technologies
  • 2. Why speech? Ma Ma Vok Say Oh 2 © Voxeo Corporation
  • 3. Speech is the natural human interface   15% of world population has a personal computer   Greater than 60% of world population has a mobile phone © Voxeo Corporation
  • 4. What is communication? Your Customer You (Your speech-enabled IVR) © Voxeo Corporation
  • 5. Communication is natural? 249694 © Voxeo Corporation
  • 6. But for IVRs . . . Your Customer You (Your untuned speech-enabled IVR) © Voxeo Corporation
  • 7. So why do we tune?   For better communication, which leads to  More satisfied customers  Shorter call durations © Voxeo Corporation
  • 8. What can we tune? Your untuned speech-enabled IVR © Voxeo Corporation
  • 9. What can we tune? Your untuned speech-enabled IVR © Voxeo Corporation
  • 10. What we say – prompts   Goal: naturally reduce variability in caller's responses   Because: predictability simplifies grammars and increases recognition accuracy © Voxeo Corporation
  • 11. Prompt tuning   Vocabulary •  Use the words your customers use •  For sales, say “sales”; For billing, say “billing”; ... •  Are you calling to learn more about our products, to fix a problem with your bill, or …   Keep in mind •  Speech allows your customer to describe things THEIR way rather than to use your internal company description •  Make it easier for them to do that! © Voxeo Corporation
  • 12. Prompt tuning   Prompt specificity •  General: “What would you like?” •  More specific: “Which department would you like?” •  Precise: “Would you like A, B, C, or something else?”   Keep in mind •  The caller will often use the exact words YOU use © Voxeo Corporation
  • 13. Ever heard this before?   For Sales, press 1   For Billing, press 2   For option I can't remember, press 3   For another option I can't remember, press 4   For yet another option I can't remember, press 5   For more of the same, press 6   Blah blah, press 7   For help with this menu, press 8   To hear these options again, press 9 © Voxeo Corporation
  • 14. Prompt tuning   Prompt length •  Keep it short: less than a few sentences total, only one of which asks for input •  Or: provide pauses (at least one second long) for interruption   Keep in mind •  Speech communication is only natural if it's not drawn out •  Primacy and recency effects © Voxeo Corporation
  • 15. What can we tune? Your untuned speech-enabled IVR © Voxeo Corporation
  • 16. What we listen for – grammars   Goal: Cover everything they are likely to say, and nothing more   Because: Accuracy in grammar coverage directly affects recognition accuracy © Voxeo Corporation
  • 17. Grammar tuning   Cover everything they say •  Pre- and post- phrases such as please, I would like, and thank you •  Synonyms such as (for yes/no) yeah, sure, absolutely not   Keep in mind •  Recognizers can only hear it if it's in the grammar © Voxeo Corporation
  • 18. Grammar tuning   Include only what they say •  Write grammars that don't overgenerate •  If matching numbers/digits, only include valid strings if at all possible   Keep in mind •  Every unnecessary grammar phrase is a potential misrecognition © Voxeo Corporation
  • 19. What can we tune? Your untuned speech-enabled IVR © Voxeo Corporation
  • 20. How we listen – parameter optimization   Goal: Optimize recognizer parameter settings   Because: Better accuracy, of course! © Voxeo Corporation
  • 21. Parameter optimization – which parameters?   Rejection threshold   Endpointer settings (sensitivity)   Large grammar parameters © Voxeo Corporation
  • 22. Rejection threshold – what is it? False Rejections Misrecognitions 0 Rejection Threshold 100 © Voxeo Corporation
  • 23. Rejection threshold – what is it? Cutoff value for the recognizer confidence below False which the speaker's utterance will be rejected Rejections Misrecognitions 0 Rejection Threshold 100 © Voxeo Corporation
  • 24. Rejection threshold – total error False Rejections Misrecognitions 0 Rejection Threshold 100 © Voxeo Corporation
  • 25. Rejection threshold – comparison ASR Engine A ASR Engine B 0 Rejection Threshold 100 © Voxeo Corporation
  • 26. Rejection threshold – comparison ASR Engine A ASR Engine B Optimal thresholds 0 Rejection Threshold 100 © Voxeo Corporation
  • 27. Rejection threshold – another comparison ASR Engine A Optimal thresholds ASR Engine B 0 Rejection Threshold 100 © Voxeo Corporation
  • 28. Parameter optimization   Rejection threshold •  Generally largest impact on accuracy •  Optimum varies across recognition engines •  Optimum varies by set of active grammars   Keep in mind •  Optimizing the rejection threshold is the SINGLE MOST IMPORTANT parameter tuning you can do © Voxeo Corporation
  • 29. Endpointer sensitivity Your Customer You (Your hard-of-hearing speech-enabled IVR) © Voxeo Corporation
  • 30. Parameter optimization   Endpointer sensitivity •  Second-largest impact on accuracy •  Unnecessarily high and low sensitivity are both bad •  Optimum should be set once, checked annually   Keep in mind •  If the recognizer can't hear you, it can't understand what you say © Voxeo Corporation
  • 31. Parameter optimization   Large grammar parameters •  Typically need to be adjusted if grammar has more than 5000 entries •  Typically consumes more memory and/or CPU •  Vary by ASR engine, so ask   Keep in mind •  If your grammar has many options, your recognizer needs to “think” more than the default settings usually allow © Voxeo Corporation
  • 32. What can we tune? Your untuned speech-enabled IVR © Voxeo Corporation
  • 33. Summary – Keep in mind   Speech allows your customer to describe things THEIR way rather than to use your internal company description – make it easy for them!   The caller will often use the exact words YOU use   Speech communication is only natural if it's not drawn out   Recognizers can only hear it if it's in the grammar   Every unnecessary grammar phrase is a potential misrecognition   Optimizing the rejection threshold is the SINGLE MOST IMPORTANT parameter tuning you can do   If the recognizer can't hear you, it can't understand what you say   If your grammar has many options, your recognizer needs to “think” more than the default settings usually allow © Voxeo Corporation
  • 34. For help 34 © Voxeo Corporation
  • 35. State of the Art/Best Practices in Speech Technology Dan Burnett, Director of Speech Technologies
 dburnett@voxeo.com