SlideShare une entreprise Scribd logo
1  sur  38
2009   |   Westergasfabriek   |   Amsterdam   |   http://eComm.ec
Practical Edge of
Speech Technology

    Moshe Yudkowsky
   www.Disaggregate.com
                          2
“Practical” is Relative


              Affordable

              Schedule

             Achievable

                           3
Core Technology: Speech
            Recognition (ASR), Text-
Engines     to-Speech (TTS),
            Biometrics, Thynometrics
            (emotions)
            Data mining, problem
Analytics
            discovery


                                       4
Two 20-second Exercises




                          5
Two 20-second Exercises

   Exercise 1


   Travel Agency
    Automated
   Reservations



                          5
Two 20-second Exercises

   Exercise 1       Exercise 2


   Travel Agency   Twitter Update
    Automated        of eComm
   Reservations     Conference



                                    5
Lessons
Exercise 1     Exercise 2




                            6
Lessons
  Exercise 1       Exercise 2
Everyone has the
 same & simple
    answers
  Call centers;




                                6
Lessons
  Exercise 1       Exercise 2
Everyone has the
  same & simple
     answers
   Call centers;
 standard device
    commands
     Speaker


                                6
Lessons
  Exercise 1       Exercise 2
Everyone has the
  same & simple
     answers
   Call centers;
 standard device
    commands
     Speaker
     Speaker
   Independent
                                6
Lessons
  Exercise 1        Exercise 2
Everyone has the
                   Highly Personal
  same & simple
                      Answers
     answers
   Call centers;
 standard device
    commands
     Speaker
     Speaker
   Independent
                                     6
Lessons
  Exercise 1        Exercise 2
Everyone has the
                   Highly Personal
  same & simple
                      Answers
     answers
   Call centers;
                   Dictation; voice
 standard device
                       search
    commands
     Speaker          Speaker
     Speaker
   Independent
                                      6
Lessons
  Exercise 1        Exercise 2
Everyone has the
                   Highly Personal
  same & simple
                      Answers
     answers
   Call centers;
                   Dictation; voice
 standard device
                       search
    commands
     Speaker          Speaker
     Speaker
                     Dependent
   Independent
                        or
                                      6
Network Hardware for
Speaker Independent


                       7
Network-
based
systems:
Your
equipment
(“Premises”)
Network-
based
systems:
“Hosted”
Local Hardware



                 10
Device-
based
systems
                       ASR
             Results

Local
Recogniti
on
Known text
Complex,
personal
text
Device-
based
systems:
Hybrid

              Voice    Results
Voice to
server,
data back
to device
Speaker
independent
(?)                   ASR
Engine
       Speech Recognition (ASR)
s



Summary:
You can do almost anything — but
the more you do, the more you
pay.
                                   13
Telephony ASR is excellent:
Inexpensiv “What city?”—
           “Amsterdam”
           “What is wrong with your
           phone?” — “I dropped it
Very
           on the floor, and the
expensive
           screen is cracked, and
           now I can’t see anything.”


                                        14
Cautions

No such thing as “speech to text”
 Speaker dependent comes closest
 Voicemail to text: human assisted
 Some telephone ASR is also human
 assisted




                                     15
Speaker Dependant


Desktop computers can do excellent
transcription, need corrections
Hand-held devices have more
memory & power → better ASR




                                     16
Engine
       Text-to-speech (TTS)
s



Summary:
Available in many languages,
reasonable quality, sometimes
difficult to understand.
                                17
18
TTS requires language understanding
and specific jargon translation:




                                      18
TTS requires language understanding
and specific jargon translation:
 “Mr.” → “Mister”




                                      18
TTS requires language understanding
and specific jargon translation:
 “Mr.” → “Mister”
 “bbl” →“Be Back Later




                                      18
TTS requires language understanding
and specific jargon translation:
 “Mr.” → “Mister”
 “bbl” →“Be Back Later
 “287 m” →“about 300 meters”




                                      18
TTS requires language understanding
and specific jargon translation:
 “Mr.” → “Mister”
 “bbl” →“Be Back Later
 “287 m” →“about 300 meters”
Custom voices available


                                      18
Biometrics (Speaker
Engine Identification, Speaker
s      Verification, Speaker
       Characterization)

Summary:
Speaker verification practical but
still rare; speaker identification &
characterization practical and
secret
                                      19
Speaker Verification (is that really
you?)
 Available, practical
 Rare in the US, more prevalent in
 Australia, Israel, and Canada
 Roadblocks: valid fear; fear of
 biometrics; love of fingerprints;
 only part of complete solution

                                      20
•Speaker Identification (who are
you?)
•Speaker Characterization (what are
you?)




                                      21
Analytic Data mining, problem
s        discovery




Summary:
Surprising useful, expensive

                                22
Not a real-time process
Word searches, “speech to text”
Emotion detection by ASR (swearing)
and by thynometrics (pitch, volume)



                                      23
About Disaggregate



              Moshe Yudkowsky
              Disaggregate
              2952 W. Fargo
              Chicago, IL 60645
              +1 773 764 8727
              www.Disaggregate.com
Headline Sponsor




                      Platinum Sponsors




                        Gold Sponsors




2009   |   Westergasfabriek   |   Amsterdam   |   http://eComm.ec

Contenu connexe

Similaire à Moshe Yudkowsky's Presentation at Emerging Communication Conference & Awards 2009 Europe

Radio Drama At A Distance
Radio Drama At A DistanceRadio Drama At A Distance
Radio Drama At A DistanceRichard Elen
 
Design Patterns Summer Course 2009-2010 - Session#1
Design Patterns Summer Course 2009-2010 - Session#1Design Patterns Summer Course 2009-2010 - Session#1
Design Patterns Summer Course 2009-2010 - Session#1Muhamad Hesham
 
Mira Georgieva - VoIP2DAY 2016 | Open hardware to be used by your deaf grandma
Mira Georgieva - VoIP2DAY 2016 | Open hardware to be used by your deaf grandmaMira Georgieva - VoIP2DAY 2016 | Open hardware to be used by your deaf grandma
Mira Georgieva - VoIP2DAY 2016 | Open hardware to be used by your deaf grandmaVOIP2DAY
 
English: Web 2.0's Universal Language
English: Web 2.0's Universal LanguageEnglish: Web 2.0's Universal Language
English: Web 2.0's Universal LanguageSmokler
 
Interspeech Gemmeke 2008 V6
Interspeech Gemmeke 2008 V6Interspeech Gemmeke 2008 V6
Interspeech Gemmeke 2008 V6jgemmeke
 
Chatbots and Voice Conversational Interfaces with Amazon Alexa, Neo4j and Gra...
Chatbots and Voice Conversational Interfaces with Amazon Alexa, Neo4j and Gra...Chatbots and Voice Conversational Interfaces with Amazon Alexa, Neo4j and Gra...
Chatbots and Voice Conversational Interfaces with Amazon Alexa, Neo4j and Gra...Christophe Willemsen
 
ICANN DNS Symposium 2019: Resolver Centrality
ICANN DNS Symposium 2019: Resolver CentralityICANN DNS Symposium 2019: Resolver Centrality
ICANN DNS Symposium 2019: Resolver CentralityAPNIC
 
General Speereo Technology
General Speereo TechnologyGeneral Speereo Technology
General Speereo TechnologyDaniel Ischenko
 
Do you Mean what you say? Recognizing Emotions.
Do you Mean what you say? Recognizing Emotions.Do you Mean what you say? Recognizing Emotions.
Do you Mean what you say? Recognizing Emotions.Sunil Kumar Kopparapu
 
Basho and Riak at GOTO Stockholm: "Don't Use My Database."
Basho and Riak at GOTO Stockholm:  "Don't Use My Database."Basho and Riak at GOTO Stockholm:  "Don't Use My Database."
Basho and Riak at GOTO Stockholm: "Don't Use My Database."Basho Technologies
 
Just the basics_strata_2013
Just the basics_strata_2013Just the basics_strata_2013
Just the basics_strata_2013Ken Mwai
 
Extent 2013 Obninsk Test Tools for Trading Systems: Evolution Theory
Extent 2013 Obninsk Test Tools for Trading Systems: Evolution TheoryExtent 2013 Obninsk Test Tools for Trading Systems: Evolution Theory
Extent 2013 Obninsk Test Tools for Trading Systems: Evolution Theoryextentconf Tsoy
 
Speech recognition system seminar
Speech recognition system seminarSpeech recognition system seminar
Speech recognition system seminarDiptimaya Sarangi
 
EXTENT Trading Test Tools Evolution Theory
EXTENT Trading Test Tools Evolution TheoryEXTENT Trading Test Tools Evolution Theory
EXTENT Trading Test Tools Evolution TheoryIosif Itkin
 

Similaire à Moshe Yudkowsky's Presentation at Emerging Communication Conference & Awards 2009 Europe (17)

Radio Drama At A Distance
Radio Drama At A DistanceRadio Drama At A Distance
Radio Drama At A Distance
 
Design Patterns Summer Course 2009-2010 - Session#1
Design Patterns Summer Course 2009-2010 - Session#1Design Patterns Summer Course 2009-2010 - Session#1
Design Patterns Summer Course 2009-2010 - Session#1
 
Mira Georgieva - VoIP2DAY 2016 | Open hardware to be used by your deaf grandma
Mira Georgieva - VoIP2DAY 2016 | Open hardware to be used by your deaf grandmaMira Georgieva - VoIP2DAY 2016 | Open hardware to be used by your deaf grandma
Mira Georgieva - VoIP2DAY 2016 | Open hardware to be used by your deaf grandma
 
English: Web 2.0's Universal Language
English: Web 2.0's Universal LanguageEnglish: Web 2.0's Universal Language
English: Web 2.0's Universal Language
 
Interspeech Gemmeke 2008 V6
Interspeech Gemmeke 2008 V6Interspeech Gemmeke 2008 V6
Interspeech Gemmeke 2008 V6
 
Ppsp icassp17v10
Ppsp icassp17v10Ppsp icassp17v10
Ppsp icassp17v10
 
Chatbots and Voice Conversational Interfaces with Amazon Alexa, Neo4j and Gra...
Chatbots and Voice Conversational Interfaces with Amazon Alexa, Neo4j and Gra...Chatbots and Voice Conversational Interfaces with Amazon Alexa, Neo4j and Gra...
Chatbots and Voice Conversational Interfaces with Amazon Alexa, Neo4j and Gra...
 
Careers in Home Staging
Careers in Home StagingCareers in Home Staging
Careers in Home Staging
 
ICANN DNS Symposium 2019: Resolver Centrality
ICANN DNS Symposium 2019: Resolver CentralityICANN DNS Symposium 2019: Resolver Centrality
ICANN DNS Symposium 2019: Resolver Centrality
 
General Speereo Technology
General Speereo TechnologyGeneral Speereo Technology
General Speereo Technology
 
Do you Mean what you say? Recognizing Emotions.
Do you Mean what you say? Recognizing Emotions.Do you Mean what you say? Recognizing Emotions.
Do you Mean what you say? Recognizing Emotions.
 
Basho and Riak at GOTO Stockholm: "Don't Use My Database."
Basho and Riak at GOTO Stockholm:  "Don't Use My Database."Basho and Riak at GOTO Stockholm:  "Don't Use My Database."
Basho and Riak at GOTO Stockholm: "Don't Use My Database."
 
Just the basics_strata_2013
Just the basics_strata_2013Just the basics_strata_2013
Just the basics_strata_2013
 
Extent 2013 Obninsk Test Tools for Trading Systems: Evolution Theory
Extent 2013 Obninsk Test Tools for Trading Systems: Evolution TheoryExtent 2013 Obninsk Test Tools for Trading Systems: Evolution Theory
Extent 2013 Obninsk Test Tools for Trading Systems: Evolution Theory
 
Ibm aix wlm idea
Ibm aix wlm ideaIbm aix wlm idea
Ibm aix wlm idea
 
Speech recognition system seminar
Speech recognition system seminarSpeech recognition system seminar
Speech recognition system seminar
 
EXTENT Trading Test Tools Evolution Theory
EXTENT Trading Test Tools Evolution TheoryEXTENT Trading Test Tools Evolution Theory
EXTENT Trading Test Tools Evolution Theory
 

Plus de eCommConf

Ronald Azuma - Presentation at Emerging Communications Conference & Awards (e...
Ronald Azuma - Presentation at Emerging Communications Conference & Awards (e...Ronald Azuma - Presentation at Emerging Communications Conference & Awards (e...
Ronald Azuma - Presentation at Emerging Communications Conference & Awards (e...eCommConf
 
David Troy - Presentation at Emerging Communications Conference & Awards (eCo...
David Troy - Presentation at Emerging Communications Conference & Awards (eCo...David Troy - Presentation at Emerging Communications Conference & Awards (eCo...
David Troy - Presentation at Emerging Communications Conference & Awards (eCo...eCommConf
 
Bhaskar Krishnamachari - Presentation at Emerging Communications Conference &...
Bhaskar Krishnamachari - Presentation at Emerging Communications Conference &...Bhaskar Krishnamachari - Presentation at Emerging Communications Conference &...
Bhaskar Krishnamachari - Presentation at Emerging Communications Conference &...eCommConf
 
Clark Dodsworth - Presentation at Emerging Communications Conference & Awards...
Clark Dodsworth - Presentation at Emerging Communications Conference & Awards...Clark Dodsworth - Presentation at Emerging Communications Conference & Awards...
Clark Dodsworth - Presentation at Emerging Communications Conference & Awards...eCommConf
 
Ryan Gallagher - Presentation at Emerging Communications Conference & Awards ...
Ryan Gallagher - Presentation at Emerging Communications Conference & Awards ...Ryan Gallagher - Presentation at Emerging Communications Conference & Awards ...
Ryan Gallagher - Presentation at Emerging Communications Conference & Awards ...eCommConf
 
Darren Schreiber - Presentation at Emerging Communications Conference & Award...
Darren Schreiber - Presentation at Emerging Communications Conference & Award...Darren Schreiber - Presentation at Emerging Communications Conference & Award...
Darren Schreiber - Presentation at Emerging Communications Conference & Award...eCommConf
 
Bryan Johns - Presentation at Emerging Communications Conference & Awards (eC...
Bryan Johns - Presentation at Emerging Communications Conference & Awards (eC...Bryan Johns - Presentation at Emerging Communications Conference & Awards (eC...
Bryan Johns - Presentation at Emerging Communications Conference & Awards (eC...eCommConf
 
Tim Panton - Presentation at Emerging Communications Conference & Awards (eCo...
Tim Panton - Presentation at Emerging Communications Conference & Awards (eCo...Tim Panton - Presentation at Emerging Communications Conference & Awards (eCo...
Tim Panton - Presentation at Emerging Communications Conference & Awards (eCo...eCommConf
 
Peter Ecclesine - Presentation at Emerging Communications Conference & Awards...
Peter Ecclesine - Presentation at Emerging Communications Conference & Awards...Peter Ecclesine - Presentation at Emerging Communications Conference & Awards...
Peter Ecclesine - Presentation at Emerging Communications Conference & Awards...eCommConf
 
John Harmon - Presentation at Emerging Communications Conference & Awards (eC...
John Harmon - Presentation at Emerging Communications Conference & Awards (eC...John Harmon - Presentation at Emerging Communications Conference & Awards (eC...
John Harmon - Presentation at Emerging Communications Conference & Awards (eC...eCommConf
 
Eladio Martin - Presentation at Emerging Communications Conference & Awards (...
Eladio Martin - Presentation at Emerging Communications Conference & Awards (...Eladio Martin - Presentation at Emerging Communications Conference & Awards (...
Eladio Martin - Presentation at Emerging Communications Conference & Awards (...eCommConf
 
Adrian Avendano - Presentation at Emerging Communications Conference & Awards...
Adrian Avendano - Presentation at Emerging Communications Conference & Awards...Adrian Avendano - Presentation at Emerging Communications Conference & Awards...
Adrian Avendano - Presentation at Emerging Communications Conference & Awards...eCommConf
 
Rob Lewis - Presentation at Emerging Communications Conference & Awards (eCom...
Rob Lewis - Presentation at Emerging Communications Conference & Awards (eCom...Rob Lewis - Presentation at Emerging Communications Conference & Awards (eCom...
Rob Lewis - Presentation at Emerging Communications Conference & Awards (eCom...eCommConf
 
Christophe Ramstein - Presentation at Emerging Communications Conference & Aw...
Christophe Ramstein - Presentation at Emerging Communications Conference & Aw...Christophe Ramstein - Presentation at Emerging Communications Conference & Aw...
Christophe Ramstein - Presentation at Emerging Communications Conference & Aw...eCommConf
 
Richard Whitt - Presentation at Emerging Communications Conference & Awards (...
Richard Whitt - Presentation at Emerging Communications Conference & Awards (...Richard Whitt - Presentation at Emerging Communications Conference & Awards (...
Richard Whitt - Presentation at Emerging Communications Conference & Awards (...eCommConf
 
Susan Crawford - Presentation at Emerging Communications Conference & Awards ...
Susan Crawford - Presentation at Emerging Communications Conference & Awards ...Susan Crawford - Presentation at Emerging Communications Conference & Awards ...
Susan Crawford - Presentation at Emerging Communications Conference & Awards ...eCommConf
 
Larry Downes - Presentation at Emerging Communications Conference & Awards (e...
Larry Downes - Presentation at Emerging Communications Conference & Awards (e...Larry Downes - Presentation at Emerging Communications Conference & Awards (e...
Larry Downes - Presentation at Emerging Communications Conference & Awards (e...eCommConf
 
Brough Turner - Presentation at Emerging Communications Conference & Awards (...
Brough Turner - Presentation at Emerging Communications Conference & Awards (...Brough Turner - Presentation at Emerging Communications Conference & Awards (...
Brough Turner - Presentation at Emerging Communications Conference & Awards (...eCommConf
 
Chris Mairs - Presentation at Emerging Communications Conference & Awards (eC...
Chris Mairs - Presentation at Emerging Communications Conference & Awards (eC...Chris Mairs - Presentation at Emerging Communications Conference & Awards (eC...
Chris Mairs - Presentation at Emerging Communications Conference & Awards (eC...eCommConf
 
Tomaz Stolfa - Presentation at Emerging Communications Conference & Awards (e...
Tomaz Stolfa - Presentation at Emerging Communications Conference & Awards (e...Tomaz Stolfa - Presentation at Emerging Communications Conference & Awards (e...
Tomaz Stolfa - Presentation at Emerging Communications Conference & Awards (e...eCommConf
 

Plus de eCommConf (20)

Ronald Azuma - Presentation at Emerging Communications Conference & Awards (e...
Ronald Azuma - Presentation at Emerging Communications Conference & Awards (e...Ronald Azuma - Presentation at Emerging Communications Conference & Awards (e...
Ronald Azuma - Presentation at Emerging Communications Conference & Awards (e...
 
David Troy - Presentation at Emerging Communications Conference & Awards (eCo...
David Troy - Presentation at Emerging Communications Conference & Awards (eCo...David Troy - Presentation at Emerging Communications Conference & Awards (eCo...
David Troy - Presentation at Emerging Communications Conference & Awards (eCo...
 
Bhaskar Krishnamachari - Presentation at Emerging Communications Conference &...
Bhaskar Krishnamachari - Presentation at Emerging Communications Conference &...Bhaskar Krishnamachari - Presentation at Emerging Communications Conference &...
Bhaskar Krishnamachari - Presentation at Emerging Communications Conference &...
 
Clark Dodsworth - Presentation at Emerging Communications Conference & Awards...
Clark Dodsworth - Presentation at Emerging Communications Conference & Awards...Clark Dodsworth - Presentation at Emerging Communications Conference & Awards...
Clark Dodsworth - Presentation at Emerging Communications Conference & Awards...
 
Ryan Gallagher - Presentation at Emerging Communications Conference & Awards ...
Ryan Gallagher - Presentation at Emerging Communications Conference & Awards ...Ryan Gallagher - Presentation at Emerging Communications Conference & Awards ...
Ryan Gallagher - Presentation at Emerging Communications Conference & Awards ...
 
Darren Schreiber - Presentation at Emerging Communications Conference & Award...
Darren Schreiber - Presentation at Emerging Communications Conference & Award...Darren Schreiber - Presentation at Emerging Communications Conference & Award...
Darren Schreiber - Presentation at Emerging Communications Conference & Award...
 
Bryan Johns - Presentation at Emerging Communications Conference & Awards (eC...
Bryan Johns - Presentation at Emerging Communications Conference & Awards (eC...Bryan Johns - Presentation at Emerging Communications Conference & Awards (eC...
Bryan Johns - Presentation at Emerging Communications Conference & Awards (eC...
 
Tim Panton - Presentation at Emerging Communications Conference & Awards (eCo...
Tim Panton - Presentation at Emerging Communications Conference & Awards (eCo...Tim Panton - Presentation at Emerging Communications Conference & Awards (eCo...
Tim Panton - Presentation at Emerging Communications Conference & Awards (eCo...
 
Peter Ecclesine - Presentation at Emerging Communications Conference & Awards...
Peter Ecclesine - Presentation at Emerging Communications Conference & Awards...Peter Ecclesine - Presentation at Emerging Communications Conference & Awards...
Peter Ecclesine - Presentation at Emerging Communications Conference & Awards...
 
John Harmon - Presentation at Emerging Communications Conference & Awards (eC...
John Harmon - Presentation at Emerging Communications Conference & Awards (eC...John Harmon - Presentation at Emerging Communications Conference & Awards (eC...
John Harmon - Presentation at Emerging Communications Conference & Awards (eC...
 
Eladio Martin - Presentation at Emerging Communications Conference & Awards (...
Eladio Martin - Presentation at Emerging Communications Conference & Awards (...Eladio Martin - Presentation at Emerging Communications Conference & Awards (...
Eladio Martin - Presentation at Emerging Communications Conference & Awards (...
 
Adrian Avendano - Presentation at Emerging Communications Conference & Awards...
Adrian Avendano - Presentation at Emerging Communications Conference & Awards...Adrian Avendano - Presentation at Emerging Communications Conference & Awards...
Adrian Avendano - Presentation at Emerging Communications Conference & Awards...
 
Rob Lewis - Presentation at Emerging Communications Conference & Awards (eCom...
Rob Lewis - Presentation at Emerging Communications Conference & Awards (eCom...Rob Lewis - Presentation at Emerging Communications Conference & Awards (eCom...
Rob Lewis - Presentation at Emerging Communications Conference & Awards (eCom...
 
Christophe Ramstein - Presentation at Emerging Communications Conference & Aw...
Christophe Ramstein - Presentation at Emerging Communications Conference & Aw...Christophe Ramstein - Presentation at Emerging Communications Conference & Aw...
Christophe Ramstein - Presentation at Emerging Communications Conference & Aw...
 
Richard Whitt - Presentation at Emerging Communications Conference & Awards (...
Richard Whitt - Presentation at Emerging Communications Conference & Awards (...Richard Whitt - Presentation at Emerging Communications Conference & Awards (...
Richard Whitt - Presentation at Emerging Communications Conference & Awards (...
 
Susan Crawford - Presentation at Emerging Communications Conference & Awards ...
Susan Crawford - Presentation at Emerging Communications Conference & Awards ...Susan Crawford - Presentation at Emerging Communications Conference & Awards ...
Susan Crawford - Presentation at Emerging Communications Conference & Awards ...
 
Larry Downes - Presentation at Emerging Communications Conference & Awards (e...
Larry Downes - Presentation at Emerging Communications Conference & Awards (e...Larry Downes - Presentation at Emerging Communications Conference & Awards (e...
Larry Downes - Presentation at Emerging Communications Conference & Awards (e...
 
Brough Turner - Presentation at Emerging Communications Conference & Awards (...
Brough Turner - Presentation at Emerging Communications Conference & Awards (...Brough Turner - Presentation at Emerging Communications Conference & Awards (...
Brough Turner - Presentation at Emerging Communications Conference & Awards (...
 
Chris Mairs - Presentation at Emerging Communications Conference & Awards (eC...
Chris Mairs - Presentation at Emerging Communications Conference & Awards (eC...Chris Mairs - Presentation at Emerging Communications Conference & Awards (eC...
Chris Mairs - Presentation at Emerging Communications Conference & Awards (eC...
 
Tomaz Stolfa - Presentation at Emerging Communications Conference & Awards (e...
Tomaz Stolfa - Presentation at Emerging Communications Conference & Awards (e...Tomaz Stolfa - Presentation at Emerging Communications Conference & Awards (e...
Tomaz Stolfa - Presentation at Emerging Communications Conference & Awards (e...
 

Dernier

Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Principled Technologies
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 

Dernier (20)

Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 

Moshe Yudkowsky's Presentation at Emerging Communication Conference & Awards 2009 Europe

  • 1. 2009 | Westergasfabriek | Amsterdam | http://eComm.ec
  • 2. Practical Edge of Speech Technology Moshe Yudkowsky www.Disaggregate.com 2
  • 3. “Practical” is Relative Affordable Schedule Achievable 3
  • 4. Core Technology: Speech Recognition (ASR), Text- Engines to-Speech (TTS), Biometrics, Thynometrics (emotions) Data mining, problem Analytics discovery 4
  • 6. Two 20-second Exercises Exercise 1 Travel Agency Automated Reservations 5
  • 7. Two 20-second Exercises Exercise 1 Exercise 2 Travel Agency Twitter Update Automated of eComm Reservations Conference 5
  • 8. Lessons Exercise 1 Exercise 2 6
  • 9. Lessons Exercise 1 Exercise 2 Everyone has the same & simple answers Call centers; 6
  • 10. Lessons Exercise 1 Exercise 2 Everyone has the same & simple answers Call centers; standard device commands Speaker 6
  • 11. Lessons Exercise 1 Exercise 2 Everyone has the same & simple answers Call centers; standard device commands Speaker Speaker Independent 6
  • 12. Lessons Exercise 1 Exercise 2 Everyone has the Highly Personal same & simple Answers answers Call centers; standard device commands Speaker Speaker Independent 6
  • 13. Lessons Exercise 1 Exercise 2 Everyone has the Highly Personal same & simple Answers answers Call centers; Dictation; voice standard device search commands Speaker Speaker Speaker Independent 6
  • 14. Lessons Exercise 1 Exercise 2 Everyone has the Highly Personal same & simple Answers answers Call centers; Dictation; voice standard device search commands Speaker Speaker Speaker Dependent Independent or 6
  • 19. Device- based systems ASR Results Local Recogniti on Known text Complex, personal text
  • 20. Device- based systems: Hybrid Voice Results Voice to server, data back to device Speaker independent (?) ASR
  • 21. Engine Speech Recognition (ASR) s Summary: You can do almost anything — but the more you do, the more you pay. 13
  • 22. Telephony ASR is excellent: Inexpensiv “What city?”— “Amsterdam” “What is wrong with your phone?” — “I dropped it Very on the floor, and the expensive screen is cracked, and now I can’t see anything.” 14
  • 23. Cautions No such thing as “speech to text” Speaker dependent comes closest Voicemail to text: human assisted Some telephone ASR is also human assisted 15
  • 24. Speaker Dependant Desktop computers can do excellent transcription, need corrections Hand-held devices have more memory & power → better ASR 16
  • 25. Engine Text-to-speech (TTS) s Summary: Available in many languages, reasonable quality, sometimes difficult to understand. 17
  • 26. 18
  • 27. TTS requires language understanding and specific jargon translation: 18
  • 28. TTS requires language understanding and specific jargon translation: “Mr.” → “Mister” 18
  • 29. TTS requires language understanding and specific jargon translation: “Mr.” → “Mister” “bbl” →“Be Back Later 18
  • 30. TTS requires language understanding and specific jargon translation: “Mr.” → “Mister” “bbl” →“Be Back Later “287 m” →“about 300 meters” 18
  • 31. TTS requires language understanding and specific jargon translation: “Mr.” → “Mister” “bbl” →“Be Back Later “287 m” →“about 300 meters” Custom voices available 18
  • 32. Biometrics (Speaker Engine Identification, Speaker s Verification, Speaker Characterization) Summary: Speaker verification practical but still rare; speaker identification & characterization practical and secret 19
  • 33. Speaker Verification (is that really you?) Available, practical Rare in the US, more prevalent in Australia, Israel, and Canada Roadblocks: valid fear; fear of biometrics; love of fingerprints; only part of complete solution 20
  • 34. •Speaker Identification (who are you?) •Speaker Characterization (what are you?) 21
  • 35. Analytic Data mining, problem s discovery Summary: Surprising useful, expensive 22
  • 36. Not a real-time process Word searches, “speech to text” Emotion detection by ASR (swearing) and by thynometrics (pitch, volume) 23
  • 37. About Disaggregate Moshe Yudkowsky Disaggregate 2952 W. Fargo Chicago, IL 60645 +1 773 764 8727 www.Disaggregate.com
  • 38. Headline Sponsor Platinum Sponsors Gold Sponsors 2009 | Westergasfabriek | Amsterdam | http://eComm.ec

Notes de l'éditeur

  1. Other topics: APIs, IDE, Grammar building tools, VUI tools
  2. 1. Ask the person next to you a question as if you were an airline reservations system. Find out what city he wants to fly to. 2. Ask the person next to you for a twitter updates of the conference.
  3. 1. Ask the person next to you a question as if you were an airline reservations system. Find out what city he wants to fly to. 2. Ask the person next to you for a twitter updates of the conference.
  4. Google, for example, does Voice mail transcriptions - poorly.
  5. Google, for example, does Voice mail transcriptions - poorly.
  6. Google, for example, does Voice mail transcriptions - poorly.
  7. Google, for example, does Voice mail transcriptions - poorly.
  8. Google, for example, does Voice mail transcriptions - poorly.
  9. Google, for example, does Voice mail transcriptions - poorly.
  10. Practical deployment configurations
  11. The telco server is also hosted. The voice of the user (the “utterance”) must have a good, clean path to the recognition system.
  12. Known text: address book, firmware Complex: dictation, add-on
  13. Not practical in the network: who is using the phone?
  14. We have reviewed the hardware and the types of recognition. I will now review some more specific details about recognition.
  15. Not magic. You still have to manage the data; enroll users; deal with users who are locked out; etc.