SlideShare a Scribd company logo
1 of 4
InterSpeech 2012
Home
13th Annual Conference of the International Speech Communication Association
September 9-13, 2012 |
Portland, Oregon
About The
Conference
Program
Grants and
Awards
Sponsors
ISDN Number:
1990-9770
Conference
Poster
Helpful Hotlinks
EventScribe
Attendee Roster
Final Agenda
Organizing Secretariat
Computer-Assisted Language
Learning (CALL) Systems
Overview
Computer-assisted language learning (CALL) provides an effective
learning environment so that students can practice in an interactive
manner using multi-media content, either with the supervision of
teachers or on their own pace in self-learning. The advancement of
speech and language technologies has opened new perspectives on
CALL systems, such as automatic pronunciation assessment and
simulated conversational-style lessons. CALL is also regarded as one
of new and promising applications of speech analysis, recognition and
synthesis. CALL covers a variety of aspects including segmental,
prosodic and lexical features. Modeling non-native speech to correctly
segment/recognize utterances while detecting errors included in them
poses a number of challenges in speech processing. Assessing
intelligibility of non-native speech or proficiency of non-native
speakers is also an important issue. In this tutorial, we will give an
overview on these issues and current solutions. The tutorial is mainly
targeted for speech researchers and engineers interested in CALL, but
also for those engaged in language teaching or learning technology.
First we review speech recognition technologies for pronunciation
learning, specifically pronunciation evaluation and error detection.
Statistical approaches to these problems are formulated, and then
acoustic and pronunciation modeling of non-native speech is described.
Unlike the conventional non-native speech recognition, error detection
capability is required in CALL, thus an effective error prediction
scheme is vitally important. Next, we address prosodic modeling and
evaluation, such as duration, stress and tones, and then the use of
speech synthesis technologies including re-synthesis and morphing.
After the review of basic component technologies, we introduce a
number of practical CALL systems which have been developed as
commercial products or deployed in classrooms, including those in our
universities. Majority of them focus on learning English as a second
language (ESL), but some deal with other languages such as Japanese
and Chinese. We also review databases of non-native speech, which
are necessary to develop CALL systems.
Outline
1. Introduction and Overview (Kawahara)
Review history and category of CALL systems.
2. Segmental aspect and speech recognition technology
(Kawahara)
2.1. Speech analysis for CALL
2.2. Segmentation of non-native speech
2.3. Error detection of non-native speech
2.4. Scoring of non-native speech
2.5. Acoustic model for non-native speech
2.6. Pronunciation model for non-native speech
2.7. Discriminative modeling
3. Prosodic aspect (Minematsu)
3.1. Prosodic deviations found in non-native pronunciation
3.2. Duration modeling & evaluation
3.3. Stress and tone modeling & evaluation
3.4. Intonation modeling & evaluation
4. Speech synthesis technology for CALL (Minematsu)
4.1. Text-to-speech for CALL
4.2. Re-synthesis for CALL
4.3. Morphing for CALL
5. Practical CALL systems (Kawahara)
Review major CALL systems that have been developed and
deployed for learning English and other languages.
6. Database for CALL (Minematsu)
Review major databases of non-native speech, which are
critical resources in developing CALL systems.
Short Biographies
Tatsuya Kawahara is a professor in Academic Center for Computing
and Media Studies and an affiliated professor in School of Informatics,
Kyoto University.
He has also been an invited researcher at ATR and NICT. He was a
visiting researcher at Bell Laboratories from 1995 to 1996. He has
published more than 200 technical papers on speech recognition,
spoken language processing, and spoken dialog systems. He has been
managing several speech-related projects including a free speech
recognition engine Julius (http://julius.sourceforge.jp/) and the
automatic transcription system for the Japanese Parliament (Diet).
From 2003 to 2006, he was a member of IEEE SPS Speech Technical
Committee. From 2011, he is a secretary of IEEE SPS Japan Chapter.
He was a general chair of IEEE Automatic Speech Recognition &
Understanding workshop (ASRU 2007). He has also served as a
tutorial chair of INTERSPEECH 2010 and a local arrangement chair of
ICASSP 2012. He is an editorial board member of Elsevier Journal of
Computer Speech and Language, ACM Transactions on Speech and
Language Processing, and APSIPA Transactions on Signal and
Information. He is a senior member of IEEE.
E-mail: kawahara@i.kyoto-u.ac.jp
Webpage: http://www.ar.media.kyoto-u.ac.jp/members/kawahara/
Nobuaki Minematsu is an associate professor in Graduate School of
Information Science and Technology, the University of Tokyo. He was
a visiting researcher at Royal Institute of Technology, Sweden (KTH)
from 2002 to 2003. He has a very wide interest in speech
communication covering from science to engineering. He has published
more than 200 scientific and technical papers including conference
papers. Those papers are on speech analysis, speech perception, speech
recognition, speech synthesis, language learning systems, etc. He was a
member of the organizing committee of Speech Prosody 2004, L2WS
2010, INTERSPEECH 2010. From 2006, he is a member of SLaTE
(ISCA SIG on Speech and Language Technology in Education). From
2011, he is a treasurer of IEEE SPS Japan Chapter. He has also been
serving as an editorial board member of Acoustic Society of Japan, The
Institute of Electronics, Information and Communication Engineers,
and Information Processing Society of Japan.

More Related Content

What's hot

Speech to text conversion
Speech to text conversionSpeech to text conversion
Speech to text conversion
ankit_saluja
 
Speech Recognition: Transcription and transformation of human speech
Speech Recognition: Transcription and transformation of human speechSpeech Recognition: Transcription and transformation of human speech
Speech Recognition: Transcription and transformation of human speech
SubmissionResearchpa
 

What's hot (19)

Résumé
RésuméRésumé
Résumé
 
Machine translation from English to Hindi
Machine translation from English to HindiMachine translation from English to Hindi
Machine translation from English to Hindi
 
Generations of programming_language.kum_ari11-1-1-1
Generations of programming_language.kum_ari11-1-1-1Generations of programming_language.kum_ari11-1-1-1
Generations of programming_language.kum_ari11-1-1-1
 
IRJET- Communication Aid for Deaf and Dumb People
IRJET- Communication Aid for Deaf and Dumb PeopleIRJET- Communication Aid for Deaf and Dumb People
IRJET- Communication Aid for Deaf and Dumb People
 
Evolution of programinglang
Evolution of programinglangEvolution of programinglang
Evolution of programinglang
 
Lila Prabodh Pragya Interactive course
Lila Prabodh Pragya Interactive courseLila Prabodh Pragya Interactive course
Lila Prabodh Pragya Interactive course
 
Speech to text conversion for visually impaired person using µ law companding
Speech to text conversion for visually impaired person using µ law compandingSpeech to text conversion for visually impaired person using µ law companding
Speech to text conversion for visually impaired person using µ law companding
 
Different valuable tools for Arabic sentiment analysis: a comparative evaluat...
Different valuable tools for Arabic sentiment analysis: a comparative evaluat...Different valuable tools for Arabic sentiment analysis: a comparative evaluat...
Different valuable tools for Arabic sentiment analysis: a comparative evaluat...
 
call and study skills-1
call and study skills-1call and study skills-1
call and study skills-1
 
Summer Research Project (Anusaaraka) Report
Summer Research Project (Anusaaraka) ReportSummer Research Project (Anusaaraka) Report
Summer Research Project (Anusaaraka) Report
 
Speech to text conversion
Speech to text conversionSpeech to text conversion
Speech to text conversion
 
A017420108
A017420108A017420108
A017420108
 
Introduction to text to speech
Introduction to text to speechIntroduction to text to speech
Introduction to text to speech
 
IS-EUD-2015, Madrid, Spain, 27 May 2015
IS-EUD-2015, Madrid, Spain, 27 May 2015IS-EUD-2015, Madrid, Spain, 27 May 2015
IS-EUD-2015, Madrid, Spain, 27 May 2015
 
PERFORMANCE ANALYSIS OF DIFFERENT ACOUSTIC FEATURES BASED ON LSTM FOR BANGLA ...
PERFORMANCE ANALYSIS OF DIFFERENT ACOUSTIC FEATURES BASED ON LSTM FOR BANGLA ...PERFORMANCE ANALYSIS OF DIFFERENT ACOUSTIC FEATURES BASED ON LSTM FOR BANGLA ...
PERFORMANCE ANALYSIS OF DIFFERENT ACOUSTIC FEATURES BASED ON LSTM FOR BANGLA ...
 
Principal of Programming Language
Principal of Programming Language Principal of Programming Language
Principal of Programming Language
 
A Review on the Cross and Multilingual Information Retrieval
A Review on the Cross and Multilingual Information RetrievalA Review on the Cross and Multilingual Information Retrieval
A Review on the Cross and Multilingual Information Retrieval
 
Voice input and speech recognition system in tourism/social media
Voice input and speech recognition system in tourism/social mediaVoice input and speech recognition system in tourism/social media
Voice input and speech recognition system in tourism/social media
 
Speech Recognition: Transcription and transformation of human speech
Speech Recognition: Transcription and transformation of human speechSpeech Recognition: Transcription and transformation of human speech
Speech Recognition: Transcription and transformation of human speech
 

Viewers also liked

Viewers also liked (9)

Verb phrase
Verb phraseVerb phrase
Verb phrase
 
Verb phrase
Verb phraseVerb phrase
Verb phrase
 
Bees
BeesBees
Bees
 
Best Artist Rep Magazine December 2015 issue
Best Artist Rep Magazine December 2015 issueBest Artist Rep Magazine December 2015 issue
Best Artist Rep Magazine December 2015 issue
 
Learn BEM: CSS Naming Convention
Learn BEM: CSS Naming ConventionLearn BEM: CSS Naming Convention
Learn BEM: CSS Naming Convention
 
How to Build a Dynamic Social Media Plan
How to Build a Dynamic Social Media PlanHow to Build a Dynamic Social Media Plan
How to Build a Dynamic Social Media Plan
 
Lightning Talk #9: How UX and Data Storytelling Can Shape Policy by Mika Aldaba
Lightning Talk #9: How UX and Data Storytelling Can Shape Policy by Mika AldabaLightning Talk #9: How UX and Data Storytelling Can Shape Policy by Mika Aldaba
Lightning Talk #9: How UX and Data Storytelling Can Shape Policy by Mika Aldaba
 
SEO: Getting Personal
SEO: Getting PersonalSEO: Getting Personal
SEO: Getting Personal
 
Succession “Losers”: What Happens to Executives Passed Over for the CEO Job?
Succession “Losers”: What Happens to Executives Passed Over for the CEO Job? Succession “Losers”: What Happens to Executives Passed Over for the CEO Job?
Succession “Losers”: What Happens to Executives Passed Over for the CEO Job?
 

Similar to CALL (computer Assisted Language)

Calico 2014 intelligent call - def
Calico 2014   intelligent call - defCalico 2014   intelligent call - def
Calico 2014 intelligent call - def
Piet Desmet
 
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text Editor
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text EditorDynamic Construction of Telugu Speech Corpus for Voice Enabled Text Editor
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text Editor
Waqas Tariq
 
Teachbot teaching robot_using_artificial
Teachbot teaching robot_using_artificialTeachbot teaching robot_using_artificial
Teachbot teaching robot_using_artificial
CamillaTonanzi
 
12EEE032- text 2 voice
12EEE032-  text 2 voice12EEE032-  text 2 voice
12EEE032- text 2 voice
Nsaroj kumar
 
EdMedia2013 - Educational Impacts of the Intelligent Integrated Computer-Assi...
EdMedia2013 - Educational Impacts of the Intelligent Integrated Computer-Assi...EdMedia2013 - Educational Impacts of the Intelligent Integrated Computer-Assi...
EdMedia2013 - Educational Impacts of the Intelligent Integrated Computer-Assi...
Harald Wahl
 
A Strong Object Recognition Using Lbp, Ltp And Rlbp
A Strong Object Recognition Using Lbp, Ltp And RlbpA Strong Object Recognition Using Lbp, Ltp And Rlbp
A Strong Object Recognition Using Lbp, Ltp And Rlbp
Rikki Wright
 
Ara--CANINE: Character-Based Pre-Trained Language Model for Arabic Language U...
Ara--CANINE: Character-Based Pre-Trained Language Model for Arabic Language U...Ara--CANINE: Character-Based Pre-Trained Language Model for Arabic Language U...
Ara--CANINE: Character-Based Pre-Trained Language Model for Arabic Language U...
IJCI JOURNAL
 

Similar to CALL (computer Assisted Language) (20)

Calico 2014 intelligent call - def
Calico 2014   intelligent call - defCalico 2014   intelligent call - def
Calico 2014 intelligent call - def
 
International Journal on Natural Language Computing (IJNLC) Vol. 4, No.2,Apri...
International Journal on Natural Language Computing (IJNLC) Vol. 4, No.2,Apri...International Journal on Natural Language Computing (IJNLC) Vol. 4, No.2,Apri...
International Journal on Natural Language Computing (IJNLC) Vol. 4, No.2,Apri...
 
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text Editor
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text EditorDynamic Construction of Telugu Speech Corpus for Voice Enabled Text Editor
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text Editor
 
English speaking proficiency assessment using speech and electroencephalograp...
English speaking proficiency assessment using speech and electroencephalograp...English speaking proficiency assessment using speech and electroencephalograp...
English speaking proficiency assessment using speech and electroencephalograp...
 
Speech-Recognition.pptx
Speech-Recognition.pptxSpeech-Recognition.pptx
Speech-Recognition.pptx
 
Teachbot teaching robot_using_artificial
Teachbot teaching robot_using_artificialTeachbot teaching robot_using_artificial
Teachbot teaching robot_using_artificial
 
IRJET - Gesture based Communication Recognition System
IRJET -  	  Gesture based Communication Recognition SystemIRJET -  	  Gesture based Communication Recognition System
IRJET - Gesture based Communication Recognition System
 
visH (fin).pptx
visH (fin).pptxvisH (fin).pptx
visH (fin).pptx
 
IRJET- Kinyarwanda Speech Recognition in an Automatic Dictation System for Tr...
IRJET- Kinyarwanda Speech Recognition in an Automatic Dictation System for Tr...IRJET- Kinyarwanda Speech Recognition in an Automatic Dictation System for Tr...
IRJET- Kinyarwanda Speech Recognition in an Automatic Dictation System for Tr...
 
IRJET- Text to Speech Synthesis for Hindi Language using Festival Framework
IRJET- Text to Speech Synthesis for Hindi Language using Festival FrameworkIRJET- Text to Speech Synthesis for Hindi Language using Festival Framework
IRJET- Text to Speech Synthesis for Hindi Language using Festival Framework
 
12EEE032- text 2 voice
12EEE032-  text 2 voice12EEE032-  text 2 voice
12EEE032- text 2 voice
 
Speech recognition An overview
Speech recognition An overviewSpeech recognition An overview
Speech recognition An overview
 
Hidden markov model based part of speech tagger for sinhala language
Hidden markov model based part of speech tagger for sinhala languageHidden markov model based part of speech tagger for sinhala language
Hidden markov model based part of speech tagger for sinhala language
 
1.pdf
1.pdf1.pdf
1.pdf
 
EdMedia2013 - Educational Impacts of the Intelligent Integrated Computer-Assi...
EdMedia2013 - Educational Impacts of the Intelligent Integrated Computer-Assi...EdMedia2013 - Educational Impacts of the Intelligent Integrated Computer-Assi...
EdMedia2013 - Educational Impacts of the Intelligent Integrated Computer-Assi...
 
A Strong Object Recognition Using Lbp, Ltp And Rlbp
A Strong Object Recognition Using Lbp, Ltp And RlbpA Strong Object Recognition Using Lbp, Ltp And Rlbp
A Strong Object Recognition Using Lbp, Ltp And Rlbp
 
SECOND LANGUAGE RESEARCH.pptx
SECOND LANGUAGE RESEARCH.pptxSECOND LANGUAGE RESEARCH.pptx
SECOND LANGUAGE RESEARCH.pptx
 
Ara--CANINE: Character-Based Pre-Trained Language Model for Arabic Language U...
Ara--CANINE: Character-Based Pre-Trained Language Model for Arabic Language U...Ara--CANINE: Character-Based Pre-Trained Language Model for Arabic Language U...
Ara--CANINE: Character-Based Pre-Trained Language Model for Arabic Language U...
 
Computational linguistics
Computational linguisticsComputational linguistics
Computational linguistics
 
IRJET- Tamil Speech to Indian Sign Language using CMUSphinx Language Models
IRJET- Tamil Speech to Indian Sign Language using CMUSphinx Language ModelsIRJET- Tamil Speech to Indian Sign Language using CMUSphinx Language Models
IRJET- Tamil Speech to Indian Sign Language using CMUSphinx Language Models
 

Recently uploaded

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Recently uploaded (20)

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 

CALL (computer Assisted Language)

  • 1. InterSpeech 2012 Home 13th Annual Conference of the International Speech Communication Association September 9-13, 2012 | Portland, Oregon About The Conference Program Grants and Awards Sponsors ISDN Number: 1990-9770 Conference Poster Helpful Hotlinks EventScribe Attendee Roster Final Agenda Organizing Secretariat Computer-Assisted Language Learning (CALL) Systems Overview Computer-assisted language learning (CALL) provides an effective learning environment so that students can practice in an interactive manner using multi-media content, either with the supervision of teachers or on their own pace in self-learning. The advancement of speech and language technologies has opened new perspectives on CALL systems, such as automatic pronunciation assessment and simulated conversational-style lessons. CALL is also regarded as one of new and promising applications of speech analysis, recognition and synthesis. CALL covers a variety of aspects including segmental, prosodic and lexical features. Modeling non-native speech to correctly segment/recognize utterances while detecting errors included in them poses a number of challenges in speech processing. Assessing intelligibility of non-native speech or proficiency of non-native speakers is also an important issue. In this tutorial, we will give an overview on these issues and current solutions. The tutorial is mainly targeted for speech researchers and engineers interested in CALL, but also for those engaged in language teaching or learning technology. First we review speech recognition technologies for pronunciation learning, specifically pronunciation evaluation and error detection. Statistical approaches to these problems are formulated, and then acoustic and pronunciation modeling of non-native speech is described.
  • 2. Unlike the conventional non-native speech recognition, error detection capability is required in CALL, thus an effective error prediction scheme is vitally important. Next, we address prosodic modeling and evaluation, such as duration, stress and tones, and then the use of speech synthesis technologies including re-synthesis and morphing. After the review of basic component technologies, we introduce a number of practical CALL systems which have been developed as commercial products or deployed in classrooms, including those in our universities. Majority of them focus on learning English as a second language (ESL), but some deal with other languages such as Japanese and Chinese. We also review databases of non-native speech, which are necessary to develop CALL systems. Outline 1. Introduction and Overview (Kawahara) Review history and category of CALL systems. 2. Segmental aspect and speech recognition technology (Kawahara) 2.1. Speech analysis for CALL 2.2. Segmentation of non-native speech 2.3. Error detection of non-native speech 2.4. Scoring of non-native speech 2.5. Acoustic model for non-native speech 2.6. Pronunciation model for non-native speech 2.7. Discriminative modeling 3. Prosodic aspect (Minematsu) 3.1. Prosodic deviations found in non-native pronunciation
  • 3. 3.2. Duration modeling & evaluation 3.3. Stress and tone modeling & evaluation 3.4. Intonation modeling & evaluation 4. Speech synthesis technology for CALL (Minematsu) 4.1. Text-to-speech for CALL 4.2. Re-synthesis for CALL 4.3. Morphing for CALL 5. Practical CALL systems (Kawahara) Review major CALL systems that have been developed and deployed for learning English and other languages. 6. Database for CALL (Minematsu) Review major databases of non-native speech, which are critical resources in developing CALL systems. Short Biographies Tatsuya Kawahara is a professor in Academic Center for Computing and Media Studies and an affiliated professor in School of Informatics, Kyoto University. He has also been an invited researcher at ATR and NICT. He was a visiting researcher at Bell Laboratories from 1995 to 1996. He has published more than 200 technical papers on speech recognition, spoken language processing, and spoken dialog systems. He has been managing several speech-related projects including a free speech recognition engine Julius (http://julius.sourceforge.jp/) and the automatic transcription system for the Japanese Parliament (Diet). From 2003 to 2006, he was a member of IEEE SPS Speech Technical Committee. From 2011, he is a secretary of IEEE SPS Japan Chapter. He was a general chair of IEEE Automatic Speech Recognition & Understanding workshop (ASRU 2007). He has also served as a tutorial chair of INTERSPEECH 2010 and a local arrangement chair of ICASSP 2012. He is an editorial board member of Elsevier Journal of
  • 4. Computer Speech and Language, ACM Transactions on Speech and Language Processing, and APSIPA Transactions on Signal and Information. He is a senior member of IEEE. E-mail: kawahara@i.kyoto-u.ac.jp Webpage: http://www.ar.media.kyoto-u.ac.jp/members/kawahara/ Nobuaki Minematsu is an associate professor in Graduate School of Information Science and Technology, the University of Tokyo. He was a visiting researcher at Royal Institute of Technology, Sweden (KTH) from 2002 to 2003. He has a very wide interest in speech communication covering from science to engineering. He has published more than 200 scientific and technical papers including conference papers. Those papers are on speech analysis, speech perception, speech recognition, speech synthesis, language learning systems, etc. He was a member of the organizing committee of Speech Prosody 2004, L2WS 2010, INTERSPEECH 2010. From 2006, he is a member of SLaTE (ISCA SIG on Speech and Language Technology in Education). From 2011, he is a treasurer of IEEE SPS Japan Chapter. He has also been serving as an editorial board member of Acoustic Society of Japan, The Institute of Electronics, Information and Communication Engineers, and Information Processing Society of Japan.