SlideShare une entreprise Scribd logo
1  sur  15
Télécharger pour lire hors ligne
Reducing Language Barriers for Tourists
Using Handwriting Recognition Enabled
          Mobile Application

• Edgard Chammas, Chafic          Rami Al Hajj Mohamad
          Mokbel
                                 American University of the
  • University of Balamand             Middle East
   • Balamand, Lebanon                 Egaila, Kuwait

Cristina Oprean, Laurence Likforman Sulem, Gérard Chollet
                   TELECOM ParisTech
                       Paris, France

                     ACTEA 2012
INTRODUCTION
• The goal is to reduce the language barriers for tourists
  who do not speak the Arabic language
  – Take a photo of the text using your smart phone and get the
    corresponding information in your own language
• Handwriting Recognition enabled mobile application
  has been developed for this purpose
  – Robust recognition engine to recognize both handwritten and
    printed noisy texts
  – Two specific vocabularies:
     • Villages and Town Names
     • Restaurant Menu entries
HANDWRITING RECOGNITION SYSTEM


    Based on Hidden Markov Model (HMM)

    Feature vectors extracted from the input image
    by a sliding windows characterized by its size,
    overlap and sliding direction
    
        20 parameters and their derivatives are computed
        by automatically dividing the window into 21 cells
        and defining the baseline
    
        The sequence of feature vectors is modeled by
        HMMs
    
        A HMM model is dedicated to each variant of a letter
HANDWRITING RECOGNITION SYSTEM


    Training of the HMMs is done with the Expectation-
    Maximization (EM) algorithm that iteratively estimates
    their parameters guaranteeing a non-decrease of the
    likelihood function with each iteration


    The Viterbi algorithm yields to the identification of the
    written text by determining the most likely sequence.
RECOGNITION OF PRINTED TEXTS
          USING THE HANDWRITING
           RECOGNITION SYSTEM

    Problem 1: Need for an efficient recognition system to
    recognize both handwritten and printed texts

    Hypothesis to be tested: If sufficient data covering an
    extended set of variability in handwritten texts is used to
    train the handwriting recognition system, then this
    system would be useful in recognizing both handwritten
    and printed texts
       
           The printed text can be considered as the median
           form of the corresponding handwritten text
       
           The handwritten texts are variations of the
           corresponding printed texts
RECOGNITION OF PRINTED TEXTS
      USING THE HANDWRITING
       RECOGNITION SYSTEM
• Problem 2: The system should be easily
  adaptable to any vocabulary
  – No additional data need to be collected and a
    retraining performed when changing the vocabulary


• Solution to be tested: Word models are the
  concatenation of letters’ models
  – Any new word model can be automatically built by
    concatenating its letters’ models
EXPERIMENTS AND RESULTS

    Balamand HCM toolkit used for both training and
    recognition using HMMs

    State of the art UOB-ENST system trained on the IFN-
    ENIT database of Tunisian village names
    
        Vocabulary set of 946 names, written by more than 400
        writers. 26000 images are used for training

    The current performance of the system on handwritten
    words is 90.9%




             Sample image from the training IFN/ENIT
EXPERIMENTS AND RESULTS

    Need to evaluate performance on printed texts.

    Target vocabulary: Lebanese village names and restaurant
    menu entries.

    The HMM models of words are obtained by concatenating
    the character HMMs trained on the IFN/ENIT database

    No printed texts have been included in the training sets

    Small test database has been collected:
       
           40 menu entries corresponding to Lebanese
           specialties
       
           40 Lebanese towns and villages
EXPERIMENTS AND RESULTS

    A test set of 240 images is constructed by
    typing in 3 different fonts these menu entries,
    town and village names




    Sample images from the locally collected test databases
EXPERIMENTS AND RESULTS

    Only 2 errors for 120 menu entries images
    tested and 3 errors for the town and village
    names images
    
        2% error rate

    If we consider the top 5 solutions, then this
    system has no error on this printed database
    even when dealing with a completely different
    vocabulary
MOBILE APPLICATION
MOBILE APPLICATION
CONCLUSIONS


    A novel approach combining both handwritten and
    printed texts recognition in a mobile application

    HMM-based handwritten recognition system achieved
    only 2% error rate on typed texts

    High accuracy also maintained with handwritten words
    belonging to a vocabulary different from the one used
    in training the HMM models

    N-best solutions returned by the server to the user
    makes the application robust to recognition errors
FUTURE WORK


    The simultaneous recognition of handwritten and
    printed texts will be further studied

    The effect on the performance of the introduction of
    printed texts in the training set will be measured and
    interpreted

    The geographical positioning provided by the device
    could be sent to the server permitting to guide the
    recognition by limiting the vocabulary in use
Thank You
Questions?

Contenu connexe

Similaire à Reducing Language Barriers for Tourists Using Handwriting Recognition Enabled Mobile Application

Raichlin et al_Mobile application for Speech Rate
Raichlin et al_Mobile application for Speech Rate Raichlin et al_Mobile application for Speech Rate
Raichlin et al_Mobile application for Speech Rate
Katia Raichlin-Levi
 
Resumefresh1
Resumefresh1Resumefresh1
Resumefresh1
ncct
 
Speech recognition system seminar
Speech recognition system seminarSpeech recognition system seminar
Speech recognition system seminar
Diptimaya Sarangi
 
A Survey on Speaker Recognition System
A Survey on Speaker Recognition SystemA Survey on Speaker Recognition System
A Survey on Speaker Recognition System
Vani011
 
E0502 01 2327
E0502 01 2327E0502 01 2327
E0502 01 2327
IJMER
 

Similaire à Reducing Language Barriers for Tourists Using Handwriting Recognition Enabled Mobile Application (20)

IRJET - Cognitive based Emotion Analysis of a Child Reading a Book
IRJET -  	  Cognitive based Emotion Analysis of a Child Reading a BookIRJET -  	  Cognitive based Emotion Analysis of a Child Reading a Book
IRJET - Cognitive based Emotion Analysis of a Child Reading a Book
 
International Journal of Engineering Research and Development
International Journal of Engineering Research and DevelopmentInternational Journal of Engineering Research and Development
International Journal of Engineering Research and Development
 
40120130406014 2
40120130406014 240120130406014 2
40120130406014 2
 
IRJET- Vocal Code
IRJET- Vocal CodeIRJET- Vocal Code
IRJET- Vocal Code
 
Raichlin et al_Mobile application for Speech Rate
Raichlin et al_Mobile application for Speech Rate Raichlin et al_Mobile application for Speech Rate
Raichlin et al_Mobile application for Speech Rate
 
Resumefresh1
Resumefresh1Resumefresh1
Resumefresh1
 
BTP paper
BTP paperBTP paper
BTP paper
 
Applying static code analysis for domain-specific languages
Applying static code analysis for domain-specific languagesApplying static code analysis for domain-specific languages
Applying static code analysis for domain-specific languages
 
Automatic attendance system
Automatic attendance systemAutomatic attendance system
Automatic attendance system
 
speech enhancement
speech enhancementspeech enhancement
speech enhancement
 
Canonical Formatted Address Data
Canonical Formatted Address DataCanonical Formatted Address Data
Canonical Formatted Address Data
 
Canonical Formatted Address Data
Canonical Formatted Address DataCanonical Formatted Address Data
Canonical Formatted Address Data
 
Arabic digits speech recognition and speaker identification in noisy environm...
Arabic digits speech recognition and speaker identification in noisy environm...Arabic digits speech recognition and speaker identification in noisy environm...
Arabic digits speech recognition and speaker identification in noisy environm...
 
Speech recognition system seminar
Speech recognition system seminarSpeech recognition system seminar
Speech recognition system seminar
 
4. processor
4. processor4. processor
4. processor
 
IRJET- Vernacular Language Spell Checker & Autocorrection
IRJET- Vernacular Language Spell Checker & AutocorrectionIRJET- Vernacular Language Spell Checker & Autocorrection
IRJET- Vernacular Language Spell Checker & Autocorrection
 
Breaking the language barrier: how do we quickly add multilanguage support in...
Breaking the language barrier: how do we quickly add multilanguage support in...Breaking the language barrier: how do we quickly add multilanguage support in...
Breaking the language barrier: how do we quickly add multilanguage support in...
 
A Survey on Speaker Recognition System
A Survey on Speaker Recognition SystemA Survey on Speaker Recognition System
A Survey on Speaker Recognition System
 
E0502 01 2327
E0502 01 2327E0502 01 2327
E0502 01 2327
 
A Novel, Robust, Hierarchical, Text-Independent Speaker Recognition Technique
A Novel, Robust, Hierarchical, Text-Independent Speaker Recognition TechniqueA Novel, Robust, Hierarchical, Text-Independent Speaker Recognition Technique
A Novel, Robust, Hierarchical, Text-Independent Speaker Recognition Technique
 

Dernier

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Dernier (20)

Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024
 

Reducing Language Barriers for Tourists Using Handwriting Recognition Enabled Mobile Application

  • 1. Reducing Language Barriers for Tourists Using Handwriting Recognition Enabled Mobile Application • Edgard Chammas, Chafic Rami Al Hajj Mohamad Mokbel American University of the • University of Balamand Middle East • Balamand, Lebanon Egaila, Kuwait Cristina Oprean, Laurence Likforman Sulem, Gérard Chollet TELECOM ParisTech Paris, France ACTEA 2012
  • 2. INTRODUCTION • The goal is to reduce the language barriers for tourists who do not speak the Arabic language – Take a photo of the text using your smart phone and get the corresponding information in your own language • Handwriting Recognition enabled mobile application has been developed for this purpose – Robust recognition engine to recognize both handwritten and printed noisy texts – Two specific vocabularies: • Villages and Town Names • Restaurant Menu entries
  • 3. HANDWRITING RECOGNITION SYSTEM  Based on Hidden Markov Model (HMM)  Feature vectors extracted from the input image by a sliding windows characterized by its size, overlap and sliding direction  20 parameters and their derivatives are computed by automatically dividing the window into 21 cells and defining the baseline  The sequence of feature vectors is modeled by HMMs  A HMM model is dedicated to each variant of a letter
  • 4. HANDWRITING RECOGNITION SYSTEM  Training of the HMMs is done with the Expectation- Maximization (EM) algorithm that iteratively estimates their parameters guaranteeing a non-decrease of the likelihood function with each iteration  The Viterbi algorithm yields to the identification of the written text by determining the most likely sequence.
  • 5. RECOGNITION OF PRINTED TEXTS USING THE HANDWRITING RECOGNITION SYSTEM  Problem 1: Need for an efficient recognition system to recognize both handwritten and printed texts  Hypothesis to be tested: If sufficient data covering an extended set of variability in handwritten texts is used to train the handwriting recognition system, then this system would be useful in recognizing both handwritten and printed texts  The printed text can be considered as the median form of the corresponding handwritten text  The handwritten texts are variations of the corresponding printed texts
  • 6. RECOGNITION OF PRINTED TEXTS USING THE HANDWRITING RECOGNITION SYSTEM • Problem 2: The system should be easily adaptable to any vocabulary – No additional data need to be collected and a retraining performed when changing the vocabulary • Solution to be tested: Word models are the concatenation of letters’ models – Any new word model can be automatically built by concatenating its letters’ models
  • 7. EXPERIMENTS AND RESULTS  Balamand HCM toolkit used for both training and recognition using HMMs  State of the art UOB-ENST system trained on the IFN- ENIT database of Tunisian village names  Vocabulary set of 946 names, written by more than 400 writers. 26000 images are used for training  The current performance of the system on handwritten words is 90.9% Sample image from the training IFN/ENIT
  • 8. EXPERIMENTS AND RESULTS  Need to evaluate performance on printed texts.  Target vocabulary: Lebanese village names and restaurant menu entries.  The HMM models of words are obtained by concatenating the character HMMs trained on the IFN/ENIT database  No printed texts have been included in the training sets  Small test database has been collected:  40 menu entries corresponding to Lebanese specialties  40 Lebanese towns and villages
  • 9. EXPERIMENTS AND RESULTS  A test set of 240 images is constructed by typing in 3 different fonts these menu entries, town and village names Sample images from the locally collected test databases
  • 10. EXPERIMENTS AND RESULTS  Only 2 errors for 120 menu entries images tested and 3 errors for the town and village names images  2% error rate  If we consider the top 5 solutions, then this system has no error on this printed database even when dealing with a completely different vocabulary
  • 13. CONCLUSIONS  A novel approach combining both handwritten and printed texts recognition in a mobile application  HMM-based handwritten recognition system achieved only 2% error rate on typed texts  High accuracy also maintained with handwritten words belonging to a vocabulary different from the one used in training the HMM models  N-best solutions returned by the server to the user makes the application robust to recognition errors
  • 14. FUTURE WORK  The simultaneous recognition of handwritten and printed texts will be further studied  The effect on the performance of the introduction of printed texts in the training set will be measured and interpreted  The geographical positioning provided by the device could be sent to the server permitting to guide the recognition by limiting the vocabulary in use