Accueil
Explorer
Soumettre la recherche
Mettre en ligne
S’identifier
S’inscrire
Publicité
IRJET- On-Screen Translator using NLP and Text Detection
Signaler
IRJET Journal
Suivre
Fast Track Publications
30 Dec 2019
•
0 j'aime
0 j'aime
×
Soyez le premier à aimer ceci
afficher plus
•
15 vues
vues
×
Nombre de vues
0
Sur Slideshare
0
À partir des intégrations
0
Nombre d'intégrations
0
Prochain SlideShare
[IJET-V2I1P13] Authors:Shilpa More, Gagandeep .S. Dhir , Deepak Daiwadney and...
Chargement dans ... 3
1
sur
3
Top clipped slide
IRJET- On-Screen Translator using NLP and Text Detection
30 Dec 2019
•
0 j'aime
0 j'aime
×
Soyez le premier à aimer ceci
afficher plus
•
15 vues
vues
×
Nombre de vues
0
Sur Slideshare
0
À partir des intégrations
0
Nombre d'intégrations
0
Télécharger maintenant
Télécharger pour lire hors ligne
Signaler
Ingénierie
https://www.irjet.net/archives/V6/i11/IRJET-V6I11122.pdf
IRJET Journal
Suivre
Fast Track Publications
Publicité
Publicité
Publicité
Recommandé
[IJET-V2I1P13] Authors:Shilpa More, Gagandeep .S. Dhir , Deepak Daiwadney and...
IJET - International Journal of Engineering and Techniques
166 vues
•
4 diapositives
IRJET- Vocal Code
IRJET Journal
14 vues
•
4 diapositives
IRJET - Pseudocode to Python Translation using Machine Learning
IRJET Journal
26 vues
•
5 diapositives
IRJET - Speech to Speech Translation using Encoder Decoder Architecture
IRJET Journal
10 vues
•
3 diapositives
Sentiment Analysis In Myanmar Language Using Convolutional Lstm Neural Network
kevig
53 vues
•
5 diapositives
DOMAIN BASED CHUNKING
kevig
64 vues
•
14 diapositives
Contenu connexe
Présentations pour vous
(18)
Notes on attention mechanism
Khang Pham
•
1.8K vues
[Paper Reading] Supervised Learning of Universal Sentence Representations fro...
Hiroki Shimanaka
•
420 vues
Nlp research presentation
Surya Sg
•
198 vues
Natural Language Processing - Research and Application Trends
Shreyas Suresh Rao
•
67 vues
[Paper review] BERT
JEE HYUN PARK
•
1K vues
text summarization using amr
amit nagarkoti
•
843 vues
BERT introduction
Hanwha System / ICT
•
3.2K vues
cushLEPOR uses LABSE distilled knowledge to improve correlation with human tr...
Lifeng (Aaron) Han
•
200 vues
NLP State of the Art | BERT
shaurya uppal
•
2.3K vues
BERT - Part 1 Learning Notes of Senthil Kumar
Senthil Kumar M
•
207 vues
02 15034 neural network
IAESIJEECS
•
15 vues
Cohesive Software Design
ijtsrd
•
26 vues
NLP Project Presentation
Aryak Sengupta
•
2.1K vues
BERT
Sang Hyun Jeon
•
152 vues
Bert
Abdallah Bashir
•
5.2K vues
Isolated word recognition using lpc & vector quantization
eSAT Publishing House
•
457 vues
Modeling of Speech Synthesis of Standard Arabic Using an Expert System
csandit
•
379 vues
IRJET - Analysis of Paraphrase Detection using NLP Techniques
IRJET Journal
•
17 vues
Similaire à IRJET- On-Screen Translator using NLP and Text Detection
(20)
A NEURAL MACHINE LANGUAGE TRANSLATION SYSTEM FROM GERMAN TO ENGLISH
IRJET Journal
•
3 vues
IRJET- Text Extraction from Text Based Image using Android
IRJET Journal
•
39 vues
IRJET- Visual Question Answering using Combination of LSTM and CNN: A Survey
IRJET Journal
•
32 vues
A SMART LANGUAGE TRANSLATION TECHNIQUE USING OCR
IRJET Journal
•
3 vues
IRJET - Threat Prediction using Speech Analysis
IRJET Journal
•
11 vues
IRJET - Automatic Lip Reading: Classification of Words and Phrases using Conv...
IRJET Journal
•
12 vues
IRJET- ASL Language Translation using ML
IRJET Journal
•
21 vues
IRJET- Factoid Question and Answering System
IRJET Journal
•
16 vues
INDIAN SIGN LANGUAGE TRANSLATION FOR HARD-OF-HEARING AND HARD-OF-SPEAKING COM...
IRJET Journal
•
2 vues
IRJET- Image Caption Generation System using Neural Network with Attention Me...
IRJET Journal
•
13 vues
Speech To Speech Translation
IRJET Journal
•
3 vues
IRJET- Voice to Code Editor using Speech Recognition
IRJET Journal
•
17 vues
Recent Trends in Translation of Programming Languages using NLP Approaches
IRJET Journal
•
4 vues
IRJET- Speech Based Answer Sheet Evaluation System
IRJET Journal
•
23 vues
IRJET- Voice based Billing System
IRJET Journal
•
46 vues
Handwritten Text Recognition and Translation with Audio
IRJET Journal
•
7 vues
D017232729
IOSR Journals
•
149 vues
Class Diagram Extraction from Textual Requirements Using NLP Techniques
iosrjce
•
258 vues
IDE Code Compiler for the physically challenged (Deaf, Blind & Mute)
IRJET Journal
•
15 vues
Language and Offensive Word Detection
IRJET Journal
•
7 vues
Publicité
Plus de IRJET Journal
(20)
AN INVESTIGATION ON THE EFFICACY OF MARBLE DUST POWDER FOR STRENGTHENING THE ...
IRJET Journal
•
26 vues
A Survey on Person Detection for Social Distancing and Safety Violation Alert...
IRJET Journal
•
8 vues
Thermal Analysis and Design Optimization of Solar Chimney using CFD
IRJET Journal
•
7 vues
Common Skin Disease Diagnosis and Prediction: A Review
IRJET Journal
•
41 vues
A STUDY ON TWITTER SENTIMENT ANALYSIS USING DEEP LEARNING
IRJET Journal
•
22 vues
Band structure of metallic single-walled carbon nanotubes
IRJET Journal
•
5 vues
IoT Based Assistive Glove for Visually Impaired People
IRJET Journal
•
10 vues
A SUPERVISED MACHINE LEARNING APPROACH USING K-NEAREST NEIGHBOR ALGORITHM TO ...
IRJET Journal
•
3 vues
An Overview of Traffic Accident Detection System using IoT
IRJET Journal
•
5 vues
Artificial Neural Network: A brief study
IRJET Journal
•
4 vues
Review Paper on Performance Analysis of LPG Refrigerator
IRJET Journal
•
6 vues
WOLBACHIA PIPIENTIS AND VARROA DESTRUCTOR MITE INFESTATION RATES OF NATIVE VE...
IRJET Journal
•
5 vues
Survey on Automatic Kidney Lesion Detection using Deep Learning
IRJET Journal
•
3 vues
A Review on Safety Management System in School Bus
IRJET Journal
•
10 vues
Design and Validation of a Light Vehicle Tubular Space Frame Chassis
IRJET Journal
•
7 vues
SMART BLIND STICK USING VOICE MODULE
IRJET Journal
•
12 vues
AUTOMATIC HYPODERMIC INFUSION REGULATOR
IRJET Journal
•
7 vues
Mobile Application of Pet Adoption System
IRJET Journal
•
11 vues
Literature Survey on “Crowdfunding Using Blockchain”
IRJET Journal
•
8 vues
Pricing Optimization using Machine Learning
IRJET Journal
•
7 vues
Dernier
(20)
PGM2.pptx
LavanyaVaishnavi2
•
0 vue
International Journal of Information Technology, Modeling and Computing (IJITMC)
Zac Darcy
•
0 vue
HAZARD IDENTIFICATION PRESENTATION.ppt
LeopordLeon
•
0 vue
materi 3.pptx
AnisQusayiriah1
•
0 vue
2011Sulowski_Pres.pdf
DeSouzaSoluesemSegur
•
0 vue
AERODYNAMIC_Kevin Michaels_EN (1).pptx
ShubhasmitaBhuyan1
•
0 vue
Introduction-to-bridge-engineering.pdf
ssuser7204f4
•
0 vue
Presentation%20Garv%20saini.pptx
RajaSaini12
•
0 vue
N__Libya_RE_National_Plan.pdf
Omar30468
•
0 vue
PLANTILLA POWER POINT INGENIERIA INDUSTRIAL
LuisSilva716492
•
0 vue
FINAL B.V.C 8051.pptx
Jayshree873826
•
0 vue
PGM3.pptx
LavanyaVaishnavi2
•
0 vue
MAPA GEOLOGICO 6.pdf
CarlosBustamante343690
•
0 vue
Polyiisoprene.pptx
Luis Tormento
•
0 vue
Final_Report_-_Smart_Door_Lock_PDF.pdf
AimanAnuar6
•
0 vue
VENTILADOR D CT-100.pdf
DavidAaronRodriguezA
•
0 vue
Analog electronic circuits
RajasiSamanta
•
0 vue
E680230530_13J2.pdf
IvanS64
•
0 vue
Information Theory and Coding
VIT-AP University
•
0 vue
presentation.pptx
shamaaslam3
•
0 vue
Publicité
IRJET- On-Screen Translator using NLP and Text Detection
International Research Journal
of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 06 Issue: 11 | Nov 2019 www.irjet.net p-ISSN: 2395-0072 © 2019, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 1167 On-screen Translator using NLP and Text Detection Prof. Vidya G. Shitole1, Mr. Aditya Agashe2 1Assistant Professor, Suman Ramesh Tulsiani Technical Campus, FoE – Kamshet, Maharashtra, India. 2Student, Suman Ramesh Tulsiani Technical Campus, FoE – Kamshet, Maharashtra, India ----------------------------------------------------------------------***--------------------------------------------------------------------- Abstract - Newer is the language, hard to understand new words. If you get stuck at any point while readingaparagraph to translate something in the understandable language you need a translation of that particular word. Insteadof going on to the new tab and searching for the translation of the word, if the translation is available on-screen over that word will be comfortable for a person. To translate a word or phrase intoa native language from a different language we can use the application of Natural Language Processing and it can be made reliable using text detection technique. NLPhasasyntax checker to check the grammar of input and also to rebuild input. NLP can be used to format a scattered text. Text detections fetch the text and pass parameters of it to methods to search for the translated form of the input text. It can be from a video frame or webpage. Text can be fetched usingOCR (optical character recognition) method from sensors or by using input images or sources, that to be translated. Key Words: NLP, text detection, Natural language processing, OCR, 1. INTRODUCTION Machine translation depends on a large dataset of sentences or words of both source and target language. Monolingual data is widely used to train languagemodelswhichincreases the performance and fluency of machine translation. We are mainly focusing on back-translation because it operates in a semi-supervised setup. Here both monolingual andbilingual data in target language are available. Detection of text, fetching content from detection, processing input and proving appropriate result as an output will be the format of the procedure. Text detection will take place from the source input like images or frame or text selection, after that we need to read the source text to give it as input to the processing model. After that to process input, we have a model that will use LSTM (Long Short-Term Memory) and RNN to analyse the word from a dataset and which will return the predicted suggestion as an output which will be our final result of the program. 1.1 Text Detection Text detection is basically a recognition of text from the image using various tools. We are passing the images as input from the frame as an image format. Here we are using a Python-tesseract tool. 1.2 NLP (Natural Language Processing) Natural language processing is a very large area in machine learning and deep learning field. It worksintheformatof the input sequence – MODEL – output sequence. 1 Back translation 2 Byte-Pair encoding or Character-based 3 Automatic Post Editing 4 Parallel filtering 5 Deep Word Representation 2. Literature Review Extracting information is concerned to identify similar relevant phrases in textual data. For many applications, extracting entities such as names,places,events,dates,times and prices is a powerful wayofsummarizingtheinformation relevant to a user’s needs. In the case of a domain-specific translation model, the automatic identification of important phrases can increase the accuracy and efficiency of a translation. Phrase detection and backtracking in NLP translation will help in organizing the whole process. OCR text detection will occur as the user give input to the system manually but the translation to the base language will take place automatically. The text detection process is a stepwise process. First, we take an appropriate image input so that it will fit dimensionally in a model for further processes. Then to perform text detection, we use a deep learning model to extract to find out output feature maps from two layers. The first layer is output sigmoid activation which will tell us the probability of region containing text or not. The second output feature map layer will help to decide to bound of text area in an image. This will provide us a frame or window around a text area bounds so that it will mark up the area and traces the text. Now next we take that frame as input and our model will try to grayscale the text area to check traces of higher scores. It will try to match a pattern with a dataset and it will tokenize the word and check in a dataset for appropriate match and return the tokensandusingthose we will get the result as the translated text. 3. System information and architecture This system is mainly designed to help people in translating non-familiar words from other languages to the target
International Research Journal
of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 06 Issue: 11 | Nov 2019 www.irjet.net p-ISSN: 2395-0072 © 2019, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 1168 language (English). The use of RNN and LSTM will help in increasing machine translation source language to target language. Procedure starts from the detection of text, we use OpenCV for that; approach of OpenCV generally claims: 1) apply filters to highlight characters 2) to recognize characters one by one contour detection is applied 3) to identify characters, apply image classification Mostly we work on unstructured data, so there are some prerequisites before moving towards the model-building part. We take input from an image-text to string text. Currently, we are using Python-tesseract a tool for OCR, which is used to extract embedded text from images. Then we will convert that object into a string and pass that to the model to normalize and tokenize. Fig: LSTM layer followed by dense layer Then the model will treat that string according to the rules defined in the model description and layers to format the fetched string. The main requirement of the Seq2Seq model is, it requires both input and output sentences to be converted into an integer sequence of fixed length. But before we do that, let’s visualize the length of the sentences. We will capture the lengths of all the sentences in two separate lists for target language and source language, respectively. Then the model will use LSTM to check the correct relation between previous and next word relationships to return a correct token and then the model will return the translation suggestion and that will be our output. Fig: Two different RNN Here, both, the input and output are sentences. In other words, these sentences are the input and output parameters of a model. This is the basicconceptofSequence-to-Sequence modeling. Fig: text detection in a video frame Here in the above figure, we are using the EAST model to detect text from a video frame. Video captures text at 13 FPS and 720p resolution. The EAST (An Efficient and Effective Accurate Scene Text Detector) achieves approximately78%F-score.Themodel is a fully convolutional neural network made for text detection that will give dense per-pixel predictions of words or text as an output. The post-processing includes the network management system and thresholding on predicted geometric shapes. NLP individual models: 1) p(f|e) Translation model -due to Bayes algorithm reverse ordering of f and e -English sentences having the same meaning as the foreign sentence will be assigned to the higher probability -a bilingual (parallel) corpus for estimationisneeded 2) p(e) The language model -to fluent/grammatical sentences higher probability is assigned -monolingual corpus is needed for estimation Translation model Enter alignment variable ‘a’ which represents alignments between the individual words in the sentence pair p(f|e) = P a p (a, f|e) Alignment probabilities Breaking of sentence into a manageable chunks or words
International Research Journal
of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 06 Issue: 11 | Nov 2019 www.irjet.net p-ISSN: 2395-0072 © 2019, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 1169 where ‘ei’ is the Englishword(s)correspondingtotheFrench word ‘fj’ and ‘t(fj|ei)’ is the (conditional) probability of the words being aligned. Fig: System work-flow diagram The system starts working from taking input as an image, it may be from an image or video frame. Then using a tool to extract corpus of text frame from image then converting text to string is the whole procedure to make model acceptable ready input. Afterward, the NLP model will analyze the string, tokenizeandthenitwill return the resulting translation. 4. Future Scope Translator is a thing that is useful toeveryone whoreadsand writes. Deploying as an extension to the web browser and making available to all will help a lot to the new language learners. There are many scopes of improvements to this system, and definitely making this system faster will be our main focus. How to make this is a thing available offline to use is also a part of future development and adding add-on support of multiple regional languagesfortranslation will be the update part of it. Possible directions for future research include:(1)Changing geometry formulation to detect directly curved / curvature text (2) Text detector + text recognizer integration (3) general object detection and suggestions. 4. CONCLUSIONS Detecting text using various tools and methods, analyzing source language and then processing that to return a translated word; Hence, we have designed an on-screen translator system using NLP and there are scopes for improvements. Better use of the application of neural networks can be seen in this application. Fully convolutional networks arefast,end-to-endmodelsfor pixel-wise problems and it helped us a lot. REFERENCES [1] EAST: An Efficient and Accurate Scene Text Detector “Xinyu Zhou, Cong Yao, He Wen, Yuzhi Wang, Shuchang Zhou, Weiran He, Jiajun Liang”, Submitted on 11 Apr 2017 [2] Foundations of Statistical Natural Language Processing, Manning and Schutze, ch. 13 [3] Mobile Camera Based Text Detection and Translation,” Derek Ma, Qiuhau Lin, Tong Zhang”. [4] http://code.google.com/p/tesseract-ocr/ [5] https://www.pyimagesearch.com/ [6] Natural Language Processing and Machine Translation Encyclopedia of Language and Linguistics, 2nd ed. (ELL2). Machine Translation: Interlingual Methods, “Bonnie J. Dorr, Eduard H. Hovy, Lori S. Levin” [7] Understanding Back-Translation at Scale, “Sergey Edunov, Myle Ott, Michael Auli, David Grangier”. [8] Google ML and deeplearning.ai
Publicité