SlideShare une entreprise Scribd logo
1  sur  5
Télécharger pour lire hors ligne
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 09 Issue: 05 | May 2022 www.irjet.net p-ISSN: 2395-0072
© 2022, IRJET | Impact Factor value: 7.529 | ISO 9001:2008 Certified Journal | Page 3398
Optical Recognition of Handwritten Text
Soham Bhagwat1, Pratik Dharu2, Abhishek Dixit3, Bhadrayu Godbole4
1,2,3,4Dept. of Computer Engineering, P.E.S. Modern College of Engineering, Maharashtra, India
---------------------------------------------------------------------***---------------------------------------------------------------------
Abstract - The project focus on the creation of OCR software
for the off-line recognition of handwriting. OCR programs can
recognize printed text with nearly perfect accuracy. The
recognition of handwriting is harder duetothemany different
styles and inconsistent nature of handwriting. Handwritten
text recognition (HTR) is an open field of research and a
relevant problem that helps automatically process historical
documents.
In recent years great advances in deep learning andcomputer
vision have allowed improvements on document and image
processing including HTR. Handwritten text recognitionplays
an important role in the processing of vital information.
Processing of digital files is cheaper than processing
traditional paper files even though a lot of information is
available on paper.
The aim of an OCR software is to convert handwrittentextinto
machine readable formats. Despite such advances in this field,
little has been done to produce open-source projects that
address this problem as well as methods that utilize graphical
process units (GPUs) to speed up the training phase.
Key Words: OCR, HTR, GPU, CNN, BRNN, CTC, DIA, MRZ
1. INTRODUCTION
Optical Character Recognition (OCR) deals with recognition
of different characters from a given input that might include
an image, a real time video, or a manuscript / document. By
using OCR, one can transform the text into a digital format,
thus allowing rapid scanning and digitization of documents
in physical format as well as real time text-recognition (in
case of videos). Similarly, this interpretation of OCR
methodology involves pre-processing of input, text area
detection, application of the best pre-trained models and
finally, detection of text as an output. All of this is made
possible with an offline software UI compatible with a
Windows operating system.
1.1 Problem Definition and Objectives
To develop Optical CharacterRecognition(OCR)software
for the recognition of handwriting using following
algorithms:
- CNN
- BRNN
- CTC
1.2 Project Scope
Presently, OCR is capable in reading screenshots which
has facilitated the transferring of information between
incompatible technologies. By using OCR for handwritten
text manual entries on paper will also be legibletocomputer
systems. Additionally,OCRcanbeusedtoperformDocument
Image Analysis (DIA) by reading and recognizing text in
research, governmental, academic, and business
organizations that are having a large pool of documented,
scanned images. Thirdly, OCR can be used to automate
documentation and security processes at airports by
automatically reading the Machine-Readable Zone (MRZ)
and other relevant parts of a passport. In this way,OCRhasa
scope in a wide range of applications.
1.3 Limitations
Having considered some of the benefits of using OCR
software, it also comes along with its own shortcomings. To
begin with, straight OCR without additional AIortechnology
specifically trained to recognize ID types will lack the
requisite accuracy one needs to deliver a good user
experience. Thus, structuring the extracted/detected data
involves more than just OCR. Secondly, considering pictures
of ID documents - these images usually need to be de-
skewed if the image was not alignedproperlyandreoriented
so that the OCR technology can properly extract the data.
Thus, OCR must combine with image rectification. Lastly,
when there is glare or blurriness in the ID image, the
probability of data extraction mistakes is significantly
higher. Thus, glare and blur can cause mistakes.
2. SOFTWARE REQUIREMENTS SPECIFICATION
Mentioned belowaresomerequirementspecificationsforthe
efficient working of the OCR software.
2.1 Assumptions and Dependencies
Before the commencement of the project, there are some
assumptions that the project works with:
- The input image selected by the user is in a jpg/png
format.
- The text to be detected and recognized is in English.
- The input image provided by the user is upright.
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 09 Issue: 05 | May 2022 www.irjet.net p-ISSN: 2395-0072
© 2022, IRJET | Impact Factor value: 7.529 | ISO 9001:2008 Certified Journal | Page 3399
2.2 System Features
- The user should be able to choose and upload an image
of his choice through the file browser window upon clicking
of the upload image button on the software UI.
- The user should be able to get instant output of the
chosen image in the output window, and user should also be
shown intermediate stages in output detection for better
understandinguponclickingofthebuttonsuccessivelyonthe
software UI.
2.3 External Interface Requirements
- The software is a single screen display where the user
uploads an image as per interest.
- The image is cropped, resized and detected text output
is displayed after the user presses indicated buttons.
- The software is designed to run on all PCs having at
least a Windows 8 OS along with Python 3.9 installed.
- The frontendismanagedusingTkinterlibraryofPython
while the backend is handled by os library of Python.
- The software runs on the saved model files that have
been trained on a cloud infrastructure named Google
Colaboratory.
2.3 Non-functional Requirements
- The models have alreadybeentrainedandoptimizedon
Google Colaboratory GPUs beforehand, so the performance
requirements of users have been reduced toaminimum8GB
RAM along witha suitablequadcoreprocessorandminimum
5 GB free HDD to make space for the whole software suite.
- With the above hardware specifications, a user takes
around 30 to 50 seconds to get an output in the provided
output window in the UI.
- As no user data is collected, there aren’t any security
concerns.
- As this software is offline, there isn’t any vulnerability
posed from the network side.
- The traces of data are deleted once the user closes the
UI.
3. SYSTEM DESIGN
3.1 System Architecture
Fig -1: System architecture diagram
3.2 Use Case Diagram
Fig -2: Use-case diagram
3.3 Sequence Diagram
Fig -3: Sequence diagram
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 09 Issue: 05 | May 2022 www.irjet.net p-ISSN: 2395-0072
© 2022, IRJET | Impact Factor value: 7.529 | ISO 9001:2008 Certified Journal | Page 3400
3.4 Component Diagram
Fig -4: Component diagram
4. PROJECT IMPLEMENTATION
4.1 Overview of Project Modules
- Document Detection: This module helps to detect
document present in the image. It further helps in cropping
the image by removing the background so that the only the
document is visible and further resizes the image so as to
make it usable by the other modules.
- Text-area Detection: This module scans over the areas
in the image and makes a rough estimate of text areas that
might be present in the image. It further draws bounding
boxes over the detected text areas in the image.
- Text Recognition: Thismodule scans over thebounding
boxes in the image and gives a rough estimate of recognized
text that might be present in the boundingboxes.Thisisdone
with the help of different pre-trained models.
- Output: This module stores the text that has been
recognized from the image. It further generates an output in
the output window and makes a text file.
4.2 Algorithm for Hough Line Detection
The algorithm for detecting straight lines can be divided into
the following steps:
- Edge detection, e.g. using the Canny edge detector.
- Mapping of edge points to the Hough space and storage
in an accumulator.
- Interpretation of the accumulatortoyieldlinesofinfinite
length. The interpretation is done by thresholding and
possibly other constraints.
- Conversion of infinite lines to finitelines. The finitelines
can then be superimposed back on the original image.
This in turn will help us in detecting a document present in
the image.
This process is used to furthercropandresizetheimagefrom
removing the background in a way that just the text remains.
4.3 Algorithm for Text Recognition
The algorithm for text recognition through different models
can be divided into the following common steps:
- Perform pre-processingon the image by removing noise
and document background using the Hough line detector
mentioned above.
- Detect text areas in the image and draw bounding boxes
over the detected text areas by making use of packages in
Python.
- Use feature extraction/labelling on existing datasets for
un-supervised learning.
- Build different neural networks and train with the pre-
processed data viz. CNN, BRNN, CTC for outputs with varied
accuracies on test data.
- Record observations after using different models and
images and choose the one with the best accuracy for text
recognition.
This will help us in detecting and recognizing text from the
given image document present in the image.
This output will then be shown on the UI for the user along
with a text-file that will be generated.
5. IMAGES OF MODELS
5.1 CTC Model
Fig -5: CTC model summary
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 09 Issue: 05 | May 2022 www.irjet.net p-ISSN: 2395-0072
© 2022, IRJET | Impact Factor value: 7.529 | ISO 9001:2008 Certified Journal | Page 3401
5.2 BRNN Model
Fig -6: BRNN model summary
5.3 CTC Model Graph (Loss)
Graph -1: CTC model loss
5.3 BRNN Model Graph (Loss)
Graph -2: BRNN model loss
5.4 Sample word detection
Fig -7: Sample word detection using pre-processed data
and saved model files
6. IMAGES OF SOFTWARE
6.1 UI screen
Fig -8: Main UI window
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 09 Issue: 05 | May 2022 www.irjet.net p-ISSN: 2395-0072
© 2022, IRJET | Impact Factor value: 7.529 | ISO 9001:2008 Certified Journal | Page 3402
6.2 Input Image
Fig -9: Input image (raw)
6.3 Input Image after Pre-processing
Fig -10: Input image after pre-processing
6.4 Final Output
Fig -11: Final output image in UI
7. CONCLUSIONS
In recent years, great advances in deep learning and
computer vision have allowed improvements on document
and image processing and HTR. Processing of digital files is
cheaper than processing traditional paper files. The aim of
an OCR software is to convert handwrittentextintomachine
readable formats and by making use of data pre-processing
that involved cropping, resizing,normalization,thresholding
of an image and then further splitting of datasets and
applications of models viz CNN, CTC andBRNN,we’reableto
perform Optical Character Recognition of Text through the
provided UI. The aforesaid models can be further optimized
to improve the accuracy of the detected text.
REFERENCES
[1] Rosebrock, A. “Automatically OCR’ing Receipts and
Scans,” PyImageSearch, 2021,
https://pyimagesearch.com/2021/10/27/automatically
-ocring-receipts-and-scans/.
[2] H. Li, R. Yang and X. Chen, "License plate detection using
convolutional neural network," 2017 3rd IEEE
International Conference on Computer and
Communications (ICCC), 2017, pp. 1736-1740, doi:
10.1109/CompComm.2017.8322837.
[3] Schuster, Mike & Paliwal, Kuldip. (1997). Bidirectional
recurrent neural networks. Signal Processing, IEEE
Transactions on. 45. 2673 - 2681. 10.1109/78.650093.
[4] O. Nina, Connectionist Temporal Classification for
Offline Handwritten Text Recognition, BYU Conference
Center, 2016.

Contenu connexe

Similaire à Optical Recognition of Handwritten Text

IRJET - Information Hiding in H.264/AVC using Digital Watermarking
IRJET -  	  Information Hiding in H.264/AVC using Digital WatermarkingIRJET -  	  Information Hiding in H.264/AVC using Digital Watermarking
IRJET - Information Hiding in H.264/AVC using Digital WatermarkingIRJET Journal
 
IRJET- Survey Paper: Image Reader for Blind Person
IRJET- Survey Paper: Image Reader for Blind PersonIRJET- Survey Paper: Image Reader for Blind Person
IRJET- Survey Paper: Image Reader for Blind PersonIRJET Journal
 
IRJET- Object Detection and Recognition for Blind Assistance
IRJET- Object Detection and Recognition for Blind AssistanceIRJET- Object Detection and Recognition for Blind Assistance
IRJET- Object Detection and Recognition for Blind AssistanceIRJET Journal
 
Voice Enable Blind Assistance System -Real time Object Detection
Voice Enable Blind Assistance System -Real time Object DetectionVoice Enable Blind Assistance System -Real time Object Detection
Voice Enable Blind Assistance System -Real time Object DetectionIRJET Journal
 
IRJET- Designing of OCR Tool Box for Decoding Vehicle Number Plate using MATLAB
IRJET- Designing of OCR Tool Box for Decoding Vehicle Number Plate using MATLABIRJET- Designing of OCR Tool Box for Decoding Vehicle Number Plate using MATLAB
IRJET- Designing of OCR Tool Box for Decoding Vehicle Number Plate using MATLABIRJET Journal
 
IRJET- Capsearch - An Image Caption Generation Based Search
IRJET- Capsearch - An Image Caption Generation Based SearchIRJET- Capsearch - An Image Caption Generation Based Search
IRJET- Capsearch - An Image Caption Generation Based SearchIRJET Journal
 
IRJET- Scandroid: A Machine Learning Approach for Understanding Handwritten N...
IRJET- Scandroid: A Machine Learning Approach for Understanding Handwritten N...IRJET- Scandroid: A Machine Learning Approach for Understanding Handwritten N...
IRJET- Scandroid: A Machine Learning Approach for Understanding Handwritten N...IRJET Journal
 
IRJET- Object Detection in an Image using Convolutional Neural Network
IRJET- Object Detection in an Image using Convolutional Neural NetworkIRJET- Object Detection in an Image using Convolutional Neural Network
IRJET- Object Detection in an Image using Convolutional Neural NetworkIRJET Journal
 
IRJET- Real-Time Object Detection System using Caffe Model
IRJET- Real-Time Object Detection System using Caffe ModelIRJET- Real-Time Object Detection System using Caffe Model
IRJET- Real-Time Object Detection System using Caffe ModelIRJET Journal
 
Text Recognition, Object Detection and Language Translation App
Text Recognition, Object Detection and Language Translation AppText Recognition, Object Detection and Language Translation App
Text Recognition, Object Detection and Language Translation AppIRJET Journal
 
Computer-Vision_Integrating-Technology_MOB_17.06.16
Computer-Vision_Integrating-Technology_MOB_17.06.16Computer-Vision_Integrating-Technology_MOB_17.06.16
Computer-Vision_Integrating-Technology_MOB_17.06.16Schuyler Kennedy
 
IRJET - Positioning and Tracking of a Person using Embedded Controller in a D...
IRJET - Positioning and Tracking of a Person using Embedded Controller in a D...IRJET - Positioning and Tracking of a Person using Embedded Controller in a D...
IRJET - Positioning and Tracking of a Person using Embedded Controller in a D...IRJET Journal
 
Iaetsd a low power and high throughput re-configurable bip for multipurpose a...
Iaetsd a low power and high throughput re-configurable bip for multipurpose a...Iaetsd a low power and high throughput re-configurable bip for multipurpose a...
Iaetsd a low power and high throughput re-configurable bip for multipurpose a...Iaetsd Iaetsd
 
IRJET- Image to Text Conversion using Tesseract
IRJET-  	  Image to Text Conversion using TesseractIRJET-  	  Image to Text Conversion using Tesseract
IRJET- Image to Text Conversion using TesseractIRJET Journal
 
IRJET-Raspberry Pi Based Reader for Blind People
IRJET-Raspberry Pi Based Reader for Blind PeopleIRJET-Raspberry Pi Based Reader for Blind People
IRJET-Raspberry Pi Based Reader for Blind PeopleIRJET Journal
 
Object and Currency Detection for the Visually Impaired
Object and Currency Detection for the Visually ImpairedObject and Currency Detection for the Visually Impaired
Object and Currency Detection for the Visually ImpairedIRJET Journal
 
IRJET- Rice QA using Deep Learning
IRJET- Rice QA using Deep LearningIRJET- Rice QA using Deep Learning
IRJET- Rice QA using Deep LearningIRJET Journal
 
IRJET- Optical Character Recognition for Blind using Raspberry Pi
IRJET- Optical Character Recognition for Blind using Raspberry PiIRJET- Optical Character Recognition for Blind using Raspberry Pi
IRJET- Optical Character Recognition for Blind using Raspberry PiIRJET Journal
 
Convolutional Neural Network Based Real Time Object Detection Using YOLO V4
Convolutional Neural Network Based Real Time Object Detection Using YOLO V4Convolutional Neural Network Based Real Time Object Detection Using YOLO V4
Convolutional Neural Network Based Real Time Object Detection Using YOLO V4IRJET Journal
 

Similaire à Optical Recognition of Handwritten Text (20)

IRJET - Information Hiding in H.264/AVC using Digital Watermarking
IRJET -  	  Information Hiding in H.264/AVC using Digital WatermarkingIRJET -  	  Information Hiding in H.264/AVC using Digital Watermarking
IRJET - Information Hiding in H.264/AVC using Digital Watermarking
 
IRJET- Survey Paper: Image Reader for Blind Person
IRJET- Survey Paper: Image Reader for Blind PersonIRJET- Survey Paper: Image Reader for Blind Person
IRJET- Survey Paper: Image Reader for Blind Person
 
IRJET- Object Detection and Recognition for Blind Assistance
IRJET- Object Detection and Recognition for Blind AssistanceIRJET- Object Detection and Recognition for Blind Assistance
IRJET- Object Detection and Recognition for Blind Assistance
 
Voice Enable Blind Assistance System -Real time Object Detection
Voice Enable Blind Assistance System -Real time Object DetectionVoice Enable Blind Assistance System -Real time Object Detection
Voice Enable Blind Assistance System -Real time Object Detection
 
IRJET- Designing of OCR Tool Box for Decoding Vehicle Number Plate using MATLAB
IRJET- Designing of OCR Tool Box for Decoding Vehicle Number Plate using MATLABIRJET- Designing of OCR Tool Box for Decoding Vehicle Number Plate using MATLAB
IRJET- Designing of OCR Tool Box for Decoding Vehicle Number Plate using MATLAB
 
IRJET- Capsearch - An Image Caption Generation Based Search
IRJET- Capsearch - An Image Caption Generation Based SearchIRJET- Capsearch - An Image Caption Generation Based Search
IRJET- Capsearch - An Image Caption Generation Based Search
 
IRJET- Scandroid: A Machine Learning Approach for Understanding Handwritten N...
IRJET- Scandroid: A Machine Learning Approach for Understanding Handwritten N...IRJET- Scandroid: A Machine Learning Approach for Understanding Handwritten N...
IRJET- Scandroid: A Machine Learning Approach for Understanding Handwritten N...
 
IRJET- Object Detection in an Image using Convolutional Neural Network
IRJET- Object Detection in an Image using Convolutional Neural NetworkIRJET- Object Detection in an Image using Convolutional Neural Network
IRJET- Object Detection in an Image using Convolutional Neural Network
 
IRJET- Real-Time Object Detection System using Caffe Model
IRJET- Real-Time Object Detection System using Caffe ModelIRJET- Real-Time Object Detection System using Caffe Model
IRJET- Real-Time Object Detection System using Caffe Model
 
Text Recognition, Object Detection and Language Translation App
Text Recognition, Object Detection and Language Translation AppText Recognition, Object Detection and Language Translation App
Text Recognition, Object Detection and Language Translation App
 
Computer-Vision_Integrating-Technology_MOB_17.06.16
Computer-Vision_Integrating-Technology_MOB_17.06.16Computer-Vision_Integrating-Technology_MOB_17.06.16
Computer-Vision_Integrating-Technology_MOB_17.06.16
 
IRJET - Positioning and Tracking of a Person using Embedded Controller in a D...
IRJET - Positioning and Tracking of a Person using Embedded Controller in a D...IRJET - Positioning and Tracking of a Person using Embedded Controller in a D...
IRJET - Positioning and Tracking of a Person using Embedded Controller in a D...
 
Smart Mirror
Smart MirrorSmart Mirror
Smart Mirror
 
Iaetsd a low power and high throughput re-configurable bip for multipurpose a...
Iaetsd a low power and high throughput re-configurable bip for multipurpose a...Iaetsd a low power and high throughput re-configurable bip for multipurpose a...
Iaetsd a low power and high throughput re-configurable bip for multipurpose a...
 
IRJET- Image to Text Conversion using Tesseract
IRJET-  	  Image to Text Conversion using TesseractIRJET-  	  Image to Text Conversion using Tesseract
IRJET- Image to Text Conversion using Tesseract
 
IRJET-Raspberry Pi Based Reader for Blind People
IRJET-Raspberry Pi Based Reader for Blind PeopleIRJET-Raspberry Pi Based Reader for Blind People
IRJET-Raspberry Pi Based Reader for Blind People
 
Object and Currency Detection for the Visually Impaired
Object and Currency Detection for the Visually ImpairedObject and Currency Detection for the Visually Impaired
Object and Currency Detection for the Visually Impaired
 
IRJET- Rice QA using Deep Learning
IRJET- Rice QA using Deep LearningIRJET- Rice QA using Deep Learning
IRJET- Rice QA using Deep Learning
 
IRJET- Optical Character Recognition for Blind using Raspberry Pi
IRJET- Optical Character Recognition for Blind using Raspberry PiIRJET- Optical Character Recognition for Blind using Raspberry Pi
IRJET- Optical Character Recognition for Blind using Raspberry Pi
 
Convolutional Neural Network Based Real Time Object Detection Using YOLO V4
Convolutional Neural Network Based Real Time Object Detection Using YOLO V4Convolutional Neural Network Based Real Time Object Detection Using YOLO V4
Convolutional Neural Network Based Real Time Object Detection Using YOLO V4
 

Plus de IRJET Journal

TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...IRJET Journal
 
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURESTUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTUREIRJET Journal
 
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...IRJET Journal
 
Effect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil CharacteristicsEffect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil CharacteristicsIRJET Journal
 
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...IRJET Journal
 
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...IRJET Journal
 
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...IRJET Journal
 
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...IRJET Journal
 
A REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADASA REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADASIRJET Journal
 
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...IRJET Journal
 
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD ProP.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD ProIRJET Journal
 
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...IRJET Journal
 
Survey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare SystemSurvey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare SystemIRJET Journal
 
Review on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridgesReview on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridgesIRJET Journal
 
React based fullstack edtech web application
React based fullstack edtech web applicationReact based fullstack edtech web application
React based fullstack edtech web applicationIRJET Journal
 
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...IRJET Journal
 
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.IRJET Journal
 
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...IRJET Journal
 
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic DesignMultistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic DesignIRJET Journal
 
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...IRJET Journal
 

Plus de IRJET Journal (20)

TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
 
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURESTUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
 
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
 
Effect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil CharacteristicsEffect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil Characteristics
 
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
 
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
 
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
 
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
 
A REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADASA REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADAS
 
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
 
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD ProP.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
 
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
 
Survey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare SystemSurvey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare System
 
Review on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridgesReview on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridges
 
React based fullstack edtech web application
React based fullstack edtech web applicationReact based fullstack edtech web application
React based fullstack edtech web application
 
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
 
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
 
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
 
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic DesignMultistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
 
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
 

Dernier

Work Experience-Dalton Park.pptxfvvvvvvv
Work Experience-Dalton Park.pptxfvvvvvvvWork Experience-Dalton Park.pptxfvvvvvvv
Work Experience-Dalton Park.pptxfvvvvvvvLewisJB
 
Risk Assessment For Installation of Drainage Pipes.pdf
Risk Assessment For Installation of Drainage Pipes.pdfRisk Assessment For Installation of Drainage Pipes.pdf
Risk Assessment For Installation of Drainage Pipes.pdfROCENODodongVILLACER
 
complete construction, environmental and economics information of biomass com...
complete construction, environmental and economics information of biomass com...complete construction, environmental and economics information of biomass com...
complete construction, environmental and economics information of biomass com...asadnawaz62
 
Class 1 | NFPA 72 | Overview Fire Alarm System
Class 1 | NFPA 72 | Overview Fire Alarm SystemClass 1 | NFPA 72 | Overview Fire Alarm System
Class 1 | NFPA 72 | Overview Fire Alarm Systemirfanmechengr
 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024hassan khalil
 
Introduction-To-Agricultural-Surveillance-Rover.pptx
Introduction-To-Agricultural-Surveillance-Rover.pptxIntroduction-To-Agricultural-Surveillance-Rover.pptx
Introduction-To-Agricultural-Surveillance-Rover.pptxk795866
 
Call Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile serviceCall Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile servicerehmti665
 
welding defects observed during the welding
welding defects observed during the weldingwelding defects observed during the welding
welding defects observed during the weldingMuhammadUzairLiaqat
 
Instrumentation, measurement and control of bio process parameters ( Temperat...
Instrumentation, measurement and control of bio process parameters ( Temperat...Instrumentation, measurement and control of bio process parameters ( Temperat...
Instrumentation, measurement and control of bio process parameters ( Temperat...121011101441
 
INFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETE
INFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETEINFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETE
INFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETEroselinkalist12
 
Call Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call GirlsCall Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call Girlsssuser7cb4ff
 
Unit7-DC_Motors nkkjnsdkfnfcdfknfdgfggfg
Unit7-DC_Motors nkkjnsdkfnfcdfknfdgfggfgUnit7-DC_Motors nkkjnsdkfnfcdfknfdgfggfg
Unit7-DC_Motors nkkjnsdkfnfcdfknfdgfggfgsaravananr517913
 
Why does (not) Kafka need fsync: Eliminating tail latency spikes caused by fsync
Why does (not) Kafka need fsync: Eliminating tail latency spikes caused by fsyncWhy does (not) Kafka need fsync: Eliminating tail latency spikes caused by fsync
Why does (not) Kafka need fsync: Eliminating tail latency spikes caused by fsyncssuser2ae721
 
TechTAC® CFD Report Summary: A Comparison of Two Types of Tubing Anchor Catchers
TechTAC® CFD Report Summary: A Comparison of Two Types of Tubing Anchor CatchersTechTAC® CFD Report Summary: A Comparison of Two Types of Tubing Anchor Catchers
TechTAC® CFD Report Summary: A Comparison of Two Types of Tubing Anchor Catcherssdickerson1
 
Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...VICTOR MAESTRE RAMIREZ
 
lifi-technology with integration of IOT.pptx
lifi-technology with integration of IOT.pptxlifi-technology with integration of IOT.pptx
lifi-technology with integration of IOT.pptxsomshekarkn64
 
Arduino_CSE ece ppt for working and principal of arduino.ppt
Arduino_CSE ece ppt for working and principal of arduino.pptArduino_CSE ece ppt for working and principal of arduino.ppt
Arduino_CSE ece ppt for working and principal of arduino.pptSAURABHKUMAR892774
 
CCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdf
CCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdfCCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdf
CCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdfAsst.prof M.Gokilavani
 

Dernier (20)

Work Experience-Dalton Park.pptxfvvvvvvv
Work Experience-Dalton Park.pptxfvvvvvvvWork Experience-Dalton Park.pptxfvvvvvvv
Work Experience-Dalton Park.pptxfvvvvvvv
 
Risk Assessment For Installation of Drainage Pipes.pdf
Risk Assessment For Installation of Drainage Pipes.pdfRisk Assessment For Installation of Drainage Pipes.pdf
Risk Assessment For Installation of Drainage Pipes.pdf
 
complete construction, environmental and economics information of biomass com...
complete construction, environmental and economics information of biomass com...complete construction, environmental and economics information of biomass com...
complete construction, environmental and economics information of biomass com...
 
Class 1 | NFPA 72 | Overview Fire Alarm System
Class 1 | NFPA 72 | Overview Fire Alarm SystemClass 1 | NFPA 72 | Overview Fire Alarm System
Class 1 | NFPA 72 | Overview Fire Alarm System
 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024
 
Introduction-To-Agricultural-Surveillance-Rover.pptx
Introduction-To-Agricultural-Surveillance-Rover.pptxIntroduction-To-Agricultural-Surveillance-Rover.pptx
Introduction-To-Agricultural-Surveillance-Rover.pptx
 
Call Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile serviceCall Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile service
 
welding defects observed during the welding
welding defects observed during the weldingwelding defects observed during the welding
welding defects observed during the welding
 
Instrumentation, measurement and control of bio process parameters ( Temperat...
Instrumentation, measurement and control of bio process parameters ( Temperat...Instrumentation, measurement and control of bio process parameters ( Temperat...
Instrumentation, measurement and control of bio process parameters ( Temperat...
 
INFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETE
INFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETEINFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETE
INFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETE
 
Call Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call GirlsCall Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call Girls
 
Unit7-DC_Motors nkkjnsdkfnfcdfknfdgfggfg
Unit7-DC_Motors nkkjnsdkfnfcdfknfdgfggfgUnit7-DC_Motors nkkjnsdkfnfcdfknfdgfggfg
Unit7-DC_Motors nkkjnsdkfnfcdfknfdgfggfg
 
Why does (not) Kafka need fsync: Eliminating tail latency spikes caused by fsync
Why does (not) Kafka need fsync: Eliminating tail latency spikes caused by fsyncWhy does (not) Kafka need fsync: Eliminating tail latency spikes caused by fsync
Why does (not) Kafka need fsync: Eliminating tail latency spikes caused by fsync
 
Design and analysis of solar grass cutter.pdf
Design and analysis of solar grass cutter.pdfDesign and analysis of solar grass cutter.pdf
Design and analysis of solar grass cutter.pdf
 
TechTAC® CFD Report Summary: A Comparison of Two Types of Tubing Anchor Catchers
TechTAC® CFD Report Summary: A Comparison of Two Types of Tubing Anchor CatchersTechTAC® CFD Report Summary: A Comparison of Two Types of Tubing Anchor Catchers
TechTAC® CFD Report Summary: A Comparison of Two Types of Tubing Anchor Catchers
 
Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...
 
lifi-technology with integration of IOT.pptx
lifi-technology with integration of IOT.pptxlifi-technology with integration of IOT.pptx
lifi-technology with integration of IOT.pptx
 
Arduino_CSE ece ppt for working and principal of arduino.ppt
Arduino_CSE ece ppt for working and principal of arduino.pptArduino_CSE ece ppt for working and principal of arduino.ppt
Arduino_CSE ece ppt for working and principal of arduino.ppt
 
CCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdf
CCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdfCCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdf
CCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdf
 
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptxExploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
 

Optical Recognition of Handwritten Text

  • 1. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 09 Issue: 05 | May 2022 www.irjet.net p-ISSN: 2395-0072 © 2022, IRJET | Impact Factor value: 7.529 | ISO 9001:2008 Certified Journal | Page 3398 Optical Recognition of Handwritten Text Soham Bhagwat1, Pratik Dharu2, Abhishek Dixit3, Bhadrayu Godbole4 1,2,3,4Dept. of Computer Engineering, P.E.S. Modern College of Engineering, Maharashtra, India ---------------------------------------------------------------------***--------------------------------------------------------------------- Abstract - The project focus on the creation of OCR software for the off-line recognition of handwriting. OCR programs can recognize printed text with nearly perfect accuracy. The recognition of handwriting is harder duetothemany different styles and inconsistent nature of handwriting. Handwritten text recognition (HTR) is an open field of research and a relevant problem that helps automatically process historical documents. In recent years great advances in deep learning andcomputer vision have allowed improvements on document and image processing including HTR. Handwritten text recognitionplays an important role in the processing of vital information. Processing of digital files is cheaper than processing traditional paper files even though a lot of information is available on paper. The aim of an OCR software is to convert handwrittentextinto machine readable formats. Despite such advances in this field, little has been done to produce open-source projects that address this problem as well as methods that utilize graphical process units (GPUs) to speed up the training phase. Key Words: OCR, HTR, GPU, CNN, BRNN, CTC, DIA, MRZ 1. INTRODUCTION Optical Character Recognition (OCR) deals with recognition of different characters from a given input that might include an image, a real time video, or a manuscript / document. By using OCR, one can transform the text into a digital format, thus allowing rapid scanning and digitization of documents in physical format as well as real time text-recognition (in case of videos). Similarly, this interpretation of OCR methodology involves pre-processing of input, text area detection, application of the best pre-trained models and finally, detection of text as an output. All of this is made possible with an offline software UI compatible with a Windows operating system. 1.1 Problem Definition and Objectives To develop Optical CharacterRecognition(OCR)software for the recognition of handwriting using following algorithms: - CNN - BRNN - CTC 1.2 Project Scope Presently, OCR is capable in reading screenshots which has facilitated the transferring of information between incompatible technologies. By using OCR for handwritten text manual entries on paper will also be legibletocomputer systems. Additionally,OCRcanbeusedtoperformDocument Image Analysis (DIA) by reading and recognizing text in research, governmental, academic, and business organizations that are having a large pool of documented, scanned images. Thirdly, OCR can be used to automate documentation and security processes at airports by automatically reading the Machine-Readable Zone (MRZ) and other relevant parts of a passport. In this way,OCRhasa scope in a wide range of applications. 1.3 Limitations Having considered some of the benefits of using OCR software, it also comes along with its own shortcomings. To begin with, straight OCR without additional AIortechnology specifically trained to recognize ID types will lack the requisite accuracy one needs to deliver a good user experience. Thus, structuring the extracted/detected data involves more than just OCR. Secondly, considering pictures of ID documents - these images usually need to be de- skewed if the image was not alignedproperlyandreoriented so that the OCR technology can properly extract the data. Thus, OCR must combine with image rectification. Lastly, when there is glare or blurriness in the ID image, the probability of data extraction mistakes is significantly higher. Thus, glare and blur can cause mistakes. 2. SOFTWARE REQUIREMENTS SPECIFICATION Mentioned belowaresomerequirementspecificationsforthe efficient working of the OCR software. 2.1 Assumptions and Dependencies Before the commencement of the project, there are some assumptions that the project works with: - The input image selected by the user is in a jpg/png format. - The text to be detected and recognized is in English. - The input image provided by the user is upright.
  • 2. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 09 Issue: 05 | May 2022 www.irjet.net p-ISSN: 2395-0072 © 2022, IRJET | Impact Factor value: 7.529 | ISO 9001:2008 Certified Journal | Page 3399 2.2 System Features - The user should be able to choose and upload an image of his choice through the file browser window upon clicking of the upload image button on the software UI. - The user should be able to get instant output of the chosen image in the output window, and user should also be shown intermediate stages in output detection for better understandinguponclickingofthebuttonsuccessivelyonthe software UI. 2.3 External Interface Requirements - The software is a single screen display where the user uploads an image as per interest. - The image is cropped, resized and detected text output is displayed after the user presses indicated buttons. - The software is designed to run on all PCs having at least a Windows 8 OS along with Python 3.9 installed. - The frontendismanagedusingTkinterlibraryofPython while the backend is handled by os library of Python. - The software runs on the saved model files that have been trained on a cloud infrastructure named Google Colaboratory. 2.3 Non-functional Requirements - The models have alreadybeentrainedandoptimizedon Google Colaboratory GPUs beforehand, so the performance requirements of users have been reduced toaminimum8GB RAM along witha suitablequadcoreprocessorandminimum 5 GB free HDD to make space for the whole software suite. - With the above hardware specifications, a user takes around 30 to 50 seconds to get an output in the provided output window in the UI. - As no user data is collected, there aren’t any security concerns. - As this software is offline, there isn’t any vulnerability posed from the network side. - The traces of data are deleted once the user closes the UI. 3. SYSTEM DESIGN 3.1 System Architecture Fig -1: System architecture diagram 3.2 Use Case Diagram Fig -2: Use-case diagram 3.3 Sequence Diagram Fig -3: Sequence diagram
  • 3. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 09 Issue: 05 | May 2022 www.irjet.net p-ISSN: 2395-0072 © 2022, IRJET | Impact Factor value: 7.529 | ISO 9001:2008 Certified Journal | Page 3400 3.4 Component Diagram Fig -4: Component diagram 4. PROJECT IMPLEMENTATION 4.1 Overview of Project Modules - Document Detection: This module helps to detect document present in the image. It further helps in cropping the image by removing the background so that the only the document is visible and further resizes the image so as to make it usable by the other modules. - Text-area Detection: This module scans over the areas in the image and makes a rough estimate of text areas that might be present in the image. It further draws bounding boxes over the detected text areas in the image. - Text Recognition: Thismodule scans over thebounding boxes in the image and gives a rough estimate of recognized text that might be present in the boundingboxes.Thisisdone with the help of different pre-trained models. - Output: This module stores the text that has been recognized from the image. It further generates an output in the output window and makes a text file. 4.2 Algorithm for Hough Line Detection The algorithm for detecting straight lines can be divided into the following steps: - Edge detection, e.g. using the Canny edge detector. - Mapping of edge points to the Hough space and storage in an accumulator. - Interpretation of the accumulatortoyieldlinesofinfinite length. The interpretation is done by thresholding and possibly other constraints. - Conversion of infinite lines to finitelines. The finitelines can then be superimposed back on the original image. This in turn will help us in detecting a document present in the image. This process is used to furthercropandresizetheimagefrom removing the background in a way that just the text remains. 4.3 Algorithm for Text Recognition The algorithm for text recognition through different models can be divided into the following common steps: - Perform pre-processingon the image by removing noise and document background using the Hough line detector mentioned above. - Detect text areas in the image and draw bounding boxes over the detected text areas by making use of packages in Python. - Use feature extraction/labelling on existing datasets for un-supervised learning. - Build different neural networks and train with the pre- processed data viz. CNN, BRNN, CTC for outputs with varied accuracies on test data. - Record observations after using different models and images and choose the one with the best accuracy for text recognition. This will help us in detecting and recognizing text from the given image document present in the image. This output will then be shown on the UI for the user along with a text-file that will be generated. 5. IMAGES OF MODELS 5.1 CTC Model Fig -5: CTC model summary
  • 4. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 09 Issue: 05 | May 2022 www.irjet.net p-ISSN: 2395-0072 © 2022, IRJET | Impact Factor value: 7.529 | ISO 9001:2008 Certified Journal | Page 3401 5.2 BRNN Model Fig -6: BRNN model summary 5.3 CTC Model Graph (Loss) Graph -1: CTC model loss 5.3 BRNN Model Graph (Loss) Graph -2: BRNN model loss 5.4 Sample word detection Fig -7: Sample word detection using pre-processed data and saved model files 6. IMAGES OF SOFTWARE 6.1 UI screen Fig -8: Main UI window
  • 5. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 09 Issue: 05 | May 2022 www.irjet.net p-ISSN: 2395-0072 © 2022, IRJET | Impact Factor value: 7.529 | ISO 9001:2008 Certified Journal | Page 3402 6.2 Input Image Fig -9: Input image (raw) 6.3 Input Image after Pre-processing Fig -10: Input image after pre-processing 6.4 Final Output Fig -11: Final output image in UI 7. CONCLUSIONS In recent years, great advances in deep learning and computer vision have allowed improvements on document and image processing and HTR. Processing of digital files is cheaper than processing traditional paper files. The aim of an OCR software is to convert handwrittentextintomachine readable formats and by making use of data pre-processing that involved cropping, resizing,normalization,thresholding of an image and then further splitting of datasets and applications of models viz CNN, CTC andBRNN,we’reableto perform Optical Character Recognition of Text through the provided UI. The aforesaid models can be further optimized to improve the accuracy of the detected text. REFERENCES [1] Rosebrock, A. “Automatically OCR’ing Receipts and Scans,” PyImageSearch, 2021, https://pyimagesearch.com/2021/10/27/automatically -ocring-receipts-and-scans/. [2] H. Li, R. Yang and X. Chen, "License plate detection using convolutional neural network," 2017 3rd IEEE International Conference on Computer and Communications (ICCC), 2017, pp. 1736-1740, doi: 10.1109/CompComm.2017.8322837. [3] Schuster, Mike & Paliwal, Kuldip. (1997). Bidirectional recurrent neural networks. Signal Processing, IEEE Transactions on. 45. 2673 - 2681. 10.1109/78.650093. [4] O. Nina, Connectionist Temporal Classification for Offline Handwritten Text Recognition, BYU Conference Center, 2016.