SlideShare une entreprise Scribd logo
1  sur  3
Télécharger pour lire hors ligne
Shrinath Janvalkar et al. Int. Journal of Engineering Research and Applications www.ijera.com
ISSN : 2248-9622, Vol. 4, Issue 4( Version 5), April 2014, pp.149-151
www.ijera.com 149 | P a g e
Text Recognition from an Image
Shrinath Janvalkar, Paresh Manjrekar, Sarvesh Pawar, Prof. Laxman Naik
Department Of Computer Engineering1, 2, 3, 4
RMCET(Mumbai University)
Ambav, Devrukh, India1, 2, 3, 4
ABSTRACT
To achieve high speed in data processing it is necessary to convert the analog data into digital data. Storage of
hard copy of any document occupies large space and retrieving of information from that document is time
consuming. Optical character recognition system is an effective way in recognition of printed character. It
provides an easy way to recognize and convert the printed text on image into the editable text. It also increases
the speed of data retrieval from the image. The image which contains characters can be scanned through scanner
and then recognition engine of the OCR system interpret the images and convert images of printed characters
into machine-readable characters [8].It improving the interface between man and machine in many applications.
I. INTRODUCTION
Character recognition is one of the most
interesting areas of pattern recognition and artificial
intelligence. Optical Character Recognition extracts
the relevant information and automatically enters it
into electronic database instead of the conventional
way of manually retyping the text. Optical Character
Recognition is a vast field with a number of varied
applications such as invoice imaging, legal industry,
banking, health care industry etc. OCR is also widely
used in many other fields like Captcha, Institutional
repositories and digital libraries, Optical Music
Recognition without any human correction or human
effort, Automatic number plate recognition and
Handwritten Recognition [6]. It contributes
immensely to the advancement of an automation
process and can improve the interface between man
and machine in numerous applications. Several
research works have been focusing on new
techniques and methods that would reduce the
processing time while providing higher recognition
accuracy. Now it is possible to scan documents as an
image and to make it editable and searchable for
further information processing.
II. OBJECTIVE
The objective of OCR software is to
recognize the text and then convert it to editable
form. Thus, developing computer algorithms to
identify the
character in the text is the principal task of OCR. A
document is first scanned by an optical scanner,
which produces an image form of it that is not
editable. Optical character recognition involves.
Translation of this text image into editable character
codes such as ASCII [4]. The below
diagram shows the processing mechanism of
OCR system (Fig. 1).
Fig. 1. OCR Engine
III. LITERARURE SURVEY
3.1 First generation OCR systems
The first commercialized OCR of this generation was
IBM 1418, which was designed to read a special IBM
font407. The recognition method was template
matching, which compares the character image with a
library of prototype images for each character of each
font [5].
3.2 Second generation OCR systems
Next generation machines were able to recognize
regular machine-printed and hand printed characters.
The character set was limited to numerals and a few
letters and symbols. Such machines appeared in the
middle of 1960s to early 1970s [5].
3.3 Third generation OCR systems
For the third generation of OCR systems, the
challenges were documents of poor quality and large
printed and hand-written character sets. Low cost and
high performance were also important objectives.
Commercial OCR systems with such capabilities
appeared during the decade 1975 to 1985[5].
RESEARCH ARTICLE OPEN ACCESS
Shrinath Janvalkar et al. Int. Journal of Engineering Research and Applications www.ijera.com
ISSN : 2248-9622, Vol. 4, Issue 4( Version 5), April 2014, pp.149-151
www.ijera.com 150 | P a g e
3.4 OCRs Today (Fourth generation OCR systems)
The fourth generation can be characterized by the
OCR of complex documents intermixing with text,
graphics, tables and mathematical symbols,
unconstrained handwritten characters, color
documents, low-quality noisy documents, etc.
Among the commercial products, postal address
readers, and reading aids for the blind are available in
the market [5].
IV. TASK INVOLVED IN OCR
Fig. 2. OCR Processing
The above figure shows different processes which are
done in OCR system (Fig. 2).
4.1 Image acquisition
Input image for OCR system might be acquire by
scanning document or by capturing photograph of
document. This is also known as digitization process
[11].
4.2 Preprocessing
Preprocessing consist series of operations and it used
to enhance an image and make it suitable for
segmentation. Noise get introduced during document
generation. So Proper filter like mean filter, min-max
filter, Gaussian filter etc. may be applied to remove
noise from document.Binarization process converts
gray scale or colored image to black and white
image. To enhance visibility and structural
information of character Binary morphological
operations like opening, closing, thinning, hole filling
etc. may be applied on image. If scanned image is not
be perfectly aligned, so we need to align it by
performing slant angle correction. Input document
may be resized if it is too large in size to reduce
dimensions to improve speed of processing [11].
4.3 Segmentation
Character segmentation performs an operation of
decomposition of an image into Sub images of
individual symbols. It is one of the decision processes
in a system for optical character recognition (OCR).
Its decision that a pattern isolated from the image is
that of a character or some other identifiable unit.
Generally document is processed in hierarchical way.
At first level lines are segmented using row
histogram. From each row, words are extracted using
column histogram and finally characters are extracted
from words. Accuracy of final result is highly
depends on accuracy of segmentation [11].
4.4 Feature extraction
Feature extraction is the important part of any pattern
recognition application. Feature extraction techniques
like Linear Discriminant Analysis (LDA), Principle
Component Analysis (PCA),Independent Component
Analysis (ICA), Chain Code (CC), Scale Invariant
Feature Extraction (SIFT),Gradient based features,
Histogram might be applied to extract the features of
individual characters. These features are used to train
the system [11].
4.5 Classification
When image is provided as input to OCR system, its
features are extracted and given as an input to the
trained classifier like artificial neural network or
support vector machine. Classifiers compare the input
feature with stored pattern and find out the best
matching class for input [11].
4.6 Post processing
This step is not compulsory; it helps to improve the
accuracy of recognition. Syntax analysis, semantic
analysis kind of higher level concepts might be
applied to check the context of recognized character
[11].
V. RESULT: SECTION OF OCR
IMAGE
Input as Image:
Fig. 3. Input to OCR System
Result:
[1] "veo etoyels Ioke oer, end net r`deao,k head fg
suhgestion" very closely together, and that
``deaths's head '' suggestion
[2] "oI gtd genep eaw stsougly markod. serhass lI
was scnw" of his bones very strongly marked.
Perhaps it was fan-
[3] "cifol, hmt I thoOphi ihat ha looser lthe a
knight of old" ciful, but I thought that he
looked like a knight of old
Shrinath Janvalkar et al. Int. Journal of Engineering Research and Applications www.ijera.com
ISSN : 2248-9622, Vol. 4, Issue 4( Version 5), April 2014, pp.149-151
www.ijera.com 151 | P a g e
[4] "wk, was goine into batIta anr snew he Wak
geing so he" who was going into battle and
knew he was going to be
[5] "... apxhn I tcit what an enH aorhi.Mwy ans
suiie unm" ... again, I felt what an
extraordinary and quite un-
[6] "eoaserouk poWer nf attracigHn he had."
conscious power of attraction he had.
VI. APPLICATIONS
Optical character recognition has been
applied to a number of applications. Some of them
have been explained below.
6.1 Legal Industry
OCR is used in Legal industry for digitize
documents, and directly entered to computer
database. Legal professionals can further search
documents required from huge databases by simply
typing a few keywords [6].
6.2 Healthcare
Healthcare professionals always have to deal with
large volumes of forms for each patient, including
insurance forms as well as general health forms. To
keep up with all of this information, it is useful to
input relevant data into an electronic database that
can be accessed as necessary. Form processing tools,
powered by OCR, are able to extract information
from forms and put it into databases, so that every
patient's data is promptly recorded [6].
6.3 Optical Music Recognition
Initially it was aimed towards recognizing printed
sheets which can be edited into playable form with
the help of electronic methods. It has many
applications like processing of different classes of
music, large scale digitization of musical data and
also it can be used for diversity in musical notation
[6].
6.4 Automatic Number Recognition
Automatic number plate recognition is used as a
technique making use of optical character recognition
on images to identify vehicle registration plates. They
are used by various police forces and as a method of
electronic toll collection on pay-per-use roads and
cataloging the movements of traffic or individuals
[6].
6.5 Handwriting Recognition
It is the ability of a computer system which scans the
image of handwritten text by scanner and extracts
only handwritten character from that image [7].
VII. CONCLUTION
Although results of OCR System are not
good, they are not that bad either, indicating that the
OCR technique is not awed. More training data may
improve robustness and accuracy.
REFFERENCES
[1] Optical Character Recognition using Neural
Networks Deepayan Sarkar University of
Wisconsin MadisonECE 539 Project, Fall
2003.
[2] “Evaluation of OCR Algorithms for Images
with Different Spatial Resolutions and Noises”
School of Information Technology and
Engineering Faculty of Engineering University
of Ottawa©Qing Chen, Ottawa, Canada, 2003.
[3] “A Neural Network Implementation of Optical
Character Recognition” Technical Report
Number CSSE10-05 COMP 6600 – Artificial
Intelligence Spring 2009.
[4] “Optical Character Recognition Techniques: A
Survey” Sukhpreet Singh M.tech Student,
Dept. of Computer Engineering, YCOE
Talwandi Sabo BP. India.
[5] OCR System: A Literature Survey
[6] “Survey of OCR Applications” by Amarjot
Singh, ketan bacchuwar, Akshay bhasin.
[7] M.D. Ganis, C.L. Wilson, J.L. Blue, “Neural
network-based systems for handprint OCR
applications” in IEEE Transactions on Image
Processing, 1998, Vol: 7, Issue: 8, p.p. 1097 –
1112.
[8] “Performance Characterization and
Acceleration of Optical Character Recognition
on Handheld Platforms” Sadagopan
Srinivasan, Li Zhao, Lin Sun, Zhen Fang,
Peng Li, Tao Wang,Ravishankar Iyer, Ramesh
Illikkal, Dong LiuIntel Corporation.
[9] “Implementing Optical Character Recognition
on the Android Operating System for Business
Cards” Sonia Bhaskar, Nicholas Lavassar,
Scott GreenEE 368 Digital Image Processing.
[10]“A Comparative analysis of feature extraction
techniques for handwritten character
recognition” Rajbala Tokas1, Aruna Bhadu2
M.Tech*(CS), Swami Keshwanand Institute of
Technology, Jaipur, Rajasthan, India,
M.Tech*(SE) Govt. Engineering College.
[11]“A Literature Review on Hand Written
Character Recognition” by Mansi shah &
Gordhan B Jethava Department of Computer
Science & Engineering Parul Institute of
Technology, Gujarat, India. Information
Technology Department Parul Institute of
Engg. & Technology, Gujarat, India.

Contenu connexe

Tendances

IRJET- Cheque Bounce Detection System using Image Processing
IRJET- Cheque Bounce Detection System using Image ProcessingIRJET- Cheque Bounce Detection System using Image Processing
IRJET- Cheque Bounce Detection System using Image ProcessingIRJET Journal
 
Optical Character Recognition( OCR )
Optical Character Recognition( OCR )Optical Character Recognition( OCR )
Optical Character Recognition( OCR )Karan Panjwani
 
Volume 2-issue-6-2009-2015
Volume 2-issue-6-2009-2015Volume 2-issue-6-2009-2015
Volume 2-issue-6-2009-2015Editor IJARCET
 
A Review of Optical Character Recognition System for Recognition of Printed Text
A Review of Optical Character Recognition System for Recognition of Printed TextA Review of Optical Character Recognition System for Recognition of Printed Text
A Review of Optical Character Recognition System for Recognition of Printed Textiosrjce
 
Handwritten Text Recognition and Digital Text Conversion
Handwritten Text Recognition and Digital Text ConversionHandwritten Text Recognition and Digital Text Conversion
Handwritten Text Recognition and Digital Text Conversionijtsrd
 
Optical Character Recognition
Optical Character RecognitionOptical Character Recognition
Optical Character RecognitionDurjoy Saha
 
IRJET- Image to Text Conversion using Tesseract
IRJET-  	  Image to Text Conversion using TesseractIRJET-  	  Image to Text Conversion using Tesseract
IRJET- Image to Text Conversion using TesseractIRJET Journal
 
A SURVEY ON DEEP LEARNING METHOD USED FOR CHARACTER RECOGNITION
A SURVEY ON DEEP LEARNING METHOD USED FOR CHARACTER RECOGNITIONA SURVEY ON DEEP LEARNING METHOD USED FOR CHARACTER RECOGNITION
A SURVEY ON DEEP LEARNING METHOD USED FOR CHARACTER RECOGNITIONIJCIRAS Journal
 
Machine learning
Machine learningMachine learning
Machine learningAmit Gupta
 
Optical Character Recognition (OCR) System
Optical Character Recognition (OCR) SystemOptical Character Recognition (OCR) System
Optical Character Recognition (OCR) Systemiosrjce
 
Final Report on Optical Character Recognition
Final Report on Optical Character Recognition Final Report on Optical Character Recognition
Final Report on Optical Character Recognition Vidyut Singhania
 
OFFLINE SIGNATURE VERIFICATION SYSTEM FOR BANK CHEQUES USING ZERNIKE MOMENTS,...
OFFLINE SIGNATURE VERIFICATION SYSTEM FOR BANK CHEQUES USING ZERNIKE MOMENTS,...OFFLINE SIGNATURE VERIFICATION SYSTEM FOR BANK CHEQUES USING ZERNIKE MOMENTS,...
OFFLINE SIGNATURE VERIFICATION SYSTEM FOR BANK CHEQUES USING ZERNIKE MOMENTS,...ijaia
 
OCR-THE 3 LAYERED APPROACH FOR CLASSIFICATION AND IDENTIFICATION OF TELUGU HA...
OCR-THE 3 LAYERED APPROACH FOR CLASSIFICATION AND IDENTIFICATION OF TELUGU HA...OCR-THE 3 LAYERED APPROACH FOR CLASSIFICATION AND IDENTIFICATION OF TELUGU HA...
OCR-THE 3 LAYERED APPROACH FOR CLASSIFICATION AND IDENTIFICATION OF TELUGU HA...csandit
 
ARABIC ONLINE HANDWRITING RECOGNITION USING NEURAL NETWORK
ARABIC ONLINE HANDWRITING RECOGNITION USING NEURAL NETWORKARABIC ONLINE HANDWRITING RECOGNITION USING NEURAL NETWORK
ARABIC ONLINE HANDWRITING RECOGNITION USING NEURAL NETWORKijaia
 
Handwriting recogntion slides boeing
Handwriting recogntion slides boeingHandwriting recogntion slides boeing
Handwriting recogntion slides boeingTejashree Gharat
 
CHARACTER RECOGNITION USING NEURAL NETWORK WITHOUT FEATURE EXTRACTION FOR KAN...
CHARACTER RECOGNITION USING NEURAL NETWORK WITHOUT FEATURE EXTRACTION FOR KAN...CHARACTER RECOGNITION USING NEURAL NETWORK WITHOUT FEATURE EXTRACTION FOR KAN...
CHARACTER RECOGNITION USING NEURAL NETWORK WITHOUT FEATURE EXTRACTION FOR KAN...Editor IJMTER
 

Tendances (19)

IRJET- Cheque Bounce Detection System using Image Processing
IRJET- Cheque Bounce Detection System using Image ProcessingIRJET- Cheque Bounce Detection System using Image Processing
IRJET- Cheque Bounce Detection System using Image Processing
 
Optical Character Recognition( OCR )
Optical Character Recognition( OCR )Optical Character Recognition( OCR )
Optical Character Recognition( OCR )
 
Volume 2-issue-6-2009-2015
Volume 2-issue-6-2009-2015Volume 2-issue-6-2009-2015
Volume 2-issue-6-2009-2015
 
Ocr 1
Ocr 1Ocr 1
Ocr 1
 
A Review of Optical Character Recognition System for Recognition of Printed Text
A Review of Optical Character Recognition System for Recognition of Printed TextA Review of Optical Character Recognition System for Recognition of Printed Text
A Review of Optical Character Recognition System for Recognition of Printed Text
 
Handwritten Text Recognition and Digital Text Conversion
Handwritten Text Recognition and Digital Text ConversionHandwritten Text Recognition and Digital Text Conversion
Handwritten Text Recognition and Digital Text Conversion
 
Optical Character Recognition
Optical Character RecognitionOptical Character Recognition
Optical Character Recognition
 
IRJET- Image to Text Conversion using Tesseract
IRJET-  	  Image to Text Conversion using TesseractIRJET-  	  Image to Text Conversion using Tesseract
IRJET- Image to Text Conversion using Tesseract
 
A SURVEY ON DEEP LEARNING METHOD USED FOR CHARACTER RECOGNITION
A SURVEY ON DEEP LEARNING METHOD USED FOR CHARACTER RECOGNITIONA SURVEY ON DEEP LEARNING METHOD USED FOR CHARACTER RECOGNITION
A SURVEY ON DEEP LEARNING METHOD USED FOR CHARACTER RECOGNITION
 
Machine learning
Machine learningMachine learning
Machine learning
 
Optical Character Recognition (OCR) System
Optical Character Recognition (OCR) SystemOptical Character Recognition (OCR) System
Optical Character Recognition (OCR) System
 
Final Report on Optical Character Recognition
Final Report on Optical Character Recognition Final Report on Optical Character Recognition
Final Report on Optical Character Recognition
 
OFFLINE SIGNATURE VERIFICATION SYSTEM FOR BANK CHEQUES USING ZERNIKE MOMENTS,...
OFFLINE SIGNATURE VERIFICATION SYSTEM FOR BANK CHEQUES USING ZERNIKE MOMENTS,...OFFLINE SIGNATURE VERIFICATION SYSTEM FOR BANK CHEQUES USING ZERNIKE MOMENTS,...
OFFLINE SIGNATURE VERIFICATION SYSTEM FOR BANK CHEQUES USING ZERNIKE MOMENTS,...
 
OCR-THE 3 LAYERED APPROACH FOR CLASSIFICATION AND IDENTIFICATION OF TELUGU HA...
OCR-THE 3 LAYERED APPROACH FOR CLASSIFICATION AND IDENTIFICATION OF TELUGU HA...OCR-THE 3 LAYERED APPROACH FOR CLASSIFICATION AND IDENTIFICATION OF TELUGU HA...
OCR-THE 3 LAYERED APPROACH FOR CLASSIFICATION AND IDENTIFICATION OF TELUGU HA...
 
ARABIC ONLINE HANDWRITING RECOGNITION USING NEURAL NETWORK
ARABIC ONLINE HANDWRITING RECOGNITION USING NEURAL NETWORKARABIC ONLINE HANDWRITING RECOGNITION USING NEURAL NETWORK
ARABIC ONLINE HANDWRITING RECOGNITION USING NEURAL NETWORK
 
Co4201605611
Co4201605611Co4201605611
Co4201605611
 
Handwriting recogntion slides boeing
Handwriting recogntion slides boeingHandwriting recogntion slides boeing
Handwriting recogntion slides boeing
 
Handwritten Character Recognition
Handwritten Character RecognitionHandwritten Character Recognition
Handwritten Character Recognition
 
CHARACTER RECOGNITION USING NEURAL NETWORK WITHOUT FEATURE EXTRACTION FOR KAN...
CHARACTER RECOGNITION USING NEURAL NETWORK WITHOUT FEATURE EXTRACTION FOR KAN...CHARACTER RECOGNITION USING NEURAL NETWORK WITHOUT FEATURE EXTRACTION FOR KAN...
CHARACTER RECOGNITION USING NEURAL NETWORK WITHOUT FEATURE EXTRACTION FOR KAN...
 

En vedette

Image Interpolation Techniques in Digital Image Processing: An Overview
Image Interpolation Techniques in Digital Image Processing: An OverviewImage Interpolation Techniques in Digital Image Processing: An Overview
Image Interpolation Techniques in Digital Image Processing: An OverviewIJERA Editor
 
Performance Simulation Of Photovoltaic System Battery
Performance Simulation Of Photovoltaic System BatteryPerformance Simulation Of Photovoltaic System Battery
Performance Simulation Of Photovoltaic System BatteryIJERA Editor
 
Empirical Determination of Locations of Unstable and Blank Gsm Signal Network...
Empirical Determination of Locations of Unstable and Blank Gsm Signal Network...Empirical Determination of Locations of Unstable and Blank Gsm Signal Network...
Empirical Determination of Locations of Unstable and Blank Gsm Signal Network...IJERA Editor
 
Coupled Inductor Based High Step-Up DC-DC Converter for Multi Input PV System
Coupled Inductor Based High Step-Up DC-DC Converter for Multi Input PV SystemCoupled Inductor Based High Step-Up DC-DC Converter for Multi Input PV System
Coupled Inductor Based High Step-Up DC-DC Converter for Multi Input PV SystemIJERA Editor
 
Evaluating the barriers for enhacing the utilization level of advanced manufa...
Evaluating the barriers for enhacing the utilization level of advanced manufa...Evaluating the barriers for enhacing the utilization level of advanced manufa...
Evaluating the barriers for enhacing the utilization level of advanced manufa...IJERA Editor
 
E-Commerce Application Distributed Operating System
E-Commerce Application Distributed Operating SystemE-Commerce Application Distributed Operating System
E-Commerce Application Distributed Operating SystemIJERA Editor
 
Security Attacks and its Countermeasures in Wireless Sensor Networks
Security Attacks and its Countermeasures in Wireless Sensor NetworksSecurity Attacks and its Countermeasures in Wireless Sensor Networks
Security Attacks and its Countermeasures in Wireless Sensor NetworksIJERA Editor
 
Improving Structural Limitations of Pid Controller For Unstable Processes
Improving Structural Limitations of Pid Controller For Unstable ProcessesImproving Structural Limitations of Pid Controller For Unstable Processes
Improving Structural Limitations of Pid Controller For Unstable ProcessesIJERA Editor
 
Voltage Profile Improvement of distribution system Using Particle Swarm Optim...
Voltage Profile Improvement of distribution system Using Particle Swarm Optim...Voltage Profile Improvement of distribution system Using Particle Swarm Optim...
Voltage Profile Improvement of distribution system Using Particle Swarm Optim...IJERA Editor
 
Growth and Characterizations of Pure and Calcium Doped Cadmium Tartrate Cryst...
Growth and Characterizations of Pure and Calcium Doped Cadmium Tartrate Cryst...Growth and Characterizations of Pure and Calcium Doped Cadmium Tartrate Cryst...
Growth and Characterizations of Pure and Calcium Doped Cadmium Tartrate Cryst...IJERA Editor
 
Samyak Vaidyak Dr. Shriniwas Kashalikar
Samyak Vaidyak Dr. Shriniwas KashalikarSamyak Vaidyak Dr. Shriniwas Kashalikar
Samyak Vaidyak Dr. Shriniwas Kashalikargokhaleajit
 
Las Meteoras De Grecia
Las Meteoras De GreciaLas Meteoras De Grecia
Las Meteoras De Greciasamasada
 
Comida de Navidad
Comida de NavidadComida de Navidad
Comida de NavidadDouce Nieto
 
TREBALLAR AMB EINES WEB2.0 A L’AULA, LA WIKI
TREBALLAR AMB EINES WEB2.0 A L’AULA, LA WIKITREBALLAR AMB EINES WEB2.0 A L’AULA, LA WIKI
TREBALLAR AMB EINES WEB2.0 A L’AULA, LA WIKIFundación Impuls
 

En vedette (20)

Y04408126132
Y04408126132Y04408126132
Y04408126132
 
Image Interpolation Techniques in Digital Image Processing: An Overview
Image Interpolation Techniques in Digital Image Processing: An OverviewImage Interpolation Techniques in Digital Image Processing: An Overview
Image Interpolation Techniques in Digital Image Processing: An Overview
 
Performance Simulation Of Photovoltaic System Battery
Performance Simulation Of Photovoltaic System BatteryPerformance Simulation Of Photovoltaic System Battery
Performance Simulation Of Photovoltaic System Battery
 
Empirical Determination of Locations of Unstable and Blank Gsm Signal Network...
Empirical Determination of Locations of Unstable and Blank Gsm Signal Network...Empirical Determination of Locations of Unstable and Blank Gsm Signal Network...
Empirical Determination of Locations of Unstable and Blank Gsm Signal Network...
 
Coupled Inductor Based High Step-Up DC-DC Converter for Multi Input PV System
Coupled Inductor Based High Step-Up DC-DC Converter for Multi Input PV SystemCoupled Inductor Based High Step-Up DC-DC Converter for Multi Input PV System
Coupled Inductor Based High Step-Up DC-DC Converter for Multi Input PV System
 
Evaluating the barriers for enhacing the utilization level of advanced manufa...
Evaluating the barriers for enhacing the utilization level of advanced manufa...Evaluating the barriers for enhacing the utilization level of advanced manufa...
Evaluating the barriers for enhacing the utilization level of advanced manufa...
 
E-Commerce Application Distributed Operating System
E-Commerce Application Distributed Operating SystemE-Commerce Application Distributed Operating System
E-Commerce Application Distributed Operating System
 
Security Attacks and its Countermeasures in Wireless Sensor Networks
Security Attacks and its Countermeasures in Wireless Sensor NetworksSecurity Attacks and its Countermeasures in Wireless Sensor Networks
Security Attacks and its Countermeasures in Wireless Sensor Networks
 
Image Inpainting
Image InpaintingImage Inpainting
Image Inpainting
 
Improving Structural Limitations of Pid Controller For Unstable Processes
Improving Structural Limitations of Pid Controller For Unstable ProcessesImproving Structural Limitations of Pid Controller For Unstable Processes
Improving Structural Limitations of Pid Controller For Unstable Processes
 
Voltage Profile Improvement of distribution system Using Particle Swarm Optim...
Voltage Profile Improvement of distribution system Using Particle Swarm Optim...Voltage Profile Improvement of distribution system Using Particle Swarm Optim...
Voltage Profile Improvement of distribution system Using Particle Swarm Optim...
 
Growth and Characterizations of Pure and Calcium Doped Cadmium Tartrate Cryst...
Growth and Characterizations of Pure and Calcium Doped Cadmium Tartrate Cryst...Growth and Characterizations of Pure and Calcium Doped Cadmium Tartrate Cryst...
Growth and Characterizations of Pure and Calcium Doped Cadmium Tartrate Cryst...
 
J045046772
J045046772J045046772
J045046772
 
J045075661
J045075661J045075661
J045075661
 
Samyak Vaidyak Dr. Shriniwas Kashalikar
Samyak Vaidyak Dr. Shriniwas KashalikarSamyak Vaidyak Dr. Shriniwas Kashalikar
Samyak Vaidyak Dr. Shriniwas Kashalikar
 
Las Meteoras De Grecia
Las Meteoras De GreciaLas Meteoras De Grecia
Las Meteoras De Grecia
 
Comida de Navidad
Comida de NavidadComida de Navidad
Comida de Navidad
 
Vital Alsar
Vital AlsarVital Alsar
Vital Alsar
 
Sesion 3 prescolar
Sesion 3 prescolarSesion 3 prescolar
Sesion 3 prescolar
 
TREBALLAR AMB EINES WEB2.0 A L’AULA, LA WIKI
TREBALLAR AMB EINES WEB2.0 A L’AULA, LA WIKITREBALLAR AMB EINES WEB2.0 A L’AULA, LA WIKI
TREBALLAR AMB EINES WEB2.0 A L’AULA, LA WIKI
 

Similaire à Z04405149151

Volume 2-issue-6-2009-2015
Volume 2-issue-6-2009-2015Volume 2-issue-6-2009-2015
Volume 2-issue-6-2009-2015Editor IJARCET
 
Character Recognition (Devanagari Script)
Character Recognition (Devanagari Script)Character Recognition (Devanagari Script)
Character Recognition (Devanagari Script)IJERA Editor
 
BLOB DETECTION TECHNIQUE USING IMAGE PROCESSING FOR IDENTIFICATION OF MACHINE...
BLOB DETECTION TECHNIQUE USING IMAGE PROCESSING FOR IDENTIFICATION OF MACHINE...BLOB DETECTION TECHNIQUE USING IMAGE PROCESSING FOR IDENTIFICATION OF MACHINE...
BLOB DETECTION TECHNIQUE USING IMAGE PROCESSING FOR IDENTIFICATION OF MACHINE...ijiert bestjournal
 
Optical character recognition IEEE Paper Study
Optical character recognition IEEE Paper StudyOptical character recognition IEEE Paper Study
Optical character recognition IEEE Paper StudyEr. Ashish Pandey
 
Opticalcharacter recognition
Opticalcharacter recognition Opticalcharacter recognition
Opticalcharacter recognition Shobhit Saxena
 
Optical Character Recognition Using Python
Optical Character Recognition Using PythonOptical Character Recognition Using Python
Optical Character Recognition Using PythonYogeshIJTSRD
 
IRJET- Scandroid: A Machine Learning Approach for Understanding Handwritten N...
IRJET- Scandroid: A Machine Learning Approach for Understanding Handwritten N...IRJET- Scandroid: A Machine Learning Approach for Understanding Handwritten N...
IRJET- Scandroid: A Machine Learning Approach for Understanding Handwritten N...IRJET Journal
 
A STUDY ON OPTICAL CHARACTER RECOGNITION TECHNIQUES
A STUDY ON OPTICAL CHARACTER RECOGNITION TECHNIQUESA STUDY ON OPTICAL CHARACTER RECOGNITION TECHNIQUES
A STUDY ON OPTICAL CHARACTER RECOGNITION TECHNIQUESijcsitcejournal
 
OPTICAL CHARACTER RECOGNITION IN HEALTHCARE
OPTICAL CHARACTER RECOGNITION IN HEALTHCAREOPTICAL CHARACTER RECOGNITION IN HEALTHCARE
OPTICAL CHARACTER RECOGNITION IN HEALTHCAREIRJET Journal
 
IRJET- Photo Optical Character Recognition Model
IRJET- Photo Optical Character Recognition ModelIRJET- Photo Optical Character Recognition Model
IRJET- Photo Optical Character Recognition ModelIRJET Journal
 
LICENSE NUMBER PLATE RECOGNITION SYSTEM USING ANDROID APP
LICENSE NUMBER PLATE RECOGNITION SYSTEM USING ANDROID APPLICENSE NUMBER PLATE RECOGNITION SYSTEM USING ANDROID APP
LICENSE NUMBER PLATE RECOGNITION SYSTEM USING ANDROID APPAditya Mishra
 
IRJET- Offline Transcription using AI
IRJET-  	  Offline Transcription using AIIRJET-  	  Offline Transcription using AI
IRJET- Offline Transcription using AIIRJET Journal
 
Smart Assistant for Blind Humans using Rashberry PI
Smart Assistant for Blind Humans using Rashberry PISmart Assistant for Blind Humans using Rashberry PI
Smart Assistant for Blind Humans using Rashberry PIijtsrd
 
Optical Character Recognition from Text Image
Optical Character Recognition from Text ImageOptical Character Recognition from Text Image
Optical Character Recognition from Text ImageEditor IJCATR
 
IRJET- Intelligent Character Recognition of Handwritten Characters using ...
IRJET-  	  Intelligent Character Recognition of Handwritten Characters using ...IRJET-  	  Intelligent Character Recognition of Handwritten Characters using ...
IRJET- Intelligent Character Recognition of Handwritten Characters using ...IRJET Journal
 

Similaire à Z04405149151 (20)

Volume 2-issue-6-2009-2015
Volume 2-issue-6-2009-2015Volume 2-issue-6-2009-2015
Volume 2-issue-6-2009-2015
 
Character Recognition (Devanagari Script)
Character Recognition (Devanagari Script)Character Recognition (Devanagari Script)
Character Recognition (Devanagari Script)
 
O45018291
O45018291O45018291
O45018291
 
BLOB DETECTION TECHNIQUE USING IMAGE PROCESSING FOR IDENTIFICATION OF MACHINE...
BLOB DETECTION TECHNIQUE USING IMAGE PROCESSING FOR IDENTIFICATION OF MACHINE...BLOB DETECTION TECHNIQUE USING IMAGE PROCESSING FOR IDENTIFICATION OF MACHINE...
BLOB DETECTION TECHNIQUE USING IMAGE PROCESSING FOR IDENTIFICATION OF MACHINE...
 
Optical character recognition IEEE Paper Study
Optical character recognition IEEE Paper StudyOptical character recognition IEEE Paper Study
Optical character recognition IEEE Paper Study
 
D017222226
D017222226D017222226
D017222226
 
Opticalcharacter recognition
Opticalcharacter recognition Opticalcharacter recognition
Opticalcharacter recognition
 
Optical Character Recognition Using Python
Optical Character Recognition Using PythonOptical Character Recognition Using Python
Optical Character Recognition Using Python
 
IRJET- Scandroid: A Machine Learning Approach for Understanding Handwritten N...
IRJET- Scandroid: A Machine Learning Approach for Understanding Handwritten N...IRJET- Scandroid: A Machine Learning Approach for Understanding Handwritten N...
IRJET- Scandroid: A Machine Learning Approach for Understanding Handwritten N...
 
50120130406005
5012013040600550120130406005
50120130406005
 
A STUDY ON OPTICAL CHARACTER RECOGNITION TECHNIQUES
A STUDY ON OPTICAL CHARACTER RECOGNITION TECHNIQUESA STUDY ON OPTICAL CHARACTER RECOGNITION TECHNIQUES
A STUDY ON OPTICAL CHARACTER RECOGNITION TECHNIQUES
 
OPTICAL CHARACTER RECOGNITION IN HEALTHCARE
OPTICAL CHARACTER RECOGNITION IN HEALTHCAREOPTICAL CHARACTER RECOGNITION IN HEALTHCARE
OPTICAL CHARACTER RECOGNITION IN HEALTHCARE
 
IRJET- Photo Optical Character Recognition Model
IRJET- Photo Optical Character Recognition ModelIRJET- Photo Optical Character Recognition Model
IRJET- Photo Optical Character Recognition Model
 
Bj35343348
Bj35343348Bj35343348
Bj35343348
 
LICENSE NUMBER PLATE RECOGNITION SYSTEM USING ANDROID APP
LICENSE NUMBER PLATE RECOGNITION SYSTEM USING ANDROID APPLICENSE NUMBER PLATE RECOGNITION SYSTEM USING ANDROID APP
LICENSE NUMBER PLATE RECOGNITION SYSTEM USING ANDROID APP
 
IRJET- Offline Transcription using AI
IRJET-  	  Offline Transcription using AIIRJET-  	  Offline Transcription using AI
IRJET- Offline Transcription using AI
 
Smart Assistant for Blind Humans using Rashberry PI
Smart Assistant for Blind Humans using Rashberry PISmart Assistant for Blind Humans using Rashberry PI
Smart Assistant for Blind Humans using Rashberry PI
 
Optical Character Recognition from Text Image
Optical Character Recognition from Text ImageOptical Character Recognition from Text Image
Optical Character Recognition from Text Image
 
IRJET- Intelligent Character Recognition of Handwritten Characters using ...
IRJET-  	  Intelligent Character Recognition of Handwritten Characters using ...IRJET-  	  Intelligent Character Recognition of Handwritten Characters using ...
IRJET- Intelligent Character Recognition of Handwritten Characters using ...
 
Telugu letters dataset and parallel deep convolutional neural network with a...
Telugu letters dataset and parallel deep convolutional neural  network with a...Telugu letters dataset and parallel deep convolutional neural  network with a...
Telugu letters dataset and parallel deep convolutional neural network with a...
 

Dernier

How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxBkGupta21
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESmohitsingh558521
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 

Dernier (20)

How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptx
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 

Z04405149151

  • 1. Shrinath Janvalkar et al. Int. Journal of Engineering Research and Applications www.ijera.com ISSN : 2248-9622, Vol. 4, Issue 4( Version 5), April 2014, pp.149-151 www.ijera.com 149 | P a g e Text Recognition from an Image Shrinath Janvalkar, Paresh Manjrekar, Sarvesh Pawar, Prof. Laxman Naik Department Of Computer Engineering1, 2, 3, 4 RMCET(Mumbai University) Ambav, Devrukh, India1, 2, 3, 4 ABSTRACT To achieve high speed in data processing it is necessary to convert the analog data into digital data. Storage of hard copy of any document occupies large space and retrieving of information from that document is time consuming. Optical character recognition system is an effective way in recognition of printed character. It provides an easy way to recognize and convert the printed text on image into the editable text. It also increases the speed of data retrieval from the image. The image which contains characters can be scanned through scanner and then recognition engine of the OCR system interpret the images and convert images of printed characters into machine-readable characters [8].It improving the interface between man and machine in many applications. I. INTRODUCTION Character recognition is one of the most interesting areas of pattern recognition and artificial intelligence. Optical Character Recognition extracts the relevant information and automatically enters it into electronic database instead of the conventional way of manually retyping the text. Optical Character Recognition is a vast field with a number of varied applications such as invoice imaging, legal industry, banking, health care industry etc. OCR is also widely used in many other fields like Captcha, Institutional repositories and digital libraries, Optical Music Recognition without any human correction or human effort, Automatic number plate recognition and Handwritten Recognition [6]. It contributes immensely to the advancement of an automation process and can improve the interface between man and machine in numerous applications. Several research works have been focusing on new techniques and methods that would reduce the processing time while providing higher recognition accuracy. Now it is possible to scan documents as an image and to make it editable and searchable for further information processing. II. OBJECTIVE The objective of OCR software is to recognize the text and then convert it to editable form. Thus, developing computer algorithms to identify the character in the text is the principal task of OCR. A document is first scanned by an optical scanner, which produces an image form of it that is not editable. Optical character recognition involves. Translation of this text image into editable character codes such as ASCII [4]. The below diagram shows the processing mechanism of OCR system (Fig. 1). Fig. 1. OCR Engine III. LITERARURE SURVEY 3.1 First generation OCR systems The first commercialized OCR of this generation was IBM 1418, which was designed to read a special IBM font407. The recognition method was template matching, which compares the character image with a library of prototype images for each character of each font [5]. 3.2 Second generation OCR systems Next generation machines were able to recognize regular machine-printed and hand printed characters. The character set was limited to numerals and a few letters and symbols. Such machines appeared in the middle of 1960s to early 1970s [5]. 3.3 Third generation OCR systems For the third generation of OCR systems, the challenges were documents of poor quality and large printed and hand-written character sets. Low cost and high performance were also important objectives. Commercial OCR systems with such capabilities appeared during the decade 1975 to 1985[5]. RESEARCH ARTICLE OPEN ACCESS
  • 2. Shrinath Janvalkar et al. Int. Journal of Engineering Research and Applications www.ijera.com ISSN : 2248-9622, Vol. 4, Issue 4( Version 5), April 2014, pp.149-151 www.ijera.com 150 | P a g e 3.4 OCRs Today (Fourth generation OCR systems) The fourth generation can be characterized by the OCR of complex documents intermixing with text, graphics, tables and mathematical symbols, unconstrained handwritten characters, color documents, low-quality noisy documents, etc. Among the commercial products, postal address readers, and reading aids for the blind are available in the market [5]. IV. TASK INVOLVED IN OCR Fig. 2. OCR Processing The above figure shows different processes which are done in OCR system (Fig. 2). 4.1 Image acquisition Input image for OCR system might be acquire by scanning document or by capturing photograph of document. This is also known as digitization process [11]. 4.2 Preprocessing Preprocessing consist series of operations and it used to enhance an image and make it suitable for segmentation. Noise get introduced during document generation. So Proper filter like mean filter, min-max filter, Gaussian filter etc. may be applied to remove noise from document.Binarization process converts gray scale or colored image to black and white image. To enhance visibility and structural information of character Binary morphological operations like opening, closing, thinning, hole filling etc. may be applied on image. If scanned image is not be perfectly aligned, so we need to align it by performing slant angle correction. Input document may be resized if it is too large in size to reduce dimensions to improve speed of processing [11]. 4.3 Segmentation Character segmentation performs an operation of decomposition of an image into Sub images of individual symbols. It is one of the decision processes in a system for optical character recognition (OCR). Its decision that a pattern isolated from the image is that of a character or some other identifiable unit. Generally document is processed in hierarchical way. At first level lines are segmented using row histogram. From each row, words are extracted using column histogram and finally characters are extracted from words. Accuracy of final result is highly depends on accuracy of segmentation [11]. 4.4 Feature extraction Feature extraction is the important part of any pattern recognition application. Feature extraction techniques like Linear Discriminant Analysis (LDA), Principle Component Analysis (PCA),Independent Component Analysis (ICA), Chain Code (CC), Scale Invariant Feature Extraction (SIFT),Gradient based features, Histogram might be applied to extract the features of individual characters. These features are used to train the system [11]. 4.5 Classification When image is provided as input to OCR system, its features are extracted and given as an input to the trained classifier like artificial neural network or support vector machine. Classifiers compare the input feature with stored pattern and find out the best matching class for input [11]. 4.6 Post processing This step is not compulsory; it helps to improve the accuracy of recognition. Syntax analysis, semantic analysis kind of higher level concepts might be applied to check the context of recognized character [11]. V. RESULT: SECTION OF OCR IMAGE Input as Image: Fig. 3. Input to OCR System Result: [1] "veo etoyels Ioke oer, end net r`deao,k head fg suhgestion" very closely together, and that ``deaths's head '' suggestion [2] "oI gtd genep eaw stsougly markod. serhass lI was scnw" of his bones very strongly marked. Perhaps it was fan- [3] "cifol, hmt I thoOphi ihat ha looser lthe a knight of old" ciful, but I thought that he looked like a knight of old
  • 3. Shrinath Janvalkar et al. Int. Journal of Engineering Research and Applications www.ijera.com ISSN : 2248-9622, Vol. 4, Issue 4( Version 5), April 2014, pp.149-151 www.ijera.com 151 | P a g e [4] "wk, was goine into batIta anr snew he Wak geing so he" who was going into battle and knew he was going to be [5] "... apxhn I tcit what an enH aorhi.Mwy ans suiie unm" ... again, I felt what an extraordinary and quite un- [6] "eoaserouk poWer nf attracigHn he had." conscious power of attraction he had. VI. APPLICATIONS Optical character recognition has been applied to a number of applications. Some of them have been explained below. 6.1 Legal Industry OCR is used in Legal industry for digitize documents, and directly entered to computer database. Legal professionals can further search documents required from huge databases by simply typing a few keywords [6]. 6.2 Healthcare Healthcare professionals always have to deal with large volumes of forms for each patient, including insurance forms as well as general health forms. To keep up with all of this information, it is useful to input relevant data into an electronic database that can be accessed as necessary. Form processing tools, powered by OCR, are able to extract information from forms and put it into databases, so that every patient's data is promptly recorded [6]. 6.3 Optical Music Recognition Initially it was aimed towards recognizing printed sheets which can be edited into playable form with the help of electronic methods. It has many applications like processing of different classes of music, large scale digitization of musical data and also it can be used for diversity in musical notation [6]. 6.4 Automatic Number Recognition Automatic number plate recognition is used as a technique making use of optical character recognition on images to identify vehicle registration plates. They are used by various police forces and as a method of electronic toll collection on pay-per-use roads and cataloging the movements of traffic or individuals [6]. 6.5 Handwriting Recognition It is the ability of a computer system which scans the image of handwritten text by scanner and extracts only handwritten character from that image [7]. VII. CONCLUTION Although results of OCR System are not good, they are not that bad either, indicating that the OCR technique is not awed. More training data may improve robustness and accuracy. REFFERENCES [1] Optical Character Recognition using Neural Networks Deepayan Sarkar University of Wisconsin MadisonECE 539 Project, Fall 2003. [2] “Evaluation of OCR Algorithms for Images with Different Spatial Resolutions and Noises” School of Information Technology and Engineering Faculty of Engineering University of Ottawa©Qing Chen, Ottawa, Canada, 2003. [3] “A Neural Network Implementation of Optical Character Recognition” Technical Report Number CSSE10-05 COMP 6600 – Artificial Intelligence Spring 2009. [4] “Optical Character Recognition Techniques: A Survey” Sukhpreet Singh M.tech Student, Dept. of Computer Engineering, YCOE Talwandi Sabo BP. India. [5] OCR System: A Literature Survey [6] “Survey of OCR Applications” by Amarjot Singh, ketan bacchuwar, Akshay bhasin. [7] M.D. Ganis, C.L. Wilson, J.L. Blue, “Neural network-based systems for handprint OCR applications” in IEEE Transactions on Image Processing, 1998, Vol: 7, Issue: 8, p.p. 1097 – 1112. [8] “Performance Characterization and Acceleration of Optical Character Recognition on Handheld Platforms” Sadagopan Srinivasan, Li Zhao, Lin Sun, Zhen Fang, Peng Li, Tao Wang,Ravishankar Iyer, Ramesh Illikkal, Dong LiuIntel Corporation. [9] “Implementing Optical Character Recognition on the Android Operating System for Business Cards” Sonia Bhaskar, Nicholas Lavassar, Scott GreenEE 368 Digital Image Processing. [10]“A Comparative analysis of feature extraction techniques for handwritten character recognition” Rajbala Tokas1, Aruna Bhadu2 M.Tech*(CS), Swami Keshwanand Institute of Technology, Jaipur, Rajasthan, India, M.Tech*(SE) Govt. Engineering College. [11]“A Literature Review on Hand Written Character Recognition” by Mansi shah & Gordhan B Jethava Department of Computer Science & Engineering Parul Institute of Technology, Gujarat, India. Information Technology Department Parul Institute of Engg. & Technology, Gujarat, India.