Optical Character Recognition (OCR)

•Télécharger en tant que PPTX, PDF•

8 j'aime•13,135 vues

Vidyut Singhania

An introduction to the concept and development of Optical Character Recognition.

Technologie Formation

OPTICAL CHARACTER
RECOGNITION
Divyanshu Sagar
Ahmed Zaid Faizee
Vidyut Singhania

INTRO
1. Ingenious piece of software.
2. Involves the mechanical/electronic
conversion of scanned images of
typewritten/printed text into machine-
encoded/computer-readable text.
• 3. Heavily used in the
industry.

INTRO ii
• Common method of digitizing printed texts
• Subtle software which is as highly overlooked as it is simple.
• Numerous applications and uses – editing, scanning,
searching, comparison, compact storage and many more!
• OCR is a field of research in pattern recognition, artificial
intelligence and computer vision.

Problem Statement
Ever since Charles Babbage invented the computer back in the early 19th
century, Computer machines have held man's imagination for numerous reasons - the
primary being what all is this collection of nuts, bolts and wires capable of doing.
Character Recognition is one such concept which has beheld mankind’s attention. There
can be no greater testimony to the same than the fact that people were already working on
this idea - a few decades before John McCarthy even coined the term "Artificial
Intelligence".
Today, especially, Character Recognition plays a very important part of our daily lives as
they are incorporated so subtly that we even forget their presence. Some examples are
their implementation in Microsoft Word, Adobe Acrobat and even Pen computing.
Optical Character Recognition (OCR) is the mechanical or electronic conversion of scanned
or photoed images of typewritten or printed text into machine-encoded/computer-
readable text. This text can then be used in numerous ways - ranging from assisting the
visually impaired (text-to-speech), extracting information from the image, pen computing
and so on. Optical Character Recognition (OCR) is a result of cross-linking various avenues
of technology like Machine Learning, Artificial Intelligence and Neural Networks. We
propose to develop a system based on mathematical algorithms and principles which
involve all the aforementioned technologies. That being said, Optical Character Recognition
(OCR) also depends on a few other factors : the quality of the image taken, the orientation
of and the dialect being used. Our paper aims to address the aforementioned
problems, which enables its application in numerous new fields as well as the obvious &
established aspects of our surroundings.

Tech Jargon - I
• Pre-processing
Used to improve the successful
recognition of the image (include De-
skew, Layout analysis, Despeckle)
• Character/glyph recognition
• Post-processing
• Application specific optimization
Tweaking the system to better deal
with specific or different inputs.

Tech Jargon - II
Segmentation
Includes two important phases:
1) Obtaining training samples
2) Recognizing new images after
training
Feature Extraction
Feature of the character are extracted
and hence are compared with the glyph
Classification
After the extraction, neural network is
trained using the training data

Our Current Progress
• We started with the Neural Networks / Machine Learning
aspect of the project.
• We have implemented Univariate / Multivariate
Linear/Regularized Linear Regression, Gradient Descent for
Multiple Variables and Logistic/ Regularized Logistic
Regression.
• Currently, we are studying & working on the
implementation of Neural Nets using Forward Propogation.
• We plan on tackling character segmentation and feature
extraction next.

Technology to be used
• We are using the following technology
platforms :
– GNU Octave
To develop and test the OCR software.
– 5MP HD camera (720p @ 30fps)
To take images for detection

Literature Review
• Microsoft One Note
• Adobe PDF scanner
• HP scanner

Contenu connexe

Tendances

Optical character recognition IEEE Paper StudyEr. Ashish Pandey

Optical Character Recognition( OCR )Karan Panjwani

Final Report on Optical Character Recognition Vidyut Singhania

Basics of-optical-character-recognitiondocument scanning services

optical character recognition systemVijay Apurva

Ocr abstractPunya Prakash

Optical Character RecognitionDurjoy Saha

Handwritten character recognition using artificial neural networkHarshana Madusanka Jayamaha

Handwritten Character RecognitionConstantine Priemski

Handwritten Character Recognition: A Comprehensive Review on Geometrical Anal...iosrjce

Optical Character Recognition (OCR) Systemiosrjce

Hand Written Character Recognition Using Neural Networks Chiranjeevi Adi

CHARACTER RECOGNITION USING NEURAL NETWORK WITHOUT FEATURE EXTRACTION FOR KAN...Editor IJMTER

Optical Character Reader - Project Report BTechKushagraChadha1

Automatic handwriting recognitionBIJIT GHOSH

Handwriting Recognition Using Deep Learning and Computer VersionNaiyan Noor

OCR Text ExtractionDr. Amarjeet Singh

offline character recognition for handwritten gujarati textBhumika Patel

Optical Character Recognition Using PythonYogeshIJTSRD

Tamil OCR using Tesseract OCR Enginebalamurugan.k Kalibalamurugan

Tendances (20)

Optical character recognition IEEE Paper Study

Optical Character Recognition( OCR )

Final Report on Optical Character Recognition

Basics of-optical-character-recognition

optical character recognition system

Ocr abstract

Optical Character Recognition

Handwritten character recognition using artificial neural network

Handwritten Character Recognition

Handwritten Character Recognition: A Comprehensive Review on Geometrical Anal...

Optical Character Recognition (OCR) System

Hand Written Character Recognition Using Neural Networks

CHARACTER RECOGNITION USING NEURAL NETWORK WITHOUT FEATURE EXTRACTION FOR KAN...

Optical Character Reader - Project Report BTech

Automatic handwriting recognition

Handwriting Recognition Using Deep Learning and Computer Version

OCR Text Extraction

offline character recognition for handwritten gujarati text

Optical Character Recognition Using Python

Tamil OCR using Tesseract OCR Engine

Similaire à Optical Character Recognition (OCR)

Face Recognition SystemStudentRocks

Intelligent image processingAndrew Stewart

Traffic Violation Detector using Object Detectionshri ram murti smarak college of engineering,technology & research

AIDC India - AI Vision SlidesIntel® Software

Optical character recognization wordDhana K

Makine Öğrenmesi ile Görüntü Tanıma | Image Recognition using Machine LearningAli Alkan

Computer architecture for vision systemAkashPatil334

A Deep Learning Approach to Recognize Cursive HandwritingIRJET Journal

IRJET- Sign Language InterpreterIRJET Journal

Using Algorithmia to leverage AI and Machine Learning APIsRakuten Group, Inc.

IRJET- Object Detection in an Image using Deep LearningIRJET Journal

Optical Recognition of Handwritten TextIRJET Journal

IRJET- Scandroid: A Machine Learning Approach for Understanding Handwritten N...IRJET Journal

IRJET- Intelligent Character Recognition of Handwritten CharactersIRJET Journal

Saksham presentationSakshamTurki

AI GRPOUP 4 PRESENTATION.pptxBaakoMohammed

Utilization of Machine Learning in Computer VisionIRJET Journal

IRJET- Text Recognization of Product for Blind Person using MATLABIRJET Journal

4 Best Computer Vision Use Cases for Solving Business ChallengesKavika Roy

ARTIFICIAL INTELLIGENCE.pptxBryCunal

Similaire à Optical Character Recognition (OCR) (20)

Face Recognition System

Intelligent image processing

Traffic Violation Detector using Object Detection

AIDC India - AI Vision Slides

Optical character recognization word

Makine Öğrenmesi ile Görüntü Tanıma | Image Recognition using Machine Learning

Computer architecture for vision system

A Deep Learning Approach to Recognize Cursive Handwriting

IRJET- Sign Language Interpreter

Using Algorithmia to leverage AI and Machine Learning APIs

IRJET- Object Detection in an Image using Deep Learning

Optical Recognition of Handwritten Text

IRJET- Scandroid: A Machine Learning Approach for Understanding Handwritten N...

IRJET- Intelligent Character Recognition of Handwritten Characters

Saksham presentation

AI GRPOUP 4 PRESENTATION.pptx

Utilization of Machine Learning in Computer Vision

IRJET- Text Recognization of Product for Blind Person using MATLAB

4 Best Computer Vision Use Cases for Solving Business Challenges

ARTIFICIAL INTELLIGENCE.pptx

Dernier

Artificial intelligence in cctv survelliance.pptxhariprasad279825

Rise of the Machines: Known As Drones...Rick Flair

What is Artificial Intelligence?????????blackmambaettijean

Sample pptx for embedding into website for demoHarshalMandlekar2

Anypoint Exchange: It’s Not Just a Repo!Manik S Magar

Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada

TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey

A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3

Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3

Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos

"ML in Production",Oleksandr BaganFwdays

unit 4 immunoblotting technique complete.pptxBkGupta21

Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3

WordPress Websites for Engineers: Elevate Your Brandgvaughan

DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell

"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays

Advanced Computer Architecture – An IntroductionDilum Bandara

The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3

DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy

SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero

Dernier (20)

Artificial intelligence in cctv survelliance.pptx

Rise of the Machines: Known As Drones...

What is Artificial Intelligence?????????

Sample pptx for embedding into website for demo

Anypoint Exchange: It’s Not Just a Repo!

Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024

TeamStation AI System Report LATAM IT Salaries 2024

A Deep Dive on Passkeys: FIDO Paris Seminar.pptx

Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx

Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)

"ML in Production",Oleksandr Bagan

unit 4 immunoblotting technique complete.pptx

Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx

WordPress Websites for Engineers: Elevate Your Brand

DSPy a system for AI to Write Prompts and Do Fine Tuning

"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack

Advanced Computer Architecture – An Introduction

The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx

DevoxxFR 2024 Reproducible Builds with Apache Maven

SIP trunking in Janus @ Kamailio World 2024

Optical Character Recognition (OCR)

1. OPTICAL CHARACTER RECOGNITION Divyanshu Sagar Ahmed Zaid Faizee Vidyut Singhania

2. INTRO 1. Ingenious piece of software. 2. Involves the mechanical/electronic conversion of scanned images of typewritten/printed text into machine- encoded/computer-readable text. • 3. Heavily used in the industry.

3. INTRO ii • Common method of digitizing printed texts • Subtle software which is as highly overlooked as it is simple. • Numerous applications and uses – editing, scanning, searching, comparison, compact storage and many more! • OCR is a field of research in pattern recognition, artificial intelligence and computer vision.

4. Problem Statement Ever since Charles Babbage invented the computer back in the early 19th century, Computer machines have held man's imagination for numerous reasons - the primary being what all is this collection of nuts, bolts and wires capable of doing. Character Recognition is one such concept which has beheld mankind’s attention. There can be no greater testimony to the same than the fact that people were already working on this idea - a few decades before John McCarthy even coined the term "Artificial Intelligence". Today, especially, Character Recognition plays a very important part of our daily lives as they are incorporated so subtly that we even forget their presence. Some examples are their implementation in Microsoft Word, Adobe Acrobat and even Pen computing. Optical Character Recognition (OCR) is the mechanical or electronic conversion of scanned or photoed images of typewritten or printed text into machine-encoded/computer- readable text. This text can then be used in numerous ways - ranging from assisting the visually impaired (text-to-speech), extracting information from the image, pen computing and so on. Optical Character Recognition (OCR) is a result of cross-linking various avenues of technology like Machine Learning, Artificial Intelligence and Neural Networks. We propose to develop a system based on mathematical algorithms and principles which involve all the aforementioned technologies. That being said, Optical Character Recognition (OCR) also depends on a few other factors : the quality of the image taken, the orientation of and the dialect being used. Our paper aims to address the aforementioned problems, which enables its application in numerous new fields as well as the obvious & established aspects of our surroundings.

5. Tech Jargon - I • Pre-processing Used to improve the successful recognition of the image (include De- skew, Layout analysis, Despeckle) • Character/glyph recognition • Post-processing • Application specific optimization Tweaking the system to better deal with specific or different inputs.

6. Tech Jargon - II Segmentation Includes two important phases: 1) Obtaining training samples 2) Recognizing new images after training Feature Extraction Feature of the character are extracted and hence are compared with the glyph Classification After the extraction, neural network is trained using the training data

7. Our Current Progress • We started with the Neural Networks / Machine Learning aspect of the project. • We have implemented Univariate / Multivariate Linear/Regularized Linear Regression, Gradient Descent for Multiple Variables and Logistic/ Regularized Logistic Regression. • Currently, we are studying & working on the implementation of Neural Nets using Forward Propogation. • We plan on tackling character segmentation and feature extraction next.

8. Technology to be used • We are using the following technology platforms : – GNU Octave To develop and test the OCR software. – 5MP HD camera (720p @ 30fps) To take images for detection

9. Timeline

10. Literature Review • Microsoft One Note • Adobe PDF scanner • HP scanner

Notes de l'éditeur

In 1914, Emanuel Goldberg developed a machine that read characters and converted them into standard telegraph code. Around the same time, Edmund Fournied'Albe developed the Otophone, a handheld scanner that when moved across a printed page, produced tones that corresponded to specific letters or characters.

Optical Character Recognition (OCR)

Recommandé

Recommandé

Contenu connexe

Tendances

Tendances (20)

Similaire à Optical Character Recognition (OCR)

Similaire à Optical Character Recognition (OCR) (20)

Dernier

Dernier (20)

Optical Character Recognition (OCR)

Notes de l'éditeur