Learn how intelligent data capture has replaced scanning for archival. Understand how recognition technologies and capture software including advanced OCR, barcodes and regex, combine to extract your important data seamlessly from scans and existing files. The time is now to truly turn your content into data.
6. Recognition
technologies such
as OCR and barcode
recognition can be
used to pull data
from structured or
unstructured scans
or existing files
painlessly.
7. OCR has the greatest impact on the growth
of intelligent data extraction and the
potential continues to grow as the
technologies continue to improve.
8. Barcode recognition offers
the most trustworthy
recognition technology for
data capture and is widely
deployed.
See What Can Barcodes Do for Me?
9. OMR (Optical Mark Recognition)
• capturing human-marked data from
document forms such as surveys and
• continues to improve in accuracy and
demand
ICR (Intelligent Character Recognition)
• handwriting recognition
• not as accurate as OCR
• plays a limited role in some capture systems
• continues to improve in accuracy and
demand
Other Recognition Technologies
10. After the data has been captured (from barcode,
OCR, etc.), pattern matching technology identifies
the key data.
Regular expressions (regex) provide a
fast and powerful method to search,
extract and replace specific data found
within scanned documents.
Regular expressions are essentially a
special text string for describing a
search pattern.You could think of
regular expressions as extremely
powerful wildcards.
See Using Regular Expressions in Document
Management Data Capture and Indexing
11. See Using Regular Expressions in Document
Management Data Capture and Indexing
Regex’s Lookahead , Lookbehind and Line Item Extraction features
go beyond basic zonal OCR and let you identify and extract data from
unstructured documents. These let you search for an identifiable
keyword or string, like “PO Number” and then a word pattern to
identify the desired text to extract.
12. There’s a Mountain
of It!
Here is a partial invoice where you might need to capture the "Catalogue Number“
with line Item extraction technology.
Real World Example
13. So once the key data
has been identified or
“extracted”, how can it
be used?
14. A large single file can be split into multiple files based on information
extracted from barcodes and content.
Split Files
Name Files and Folders
Name files, folders and subfolders with extracted information from the file
or system information.
Route Files
Route the files to another directory (and even create the folder and
subfolder names) using content.
Create indexes from extracted information for the “searchable” fields.
Index
Create PDF Bookmarks
Create PDF bookmarks based on extracted information.
Validation
Data can be validated against business rules to reduce errors .
16. Integration means
sharing the
information with:
• A simple search and retrieval
system
• A Document Management (DM)
system
• An Enterprise Content
Management (ECM) system
• A back-end application such as an
Enterprise Resource Planning
(ERP) system
Molaire sur implant, jbessade — Travail, www.fr.wikipedia.org
17. Henry Schein,
Dentri Dentrix
Enterprise
Dentrix Ascend,
Easy Dx, ental
Viive,
DentalVision,
axiUm
… ImageRamp can share the extracted data with
anyone who can accept a standard XML or CSV
file
Laserfiche
Filenet
MyMedicalRecords
Eaglesoft
Allscripts
Dentrix
CSV or XML
Anyone
Documentum
Epic
19. There’s a Mountain
of It!
If a stack of invoices were scanned at one time, at each unique occurrence of
the Invoice Number, the file could be split and named with the extracted
invoice number. Furthermore, the Invoice Number could be shared with an
AP system.
The Catalogue Numbers could be extracted and shared with an ERP for
inventory purposes.
Remember our Real World Example?
20. So what needs brushing up?
What does the future
hold for intelligent
data capture? digicla, "Be good for your teeth and the will be good for you“.
21. Continued Improvement in Recognition Technologies Including:
Increased Mobility Integration For Smart Phones, Tablets, etc.
Increased Cloud Computing Options
Improved Validation Against Complex Business Rules
Increased Technical Support to Manage the Complexity
• OCR expansion to include services like translation
• Better accuracy of ICR (handwriting recognition)
• Faster, more accurate
Increased Information Governance Issues and Complexity
22. Want to Learn More about Document Imaging
and Capture?