SlideShare a Scribd company logo
1 of 17
Download to read offline
Document Representation Refinement
for Precise Region Description
Christian Clausner, Stefan Pletschacher and
Apostolos Antonacopoulos
PRImA Lab, School of Computing, Science and
Engineering, University of Salford,
United Kingdom
Document Page Regions
DATeCH 2014 2
Segmentation,
Classification
• Region (block, zone): Connected area of a
document image with content of a single
specific type
• Examples: Text, graphic, table
Region Representation
• By geometric objects
– Bounding box
– Stack of rectangles
– Polygon
• By pixels
– Bitmap
– Run-length encoding
DATeCH 2014 3
Need for Precise Region Descriptions
• Precise description is crucial for all but the most
trivial document analysis and recognition
applications
• For performance evaluation:
The loss of quality introduced
by imprecise regions can be
bigger than the variation of
accuracy of the actual
recognition method
DATeCH 2014 4
The Situation
• Trend to more precise descriptions, but…
• Output of state-of-the-artOCR systems:
– Stacks of rectangles (ABBYY FineReader Engine 11)
– Bounding boxes (Tesseract OCR 3.02)
• Popular formats for layout analysis and OCR results:
– ALTO XML (boxes, ellipses, polygons (region level only))
– FineReader XML (stacks of rectangles (region level only))
– PAGE XML (polygons for all levels)
– HOCR (boxes)
DATeCH 2014 5
Refinement through Polygonal Fitting
• Applicable to regions that
have child objects in the
document model
• A typical object hierarchy
contains regions, text lines,
words and glyphs (characters)
• Idea: Tightly wrap a polygon
around the child objects
DATeCH 2014 6
Polygonal Fitting Approach
1. Create bitmasks for the child
objects and transfer them to an
empty bitmap
2. Fill the gaps between the child
objects by a smearing approach
3. Optional: Exclude neighbour
regions
4. Trace the contour of the
foreground and create a polygon
DATeCH 2014 7
1 - Transferring Child Object to Bitmap
• Starting point: Polygonal object (e.g. text line,
word, or glyph)
• Lossless conversion to rectangle based interval
representation
• Transferring the rectangles to the target bitmap
DATeCH 2014 8
2 – Smearing Approach
• Goal: Connect all foreground
components in the bitmap by
filling the gaps in-between
1. Alternatingly fill horizontal and
vertical gaps if they are smaller
than a dynamic threshold
(threshold is increased after
each iteration)
2. If necessary, use diagonal
smearing to connect remaining
components
DATeCH 2014 9
3 – Subtraction of Neighbours
• Optional step to avoid
overlap with adjacent
regions
• Simply erase the
corresponding pixels from
the created bitmap
DATeCH 2014 10
4 – Outline Tracing
• Trace the contour of the
foreground component
in the created bitmap
• Create polygon on-the-
fly by adding points for
each change of direction
(corner)
DATeCH 2014 11
Experiments
• Carried out on a dataset
of contemporary
documents consisting of
scanned magazine and
technical article pages
• Processed with Tesseract
OCR 3.02 (open source)
• Exported to PAGE XML
with and without
refinement
DATeCH 2014 12
DATeCH 2014 13
Original (unrefined) Refined
Results
• Measurement of region overlaps (number and
area)
DATeCH 2014 14
Overlapping
Regions
Overlap Area
(Megapixel)
Original
Outlines
621 (45.8%) 19.9
Refined
Outlines
286 (21.1%) 2.5
Impact on Performance Evaluation
• Real-world scenario
• Measure the performance of Tesseract OCR engine
• Evaluation metrics of previous ICDAR page
segmentation competitions
DATeCH 2014 15
Average success rate using originaloutlines 81.1%
Average success rate using refined outlines 84.5%
Average improvementfor all documents 3.4%
Maximumimprovement 22.9%
Conclusion
• Existing geometric region data can be significantly refined by fitting
precise polygons around child objects
• Validity and impact on real-world scenarios has been shown
• Refinement in performance evaluation helps to eliminate problems
that arise from insufficient geometric descriptions → Concentrate
on real issues of OCR methods
• Positive effect on accuracy of presentation/repurposing systems
(highlighting, cropping, article tracking, etc.)
• Approach used in Aletheia ground truth editor and result viewer
(primaresearch.org/tools)
DATeCH 2014 16
DATeCH 2014 17

More Related Content

What's hot

Semi automatic vortex extraction in 4 d pc-mri cardiac blood flow data using ...
Semi automatic vortex extraction in 4 d pc-mri cardiac blood flow data using ...Semi automatic vortex extraction in 4 d pc-mri cardiac blood flow data using ...
Semi automatic vortex extraction in 4 d pc-mri cardiac blood flow data using ...Subhashis Hazarika
 
Surface reconstruction using point cloud
Surface reconstruction using point cloudSurface reconstruction using point cloud
Surface reconstruction using point cloudishan kossambe
 
GEOPROCESSING IN QGIS
GEOPROCESSING IN QGISGEOPROCESSING IN QGIS
GEOPROCESSING IN QGISSwetha A
 
A Run Length Smoothing-Based Algorithm for Non-Manhattan Document Segmentation
A Run Length Smoothing-Based Algorithm for Non-Manhattan Document SegmentationA Run Length Smoothing-Based Algorithm for Non-Manhattan Document Segmentation
A Run Length Smoothing-Based Algorithm for Non-Manhattan Document SegmentationUniversity of Bari (Italy)
 
A Run Length Smoothing-Based Algorithm for Non-Manhattan Document Segmentation
A Run Length Smoothing-Based Algorithm for Non-Manhattan Document SegmentationA Run Length Smoothing-Based Algorithm for Non-Manhattan Document Segmentation
A Run Length Smoothing-Based Algorithm for Non-Manhattan Document SegmentationUniversity of Bari (Italy)
 
Au 2008 Gs100 1 P Getting Spatial With
Au 2008   Gs100 1 P Getting Spatial WithAu 2008   Gs100 1 P Getting Spatial With
Au 2008 Gs100 1 P Getting Spatial WithRichard Chappell, GISP
 
Spme 2013 segmentation
Spme 2013 segmentationSpme 2013 segmentation
Spme 2013 segmentationQujiang Lei
 
GIS in land suitability mapping
GIS in land suitability mappingGIS in land suitability mapping
GIS in land suitability mappingGlory Enaruvbe
 
Spatial data analysis 1
Spatial data analysis 1Spatial data analysis 1
Spatial data analysis 1Johan Blomme
 
Mar 8 single_map_analysis_1
Mar 8 single_map_analysis_1Mar 8 single_map_analysis_1
Mar 8 single_map_analysis_1dellissimo
 
Geographical information system unit 6
Geographical information  system unit 6Geographical information  system unit 6
Geographical information system unit 6WE-IT TUTORIALS
 
GIS Analysis For Site Remediation
GIS Analysis For Site RemediationGIS Analysis For Site Remediation
GIS Analysis For Site RemediationJoseph Luchette
 
Creating watershed using SRTM DEM
Creating watershed using SRTM DEMCreating watershed using SRTM DEM
Creating watershed using SRTM DEMbajajngadat
 

What's hot (20)

Semi automatic vortex extraction in 4 d pc-mri cardiac blood flow data using ...
Semi automatic vortex extraction in 4 d pc-mri cardiac blood flow data using ...Semi automatic vortex extraction in 4 d pc-mri cardiac blood flow data using ...
Semi automatic vortex extraction in 4 d pc-mri cardiac blood flow data using ...
 
Gis Concepts 5/5
Gis Concepts 5/5Gis Concepts 5/5
Gis Concepts 5/5
 
Surface reconstruction using point cloud
Surface reconstruction using point cloudSurface reconstruction using point cloud
Surface reconstruction using point cloud
 
GEOPROCESSING IN QGIS
GEOPROCESSING IN QGISGEOPROCESSING IN QGIS
GEOPROCESSING IN QGIS
 
A Run Length Smoothing-Based Algorithm for Non-Manhattan Document Segmentation
A Run Length Smoothing-Based Algorithm for Non-Manhattan Document SegmentationA Run Length Smoothing-Based Algorithm for Non-Manhattan Document Segmentation
A Run Length Smoothing-Based Algorithm for Non-Manhattan Document Segmentation
 
A Run Length Smoothing-Based Algorithm for Non-Manhattan Document Segmentation
A Run Length Smoothing-Based Algorithm for Non-Manhattan Document SegmentationA Run Length Smoothing-Based Algorithm for Non-Manhattan Document Segmentation
A Run Length Smoothing-Based Algorithm for Non-Manhattan Document Segmentation
 
Au 2008 Gs100 1 P Getting Spatial With
Au 2008   Gs100 1 P Getting Spatial WithAu 2008   Gs100 1 P Getting Spatial With
Au 2008 Gs100 1 P Getting Spatial With
 
conversion of digital elevation maps to geological information
conversion of digital elevation maps to geological informationconversion of digital elevation maps to geological information
conversion of digital elevation maps to geological information
 
Spme 2013 segmentation
Spme 2013 segmentationSpme 2013 segmentation
Spme 2013 segmentation
 
GIS in land suitability mapping
GIS in land suitability mappingGIS in land suitability mapping
GIS in land suitability mapping
 
Spatial data analysis 1
Spatial data analysis 1Spatial data analysis 1
Spatial data analysis 1
 
Mar 8 single_map_analysis_1
Mar 8 single_map_analysis_1Mar 8 single_map_analysis_1
Mar 8 single_map_analysis_1
 
Geographical information system unit 6
Geographical information  system unit 6Geographical information  system unit 6
Geographical information system unit 6
 
GIS Analysis For Site Remediation
GIS Analysis For Site RemediationGIS Analysis For Site Remediation
GIS Analysis For Site Remediation
 
Spatial Data Model
Spatial Data ModelSpatial Data Model
Spatial Data Model
 
QGIS Tutorial 1
QGIS Tutorial 1QGIS Tutorial 1
QGIS Tutorial 1
 
Creating watershed using SRTM DEM
Creating watershed using SRTM DEMCreating watershed using SRTM DEM
Creating watershed using SRTM DEM
 
QGIS Tutorial 2
QGIS Tutorial 2QGIS Tutorial 2
QGIS Tutorial 2
 
ML whitepaper v0.2
ML whitepaper v0.2ML whitepaper v0.2
ML whitepaper v0.2
 
Graph chi
Graph chiGraph chi
Graph chi
 

Viewers also liked

University library of KU Leuven - Sam Alloing et Demmy Verbecke
University library of KU Leuven - Sam Alloing et Demmy VerbeckeUniversity library of KU Leuven - Sam Alloing et Demmy Verbecke
University library of KU Leuven - Sam Alloing et Demmy VerbeckeIMPACT Centre of Competence
 
Biblioteca Virtual Miguel de Cervantes - Oskarbi Zubiarrain
Biblioteca Virtual Miguel de Cervantes - Oskarbi ZubiarrainBiblioteca Virtual Miguel de Cervantes - Oskarbi Zubiarrain
Biblioteca Virtual Miguel de Cervantes - Oskarbi ZubiarrainIMPACT Centre of Competence
 
2. Interoperability framework and Taverna. Enrique Molla, Succeed Project.
2. Interoperability framework and Taverna. Enrique Molla, Succeed Project. 2. Interoperability framework and Taverna. Enrique Molla, Succeed Project.
2. Interoperability framework and Taverna. Enrique Molla, Succeed Project. IMPACT Centre of Competence
 
7. Technical development at the Meertens Institute. Marc Kemps Snijders.
7. Technical development at the Meertens Institute. Marc Kemps Snijders.7. Technical development at the Meertens Institute. Marc Kemps Snijders.
7. Technical development at the Meertens Institute. Marc Kemps Snijders.IMPACT Centre of Competence
 
Datech2014 - Session 5 - Wittgenstein’s Nachlass: WiTTFind and Wittgenstein A...
Datech2014 - Session 5 - Wittgenstein’s Nachlass: WiTTFind and Wittgenstein A...Datech2014 - Session 5 - Wittgenstein’s Nachlass: WiTTFind and Wittgenstein A...
Datech2014 - Session 5 - Wittgenstein’s Nachlass: WiTTFind and Wittgenstein A...IMPACT Centre of Competence
 

Viewers also liked (8)

Kennisbank IMPACT by Lotte Wilms
Kennisbank IMPACT by Lotte WilmsKennisbank IMPACT by Lotte Wilms
Kennisbank IMPACT by Lotte Wilms
 
Image Enhancement tools by Lotte Wilms
Image Enhancement tools by Lotte WilmsImage Enhancement tools by Lotte Wilms
Image Enhancement tools by Lotte Wilms
 
University library of KU Leuven - Sam Alloing et Demmy Verbecke
University library of KU Leuven - Sam Alloing et Demmy VerbeckeUniversity library of KU Leuven - Sam Alloing et Demmy Verbecke
University library of KU Leuven - Sam Alloing et Demmy Verbecke
 
Biblioteca Virtual Miguel de Cervantes - Oskarbi Zubiarrain
Biblioteca Virtual Miguel de Cervantes - Oskarbi ZubiarrainBiblioteca Virtual Miguel de Cervantes - Oskarbi Zubiarrain
Biblioteca Virtual Miguel de Cervantes - Oskarbi Zubiarrain
 
2. Interoperability framework and Taverna. Enrique Molla, Succeed Project.
2. Interoperability framework and Taverna. Enrique Molla, Succeed Project. 2. Interoperability framework and Taverna. Enrique Molla, Succeed Project.
2. Interoperability framework and Taverna. Enrique Molla, Succeed Project.
 
CONCERT IMPACT by Lotte Wilms
CONCERT IMPACT by Lotte WilmsCONCERT IMPACT by Lotte Wilms
CONCERT IMPACT by Lotte Wilms
 
7. Technical development at the Meertens Institute. Marc Kemps Snijders.
7. Technical development at the Meertens Institute. Marc Kemps Snijders.7. Technical development at the Meertens Institute. Marc Kemps Snijders.
7. Technical development at the Meertens Institute. Marc Kemps Snijders.
 
Datech2014 - Session 5 - Wittgenstein’s Nachlass: WiTTFind and Wittgenstein A...
Datech2014 - Session 5 - Wittgenstein’s Nachlass: WiTTFind and Wittgenstein A...Datech2014 - Session 5 - Wittgenstein’s Nachlass: WiTTFind and Wittgenstein A...
Datech2014 - Session 5 - Wittgenstein’s Nachlass: WiTTFind and Wittgenstein A...
 

Similar to Datech2014-Session1-Document Representation Refinement for Precise Region Description

Algorithmic Techniques for Parametric Model Recovery
Algorithmic Techniques for Parametric Model RecoveryAlgorithmic Techniques for Parametric Model Recovery
Algorithmic Techniques for Parametric Model RecoveryCurvSurf
 
Dsd int 2014 - data science symposium - application 1 - point clouds, prof. p...
Dsd int 2014 - data science symposium - application 1 - point clouds, prof. p...Dsd int 2014 - data science symposium - application 1 - point clouds, prof. p...
Dsd int 2014 - data science symposium - application 1 - point clouds, prof. p...Deltares
 
Enterprise Scale Topological Data Analysis Using Spark
Enterprise Scale Topological Data Analysis Using SparkEnterprise Scale Topological Data Analysis Using Spark
Enterprise Scale Topological Data Analysis Using SparkAlpine Data
 
Enterprise Scale Topological Data Analysis Using Spark
Enterprise Scale Topological Data Analysis Using SparkEnterprise Scale Topological Data Analysis Using Spark
Enterprise Scale Topological Data Analysis Using SparkSpark Summit
 
An Efficient Arabic Text Spotting from Natural Scenes Images
An Efficient Arabic Text Spotting from Natural Scenes ImagesAn Efficient Arabic Text Spotting from Natural Scenes Images
An Efficient Arabic Text Spotting from Natural Scenes ImagesReham Marzouk
 
Global Map Matching using BLE Beacons for Indoor Route and Stay Estimation
Global Map Matching using BLE Beacons for Indoor Route and Stay EstimationGlobal Map Matching using BLE Beacons for Indoor Route and Stay Estimation
Global Map Matching using BLE Beacons for Indoor Route and Stay EstimationDaisuke Yamamoto
 
BarnieMAT
BarnieMATBarnieMAT
BarnieMATNplusT
 
HP - Jerome Rolia - Hadoop World 2010
HP - Jerome Rolia - Hadoop World 2010HP - Jerome Rolia - Hadoop World 2010
HP - Jerome Rolia - Hadoop World 2010Cloudera, Inc.
 
TcpTunnel CAD
TcpTunnel CADTcpTunnel CAD
TcpTunnel CADaplitop
 
2015-07-08 Paper 38 - ICVS Talk
2015-07-08 Paper 38 - ICVS Talk2015-07-08 Paper 38 - ICVS Talk
2015-07-08 Paper 38 - ICVS TalkThomas Sølund
 
Computer Vision Landscape : Present and Future
Computer Vision Landscape : Present and FutureComputer Vision Landscape : Present and Future
Computer Vision Landscape : Present and FutureSanghamitra Deb
 
대용량 데이터 분석을 위한 병렬 Clustering 알고리즘 최적화
대용량 데이터 분석을 위한 병렬 Clustering 알고리즘 최적화대용량 데이터 분석을 위한 병렬 Clustering 알고리즘 최적화
대용량 데이터 분석을 위한 병렬 Clustering 알고리즘 최적화NAVER Engineering
 
Spatial decision support and analytics on a campus scale: bringing GIS, CAD, ...
Spatial decision support and analytics on a campus scale: bringing GIS, CAD, ...Spatial decision support and analytics on a campus scale: bringing GIS, CAD, ...
Spatial decision support and analytics on a campus scale: bringing GIS, CAD, ...Safe Software
 
Big Data and Geospatial with HPCC Systems
Big Data and Geospatial with HPCC SystemsBig Data and Geospatial with HPCC Systems
Big Data and Geospatial with HPCC SystemsHPCC Systems
 

Similar to Datech2014-Session1-Document Representation Refinement for Precise Region Description (20)

2015 10-08 - additive manufacturing software 1
2015 10-08 - additive manufacturing software  12015 10-08 - additive manufacturing software  1
2015 10-08 - additive manufacturing software 1
 
Algorithmic Techniques for Parametric Model Recovery
Algorithmic Techniques for Parametric Model RecoveryAlgorithmic Techniques for Parametric Model Recovery
Algorithmic Techniques for Parametric Model Recovery
 
Dsd int 2014 - data science symposium - application 1 - point clouds, prof. p...
Dsd int 2014 - data science symposium - application 1 - point clouds, prof. p...Dsd int 2014 - data science symposium - application 1 - point clouds, prof. p...
Dsd int 2014 - data science symposium - application 1 - point clouds, prof. p...
 
Enterprise Scale Topological Data Analysis Using Spark
Enterprise Scale Topological Data Analysis Using SparkEnterprise Scale Topological Data Analysis Using Spark
Enterprise Scale Topological Data Analysis Using Spark
 
Enterprise Scale Topological Data Analysis Using Spark
Enterprise Scale Topological Data Analysis Using SparkEnterprise Scale Topological Data Analysis Using Spark
Enterprise Scale Topological Data Analysis Using Spark
 
An Efficient Arabic Text Spotting from Natural Scenes Images
An Efficient Arabic Text Spotting from Natural Scenes ImagesAn Efficient Arabic Text Spotting from Natural Scenes Images
An Efficient Arabic Text Spotting from Natural Scenes Images
 
DaViT.pdf
DaViT.pdfDaViT.pdf
DaViT.pdf
 
Global Map Matching using BLE Beacons for Indoor Route and Stay Estimation
Global Map Matching using BLE Beacons for Indoor Route and Stay EstimationGlobal Map Matching using BLE Beacons for Indoor Route and Stay Estimation
Global Map Matching using BLE Beacons for Indoor Route and Stay Estimation
 
Presentation
PresentationPresentation
Presentation
 
BarnieMAT
BarnieMATBarnieMAT
BarnieMAT
 
HP - Jerome Rolia - Hadoop World 2010
HP - Jerome Rolia - Hadoop World 2010HP - Jerome Rolia - Hadoop World 2010
HP - Jerome Rolia - Hadoop World 2010
 
TcpTunnel CAD
TcpTunnel CADTcpTunnel CAD
TcpTunnel CAD
 
2015-07-08 Paper 38 - ICVS Talk
2015-07-08 Paper 38 - ICVS Talk2015-07-08 Paper 38 - ICVS Talk
2015-07-08 Paper 38 - ICVS Talk
 
Computer Vision Landscape : Present and Future
Computer Vision Landscape : Present and FutureComputer Vision Landscape : Present and Future
Computer Vision Landscape : Present and Future
 
대용량 데이터 분석을 위한 병렬 Clustering 알고리즘 최적화
대용량 데이터 분석을 위한 병렬 Clustering 알고리즘 최적화대용량 데이터 분석을 위한 병렬 Clustering 알고리즘 최적화
대용량 데이터 분석을 위한 병렬 Clustering 알고리즘 최적화
 
Facility layout
Facility layoutFacility layout
Facility layout
 
Spatiotemporal analytics
Spatiotemporal analyticsSpatiotemporal analytics
Spatiotemporal analytics
 
LiDAR_Project
LiDAR_ProjectLiDAR_Project
LiDAR_Project
 
Spatial decision support and analytics on a campus scale: bringing GIS, CAD, ...
Spatial decision support and analytics on a campus scale: bringing GIS, CAD, ...Spatial decision support and analytics on a campus scale: bringing GIS, CAD, ...
Spatial decision support and analytics on a campus scale: bringing GIS, CAD, ...
 
Big Data and Geospatial with HPCC Systems
Big Data and Geospatial with HPCC SystemsBig Data and Geospatial with HPCC Systems
Big Data and Geospatial with HPCC Systems
 

More from IMPACT Centre of Competence

More from IMPACT Centre of Competence (20)

Session6 01.helmut schmid
Session6 01.helmut schmidSession6 01.helmut schmid
Session6 01.helmut schmid
 
Session1 03.hsian-an wang
Session1 03.hsian-an wangSession1 03.hsian-an wang
Session1 03.hsian-an wang
 
Session7 03.katrien depuydt
Session7 03.katrien depuydtSession7 03.katrien depuydt
Session7 03.katrien depuydt
 
Session7 02.peter kiraly
Session7 02.peter kiralySession7 02.peter kiraly
Session7 02.peter kiraly
 
Session6 04.giuseppe celano
Session6 04.giuseppe celanoSession6 04.giuseppe celano
Session6 04.giuseppe celano
 
Session6 03.sandra young
Session6 03.sandra youngSession6 03.sandra young
Session6 03.sandra young
 
Session6 02.jeremi ochab
Session6 02.jeremi ochabSession6 02.jeremi ochab
Session6 02.jeremi ochab
 
Session5 04.evangelos varthis
Session5 04.evangelos varthisSession5 04.evangelos varthis
Session5 04.evangelos varthis
 
Session5 03.george rehm
Session5 03.george rehmSession5 03.george rehm
Session5 03.george rehm
 
Session5 02.tom derrick
Session5 02.tom derrickSession5 02.tom derrick
Session5 02.tom derrick
 
Session5 01.rutger vankoert
Session5 01.rutger vankoertSession5 01.rutger vankoert
Session5 01.rutger vankoert
 
Session4 04.senka drobac
Session4 04.senka drobacSession4 04.senka drobac
Session4 04.senka drobac
 
Session3 04.arnau baro
Session3 04.arnau baroSession3 04.arnau baro
Session3 04.arnau baro
 
Session3 03.christian clausner
Session3 03.christian clausnerSession3 03.christian clausner
Session3 03.christian clausner
 
Session3 02.kimmo ketunnen
Session3 02.kimmo ketunnenSession3 02.kimmo ketunnen
Session3 02.kimmo ketunnen
 
Session3 01.clemens neudecker
Session3 01.clemens neudeckerSession3 01.clemens neudecker
Session3 01.clemens neudecker
 
Session2 04.ashkan ashkpour
Session2 04.ashkan ashkpourSession2 04.ashkan ashkpour
Session2 04.ashkan ashkpour
 
Session2 03.juri opitz
Session2 03.juri opitzSession2 03.juri opitz
Session2 03.juri opitz
 
Session2 02.christian reul
Session2 02.christian reulSession2 02.christian reul
Session2 02.christian reul
 
Session2 01.emad mohamed
Session2 01.emad mohamedSession2 01.emad mohamed
Session2 01.emad mohamed
 

Recently uploaded

Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 

Recently uploaded (20)

Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 

Datech2014-Session1-Document Representation Refinement for Precise Region Description

  • 1. Document Representation Refinement for Precise Region Description Christian Clausner, Stefan Pletschacher and Apostolos Antonacopoulos PRImA Lab, School of Computing, Science and Engineering, University of Salford, United Kingdom
  • 2. Document Page Regions DATeCH 2014 2 Segmentation, Classification • Region (block, zone): Connected area of a document image with content of a single specific type • Examples: Text, graphic, table
  • 3. Region Representation • By geometric objects – Bounding box – Stack of rectangles – Polygon • By pixels – Bitmap – Run-length encoding DATeCH 2014 3
  • 4. Need for Precise Region Descriptions • Precise description is crucial for all but the most trivial document analysis and recognition applications • For performance evaluation: The loss of quality introduced by imprecise regions can be bigger than the variation of accuracy of the actual recognition method DATeCH 2014 4
  • 5. The Situation • Trend to more precise descriptions, but… • Output of state-of-the-artOCR systems: – Stacks of rectangles (ABBYY FineReader Engine 11) – Bounding boxes (Tesseract OCR 3.02) • Popular formats for layout analysis and OCR results: – ALTO XML (boxes, ellipses, polygons (region level only)) – FineReader XML (stacks of rectangles (region level only)) – PAGE XML (polygons for all levels) – HOCR (boxes) DATeCH 2014 5
  • 6. Refinement through Polygonal Fitting • Applicable to regions that have child objects in the document model • A typical object hierarchy contains regions, text lines, words and glyphs (characters) • Idea: Tightly wrap a polygon around the child objects DATeCH 2014 6
  • 7. Polygonal Fitting Approach 1. Create bitmasks for the child objects and transfer them to an empty bitmap 2. Fill the gaps between the child objects by a smearing approach 3. Optional: Exclude neighbour regions 4. Trace the contour of the foreground and create a polygon DATeCH 2014 7
  • 8. 1 - Transferring Child Object to Bitmap • Starting point: Polygonal object (e.g. text line, word, or glyph) • Lossless conversion to rectangle based interval representation • Transferring the rectangles to the target bitmap DATeCH 2014 8
  • 9. 2 – Smearing Approach • Goal: Connect all foreground components in the bitmap by filling the gaps in-between 1. Alternatingly fill horizontal and vertical gaps if they are smaller than a dynamic threshold (threshold is increased after each iteration) 2. If necessary, use diagonal smearing to connect remaining components DATeCH 2014 9
  • 10. 3 – Subtraction of Neighbours • Optional step to avoid overlap with adjacent regions • Simply erase the corresponding pixels from the created bitmap DATeCH 2014 10
  • 11. 4 – Outline Tracing • Trace the contour of the foreground component in the created bitmap • Create polygon on-the- fly by adding points for each change of direction (corner) DATeCH 2014 11
  • 12. Experiments • Carried out on a dataset of contemporary documents consisting of scanned magazine and technical article pages • Processed with Tesseract OCR 3.02 (open source) • Exported to PAGE XML with and without refinement DATeCH 2014 12
  • 13. DATeCH 2014 13 Original (unrefined) Refined
  • 14. Results • Measurement of region overlaps (number and area) DATeCH 2014 14 Overlapping Regions Overlap Area (Megapixel) Original Outlines 621 (45.8%) 19.9 Refined Outlines 286 (21.1%) 2.5
  • 15. Impact on Performance Evaluation • Real-world scenario • Measure the performance of Tesseract OCR engine • Evaluation metrics of previous ICDAR page segmentation competitions DATeCH 2014 15 Average success rate using originaloutlines 81.1% Average success rate using refined outlines 84.5% Average improvementfor all documents 3.4% Maximumimprovement 22.9%
  • 16. Conclusion • Existing geometric region data can be significantly refined by fitting precise polygons around child objects • Validity and impact on real-world scenarios has been shown • Refinement in performance evaluation helps to eliminate problems that arise from insufficient geometric descriptions → Concentrate on real issues of OCR methods • Positive effect on accuracy of presentation/repurposing systems (highlighting, cropping, article tracking, etc.) • Approach used in Aletheia ground truth editor and result viewer (primaresearch.org/tools) DATeCH 2014 16