SlideShare une entreprise Scribd logo
1  sur  55
Télécharger pour lire hors ligne
Handwritten Text Recognition:
Key concepts
PD Dr. Roger Labahn
Computational Intelligence Technology Lab
Mathematical Optimization Group
Institute for Mathematics
University of Rostock
co:op Convention | READ Kickoff 19.01.2016
Handwritten Text Recognition: Key concepts
Introduction
Concepts – Problems – Tasks
Recognition & Training
Interpretation – Decoding
Epilog
co:op Convention | READ Kickoff HTR Key Concepts | Introduction Roger Labahn | C I T lab
Framework – Workflow
...
...
...
• Application: keyword search, transcription, . . .. . .. . .
OUT textual information (words, positions, ...) with alternatives & confidences
⇑⇑⇑
• HTR-Engine
⇑⇑⇑
IN writing images (lines, words, table cells, form fields, ...)
• Layout Analysis: . . .. . .. . . , text blocks
...
...
...
co:op Convention | READ Kickoff HTR Key Concepts | Introduction Roger Labahn | C I T lab
Framework – Workflow
...
...
...
• Application: keyword search, transcription, . . .. . .. . .
OUT textual information (words, positions, ...) with alternatives & confidences
⇑⇑⇑
• HTR-Engine
⇑⇑⇑
IN writing images (lines, words, table cells, form fields, ...)
• Layout Analysis: . . .. . .. . . , text blocks
...
...
...
co:op Convention | READ Kickoff HTR Key Concepts | Introduction Roger Labahn | C I T lab
Alternative recognition strategies
Topological methods
• learn & read graphical substructures of writings
• arcs, lines, curves, holes, ...
HMM based methods
• Hidden Markov Models
• learn & read states while traversing the writing
RNN based methods
• Recurrent Neural Networks
co:op Convention | READ Kickoff HTR Key Concepts | Introduction Roger Labahn | C I T lab
Alternative recognition strategies
Topological methods
• learn & read graphical substructures of writings
• arcs, lines, curves, holes, ...
HMM based methods
• Hidden Markov Models
• learn & read states while traversing the writing
RNN based methods
• Recurrent Neural Networks
co:op Convention | READ Kickoff HTR Key Concepts | Introduction Roger Labahn | C I T lab
Recognition Engine C I T lab & MoU partner PLANET
Decoding textual output
• textual interpretation of recognition results
• matching external requierements / knowledge (dictionaries, language model, ...)
⇑⇑⇑ ⇑⇑⇑
Recognition recognition matrix
• recognition information from image information
• processing standardized writing image
⇑⇑⇑ ⇑⇑⇑
Writing preprocessing standardized writing
• corrections & normalizations
• e.g.: baseline, slant, height, ...
co:op Convention | READ Kickoff HTR Key Concepts | Introduction Roger Labahn | C I T lab
Introduction
Concepts – Problems – Tasks
Segmentation
Context
Language
HTR
Recognition & Training
Interpretation – Decoding
Epilog
co:op Convention | READ Kickoff HTR Key Concepts | Concepts – Problems – Tasks Roger Labahn | C I T lab
Segmentation ?
(classical) OCR = Optical Character Recognition
• Reading single characters Sub-images per character ! ?
• B a d ␣ D o ??? a n
Segmentationfree Reading
• processing the entire writing image: word . . .. . .. . . line . . .. . .. . .
• scanning information data sequence (signal) / character sequence
•
co:op Convention | READ Kickoff HTR Key Concepts | Concepts – Problems – Tasks | Segmentation Roger Labahn | C I T lab
Segmentation ? NONE !
(classical) OCR = Optical Character Recognition
• Reading single characters Sub-images per character ! ?
• B a d ␣ D o ??? a n
Segmentationfree Reading
• processing the entire writing image: word . . .. . .. . . line . . .. . .. . .
• scanning information data sequence (signal) / character sequence
•
co:op Convention | READ Kickoff HTR Key Concepts | Concepts – Problems – Tasks | Segmentation Roger Labahn | C I T lab
Segmentation ? NONE !
(classical) OCR = Optical Character Recognition
• Reading single characters Sub-images per character ! ?
• B a d ␣ D o ??? a n
Segmentationfree Reading
• processing the entire writing image: word . . .. . .. . . line . . .. . .. . .
• scanning information data sequence (signal) / character sequence
• B
co:op Convention | READ Kickoff HTR Key Concepts | Concepts – Problems – Tasks | Segmentation Roger Labahn | C I T lab
Segmentation ? NONE !
(classical) OCR = Optical Character Recognition
• Reading single characters Sub-images per character ! ?
• B a d ␣ D o ??? a n
Segmentationfree Reading
• processing the entire writing image: word . . .. . .. . . line . . .. . .. . .
• scanning information data sequence (signal) / character sequence
• BB
co:op Convention | READ Kickoff HTR Key Concepts | Concepts – Problems – Tasks | Segmentation Roger Labahn | C I T lab
Segmentation ? NONE !
(classical) OCR = Optical Character Recognition
• Reading single characters Sub-images per character ! ?
• B a d ␣ D o ??? a n
Segmentationfree Reading
• processing the entire writing image: word . . .. . .. . . line . . .. . .. . .
• scanning information data sequence (signal) / character sequence
• BB.
co:op Convention | READ Kickoff HTR Key Concepts | Concepts – Problems – Tasks | Segmentation Roger Labahn | C I T lab
Segmentation ? NONE !
(classical) OCR = Optical Character Recognition
• Reading single characters Sub-images per character ! ?
• B a d ␣ D o ??? a n
Segmentationfree Reading
• processing the entire writing image: word . . .. . .. . . line . . .. . .. . .
• scanning information data sequence (signal) / character sequence
• BB.a
co:op Convention | READ Kickoff HTR Key Concepts | Concepts – Problems – Tasks | Segmentation Roger Labahn | C I T lab
Segmentation ? NONE !
(classical) OCR = Optical Character Recognition
• Reading single characters Sub-images per character ! ?
• B a d ␣ D o ??? a n
Segmentationfree Reading
• processing the entire writing image: word . . .. . .. . . line . . .. . .. . .
• scanning information data sequence (signal) / character sequence
• BB.ad
co:op Convention | READ Kickoff HTR Key Concepts | Concepts – Problems – Tasks | Segmentation Roger Labahn | C I T lab
Segmentation ? NONE !
(classical) OCR = Optical Character Recognition
• Reading single characters Sub-images per character ! ?
• B a d ␣ D o ??? a n
Segmentationfree Reading
• processing the entire writing image: word . . .. . .. . . line . . .. . .. . .
• scanning information data sequence (signal) / character sequence
• BB.ad␣
co:op Convention | READ Kickoff HTR Key Concepts | Concepts – Problems – Tasks | Segmentation Roger Labahn | C I T lab
Segmentation ? NONE !
(classical) OCR = Optical Character Recognition
• Reading single characters Sub-images per character ! ?
• B a d ␣ D o ??? a n
Segmentationfree Reading
• processing the entire writing image: word . . .. . .. . . line . . .. . .. . .
• scanning information data sequence (signal) / character sequence
• BB.ad␣.
co:op Convention | READ Kickoff HTR Key Concepts | Concepts – Problems – Tasks | Segmentation Roger Labahn | C I T lab
Segmentation ? NONE !
(classical) OCR = Optical Character Recognition
• Reading single characters Sub-images per character ! ?
• B a d ␣ D o ??? a n
Segmentationfree Reading
• processing the entire writing image: word . . .. . .. . . line . . .. . .. . .
• scanning information data sequence (signal) / character sequence
• BB.ad␣.D
co:op Convention | READ Kickoff HTR Key Concepts | Concepts – Problems – Tasks | Segmentation Roger Labahn | C I T lab
Image context is essential !
Single segment without context
•
• virtually not (sufficiently) readable
Character sequence without context
•
• virtually not (sufficiently) explainable
co:op Convention | READ Kickoff HTR Key Concepts | Concepts – Problems – Tasks | Context Roger Labahn | C I T lab
Image context is essential !
Single segment without context
• u ?? OR n ??
• virtually not (sufficiently) readable
Character sequence without context
• ???
• virtually not (sufficiently) explainable
co:op Convention | READ Kickoff HTR Key Concepts | Concepts – Problems – Tasks | Context Roger Labahn | C I T lab
Language context is essential !
Free reading – no restrictions for possible reading results
• BB.ad␣DDolo.auu
• application: figures & general numbers, ...
Comparison against dictionary or keyword
• task: • Read a german city name from a given list !
• Find the name Bad Doberan !
• Bad Doberan
• goal: optimal / possible correspondence
writing / reading result dictionary entry / keyword
co:op Convention | READ Kickoff HTR Key Concepts | Concepts – Problems – Tasks | Language Roger Labahn | C I T lab
Language context is essential !
Free reading – no restrictions for possible reading results
• BB.ad␣DDolo.auu
• application: figures & general numbers, ...
Comparison against dictionary or keyword
• task: • Read a german city name from a given list !
• Find the name Bad Doberan !
• Bad Doberan
• goal: optimal / possible correspondence
writing / reading result dictionary entry / keyword
co:op Convention | READ Kickoff HTR Key Concepts | Concepts – Problems – Tasks | Language Roger Labahn | C I T lab
Language context is essential !
Free reading – no restrictions for possible reading results
• BB.ad␣DDolo.auu
• application: figures & general numbers, ...
Comparison against dictionary or keyword
• task: • Read a german city name from a given list !
• Find the name Bad Doberan !
• Bad Doberan
• goal: optimal / possible correspondence
writing / reading result dictionary entry / keyword
co:op Convention | READ Kickoff HTR Key Concepts | Concepts – Problems – Tasks | Language Roger Labahn | C I T lab
OCR ?
new paradigm – new concepts
co:op Convention | READ Kickoff HTR Key Concepts | Concepts – Problems – Tasks | HTR Roger Labahn | C I T lab
OCR ? HTR !
new paradigm – new concepts new term !
co:op Convention | READ Kickoff HTR Key Concepts | Concepts – Problems – Tasks | HTR Roger Labahn | C I T lab
OCR ? HTR !
new paradigm – new concepts new term !
• HTR Handwritten Text Recognition
• ATR Automatic Text Recognition
• ... ???
co:op Convention | READ Kickoff HTR Key Concepts | Concepts – Problems – Tasks | HTR Roger Labahn | C I T lab
Introduction
Concepts – Problems – Tasks
Recognition & Training
Feature extraction
Writing processing
Neural Network
Parameter training
Interpretation – Decoding
Epilog
co:op Convention | READ Kickoff HTR Key Concepts | Recognition & Training Roger Labahn | C I T lab
From pixel values to features
original grey image
Filtering
co:op Convention | READ Kickoff HTR Key Concepts | Recognition & Training | Feature extraction Roger Labahn | C I T lab
From pixel values to features
original grey image
Filtering
co:op Convention | READ Kickoff HTR Key Concepts | Recognition & Training | Feature extraction Roger Labahn | C I T lab
From pixel values to features
original grey image
Filtering
co:op Convention | READ Kickoff HTR Key Concepts | Recognition & Training | Feature extraction Roger Labahn | C I T lab
From pixel values to features
original grey image
Filtering
co:op Convention | READ Kickoff HTR Key Concepts | Recognition & Training | Feature extraction Roger Labahn | C I T lab
From pixel values to features
original grey image
Filtering
co:op Convention | READ Kickoff HTR Key Concepts | Recognition & Training | Feature extraction Roger Labahn | C I T lab
From pixel values to features
original grey image
Filtering
co:op Convention | READ Kickoff HTR Key Concepts | Recognition & Training | Feature extraction Roger Labahn | C I T lab
Collect & remember context !
Writing processing
• scanning in different directions data sequences (signals)
•
Information memory
• neural networks with complex neurons (cells)
• recurrent connections =⇒=⇒=⇒ memory
co:op Convention | READ Kickoff HTR Key Concepts | Recognition & Training | Writing processing Roger Labahn | C I T lab
Collect & remember context !
Writing processing
• scanning in different directions data sequences (signals)
•
Information memory
• neural networks with complex neurons (cells)
• recurrent connections =⇒=⇒=⇒ memory
co:op Convention | READ Kickoff HTR Key Concepts | Recognition & Training | Writing processing Roger Labahn | C I T lab
Complex cells
co:op Convention | READ Kickoff HTR Key Concepts | Recognition & Training | Neural Network Roger Labahn | C I T lab
Complex cells – memory by recurrent connections
6
?
??

co:op Convention | READ Kickoff HTR Key Concepts | Recognition  Training | Neural Network Roger Labahn | C I T lab
Hierarchical Neuronal Networks
co:op Convention | READ Kickoff HTR Key Concepts | Recognition  Training | Neural Network Roger Labahn | C I T lab
From feature input to network output
(Figure from GRAVES, SCHMIDHUBER: Offline Handwriting Recognition with Multidimensional Recurrent Neural Networks)
co:op Convention | READ Kickoff HTR Key Concepts | Recognition  Training | Neural Network Roger Labahn | C I T lab
From feature input to network output
co:op Convention | READ Kickoff HTR Key Concepts | Recognition  Training | Neural Network Roger Labahn | C I T lab
Parameter training: Machine Learning
Theory
• objective: optimally adapt parameters in cells  along network connections
• idea: train the network with learning data samples
• optimization: minimize error (network output vs. sample target) over training data
Practice: impression of large application cases
• 104
network cells
• 106
trainable parameters
• 104
learning data samples (writing images)
• 150 training epochs each processing every sample once
• 4 weeks training from the scratch
co:op Convention | READ Kickoff HTR Key Concepts | Recognition  Training | Parameter training Roger Labahn | C I T lab
Learning data . . .. . .. . .
• . . .. . .. . . labeled training samples ground truth
HTR: writing images with correct text
• . . .. . .. . . the more the better . . .. . .. . . BUT:
start with realistic (reasonable) number improve while working
• . . .. . .. . . represent all project data . . .. . .. . . BUT:
start with HTR (networks) from similar collections  corpora
• . . .. . .. . . contribute to general HTR engine improvement:
put into network repository for specific application cases
co:op Convention | READ Kickoff HTR Key Concepts | Recognition  Training | Parameter training Roger Labahn | C I T lab
Introduction
Concepts – Problems – Tasks
Recognition  Training
Interpretation – Decoding
Network output
Decoding
Epilog
co:op Convention | READ Kickoff HTR Key Concepts | Interpretation – Decoding Roger Labahn | C I T lab
Channel probabilities
Pre-conditions
• (abstract) alphabet of (abstract) characters
• text composed of exactly these characters
• alphabet characters ⇐⇒⇐⇒⇐⇒ network output neurons channels
• example: digits, uppercase letters, lowercase letters, special characters ␣-
• much more general: any symbol unit learnable from training data
• current (large) application case: up to 150 character channels
• independent from (natural) language – reading/writing direction – understanding
Network output
probability of (character) channel at writing (image) position
co:op Convention | READ Kickoff HTR Key Concepts | Interpretation – Decoding | Network output Roger Labahn | C I T lab
Confidence Matrix – recognition / perception matrix
. B D a d l o u ␣
co:op Convention | READ Kickoff HTR Key Concepts | Interpretation – Decoding | Network output Roger Labahn | C I T lab
Expression matching
• restrict to
permissible words dictionary
keyword(s) construct(s) regular expression
• consider
character confidences probability measure
or their negative logarithms distance measure
Algorithmic method
• compare confidence matrix against any permissible expression
• use extremely fast algorithm: Dynamic Programming
co:op Convention | READ Kickoff HTR Key Concepts | Interpretation – Decoding | Decoding Roger Labahn | C I T lab
Expression matching
• restrict to
permissible words dictionary
keyword(s) construct(s) regular expression
• consider
character confidences probability measure
or their negative logarithms distance measure
Algorithmic method
• compare confidence matrix against any permissible expression
• use extremely fast algorithm: Dynamic Programming
co:op Convention | READ Kickoff HTR Key Concepts | Interpretation – Decoding | Decoding Roger Labahn | C I T lab
Decoding
Objective – Result
• permissible expression(s) with best matching to recognition output
• best matching ⇐⇒⇐⇒⇐⇒ maximal probability ⇐⇒⇐⇒⇐⇒ minimum distance
• best alternatives ranked by measure (probability / distance)
Practice: impression of actual application cases
• only decoding on pre-processed lines
• searching 1 keyword in 10.500 lines (433 pages) : 2 - 3 sec. average
• reading 1 page against 11.650 word dictionary: 8 - 9 sec. average
co:op Convention | READ Kickoff HTR Key Concepts | Interpretation – Decoding | Decoding Roger Labahn | C I T lab
Decoding
Objective – Result
• permissible expression(s) with best matching to recognition output
• best matching ⇐⇒⇐⇒⇐⇒ maximal probability ⇐⇒⇐⇒⇐⇒ minimum distance
• best alternatives ranked by measure (probability / distance)
Practice: impression of actual application cases
• only decoding on pre-processed lines
• searching 1 keyword in 10.500 lines (433 pages) : 2 - 3 sec. average
• reading 1 page against 11.650 word dictionary: 8 - 9 sec. average
co:op Convention | READ Kickoff HTR Key Concepts | Interpretation – Decoding | Decoding Roger Labahn | C I T lab
Dynamic Programming
co:op Convention | READ Kickoff HTR Key Concepts | Interpretation – Decoding | Decoding Roger Labahn | C I T lab
Dynamic Programming
co:op Convention | READ Kickoff HTR Key Concepts | Interpretation – Decoding | Decoding Roger Labahn | C I T lab
Introduction
Concepts – Problems – Tasks
Recognition  Training
Interpretation – Decoding
Epilog
co:op Convention | READ Kickoff HTR Key Concepts | Epilog Roger Labahn | C I T lab
Results
from C I T lab’s contribution to ICDAR’s HTRtS-2015 contest
WER = 0%
CER = 0%
who has with or without right the temporary possession of it : and
who has with or without right the temporary possession of it : and
WER = 17%
CER = 4%
 operation of this act is spent upon Titius only , 
 operation of this act isspeut upon Titius only , 
WER = 67%
CER = 52%
of the said first issue : the amount of such second consequently gap/ to the
of the and put feet the without of such ; said uitrquunity be the
WER = 80%
CER = 17%
for a simple personal Injury the Offender ’ s punish=
For on simple personal injury the offenders punish .
2. Examples of test line images of increasing difficulty. The reference transcript and the CITlab system hypothesis are displayed (in this order) below
h image. The corresponding WER and CER figures are also shown on the right of each image.
the lines with crossed-out word can be transcribed as the
ine shows. Finally, we can see that sometimes, if the line
a large WER but a low CER, the transcript can be more
ul than if the WER is lower and the CER higher (see third
[4] A. Graves, M. Liwicki, S. Fern´andez, R. Bertolami, H. Bunke, and
J. Schmidhuber, “A Novel Connectionist System for Unconstrained
Handwriting Recognition,” IEEE Tr. PAMI, vol. 31, no. 5, pp. 855–
868, 2009.
(Figure from SÁNCHEZ, TOSELLI, ROMERO, VIDAL: ICDAR2015 Competition HTRtS: Handwritten Text Recognition on the tranScriptorium Dataset)
co:op Convention | READ Kickoff HTR Key Concepts | Epilog Roger Labahn | C I T lab
Thanks . . .. . .. . .
C I T lab Group – URO MoU Partner
PLANET intgelligent systems GmbH
EU Funding
Recognition  Enrichment of Archival Documents
. . .. . .. . . for your attention!
co:op Convention | READ Kickoff HTR Key Concepts | Epilog Roger Labahn | C I T lab
co:op Convention | READ Kickoff HTR Key Concepts | Epilog Roger Labahn | C I T lab

Contenu connexe

Tendances

Frontiers of Natural Language Processing
Frontiers of Natural Language ProcessingFrontiers of Natural Language Processing
Frontiers of Natural Language ProcessingSebastian Ruder
 
Transfer Learning for Natural Language Processing
Transfer Learning for Natural Language ProcessingTransfer Learning for Natural Language Processing
Transfer Learning for Natural Language ProcessingSebastian Ruder
 
How to get started with R programming
How to get started with R programmingHow to get started with R programming
How to get started with R programmingRamon Salazar
 
The VoiceMOS Challenge 2022
The VoiceMOS Challenge 2022The VoiceMOS Challenge 2022
The VoiceMOS Challenge 2022NU_I_TODALAB
 
Ir 1 lec 7
Ir 1 lec 7Ir 1 lec 7
Ir 1 lec 7alaa223
 
Can functional programming be liberated from static typing?
Can functional programming be liberated from static typing?Can functional programming be liberated from static typing?
Can functional programming be liberated from static typing?Vsevolod Dyomkin
 
Natural language processing for requirements engineering: ICSE 2021 Technical...
Natural language processing for requirements engineering: ICSE 2021 Technical...Natural language processing for requirements engineering: ICSE 2021 Technical...
Natural language processing for requirements engineering: ICSE 2021 Technical...alessio_ferrari
 
How to expand your nlp solution to new languages using transfer learning
How to expand your nlp solution to new languages using transfer learningHow to expand your nlp solution to new languages using transfer learning
How to expand your nlp solution to new languages using transfer learningLena Shakurova
 
Natural Language Processing in Practice
Natural Language Processing in PracticeNatural Language Processing in Practice
Natural Language Processing in PracticeVsevolod Dyomkin
 
Intelligent Search
Intelligent SearchIntelligent Search
Intelligent SearchTed Dunning
 
Arcomem training entities-and-events_advanced
Arcomem training entities-and-events_advancedArcomem training entities-and-events_advanced
Arcomem training entities-and-events_advancedarcomem
 
Exploring Statistical Language Models for Recommender Systems [RecSys '15 DS ...
Exploring Statistical Language Models for Recommender Systems [RecSys '15 DS ...Exploring Statistical Language Models for Recommender Systems [RecSys '15 DS ...
Exploring Statistical Language Models for Recommender Systems [RecSys '15 DS ...Daniel Valcarce
 
Natural Language Processing (NLP) for Requirements Engineering (RE): an Overview
Natural Language Processing (NLP) for Requirements Engineering (RE): an OverviewNatural Language Processing (NLP) for Requirements Engineering (RE): an Overview
Natural Language Processing (NLP) for Requirements Engineering (RE): an Overviewalessio_ferrari
 
Crash-course in Natural Language Processing
Crash-course in Natural Language ProcessingCrash-course in Natural Language Processing
Crash-course in Natural Language ProcessingVsevolod Dyomkin
 

Tendances (18)

Frontiers of Natural Language Processing
Frontiers of Natural Language ProcessingFrontiers of Natural Language Processing
Frontiers of Natural Language Processing
 
Transfer Learning for Natural Language Processing
Transfer Learning for Natural Language ProcessingTransfer Learning for Natural Language Processing
Transfer Learning for Natural Language Processing
 
NLP Project Full Cycle
NLP Project Full CycleNLP Project Full Cycle
NLP Project Full Cycle
 
How to get started with R programming
How to get started with R programmingHow to get started with R programming
How to get started with R programming
 
The VoiceMOS Challenge 2022
The VoiceMOS Challenge 2022The VoiceMOS Challenge 2022
The VoiceMOS Challenge 2022
 
Ir 1 lec 7
Ir 1 lec 7Ir 1 lec 7
Ir 1 lec 7
 
Spatial LDA
Spatial LDASpatial LDA
Spatial LDA
 
Aspects of NLP Practice
Aspects of NLP PracticeAspects of NLP Practice
Aspects of NLP Practice
 
Arabic question answering ‫‬
Arabic question answering ‫‬Arabic question answering ‫‬
Arabic question answering ‫‬
 
Can functional programming be liberated from static typing?
Can functional programming be liberated from static typing?Can functional programming be liberated from static typing?
Can functional programming be liberated from static typing?
 
Natural language processing for requirements engineering: ICSE 2021 Technical...
Natural language processing for requirements engineering: ICSE 2021 Technical...Natural language processing for requirements engineering: ICSE 2021 Technical...
Natural language processing for requirements engineering: ICSE 2021 Technical...
 
How to expand your nlp solution to new languages using transfer learning
How to expand your nlp solution to new languages using transfer learningHow to expand your nlp solution to new languages using transfer learning
How to expand your nlp solution to new languages using transfer learning
 
Natural Language Processing in Practice
Natural Language Processing in PracticeNatural Language Processing in Practice
Natural Language Processing in Practice
 
Intelligent Search
Intelligent SearchIntelligent Search
Intelligent Search
 
Arcomem training entities-and-events_advanced
Arcomem training entities-and-events_advancedArcomem training entities-and-events_advanced
Arcomem training entities-and-events_advanced
 
Exploring Statistical Language Models for Recommender Systems [RecSys '15 DS ...
Exploring Statistical Language Models for Recommender Systems [RecSys '15 DS ...Exploring Statistical Language Models for Recommender Systems [RecSys '15 DS ...
Exploring Statistical Language Models for Recommender Systems [RecSys '15 DS ...
 
Natural Language Processing (NLP) for Requirements Engineering (RE): an Overview
Natural Language Processing (NLP) for Requirements Engineering (RE): an OverviewNatural Language Processing (NLP) for Requirements Engineering (RE): an Overview
Natural Language Processing (NLP) for Requirements Engineering (RE): an Overview
 
Crash-course in Natural Language Processing
Crash-course in Natural Language ProcessingCrash-course in Natural Language Processing
Crash-course in Natural Language Processing
 

En vedette

Monitoring IO performance with iostat and pt-diskstats
Monitoring IO performance with iostat and pt-diskstatsMonitoring IO performance with iostat and pt-diskstats
Monitoring IO performance with iostat and pt-diskstatsBen Mildren
 
Treasures of the National Library of Myanmar
Treasures of the National Library of MyanmarTreasures of the National Library of Myanmar
Treasures of the National Library of MyanmarMya OO
 
Final Project presentation on Image processing based intelligent traffic cont...
Final Project presentation on Image processing based intelligent traffic cont...Final Project presentation on Image processing based intelligent traffic cont...
Final Project presentation on Image processing based intelligent traffic cont...Louise Antonio
 
Real time image processing ppt
Real time image processing pptReal time image processing ppt
Real time image processing pptashwini.jagdhane
 

En vedette (6)

Monitoring IO performance with iostat and pt-diskstats
Monitoring IO performance with iostat and pt-diskstatsMonitoring IO performance with iostat and pt-diskstats
Monitoring IO performance with iostat and pt-diskstats
 
Treasures of the National Library of Myanmar
Treasures of the National Library of MyanmarTreasures of the National Library of Myanmar
Treasures of the National Library of Myanmar
 
Final Project presentation on Image processing based intelligent traffic cont...
Final Project presentation on Image processing based intelligent traffic cont...Final Project presentation on Image processing based intelligent traffic cont...
Final Project presentation on Image processing based intelligent traffic cont...
 
Real time image processing ppt
Real time image processing pptReal time image processing ppt
Real time image processing ppt
 
Text Detection and Recognition
Text Detection and RecognitionText Detection and Recognition
Text Detection and Recognition
 
Image processing ppt
Image processing pptImage processing ppt
Image processing ppt
 

Similaire à co:op-READ-Convention Marburg - Roger Labahn

Building Named Entity Recognition Models Efficiently using NERDS
Building Named Entity Recognition Models Efficiently using NERDSBuilding Named Entity Recognition Models Efficiently using NERDS
Building Named Entity Recognition Models Efficiently using NERDSSujit Pal
 
gPBL - Reading Assistant for Blind - Working Progress
gPBL - Reading Assistant for Blind - Working Progress gPBL - Reading Assistant for Blind - Working Progress
gPBL - Reading Assistant for Blind - Working Progress Chanon Khongprasongsiri
 
2R-3KS03-OOP_UNIT-I (Part-A)_2023-24.pptx
2R-3KS03-OOP_UNIT-I (Part-A)_2023-24.pptx2R-3KS03-OOP_UNIT-I (Part-A)_2023-24.pptx
2R-3KS03-OOP_UNIT-I (Part-A)_2023-24.pptxGauravGamer2
 
Using Static Binary Analysis To Find Vulnerabilities And Backdoors in Firmware
Using Static Binary Analysis To Find Vulnerabilities And Backdoors in FirmwareUsing Static Binary Analysis To Find Vulnerabilities And Backdoors in Firmware
Using Static Binary Analysis To Find Vulnerabilities And Backdoors in FirmwareLastline, Inc.
 
Beyond the Symbols: A 30-minute Overview of NLP
Beyond the Symbols: A 30-minute Overview of NLPBeyond the Symbols: A 30-minute Overview of NLP
Beyond the Symbols: A 30-minute Overview of NLPMENGSAYLOEM1
 
A Panorama of Natural Language Processing
A Panorama of Natural Language ProcessingA Panorama of Natural Language Processing
A Panorama of Natural Language ProcessingTed Xiao
 
Introduction to natural language processing (NLP)
Introduction to natural language processing (NLP)Introduction to natural language processing (NLP)
Introduction to natural language processing (NLP)Alia Hamwi
 
Assessment of OCR quality and font identification in historical documents
Assessment of OCR quality and font identification in historical documentsAssessment of OCR quality and font identification in historical documents
Assessment of OCR quality and font identification in historical documentsAnshul Gupta
 
A Static Type Analyzer of Untyped Ruby Code for Ruby 3
A Static Type Analyzer of Untyped Ruby Code for Ruby 3A Static Type Analyzer of Untyped Ruby Code for Ruby 3
A Static Type Analyzer of Untyped Ruby Code for Ruby 3mametter
 
DSL Construction rith Ruby
DSL Construction rith RubyDSL Construction rith Ruby
DSL Construction rith RubyThoughtWorks
 
Topic01 intro.post
Topic01 intro.postTopic01 intro.post
Topic01 intro.postSree Devi
 
NLP for Everyday People
NLP for Everyday PeopleNLP for Everyday People
NLP for Everyday PeopleRebecca Bilbro
 
Knowledge_Based_Systems_Siemens
Knowledge_Based_Systems_SiemensKnowledge_Based_Systems_Siemens
Knowledge_Based_Systems_SiemensVinay Bhat
 
Finding local lessons in software engineering
Finding local lessons in software engineeringFinding local lessons in software engineering
Finding local lessons in software engineeringCS, NcState
 
Machine Learning Pipelines
Machine Learning PipelinesMachine Learning Pipelines
Machine Learning Pipelinesjeykottalam
 
OUTDATED Text Mining 3/5: String Processing
OUTDATED Text Mining 3/5: String ProcessingOUTDATED Text Mining 3/5: String Processing
OUTDATED Text Mining 3/5: String ProcessingFlorian Leitner
 

Similaire à co:op-READ-Convention Marburg - Roger Labahn (20)

Building Named Entity Recognition Models Efficiently using NERDS
Building Named Entity Recognition Models Efficiently using NERDSBuilding Named Entity Recognition Models Efficiently using NERDS
Building Named Entity Recognition Models Efficiently using NERDS
 
gPBL - Reading Assistant for Blind - Working Progress
gPBL - Reading Assistant for Blind - Working Progress gPBL - Reading Assistant for Blind - Working Progress
gPBL - Reading Assistant for Blind - Working Progress
 
2R-3KS03-OOP_UNIT-I (Part-A)_2023-24.pptx
2R-3KS03-OOP_UNIT-I (Part-A)_2023-24.pptx2R-3KS03-OOP_UNIT-I (Part-A)_2023-24.pptx
2R-3KS03-OOP_UNIT-I (Part-A)_2023-24.pptx
 
Using Static Binary Analysis To Find Vulnerabilities And Backdoors in Firmware
Using Static Binary Analysis To Find Vulnerabilities And Backdoors in FirmwareUsing Static Binary Analysis To Find Vulnerabilities And Backdoors in Firmware
Using Static Binary Analysis To Find Vulnerabilities And Backdoors in Firmware
 
Beyond the Symbols: A 30-minute Overview of NLP
Beyond the Symbols: A 30-minute Overview of NLPBeyond the Symbols: A 30-minute Overview of NLP
Beyond the Symbols: A 30-minute Overview of NLP
 
A Panorama of Natural Language Processing
A Panorama of Natural Language ProcessingA Panorama of Natural Language Processing
A Panorama of Natural Language Processing
 
Introduction to natural language processing (NLP)
Introduction to natural language processing (NLP)Introduction to natural language processing (NLP)
Introduction to natural language processing (NLP)
 
Information Extraction
Information ExtractionInformation Extraction
Information Extraction
 
Assessment of OCR quality and font identification in historical documents
Assessment of OCR quality and font identification in historical documentsAssessment of OCR quality and font identification in historical documents
Assessment of OCR quality and font identification in historical documents
 
A Static Type Analyzer of Untyped Ruby Code for Ruby 3
A Static Type Analyzer of Untyped Ruby Code for Ruby 3A Static Type Analyzer of Untyped Ruby Code for Ruby 3
A Static Type Analyzer of Untyped Ruby Code for Ruby 3
 
Deep Learning Summit (DLS01-4)
Deep Learning Summit (DLS01-4)Deep Learning Summit (DLS01-4)
Deep Learning Summit (DLS01-4)
 
DSL Construction rith Ruby
DSL Construction rith RubyDSL Construction rith Ruby
DSL Construction rith Ruby
 
Topic01 intro.post
Topic01 intro.postTopic01 intro.post
Topic01 intro.post
 
NLP from scratch
NLP from scratch NLP from scratch
NLP from scratch
 
NLP for Everyday People
NLP for Everyday PeopleNLP for Everyday People
NLP for Everyday People
 
GDSC NYCU | 如何建立自己的開源專案
 GDSC NYCU | 如何建立自己的開源專案 GDSC NYCU | 如何建立自己的開源專案
GDSC NYCU | 如何建立自己的開源專案
 
Knowledge_Based_Systems_Siemens
Knowledge_Based_Systems_SiemensKnowledge_Based_Systems_Siemens
Knowledge_Based_Systems_Siemens
 
Finding local lessons in software engineering
Finding local lessons in software engineeringFinding local lessons in software engineering
Finding local lessons in software engineering
 
Machine Learning Pipelines
Machine Learning PipelinesMachine Learning Pipelines
Machine Learning Pipelines
 
OUTDATED Text Mining 3/5: String Processing
OUTDATED Text Mining 3/5: String ProcessingOUTDATED Text Mining 3/5: String Processing
OUTDATED Text Mining 3/5: String Processing
 

Plus de ICARUS - International Centre for Archival Research

Plus de ICARUS - International Centre for Archival Research (20)

ICARUS-Meeting #20 | The Age of Digital Technology: Documents, Archives and S...
ICARUS-Meeting #20 | The Age of Digital Technology: Documents, Archives and S...ICARUS-Meeting #20 | The Age of Digital Technology: Documents, Archives and S...
ICARUS-Meeting #20 | The Age of Digital Technology: Documents, Archives and S...
 
ICARUS-Meeting #20 | The Age of Digital Technology: Documents, Archives and S...
ICARUS-Meeting #20 | The Age of Digital Technology: Documents, Archives and S...ICARUS-Meeting #20 | The Age of Digital Technology: Documents, Archives and S...
ICARUS-Meeting #20 | The Age of Digital Technology: Documents, Archives and S...
 
ICARUS-Meeting #20 | The Age of Digital Technology: Documents, Archives and S...
ICARUS-Meeting #20 | The Age of Digital Technology: Documents, Archives and S...ICARUS-Meeting #20 | The Age of Digital Technology: Documents, Archives and S...
ICARUS-Meeting #20 | The Age of Digital Technology: Documents, Archives and S...
 
ICARUS-Meeting #20 | The Age of Digital Technology: Documents, Archives and S...
ICARUS-Meeting #20 | The Age of Digital Technology: Documents, Archives and S...ICARUS-Meeting #20 | The Age of Digital Technology: Documents, Archives and S...
ICARUS-Meeting #20 | The Age of Digital Technology: Documents, Archives and S...
 
ICARUS-Meeting #20 | The Age of Digital Technology: Documents, Archives and S...
ICARUS-Meeting #20 | The Age of Digital Technology: Documents, Archives and S...ICARUS-Meeting #20 | The Age of Digital Technology: Documents, Archives and S...
ICARUS-Meeting #20 | The Age of Digital Technology: Documents, Archives and S...
 
ICARUS-Meeting #20 | The Age of Digital Technology: Documents, Archives and S...
ICARUS-Meeting #20 | The Age of Digital Technology: Documents, Archives and S...ICARUS-Meeting #20 | The Age of Digital Technology: Documents, Archives and S...
ICARUS-Meeting #20 | The Age of Digital Technology: Documents, Archives and S...
 
ICARUS-Meeting #20 | The Age of Digital Technology: Documents, Archives and S...
ICARUS-Meeting #20 | The Age of Digital Technology: Documents, Archives and S...ICARUS-Meeting #20 | The Age of Digital Technology: Documents, Archives and S...
ICARUS-Meeting #20 | The Age of Digital Technology: Documents, Archives and S...
 
ICARUS-Meeting #20 | The Age of Digital Technology: Documents, Archives and S...
ICARUS-Meeting #20 | The Age of Digital Technology: Documents, Archives and S...ICARUS-Meeting #20 | The Age of Digital Technology: Documents, Archives and S...
ICARUS-Meeting #20 | The Age of Digital Technology: Documents, Archives and S...
 
ICARUS-Meeting #20 | The Age of Digital Technology: Documents, Archives and S...
ICARUS-Meeting #20 | The Age of Digital Technology: Documents, Archives and S...ICARUS-Meeting #20 | The Age of Digital Technology: Documents, Archives and S...
ICARUS-Meeting #20 | The Age of Digital Technology: Documents, Archives and S...
 
ICARUS-Meeting #17 | Transparency - Accessibility – Dialogue. How a creative ...
ICARUS-Meeting #17 | Transparency - Accessibility – Dialogue. How a creative ...ICARUS-Meeting #17 | Transparency - Accessibility – Dialogue. How a creative ...
ICARUS-Meeting #17 | Transparency - Accessibility – Dialogue. How a creative ...
 
ICARUS-Meeting #17 | Transparency - Accessibility – Dialogue. How a creative ...
ICARUS-Meeting #17 | Transparency - Accessibility – Dialogue. How a creative ...ICARUS-Meeting #17 | Transparency - Accessibility – Dialogue. How a creative ...
ICARUS-Meeting #17 | Transparency - Accessibility – Dialogue. How a creative ...
 
ICARUS-Meeting #17 | Transparency - Accessibility – Dialogue. How a creative ...
ICARUS-Meeting #17 | Transparency - Accessibility – Dialogue. How a creative ...ICARUS-Meeting #17 | Transparency - Accessibility – Dialogue. How a creative ...
ICARUS-Meeting #17 | Transparency - Accessibility – Dialogue. How a creative ...
 
ICARUS-Meeting #17 | Transparency - Accessibility – Dialogue. How a creative ...
ICARUS-Meeting #17 | Transparency - Accessibility – Dialogue. How a creative ...ICARUS-Meeting #17 | Transparency - Accessibility – Dialogue. How a creative ...
ICARUS-Meeting #17 | Transparency - Accessibility – Dialogue. How a creative ...
 
ICARUS-Meeting #17 | Transparency - Accessibility – Dialogue. How a creative ...
ICARUS-Meeting #17 | Transparency - Accessibility – Dialogue. How a creative ...ICARUS-Meeting #17 | Transparency - Accessibility – Dialogue. How a creative ...
ICARUS-Meeting #17 | Transparency - Accessibility – Dialogue. How a creative ...
 
ICARUS-Meeting #17 | Transparency - Accessibility – Dialogue. How a creative ...
ICARUS-Meeting #17 | Transparency - Accessibility – Dialogue. How a creative ...ICARUS-Meeting #17 | Transparency - Accessibility – Dialogue. How a creative ...
ICARUS-Meeting #17 | Transparency - Accessibility – Dialogue. How a creative ...
 
ICARUS-Meeting #17 | Transparency - Accessibility – Dialogue. How a creative ...
ICARUS-Meeting #17 | Transparency - Accessibility – Dialogue. How a creative ...ICARUS-Meeting #17 | Transparency - Accessibility – Dialogue. How a creative ...
ICARUS-Meeting #17 | Transparency - Accessibility – Dialogue. How a creative ...
 
ICARUS-Meeting #17 | Transparency - Accessibility – Dialogue. How a creative ...
ICARUS-Meeting #17 | Transparency - Accessibility – Dialogue. How a creative ...ICARUS-Meeting #17 | Transparency - Accessibility – Dialogue. How a creative ...
ICARUS-Meeting #17 | Transparency - Accessibility – Dialogue. How a creative ...
 
ICARUS-Meeting #17 | Transparency - Accessibility – Dialogue. How a creative ...
ICARUS-Meeting #17 | Transparency - Accessibility – Dialogue. How a creative ...ICARUS-Meeting #17 | Transparency - Accessibility – Dialogue. How a creative ...
ICARUS-Meeting #17 | Transparency - Accessibility – Dialogue. How a creative ...
 
ICARUS-Meeting #17 | Transparency - Accessibility – Dialogue. How a creative ...
ICARUS-Meeting #17 | Transparency - Accessibility – Dialogue. How a creative ...ICARUS-Meeting #17 | Transparency - Accessibility – Dialogue. How a creative ...
ICARUS-Meeting #17 | Transparency - Accessibility – Dialogue. How a creative ...
 
ICARUS-Meeting #17 | Transparency - Accessibility – Dialogue. How a creative ...
ICARUS-Meeting #17 | Transparency - Accessibility – Dialogue. How a creative ...ICARUS-Meeting #17 | Transparency - Accessibility – Dialogue. How a creative ...
ICARUS-Meeting #17 | Transparency - Accessibility – Dialogue. How a creative ...
 

Dernier

❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.Nitya salvi
 
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...ssuser79fe74
 
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts ServiceJustdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Servicemonikaservice1
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bSérgio Sacani
 
IDENTIFICATION OF THE LIVING- forensic medicine
IDENTIFICATION OF THE LIVING- forensic medicineIDENTIFICATION OF THE LIVING- forensic medicine
IDENTIFICATION OF THE LIVING- forensic medicinesherlingomez2
 
Bacterial Identification and Classifications
Bacterial Identification and ClassificationsBacterial Identification and Classifications
Bacterial Identification and ClassificationsAreesha Ahmad
 
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑Damini Dixit
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPirithiRaju
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksSérgio Sacani
 
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRLKochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRLkantirani197
 
GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)Areesha Ahmad
 
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxCOST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxFarihaAbdulRasheed
 
Zoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdfZoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdfSumit Kumar yadav
 
biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY1301aanya
 
Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .Poonam Aher Patil
 
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...dkNET
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learninglevieagacer
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfrohankumarsinghrore1
 
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...Mohammad Khajehpour
 
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Alandi Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance Bookingroncy bisnoi
 

Dernier (20)

❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
 
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
 
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts ServiceJustdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
 
IDENTIFICATION OF THE LIVING- forensic medicine
IDENTIFICATION OF THE LIVING- forensic medicineIDENTIFICATION OF THE LIVING- forensic medicine
IDENTIFICATION OF THE LIVING- forensic medicine
 
Bacterial Identification and Classifications
Bacterial Identification and ClassificationsBacterial Identification and Classifications
Bacterial Identification and Classifications
 
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disks
 
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRLKochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
 
GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)
 
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxCOST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
 
Zoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdfZoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdf
 
biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY
 
Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .
 
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learning
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdf
 
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...
 
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Alandi Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance Booking
 

co:op-READ-Convention Marburg - Roger Labahn

  • 1. Handwritten Text Recognition: Key concepts PD Dr. Roger Labahn Computational Intelligence Technology Lab Mathematical Optimization Group Institute for Mathematics University of Rostock co:op Convention | READ Kickoff 19.01.2016
  • 2. Handwritten Text Recognition: Key concepts Introduction Concepts – Problems – Tasks Recognition & Training Interpretation – Decoding Epilog co:op Convention | READ Kickoff HTR Key Concepts | Introduction Roger Labahn | C I T lab
  • 3. Framework – Workflow ... ... ... • Application: keyword search, transcription, . . .. . .. . . OUT textual information (words, positions, ...) with alternatives & confidences ⇑⇑⇑ • HTR-Engine ⇑⇑⇑ IN writing images (lines, words, table cells, form fields, ...) • Layout Analysis: . . .. . .. . . , text blocks ... ... ... co:op Convention | READ Kickoff HTR Key Concepts | Introduction Roger Labahn | C I T lab
  • 4. Framework – Workflow ... ... ... • Application: keyword search, transcription, . . .. . .. . . OUT textual information (words, positions, ...) with alternatives & confidences ⇑⇑⇑ • HTR-Engine ⇑⇑⇑ IN writing images (lines, words, table cells, form fields, ...) • Layout Analysis: . . .. . .. . . , text blocks ... ... ... co:op Convention | READ Kickoff HTR Key Concepts | Introduction Roger Labahn | C I T lab
  • 5. Alternative recognition strategies Topological methods • learn & read graphical substructures of writings • arcs, lines, curves, holes, ... HMM based methods • Hidden Markov Models • learn & read states while traversing the writing RNN based methods • Recurrent Neural Networks co:op Convention | READ Kickoff HTR Key Concepts | Introduction Roger Labahn | C I T lab
  • 6. Alternative recognition strategies Topological methods • learn & read graphical substructures of writings • arcs, lines, curves, holes, ... HMM based methods • Hidden Markov Models • learn & read states while traversing the writing RNN based methods • Recurrent Neural Networks co:op Convention | READ Kickoff HTR Key Concepts | Introduction Roger Labahn | C I T lab
  • 7. Recognition Engine C I T lab & MoU partner PLANET Decoding textual output • textual interpretation of recognition results • matching external requierements / knowledge (dictionaries, language model, ...) ⇑⇑⇑ ⇑⇑⇑ Recognition recognition matrix • recognition information from image information • processing standardized writing image ⇑⇑⇑ ⇑⇑⇑ Writing preprocessing standardized writing • corrections & normalizations • e.g.: baseline, slant, height, ... co:op Convention | READ Kickoff HTR Key Concepts | Introduction Roger Labahn | C I T lab
  • 8. Introduction Concepts – Problems – Tasks Segmentation Context Language HTR Recognition & Training Interpretation – Decoding Epilog co:op Convention | READ Kickoff HTR Key Concepts | Concepts – Problems – Tasks Roger Labahn | C I T lab
  • 9. Segmentation ? (classical) OCR = Optical Character Recognition • Reading single characters Sub-images per character ! ? • B a d ␣ D o ??? a n Segmentationfree Reading • processing the entire writing image: word . . .. . .. . . line . . .. . .. . . • scanning information data sequence (signal) / character sequence • co:op Convention | READ Kickoff HTR Key Concepts | Concepts – Problems – Tasks | Segmentation Roger Labahn | C I T lab
  • 10. Segmentation ? NONE ! (classical) OCR = Optical Character Recognition • Reading single characters Sub-images per character ! ? • B a d ␣ D o ??? a n Segmentationfree Reading • processing the entire writing image: word . . .. . .. . . line . . .. . .. . . • scanning information data sequence (signal) / character sequence • co:op Convention | READ Kickoff HTR Key Concepts | Concepts – Problems – Tasks | Segmentation Roger Labahn | C I T lab
  • 11. Segmentation ? NONE ! (classical) OCR = Optical Character Recognition • Reading single characters Sub-images per character ! ? • B a d ␣ D o ??? a n Segmentationfree Reading • processing the entire writing image: word . . .. . .. . . line . . .. . .. . . • scanning information data sequence (signal) / character sequence • B co:op Convention | READ Kickoff HTR Key Concepts | Concepts – Problems – Tasks | Segmentation Roger Labahn | C I T lab
  • 12. Segmentation ? NONE ! (classical) OCR = Optical Character Recognition • Reading single characters Sub-images per character ! ? • B a d ␣ D o ??? a n Segmentationfree Reading • processing the entire writing image: word . . .. . .. . . line . . .. . .. . . • scanning information data sequence (signal) / character sequence • BB co:op Convention | READ Kickoff HTR Key Concepts | Concepts – Problems – Tasks | Segmentation Roger Labahn | C I T lab
  • 13. Segmentation ? NONE ! (classical) OCR = Optical Character Recognition • Reading single characters Sub-images per character ! ? • B a d ␣ D o ??? a n Segmentationfree Reading • processing the entire writing image: word . . .. . .. . . line . . .. . .. . . • scanning information data sequence (signal) / character sequence • BB. co:op Convention | READ Kickoff HTR Key Concepts | Concepts – Problems – Tasks | Segmentation Roger Labahn | C I T lab
  • 14. Segmentation ? NONE ! (classical) OCR = Optical Character Recognition • Reading single characters Sub-images per character ! ? • B a d ␣ D o ??? a n Segmentationfree Reading • processing the entire writing image: word . . .. . .. . . line . . .. . .. . . • scanning information data sequence (signal) / character sequence • BB.a co:op Convention | READ Kickoff HTR Key Concepts | Concepts – Problems – Tasks | Segmentation Roger Labahn | C I T lab
  • 15. Segmentation ? NONE ! (classical) OCR = Optical Character Recognition • Reading single characters Sub-images per character ! ? • B a d ␣ D o ??? a n Segmentationfree Reading • processing the entire writing image: word . . .. . .. . . line . . .. . .. . . • scanning information data sequence (signal) / character sequence • BB.ad co:op Convention | READ Kickoff HTR Key Concepts | Concepts – Problems – Tasks | Segmentation Roger Labahn | C I T lab
  • 16. Segmentation ? NONE ! (classical) OCR = Optical Character Recognition • Reading single characters Sub-images per character ! ? • B a d ␣ D o ??? a n Segmentationfree Reading • processing the entire writing image: word . . .. . .. . . line . . .. . .. . . • scanning information data sequence (signal) / character sequence • BB.ad␣ co:op Convention | READ Kickoff HTR Key Concepts | Concepts – Problems – Tasks | Segmentation Roger Labahn | C I T lab
  • 17. Segmentation ? NONE ! (classical) OCR = Optical Character Recognition • Reading single characters Sub-images per character ! ? • B a d ␣ D o ??? a n Segmentationfree Reading • processing the entire writing image: word . . .. . .. . . line . . .. . .. . . • scanning information data sequence (signal) / character sequence • BB.ad␣. co:op Convention | READ Kickoff HTR Key Concepts | Concepts – Problems – Tasks | Segmentation Roger Labahn | C I T lab
  • 18. Segmentation ? NONE ! (classical) OCR = Optical Character Recognition • Reading single characters Sub-images per character ! ? • B a d ␣ D o ??? a n Segmentationfree Reading • processing the entire writing image: word . . .. . .. . . line . . .. . .. . . • scanning information data sequence (signal) / character sequence • BB.ad␣.D co:op Convention | READ Kickoff HTR Key Concepts | Concepts – Problems – Tasks | Segmentation Roger Labahn | C I T lab
  • 19. Image context is essential ! Single segment without context • • virtually not (sufficiently) readable Character sequence without context • • virtually not (sufficiently) explainable co:op Convention | READ Kickoff HTR Key Concepts | Concepts – Problems – Tasks | Context Roger Labahn | C I T lab
  • 20. Image context is essential ! Single segment without context • u ?? OR n ?? • virtually not (sufficiently) readable Character sequence without context • ??? • virtually not (sufficiently) explainable co:op Convention | READ Kickoff HTR Key Concepts | Concepts – Problems – Tasks | Context Roger Labahn | C I T lab
  • 21. Language context is essential ! Free reading – no restrictions for possible reading results • BB.ad␣DDolo.auu • application: figures & general numbers, ... Comparison against dictionary or keyword • task: • Read a german city name from a given list ! • Find the name Bad Doberan ! • Bad Doberan • goal: optimal / possible correspondence writing / reading result dictionary entry / keyword co:op Convention | READ Kickoff HTR Key Concepts | Concepts – Problems – Tasks | Language Roger Labahn | C I T lab
  • 22. Language context is essential ! Free reading – no restrictions for possible reading results • BB.ad␣DDolo.auu • application: figures & general numbers, ... Comparison against dictionary or keyword • task: • Read a german city name from a given list ! • Find the name Bad Doberan ! • Bad Doberan • goal: optimal / possible correspondence writing / reading result dictionary entry / keyword co:op Convention | READ Kickoff HTR Key Concepts | Concepts – Problems – Tasks | Language Roger Labahn | C I T lab
  • 23. Language context is essential ! Free reading – no restrictions for possible reading results • BB.ad␣DDolo.auu • application: figures & general numbers, ... Comparison against dictionary or keyword • task: • Read a german city name from a given list ! • Find the name Bad Doberan ! • Bad Doberan • goal: optimal / possible correspondence writing / reading result dictionary entry / keyword co:op Convention | READ Kickoff HTR Key Concepts | Concepts – Problems – Tasks | Language Roger Labahn | C I T lab
  • 24. OCR ? new paradigm – new concepts co:op Convention | READ Kickoff HTR Key Concepts | Concepts – Problems – Tasks | HTR Roger Labahn | C I T lab
  • 25. OCR ? HTR ! new paradigm – new concepts new term ! co:op Convention | READ Kickoff HTR Key Concepts | Concepts – Problems – Tasks | HTR Roger Labahn | C I T lab
  • 26. OCR ? HTR ! new paradigm – new concepts new term ! • HTR Handwritten Text Recognition • ATR Automatic Text Recognition • ... ??? co:op Convention | READ Kickoff HTR Key Concepts | Concepts – Problems – Tasks | HTR Roger Labahn | C I T lab
  • 27. Introduction Concepts – Problems – Tasks Recognition & Training Feature extraction Writing processing Neural Network Parameter training Interpretation – Decoding Epilog co:op Convention | READ Kickoff HTR Key Concepts | Recognition & Training Roger Labahn | C I T lab
  • 28. From pixel values to features original grey image Filtering co:op Convention | READ Kickoff HTR Key Concepts | Recognition & Training | Feature extraction Roger Labahn | C I T lab
  • 29. From pixel values to features original grey image Filtering co:op Convention | READ Kickoff HTR Key Concepts | Recognition & Training | Feature extraction Roger Labahn | C I T lab
  • 30. From pixel values to features original grey image Filtering co:op Convention | READ Kickoff HTR Key Concepts | Recognition & Training | Feature extraction Roger Labahn | C I T lab
  • 31. From pixel values to features original grey image Filtering co:op Convention | READ Kickoff HTR Key Concepts | Recognition & Training | Feature extraction Roger Labahn | C I T lab
  • 32. From pixel values to features original grey image Filtering co:op Convention | READ Kickoff HTR Key Concepts | Recognition & Training | Feature extraction Roger Labahn | C I T lab
  • 33. From pixel values to features original grey image Filtering co:op Convention | READ Kickoff HTR Key Concepts | Recognition & Training | Feature extraction Roger Labahn | C I T lab
  • 34. Collect & remember context ! Writing processing • scanning in different directions data sequences (signals) • Information memory • neural networks with complex neurons (cells) • recurrent connections =⇒=⇒=⇒ memory co:op Convention | READ Kickoff HTR Key Concepts | Recognition & Training | Writing processing Roger Labahn | C I T lab
  • 35. Collect & remember context ! Writing processing • scanning in different directions data sequences (signals) • Information memory • neural networks with complex neurons (cells) • recurrent connections =⇒=⇒=⇒ memory co:op Convention | READ Kickoff HTR Key Concepts | Recognition & Training | Writing processing Roger Labahn | C I T lab
  • 36. Complex cells co:op Convention | READ Kickoff HTR Key Concepts | Recognition & Training | Neural Network Roger Labahn | C I T lab
  • 37. Complex cells – memory by recurrent connections 6 ? ?? co:op Convention | READ Kickoff HTR Key Concepts | Recognition Training | Neural Network Roger Labahn | C I T lab
  • 38. Hierarchical Neuronal Networks co:op Convention | READ Kickoff HTR Key Concepts | Recognition Training | Neural Network Roger Labahn | C I T lab
  • 39. From feature input to network output (Figure from GRAVES, SCHMIDHUBER: Offline Handwriting Recognition with Multidimensional Recurrent Neural Networks) co:op Convention | READ Kickoff HTR Key Concepts | Recognition Training | Neural Network Roger Labahn | C I T lab
  • 40. From feature input to network output co:op Convention | READ Kickoff HTR Key Concepts | Recognition Training | Neural Network Roger Labahn | C I T lab
  • 41. Parameter training: Machine Learning Theory • objective: optimally adapt parameters in cells along network connections • idea: train the network with learning data samples • optimization: minimize error (network output vs. sample target) over training data Practice: impression of large application cases • 104 network cells • 106 trainable parameters • 104 learning data samples (writing images) • 150 training epochs each processing every sample once • 4 weeks training from the scratch co:op Convention | READ Kickoff HTR Key Concepts | Recognition Training | Parameter training Roger Labahn | C I T lab
  • 42. Learning data . . .. . .. . . • . . .. . .. . . labeled training samples ground truth HTR: writing images with correct text • . . .. . .. . . the more the better . . .. . .. . . BUT: start with realistic (reasonable) number improve while working • . . .. . .. . . represent all project data . . .. . .. . . BUT: start with HTR (networks) from similar collections corpora • . . .. . .. . . contribute to general HTR engine improvement: put into network repository for specific application cases co:op Convention | READ Kickoff HTR Key Concepts | Recognition Training | Parameter training Roger Labahn | C I T lab
  • 43. Introduction Concepts – Problems – Tasks Recognition Training Interpretation – Decoding Network output Decoding Epilog co:op Convention | READ Kickoff HTR Key Concepts | Interpretation – Decoding Roger Labahn | C I T lab
  • 44. Channel probabilities Pre-conditions • (abstract) alphabet of (abstract) characters • text composed of exactly these characters • alphabet characters ⇐⇒⇐⇒⇐⇒ network output neurons channels • example: digits, uppercase letters, lowercase letters, special characters ␣- • much more general: any symbol unit learnable from training data • current (large) application case: up to 150 character channels • independent from (natural) language – reading/writing direction – understanding Network output probability of (character) channel at writing (image) position co:op Convention | READ Kickoff HTR Key Concepts | Interpretation – Decoding | Network output Roger Labahn | C I T lab
  • 45. Confidence Matrix – recognition / perception matrix . B D a d l o u ␣ co:op Convention | READ Kickoff HTR Key Concepts | Interpretation – Decoding | Network output Roger Labahn | C I T lab
  • 46. Expression matching • restrict to permissible words dictionary keyword(s) construct(s) regular expression • consider character confidences probability measure or their negative logarithms distance measure Algorithmic method • compare confidence matrix against any permissible expression • use extremely fast algorithm: Dynamic Programming co:op Convention | READ Kickoff HTR Key Concepts | Interpretation – Decoding | Decoding Roger Labahn | C I T lab
  • 47. Expression matching • restrict to permissible words dictionary keyword(s) construct(s) regular expression • consider character confidences probability measure or their negative logarithms distance measure Algorithmic method • compare confidence matrix against any permissible expression • use extremely fast algorithm: Dynamic Programming co:op Convention | READ Kickoff HTR Key Concepts | Interpretation – Decoding | Decoding Roger Labahn | C I T lab
  • 48. Decoding Objective – Result • permissible expression(s) with best matching to recognition output • best matching ⇐⇒⇐⇒⇐⇒ maximal probability ⇐⇒⇐⇒⇐⇒ minimum distance • best alternatives ranked by measure (probability / distance) Practice: impression of actual application cases • only decoding on pre-processed lines • searching 1 keyword in 10.500 lines (433 pages) : 2 - 3 sec. average • reading 1 page against 11.650 word dictionary: 8 - 9 sec. average co:op Convention | READ Kickoff HTR Key Concepts | Interpretation – Decoding | Decoding Roger Labahn | C I T lab
  • 49. Decoding Objective – Result • permissible expression(s) with best matching to recognition output • best matching ⇐⇒⇐⇒⇐⇒ maximal probability ⇐⇒⇐⇒⇐⇒ minimum distance • best alternatives ranked by measure (probability / distance) Practice: impression of actual application cases • only decoding on pre-processed lines • searching 1 keyword in 10.500 lines (433 pages) : 2 - 3 sec. average • reading 1 page against 11.650 word dictionary: 8 - 9 sec. average co:op Convention | READ Kickoff HTR Key Concepts | Interpretation – Decoding | Decoding Roger Labahn | C I T lab
  • 50. Dynamic Programming co:op Convention | READ Kickoff HTR Key Concepts | Interpretation – Decoding | Decoding Roger Labahn | C I T lab
  • 51. Dynamic Programming co:op Convention | READ Kickoff HTR Key Concepts | Interpretation – Decoding | Decoding Roger Labahn | C I T lab
  • 52. Introduction Concepts – Problems – Tasks Recognition Training Interpretation – Decoding Epilog co:op Convention | READ Kickoff HTR Key Concepts | Epilog Roger Labahn | C I T lab
  • 53. Results from C I T lab’s contribution to ICDAR’s HTRtS-2015 contest WER = 0% CER = 0% who has with or without right the temporary possession of it : and who has with or without right the temporary possession of it : and WER = 17% CER = 4% operation of this act is spent upon Titius only , operation of this act isspeut upon Titius only , WER = 67% CER = 52% of the said first issue : the amount of such second consequently gap/ to the of the and put feet the without of such ; said uitrquunity be the WER = 80% CER = 17% for a simple personal Injury the Offender ’ s punish= For on simple personal injury the offenders punish . 2. Examples of test line images of increasing difficulty. The reference transcript and the CITlab system hypothesis are displayed (in this order) below h image. The corresponding WER and CER figures are also shown on the right of each image. the lines with crossed-out word can be transcribed as the ine shows. Finally, we can see that sometimes, if the line a large WER but a low CER, the transcript can be more ul than if the WER is lower and the CER higher (see third [4] A. Graves, M. Liwicki, S. Fern´andez, R. Bertolami, H. Bunke, and J. Schmidhuber, “A Novel Connectionist System for Unconstrained Handwriting Recognition,” IEEE Tr. PAMI, vol. 31, no. 5, pp. 855– 868, 2009. (Figure from SÁNCHEZ, TOSELLI, ROMERO, VIDAL: ICDAR2015 Competition HTRtS: Handwritten Text Recognition on the tranScriptorium Dataset) co:op Convention | READ Kickoff HTR Key Concepts | Epilog Roger Labahn | C I T lab
  • 54. Thanks . . .. . .. . . C I T lab Group – URO MoU Partner PLANET intgelligent systems GmbH EU Funding Recognition Enrichment of Archival Documents . . .. . .. . . for your attention! co:op Convention | READ Kickoff HTR Key Concepts | Epilog Roger Labahn | C I T lab
  • 55. co:op Convention | READ Kickoff HTR Key Concepts | Epilog Roger Labahn | C I T lab