SlideShare une entreprise Scribd logo
1  sur  26
Télécharger pour lire hors ligne
Succeed 
WP3 – Validation and take-up of tools 
Katrien Depuydt (INL) –Stefan Eickeler, Sebastian Kirch, (IAIS) 
Succeed is supported by the European Union under FP7-ICT and coordinated by Universidad de Alicante.
Objectives 
Many tools and linguistic resources were developed in research and development programs supporting the digitisation of cultural heritage 
Still, too few are used in the productive environments 
Succeed’s approach to support the take-up of these tools: 
1.Identify existing tools and resources 
2.Identify libraries willing to use and evaluate tools 
3.Define criteria to validate and evaluate tools 
4.Provide training material for tools 
5.Provide support to libraries using and evaluating tools 
6.Blueprint for validation and take-up of tools 
Succeed is supported by the European Union under FP7-ICT and coordinated by Universidad de Alicante.
Survey of tools 
Training material 
Evaluation
1. SURVEY AND SELECTION OF TOOLS
Survey of tools 
Brief description and goals 
Produce a survey of existing 
tools 
ground truth data and 
lexicon data for digitisation 
Select candidate tools for implementation at cultural heritage institutions 
Succeed is supported by the European Union under FP7-ICT and coordinated by Universidad de Alicante.
Survey of tools 
Methodology used to achieve the objectives 
1.Taxonomy for categorisation based on a simplified digitisation workflow 
2.Definition of attributes e.g. how a tool can be used in the digitisation process 
3.Online Spreadsheet to collect and organise tools 
4.Assessment and further selection into a shortlist of tools 
Succeed is supported by the European Union under FP7-ICT and coordinated by Universidad de Alicante.
Selection of tools 
First selection: knock-out criteria (three steps) 
Succeed is supported by the European Union under FP7-ICT and coordinated by Universidad de Alicante. 
Further selection: (expertise partners)
Task 1 Survey of tools 
Summary of outcomes 
Categorised list of 213 research and commercial tools 
Available in an online database and frequently updated 
Shortlist with the most relevant tools based on a quality assessment 
An overview of existing ground truth material and lexicon data has been produced. http://impact.dlsi.ua.es/digitisation/tools-resources/tools-for-text-digitisation/ 
Succeed is supported by the European Union under FP7-ICT and coordinated by Universidad de Alicante.
2. VALIDATION PARAMETERS
1st Project Review – WP3 
Validation parameters 
Brief description and goals 
Define validation parameters and procedures for the implementation of tools in productive environments (per task carried out by using a tool) 
Validate each tool (or group of tools) based on these criteria 
Work out evaluation work plans and test scenarios in cooperation with libraries based on their requirements 
Succeed is supported by the European Union under FP7-ICT and coordinated by Universidad de Alicante.
Validation parameters 
Methodology used to achieve the objectives 
1.Definition of evaluation template structure 
2.Tool selection by libraries 
3.Creation and compilation of evaluation material Separate evaluation forms per task/tool type & common usability evaluation form 
4.Distribution of evaluation material to participating libraries 
Succeed is supported by the European Union under FP7-ICT and coordinated by Universidad de Alicante.
1st Project Review – WP3 
Validation parameters 
Summary of outcomes 
Described evaluation procedures and produced 9 evaluation forms per task 
Worked out evaluation and test scenarios as a “work plan” together with the participating libraries 
Blueprint for take-up and validation 
Succeed is supported by the European Union under FP7-ICT and coordinated by Universidad de Alicante.
3. TAKE-UP SUPPORT
Take-up support 
Brief description and goals 
Support the integration, take-up and validation of digitisation tools and resources 
Tool implementation at four participant libraries and nine external libraries (16 potential external libraries at the start of the project > 9 retained) 
Assistance for the adaptation/application of the tools to specific domains and/or languages 
Succeed is supported by the European Union under FP7-ICT and coordinated by Universidad de Alicante.
Take-up support 
Methodology used to achieve the objectives 
1.Each library installs, on average, two tools and tests their performance and usability in a productive environment according to the predifined validation criteria 
2.Some consortium libraries will test existing linguistic resources for enhancement of textual information retrieval 
3.The technical partners (IAIS, INL, PSNC, UA) will provide online assistance for the adaptation of the tools to specific domains and languages 
4.The technical partners will report on the results based on the information provided by the libraries 
Succeed is supported by the European Union under FP7-ICT and coordinated by Universidad de Alicante.
External Libraries 
Library Country Selected Tools 
Wielkopolska Biblioteka Cyfrowa Poland - Scan Tailor 
- JHOVE2 
- Image Magick 
General Historical Library of 
Salamanca 
Spain - Gimp 
- Omnipage 
Wroclaw University Library Poland - Scan Tailor 
- Tesseract OCR 
University Library of Bratislava Slovak 
Republic 
- Scan Tailor 
- ImageMagick 
National Library of Finland Finland - Newspaper segmentation 
- Korrektor 
- Document Deskewer 
Library of the University of Granada Spain - Scan Tailor 
- Alchemy API 
University Library of Leuven Belgium - Abbyy FRE 
- NERT 
University Library of Antwerp Belgium - NE Attestation tool, 
- NLTK (NE), 
- Stanford (NE) 
University Library of Darmstadt Germany - Newspaper segmentation 
- Korrektor 
- Document Deskewer 
Internal Libraries 
Library Country Selected Tools 
Biblioteca Virtual Miguel de Cervantes Spain - Abbyy FRE 
- Geometric correction: Page Curl 
- COBaLT 
- Lexicon as Webservice 
Bibliotèque nationale de France France - DBPedia Spotlight 
- Evaluation Tool for OCR 
- Lexicon as Webservice 
Koninklijke Bibliotheek Netherlands - Lexicon as Webservice 
- NLTK 
- NERT 
The British Library United 
Kingdom 
- Evaluation Tool for OCR 
- Stanford (NE) 
- Lexicon as Webservice 
Take-up support 
Summary of outcomes 
 Involved 9 external libraries in the 
project to perform tool evaluation, 
each of them committed to evaluate at 
least 2 tools 
 Collected libraries’ digitisation 
requirements 
 Consulted libraries in defining 
interesting use cases for evaluation 
 Provided remote assistance for the 
take-up of tools selected by the 
libraries 
Succeed is supported by the European Union under FP7-ICT and coordinated by Universidad de Alicante.
Take-up support 
Remote assistance for technical support: Assistance for the integration and adaptation of the tools to specific domains, languages and use cases 
Implementation studies (final report): Elaboration of blueprint on validation and take-up process for tools and resources 
Case studies from the implementation experiences produced 
Succeed is supported by the European Union under FP7-ICT and coordinated by Universidad de Alicante.
4: TRAINING
Training 
Brief description and goals 
Produce documentation and training material for the tools to be validated. They must help the participating libraries to take-up the tools in their productive environment. 
Provide training on specific tools to external stakeholders. 
Organise on-site training workshops depending on libraries requirements 
Succeed is supported by the European Union under FP7-ICT and coordinated by Universidad de Alicante.
Training 
Methodology used to achieve the objectives 
1.Document structure of training material 
2.Tool selection by libraries 
3.Distribution of Work: WP 3 partners according to expertise and knowledge with the selected tools 
4.Creation and compilation of training material 
5.Distribution of training material to participating libraries 
Succeed is supported by the European Union under FP7-ICT and coordinated by Universidad de Alicante.
Training 
Summary of outcomes 
Prepared training materials for 19 tools (separate document, online SCORM + DigitWiki) 
Organized TPDL tutorial attracting experts from digital libraries from around the world 
Participation in hackathons 
Succeed is supported by the European Union under FP7-ICT and coordinated by Universidad de Alicante.
5. CONCLUSIONS
Conclusions 
Evaluation work of each participating library > Presentations! 
Succeed is supported by the European Union under FP7-ICT and coordinated by Universidad de Alicante.
Conclusions 
Blueprint for evaluation General recommendations for evaluation by libraries: 
a.Translate requirements into detailed use case (including detailed description of data + data format) 
b.Acquire or produce test data 
c.Determine tools 
d.Produce work plan 
e.Verify use case with internal and external experts (Tool providers, CoC) If no test data can be produced, adapt use case If plan breaks down in too many steps, adapt use case If necessary, change tool selection 
f.Documentation of the evaluation (evaluation forms) 
g.Use experienced technical staff 
Succeed is supported by the European Union under FP7-ICT and coordinated by Universidad de Alicante.
Conclusions 
Blueprint for evaluation General recommendations for tool providers: 
a.Provide a clear description of the purpose of the tool 
b.Provide a clear description of the formats the tool can handle 
c.Provide a clear description of the type of material the tool can handle with reasonable results; provide information on performance where possible 
d.Provide a clear step by step description of the complete procedure that should be followed to get the best possible result, including training and tuning of parameters. 
e.Provide compact documentation if possible 
f.Minimize interdependency of parts of documentation 
Succeed is supported by the European Union under FP7-ICT and coordinated by Universidad de Alicante.
Thank you! 
Succeed is supported by the European Union under FP7-ICT and coordinated by Universidad de Alicante.

Contenu connexe

En vedette

University library of KU Leuven - Sam Alloing et Demmy Verbecke
University library of KU Leuven - Sam Alloing et Demmy VerbeckeUniversity library of KU Leuven - Sam Alloing et Demmy Verbecke
University library of KU Leuven - Sam Alloing et Demmy VerbeckeIMPACT Centre of Competence
 
IMPACT Final Event 26-06-2012 - The IMPACT Centre of Competence by Rafael Car...
IMPACT Final Event 26-06-2012 - The IMPACT Centre of Competence by Rafael Car...IMPACT Final Event 26-06-2012 - The IMPACT Centre of Competence by Rafael Car...
IMPACT Final Event 26-06-2012 - The IMPACT Centre of Competence by Rafael Car...IMPACT Centre of Competence
 
Impact Centre of Competence presentation at CERL 2014 by Tomasz Parkola (PSNC)
Impact Centre of Competence presentation at CERL 2014 by Tomasz Parkola (PSNC)Impact Centre of Competence presentation at CERL 2014 by Tomasz Parkola (PSNC)
Impact Centre of Competence presentation at CERL 2014 by Tomasz Parkola (PSNC)IMPACT Centre of Competence
 
IMPACT Final Event 26-06-2012 - Use of IMPACT tools in the Europeana Newspap...
IMPACT Final Event 26-06-2012  - Use of IMPACT tools in the Europeana Newspap...IMPACT Final Event 26-06-2012  - Use of IMPACT tools in the Europeana Newspap...
IMPACT Final Event 26-06-2012 - Use of IMPACT tools in the Europeana Newspap...IMPACT Centre of Competence
 
Datech2014 - Session 2 - An approach to Unsupervised Historical Text Normalis...
Datech2014 - Session 2 - An approach to Unsupervised Historical Text Normalis...Datech2014 - Session 2 - An approach to Unsupervised Historical Text Normalis...
Datech2014 - Session 2 - An approach to Unsupervised Historical Text Normalis...IMPACT Centre of Competence
 
Datech2014-Session1-Document Representation Refinement for Precise Region Des...
Datech2014-Session1-Document Representation Refinement for Precise Region Des...Datech2014-Session1-Document Representation Refinement for Precise Region Des...
Datech2014-Session1-Document Representation Refinement for Precise Region Des...IMPACT Centre of Competence
 
Datech2014 - Session 4 - Construction of Text Digitization System for Nôm His...
Datech2014 - Session 4 - Construction of Text Digitization System for Nôm His...Datech2014 - Session 4 - Construction of Text Digitization System for Nôm His...
Datech2014 - Session 4 - Construction of Text Digitization System for Nôm His...IMPACT Centre of Competence
 
Impact centre of_competence_for_workshop_ocr_rouen_march_2011[1]
Impact centre of_competence_for_workshop_ocr_rouen_march_2011[1]Impact centre of_competence_for_workshop_ocr_rouen_march_2011[1]
Impact centre of_competence_for_workshop_ocr_rouen_march_2011[1]IMPACT Centre of Competence
 
Datech2014 - Session 5 - Wittgenstein’s Nachlass: WiTTFind and Wittgenstein A...
Datech2014 - Session 5 - Wittgenstein’s Nachlass: WiTTFind and Wittgenstein A...Datech2014 - Session 5 - Wittgenstein’s Nachlass: WiTTFind and Wittgenstein A...
Datech2014 - Session 5 - Wittgenstein’s Nachlass: WiTTFind and Wittgenstein A...IMPACT Centre of Competence
 
IMPACT Final Event 26-06-2012 - Automated metadata extraction from title page...
IMPACT Final Event 26-06-2012 - Automated metadata extraction from title page...IMPACT Final Event 26-06-2012 - Automated metadata extraction from title page...
IMPACT Final Event 26-06-2012 - Automated metadata extraction from title page...IMPACT Centre of Competence
 
Succeed final conference - The interoperability of digitisation platforms
Succeed final conference - The interoperability of digitisation platformsSucceed final conference - The interoperability of digitisation platforms
Succeed final conference - The interoperability of digitisation platformsIMPACT Centre of Competence
 
IMPACT Interoperability Framework - Clemens Neudecker
IMPACT Interoperability Framework - Clemens NeudeckerIMPACT Interoperability Framework - Clemens Neudecker
IMPACT Interoperability Framework - Clemens NeudeckerIMPACT Centre of Competence
 
6. Digital Humanities Innovation Lab (LINHD). Clara Martínez Cantón
6.  Digital Humanities Innovation Lab (LINHD). Clara Martínez Cantón6.  Digital Humanities Innovation Lab (LINHD). Clara Martínez Cantón
6. Digital Humanities Innovation Lab (LINHD). Clara Martínez CantónIMPACT Centre of Competence
 

En vedette (18)

University library of KU Leuven - Sam Alloing et Demmy Verbecke
University library of KU Leuven - Sam Alloing et Demmy VerbeckeUniversity library of KU Leuven - Sam Alloing et Demmy Verbecke
University library of KU Leuven - Sam Alloing et Demmy Verbecke
 
IMPACT Final Event 26-06-2012 - The IMPACT Centre of Competence by Rafael Car...
IMPACT Final Event 26-06-2012 - The IMPACT Centre of Competence by Rafael Car...IMPACT Final Event 26-06-2012 - The IMPACT Centre of Competence by Rafael Car...
IMPACT Final Event 26-06-2012 - The IMPACT Centre of Competence by Rafael Car...
 
Impact Centre of Competence presentation at CERL 2014 by Tomasz Parkola (PSNC)
Impact Centre of Competence presentation at CERL 2014 by Tomasz Parkola (PSNC)Impact Centre of Competence presentation at CERL 2014 by Tomasz Parkola (PSNC)
Impact Centre of Competence presentation at CERL 2014 by Tomasz Parkola (PSNC)
 
IMPACT Final Event 26-06-2012 - Use of IMPACT tools in the Europeana Newspap...
IMPACT Final Event 26-06-2012  - Use of IMPACT tools in the Europeana Newspap...IMPACT Final Event 26-06-2012  - Use of IMPACT tools in the Europeana Newspap...
IMPACT Final Event 26-06-2012 - Use of IMPACT tools in the Europeana Newspap...
 
Datech2014 - Session 2 - An approach to Unsupervised Historical Text Normalis...
Datech2014 - Session 2 - An approach to Unsupervised Historical Text Normalis...Datech2014 - Session 2 - An approach to Unsupervised Historical Text Normalis...
Datech2014 - Session 2 - An approach to Unsupervised Historical Text Normalis...
 
Datech2014-Session1-Document Representation Refinement for Precise Region Des...
Datech2014-Session1-Document Representation Refinement for Precise Region Des...Datech2014-Session1-Document Representation Refinement for Precise Region Des...
Datech2014-Session1-Document Representation Refinement for Precise Region Des...
 
Image Enhancement tools by Lotte Wilms
Image Enhancement tools by Lotte WilmsImage Enhancement tools by Lotte Wilms
Image Enhancement tools by Lotte Wilms
 
Datech2014 - Session 4 - Construction of Text Digitization System for Nôm His...
Datech2014 - Session 4 - Construction of Text Digitization System for Nôm His...Datech2014 - Session 4 - Construction of Text Digitization System for Nôm His...
Datech2014 - Session 4 - Construction of Text Digitization System for Nôm His...
 
Impact centre of_competence_for_workshop_ocr_rouen_march_2011[1]
Impact centre of_competence_for_workshop_ocr_rouen_march_2011[1]Impact centre of_competence_for_workshop_ocr_rouen_march_2011[1]
Impact centre of_competence_for_workshop_ocr_rouen_march_2011[1]
 
Datech2014 - Session 5 - Wittgenstein’s Nachlass: WiTTFind and Wittgenstein A...
Datech2014 - Session 5 - Wittgenstein’s Nachlass: WiTTFind and Wittgenstein A...Datech2014 - Session 5 - Wittgenstein’s Nachlass: WiTTFind and Wittgenstein A...
Datech2014 - Session 5 - Wittgenstein’s Nachlass: WiTTFind and Wittgenstein A...
 
National library of the netherlands judith rog
National library of the netherlands   judith rogNational library of the netherlands   judith rog
National library of the netherlands judith rog
 
IMPACT Final Event 26-06-2012 - Automated metadata extraction from title page...
IMPACT Final Event 26-06-2012 - Automated metadata extraction from title page...IMPACT Final Event 26-06-2012 - Automated metadata extraction from title page...
IMPACT Final Event 26-06-2012 - Automated metadata extraction from title page...
 
Succeed final conference - The interoperability of digitisation platforms
Succeed final conference - The interoperability of digitisation platformsSucceed final conference - The interoperability of digitisation platforms
Succeed final conference - The interoperability of digitisation platforms
 
IMPACT Interoperability Framework - Clemens Neudecker
IMPACT Interoperability Framework - Clemens NeudeckerIMPACT Interoperability Framework - Clemens Neudecker
IMPACT Interoperability Framework - Clemens Neudecker
 
6. Digital Humanities Innovation Lab (LINHD). Clara Martínez Cantón
6.  Digital Humanities Innovation Lab (LINHD). Clara Martínez Cantón6.  Digital Humanities Innovation Lab (LINHD). Clara Martínez Cantón
6. Digital Humanities Innovation Lab (LINHD). Clara Martínez Cantón
 
UA - GT Aligner - ICoC
UA - GT Aligner - ICoCUA - GT Aligner - ICoC
UA - GT Aligner - ICoC
 
Libnova ICoC
Libnova ICoCLibnova ICoC
Libnova ICoC
 
BVC - Semantic Web - ICoC
BVC - Semantic Web - ICoCBVC - Semantic Web - ICoC
BVC - Semantic Web - ICoC
 

Similaire à Succeed Validation and Take up of Tools - Katrien Depuydt

Presentation of the paper “A survey of resources for introducing coding into ...
Presentation of the paper “A survey of resources for introducing coding into ...Presentation of the paper “A survey of resources for introducing coding into ...
Presentation of the paper “A survey of resources for introducing coding into ...Grial - University of Salamanca
 
A survey of resources for introducing coding into schools
A survey of resources for introducing coding into schoolsA survey of resources for introducing coding into schools
A survey of resources for introducing coding into schoolsGrial - University of Salamanca
 
The Sci-GaIA project
The Sci-GaIA projectThe Sci-GaIA project
The Sci-GaIA projectBruce Becker
 
Digital Curator Vocational Education Europe: Project Objectives
Digital Curator Vocational Education Europe: Project ObjectivesDigital Curator Vocational Education Europe: Project Objectives
Digital Curator Vocational Education Europe: Project ObjectivesDigCurV
 
Experimental Workflow Development in Digitisation
Experimental Workflow Development in DigitisationExperimental Workflow Development in Digitisation
Experimental Workflow Development in Digitisationcneudecker
 
Education success factors
Education success factorsEducation success factors
Education success factorsManuel Canabal
 
Ontological Infrastructure for Interoperable Research Information Systems: HE...
Ontological Infrastructure for Interoperable Research Information Systems: HE...Ontological Infrastructure for Interoperable Research Information Systems: HE...
Ontological Infrastructure for Interoperable Research Information Systems: HE...Diego López-de-Ipiña González-de-Artaza
 
Pipers outline and overview 2014_generic
Pipers outline and overview 2014_genericPipers outline and overview 2014_generic
Pipers outline and overview 2014_genericNikolay Stoyanov
 
Intelligent tools-mitja-jermol-2013-bali-7 may2013
Intelligent tools-mitja-jermol-2013-bali-7 may2013Intelligent tools-mitja-jermol-2013-bali-7 may2013
Intelligent tools-mitja-jermol-2013-bali-7 may2013MediaMixerCommunity
 
Opening session - CAPFITOGEN Programme introduction
Opening session - CAPFITOGEN Programme introductionOpening session - CAPFITOGEN Programme introduction
Opening session - CAPFITOGEN Programme introductionMauricio Parra Quijano
 
Project Overview_MINKE
Project Overview_MINKEProject Overview_MINKE
Project Overview_MINKEMinkeProject
 
European Conference on Software Architecture - ECSA 2015 Announcement
European Conference on Software Architecture - ECSA 2015 AnnouncementEuropean Conference on Software Architecture - ECSA 2015 Announcement
European Conference on Software Architecture - ECSA 2015 AnnouncementIvica Crnkovic
 
Subject-specific International Accreditation for Technical Profiles
Subject-specific International  Accreditation for Technical ProfilesSubject-specific International  Accreditation for Technical Profiles
Subject-specific International Accreditation for Technical ProfilesLvivPolytechnic
 
TAROT summerschool slides 2013 - Italy
TAROT summerschool slides 2013 - ItalyTAROT summerschool slides 2013 - Italy
TAROT summerschool slides 2013 - ItalyTanja Vos
 
ICT and Education Lessons Learned from the LLP Peter Birch EDEN conference
ICT and Education Lessons Learned from the LLP Peter Birch EDEN conferenceICT and Education Lessons Learned from the LLP Peter Birch EDEN conference
ICT and Education Lessons Learned from the LLP Peter Birch EDEN conferencePeter Birch
 

Similaire à Succeed Validation and Take up of Tools - Katrien Depuydt (20)

Presentation of the paper “A survey of resources for introducing coding into ...
Presentation of the paper “A survey of resources for introducing coding into ...Presentation of the paper “A survey of resources for introducing coding into ...
Presentation of the paper “A survey of resources for introducing coding into ...
 
A survey of resources for introducing coding into schools
A survey of resources for introducing coding into schoolsA survey of resources for introducing coding into schools
A survey of resources for introducing coding into schools
 
A survey of resources for introducing coding into schools
A survey of resources for introducing coding into schoolsA survey of resources for introducing coding into schools
A survey of resources for introducing coding into schools
 
The Sci-GaIA project
The Sci-GaIA projectThe Sci-GaIA project
The Sci-GaIA project
 
SpeakApps presentation
SpeakApps  presentationSpeakApps  presentation
SpeakApps presentation
 
Caenti Huelva2007 Wp6 Catalyse Content
Caenti Huelva2007 Wp6 Catalyse ContentCaenti Huelva2007 Wp6 Catalyse Content
Caenti Huelva2007 Wp6 Catalyse Content
 
Digital Curator Vocational Education Europe: Project Objectives
Digital Curator Vocational Education Europe: Project ObjectivesDigital Curator Vocational Education Europe: Project Objectives
Digital Curator Vocational Education Europe: Project Objectives
 
Experimental Workflow Development in Digitisation
Experimental Workflow Development in DigitisationExperimental Workflow Development in Digitisation
Experimental Workflow Development in Digitisation
 
Education success factors
Education success factorsEducation success factors
Education success factors
 
Ontological Infrastructure for Interoperable Research Information Systems: HE...
Ontological Infrastructure for Interoperable Research Information Systems: HE...Ontological Infrastructure for Interoperable Research Information Systems: HE...
Ontological Infrastructure for Interoperable Research Information Systems: HE...
 
Pipers outline and overview 2014_generic
Pipers outline and overview 2014_genericPipers outline and overview 2014_generic
Pipers outline and overview 2014_generic
 
Intelligent tools-mitja-jermol-2013-bali-7 may2013
Intelligent tools-mitja-jermol-2013-bali-7 may2013Intelligent tools-mitja-jermol-2013-bali-7 may2013
Intelligent tools-mitja-jermol-2013-bali-7 may2013
 
Opening session - CAPFITOGEN Programme introduction
Opening session - CAPFITOGEN Programme introductionOpening session - CAPFITOGEN Programme introduction
Opening session - CAPFITOGEN Programme introduction
 
Project Overview_MINKE
Project Overview_MINKEProject Overview_MINKE
Project Overview_MINKE
 
European Conference on Software Architecture - ECSA 2015 Announcement
European Conference on Software Architecture - ECSA 2015 AnnouncementEuropean Conference on Software Architecture - ECSA 2015 Announcement
European Conference on Software Architecture - ECSA 2015 Announcement
 
Subject-specific International Accreditation for Technical Profiles
Subject-specific International  Accreditation for Technical ProfilesSubject-specific International  Accreditation for Technical Profiles
Subject-specific International Accreditation for Technical Profiles
 
VALEU-X Expert Forum
VALEU-X Expert ForumVALEU-X Expert Forum
VALEU-X Expert Forum
 
TAROT summerschool slides 2013 - Italy
TAROT summerschool slides 2013 - ItalyTAROT summerschool slides 2013 - Italy
TAROT summerschool slides 2013 - Italy
 
Presentation of the TACCLE3 Coding European Project
Presentation of the TACCLE3 Coding European ProjectPresentation of the TACCLE3 Coding European Project
Presentation of the TACCLE3 Coding European Project
 
ICT and Education Lessons Learned from the LLP Peter Birch EDEN conference
ICT and Education Lessons Learned from the LLP Peter Birch EDEN conferenceICT and Education Lessons Learned from the LLP Peter Birch EDEN conference
ICT and Education Lessons Learned from the LLP Peter Birch EDEN conference
 

Plus de IMPACT Centre of Competence

Plus de IMPACT Centre of Competence (20)

Session6 01.helmut schmid
Session6 01.helmut schmidSession6 01.helmut schmid
Session6 01.helmut schmid
 
Session1 03.hsian-an wang
Session1 03.hsian-an wangSession1 03.hsian-an wang
Session1 03.hsian-an wang
 
Session7 03.katrien depuydt
Session7 03.katrien depuydtSession7 03.katrien depuydt
Session7 03.katrien depuydt
 
Session7 02.peter kiraly
Session7 02.peter kiralySession7 02.peter kiraly
Session7 02.peter kiraly
 
Session6 04.giuseppe celano
Session6 04.giuseppe celanoSession6 04.giuseppe celano
Session6 04.giuseppe celano
 
Session6 03.sandra young
Session6 03.sandra youngSession6 03.sandra young
Session6 03.sandra young
 
Session6 02.jeremi ochab
Session6 02.jeremi ochabSession6 02.jeremi ochab
Session6 02.jeremi ochab
 
Session5 04.evangelos varthis
Session5 04.evangelos varthisSession5 04.evangelos varthis
Session5 04.evangelos varthis
 
Session5 03.george rehm
Session5 03.george rehmSession5 03.george rehm
Session5 03.george rehm
 
Session5 02.tom derrick
Session5 02.tom derrickSession5 02.tom derrick
Session5 02.tom derrick
 
Session5 01.rutger vankoert
Session5 01.rutger vankoertSession5 01.rutger vankoert
Session5 01.rutger vankoert
 
Session4 04.senka drobac
Session4 04.senka drobacSession4 04.senka drobac
Session4 04.senka drobac
 
Session3 04.arnau baro
Session3 04.arnau baroSession3 04.arnau baro
Session3 04.arnau baro
 
Session3 03.christian clausner
Session3 03.christian clausnerSession3 03.christian clausner
Session3 03.christian clausner
 
Session3 02.kimmo ketunnen
Session3 02.kimmo ketunnenSession3 02.kimmo ketunnen
Session3 02.kimmo ketunnen
 
Session3 01.clemens neudecker
Session3 01.clemens neudeckerSession3 01.clemens neudecker
Session3 01.clemens neudecker
 
Session2 04.ashkan ashkpour
Session2 04.ashkan ashkpourSession2 04.ashkan ashkpour
Session2 04.ashkan ashkpour
 
Session2 03.juri opitz
Session2 03.juri opitzSession2 03.juri opitz
Session2 03.juri opitz
 
Session2 02.christian reul
Session2 02.christian reulSession2 02.christian reul
Session2 02.christian reul
 
Session2 01.emad mohamed
Session2 01.emad mohamedSession2 01.emad mohamed
Session2 01.emad mohamed
 

Dernier

Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 

Dernier (20)

Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 

Succeed Validation and Take up of Tools - Katrien Depuydt

  • 1. Succeed WP3 – Validation and take-up of tools Katrien Depuydt (INL) –Stefan Eickeler, Sebastian Kirch, (IAIS) Succeed is supported by the European Union under FP7-ICT and coordinated by Universidad de Alicante.
  • 2. Objectives Many tools and linguistic resources were developed in research and development programs supporting the digitisation of cultural heritage Still, too few are used in the productive environments Succeed’s approach to support the take-up of these tools: 1.Identify existing tools and resources 2.Identify libraries willing to use and evaluate tools 3.Define criteria to validate and evaluate tools 4.Provide training material for tools 5.Provide support to libraries using and evaluating tools 6.Blueprint for validation and take-up of tools Succeed is supported by the European Union under FP7-ICT and coordinated by Universidad de Alicante.
  • 3. Survey of tools Training material Evaluation
  • 4. 1. SURVEY AND SELECTION OF TOOLS
  • 5. Survey of tools Brief description and goals Produce a survey of existing tools ground truth data and lexicon data for digitisation Select candidate tools for implementation at cultural heritage institutions Succeed is supported by the European Union under FP7-ICT and coordinated by Universidad de Alicante.
  • 6. Survey of tools Methodology used to achieve the objectives 1.Taxonomy for categorisation based on a simplified digitisation workflow 2.Definition of attributes e.g. how a tool can be used in the digitisation process 3.Online Spreadsheet to collect and organise tools 4.Assessment and further selection into a shortlist of tools Succeed is supported by the European Union under FP7-ICT and coordinated by Universidad de Alicante.
  • 7. Selection of tools First selection: knock-out criteria (three steps) Succeed is supported by the European Union under FP7-ICT and coordinated by Universidad de Alicante. Further selection: (expertise partners)
  • 8. Task 1 Survey of tools Summary of outcomes Categorised list of 213 research and commercial tools Available in an online database and frequently updated Shortlist with the most relevant tools based on a quality assessment An overview of existing ground truth material and lexicon data has been produced. http://impact.dlsi.ua.es/digitisation/tools-resources/tools-for-text-digitisation/ Succeed is supported by the European Union under FP7-ICT and coordinated by Universidad de Alicante.
  • 10. 1st Project Review – WP3 Validation parameters Brief description and goals Define validation parameters and procedures for the implementation of tools in productive environments (per task carried out by using a tool) Validate each tool (or group of tools) based on these criteria Work out evaluation work plans and test scenarios in cooperation with libraries based on their requirements Succeed is supported by the European Union under FP7-ICT and coordinated by Universidad de Alicante.
  • 11. Validation parameters Methodology used to achieve the objectives 1.Definition of evaluation template structure 2.Tool selection by libraries 3.Creation and compilation of evaluation material Separate evaluation forms per task/tool type & common usability evaluation form 4.Distribution of evaluation material to participating libraries Succeed is supported by the European Union under FP7-ICT and coordinated by Universidad de Alicante.
  • 12. 1st Project Review – WP3 Validation parameters Summary of outcomes Described evaluation procedures and produced 9 evaluation forms per task Worked out evaluation and test scenarios as a “work plan” together with the participating libraries Blueprint for take-up and validation Succeed is supported by the European Union under FP7-ICT and coordinated by Universidad de Alicante.
  • 14. Take-up support Brief description and goals Support the integration, take-up and validation of digitisation tools and resources Tool implementation at four participant libraries and nine external libraries (16 potential external libraries at the start of the project > 9 retained) Assistance for the adaptation/application of the tools to specific domains and/or languages Succeed is supported by the European Union under FP7-ICT and coordinated by Universidad de Alicante.
  • 15. Take-up support Methodology used to achieve the objectives 1.Each library installs, on average, two tools and tests their performance and usability in a productive environment according to the predifined validation criteria 2.Some consortium libraries will test existing linguistic resources for enhancement of textual information retrieval 3.The technical partners (IAIS, INL, PSNC, UA) will provide online assistance for the adaptation of the tools to specific domains and languages 4.The technical partners will report on the results based on the information provided by the libraries Succeed is supported by the European Union under FP7-ICT and coordinated by Universidad de Alicante.
  • 16. External Libraries Library Country Selected Tools Wielkopolska Biblioteka Cyfrowa Poland - Scan Tailor - JHOVE2 - Image Magick General Historical Library of Salamanca Spain - Gimp - Omnipage Wroclaw University Library Poland - Scan Tailor - Tesseract OCR University Library of Bratislava Slovak Republic - Scan Tailor - ImageMagick National Library of Finland Finland - Newspaper segmentation - Korrektor - Document Deskewer Library of the University of Granada Spain - Scan Tailor - Alchemy API University Library of Leuven Belgium - Abbyy FRE - NERT University Library of Antwerp Belgium - NE Attestation tool, - NLTK (NE), - Stanford (NE) University Library of Darmstadt Germany - Newspaper segmentation - Korrektor - Document Deskewer Internal Libraries Library Country Selected Tools Biblioteca Virtual Miguel de Cervantes Spain - Abbyy FRE - Geometric correction: Page Curl - COBaLT - Lexicon as Webservice Bibliotèque nationale de France France - DBPedia Spotlight - Evaluation Tool for OCR - Lexicon as Webservice Koninklijke Bibliotheek Netherlands - Lexicon as Webservice - NLTK - NERT The British Library United Kingdom - Evaluation Tool for OCR - Stanford (NE) - Lexicon as Webservice Take-up support Summary of outcomes  Involved 9 external libraries in the project to perform tool evaluation, each of them committed to evaluate at least 2 tools  Collected libraries’ digitisation requirements  Consulted libraries in defining interesting use cases for evaluation  Provided remote assistance for the take-up of tools selected by the libraries Succeed is supported by the European Union under FP7-ICT and coordinated by Universidad de Alicante.
  • 17. Take-up support Remote assistance for technical support: Assistance for the integration and adaptation of the tools to specific domains, languages and use cases Implementation studies (final report): Elaboration of blueprint on validation and take-up process for tools and resources Case studies from the implementation experiences produced Succeed is supported by the European Union under FP7-ICT and coordinated by Universidad de Alicante.
  • 19. Training Brief description and goals Produce documentation and training material for the tools to be validated. They must help the participating libraries to take-up the tools in their productive environment. Provide training on specific tools to external stakeholders. Organise on-site training workshops depending on libraries requirements Succeed is supported by the European Union under FP7-ICT and coordinated by Universidad de Alicante.
  • 20. Training Methodology used to achieve the objectives 1.Document structure of training material 2.Tool selection by libraries 3.Distribution of Work: WP 3 partners according to expertise and knowledge with the selected tools 4.Creation and compilation of training material 5.Distribution of training material to participating libraries Succeed is supported by the European Union under FP7-ICT and coordinated by Universidad de Alicante.
  • 21. Training Summary of outcomes Prepared training materials for 19 tools (separate document, online SCORM + DigitWiki) Organized TPDL tutorial attracting experts from digital libraries from around the world Participation in hackathons Succeed is supported by the European Union under FP7-ICT and coordinated by Universidad de Alicante.
  • 23. Conclusions Evaluation work of each participating library > Presentations! Succeed is supported by the European Union under FP7-ICT and coordinated by Universidad de Alicante.
  • 24. Conclusions Blueprint for evaluation General recommendations for evaluation by libraries: a.Translate requirements into detailed use case (including detailed description of data + data format) b.Acquire or produce test data c.Determine tools d.Produce work plan e.Verify use case with internal and external experts (Tool providers, CoC) If no test data can be produced, adapt use case If plan breaks down in too many steps, adapt use case If necessary, change tool selection f.Documentation of the evaluation (evaluation forms) g.Use experienced technical staff Succeed is supported by the European Union under FP7-ICT and coordinated by Universidad de Alicante.
  • 25. Conclusions Blueprint for evaluation General recommendations for tool providers: a.Provide a clear description of the purpose of the tool b.Provide a clear description of the formats the tool can handle c.Provide a clear description of the type of material the tool can handle with reasonable results; provide information on performance where possible d.Provide a clear step by step description of the complete procedure that should be followed to get the best possible result, including training and tuning of parameters. e.Provide compact documentation if possible f.Minimize interdependency of parts of documentation Succeed is supported by the European Union under FP7-ICT and coordinated by Universidad de Alicante.
  • 26. Thank you! Succeed is supported by the European Union under FP7-ICT and coordinated by Universidad de Alicante.