Nell’iperspazio con Rocket: il Framework Web di Rust!
Europe’s Beginnings through the Looking Glass: Publishing Historical Documents on the Web Using EVT
1. Roberto Rosselli Del Turco - Università di Torino Florentina Armaselu - CVCE
roberto.rossellidelturco@unito.it florentina.armaselu@cvce.eu
Chiara Di Pietro - Università di Pisa Lars Wieneke - CVCE
dipi.chiara@gmail.com lars.wieneke@cvce.eu
Raffaele Masotti - Università di Pisa
raffaele.masotti@gmail.com
1
www.cvce.eu
Europe’s Beginnings through the Looking
Glass: Publishing Historical Documents
on the Web Using EVT
5. 1. Goal: XML-TEI encoding, corpus analysis and Web publication of institutional documents
of the W.E.U. (Western European Union):
• Topics: armament production, standardization, control in the period from 1954 to 1982;
• Source: Archives nationales de Luxembourg, W.E.U collection.
2. Initial format:
• digitized versions (JPEG) of typewritten materials (one file per page).
3. Size:
*proc. = processed
Overview of the WEU-DIPLO project
Overview WEU-DIPLO 5
Category Number of
documents
Number of documents
per language
Number
of pages
Number of pages per
language
EN FR FR proc.* EN FR FR proc.*
Note 89 43 46 37 395 191 204 155
Minutes 30 15 15 15 256 138 118 118
Memorandum 3 1 2 2 16 7 9 9
Study 2 0 2 1 12 0 12 8
Discourse 1 0 1 0 4 0 4 0
Draft protocol 2 1 1 0 4 2 2 0
Total 127 60 67 55 687 338 349 290
6. Overview of the WEU-DIPLO project: workflow
Overview WEU-DIPLO 6
13. EVT experiments
Experiments 14
(Partial) customisation:
• General layout: folders structure, images renaming.
• EVT Transformer: builder pack (XSLT)
o added/modified templates for transforming specific patterns (headers, footers, paragraphs) (layout
not fully supported – e.g. sections, subsections, paragraph indentation, etc.).
• EVT Viewer: CSS
o added/modified statements to support visualisation in the browser of specific patterns (alignment,
text decoration, colour of headers, footers, etc.).
• Manual modification
o XML-TEI input: page breaks linked to the facsimile images;
o transformation output: changed HTML output to support particular features (Text-Link, HotSpot) (should
not occur in the real workflow).
15. 1. Goal:
• publishing on the CVCE’s Web site different types of documents on
European Integration history.
2. Types of documents (for the majority, high quality multilingual
transcriptions are available - TXT, RTF, SRT formats):
• treaties;
• administrative documents (minutes, notes, memoranda);
• press articles;
• handwritten notes;
• letters;
• video and audio archives.
3. Types of features to be implemented (required / optional):
• side by side facsimile/transcription (replicating the original with more or
less fidelity) (r);
• multipanel alignment (r);
• text-image link (o);
• zooming (r);
• HotSpot (o), etc.
EVT adaptation – towards a TEI-based publication framework – types of documents/features
EVT adaptation 17
16. EVT adaptation – towards a TEI-based publication framework – manuscript note (Werner corpus)
EVT adaptation 18
18. EVT adaptation – towards a TEI-based publication framework – architecture, workflow
EVT adaptation 20
General architecture General workflow
19. 1. Identification of features to be implemented in the digital
editions:
• visualisation;
• search.
2. Publication framework design:
• core / plugin;
• optional / project specific.
3. Implementation of the module for XML-TEI conversion
(potential adaptation of OxGarage for batch processing).
4. Implementation/integration into existing CVCE architecture:
• Back End;
• Front End.
Future work
Future work 21
20. EVT framework:
• flexible enough to support different types of documents in
European integration history;
• possibility to compare original / transcription (of interest for
researchers in European integration studies);
• different degrees of fidelity to the original can be envisaged
(balance manual / automatic processing).
EVT adaptation:
• minimise the amount of manual interventions in the XML-TEI
documents;
• publication framework with modular architecture to allow gradual
development and customisation according to the needs of the
projects.
Conclusion
Future work 22