SlideShare une entreprise Scribd logo
1  sur  68
INTERDISCIPLINARY CHALLENGE USING TEI FOR
BUILDING MULTILINGUAL DIGITAL CORPORA
A MAGHREB DH CASE STUDY
Mokhtar Ben Henda
Digital Humanities Institute – Beirut
American University of Beirut
2-6 March 2015
The general scope
 Creating a DH community of practice in
the Maghreb region
 Treat multilingual and interdisciplinary
resources
 Acquire new research and editing habits with
SH digital resources
 Using TEI as technical solution
 For its collegiate dynamics, open to non-
Western linguistic diversity
 For its accuracy to maintain, enrich and
document open guidelines on diversity
 For its ability to ensure the interoperability of
all produced resources
Delimiters
Multilingualism as a core question
Hybridization of
oral and written languages
in the Maghreb
Languages hybridization
 Use of classical or literary (scholar) Arabic in formal contexts :
 Administration, politics, media, education, religion…
 Dialects in daily life (trade, traditions, …)
 “Mother” tongue, “native” language ?
 Use of Arabic dialects:
 Morocco : Haḍarī (Fez, Rabat), Bedui (Marrakech), ‘Arūbī (west)
 Algeria: wahrani (West), Ksantini (East), Dziri (Center)
 Tunsia: Beldi (Tunis), Sahli (Center), Bedui (South)
 Ibn Žaldūn, al-Muqaddima, p.984 : (1377) «... Aboriginal Language
[Algerian] has taken over the Arabic language. The result is a mixed
language, in which Berber dominates and in which North African
dialects are farthest from Classical Arabic ».
Writing hybridization
A calendar page for November 1, 1895
with Ottoman-Turkish, Hebrew, Latin, Arabic scripts
Turkish and
Arabic manuscript
dictionary. 1201
AH [1786/87 AD].
Writing hybridization
Notary records in Arabic and French
(1342 Hg/1923 AD)
Strong claims of Berber as a
national language
Berber languages
 It is estimated that there are between 14 and 25 million speakers of
Berber languages
Tachelhit 3 million Morocco
Tamazight 3 million Morocco
Kabyle 2.5 to 6 million Algeria
Tarifit 1.7 million Morocco
Tachawit 1.4 million Algeria
Tamajaq 640,000 Niger
Tamasheq (Tuareg) 281,000 Mali
Tamajeq 250,000 Niger
Nafusi 167,000 Libya
Tamasheq has five varieties.
Tachelhit has a dialect called Susiua.
Tamazight has two major varieties: Central Altas and Southern Oran.
Tarifit has at least two dialects, one of which may be a separate language
Kabyle has two dialects: Greater and Lesser Kabyle.
Berber writing systems
 Berber languages are written with
Roman, Arabic and Tifinagh alphabets
The female name "Saida" written in tifinagh Berber language poetry in Arabic script
with its translation in French
Berber writing systems
 Kabylian manuscript of the XVIII Century
Berber « Rosetta » stone
Berber « Rosetta stone » from Dugga (Tunisia)
Phoenician and Tifinagh scripts (British Museum)
The Tifinagh alphabet is
thought to have been
derived from the Phoenician
alphabet.
Berber spreading in Maghreb
 60% in Morocco, 50% in Algeria, 2% in Tunisia
Berber recognition as a national language
 Algeria: constitutional amendment (2002)
Berber recognition as a national language
 Morocco « Constitutional amendment, July 2011)
IRCAM
IRCAM
 Tunisia
 New constitution 2014: recognizing minorities without nominating them
 NGOs acting for a constitutional amendment
So what ? … (hypothesis)
 A significant amount of resources in various
languages and writing systems have been
accumulated throughout years
 It does not exist today vast digital corpora of
texts in vernacular idioms and scripts (Unlike
many traditional textual resources : printed
documents and medieval manuscripts),
 But, since constitutional amendments
(Algeria and Morocco), there is a growing
awareness that these resources should be
collected, studied and digitized
A large reservoir of
unrecorded oral
heritage
A large reservoir of
popular music
A large reservoir of
multilingual city
printing
A large amount of
untreated notary
records
A large quantity of
manuscripts to be
digitized
How to subscribe to this challenge ?
 Need for a strategy
 Need for a working frame
 Need for competencies
 Need for data resources
 Need for … incentives !
Incentives
 DH general context in France
 THATCamp Paris 2010 : HD Manifest
 …
 III. Declaration
 5. We, professionals of the digital humanities, are building a
community of practice that is solidary, open, welcoming and
freely accessible.
 6. We are a community without borders. We are a multilingual
and multidisciplinary community.
 7. Our objectives are the advancement of knowledge, the
improvement of research quality in our disciplines, the
enrichment of knowledge and of collective patrimony, in the
academic sphere and beyond it.
 8. We call for the integration of digital culture in the definition
of the general culture of the twenty-first century.
http://tcp.hypotheses.org/411
Incentives
 France: 2014
Incentives
 Research projects experience
 BNFB (Bibliothèque Numérique Franco-
Berbère) : OIF/FFI Project - 2011 (100 000 €)
 Enrich and update, with automated procedures,
records repositories that describe and make
available documents in Berber languages
(Timbuktu manuscripts)
 IDEFI-CréaTIC: Unesco Chair-ITEN (2012) :
Innovative Training (5,2 million €)
 ISO standards:
 E-Learning vocabulary standards (SC36/WG1)
 E-Learning MLR (Metadata for Learning
standard)
 E-learning MLO (Metadata for learning
opportunities) (Afnor/GT1 mirror)
Incentives
 Research background
 Information and Communication Studies
 Multilingual data encoding standards
 Digital data processing
 Research affiliations
 Universities (Algeria, France, Morocco,
Tunisia)
 Research Laboratories
 Professional associations
 International Organizations
Key incentive
 Support from international TEI referees
 Laurent ROMARY : director of DARIAH & member of
the TEI council
 Lu BURNARD: Oxford Univ., godfather of TEI
Guidelines & consultant TGIR Huma-Num, France
Laurent ROMARY
Lu BURNARD
The roadmap
 Building partnerships
 Fund raising
 Building work environment
 Work organization
 Implementation, production and evaluation
Partnerships
 Individuals and institutions
Fund raising
 OIF/FFI (2010-2013)
 100 000 €
 BNFB project
 ISCC/CNRS (2012-2013)
 10 000 €
 HumanitéDigitMaghreb
 LEDEN, MSH Paris-Nord (2014-2015)
 3000 €
 TEI/MEI Euromed
 Chaire Unesco-ITEN (provisional)
 HD-MUREN
Work organization (structuring)
SH Corpus:
Document structures modeling
Poetry
Theatre Stories
Music Manuscript Speech
Fables
Others
Video
Peintings
Work environment
 Preferring
 A data description system ?
 Document centered system
 Document structuring
 to
 A Data base management
system
 Flat scanning
 Referencing metadata
Work environment
 oXygen
Work environment
 oXygen
 oXygen XML Editor supports a large number of text encodings.
 Bidirectional Text Support in Text Editing Mode (Unicode BIDI
Algorithm)
 Editing right-to left
XML documents
 Ability to create and
edit XML content in
Arabic
Work environment
http://www.tei-c.org/Roma/
Work environment
http://www.tei-c.org/oxgarage/
Work environment
 Image Markup Tool
Work environment
 Xaira (XML search engine)
 Explore a corpus language:
 Search words, forms, phrases or XML tags
 Analyze the way how they are used (frequency, stats, forms)
Work environment
 Omeka CMS platform
Work plan (provisional)
 Sharing tasks and responsibilities
 Describing roles
 Training sessions
 Resource analysis
 Structuring (segmentation)
 TEI/XML Encoding
 Collections organization
 Referencing (TeiHeader)
 DC indexing (metadata harvesting)
 Exploitation
 Content research
 Communication
Work organization (sharing)
SH Specialists
XML/TEI technicians
Info-Com. specialists
Arts & SH Corpus
Work organization (structuring)
 SHS SPECIALISTS identify logical structure of documents
according to the genres (Novel, poem, theatre, story, painting,
music, …)
Work organization (encoding)
 XML/TEI TECHNICIANS convert data models prepared by SH
specialists into TEI schemas using specialized software
XML files
Work organization (indexing)
 INFORMATION SPECIALISTS (librarians) add referencing data
(TEIHeader) and description metadata in the Data base
CMS / DB
Some examples (TEIVerse)
 Identify within a poem
 Verse metric
 Different types of verse
 Different verse clusters (verses,
quatrains)
 Patterns of stanzas
 Rhythm
 ...
The granularity may vary between large segments of
text and very tiny character strings
Some examples (TEIVerse)
<text>
<body>
<head>Heureux qui, comme Ulysse, a fait un beau voyage
</head>
<lg>
<l>Heureux qui, comme Ulysse, a fait un beau voyage </l>
<l> Ou comme cestuy-là qui conquit la toison </l>
<l> Et puis est retourné, plein d'usage et raison </l>
<l> Vivre entre ses parents le reste de son âge ! </l>
</lg>
<lg>
<l> Quand reverrai-je, hélas, de mon petit village </l>
<l> Fumer la cheminée, et en quelle saison </l>
<l> Reverrai-je le clos de ma pauvre maison </l>
<l> Qui m'est une province, et beaucoup davantage ?
</lg>
<lg>
<l> Plus me plaît le séjour qu'ont bâti mes aïeux </l>
<l> Que des palais Romains le front audacieux </l>
<l> Plus que le marbre dur me plaît l'ardoise fine : </l>
</lg>
<lg>
<l> Plus mon Loir gaulois, que le Tibre latin </l>
<l> Plus mon petit Liré, que le mont Palatin </l>
<l> Et plus que l'air marin la douceur angevine </l>
</lg>
</body>
</text>
<teiHeader>
<fileDesc>
<titleStmt>
<Title>Heureux qui comme
Ulysse</title>
<respStmt>
<resp>Auteur</resp>
<Name>Joacquim Du
Belay</name>
</respStmt>
</titleStmt>
<sourceDesc>
<Bibl>Recueil Les regrets
</bibl>
</sourceDesc>
</fileDesc>
</teiHeader>
<text>…
Some examples (TEIDrama)
 Identify within a piece of theatre:
Interpretation and production
Speech on stage
Stage direction (“didascalies”)
Types of exposure
Dénouement
Prologue
 …
 Interventions et indications scéniques
<sp who=“P”>
<speaker>Paul</speaker>
<p>Il fait chaud
aujourd’hui…</p>
</sp>
<stage>Paul allume une
cigarette</stage>
<surface> : a page, a stela, all objects
with inscription
 Surface contains zones and lines
 It has coordinates
<zone>: An area of the surface
determined arbitrarily for editorial
needs. Areas can overlap: overlaping is
defined according to spacial
coordinates
 Can contain <line>
 Disposes of coordinates
<line> : a continuum of text clearly
identified by editor
 Can contain text and <zone>
 Without coordinates
Some examples (TEIManuscript)
Some examples (TEIManuscript)
Some examples (TEIManuscript)
<sourceDoc>
<surface ulx="0" uly="0" lrx="200" lry="300">
<zone ulx="10" uly="43" lrx="185" lry="84"
rotate="0”>
<zone>
<line rend="right"> 1 April 2009</line>
</zone>
<line>Fed Birds in the park today.</line>
<line>Might write an articleabout</line>
<line>the Thick-billed Warbler.</line>
</zone>
<zone ulx="9" uly="20" lrx="70" lry="60"
rotate="90”>
<line>Samaria is a Greek</line>
<line>brand of water that</line>
<line>comes from the natural</line>
<line>springs of Stilos, in</line>
<line>Crete</line>
</zone>
</surface>
</sourceDoc>
<surface
ulx="0"
uly="0"
lrx="700"
lry="1000">
<!-- ... -->
</surface>
<zone
ulx="93"
uly="681"
lrx="967"
lry="1568">
<graphic url=« gb.jpg"/>
</zone>
Some examples (TEIManuscript)
<surface xml:id=“s1” ulx="0" uly="0"
lrx="50" lry="50">
<zone xml:id=“s1-z1” ulx="1" uly="1"
lrx="10" lry="10">
<line>Poem</line>
<!-- ... -->
<line>the head</line>
</zone>
<zone xml:id=“s1-z2” ulx="4" uly="4"
lrx="20" lry="20">
<surface xml:id=“s2” ulx="0" uly="0"
lrx="100" lry="100">
<zone xml:id=“s2-z1” ulx="10"
uly="10" lrx="90" lry="95"> Spring has just
set in here, and the weather […] a steamer
</zone>
</surface>
</zone>
</surface>
s1
s1-z1
s1-z2
s2
s2-z1
Some examples (Graph)
 Image Markup Tool
Some examples (Graph)
<teiHeader>
<fileDesc>
<titleStmt>
<title>The Image Markup Logo</title>
</titleStmt>
<publicationStmt>
<p></p>
</publicationStmt>
<sourceDesc>
<p>377 x 259</p>
</sourceDesc>
</fileDesc>
<encodingDesc>
</encodingDesc>
</teiHeader>
Some examples (Graph)
 Image Markup Tool
Some encoding examples (TEISpeech)
 Encoding speech according to a
« Timeline »
 Possibility to describe different
speech phenomena of oral
communication and sounds
during a speech
 pauses, words overlaps, tonal
changes, vocal expressions (coughs,
laughs, sneezing, …), gestures etc.
© The Music Encoding Initiative. http://music-encoding.org/
Some examples (MEI)
A tunisian song encoded in TEI <TeiHeader>
Métadonnées(TEIHeader)Espacedenom
A tunisian song encoded in TEI <text>
‫الليل‬ ‫في‬ ‫الياسمينة‬ ‫تحت‬
‫نسمة‬‫والورد‬‫محاذيني‬
‫األغصان‬‫تــــميل‬ ‫عليا‬
‫تمسحلي‬‫دمعة‬ ‫في‬‫عيني‬
‫تحت‬‫الياسمينة‬‫اتكيت‬
‫عدلت‬‫وغنيت‬ ‫الـــعود‬
‫وتناطر‬‫وبكيت‬ ‫دمعي‬
‫تفكرتك‬‫كنت‬ ‫كيف‬‫تجيني‬
‫جنينة‬‫النوار‬ ‫مزينها‬
‫فاحت‬‫األزهار‬ ‫ريحت‬ ‫من‬
‫تفكرتك‬‫النار‬ ‫شعــــلت‬
‫عملت‬‫لهليبة‬‫في‬‫قليبي‬
‫متوحش‬‫محتار‬ ‫وحدي‬
‫ال‬‫أطيار‬ ‫حس‬ ‫ال‬ ‫و‬ ‫قمرة‬
‫كان‬‫األشجار‬ ‫ع‬ ‫النسمة‬
‫توانس‬‫فيا‬‫وتواسيني‬
Stanza(Quatrain)
A tunisian song encoded in MEI <music>
A Arabic/French fable encoded with Oxygen
59
A TEI-based critical apparat of an Arabic document
Titre du document : ‫ّم‬‫ج‬‫المن‬ ‫بن‬ ‫يحيى‬ ‫رسالة‬ [Epître de Yahya ibn al-Munajjm sur la musique]
Nature : Fichier Word (converti en TEI P5 XML par Oxgarage)
Langue : Arabe
Genre littéraire : Catalogue de manuscrit de musique
Réalisation : Anas GHRAB
Schéma TEI : All
Extrait du bloc
<TEIHeader> :
<teiHeader>
<fileDesc xml:idx="001">
<titleStmt>
<title type="full">
<title type="main">‫ّم‬‫ج‬‫المن‬ ‫بن‬ ‫يحيى‬ ‫/<رسالة‬title>
<title type="sub">Titre translitéré</title>
<title type="alt">Epître de Yahya ibn al-Munajjm sur la musique</title>
</title>
<author>
<persName>
<forename>‫/<يحيى‬forename>
<genName>‫يحيى‬ ‫/<بن‬genName>
<surname>‫ّم‬‫ج‬‫المن‬ ‫/<بن‬surname>
<addName></addName>
</persName>
<date>...
<placeName>
<settlement type="city">...</settlement>
<country>...</country>
</placeName>
</date>
<date>...
<placeName>
<settlement type="city">...</settlement>
<country>...</country>
</placeName>
</date>
</author>
<respStmt>
<resp>‫/<تحقيق‬resp>
<persName>‫غراب‬ ‫/<أنس‬persName>
</respStmt>
</titleStmt>
<publicationStmt>
Extrait du bloc
<Text>
<text>
<front>
<div type="abstract">
<head>‫الكتاب‬ ‫محتوى‬ ‫/<وصف‬head>
</div>
<div type="contents">
<head>‫الكتاب‬ ‫/<فهرس‬head>
</div>
</front>
<body>
<p xml:idx="001">
ِ‫م‬‫ِي‬‫ح‬‫ه‬‫ر‬‫ال‬ ِ‫ن‬‫َا‬‫م‬ْ‫هح‬‫ر‬‫ال‬ ِ ‫ه‬‫اَّلل‬ ِ‫م‬ ْ‫س‬ِ‫ب‬
</p>
<p xml:idx="002">
‫ال‬ ‫ِي‬‫ف‬ ّ َ‫اَّلل‬ِ‫ب‬ ‫ِد‬‫ض‬َ‫ت‬‫ُع‬‫م‬‫ال‬ َ‫منين‬ْ‫ؤ‬ُ‫م‬‫ال‬ ِ‫ِير‬‫م‬َ‫أ‬ ‫ى‬َ‫َول‬‫م‬ ِ‫م‬ِّ‫ج‬َ‫ن‬ُ‫م‬‫ال‬ ‫َى‬‫ي‬ْ‫َح‬‫ي‬ ُ‫بن‬ ّ‫ي‬ِ‫ل‬َ‫ع‬ ُ‫بن‬ ‫َى‬‫ي‬ْ‫َح‬‫ي‬ ُ‫ة‬َ‫ال‬ َ‫س‬ِ‫ر‬‫َى‬‫ق‬‫ِي‬‫س‬‫ُو‬‫م‬
</p>
<p xml:idx="003">
ْ‫ي‬َ‫ل‬َ‫ع‬ َ‫ون‬ُ‫َك‬‫ي‬ ْ‫ن‬َ‫أ‬ ُ‫ب‬ِ‫ج‬َ‫ي‬ ‫َا‬‫م‬َ‫و‬ ،‫ِي‬ّ‫ن‬َ‫غ‬ُ‫م‬‫ال‬ َ‫ة‬َ‫ف‬ِ‫ص‬ ‫َا‬‫ذ‬َ‫ه‬ َ‫ل‬ْ‫ب‬َ‫ق‬ ‫ِي‬‫ذ‬‫ه‬‫ال‬ ‫َا‬‫ن‬ِ‫ب‬‫َا‬‫ت‬ِ‫ك‬ ‫ِي‬‫ف‬ ‫َا‬‫ن‬ْ‫ر‬َ‫َك‬‫ذ‬ ْ‫َد‬‫ق‬ ُ‫ل‬‫و‬ُ‫َق‬‫ن‬َ‫م‬ ‫َا‬‫ن‬ْ‫ف‬َ‫ص‬َ‫و‬َ‫و‬ ،ِ‫ه‬َ‫ر‬ْ‫م‬َ‫أ‬ َ‫اآلن‬ ُ‫ِف‬‫ص‬َ‫ن‬َ‫و‬ ،َ‫ك‬ِ‫ل‬َ‫ذ‬ ْ‫ِن‬‫م‬ ً‫ة‬َ‫ي‬‫َا‬‫ه‬ِ‫ن‬ ِ‫ه‬‫ِي‬‫ف‬ ‫ا‬
ٍ‫ع‬ِ‫ب‬ْ‫ص‬‫ا‬ َ‫ع‬ِ‫ق‬‫َا‬‫و‬َ‫م‬َ‫و‬ ، ُ‫ِف‬‫ل‬َ‫ت‬ْ‫َخ‬‫ي‬َ‫و‬ ‫َا‬‫ه‬ْ‫ن‬ِ‫م‬ ُ‫ِف‬‫ل‬َ‫ت‬ْ‫َأ‬‫ي‬ ‫َا‬‫م‬َ‫و‬ ،‫َا‬‫ه‬َ‫د‬َ‫د‬َ‫ع‬َ‫و‬ ِ‫م‬َ‫غ‬‫ه‬‫ن‬‫ال‬ٍ‫ع‬ِ‫ب‬ْ‫ص‬‫ا‬ٍ‫َر‬‫ت‬َ‫و‬ ْ‫ِن‬‫م‬ٍ‫َر‬‫ت‬َ‫و‬َ‫و‬ ،ُ‫ه‬‫ا‬‫ه‬‫م‬ َ‫س‬ ‫َا‬‫م‬ ُ‫ن‬ِّ‫ي‬َ‫ب‬ُ‫ن‬َ‫و‬ ،ٍ‫ن‬‫َا‬‫ت‬ ْ‫َس‬‫د‬ ِّ‫ل‬ُ‫ك‬ ْ‫ِن‬‫م‬ ٍ‫ة‬َ‫م‬ْ‫غ‬َ‫ن‬ ِّ‫ل‬ُ‫ك‬ َ‫ع‬ِ‫ض‬ْ‫و‬َ‫م‬
‫ِي‬‫ل‬ِ‫ص‬ْ‫و‬َ‫م‬‫ال‬ َ‫م‬‫ِي‬‫ه‬‫َا‬‫ر‬ْ‫ب‬‫ا‬ ُ‫بن‬ ُ‫َاق‬‫ح‬ ْ‫س‬ِ‫إ‬«‫َى‬‫ر‬ْ‫َج‬‫م‬‫ال‬»ْ‫ُس‬‫و‬‫ال‬ ‫َى‬‫ر‬ْ‫َج‬‫م‬ِ‫ب‬ ‫َا‬‫ه‬َ‫ض‬ْ‫ع‬َ‫ب‬ َ‫م‬ َ‫َس‬‫ر‬ ‫ِي‬‫ت‬‫ه‬‫ال‬ ِ‫ت‬‫َا‬‫و‬ْ‫ص‬َ‫األ‬ ‫ِي‬‫ف‬،ِ‫َر‬‫ص‬ْ‫ن‬ِ‫ب‬‫ال‬ ‫َى‬‫ر‬ْ‫َج‬‫م‬ِ‫ب‬ ‫َا‬‫ه‬َ‫ض‬ْ‫ع‬َ‫ب‬َ‫و‬ ،‫ى‬َ‫ط‬
ْ‫هن‬‫م‬ِ‫م‬ ‫ِه‬‫ئ‬‫َا‬‫ر‬َ‫ظ‬ُ‫ن‬َ‫و‬ َ‫َاق‬‫ح‬ ْ‫س‬ِ‫إ‬ َ‫ل‬ْ‫ث‬ِ‫م‬ ِّ‫ي‬ِ‫ب‬َ‫ر‬َ‫ع‬‫ال‬ ِ‫ء‬‫َا‬‫ن‬ِ‫غ‬‫ال‬ ِ‫ب‬‫َا‬‫ح‬ْ‫ص‬َ‫أ‬ َ‫ْن‬‫ي‬َ‫ب‬ ‫َا‬‫م‬ َ‫ف‬ َ‫َِل‬‫ت‬ْ‫َاخ‬‫و‬
<app>
<lem>َ‫ع‬َ‫م‬َ‫ج‬</lem>
<rdg wit="#Ram">‫/<جميع‬rdg>
</app>
َ‫د‬َ‫ع‬ ‫ِي‬‫ف‬ ِ‫ء‬‫َا‬‫م‬َ‫د‬ُ‫الق‬ ِ‫ة‬َ‫ف‬ِ‫س‬ َ‫ََل‬‫ف‬‫ال‬ َ‫ِن‬‫م‬ ‫َى‬‫ق‬‫ِي‬‫س‬‫ُو‬‫م‬‫ال‬ ِ‫ب‬‫َا‬‫ح‬ْ‫ص‬َ‫أ‬ َ‫ْن‬‫ي‬َ‫ب‬َ‫و‬ ،ِ‫َل‬‫م‬َ‫ع‬‫َال‬‫و‬ ِ‫ة‬َ‫ع‬‫َا‬‫ن‬ِّ‫ص‬‫ال‬ِ‫ب‬ َ‫م‬ْ‫ِل‬‫ع‬‫ال‬َ‫َان‬‫ك‬ ْ‫ذ‬ِ‫إ‬ ِ‫م‬َ‫غ‬‫ه‬‫ن‬‫ال‬ ِ‫د‬‫ه‬‫ن‬َ‫أ‬ َ‫ُون‬‫م‬َ‫ع‬ْ‫ز‬َ‫ي‬ ُ‫ه‬ُ‫َاؤ‬‫ر‬َ‫ظ‬ُ‫ن‬َ‫و‬ ُ‫َاق‬‫ح‬ ْ‫س‬ِ‫إ‬
،َ‫ة‬َ‫ر‬ َ‫َش‬‫ع‬ ‫ِي‬‫ن‬‫َا‬‫م‬َ‫ث‬ ‫َا‬‫ه‬‫ه‬‫ن‬َ‫أ‬ َ‫ُون‬‫م‬َ‫ع‬ْ‫ز‬َ‫ي‬ ‫َى‬‫ق‬‫ِي‬‫س‬‫ُو‬‫م‬‫ال‬ ُ‫َاب‬‫ح‬ْ‫ص‬َ‫أ‬ َ‫َان‬‫ك‬َ‫و‬ ٌ‫ر‬ ْ‫َش‬‫ع‬ َ‫م‬َ‫غ‬‫ه‬‫ن‬‫ال‬
<app>
<lem>.َ‫م‬‫َا‬‫غ‬ْ‫ن‬َ‫َاأل‬‫و‬ َ‫َان‬‫ح‬ْ‫ل‬َ‫األ‬ ُ‫ع‬َ‫م‬ْ‫َج‬‫ي‬ ‫َا‬‫م‬ِ‫ل‬ ٌ‫ح‬ْ‫ر‬ َ‫ش‬ ‫َا‬‫ه‬َ‫د‬ْ‫ع‬َ‫ب‬َ‫و‬</lem>
<rdg wit="#Ram"> ‫يجمع‬ ‫ما‬ ‫شرح‬ ‫وبعده‬‫االلحار‬‫واالفهام‬ </rdg>
<rdg wit="#BM"> ‫يجتمع‬ ‫بما‬ ‫شرح‬ ‫ونقدمه‬‫االلحار‬‫واالفهام‬ </rdg>
<rdg wit="#Zak">‫واألنغام‬ ‫األلحان‬ ‫يجتمع‬ ‫ما‬ ‫شرح‬ ‫/<ونقدم‬rdg>
</app>
Critical diagnosis
 Facing many problems
 DH & TEI are very new concepts for academic community
(resistance)
 Vertical clustering of disciplines in the same university
 Some cultural and linguistic sensitiveness observed between
partners
 Need for more training and interdisciplinary
research
 Nomadic training team
 Group publishing
The stylistic notes by Rétif de la Bretonne: a hypertext before digital era
Henri HUDRISIER (1), Sofia BEZINA (2), Hichem ISMAIL (3), Rachid ZGHIBI (4), Sihem
ZGHIDI (5), Loula Abd-elrazak (6), Laurent ROMARY (7), Mokhtar BEN HENDA (8),
Arnaud LABORDERIE (9), Ghislaine AZEMARD (10)
General orientation
 A strategy of support and orientation
 French translation of the full version of "TEI P5
Guidelines" (1500 pages counting appendices
including about 600 pages are translated to
date)
 An Arabic TEI P5 version is also under attempt
Rachid ZGHIBI, Tunis
Henri HUDRISIER, PAris8
General orientation
 Further research projects
 2014-2015: Project & MEI TEI-Euromed
(Encoding with MEI Maghreb traditional
music)
Sylvaine MARTIN,
LEDEN, MSH Paris Nord
General orientation
 Further research projects
 2012: The Unesco Chair-ITEN:
“Innovation, Transmission, Edition
Numérique”
 Bringing academic and entrepreneurial
partners together to discuss the
contemporary changes education
related to the effects of digital
innovation
 2012: IDEFI-CréaTIC project
 budget of 5,2 M€ on 8 years
 17 training programs
 24 workshops open to more than 500
students
HD-Muren
 Implement and test an environment where different stakeholders
in the academic communities learn together about new
computing technologies and their impact on the humanities
 DH & TEI at school?
General orientation
 More positioning within international
communities of practice in the DH
 Humanistica (2014 membership)
 ThatCamps: St Malo, 2013 / Lyon 2014 /
Beirut, 2015 / Paris 2015)
Thank you for your attention

Contenu connexe

Similaire à TEI for building multilingual corpora

Europeana Regia presentation at eChallenges 2011 conference
Europeana Regia presentation at eChallenges 2011 conferenceEuropeana Regia presentation at eChallenges 2011 conference
Europeana Regia presentation at eChallenges 2011 conferenceEuropeana Regia
 
03_ For a sociology of translator For a sociology of translator.ppt
03_ For a sociology of translator For a sociology of translator.ppt03_ For a sociology of translator For a sociology of translator.ppt
03_ For a sociology of translator For a sociology of translator.pptnzroghani
 
Promoting the Use of Basque via Language Technology
Promoting the Use of Basque via Language TechnologyPromoting the Use of Basque via Language Technology
Promoting the Use of Basque via Language Technologytechiaith
 
Building the Biblissima Observatory
Building the Biblissima ObservatoryBuilding the Biblissima Observatory
Building the Biblissima ObservatoryEquipex Biblissima
 
Biblissima: an Observatory for the Written Cultural Heritage of the Middle Ag...
Biblissima: an Observatory for the Written Cultural Heritage of the Middle Ag...Biblissima: an Observatory for the Written Cultural Heritage of the Middle Ag...
Biblissima: an Observatory for the Written Cultural Heritage of the Middle Ag...Equipex Biblissima
 
Sshoc kick off meeting - 1.4.4 SSHOC: Where Researchers (and their Research I...
Sshoc kick off meeting - 1.4.4 SSHOC: Where Researchers (and their Research I...Sshoc kick off meeting - 1.4.4 SSHOC: Where Researchers (and their Research I...
Sshoc kick off meeting - 1.4.4 SSHOC: Where Researchers (and their Research I...SSHOC
 
Du Literary and linguistic computing aux Digital Humanities : retour sur 40 a...
Du Literary and linguistic computing aux Digital Humanities : retour sur 40 a...Du Literary and linguistic computing aux Digital Humanities : retour sur 40 a...
Du Literary and linguistic computing aux Digital Humanities : retour sur 40 a...OpenEdition
 
Laura Welcher - The Rosetta Project and The Language Commons
Laura Welcher - The Rosetta Project and The Language CommonsLaura Welcher - The Rosetta Project and The Language Commons
Laura Welcher - The Rosetta Project and The Language Commonslongnow
 
Multilingual challenges in Europeana
Multilingual challenges in EuropeanaMultilingual challenges in Europeana
Multilingual challenges in EuropeanaAntoine Isaac
 
Biblissima and data modelling - EuropeanaTech 2015 Paris
Biblissima and data modelling - EuropeanaTech 2015 ParisBiblissima and data modelling - EuropeanaTech 2015 Paris
Biblissima and data modelling - EuropeanaTech 2015 ParisEquipex Biblissima
 
Challenges and opportunities of aggregating multiple databases in Europeana
Challenges and opportunities of aggregating multiple databases in EuropeanaChallenges and opportunities of aggregating multiple databases in Europeana
Challenges and opportunities of aggregating multiple databases in EuropeanaDavid Haskiya
 
Europeana 2019 - Connect Communities
Europeana 2019 - Connect CommunitiesEuropeana 2019 - Connect Communities
Europeana 2019 - Connect CommunitiesEuropeana
 
Challenges for the Language Technology Industry
Challenges for the Language Technology IndustryChallenges for the Language Technology Industry
Challenges for the Language Technology IndustryAntoine Isaac
 
TraduXio project - Cosi10
TraduXio project - Cosi10TraduXio project - Cosi10
TraduXio project - Cosi10PhilippeLacour
 
LIBER. The power of Association
LIBER. The power of AssociationLIBER. The power of Association
LIBER. The power of AssociationLIBER Europe
 
(Un)writing the histories of Humanities Computing(s)
(Un)writing the histories of Humanities Computing(s)(Un)writing the histories of Humanities Computing(s)
(Un)writing the histories of Humanities Computing(s)Edward Vanhoutte
 
Learning and Text Analysis for Ontology Engineering
Learning and Text Analysis for Ontology EngineeringLearning and Text Analysis for Ontology Engineering
Learning and Text Analysis for Ontology Engineeringbutest
 
Plurilingualism and intercomprehension teaching forum 2012
Plurilingualism and intercomprehension teaching forum 2012Plurilingualism and intercomprehension teaching forum 2012
Plurilingualism and intercomprehension teaching forum 2012Tita Beaven
 
Digital text as a phenomenon of culture
Digital text as a phenomenon of cultureDigital text as a phenomenon of culture
Digital text as a phenomenon of cultureTeodora Petkova
 

Similaire à TEI for building multilingual corpora (20)

Europeana Regia presentation at eChallenges 2011 conference
Europeana Regia presentation at eChallenges 2011 conferenceEuropeana Regia presentation at eChallenges 2011 conference
Europeana Regia presentation at eChallenges 2011 conference
 
03_ For a sociology of translator For a sociology of translator.ppt
03_ For a sociology of translator For a sociology of translator.ppt03_ For a sociology of translator For a sociology of translator.ppt
03_ For a sociology of translator For a sociology of translator.ppt
 
Promoting the Use of Basque via Language Technology
Promoting the Use of Basque via Language TechnologyPromoting the Use of Basque via Language Technology
Promoting the Use of Basque via Language Technology
 
Building the Biblissima Observatory
Building the Biblissima ObservatoryBuilding the Biblissima Observatory
Building the Biblissima Observatory
 
Biblissima: an Observatory for the Written Cultural Heritage of the Middle Ag...
Biblissima: an Observatory for the Written Cultural Heritage of the Middle Ag...Biblissima: an Observatory for the Written Cultural Heritage of the Middle Ag...
Biblissima: an Observatory for the Written Cultural Heritage of the Middle Ag...
 
Sshoc kick off meeting - 1.4.4 SSHOC: Where Researchers (and their Research I...
Sshoc kick off meeting - 1.4.4 SSHOC: Where Researchers (and their Research I...Sshoc kick off meeting - 1.4.4 SSHOC: Where Researchers (and their Research I...
Sshoc kick off meeting - 1.4.4 SSHOC: Where Researchers (and their Research I...
 
Du Literary and linguistic computing aux Digital Humanities : retour sur 40 a...
Du Literary and linguistic computing aux Digital Humanities : retour sur 40 a...Du Literary and linguistic computing aux Digital Humanities : retour sur 40 a...
Du Literary and linguistic computing aux Digital Humanities : retour sur 40 a...
 
Laura Welcher - The Rosetta Project and The Language Commons
Laura Welcher - The Rosetta Project and The Language CommonsLaura Welcher - The Rosetta Project and The Language Commons
Laura Welcher - The Rosetta Project and The Language Commons
 
Multilingual challenges in Europeana
Multilingual challenges in EuropeanaMultilingual challenges in Europeana
Multilingual challenges in Europeana
 
Biblissima and data modelling - EuropeanaTech 2015 Paris
Biblissima and data modelling - EuropeanaTech 2015 ParisBiblissima and data modelling - EuropeanaTech 2015 Paris
Biblissima and data modelling - EuropeanaTech 2015 Paris
 
Challenges and opportunities of aggregating multiple databases in Europeana
Challenges and opportunities of aggregating multiple databases in EuropeanaChallenges and opportunities of aggregating multiple databases in Europeana
Challenges and opportunities of aggregating multiple databases in Europeana
 
Europeana 2019 - Connect Communities
Europeana 2019 - Connect CommunitiesEuropeana 2019 - Connect Communities
Europeana 2019 - Connect Communities
 
Challenges for the Language Technology Industry
Challenges for the Language Technology IndustryChallenges for the Language Technology Industry
Challenges for the Language Technology Industry
 
TraduXio project - Cosi10
TraduXio project - Cosi10TraduXio project - Cosi10
TraduXio project - Cosi10
 
LIBER. The power of Association
LIBER. The power of AssociationLIBER. The power of Association
LIBER. The power of Association
 
(Un)writing the histories of Humanities Computing(s)
(Un)writing the histories of Humanities Computing(s)(Un)writing the histories of Humanities Computing(s)
(Un)writing the histories of Humanities Computing(s)
 
Introducing DARIAH
Introducing DARIAHIntroducing DARIAH
Introducing DARIAH
 
Learning and Text Analysis for Ontology Engineering
Learning and Text Analysis for Ontology EngineeringLearning and Text Analysis for Ontology Engineering
Learning and Text Analysis for Ontology Engineering
 
Plurilingualism and intercomprehension teaching forum 2012
Plurilingualism and intercomprehension teaching forum 2012Plurilingualism and intercomprehension teaching forum 2012
Plurilingualism and intercomprehension teaching forum 2012
 
Digital text as a phenomenon of culture
Digital text as a phenomenon of cultureDigital text as a phenomenon of culture
Digital text as a phenomenon of culture
 

Plus de Mokhtar Ben Henda

Webinaire de formation sur les REL
Webinaire de formation sur les RELWebinaire de formation sur les REL
Webinaire de formation sur les RELMokhtar Ben Henda
 
La norme technique comme catalyseur de transferts : la Francophonie à l’oeuvr...
La norme technique comme catalyseur de transferts : la Francophonie à l’oeuvr...La norme technique comme catalyseur de transferts : la Francophonie à l’oeuvr...
La norme technique comme catalyseur de transferts : la Francophonie à l’oeuvr...Mokhtar Ben Henda
 
Modalités & outils d’évaluation dans l’apprentissages en ligne
Modalités & outils d’évaluation dans l’apprentissages en ligneModalités & outils d’évaluation dans l’apprentissages en ligne
Modalités & outils d’évaluation dans l’apprentissages en ligneMokhtar Ben Henda
 
Écosystème d’ensemble d’un dispositif de formation hybride
Écosystème d’ensemble d’un dispositif de formation hybride Écosystème d’ensemble d’un dispositif de formation hybride
Écosystème d’ensemble d’un dispositif de formation hybride Mokhtar Ben Henda
 
Évaluation & Valorisation des résultats de la recherche
Évaluation & Valorisation des résultats de la rechercheÉvaluation & Valorisation des résultats de la recherche
Évaluation & Valorisation des résultats de la rechercheMokhtar Ben Henda
 
Modeling a Framework for Production and Dissemination of Open Educational Res...
Modeling a Framework for Production and Dissemination of Open Educational Res...Modeling a Framework for Production and Dissemination of Open Educational Res...
Modeling a Framework for Production and Dissemination of Open Educational Res...Mokhtar Ben Henda
 
Transformation Numerique à l'UEL (Vietnam)
Transformation Numerique à l'UEL (Vietnam)Transformation Numerique à l'UEL (Vietnam)
Transformation Numerique à l'UEL (Vietnam)Mokhtar Ben Henda
 
Classe inversée au TLU, Hanoï
Classe inversée au TLU, HanoïClasse inversée au TLU, Hanoï
Classe inversée au TLU, HanoïMokhtar Ben Henda
 
Formation Hybride au TLU, Hanoï
Formation Hybride au TLU, HanoïFormation Hybride au TLU, Hanoï
Formation Hybride au TLU, HanoïMokhtar Ben Henda
 
Gender-Equality_Scenarios.pptx
Gender-Equality_Scenarios.pptxGender-Equality_Scenarios.pptx
Gender-Equality_Scenarios.pptxMokhtar Ben Henda
 
Normes technologiques internationales : potentiels d’Innovation et de créativ...
Normes technologiques internationales : potentiels d’Innovation et de créativ...Normes technologiques internationales : potentiels d’Innovation et de créativ...
Normes technologiques internationales : potentiels d’Innovation et de créativ...Mokhtar Ben Henda
 
Modeling 30% online & 70% face-to-face learning
Modeling 30% online & 70% face-to-face learningModeling 30% online & 70% face-to-face learning
Modeling 30% online & 70% face-to-face learningMokhtar Ben Henda
 
Webinaires AUF ENS-Luang Prabang Laos
Webinaires AUF ENS-Luang Prabang LaosWebinaires AUF ENS-Luang Prabang Laos
Webinaires AUF ENS-Luang Prabang LaosMokhtar Ben Henda
 
مقاربة تحليلية للموارد التعليمية المفتوحة بموجب تراخيص المشاع الإبداعي و علاق...
مقاربة تحليلية للموارد التعليمية المفتوحة بموجب تراخيص المشاع الإبداعي و علاق...مقاربة تحليلية للموارد التعليمية المفتوحة بموجب تراخيص المشاع الإبداعي و علاق...
مقاربة تحليلية للموارد التعليمية المفتوحة بموجب تراخيص المشاع الإبداعي و علاق...Mokhtar Ben Henda
 
Open education between the specifications of Creative Commons and Copyright
Open education between the specifications of Creative Commons and CopyrightOpen education between the specifications of Creative Commons and Copyright
Open education between the specifications of Creative Commons and CopyrightMokhtar Ben Henda
 
Les fondamentaux de la classe inversée
Les fondamentaux de la classe inverséeLes fondamentaux de la classe inversée
Les fondamentaux de la classe inverséeMokhtar Ben Henda
 
L’intégrité académique en relation avec l’offre de formation d’un établisseme...
L’intégrité académique en relation avec l’offre de formation d’un établisseme...L’intégrité académique en relation avec l’offre de formation d’un établisseme...
L’intégrité académique en relation avec l’offre de formation d’un établisseme...Mokhtar Ben Henda
 
Animation des activités d'une classe virtuelle
Animation des activités d'une classe virtuelleAnimation des activités d'une classe virtuelle
Animation des activités d'une classe virtuelleMokhtar Ben Henda
 

Plus de Mokhtar Ben Henda (20)

Webinaire de formation sur les REL
Webinaire de formation sur les RELWebinaire de formation sur les REL
Webinaire de formation sur les REL
 
La norme technique comme catalyseur de transferts : la Francophonie à l’oeuvr...
La norme technique comme catalyseur de transferts : la Francophonie à l’oeuvr...La norme technique comme catalyseur de transferts : la Francophonie à l’oeuvr...
La norme technique comme catalyseur de transferts : la Francophonie à l’oeuvr...
 
Modalités & outils d’évaluation dans l’apprentissages en ligne
Modalités & outils d’évaluation dans l’apprentissages en ligneModalités & outils d’évaluation dans l’apprentissages en ligne
Modalités & outils d’évaluation dans l’apprentissages en ligne
 
Écosystème d’ensemble d’un dispositif de formation hybride
Écosystème d’ensemble d’un dispositif de formation hybride Écosystème d’ensemble d’un dispositif de formation hybride
Écosystème d’ensemble d’un dispositif de formation hybride
 
Évaluation & Valorisation des résultats de la recherche
Évaluation & Valorisation des résultats de la rechercheÉvaluation & Valorisation des résultats de la recherche
Évaluation & Valorisation des résultats de la recherche
 
Modeling a Framework for Production and Dissemination of Open Educational Res...
Modeling a Framework for Production and Dissemination of Open Educational Res...Modeling a Framework for Production and Dissemination of Open Educational Res...
Modeling a Framework for Production and Dissemination of Open Educational Res...
 
Transformation Numerique à l'UEL (Vietnam)
Transformation Numerique à l'UEL (Vietnam)Transformation Numerique à l'UEL (Vietnam)
Transformation Numerique à l'UEL (Vietnam)
 
Classe inversée au TLU, Hanoï
Classe inversée au TLU, HanoïClasse inversée au TLU, Hanoï
Classe inversée au TLU, Hanoï
 
Classe virtuelle au TLU
Classe virtuelle au TLUClasse virtuelle au TLU
Classe virtuelle au TLU
 
Formation Hybride au TLU, Hanoï
Formation Hybride au TLU, HanoïFormation Hybride au TLU, Hanoï
Formation Hybride au TLU, Hanoï
 
Ressouces Éducatives
Ressouces ÉducativesRessouces Éducatives
Ressouces Éducatives
 
Gender-Equality_Scenarios.pptx
Gender-Equality_Scenarios.pptxGender-Equality_Scenarios.pptx
Gender-Equality_Scenarios.pptx
 
Normes technologiques internationales : potentiels d’Innovation et de créativ...
Normes technologiques internationales : potentiels d’Innovation et de créativ...Normes technologiques internationales : potentiels d’Innovation et de créativ...
Normes technologiques internationales : potentiels d’Innovation et de créativ...
 
Modeling 30% online & 70% face-to-face learning
Modeling 30% online & 70% face-to-face learningModeling 30% online & 70% face-to-face learning
Modeling 30% online & 70% face-to-face learning
 
Webinaires AUF ENS-Luang Prabang Laos
Webinaires AUF ENS-Luang Prabang LaosWebinaires AUF ENS-Luang Prabang Laos
Webinaires AUF ENS-Luang Prabang Laos
 
مقاربة تحليلية للموارد التعليمية المفتوحة بموجب تراخيص المشاع الإبداعي و علاق...
مقاربة تحليلية للموارد التعليمية المفتوحة بموجب تراخيص المشاع الإبداعي و علاق...مقاربة تحليلية للموارد التعليمية المفتوحة بموجب تراخيص المشاع الإبداعي و علاق...
مقاربة تحليلية للموارد التعليمية المفتوحة بموجب تراخيص المشاع الإبداعي و علاق...
 
Open education between the specifications of Creative Commons and Copyright
Open education between the specifications of Creative Commons and CopyrightOpen education between the specifications of Creative Commons and Copyright
Open education between the specifications of Creative Commons and Copyright
 
Les fondamentaux de la classe inversée
Les fondamentaux de la classe inverséeLes fondamentaux de la classe inversée
Les fondamentaux de la classe inversée
 
L’intégrité académique en relation avec l’offre de formation d’un établisseme...
L’intégrité académique en relation avec l’offre de formation d’un établisseme...L’intégrité académique en relation avec l’offre de formation d’un établisseme...
L’intégrité académique en relation avec l’offre de formation d’un établisseme...
 
Animation des activités d'une classe virtuelle
Animation des activités d'une classe virtuelleAnimation des activités d'une classe virtuelle
Animation des activités d'une classe virtuelle
 

Dernier

Thirunelveli call girls Tamil escorts 7877702510
Thirunelveli call girls Tamil escorts 7877702510Thirunelveli call girls Tamil escorts 7877702510
Thirunelveli call girls Tamil escorts 7877702510Vipesco
 
Sector 62, Noida Call girls :8448380779 Noida Escorts | 100% verified
Sector 62, Noida Call girls :8448380779 Noida Escorts | 100% verifiedSector 62, Noida Call girls :8448380779 Noida Escorts | 100% verified
Sector 62, Noida Call girls :8448380779 Noida Escorts | 100% verifiedDelhi Call girls
 
AWS Data Engineer Associate (DEA-C01) Exam Dumps 2024.pdf
AWS Data Engineer Associate (DEA-C01) Exam Dumps 2024.pdfAWS Data Engineer Associate (DEA-C01) Exam Dumps 2024.pdf
AWS Data Engineer Associate (DEA-C01) Exam Dumps 2024.pdfSkillCertProExams
 
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptx
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptxChiulli_Aurora_Oman_Raffaele_Beowulf.pptx
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptxraffaeleoman
 
Digital collaboration with Microsoft 365 as extension of Drupal
Digital collaboration with Microsoft 365 as extension of DrupalDigital collaboration with Microsoft 365 as extension of Drupal
Digital collaboration with Microsoft 365 as extension of DrupalFabian de Rijk
 
My Presentation "In Your Hands" by Halle Bailey
My Presentation "In Your Hands" by Halle BaileyMy Presentation "In Your Hands" by Halle Bailey
My Presentation "In Your Hands" by Halle Baileyhlharris
 
Causes of poverty in France presentation.pptx
Causes of poverty in France presentation.pptxCauses of poverty in France presentation.pptx
Causes of poverty in France presentation.pptxCamilleBoulbin1
 
If this Giant Must Walk: A Manifesto for a New Nigeria
If this Giant Must Walk: A Manifesto for a New NigeriaIf this Giant Must Walk: A Manifesto for a New Nigeria
If this Giant Must Walk: A Manifesto for a New NigeriaKayode Fayemi
 
The workplace ecosystem of the future 24.4.2024 Fabritius_share ii.pdf
The workplace ecosystem of the future 24.4.2024 Fabritius_share ii.pdfThe workplace ecosystem of the future 24.4.2024 Fabritius_share ii.pdf
The workplace ecosystem of the future 24.4.2024 Fabritius_share ii.pdfSenaatti-kiinteistöt
 
Busty Desi⚡Call Girls in Sector 51 Noida Escorts >༒8448380779 Escort Service-...
Busty Desi⚡Call Girls in Sector 51 Noida Escorts >༒8448380779 Escort Service-...Busty Desi⚡Call Girls in Sector 51 Noida Escorts >༒8448380779 Escort Service-...
Busty Desi⚡Call Girls in Sector 51 Noida Escorts >༒8448380779 Escort Service-...Delhi Call girls
 
Aesthetic Colaba Mumbai Cst Call girls 📞 7738631006 Grant road Call Girls ❤️-...
Aesthetic Colaba Mumbai Cst Call girls 📞 7738631006 Grant road Call Girls ❤️-...Aesthetic Colaba Mumbai Cst Call girls 📞 7738631006 Grant road Call Girls ❤️-...
Aesthetic Colaba Mumbai Cst Call girls 📞 7738631006 Grant road Call Girls ❤️-...Pooja Nehwal
 
Report Writing Webinar Training
Report Writing Webinar TrainingReport Writing Webinar Training
Report Writing Webinar TrainingKylaCullinane
 
Dreaming Music Video Treatment _ Project & Portfolio III
Dreaming Music Video Treatment _ Project & Portfolio IIIDreaming Music Video Treatment _ Project & Portfolio III
Dreaming Music Video Treatment _ Project & Portfolio IIINhPhngng3
 
Dreaming Marissa Sánchez Music Video Treatment
Dreaming Marissa Sánchez Music Video TreatmentDreaming Marissa Sánchez Music Video Treatment
Dreaming Marissa Sánchez Music Video Treatmentnswingard
 
lONG QUESTION ANSWER PAKISTAN STUDIES10.
lONG QUESTION ANSWER PAKISTAN STUDIES10.lONG QUESTION ANSWER PAKISTAN STUDIES10.
lONG QUESTION ANSWER PAKISTAN STUDIES10.lodhisaajjda
 
Bring back lost lover in USA, Canada ,Uk ,Australia ,London Lost Love Spell C...
Bring back lost lover in USA, Canada ,Uk ,Australia ,London Lost Love Spell C...Bring back lost lover in USA, Canada ,Uk ,Australia ,London Lost Love Spell C...
Bring back lost lover in USA, Canada ,Uk ,Australia ,London Lost Love Spell C...amilabibi1
 
Uncommon Grace The Autobiography of Isaac Folorunso
Uncommon Grace The Autobiography of Isaac FolorunsoUncommon Grace The Autobiography of Isaac Folorunso
Uncommon Grace The Autobiography of Isaac FolorunsoKayode Fayemi
 

Dernier (18)

Thirunelveli call girls Tamil escorts 7877702510
Thirunelveli call girls Tamil escorts 7877702510Thirunelveli call girls Tamil escorts 7877702510
Thirunelveli call girls Tamil escorts 7877702510
 
Sector 62, Noida Call girls :8448380779 Noida Escorts | 100% verified
Sector 62, Noida Call girls :8448380779 Noida Escorts | 100% verifiedSector 62, Noida Call girls :8448380779 Noida Escorts | 100% verified
Sector 62, Noida Call girls :8448380779 Noida Escorts | 100% verified
 
AWS Data Engineer Associate (DEA-C01) Exam Dumps 2024.pdf
AWS Data Engineer Associate (DEA-C01) Exam Dumps 2024.pdfAWS Data Engineer Associate (DEA-C01) Exam Dumps 2024.pdf
AWS Data Engineer Associate (DEA-C01) Exam Dumps 2024.pdf
 
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptx
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptxChiulli_Aurora_Oman_Raffaele_Beowulf.pptx
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptx
 
Digital collaboration with Microsoft 365 as extension of Drupal
Digital collaboration with Microsoft 365 as extension of DrupalDigital collaboration with Microsoft 365 as extension of Drupal
Digital collaboration with Microsoft 365 as extension of Drupal
 
My Presentation "In Your Hands" by Halle Bailey
My Presentation "In Your Hands" by Halle BaileyMy Presentation "In Your Hands" by Halle Bailey
My Presentation "In Your Hands" by Halle Bailey
 
Causes of poverty in France presentation.pptx
Causes of poverty in France presentation.pptxCauses of poverty in France presentation.pptx
Causes of poverty in France presentation.pptx
 
If this Giant Must Walk: A Manifesto for a New Nigeria
If this Giant Must Walk: A Manifesto for a New NigeriaIf this Giant Must Walk: A Manifesto for a New Nigeria
If this Giant Must Walk: A Manifesto for a New Nigeria
 
The workplace ecosystem of the future 24.4.2024 Fabritius_share ii.pdf
The workplace ecosystem of the future 24.4.2024 Fabritius_share ii.pdfThe workplace ecosystem of the future 24.4.2024 Fabritius_share ii.pdf
The workplace ecosystem of the future 24.4.2024 Fabritius_share ii.pdf
 
Busty Desi⚡Call Girls in Sector 51 Noida Escorts >༒8448380779 Escort Service-...
Busty Desi⚡Call Girls in Sector 51 Noida Escorts >༒8448380779 Escort Service-...Busty Desi⚡Call Girls in Sector 51 Noida Escorts >༒8448380779 Escort Service-...
Busty Desi⚡Call Girls in Sector 51 Noida Escorts >༒8448380779 Escort Service-...
 
Aesthetic Colaba Mumbai Cst Call girls 📞 7738631006 Grant road Call Girls ❤️-...
Aesthetic Colaba Mumbai Cst Call girls 📞 7738631006 Grant road Call Girls ❤️-...Aesthetic Colaba Mumbai Cst Call girls 📞 7738631006 Grant road Call Girls ❤️-...
Aesthetic Colaba Mumbai Cst Call girls 📞 7738631006 Grant road Call Girls ❤️-...
 
ICT role in 21st century education and it's challenges.pdf
ICT role in 21st century education and it's challenges.pdfICT role in 21st century education and it's challenges.pdf
ICT role in 21st century education and it's challenges.pdf
 
Report Writing Webinar Training
Report Writing Webinar TrainingReport Writing Webinar Training
Report Writing Webinar Training
 
Dreaming Music Video Treatment _ Project & Portfolio III
Dreaming Music Video Treatment _ Project & Portfolio IIIDreaming Music Video Treatment _ Project & Portfolio III
Dreaming Music Video Treatment _ Project & Portfolio III
 
Dreaming Marissa Sánchez Music Video Treatment
Dreaming Marissa Sánchez Music Video TreatmentDreaming Marissa Sánchez Music Video Treatment
Dreaming Marissa Sánchez Music Video Treatment
 
lONG QUESTION ANSWER PAKISTAN STUDIES10.
lONG QUESTION ANSWER PAKISTAN STUDIES10.lONG QUESTION ANSWER PAKISTAN STUDIES10.
lONG QUESTION ANSWER PAKISTAN STUDIES10.
 
Bring back lost lover in USA, Canada ,Uk ,Australia ,London Lost Love Spell C...
Bring back lost lover in USA, Canada ,Uk ,Australia ,London Lost Love Spell C...Bring back lost lover in USA, Canada ,Uk ,Australia ,London Lost Love Spell C...
Bring back lost lover in USA, Canada ,Uk ,Australia ,London Lost Love Spell C...
 
Uncommon Grace The Autobiography of Isaac Folorunso
Uncommon Grace The Autobiography of Isaac FolorunsoUncommon Grace The Autobiography of Isaac Folorunso
Uncommon Grace The Autobiography of Isaac Folorunso
 

TEI for building multilingual corpora

  • 1. INTERDISCIPLINARY CHALLENGE USING TEI FOR BUILDING MULTILINGUAL DIGITAL CORPORA A MAGHREB DH CASE STUDY Mokhtar Ben Henda Digital Humanities Institute – Beirut American University of Beirut 2-6 March 2015
  • 2. The general scope  Creating a DH community of practice in the Maghreb region  Treat multilingual and interdisciplinary resources  Acquire new research and editing habits with SH digital resources  Using TEI as technical solution  For its collegiate dynamics, open to non- Western linguistic diversity  For its accuracy to maintain, enrich and document open guidelines on diversity  For its ability to ensure the interoperability of all produced resources
  • 3. Delimiters Multilingualism as a core question Hybridization of oral and written languages in the Maghreb
  • 4. Languages hybridization  Use of classical or literary (scholar) Arabic in formal contexts :  Administration, politics, media, education, religion…  Dialects in daily life (trade, traditions, …)  “Mother” tongue, “native” language ?  Use of Arabic dialects:  Morocco : Haḍarī (Fez, Rabat), Bedui (Marrakech), ‘Arūbī (west)  Algeria: wahrani (West), Ksantini (East), Dziri (Center)  Tunsia: Beldi (Tunis), Sahli (Center), Bedui (South)  Ibn Žaldūn, al-Muqaddima, p.984 : (1377) «... Aboriginal Language [Algerian] has taken over the Arabic language. The result is a mixed language, in which Berber dominates and in which North African dialects are farthest from Classical Arabic ».
  • 5. Writing hybridization A calendar page for November 1, 1895 with Ottoman-Turkish, Hebrew, Latin, Arabic scripts Turkish and Arabic manuscript dictionary. 1201 AH [1786/87 AD].
  • 6. Writing hybridization Notary records in Arabic and French (1342 Hg/1923 AD) Strong claims of Berber as a national language
  • 7. Berber languages  It is estimated that there are between 14 and 25 million speakers of Berber languages Tachelhit 3 million Morocco Tamazight 3 million Morocco Kabyle 2.5 to 6 million Algeria Tarifit 1.7 million Morocco Tachawit 1.4 million Algeria Tamajaq 640,000 Niger Tamasheq (Tuareg) 281,000 Mali Tamajeq 250,000 Niger Nafusi 167,000 Libya Tamasheq has five varieties. Tachelhit has a dialect called Susiua. Tamazight has two major varieties: Central Altas and Southern Oran. Tarifit has at least two dialects, one of which may be a separate language Kabyle has two dialects: Greater and Lesser Kabyle.
  • 8. Berber writing systems  Berber languages are written with Roman, Arabic and Tifinagh alphabets The female name "Saida" written in tifinagh Berber language poetry in Arabic script with its translation in French
  • 9. Berber writing systems  Kabylian manuscript of the XVIII Century
  • 10. Berber « Rosetta » stone Berber « Rosetta stone » from Dugga (Tunisia) Phoenician and Tifinagh scripts (British Museum) The Tifinagh alphabet is thought to have been derived from the Phoenician alphabet.
  • 11. Berber spreading in Maghreb  60% in Morocco, 50% in Algeria, 2% in Tunisia
  • 12. Berber recognition as a national language  Algeria: constitutional amendment (2002)
  • 13. Berber recognition as a national language  Morocco « Constitutional amendment, July 2011)
  • 14. IRCAM
  • 15. IRCAM
  • 16.  Tunisia  New constitution 2014: recognizing minorities without nominating them  NGOs acting for a constitutional amendment
  • 17.
  • 18. So what ? … (hypothesis)  A significant amount of resources in various languages and writing systems have been accumulated throughout years  It does not exist today vast digital corpora of texts in vernacular idioms and scripts (Unlike many traditional textual resources : printed documents and medieval manuscripts),  But, since constitutional amendments (Algeria and Morocco), there is a growing awareness that these resources should be collected, studied and digitized A large reservoir of unrecorded oral heritage A large reservoir of popular music A large reservoir of multilingual city printing A large amount of untreated notary records A large quantity of manuscripts to be digitized
  • 19. How to subscribe to this challenge ?  Need for a strategy  Need for a working frame  Need for competencies  Need for data resources  Need for … incentives !
  • 20. Incentives  DH general context in France  THATCamp Paris 2010 : HD Manifest  …  III. Declaration  5. We, professionals of the digital humanities, are building a community of practice that is solidary, open, welcoming and freely accessible.  6. We are a community without borders. We are a multilingual and multidisciplinary community.  7. Our objectives are the advancement of knowledge, the improvement of research quality in our disciplines, the enrichment of knowledge and of collective patrimony, in the academic sphere and beyond it.  8. We call for the integration of digital culture in the definition of the general culture of the twenty-first century. http://tcp.hypotheses.org/411
  • 22. Incentives  Research projects experience  BNFB (Bibliothèque Numérique Franco- Berbère) : OIF/FFI Project - 2011 (100 000 €)  Enrich and update, with automated procedures, records repositories that describe and make available documents in Berber languages (Timbuktu manuscripts)  IDEFI-CréaTIC: Unesco Chair-ITEN (2012) : Innovative Training (5,2 million €)  ISO standards:  E-Learning vocabulary standards (SC36/WG1)  E-Learning MLR (Metadata for Learning standard)  E-learning MLO (Metadata for learning opportunities) (Afnor/GT1 mirror)
  • 23. Incentives  Research background  Information and Communication Studies  Multilingual data encoding standards  Digital data processing  Research affiliations  Universities (Algeria, France, Morocco, Tunisia)  Research Laboratories  Professional associations  International Organizations
  • 24. Key incentive  Support from international TEI referees  Laurent ROMARY : director of DARIAH & member of the TEI council  Lu BURNARD: Oxford Univ., godfather of TEI Guidelines & consultant TGIR Huma-Num, France Laurent ROMARY Lu BURNARD
  • 25.
  • 26. The roadmap  Building partnerships  Fund raising  Building work environment  Work organization  Implementation, production and evaluation
  • 28. Fund raising  OIF/FFI (2010-2013)  100 000 €  BNFB project  ISCC/CNRS (2012-2013)  10 000 €  HumanitéDigitMaghreb  LEDEN, MSH Paris-Nord (2014-2015)  3000 €  TEI/MEI Euromed  Chaire Unesco-ITEN (provisional)  HD-MUREN
  • 29. Work organization (structuring) SH Corpus: Document structures modeling Poetry Theatre Stories Music Manuscript Speech Fables Others Video Peintings
  • 30. Work environment  Preferring  A data description system ?  Document centered system  Document structuring  to  A Data base management system  Flat scanning  Referencing metadata
  • 32. Work environment  oXygen  oXygen XML Editor supports a large number of text encodings.  Bidirectional Text Support in Text Editing Mode (Unicode BIDI Algorithm)  Editing right-to left XML documents  Ability to create and edit XML content in Arabic
  • 36. Work environment  Xaira (XML search engine)  Explore a corpus language:  Search words, forms, phrases or XML tags  Analyze the way how they are used (frequency, stats, forms)
  • 38. Work plan (provisional)  Sharing tasks and responsibilities  Describing roles  Training sessions  Resource analysis  Structuring (segmentation)  TEI/XML Encoding  Collections organization  Referencing (TeiHeader)  DC indexing (metadata harvesting)  Exploitation  Content research  Communication
  • 39. Work organization (sharing) SH Specialists XML/TEI technicians Info-Com. specialists Arts & SH Corpus
  • 40. Work organization (structuring)  SHS SPECIALISTS identify logical structure of documents according to the genres (Novel, poem, theatre, story, painting, music, …)
  • 41. Work organization (encoding)  XML/TEI TECHNICIANS convert data models prepared by SH specialists into TEI schemas using specialized software XML files
  • 42. Work organization (indexing)  INFORMATION SPECIALISTS (librarians) add referencing data (TEIHeader) and description metadata in the Data base CMS / DB
  • 43. Some examples (TEIVerse)  Identify within a poem  Verse metric  Different types of verse  Different verse clusters (verses, quatrains)  Patterns of stanzas  Rhythm  ... The granularity may vary between large segments of text and very tiny character strings
  • 44. Some examples (TEIVerse) <text> <body> <head>Heureux qui, comme Ulysse, a fait un beau voyage </head> <lg> <l>Heureux qui, comme Ulysse, a fait un beau voyage </l> <l> Ou comme cestuy-là qui conquit la toison </l> <l> Et puis est retourné, plein d'usage et raison </l> <l> Vivre entre ses parents le reste de son âge ! </l> </lg> <lg> <l> Quand reverrai-je, hélas, de mon petit village </l> <l> Fumer la cheminée, et en quelle saison </l> <l> Reverrai-je le clos de ma pauvre maison </l> <l> Qui m'est une province, et beaucoup davantage ? </lg> <lg> <l> Plus me plaît le séjour qu'ont bâti mes aïeux </l> <l> Que des palais Romains le front audacieux </l> <l> Plus que le marbre dur me plaît l'ardoise fine : </l> </lg> <lg> <l> Plus mon Loir gaulois, que le Tibre latin </l> <l> Plus mon petit Liré, que le mont Palatin </l> <l> Et plus que l'air marin la douceur angevine </l> </lg> </body> </text> <teiHeader> <fileDesc> <titleStmt> <Title>Heureux qui comme Ulysse</title> <respStmt> <resp>Auteur</resp> <Name>Joacquim Du Belay</name> </respStmt> </titleStmt> <sourceDesc> <Bibl>Recueil Les regrets </bibl> </sourceDesc> </fileDesc> </teiHeader> <text>…
  • 45. Some examples (TEIDrama)  Identify within a piece of theatre: Interpretation and production Speech on stage Stage direction (“didascalies”) Types of exposure Dénouement Prologue  …  Interventions et indications scéniques <sp who=“P”> <speaker>Paul</speaker> <p>Il fait chaud aujourd’hui…</p> </sp> <stage>Paul allume une cigarette</stage>
  • 46. <surface> : a page, a stela, all objects with inscription  Surface contains zones and lines  It has coordinates <zone>: An area of the surface determined arbitrarily for editorial needs. Areas can overlap: overlaping is defined according to spacial coordinates  Can contain <line>  Disposes of coordinates <line> : a continuum of text clearly identified by editor  Can contain text and <zone>  Without coordinates Some examples (TEIManuscript)
  • 48. Some examples (TEIManuscript) <sourceDoc> <surface ulx="0" uly="0" lrx="200" lry="300"> <zone ulx="10" uly="43" lrx="185" lry="84" rotate="0”> <zone> <line rend="right"> 1 April 2009</line> </zone> <line>Fed Birds in the park today.</line> <line>Might write an articleabout</line> <line>the Thick-billed Warbler.</line> </zone> <zone ulx="9" uly="20" lrx="70" lry="60" rotate="90”> <line>Samaria is a Greek</line> <line>brand of water that</line> <line>comes from the natural</line> <line>springs of Stilos, in</line> <line>Crete</line> </zone> </surface> </sourceDoc> <surface ulx="0" uly="0" lrx="700" lry="1000"> <!-- ... --> </surface> <zone ulx="93" uly="681" lrx="967" lry="1568"> <graphic url=« gb.jpg"/> </zone>
  • 49. Some examples (TEIManuscript) <surface xml:id=“s1” ulx="0" uly="0" lrx="50" lry="50"> <zone xml:id=“s1-z1” ulx="1" uly="1" lrx="10" lry="10"> <line>Poem</line> <!-- ... --> <line>the head</line> </zone> <zone xml:id=“s1-z2” ulx="4" uly="4" lrx="20" lry="20"> <surface xml:id=“s2” ulx="0" uly="0" lrx="100" lry="100"> <zone xml:id=“s2-z1” ulx="10" uly="10" lrx="90" lry="95"> Spring has just set in here, and the weather […] a steamer </zone> </surface> </zone> </surface> s1 s1-z1 s1-z2 s2 s2-z1
  • 50. Some examples (Graph)  Image Markup Tool
  • 51. Some examples (Graph) <teiHeader> <fileDesc> <titleStmt> <title>The Image Markup Logo</title> </titleStmt> <publicationStmt> <p></p> </publicationStmt> <sourceDesc> <p>377 x 259</p> </sourceDesc> </fileDesc> <encodingDesc> </encodingDesc> </teiHeader>
  • 52. Some examples (Graph)  Image Markup Tool
  • 53. Some encoding examples (TEISpeech)  Encoding speech according to a « Timeline »  Possibility to describe different speech phenomena of oral communication and sounds during a speech  pauses, words overlaps, tonal changes, vocal expressions (coughs, laughs, sneezing, …), gestures etc.
  • 54. © The Music Encoding Initiative. http://music-encoding.org/ Some examples (MEI)
  • 55. A tunisian song encoded in TEI <TeiHeader> Métadonnées(TEIHeader)Espacedenom
  • 56. A tunisian song encoded in TEI <text> ‫الليل‬ ‫في‬ ‫الياسمينة‬ ‫تحت‬ ‫نسمة‬‫والورد‬‫محاذيني‬ ‫األغصان‬‫تــــميل‬ ‫عليا‬ ‫تمسحلي‬‫دمعة‬ ‫في‬‫عيني‬ ‫تحت‬‫الياسمينة‬‫اتكيت‬ ‫عدلت‬‫وغنيت‬ ‫الـــعود‬ ‫وتناطر‬‫وبكيت‬ ‫دمعي‬ ‫تفكرتك‬‫كنت‬ ‫كيف‬‫تجيني‬ ‫جنينة‬‫النوار‬ ‫مزينها‬ ‫فاحت‬‫األزهار‬ ‫ريحت‬ ‫من‬ ‫تفكرتك‬‫النار‬ ‫شعــــلت‬ ‫عملت‬‫لهليبة‬‫في‬‫قليبي‬ ‫متوحش‬‫محتار‬ ‫وحدي‬ ‫ال‬‫أطيار‬ ‫حس‬ ‫ال‬ ‫و‬ ‫قمرة‬ ‫كان‬‫األشجار‬ ‫ع‬ ‫النسمة‬ ‫توانس‬‫فيا‬‫وتواسيني‬ Stanza(Quatrain)
  • 57. A tunisian song encoded in MEI <music>
  • 58. A Arabic/French fable encoded with Oxygen 59
  • 59. A TEI-based critical apparat of an Arabic document Titre du document : ‫ّم‬‫ج‬‫المن‬ ‫بن‬ ‫يحيى‬ ‫رسالة‬ [Epître de Yahya ibn al-Munajjm sur la musique] Nature : Fichier Word (converti en TEI P5 XML par Oxgarage) Langue : Arabe Genre littéraire : Catalogue de manuscrit de musique Réalisation : Anas GHRAB Schéma TEI : All Extrait du bloc <TEIHeader> : <teiHeader> <fileDesc xml:idx="001"> <titleStmt> <title type="full"> <title type="main">‫ّم‬‫ج‬‫المن‬ ‫بن‬ ‫يحيى‬ ‫/<رسالة‬title> <title type="sub">Titre translitéré</title> <title type="alt">Epître de Yahya ibn al-Munajjm sur la musique</title> </title> <author> <persName> <forename>‫/<يحيى‬forename> <genName>‫يحيى‬ ‫/<بن‬genName> <surname>‫ّم‬‫ج‬‫المن‬ ‫/<بن‬surname> <addName></addName> </persName> <date>... <placeName> <settlement type="city">...</settlement> <country>...</country> </placeName> </date> <date>... <placeName> <settlement type="city">...</settlement> <country>...</country> </placeName> </date> </author> <respStmt> <resp>‫/<تحقيق‬resp> <persName>‫غراب‬ ‫/<أنس‬persName> </respStmt> </titleStmt> <publicationStmt> Extrait du bloc <Text> <text> <front> <div type="abstract"> <head>‫الكتاب‬ ‫محتوى‬ ‫/<وصف‬head> </div> <div type="contents"> <head>‫الكتاب‬ ‫/<فهرس‬head> </div> </front> <body> <p xml:idx="001"> ِ‫م‬‫ِي‬‫ح‬‫ه‬‫ر‬‫ال‬ ِ‫ن‬‫َا‬‫م‬ْ‫هح‬‫ر‬‫ال‬ ِ ‫ه‬‫اَّلل‬ ِ‫م‬ ْ‫س‬ِ‫ب‬ </p> <p xml:idx="002"> ‫ال‬ ‫ِي‬‫ف‬ ّ َ‫اَّلل‬ِ‫ب‬ ‫ِد‬‫ض‬َ‫ت‬‫ُع‬‫م‬‫ال‬ َ‫منين‬ْ‫ؤ‬ُ‫م‬‫ال‬ ِ‫ِير‬‫م‬َ‫أ‬ ‫ى‬َ‫َول‬‫م‬ ِ‫م‬ِّ‫ج‬َ‫ن‬ُ‫م‬‫ال‬ ‫َى‬‫ي‬ْ‫َح‬‫ي‬ ُ‫بن‬ ّ‫ي‬ِ‫ل‬َ‫ع‬ ُ‫بن‬ ‫َى‬‫ي‬ْ‫َح‬‫ي‬ ُ‫ة‬َ‫ال‬ َ‫س‬ِ‫ر‬‫َى‬‫ق‬‫ِي‬‫س‬‫ُو‬‫م‬ </p> <p xml:idx="003"> ْ‫ي‬َ‫ل‬َ‫ع‬ َ‫ون‬ُ‫َك‬‫ي‬ ْ‫ن‬َ‫أ‬ ُ‫ب‬ِ‫ج‬َ‫ي‬ ‫َا‬‫م‬َ‫و‬ ،‫ِي‬ّ‫ن‬َ‫غ‬ُ‫م‬‫ال‬ َ‫ة‬َ‫ف‬ِ‫ص‬ ‫َا‬‫ذ‬َ‫ه‬ َ‫ل‬ْ‫ب‬َ‫ق‬ ‫ِي‬‫ذ‬‫ه‬‫ال‬ ‫َا‬‫ن‬ِ‫ب‬‫َا‬‫ت‬ِ‫ك‬ ‫ِي‬‫ف‬ ‫َا‬‫ن‬ْ‫ر‬َ‫َك‬‫ذ‬ ْ‫َد‬‫ق‬ ُ‫ل‬‫و‬ُ‫َق‬‫ن‬َ‫م‬ ‫َا‬‫ن‬ْ‫ف‬َ‫ص‬َ‫و‬َ‫و‬ ،ِ‫ه‬َ‫ر‬ْ‫م‬َ‫أ‬ َ‫اآلن‬ ُ‫ِف‬‫ص‬َ‫ن‬َ‫و‬ ،َ‫ك‬ِ‫ل‬َ‫ذ‬ ْ‫ِن‬‫م‬ ً‫ة‬َ‫ي‬‫َا‬‫ه‬ِ‫ن‬ ِ‫ه‬‫ِي‬‫ف‬ ‫ا‬ ٍ‫ع‬ِ‫ب‬ْ‫ص‬‫ا‬ َ‫ع‬ِ‫ق‬‫َا‬‫و‬َ‫م‬َ‫و‬ ، ُ‫ِف‬‫ل‬َ‫ت‬ْ‫َخ‬‫ي‬َ‫و‬ ‫َا‬‫ه‬ْ‫ن‬ِ‫م‬ ُ‫ِف‬‫ل‬َ‫ت‬ْ‫َأ‬‫ي‬ ‫َا‬‫م‬َ‫و‬ ،‫َا‬‫ه‬َ‫د‬َ‫د‬َ‫ع‬َ‫و‬ ِ‫م‬َ‫غ‬‫ه‬‫ن‬‫ال‬ٍ‫ع‬ِ‫ب‬ْ‫ص‬‫ا‬ٍ‫َر‬‫ت‬َ‫و‬ ْ‫ِن‬‫م‬ٍ‫َر‬‫ت‬َ‫و‬َ‫و‬ ،ُ‫ه‬‫ا‬‫ه‬‫م‬ َ‫س‬ ‫َا‬‫م‬ ُ‫ن‬ِّ‫ي‬َ‫ب‬ُ‫ن‬َ‫و‬ ،ٍ‫ن‬‫َا‬‫ت‬ ْ‫َس‬‫د‬ ِّ‫ل‬ُ‫ك‬ ْ‫ِن‬‫م‬ ٍ‫ة‬َ‫م‬ْ‫غ‬َ‫ن‬ ِّ‫ل‬ُ‫ك‬ َ‫ع‬ِ‫ض‬ْ‫و‬َ‫م‬ ‫ِي‬‫ل‬ِ‫ص‬ْ‫و‬َ‫م‬‫ال‬ َ‫م‬‫ِي‬‫ه‬‫َا‬‫ر‬ْ‫ب‬‫ا‬ ُ‫بن‬ ُ‫َاق‬‫ح‬ ْ‫س‬ِ‫إ‬«‫َى‬‫ر‬ْ‫َج‬‫م‬‫ال‬»ْ‫ُس‬‫و‬‫ال‬ ‫َى‬‫ر‬ْ‫َج‬‫م‬ِ‫ب‬ ‫َا‬‫ه‬َ‫ض‬ْ‫ع‬َ‫ب‬ َ‫م‬ َ‫َس‬‫ر‬ ‫ِي‬‫ت‬‫ه‬‫ال‬ ِ‫ت‬‫َا‬‫و‬ْ‫ص‬َ‫األ‬ ‫ِي‬‫ف‬،ِ‫َر‬‫ص‬ْ‫ن‬ِ‫ب‬‫ال‬ ‫َى‬‫ر‬ْ‫َج‬‫م‬ِ‫ب‬ ‫َا‬‫ه‬َ‫ض‬ْ‫ع‬َ‫ب‬َ‫و‬ ،‫ى‬َ‫ط‬ ْ‫هن‬‫م‬ِ‫م‬ ‫ِه‬‫ئ‬‫َا‬‫ر‬َ‫ظ‬ُ‫ن‬َ‫و‬ َ‫َاق‬‫ح‬ ْ‫س‬ِ‫إ‬ َ‫ل‬ْ‫ث‬ِ‫م‬ ِّ‫ي‬ِ‫ب‬َ‫ر‬َ‫ع‬‫ال‬ ِ‫ء‬‫َا‬‫ن‬ِ‫غ‬‫ال‬ ِ‫ب‬‫َا‬‫ح‬ْ‫ص‬َ‫أ‬ َ‫ْن‬‫ي‬َ‫ب‬ ‫َا‬‫م‬ َ‫ف‬ َ‫َِل‬‫ت‬ْ‫َاخ‬‫و‬ <app> <lem>َ‫ع‬َ‫م‬َ‫ج‬</lem> <rdg wit="#Ram">‫/<جميع‬rdg> </app> َ‫د‬َ‫ع‬ ‫ِي‬‫ف‬ ِ‫ء‬‫َا‬‫م‬َ‫د‬ُ‫الق‬ ِ‫ة‬َ‫ف‬ِ‫س‬ َ‫ََل‬‫ف‬‫ال‬ َ‫ِن‬‫م‬ ‫َى‬‫ق‬‫ِي‬‫س‬‫ُو‬‫م‬‫ال‬ ِ‫ب‬‫َا‬‫ح‬ْ‫ص‬َ‫أ‬ َ‫ْن‬‫ي‬َ‫ب‬َ‫و‬ ،ِ‫َل‬‫م‬َ‫ع‬‫َال‬‫و‬ ِ‫ة‬َ‫ع‬‫َا‬‫ن‬ِّ‫ص‬‫ال‬ِ‫ب‬ َ‫م‬ْ‫ِل‬‫ع‬‫ال‬َ‫َان‬‫ك‬ ْ‫ذ‬ِ‫إ‬ ِ‫م‬َ‫غ‬‫ه‬‫ن‬‫ال‬ ِ‫د‬‫ه‬‫ن‬َ‫أ‬ َ‫ُون‬‫م‬َ‫ع‬ْ‫ز‬َ‫ي‬ ُ‫ه‬ُ‫َاؤ‬‫ر‬َ‫ظ‬ُ‫ن‬َ‫و‬ ُ‫َاق‬‫ح‬ ْ‫س‬ِ‫إ‬ ،َ‫ة‬َ‫ر‬ َ‫َش‬‫ع‬ ‫ِي‬‫ن‬‫َا‬‫م‬َ‫ث‬ ‫َا‬‫ه‬‫ه‬‫ن‬َ‫أ‬ َ‫ُون‬‫م‬َ‫ع‬ْ‫ز‬َ‫ي‬ ‫َى‬‫ق‬‫ِي‬‫س‬‫ُو‬‫م‬‫ال‬ ُ‫َاب‬‫ح‬ْ‫ص‬َ‫أ‬ َ‫َان‬‫ك‬َ‫و‬ ٌ‫ر‬ ْ‫َش‬‫ع‬ َ‫م‬َ‫غ‬‫ه‬‫ن‬‫ال‬ <app> <lem>.َ‫م‬‫َا‬‫غ‬ْ‫ن‬َ‫َاأل‬‫و‬ َ‫َان‬‫ح‬ْ‫ل‬َ‫األ‬ ُ‫ع‬َ‫م‬ْ‫َج‬‫ي‬ ‫َا‬‫م‬ِ‫ل‬ ٌ‫ح‬ْ‫ر‬ َ‫ش‬ ‫َا‬‫ه‬َ‫د‬ْ‫ع‬َ‫ب‬َ‫و‬</lem> <rdg wit="#Ram"> ‫يجمع‬ ‫ما‬ ‫شرح‬ ‫وبعده‬‫االلحار‬‫واالفهام‬ </rdg> <rdg wit="#BM"> ‫يجتمع‬ ‫بما‬ ‫شرح‬ ‫ونقدمه‬‫االلحار‬‫واالفهام‬ </rdg> <rdg wit="#Zak">‫واألنغام‬ ‫األلحان‬ ‫يجتمع‬ ‫ما‬ ‫شرح‬ ‫/<ونقدم‬rdg> </app>
  • 60. Critical diagnosis  Facing many problems  DH & TEI are very new concepts for academic community (resistance)  Vertical clustering of disciplines in the same university  Some cultural and linguistic sensitiveness observed between partners  Need for more training and interdisciplinary research  Nomadic training team  Group publishing The stylistic notes by Rétif de la Bretonne: a hypertext before digital era Henri HUDRISIER (1), Sofia BEZINA (2), Hichem ISMAIL (3), Rachid ZGHIBI (4), Sihem ZGHIDI (5), Loula Abd-elrazak (6), Laurent ROMARY (7), Mokhtar BEN HENDA (8), Arnaud LABORDERIE (9), Ghislaine AZEMARD (10)
  • 61.
  • 62. General orientation  A strategy of support and orientation  French translation of the full version of "TEI P5 Guidelines" (1500 pages counting appendices including about 600 pages are translated to date)  An Arabic TEI P5 version is also under attempt Rachid ZGHIBI, Tunis Henri HUDRISIER, PAris8
  • 63. General orientation  Further research projects  2014-2015: Project & MEI TEI-Euromed (Encoding with MEI Maghreb traditional music) Sylvaine MARTIN, LEDEN, MSH Paris Nord
  • 64. General orientation  Further research projects  2012: The Unesco Chair-ITEN: “Innovation, Transmission, Edition Numérique”  Bringing academic and entrepreneurial partners together to discuss the contemporary changes education related to the effects of digital innovation  2012: IDEFI-CréaTIC project  budget of 5,2 M€ on 8 years  17 training programs  24 workshops open to more than 500 students
  • 65.
  • 66. HD-Muren  Implement and test an environment where different stakeholders in the academic communities learn together about new computing technologies and their impact on the humanities  DH & TEI at school?
  • 67. General orientation  More positioning within international communities of practice in the DH  Humanistica (2014 membership)  ThatCamps: St Malo, 2013 / Lyon 2014 / Beirut, 2015 / Paris 2015)
  • 68. Thank you for your attention