SlideShare une entreprise Scribd logo
1  sur  17
R A T R A I N I N G D A Y
GRF Corpus project
Sign in to the project
 Get your user account and log in to
https://grfcorpus.teamworkpm.net/
Get the software
 Software download from:
 http://tla.mpi.nl/tools/tla-tools/elan/
 Or from the project page
ELAN working environment
 ELAN project consists of 2 files
 .etf file
 Source audio file
 Download 2 files from teamwork
 1) your personal audio file as per your task
 2) standard etf template file
Create your new project
 File : new -> wav/mp3 + etf.
 The annotation work consists of 2 parts:
 1) segmentation
 2) transcription
Segmentation 1
 Options -> segmentation mode
 Listen first. Different participants are recorded.
Segmentation 2
 Start with Speaker1 - Sentence tier
 Each speaker separate. Fine tune boundaries
 Delete, move merge and split
Transcription 1
 Options -> transcription mode
 Select Speech
Transcription 2
 Listen and type
Transcription 3
 This phase:
1st copy of segmentation
 Options -> Annotation mode
 Tiers -> Create annotations on
dependent tiers
 Speech -> JyutPing, Translation
More transcription
 Use this or transcription view to enter text
 For jyutping transcription use website:
 http://hktv.cc/hp/cantonesetojyutping/
 Pay attention to spaces
Tokenizing
 Tier ->Tokenize tiers: JyutPing -> Words
 Adjust segments while pressing Alt
2nd copy of segmentation
 Tier -> Create annotations on dependent tiers
 Words -> English Gloss, IPA, Language
 Language has Controlled Vocabulary:
 E, C, P, ?
Last 2 Tiers
 Code switching types
 Annotation mode
 Select a section with your mouse and double click
 Choose an option
 Translation
 Annotation mode or Transcription mode
 Ctrl+Enter or Configure Verbal Unit Tier
More participants
 Recreate tier structure for each participant
 Tier -> Add new participant -> OK
 Take a break and repeat
the whole
transcription
process.

 Save your work often
 Try using a mouse
Finish
 Upload .eaf file to Teamwork and set the task to
complete and upload saved file

Contenu connexe

En vedette

Renacimiento Quattrocento
Renacimiento QuattrocentoRenacimiento Quattrocento
Renacimiento Quattrocento
EvaPaula
 
CDO presentation showcase
CDO presentation showcaseCDO presentation showcase
CDO presentation showcase
Trainer Mahgoub
 
El rol de la percepcion ensayo
El rol de la percepcion ensayoEl rol de la percepcion ensayo
El rol de la percepcion ensayo
karolyduque22
 
Bts services informatiques aux organisations 1
Bts services informatiques aux organisations 1Bts services informatiques aux organisations 1
Bts services informatiques aux organisations 1
liguad1
 
3° Medio Lenguaje plani abril 2013
3° Medio Lenguaje  plani abril 20133° Medio Lenguaje  plani abril 2013
3° Medio Lenguaje plani abril 2013
ivansanfrisco
 

En vedette (12)

How To Video Transcript
How To Video TranscriptHow To Video Transcript
How To Video Transcript
 
PBL and BC Revised Curriculum
PBL and BC Revised CurriculumPBL and BC Revised Curriculum
PBL and BC Revised Curriculum
 
Game theory for neural networks
Game theory for neural networksGame theory for neural networks
Game theory for neural networks
 
Renacimiento Quattrocento
Renacimiento QuattrocentoRenacimiento Quattrocento
Renacimiento Quattrocento
 
CDO presentation showcase
CDO presentation showcaseCDO presentation showcase
CDO presentation showcase
 
El rol de la percepcion ensayo
El rol de la percepcion ensayoEl rol de la percepcion ensayo
El rol de la percepcion ensayo
 
Bts services informatiques aux organisations 1
Bts services informatiques aux organisations 1Bts services informatiques aux organisations 1
Bts services informatiques aux organisations 1
 
Webinar: SharePoint 2016: The Future Of Hybrid
Webinar: SharePoint 2016: The Future Of Hybrid Webinar: SharePoint 2016: The Future Of Hybrid
Webinar: SharePoint 2016: The Future Of Hybrid
 
3° Medio Lenguaje plani abril 2013
3° Medio Lenguaje  plani abril 20133° Medio Lenguaje  plani abril 2013
3° Medio Lenguaje plani abril 2013
 
VERBO AVERE
VERBO AVEREVERBO AVERE
VERBO AVERE
 
Photo album
Photo albumPhoto album
Photo album
 
2014.03.11- G1 - presidiários de porto alegre fazem documentário sobre vida n...
2014.03.11- G1 - presidiários de porto alegre fazem documentário sobre vida n...2014.03.11- G1 - presidiários de porto alegre fazem documentário sobre vida n...
2014.03.11- G1 - presidiários de porto alegre fazem documentário sobre vida n...
 

Dernier

1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
QucHHunhnh
 

Dernier (20)

1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
fourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingfourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writing
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docx
 

Grf corpus project training 1

  • 1. R A T R A I N I N G D A Y GRF Corpus project
  • 2. Sign in to the project  Get your user account and log in to https://grfcorpus.teamworkpm.net/
  • 3. Get the software  Software download from:  http://tla.mpi.nl/tools/tla-tools/elan/  Or from the project page
  • 4. ELAN working environment  ELAN project consists of 2 files  .etf file  Source audio file  Download 2 files from teamwork  1) your personal audio file as per your task  2) standard etf template file
  • 5. Create your new project  File : new -> wav/mp3 + etf.  The annotation work consists of 2 parts:  1) segmentation  2) transcription
  • 6. Segmentation 1  Options -> segmentation mode  Listen first. Different participants are recorded.
  • 7. Segmentation 2  Start with Speaker1 - Sentence tier  Each speaker separate. Fine tune boundaries  Delete, move merge and split
  • 8. Transcription 1  Options -> transcription mode  Select Speech
  • 11. 1st copy of segmentation  Options -> Annotation mode  Tiers -> Create annotations on dependent tiers  Speech -> JyutPing, Translation
  • 12. More transcription  Use this or transcription view to enter text  For jyutping transcription use website:  http://hktv.cc/hp/cantonesetojyutping/  Pay attention to spaces
  • 13. Tokenizing  Tier ->Tokenize tiers: JyutPing -> Words  Adjust segments while pressing Alt
  • 14. 2nd copy of segmentation  Tier -> Create annotations on dependent tiers  Words -> English Gloss, IPA, Language  Language has Controlled Vocabulary:  E, C, P, ?
  • 15. Last 2 Tiers  Code switching types  Annotation mode  Select a section with your mouse and double click  Choose an option  Translation  Annotation mode or Transcription mode  Ctrl+Enter or Configure Verbal Unit Tier
  • 16. More participants  Recreate tier structure for each participant  Tier -> Add new participant -> OK  Take a break and repeat the whole transcription process.   Save your work often  Try using a mouse
  • 17. Finish  Upload .eaf file to Teamwork and set the task to complete and upload saved file