AWS Community Day CPH - Three problems of Terraform
Part-of-Speech Tagging of Northern Sotho: Disambiguating Polysemous Function Words
1. Part-of-Speech tagging of Northern Sotho: Disambiguating polysemous function words Gertrud Faa ß [email_address] Ulrich Heid [email_address] E lsab é Taljard [email_address] DJ Prinsloo [email_address]
2.
3.
4.
5. Noun class system 1 1 ga re ‘middle’ ga- (24) ga - n tle ‘outside’ pele ‘in front’ N- / Ø- N - mo rago ‘behind’ mo- 18 go dimo ‘above’ go- 17 fa se ‘below’ fa- 16 go ruta ‘to learn’ go- 15 ma dulo‘residences’ ma- (6) bo dulo ‘residence’ bo- 14 dim pša ‘dogs’ / di hlogo ‘heads’ di N - / di- 10 m pša ‘dog’ / hlogo ‘head’ N - / Ø- 9 di lepe ‘axes’ di- 8 se lepe ‘axe’ se- 7 ma bone ‘lights’ ma- 6 le bone ‘light’ le- 5 me nwana ‘fingers’ me- 4 mo nwana ‘finger’ mo- 3 malome ‘uncle’ bo malome ‘uncle & co’ Ø- bo- 1a 2b mo sadi ‘woman’ ba sadi ‘women’ mo- ba- 1 2 Example CP Cl.No
6.
7.
8.
9.
10.
11.
12.
13. Descriptive State of the Art: tagsets and tools yes yes 25/141 This paper no yes 141/262 Taljard et al. (2008) yes no partial Kotzé (several, e.g. 2008) yes no 56 De Schryver and De Pauw (2007) no no 106 Van Rooy and Pretorius (2003) Tool? Noun class yes/no No. of tags Authors
14.
15.
16.
17.
18.
19.
20.
21.
22. Effects of size of training corpus No more adding of training data necessary