SlideShare une entreprise Scribd logo
1  sur  6
Télécharger pour lire hors ligne
Utilizing Agglutinative Features in Japanese-Uighur Machine Translation
           Muhtar MAHSUT, Yasuhiro OGAWA, Kazue SUGINO, Yasuyoshi INAGAKI
                                      Graduate School of Engineering, Nagoya University
                                                Furo-cho, Chikusa-ku, Nagoya
                                                           Japan
                                       {muhtar, yasuhiro}@inagaki.nuie.nagoya-u.ac.jp

                                                              Abstract
Japanese and Uighur languages are agglutinative languages and they have many syntactical and morphological similarities. And
roughly speaking, we can translate Japanese into Uighur sequentially by replacing Japanese words with corresponding Uighur ones
after morphological analysis. However, we should translate agglutinated suffixes carefully to make correct translation, because they
play important roles on both languages. In this paper, we pay attention to them and propose a Japanese-Uighur machine translation
utilizing the agglutinative features of both languages. To deal with the agglutinative features, we use the derivational grammar, which
makes the similarities clearer between both languages. This makes our system proposed here simple and systematical. We have
implemented the machine translation system and evaluated how effectively our system works.
                                                            Keywords
agglutinative languages, machine translation, derivational grammar, Japanese, Uighur
                                                               similarities between Japanese and Uighur through using
                     1 Introduction                            the derivational grammar.
Japanese and Uighur have many syntactical similarities,        In this paper, we propose a Japanese-Uighur machine
especially with respect to the word order in the sentence.     translation system based on the derivational grammar.
This observation suggests that we can translate Japanese       First, we explain the similarities between them and show
into Uighur in such a manner as word for word replacing        some illustrative examples of word-for-word translation.
after the morphological analysis of input sentences. In        As a matter of course, the fact that Japanese and Uighur
addition, both languages have morphological similarities,      are different languages does not allow us to apply the
which is a significant feature of agglutinative languages.     word-for-word translation in a straightforward way. In
However, there is a grammatical problem, that is, Uighur       fact, a straightforward word-for-word Japanese-Uighur
grammar says that Uighur has no conjugations, while            translation sometimes yields unnatural Uighur sentences,
traditional Japanese grammar, which is often called school     which is caused by wrong combination of Uighur
grammar, says that Japanese verbs do conjugate.                morphemes. We introduce an idea of morpheme
Therefore the word-for-word translation based on school        replacement to solve this problem. When unnatural
grammar needs to deal with verbal conjugations of              Uighur morpheme sequence occurs, we replace an
Japanese, in which the translation of verbal endings is        inappropriate Uighur morpheme with an appropriate one
very complicated to describe.                                  determined by the information concerning morphemes
On the other hand, it is said that agglutinative language      appearing before and after it. This leads us to succeed in
does not conjugate, and that verbal variants are derived       generating more natural Uighur sentences.
from invariable stems by mechanically appending affixes        Thus we have implemented the translation system by
to them. These two views contradict to each other.             using a Japanese morphological analyser MAJO (Ogawa
To resolve this problem, G. N. Kiyose proposed the             et al., 1998) which we have developed by the derivational
derivational grammar (Kiyose, 1995) based on the               grammar approach. MAJO originally keeps the dictionary
phonological approach originated by Bloch (1946). This         information as the list of 3-tuples <Japanese morpheme,
grammar claims that Japanese verbs do not conjugate and        part-of-speech, meaning>. For translation, we replace the
Japanese verbal variants are made up by appending              3-tuples with <Japanese morpheme, part-of-speech,
suffixes to verbal stems. Because verbal conjugations          Uighur morpheme >. As a result of it, outputs of the
need not to be considered, the derivational grammar            morphological analysis by using MAJO become word-for-
makes morphological analysis quite simple.                     word translation from Japanese to Uighur. Then we apply
Since the derivational grammar pays attention to               morpheme replacement to Uighur morphemes of MAJO’s
agglutinative features of languages, it is applicable to not   outputs and acquire a translated Uighur sentence.
only Japanese but also other agglutinative languages such      Our approach is not limited to Japanese-Uighur translation.
as Uighur. So applying the derivational grammar to the         We can apply our approach to translations between other
two languages, we can make not only syntactical but also       agglutinative languages        successively, since the
morphological similarities between them clearer. This          derivational grammar approach treats to agglutinative
approach also makes the translation simp ler and more          features of languages.
systematic.                                                    This paper is organized as follows: Agglutinative
Recently, there are many literatures that discuss              similarities between Japanese and Uighur with examples
translation between agglutinative languages such as            of translations between them are shown in Section 2.
Japanese and Korean (Kim et al., 1998; Lee et al., 1990).      Section 3 points out some problems of word-for-word
They use only syntactical similarities between them, but       translation. In Section 4, we introduce morpheme
they can not make use of morphological similarities. Our       replacement rules to solve them. We describe our
approach systematizes both syntactical and morphological       implementation of Japanese-Uighur translation system
karega tobirawo aketa                                 tobirawo karega aketa



           kare ga tobira wo             ake     ta              tobira wo        kare ga         ake   ta


             u      Ø      ixik    ni       aq   di                ixik    ni       u      Ø       aq   di




                            u ixikni aqdi                                         ixikni u aqdi


                            Figure 1: Word-for-word translation from Japanese to Uighur

based on the derivational grammar in Section 5. Finally,       This causes a serious problem for word-for-word
we evaluate our system in Section 6.                           translation.
                                                               We overcome this problem by using the derivational
  2 Agglutinative Similarities between                         grammar, which claims that Japanese verbs do not
         Japanese and Uighur                                   conjugate, and that appending suffixes to verbal stems
                                                               makes up Japanese verbal variants. We also apply the
2.1 Syntactical Similarities                                   derivational grammar to Uighur. This makes their
                                                               morphological similarities clear as well as the syntactical
Both Japanese and Uighur languages are agglutinative and       ones.
they often refer to such the languages as free word order
languages. In Japanese, for example, we can say “karega        2.2.1 Union consonant and union vowel
tobirawo aketa” as well as “tobirawo karega aketa”.            The invariable part of a verb is called a verbal stem. For
Both sentences mean “he opened the door” in English.           example, ‘tabe-’ in “tabe-ru” [eat] and ‘kak-’ in “kak-
What does allow such the word order changes keeping the        u” [write] are both verbal stems in Japanese. There are
equivalent meaning? The answer should be found in the          two sorts of verbal stems : the one ending with a vowel,
function of case suffices. The dependency relation of a        e.g. ‘tabe-’ and the one ending with a consonant, e.g.
noun to other words, that is the role which plays in a         ‘kak-’. We call the former a vowel stem and the latter a
sentence, is indicated by case suffixes. The Japanese case     consonant stem, respectively.
suffix ‘-ga’ indicates the subjective noun of a sentence.      A verbal phrase consists of a verbal stem and some verbal
That is, ‘-ga’ makes the nominative case. The case suffix      suffixes. The verbal suffixes of “tabe-ru” and “kak-u”
‘-wo’ makes the accusative case and it indicates that the      are ‘-ru ’ and ‘-u’, respectively. The verbal suffix ‘-ru’
prepositive noun is the object of the verb in a sentence. So   follows only a vowel stem and the suffix ‘-u’ follows only
case suffixes make it possible to understand sentences         a consonant stem. The derivational grammar says those
even if the position of “karega” and “tobirawo” are            suffixes are the environmental variants of the suffix
exchanged.                                                     ‘-(r)u’. The consonant ‘r’ appears when it is appended to
Uighur language has the same property. Uighur accusative       a vowel stem and disappears when it is appended to a
case marker is ‘-ni’ and it corresponds to Japanese case       consonant stem. A consonant of this kind is called a union
suffix ‘-wo’. But in Uighur language the nominative case       consonant.
is often indicated by the zero-form. We show it by ‘Ø’.        Verbal phrases “tabe-na-i” [do not eat] and “kak-ana-i”
This does not mean that Uighur language has no                 [do not write] represent negative actions. The negative
nominative case suffixes.                                      verbal suffixes of them are ‘-na-’ and ‘-ana-’,
Similar by to Japanese, Uighur case suffixes make it           respectively. Those suffixes are also the variants of the
possible to exchange word order with no change of              suffix ‘-(a)na-’. The vowel ‘a’ appears when it is
meaning. Thus, we can translate both Japanese sentences        appended to a consonant stem and disappears when it is
“karega tobirawo aketa” and “tobirawo karega                   appended to a vowel stem. A vowel of this kind is called a
aketa” into Uighur in the manner of word-for-word as           union vowel.
shown in Figure 1. Both of “u ixikni aqdi” and “ixikni u
aqdi” are natural Uighur sentences that mean “he opened
                                                                      meaning              Japanese           Uighur
the door”.
This observation means that the case suffixes play the          causative               -(s)ase-             -guz-
essential roles in Japanese and Uighur, and should be           passive                 -(r)are-             -(i)l-
treated carefully in Japanese-Uighur translation.               potential               -(r)e-               -(y)ala-
                                                                polite                  -(i)mas-                -
2.2 Morphological Similarities                                  negative                -(a)na-              -ma-
It has been considered that there is a morphological            desiderative            -(i)ta-              -gu-
difference between Japanese and Uighur. That is, although
Uighur has no verbal conjugations, the traditional
Japanese grammar says that Japanese verbs do conjugate.        Table 1: Derivational suffixes in Japanese and Uighur
form                       meaning                     Japanese            Uighur
                       finite        non-perfective                       -(r)u                -[i]du
                       form          perfective                           -(i)ta               -di
                     participle      non-perfective                       -(r)u                -[i]digan
                       form          perfective                           -(i)ta               -gan
                                     perfective copulative                -(i)te               -(i)p
                     converb         provisional conditional              -(r)eba              -sa
                       form          negative copulative                  -(a)zu               -mastin
                                     simultaneous                         -(i)nagara           -gaq
                    imperative       affirmative                          -e,-ro               -gin
                      form           negative                             -(r)una              -magin

                                  Table 2: Syntactical suffixes in Japanese and Uighur

The derivational grammar summarizes these discussions            In addition, there are some derivational suffixes other than
as the following two rules:                                      these in Uighur. For example, a Uighur suffix ‘-(i)x-’
                                                                 indicates cooperative meaning. If ‘kutra-’ [congratulate]
Connection rule 1: When the suffix beginning with a              follows ‘-(i)x-’, it forms ‘kutrax-’ [congratulate each
    union consonant is appended to a consonant stem,             other]. In Japanese similar meaning indicated by ‘-(i)aw-’,
    the union consonant disappears.                              which is not a single suffix but is combination of a
Connection rule 2: When the suffix beginning with a              syntactical suffix ‘-(i)’ and a verb ‘aw -’. However, for
    union vowel is appended to a vowel stem, the union           translation, Japanese ‘-(i)aw-’ matches Uighur
    vowel disappears.                                            derivational suffix ‘-(i)x-’, so we take ‘-(i)aw-’ as a
                                                                 derivational suffix. Similarly, we consider some
2.2.2 Derivational Suffixes and Syntactical Suffixes             compound morphemes as derivational suffixes, such as
As shown in the examples of the above section, the               ‘-(i)teir-’ and ‘-(i)tutuar-’. Both suffixes correspond to
negative verbal suffix ‘-(a)na-’ can be followed by other        Uighur suffix ‘-(i)wat -’.
verbal suffixes, for example, ‘-i’ in “kak-ana-i” [do not        Here, we should note that Uighur has another connection
write]. These examples show that the verbal suffix               rule. This shows a change of a phoneme instead of
appended to verbal stems derives a new stem, which is            disappearing. For example, when the syntactical suffix
called a secondary stem. A suffix that derives a new stem        ‘-[i]digan’, which means non-perfective participle,
is called a derivational suffix. For example, “kak-ana-”         follows a consonant stem ‘yaz-’, a phoneme ‘i’ appears
                                                                 and “yazidigan” is formed. But if it follows a vowel stem
is a secondary stem and ‘-(a)na-’ is a derivational suffix.
                                                                 ‘yasa-’, ‘i’ changes to ‘y’ and “yasaydigan” is formed.
Other verbal suffixes that do not derive new stems are
                                                                 We named such a phoneme a union half vowel. It varies
called syntactical suffixes. They make various verbal            according to the following rule:
forms, which are classified into 4 forms: fin ite, participle,
converb and imperative. For example, the suffix ‘-(r)u’ in       Connection rule 3: When a suffix beginning with a
“tabe-ru” is a syntactical suffix and it forms a finite form.        union half vowel is appended to a consonant stem,
Japanese verbal stems can be followed by several suffixes,           the union half vowel becomes ‘i’, otherwise it
but syntactical suffixes are always appended last. Note              becomes ‘y’.
that Japanese has strict rules for order of verbal suffixes,
although the word order in Japanese sentence is rather           3 Problems for Japanese-Uighur Translation
flexible.                                                        There are m  any similarities between Japanese and Uighur
                                                                 but there are also some linguistic differences between
2.2.3 Applying the derivational grammar to Uighur                them. Those differences cause serious problems to the
Uighur verbal phrases have the similar features to               word-for-word translation. In this section, some examples
Japanese. We can also apply the derivational grammar to          illustrate the problems. Our solutions to these problems
Uighur language. A Uighur passive derivational suffix is         are descried in section 4.
‘-(i)l-’, where ‘(i)’ refers a union vowel. When ‘-(i)l-’
follows a vowel stem ‘yasa-’ [create], a union vowel ‘(i)’
                                                                 3.1 Problems of verbal suffixes
disappears and ‘yasal-’ [be created] yields.
Let's consider a Uighur verbal stem ‘ az-’ [write]. In
                                          y                      Table 2 shows the correspondences between Japanese and
order to express a causative meaning, ‘yaz-’ is followed         Uighur syntactical suffixes. As you see, Japanese
by a causative suffix ‘-guz-’ and forms a new stem               participle suffixes are the same form as finite suffixes
‘yazguz-’ [cause to write], that is, ‘-guz-’ is a                while Uighur ones are not. For example, Japanese
derivational suffix. In addition, ‘-di’ completes the verbal     perfective syntactical suffix ‘-(i)ta’ is used for both finite
phrase “yazguzdi” by appending to the new stem. Both             and participle forms. In Uighur, however, they are
Japanese and Uighur have several derivational suffixes           different forms as shown in Figure 2. For this reason, to
and the correspondence between them is summarized in             translate ‘-(i)ta’, we have to decide which of ‘-di’ or
Table 2.                                                         ‘-gan’ is the correct translation.
Finite form                                        Participle form

                        karega tobirawo aketa.                                     karega aketa tobira


              kare     ga     tobira wo       ake   ta     .                kare     ga      ake     ta   tobira



                u      Ø        ixik    ni    aq    di     .                 u        Ø      aq    -gan    ixik



                                 u ixikni aqdi.                                           u aqgan ixik
                            [he opened the door.]                                [the door that he opened]


                                 Figure 2: The difference about finite form and participle form

                                                                “köwrük din ötidu”. Japanese case suffix ‘wo’ is
3.2 Problems of case suffixes                                   translated into ‘din’. The suffix ‘w o’ usually indicates the
We have already explained that for Japanese and Uighur          object, but sometimes it indicates the place, while Uighur
case suffixes specify the role of noun phrases in the           case suffix ‘ni’ and ‘din’ indicates the object and the place,
sentences . This fact allows the flexible word order in the     respectively. So we have to choose ‘ni’ or ‘din’ according
sentences of these languages. But there does not exist one-     to the role of ‘  wo’ in the sentence. In a fact, the case
to-one correspondence between case suffixes of Japanese         suffix ‘wo’ that indicates a place often appears together
and those of Uighur. For example, “gomiwo suteru” is            with such the motional verbs like “tooru ” [pass] and
translated into “éhlétni tökidu”, where Japanese case           “wataru” [cross]. This suggests we can choose ‘ or      ni’
suffix ‘wo’ is translated into Uighur case suffix ‘ni’. On      ‘din’ according to the verbs which the noun phrases with
the other hand, the translation of “hasiwo wataru” is           ‘wo’ depends on.
                                                                We have investigated the ambiguities of case suffixes by
                                                                using IPA Lexicon of the Japanese Language for
Japanese                                                        Computers (basic Japanese verbs) (IPAL, 1987), which
  suffix translated Uighur suffixes and its number total        contains 861 Japanese verbs and 3473 example sentences.
             Ø        ning     gé                               We have translated the Japanese case suffixes that
  ga       3637          77     5                        3719   appeared in the sentences into Uighur ones and counted
          97.7%       2.0%  0.3%                                the number of them. Table 3 shows the result. ‘Ø/ni’ in
             ni       Ø/ni    din others fault                  the second row means that ‘wo’ in the sentence is
  wo       1566         320   118     57   47            2108   translated in ‘ni’ but it is usually omitted in the
          74.3%      15.2%  5.6% 2.7% 2.2%                      corresponding Uighur sentences. Some of example
             gé          dé   din    Ø fault                    sentences can not be translated directly and we classified
   ni      1183         294    81     16   35            1609   such cases as faults.
          73.5%      18.3%  5.0% 1.0% 2.2%                      We do not deal with Japanese case suffix ‘no’, which is
             dé       bilén                                     usually translated Uighur case suffix ‘ning’, because ‘no’
   de        638         14                               652   is usually used in such a way “A no B” [B of A] and it is
          97.8%       2.2%                                      related little to verbs. We do not deal with Japanese case
            dép      Ø/dép  bilén                               suffix ‘ya’, too, because of the same reason.
   to        193         46   146                         385
          50.1%      12.0% 37.9%                                                   4 Suffix Adjustment
             din      fault                                     Now let us proceed to our discussions on realization of
 kara        289          5                               294   word-for-word translation from Japanese to Uighur. The
          98.3%       1.7%                                      facts we revealed so far show that the problems to be
              gé                                                solved here is how to decide verbal or case suffix
   he        236                                          236   correspondences correctly. To overcome these problems,
          100%                                                  we adopt a method to assign the default Uighur suffix to
             din                                                each Japanese suffix and then to substitute a well fitted
 yori         14                                           14   suffix for an unnatural one under replacement rules. Since
                                                                a verbal stem and a following verbal suffix affect each
          100%
                                                                other, we can choose an appropriate suffix by knowing the
Table 3: Statistical correspondence between Japanese            right and left words. On the other hand, the verb which the
                                                                noun phrase with a case suffix depends on affects the
                and Uighur case suffixes
                                                                suffix. So we need to decide the correct case suffix
Japanese        default         prepositive                postpositive                 new        new parts
           suffix        suffix             word                       word                     suffix      -of-speech
                                               *              end of sentence                   -ydu        finite suffix
           -(r)u         -ydigan               *             punctuation mark                   -ydu        finite suffix
                                               *          sentence-final particle               -ydu        finite suffix
                                               *                 end of sentence                 -di        finite suffix
           -(i)ta         -gan                 *                punctuation mark                 -di        finite suffix
                                               *              sentence-final particle            -di        finite suffix

                                       Table 4: Replacement table of verbal suffixes

considering the verbs depended. Thus, we propose two               ‘ni’ is more popular than the locative one. But the locative
types of rules for suffix replacement.                             case ‘ni’ is less related to verbs than the dative one. So we
                                                                   translate ‘ni’ to ‘dé’ as default and it is replaced with ‘gé’
4.1 Replacement Table of Verbal Suffixes                           if the verb needs an indirect object.
For verbal suffixes, we make a replacement rule shown in           In addition, we consider not ‘dép’ but ‘bilén’ as a default
Table 4. A Japanese suffix in the left most columns is not         Uighur suffix for Japanese suffix ‘to’, which has two
used for the replacement but for the purpose of helping to         meanings. One is to change the preceding sentence to a
understand. A default suffix in the second column is a             quotative clause and it corresponds to ‘dép’. Another is
Uighur morpheme , which is translated directly from the            called the comitative case, which expresses a co-operant
corresponding Japanese suffix in the first column. A               or accompanist, and it corresponds to ‘bilén’. Traditional
prepositive word and a postpositive word represent a               Japanese grammar says that both designations of ‘to’ are
condition for replacement of the default suffix. If the            case suffixes. But the derivational grammar treats the
default suffix appears between the prepositive and                 former as a conjunctive particle and only the latter as a
postpositive words, it should be replaced by a new suffix          case suffix. So we consider it is not the problem of
in the fifth column. The mark *(don't care) denotes that a         translation but the one of morphological analysis to
replacement rule does not require a prepositive or                 distinguish two type of ‘to’. We use the morphological
postpositive word. Since words of a sentence are tagged            analyzer that tags only the latter as a case suffix. So we
parts of speech by the morphological analysis, we need a           decide ‘bilén’ as a default for case suffix ‘to’.
part of speech of the new suffix in the right most columns.
                                                                     5 Machine translation from Japanese into
4.2 Replacement Rule for Case Suffixes                                               Uighur
There are often several phrases between a noun phrase              We have implemented the Japanese-Uighur machine
and a verbal phrase on which it depends. So we can not             translation system. Our system consists of four modules:
use a replacement rules similar to above and need another          MAJO, the two replacement modules and the morpheme
type of rules for case suffixes. We use dependency                 connection system. MAJO is a morphological analyser of
relation and add the case pattern data to the Uighur verb,         Japanese based on the derivational grammar, and its
such as <öt-{wo/-din}>. The Uighur verb ‘öt-’ has a pair           dictionary consists of 3-tuples <Japanese morpheme, part-
of Japanese case suffix and Uighur one and it is a                 of-speech, meaning>. For the translation, we replaced the
replacement rule for case suffixes. In translation, if a           3-tuples with <Japanese morpheme, part-of-speech,
Uighur verb that has such a pair appears in the sentence,          Uighur morpheme >. Therefore outputs of MAJO become
the translation system searches the noun phrase depending          word-for-word translations for input sentences.
on the verb and having the Japanese suffix contained in            Here, a Japanese input sentence is “tukutta hasiwo
the pair. Then the old Uighur suffix is replaced by the new        watatta.” [crossed the bridge that one constructed.]
one that is in the pair.                                           shown in figure 3. Firstly, MAJO divides it into Japanese
For each Japanese case suffix, we assume the most                  words and yields a sequence of equivalent Uighur words.
possible one determined by Table 3 as a default Uighur             Secondly, replacement rules of verbal suffixes are applied
suffix. But, exceptionally, we adopt ‘dé’ as a default             to those Uighur suffixes if they match the conditions on
Uighur suffix for a Japanese suffix ‘ni’. The reason is the        the replacement table. In the example, there are two ‘-tta’
following. The Japanese case suffix ‘ni’ often indicates a
                                                                   in the input sentence and MAJO translates both into
position of space or time and it is said to represent the
                                                                   ‘-gan’. At this point, last ‘-gan’ matches the replacement
locative case. In such a case ‘ni’ should be translated
Uighur suffix ‘dé’ which designates the locative case. On          rule and it is replaced with ‘-di’. Thirdly, case suffixes are
the other hand, ‘ni’ also denotes the dative case and              replaced if the verbs that they depend have the
corresponds to ‘gé’. Table 3 shows that the dative case            replacement rules satisfied the condition. In the example,


               Japanese case suffix        ga      wo         ni      de        to      kara           he   yori
               default Uighur suffix       Ø       ni      dé         dé       bilén      din          gé    din

                                       Table 5: Default translation for case suffixes
input sentence                                tukutta hasiwo watatta.


             Morphological analysis

                                             tukur-      -ta        hasi       -wo       watar-      -ta      .
           Word-for-word translation

                                              sal-       -gan      köwrük       -ni         öt-      -gan     .
                 Replacement 1
                                              sal-       -gan      köwrük       -ni         öt-       -di     .
                 Replacement 2
                                              sal-       -gan      köwrük      -din         öt-       -di     .
             Morpheme connection


                output sentence                                    salgan köwrükdin ötdi.

                            Figure 3: Translation system and an example of translation

the first step translates ‘wo’ into ‘ ni’. But the verb
                          -         -                           develop a word selection method. The replacement table
‘watar’ on which the noun phrase “hasiwo” depends has           of our system describe only suffixes and parts of speech.
a replacement rule <watar, consonant-verb, öt-{wo/              If we expand it to word meanings, we could solve
-din}>. So ‘-ni’ is replaced with ‘-din’. Finally, the          ambiguities of word meaning.
morpheme      connection system connects Uighur                 We have discussed only case suffixes, but there are other
morphemes according to the connection rules and                 suffixes in Japanese, like ‘mo’ and ‘sika’. We know
generates a Uighur output sentence.                             some of them correspond to Uighur suffixes. For example,
                                                                Uighur suffix ‘mu’ has similar role of Japanese ‘mo’. So
                    5 Experiments                               we should investigate their correspondence further.
We used 136 sentences that include 306 verbal phrases           We are now making bigger size of experiments of our
(254 different patterns) to evaluate performance of our         translation system and collecting m replacement rules.
                                                                                                    ore
Japanese-Uighur machine translation system. We                  We are aiming to make our system fit for practical use.
constructed a Japanese-Uighur dictionary that had about
13,000 words including 3,800 verbs. We compared                 Bibliographical References
translation results between the system with/without the         Bloch, B. (1946). Studies in Colloquial Japanese, Part I,
replacement rules for verbal suffixes. As a result, the           Inflection, Journals of the American Oriental Society
system translated 119 verbal phrases correctly without the        66.
replacement rules, while the system with the replacement        IPA L (1987). IPA Lexicon of the Japanese Language for
rules translated 212 verbal phrases correctly. Thus, the          Computers (Basic Verbs), The Information-technology
precisions of translation about verbal phrases improved           Promotion Agency.
from 38.9% to 69.3%.                                            Kim, J. and Lee, J. and Lee, G. (1998). Generation of
We also evaluate translation of 295 case suffixes appeared        Korean Predicates Based on Modality-Feature Ordering
the 136 sentences. The simple system translated 257               and Lexicalizing Table in Japanese-Korean Machine
suffixes correctly, while our system translates 293 case          Translation, Journal of Natural Language Processing
suffixes correctly. The precisions of translation for case        5(2), pp.3-24.
suffixes improved from 87.1% to 99.3%.                          Kiyose, G. N. (1995). Japanese grammar --A new
                                                                  approach--, Kyoto university press.
                     6 Conclusion                               Lee, H. and Nakajima, M. and Agui, T. (1990). A
                                                                  Japanese-Korean Predicate Translation System Using
In this paper, we proposed a Japanese-Uighur translation          the Semantic Correspondence between Their Auxiliary
system. Our system is based on the derivational grammar           Predicative Expressions, Transactions of Information
and has succeeded in systematic word-for-word                     Processing Society of Japan 31(6), pp.801-809.
translation. In addition, it can generate a natural Uighur      Ogawa, Y. and Muhtar and Toyama, K. and Inagaki, Y.
sentence by using replacement rules.                              (1998).     Derivational   Grammar      Approach     to
Our system has two modules for replacements and now               Morphological Analysis of Japanese Sentences,
we are going to unify them. In addition, it does not take         PRICAI'98, LNAI 1531, pp. 424-435.
account of amb iguities of word meaning. For example,
Japanese syntactical suffix ‘-(r)are-’ has three meanings,
which are passive, potential and honorific, but we now
consider it only as a passive suffix. Therefore, we need to

Contenu connexe

Tendances

Caplow Tibetan Prosody
Caplow Tibetan ProsodyCaplow Tibetan Prosody
Caplow Tibetan Prosodyktolbert
 
AUTO CORRECTION OF SETSWANA REAL-WORD ERRORS
AUTO CORRECTION OF SETSWANA REAL-WORD ERRORSAUTO CORRECTION OF SETSWANA REAL-WORD ERRORS
AUTO CORRECTION OF SETSWANA REAL-WORD ERRORSijnlc
 
Principles And Parameter Of Universal Grammar
Principles And Parameter Of Universal GrammarPrinciples And Parameter Of Universal Grammar
Principles And Parameter Of Universal GrammarDr. Cupid Lucid
 
Assignment on morphology
Assignment on morphologyAssignment on morphology
Assignment on morphologyLinda Midy
 
Sadia Qamar (Assignment)
Sadia Qamar (Assignment)Sadia Qamar (Assignment)
Sadia Qamar (Assignment)Dr. Cupid Lucid
 
Translation Techniques from English into Romanian and Russin
Translation  Techniques from English into Romanian and RussinTranslation  Techniques from English into Romanian and Russin
Translation Techniques from English into Romanian and RussinElena Shapa
 
Untranslatability in translation
Untranslatability in translation Untranslatability in translation
Untranslatability in translation Mohsine Mahraj
 
morphology and syntax
morphology and syntaxmorphology and syntax
morphology and syntaxguest39e240
 
Translatioin manual - dr.enani
Translatioin manual - dr.enaniTranslatioin manual - dr.enani
Translatioin manual - dr.enaniAST-School
 
Anaphors and Pronominals in Tiv: Government-Binding Approach
Anaphors and Pronominals in Tiv: Government-Binding ApproachAnaphors and Pronominals in Tiv: Government-Binding Approach
Anaphors and Pronominals in Tiv: Government-Binding ApproachIsaac Kuna
 
Lecture 05 sonia
Lecture 05 soniaLecture 05 sonia
Lecture 05 soniasonianad
 

Tendances (20)

Caplow Tibetan Prosody
Caplow Tibetan ProsodyCaplow Tibetan Prosody
Caplow Tibetan Prosody
 
AUTO CORRECTION OF SETSWANA REAL-WORD ERRORS
AUTO CORRECTION OF SETSWANA REAL-WORD ERRORSAUTO CORRECTION OF SETSWANA REAL-WORD ERRORS
AUTO CORRECTION OF SETSWANA REAL-WORD ERRORS
 
Principles And Parameter Of Universal Grammar
Principles And Parameter Of Universal GrammarPrinciples And Parameter Of Universal Grammar
Principles And Parameter Of Universal Grammar
 
Assignment on morphology
Assignment on morphologyAssignment on morphology
Assignment on morphology
 
Syntax lecture
Syntax lectureSyntax lecture
Syntax lecture
 
1 2-1 grammar
1 2-1 grammar1 2-1 grammar
1 2-1 grammar
 
Sadia Qamar (Assignment)
Sadia Qamar (Assignment)Sadia Qamar (Assignment)
Sadia Qamar (Assignment)
 
N410197100
N410197100N410197100
N410197100
 
Translation Techniques from English into Romanian and Russin
Translation  Techniques from English into Romanian and RussinTranslation  Techniques from English into Romanian and Russin
Translation Techniques from English into Romanian and Russin
 
Untranslatability in translation
Untranslatability in translation Untranslatability in translation
Untranslatability in translation
 
morphology and syntax
morphology and syntaxmorphology and syntax
morphology and syntax
 
D0321023027
D0321023027D0321023027
D0321023027
 
80% of Qur'anic Words (Classified word lists for easy memorization)
80% of Qur'anic Words (Classified word lists for easy memorization)80% of Qur'anic Words (Classified word lists for easy memorization)
80% of Qur'anic Words (Classified word lists for easy memorization)
 
Translatioin manual - dr.enani
Translatioin manual - dr.enaniTranslatioin manual - dr.enani
Translatioin manual - dr.enani
 
Anaphors and Pronominals in Tiv: Government-Binding Approach
Anaphors and Pronominals in Tiv: Government-Binding ApproachAnaphors and Pronominals in Tiv: Government-Binding Approach
Anaphors and Pronominals in Tiv: Government-Binding Approach
 
Words and translation
Words and translationWords and translation
Words and translation
 
Writing
WritingWriting
Writing
 
Syntax turn paper
Syntax turn paperSyntax turn paper
Syntax turn paper
 
Lecture 05 sonia
Lecture 05 soniaLecture 05 sonia
Lecture 05 sonia
 
Additive morpheme
Additive morphemeAdditive morpheme
Additive morpheme
 

En vedette

Msft Top10 Business Practicesfor Es Data Centers April09
Msft Top10 Business Practicesfor Es Data Centers April09Msft Top10 Business Practicesfor Es Data Centers April09
Msft Top10 Business Practicesfor Es Data Centers April09hutuworm
 
Amazon的分布式Key Value存储系统(Dynamo)的论文
Amazon的分布式Key Value存储系统(Dynamo)的论文Amazon的分布式Key Value存储系统(Dynamo)的论文
Amazon的分布式Key Value存储系统(Dynamo)的论文yiditushe
 
en_The_Amazing_Quran
en_The_Amazing_Quranen_The_Amazing_Quran
en_The_Amazing_Qurantughchi
 
Happiness Is A Voyage
Happiness Is A VoyageHappiness Is A Voyage
Happiness Is A Voyagefoonkok
 
Nce2文本
Nce2文本Nce2文本
Nce2文本Qu Yi
 
3 Character Education Workshop
3 Character Education Workshop3 Character Education Workshop
3 Character Education Workshopwctang
 
organic information design
organic information designorganic information design
organic information designrosa qin
 

En vedette (8)

Msft Top10 Business Practicesfor Es Data Centers April09
Msft Top10 Business Practicesfor Es Data Centers April09Msft Top10 Business Practicesfor Es Data Centers April09
Msft Top10 Business Practicesfor Es Data Centers April09
 
Amazon的分布式Key Value存储系统(Dynamo)的论文
Amazon的分布式Key Value存储系统(Dynamo)的论文Amazon的分布式Key Value存储系统(Dynamo)的论文
Amazon的分布式Key Value存储系统(Dynamo)的论文
 
en_The_Amazing_Quran
en_The_Amazing_Quranen_The_Amazing_Quran
en_The_Amazing_Quran
 
Happiness Is A Voyage
Happiness Is A VoyageHappiness Is A Voyage
Happiness Is A Voyage
 
Nce2文本
Nce2文本Nce2文本
Nce2文本
 
3 Character Education Workshop
3 Character Education Workshop3 Character Education Workshop
3 Character Education Workshop
 
organic information design
organic information designorganic information design
organic information design
 
noruz
noruznoruz
noruz
 

Similaire à MTS-2001-Mahsut

All About Japanese Verbs
All About Japanese VerbsAll About Japanese Verbs
All About Japanese VerbsKireinaNeko
 
Japanese esl
Japanese eslJapanese esl
Japanese eslRyguy786
 
EXTRACTING LINGUISTIC SPEECH PATTERNS OF JAPANESE FICTIONAL CHARACTERS USING ...
EXTRACTING LINGUISTIC SPEECH PATTERNS OF JAPANESE FICTIONAL CHARACTERS USING ...EXTRACTING LINGUISTIC SPEECH PATTERNS OF JAPANESE FICTIONAL CHARACTERS USING ...
EXTRACTING LINGUISTIC SPEECH PATTERNS OF JAPANESE FICTIONAL CHARACTERS USING ...kevig
 
EXTRACTING LINGUISTIC SPEECH PATTERNS OF JAPANESE FICTIONAL CHARACTERS USING ...
EXTRACTING LINGUISTIC SPEECH PATTERNS OF JAPANESE FICTIONAL CHARACTERS USING ...EXTRACTING LINGUISTIC SPEECH PATTERNS OF JAPANESE FICTIONAL CHARACTERS USING ...
EXTRACTING LINGUISTIC SPEECH PATTERNS OF JAPANESE FICTIONAL CHARACTERS USING ...kevig
 
A Comparison of the Use of Adjectives and Onomatopoeic Words for Describing E...
A Comparison of the Use of Adjectives and Onomatopoeic Words for Describing E...A Comparison of the Use of Adjectives and Onomatopoeic Words for Describing E...
A Comparison of the Use of Adjectives and Onomatopoeic Words for Describing E...Tim Schwartz
 
Mastery in Japanese Conjunctions among Indonesian Learners of Japanese
Mastery in Japanese Conjunctions among Indonesian Learners of JapaneseMastery in Japanese Conjunctions among Indonesian Learners of Japanese
Mastery in Japanese Conjunctions among Indonesian Learners of JapaneseQUESTJOURNAL
 
Passive voice
Passive voicePassive voice
Passive voiceSaudiStar
 
lesson 1 syntax 2021.pdf
lesson 1 syntax 2021.pdflesson 1 syntax 2021.pdf
lesson 1 syntax 2021.pdfMOHAMEDAADDI
 
M ORPHOLOGICAL A NALYZER U SING THE B I - LSTM M ODEL O NLY FOR JAPANESE H IR...
M ORPHOLOGICAL A NALYZER U SING THE B I - LSTM M ODEL O NLY FOR JAPANESE H IR...M ORPHOLOGICAL A NALYZER U SING THE B I - LSTM M ODEL O NLY FOR JAPANESE H IR...
M ORPHOLOGICAL A NALYZER U SING THE B I - LSTM M ODEL O NLY FOR JAPANESE H IR...kevig
 
MORPHOLOGICAL ANALYZER USING THE BILSTM MODEL ONLY FOR JAPANESE HIRAGANA SENT...
MORPHOLOGICAL ANALYZER USING THE BILSTM MODEL ONLY FOR JAPANESE HIRAGANA SENT...MORPHOLOGICAL ANALYZER USING THE BILSTM MODEL ONLY FOR JAPANESE HIRAGANA SENT...
MORPHOLOGICAL ANALYZER USING THE BILSTM MODEL ONLY FOR JAPANESE HIRAGANA SENT...kevig
 
A Comparative Study of Light Verbs in Tamil and Tulu
A Comparative Study of Light Verbs in Tamil and TuluA Comparative Study of Light Verbs in Tamil and Tulu
A Comparative Study of Light Verbs in Tamil and Tuluijtsrd
 

Similaire à MTS-2001-Mahsut (20)

All About Japanese Verbs
All About Japanese VerbsAll About Japanese Verbs
All About Japanese Verbs
 
Japanese esl
Japanese eslJapanese esl
Japanese esl
 
EXTRACTING LINGUISTIC SPEECH PATTERNS OF JAPANESE FICTIONAL CHARACTERS USING ...
EXTRACTING LINGUISTIC SPEECH PATTERNS OF JAPANESE FICTIONAL CHARACTERS USING ...EXTRACTING LINGUISTIC SPEECH PATTERNS OF JAPANESE FICTIONAL CHARACTERS USING ...
EXTRACTING LINGUISTIC SPEECH PATTERNS OF JAPANESE FICTIONAL CHARACTERS USING ...
 
EXTRACTING LINGUISTIC SPEECH PATTERNS OF JAPANESE FICTIONAL CHARACTERS USING ...
EXTRACTING LINGUISTIC SPEECH PATTERNS OF JAPANESE FICTIONAL CHARACTERS USING ...EXTRACTING LINGUISTIC SPEECH PATTERNS OF JAPANESE FICTIONAL CHARACTERS USING ...
EXTRACTING LINGUISTIC SPEECH PATTERNS OF JAPANESE FICTIONAL CHARACTERS USING ...
 
Nonline japan
Nonline japanNonline japan
Nonline japan
 
Nonline japan
Nonline japanNonline japan
Nonline japan
 
Nonline japan
Nonline japanNonline japan
Nonline japan
 
Beginner Japanese
Beginner JapaneseBeginner Japanese
Beginner Japanese
 
Nonline japan
Nonline japanNonline japan
Nonline japan
 
Nonline japan
Nonline japanNonline japan
Nonline japan
 
A Comparison of the Use of Adjectives and Onomatopoeic Words for Describing E...
A Comparison of the Use of Adjectives and Onomatopoeic Words for Describing E...A Comparison of the Use of Adjectives and Onomatopoeic Words for Describing E...
A Comparison of the Use of Adjectives and Onomatopoeic Words for Describing E...
 
Mastery in Japanese Conjunctions among Indonesian Learners of Japanese
Mastery in Japanese Conjunctions among Indonesian Learners of JapaneseMastery in Japanese Conjunctions among Indonesian Learners of Japanese
Mastery in Japanese Conjunctions among Indonesian Learners of Japanese
 
Nonline japan
Nonline japanNonline japan
Nonline japan
 
Japanese Beginner
Japanese BeginnerJapanese Beginner
Japanese Beginner
 
Nonline japan
Nonline japanNonline japan
Nonline japan
 
Passive voice
Passive voicePassive voice
Passive voice
 
lesson 1 syntax 2021.pdf
lesson 1 syntax 2021.pdflesson 1 syntax 2021.pdf
lesson 1 syntax 2021.pdf
 
M ORPHOLOGICAL A NALYZER U SING THE B I - LSTM M ODEL O NLY FOR JAPANESE H IR...
M ORPHOLOGICAL A NALYZER U SING THE B I - LSTM M ODEL O NLY FOR JAPANESE H IR...M ORPHOLOGICAL A NALYZER U SING THE B I - LSTM M ODEL O NLY FOR JAPANESE H IR...
M ORPHOLOGICAL A NALYZER U SING THE B I - LSTM M ODEL O NLY FOR JAPANESE H IR...
 
MORPHOLOGICAL ANALYZER USING THE BILSTM MODEL ONLY FOR JAPANESE HIRAGANA SENT...
MORPHOLOGICAL ANALYZER USING THE BILSTM MODEL ONLY FOR JAPANESE HIRAGANA SENT...MORPHOLOGICAL ANALYZER USING THE BILSTM MODEL ONLY FOR JAPANESE HIRAGANA SENT...
MORPHOLOGICAL ANALYZER USING THE BILSTM MODEL ONLY FOR JAPANESE HIRAGANA SENT...
 
A Comparative Study of Light Verbs in Tamil and Tulu
A Comparative Study of Light Verbs in Tamil and TuluA Comparative Study of Light Verbs in Tamil and Tulu
A Comparative Study of Light Verbs in Tamil and Tulu
 

Plus de tughchi

Resume_Mehmud
Resume_MehmudResume_Mehmud
Resume_Mehmudtughchi
 
Prayer Profile - Uighur
Prayer Profile - UighurPrayer Profile - Uighur
Prayer Profile - Uighurtughchi
 
ug_ozumge_xitap
ug_ozumge_xitapug_ozumge_xitap
ug_ozumge_xitaptughchi
 
Microsoft Word - mso4F
Microsoft Word - mso4FMicrosoft Word - mso4F
Microsoft Word - mso4Ftughchi
 
hon-TohtiTunyaz
hon-TohtiTunyazhon-TohtiTunyaz
hon-TohtiTunyaztughchi
 
9- oghuz 60-62
9- oghuz 60-629- oghuz 60-62
9- oghuz 60-62tughchi
 
Uyghurs And Nowruz
Uyghurs And NowruzUyghurs And Nowruz
Uyghurs And Nowruztughchi
 
parchilar-ijadiyet
parchilar-ijadiyetparchilar-ijadiyet
parchilar-ijadiyettughchi
 
Ghalip_barat_ark eserliri_1
Ghalip_barat_ark eserliri_1Ghalip_barat_ark eserliri_1
Ghalip_barat_ark eserliri_1tughchi
 
_ghalib ark eserliri
_ghalib ark  eserliri_ghalib ark  eserliri
_ghalib ark eserliritughchi
 
adil tuniyaz dastanliri
adil tuniyaz dastanliriadil tuniyaz dastanliri
adil tuniyaz dastanliritughchi
 
15句让女生甜到晕的情话_doc
15句让女生甜到晕的情话_doc15句让女生甜到晕的情话_doc
15句让女生甜到晕的情话_doctughchi
 
ug_iman_ajizliqining_sawapliri
ug_iman_ajizliqining_sawapliriug_iman_ajizliqining_sawapliri
ug_iman_ajizliqining_sawapliritughchi
 
en_The_Prohibition_of_Music
en_The_Prohibition_of_Musicen_The_Prohibition_of_Music
en_The_Prohibition_of_Musictughchi
 
tb51pisk
tb51pisktb51pisk
tb51pisktughchi
 
ug_quran_sunnette_maxtalghan_ilimlar
ug_quran_sunnette_maxtalghan_ilimlarug_quran_sunnette_maxtalghan_ilimlar
ug_quran_sunnette_maxtalghan_ilimlartughchi
 
Bir_Oqung
Bir_OqungBir_Oqung
Bir_Oqungtughchi
 
men bir dixqan
men bir dixqanmen bir dixqan
men bir dixqantughchi
 

Plus de tughchi (20)

Resume_Mehmud
Resume_MehmudResume_Mehmud
Resume_Mehmud
 
rom1_ug
rom1_ugrom1_ug
rom1_ug
 
Prayer Profile - Uighur
Prayer Profile - UighurPrayer Profile - Uighur
Prayer Profile - Uighur
 
ug_ozumge_xitap
ug_ozumge_xitapug_ozumge_xitap
ug_ozumge_xitap
 
Microsoft Word - mso4F
Microsoft Word - mso4FMicrosoft Word - mso4F
Microsoft Word - mso4F
 
hon-TohtiTunyaz
hon-TohtiTunyazhon-TohtiTunyaz
hon-TohtiTunyaz
 
9- oghuz 60-62
9- oghuz 60-629- oghuz 60-62
9- oghuz 60-62
 
Uyghurs And Nowruz
Uyghurs And NowruzUyghurs And Nowruz
Uyghurs And Nowruz
 
parchilar-ijadiyet
parchilar-ijadiyetparchilar-ijadiyet
parchilar-ijadiyet
 
Ghalip_barat_ark eserliri_1
Ghalip_barat_ark eserliri_1Ghalip_barat_ark eserliri_1
Ghalip_barat_ark eserliri_1
 
_ghalib ark eserliri
_ghalib ark  eserliri_ghalib ark  eserliri
_ghalib ark eserliri
 
adil tuniyaz dastanliri
adil tuniyaz dastanliriadil tuniyaz dastanliri
adil tuniyaz dastanliri
 
15句让女生甜到晕的情话_doc
15句让女生甜到晕的情话_doc15句让女生甜到晕的情话_doc
15句让女生甜到晕的情话_doc
 
ug_iman_ajizliqining_sawapliri
ug_iman_ajizliqining_sawapliriug_iman_ajizliqining_sawapliri
ug_iman_ajizliqining_sawapliri
 
en_The_Prohibition_of_Music
en_The_Prohibition_of_Musicen_The_Prohibition_of_Music
en_The_Prohibition_of_Music
 
beller
bellerbeller
beller
 
tb51pisk
tb51pisktb51pisk
tb51pisk
 
ug_quran_sunnette_maxtalghan_ilimlar
ug_quran_sunnette_maxtalghan_ilimlarug_quran_sunnette_maxtalghan_ilimlar
ug_quran_sunnette_maxtalghan_ilimlar
 
Bir_Oqung
Bir_OqungBir_Oqung
Bir_Oqung
 
men bir dixqan
men bir dixqanmen bir dixqan
men bir dixqan
 

Dernier

04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGSujit Pal
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 

Dernier (20)

04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAG
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 

MTS-2001-Mahsut

  • 1. Utilizing Agglutinative Features in Japanese-Uighur Machine Translation Muhtar MAHSUT, Yasuhiro OGAWA, Kazue SUGINO, Yasuyoshi INAGAKI Graduate School of Engineering, Nagoya University Furo-cho, Chikusa-ku, Nagoya Japan {muhtar, yasuhiro}@inagaki.nuie.nagoya-u.ac.jp Abstract Japanese and Uighur languages are agglutinative languages and they have many syntactical and morphological similarities. And roughly speaking, we can translate Japanese into Uighur sequentially by replacing Japanese words with corresponding Uighur ones after morphological analysis. However, we should translate agglutinated suffixes carefully to make correct translation, because they play important roles on both languages. In this paper, we pay attention to them and propose a Japanese-Uighur machine translation utilizing the agglutinative features of both languages. To deal with the agglutinative features, we use the derivational grammar, which makes the similarities clearer between both languages. This makes our system proposed here simple and systematical. We have implemented the machine translation system and evaluated how effectively our system works. Keywords agglutinative languages, machine translation, derivational grammar, Japanese, Uighur similarities between Japanese and Uighur through using 1 Introduction the derivational grammar. Japanese and Uighur have many syntactical similarities, In this paper, we propose a Japanese-Uighur machine especially with respect to the word order in the sentence. translation system based on the derivational grammar. This observation suggests that we can translate Japanese First, we explain the similarities between them and show into Uighur in such a manner as word for word replacing some illustrative examples of word-for-word translation. after the morphological analysis of input sentences. In As a matter of course, the fact that Japanese and Uighur addition, both languages have morphological similarities, are different languages does not allow us to apply the which is a significant feature of agglutinative languages. word-for-word translation in a straightforward way. In However, there is a grammatical problem, that is, Uighur fact, a straightforward word-for-word Japanese-Uighur grammar says that Uighur has no conjugations, while translation sometimes yields unnatural Uighur sentences, traditional Japanese grammar, which is often called school which is caused by wrong combination of Uighur grammar, says that Japanese verbs do conjugate. morphemes. We introduce an idea of morpheme Therefore the word-for-word translation based on school replacement to solve this problem. When unnatural grammar needs to deal with verbal conjugations of Uighur morpheme sequence occurs, we replace an Japanese, in which the translation of verbal endings is inappropriate Uighur morpheme with an appropriate one very complicated to describe. determined by the information concerning morphemes On the other hand, it is said that agglutinative language appearing before and after it. This leads us to succeed in does not conjugate, and that verbal variants are derived generating more natural Uighur sentences. from invariable stems by mechanically appending affixes Thus we have implemented the translation system by to them. These two views contradict to each other. using a Japanese morphological analyser MAJO (Ogawa To resolve this problem, G. N. Kiyose proposed the et al., 1998) which we have developed by the derivational derivational grammar (Kiyose, 1995) based on the grammar approach. MAJO originally keeps the dictionary phonological approach originated by Bloch (1946). This information as the list of 3-tuples <Japanese morpheme, grammar claims that Japanese verbs do not conjugate and part-of-speech, meaning>. For translation, we replace the Japanese verbal variants are made up by appending 3-tuples with <Japanese morpheme, part-of-speech, suffixes to verbal stems. Because verbal conjugations Uighur morpheme >. As a result of it, outputs of the need not to be considered, the derivational grammar morphological analysis by using MAJO become word-for- makes morphological analysis quite simple. word translation from Japanese to Uighur. Then we apply Since the derivational grammar pays attention to morpheme replacement to Uighur morphemes of MAJO’s agglutinative features of languages, it is applicable to not outputs and acquire a translated Uighur sentence. only Japanese but also other agglutinative languages such Our approach is not limited to Japanese-Uighur translation. as Uighur. So applying the derivational grammar to the We can apply our approach to translations between other two languages, we can make not only syntactical but also agglutinative languages successively, since the morphological similarities between them clearer. This derivational grammar approach treats to agglutinative approach also makes the translation simp ler and more features of languages. systematic. This paper is organized as follows: Agglutinative Recently, there are many literatures that discuss similarities between Japanese and Uighur with examples translation between agglutinative languages such as of translations between them are shown in Section 2. Japanese and Korean (Kim et al., 1998; Lee et al., 1990). Section 3 points out some problems of word-for-word They use only syntactical similarities between them, but translation. In Section 4, we introduce morpheme they can not make use of morphological similarities. Our replacement rules to solve them. We describe our approach systematizes both syntactical and morphological implementation of Japanese-Uighur translation system
  • 2. karega tobirawo aketa tobirawo karega aketa kare ga tobira wo ake ta tobira wo kare ga ake ta u Ø ixik ni aq di ixik ni u Ø aq di u ixikni aqdi ixikni u aqdi Figure 1: Word-for-word translation from Japanese to Uighur based on the derivational grammar in Section 5. Finally, This causes a serious problem for word-for-word we evaluate our system in Section 6. translation. We overcome this problem by using the derivational 2 Agglutinative Similarities between grammar, which claims that Japanese verbs do not Japanese and Uighur conjugate, and that appending suffixes to verbal stems makes up Japanese verbal variants. We also apply the 2.1 Syntactical Similarities derivational grammar to Uighur. This makes their morphological similarities clear as well as the syntactical Both Japanese and Uighur languages are agglutinative and ones. they often refer to such the languages as free word order languages. In Japanese, for example, we can say “karega 2.2.1 Union consonant and union vowel tobirawo aketa” as well as “tobirawo karega aketa”. The invariable part of a verb is called a verbal stem. For Both sentences mean “he opened the door” in English. example, ‘tabe-’ in “tabe-ru” [eat] and ‘kak-’ in “kak- What does allow such the word order changes keeping the u” [write] are both verbal stems in Japanese. There are equivalent meaning? The answer should be found in the two sorts of verbal stems : the one ending with a vowel, function of case suffices. The dependency relation of a e.g. ‘tabe-’ and the one ending with a consonant, e.g. noun to other words, that is the role which plays in a ‘kak-’. We call the former a vowel stem and the latter a sentence, is indicated by case suffixes. The Japanese case consonant stem, respectively. suffix ‘-ga’ indicates the subjective noun of a sentence. A verbal phrase consists of a verbal stem and some verbal That is, ‘-ga’ makes the nominative case. The case suffix suffixes. The verbal suffixes of “tabe-ru” and “kak-u” ‘-wo’ makes the accusative case and it indicates that the are ‘-ru ’ and ‘-u’, respectively. The verbal suffix ‘-ru’ prepositive noun is the object of the verb in a sentence. So follows only a vowel stem and the suffix ‘-u’ follows only case suffixes make it possible to understand sentences a consonant stem. The derivational grammar says those even if the position of “karega” and “tobirawo” are suffixes are the environmental variants of the suffix exchanged. ‘-(r)u’. The consonant ‘r’ appears when it is appended to Uighur language has the same property. Uighur accusative a vowel stem and disappears when it is appended to a case marker is ‘-ni’ and it corresponds to Japanese case consonant stem. A consonant of this kind is called a union suffix ‘-wo’. But in Uighur language the nominative case consonant. is often indicated by the zero-form. We show it by ‘Ø’. Verbal phrases “tabe-na-i” [do not eat] and “kak-ana-i” This does not mean that Uighur language has no [do not write] represent negative actions. The negative nominative case suffixes. verbal suffixes of them are ‘-na-’ and ‘-ana-’, Similar by to Japanese, Uighur case suffixes make it respectively. Those suffixes are also the variants of the possible to exchange word order with no change of suffix ‘-(a)na-’. The vowel ‘a’ appears when it is meaning. Thus, we can translate both Japanese sentences appended to a consonant stem and disappears when it is “karega tobirawo aketa” and “tobirawo karega appended to a vowel stem. A vowel of this kind is called a aketa” into Uighur in the manner of word-for-word as union vowel. shown in Figure 1. Both of “u ixikni aqdi” and “ixikni u aqdi” are natural Uighur sentences that mean “he opened meaning Japanese Uighur the door”. This observation means that the case suffixes play the causative -(s)ase- -guz- essential roles in Japanese and Uighur, and should be passive -(r)are- -(i)l- treated carefully in Japanese-Uighur translation. potential -(r)e- -(y)ala- polite -(i)mas- - 2.2 Morphological Similarities negative -(a)na- -ma- It has been considered that there is a morphological desiderative -(i)ta- -gu- difference between Japanese and Uighur. That is, although Uighur has no verbal conjugations, the traditional Japanese grammar says that Japanese verbs do conjugate. Table 1: Derivational suffixes in Japanese and Uighur
  • 3. form meaning Japanese Uighur finite non-perfective -(r)u -[i]du form perfective -(i)ta -di participle non-perfective -(r)u -[i]digan form perfective -(i)ta -gan perfective copulative -(i)te -(i)p converb provisional conditional -(r)eba -sa form negative copulative -(a)zu -mastin simultaneous -(i)nagara -gaq imperative affirmative -e,-ro -gin form negative -(r)una -magin Table 2: Syntactical suffixes in Japanese and Uighur The derivational grammar summarizes these discussions In addition, there are some derivational suffixes other than as the following two rules: these in Uighur. For example, a Uighur suffix ‘-(i)x-’ indicates cooperative meaning. If ‘kutra-’ [congratulate] Connection rule 1: When the suffix beginning with a follows ‘-(i)x-’, it forms ‘kutrax-’ [congratulate each union consonant is appended to a consonant stem, other]. In Japanese similar meaning indicated by ‘-(i)aw-’, the union consonant disappears. which is not a single suffix but is combination of a Connection rule 2: When the suffix beginning with a syntactical suffix ‘-(i)’ and a verb ‘aw -’. However, for union vowel is appended to a vowel stem, the union translation, Japanese ‘-(i)aw-’ matches Uighur vowel disappears. derivational suffix ‘-(i)x-’, so we take ‘-(i)aw-’ as a derivational suffix. Similarly, we consider some 2.2.2 Derivational Suffixes and Syntactical Suffixes compound morphemes as derivational suffixes, such as As shown in the examples of the above section, the ‘-(i)teir-’ and ‘-(i)tutuar-’. Both suffixes correspond to negative verbal suffix ‘-(a)na-’ can be followed by other Uighur suffix ‘-(i)wat -’. verbal suffixes, for example, ‘-i’ in “kak-ana-i” [do not Here, we should note that Uighur has another connection write]. These examples show that the verbal suffix rule. This shows a change of a phoneme instead of appended to verbal stems derives a new stem, which is disappearing. For example, when the syntactical suffix called a secondary stem. A suffix that derives a new stem ‘-[i]digan’, which means non-perfective participle, is called a derivational suffix. For example, “kak-ana-” follows a consonant stem ‘yaz-’, a phoneme ‘i’ appears and “yazidigan” is formed. But if it follows a vowel stem is a secondary stem and ‘-(a)na-’ is a derivational suffix. ‘yasa-’, ‘i’ changes to ‘y’ and “yasaydigan” is formed. Other verbal suffixes that do not derive new stems are We named such a phoneme a union half vowel. It varies called syntactical suffixes. They make various verbal according to the following rule: forms, which are classified into 4 forms: fin ite, participle, converb and imperative. For example, the suffix ‘-(r)u’ in Connection rule 3: When a suffix beginning with a “tabe-ru” is a syntactical suffix and it forms a finite form. union half vowel is appended to a consonant stem, Japanese verbal stems can be followed by several suffixes, the union half vowel becomes ‘i’, otherwise it but syntactical suffixes are always appended last. Note becomes ‘y’. that Japanese has strict rules for order of verbal suffixes, although the word order in Japanese sentence is rather 3 Problems for Japanese-Uighur Translation flexible. There are m any similarities between Japanese and Uighur but there are also some linguistic differences between 2.2.3 Applying the derivational grammar to Uighur them. Those differences cause serious problems to the Uighur verbal phrases have the similar features to word-for-word translation. In this section, some examples Japanese. We can also apply the derivational grammar to illustrate the problems. Our solutions to these problems Uighur language. A Uighur passive derivational suffix is are descried in section 4. ‘-(i)l-’, where ‘(i)’ refers a union vowel. When ‘-(i)l-’ follows a vowel stem ‘yasa-’ [create], a union vowel ‘(i)’ 3.1 Problems of verbal suffixes disappears and ‘yasal-’ [be created] yields. Let's consider a Uighur verbal stem ‘ az-’ [write]. In y Table 2 shows the correspondences between Japanese and order to express a causative meaning, ‘yaz-’ is followed Uighur syntactical suffixes. As you see, Japanese by a causative suffix ‘-guz-’ and forms a new stem participle suffixes are the same form as finite suffixes ‘yazguz-’ [cause to write], that is, ‘-guz-’ is a while Uighur ones are not. For example, Japanese derivational suffix. In addition, ‘-di’ completes the verbal perfective syntactical suffix ‘-(i)ta’ is used for both finite phrase “yazguzdi” by appending to the new stem. Both and participle forms. In Uighur, however, they are Japanese and Uighur have several derivational suffixes different forms as shown in Figure 2. For this reason, to and the correspondence between them is summarized in translate ‘-(i)ta’, we have to decide which of ‘-di’ or Table 2. ‘-gan’ is the correct translation.
  • 4. Finite form Participle form karega tobirawo aketa. karega aketa tobira kare ga tobira wo ake ta . kare ga ake ta tobira u Ø ixik ni aq di . u Ø aq -gan ixik u ixikni aqdi. u aqgan ixik [he opened the door.] [the door that he opened] Figure 2: The difference about finite form and participle form “köwrük din ötidu”. Japanese case suffix ‘wo’ is 3.2 Problems of case suffixes translated into ‘din’. The suffix ‘w o’ usually indicates the We have already explained that for Japanese and Uighur object, but sometimes it indicates the place, while Uighur case suffixes specify the role of noun phrases in the case suffix ‘ni’ and ‘din’ indicates the object and the place, sentences . This fact allows the flexible word order in the respectively. So we have to choose ‘ni’ or ‘din’ according sentences of these languages. But there does not exist one- to the role of ‘ wo’ in the sentence. In a fact, the case to-one correspondence between case suffixes of Japanese suffix ‘wo’ that indicates a place often appears together and those of Uighur. For example, “gomiwo suteru” is with such the motional verbs like “tooru ” [pass] and translated into “éhlétni tökidu”, where Japanese case “wataru” [cross]. This suggests we can choose ‘ or ni’ suffix ‘wo’ is translated into Uighur case suffix ‘ni’. On ‘din’ according to the verbs which the noun phrases with the other hand, the translation of “hasiwo wataru” is ‘wo’ depends on. We have investigated the ambiguities of case suffixes by using IPA Lexicon of the Japanese Language for Japanese Computers (basic Japanese verbs) (IPAL, 1987), which suffix translated Uighur suffixes and its number total contains 861 Japanese verbs and 3473 example sentences. Ø ning gé We have translated the Japanese case suffixes that ga 3637 77 5 3719 appeared in the sentences into Uighur ones and counted 97.7% 2.0% 0.3% the number of them. Table 3 shows the result. ‘Ø/ni’ in ni Ø/ni din others fault the second row means that ‘wo’ in the sentence is wo 1566 320 118 57 47 2108 translated in ‘ni’ but it is usually omitted in the 74.3% 15.2% 5.6% 2.7% 2.2% corresponding Uighur sentences. Some of example gé dé din Ø fault sentences can not be translated directly and we classified ni 1183 294 81 16 35 1609 such cases as faults. 73.5% 18.3% 5.0% 1.0% 2.2% We do not deal with Japanese case suffix ‘no’, which is dé bilén usually translated Uighur case suffix ‘ning’, because ‘no’ de 638 14 652 is usually used in such a way “A no B” [B of A] and it is 97.8% 2.2% related little to verbs. We do not deal with Japanese case dép Ø/dép bilén suffix ‘ya’, too, because of the same reason. to 193 46 146 385 50.1% 12.0% 37.9% 4 Suffix Adjustment din fault Now let us proceed to our discussions on realization of kara 289 5 294 word-for-word translation from Japanese to Uighur. The 98.3% 1.7% facts we revealed so far show that the problems to be gé solved here is how to decide verbal or case suffix he 236 236 correspondences correctly. To overcome these problems, 100% we adopt a method to assign the default Uighur suffix to din each Japanese suffix and then to substitute a well fitted yori 14 14 suffix for an unnatural one under replacement rules. Since a verbal stem and a following verbal suffix affect each 100% other, we can choose an appropriate suffix by knowing the Table 3: Statistical correspondence between Japanese right and left words. On the other hand, the verb which the noun phrase with a case suffix depends on affects the and Uighur case suffixes suffix. So we need to decide the correct case suffix
  • 5. Japanese default prepositive postpositive new new parts suffix suffix word word suffix -of-speech * end of sentence -ydu finite suffix -(r)u -ydigan * punctuation mark -ydu finite suffix * sentence-final particle -ydu finite suffix * end of sentence -di finite suffix -(i)ta -gan * punctuation mark -di finite suffix * sentence-final particle -di finite suffix Table 4: Replacement table of verbal suffixes considering the verbs depended. Thus, we propose two ‘ni’ is more popular than the locative one. But the locative types of rules for suffix replacement. case ‘ni’ is less related to verbs than the dative one. So we translate ‘ni’ to ‘dé’ as default and it is replaced with ‘gé’ 4.1 Replacement Table of Verbal Suffixes if the verb needs an indirect object. For verbal suffixes, we make a replacement rule shown in In addition, we consider not ‘dép’ but ‘bilén’ as a default Table 4. A Japanese suffix in the left most columns is not Uighur suffix for Japanese suffix ‘to’, which has two used for the replacement but for the purpose of helping to meanings. One is to change the preceding sentence to a understand. A default suffix in the second column is a quotative clause and it corresponds to ‘dép’. Another is Uighur morpheme , which is translated directly from the called the comitative case, which expresses a co-operant corresponding Japanese suffix in the first column. A or accompanist, and it corresponds to ‘bilén’. Traditional prepositive word and a postpositive word represent a Japanese grammar says that both designations of ‘to’ are condition for replacement of the default suffix. If the case suffixes. But the derivational grammar treats the default suffix appears between the prepositive and former as a conjunctive particle and only the latter as a postpositive words, it should be replaced by a new suffix case suffix. So we consider it is not the problem of in the fifth column. The mark *(don't care) denotes that a translation but the one of morphological analysis to replacement rule does not require a prepositive or distinguish two type of ‘to’. We use the morphological postpositive word. Since words of a sentence are tagged analyzer that tags only the latter as a case suffix. So we parts of speech by the morphological analysis, we need a decide ‘bilén’ as a default for case suffix ‘to’. part of speech of the new suffix in the right most columns. 5 Machine translation from Japanese into 4.2 Replacement Rule for Case Suffixes Uighur There are often several phrases between a noun phrase We have implemented the Japanese-Uighur machine and a verbal phrase on which it depends. So we can not translation system. Our system consists of four modules: use a replacement rules similar to above and need another MAJO, the two replacement modules and the morpheme type of rules for case suffixes. We use dependency connection system. MAJO is a morphological analyser of relation and add the case pattern data to the Uighur verb, Japanese based on the derivational grammar, and its such as <öt-{wo/-din}>. The Uighur verb ‘öt-’ has a pair dictionary consists of 3-tuples <Japanese morpheme, part- of Japanese case suffix and Uighur one and it is a of-speech, meaning>. For the translation, we replaced the replacement rule for case suffixes. In translation, if a 3-tuples with <Japanese morpheme, part-of-speech, Uighur verb that has such a pair appears in the sentence, Uighur morpheme >. Therefore outputs of MAJO become the translation system searches the noun phrase depending word-for-word translations for input sentences. on the verb and having the Japanese suffix contained in Here, a Japanese input sentence is “tukutta hasiwo the pair. Then the old Uighur suffix is replaced by the new watatta.” [crossed the bridge that one constructed.] one that is in the pair. shown in figure 3. Firstly, MAJO divides it into Japanese For each Japanese case suffix, we assume the most words and yields a sequence of equivalent Uighur words. possible one determined by Table 3 as a default Uighur Secondly, replacement rules of verbal suffixes are applied suffix. But, exceptionally, we adopt ‘dé’ as a default to those Uighur suffixes if they match the conditions on Uighur suffix for a Japanese suffix ‘ni’. The reason is the the replacement table. In the example, there are two ‘-tta’ following. The Japanese case suffix ‘ni’ often indicates a in the input sentence and MAJO translates both into position of space or time and it is said to represent the ‘-gan’. At this point, last ‘-gan’ matches the replacement locative case. In such a case ‘ni’ should be translated Uighur suffix ‘dé’ which designates the locative case. On rule and it is replaced with ‘-di’. Thirdly, case suffixes are the other hand, ‘ni’ also denotes the dative case and replaced if the verbs that they depend have the corresponds to ‘gé’. Table 3 shows that the dative case replacement rules satisfied the condition. In the example, Japanese case suffix ga wo ni de to kara he yori default Uighur suffix Ø ni dé dé bilén din gé din Table 5: Default translation for case suffixes
  • 6. input sentence tukutta hasiwo watatta. Morphological analysis tukur- -ta hasi -wo watar- -ta . Word-for-word translation sal- -gan köwrük -ni öt- -gan . Replacement 1 sal- -gan köwrük -ni öt- -di . Replacement 2 sal- -gan köwrük -din öt- -di . Morpheme connection output sentence salgan köwrükdin ötdi. Figure 3: Translation system and an example of translation the first step translates ‘wo’ into ‘ ni’. But the verb - - develop a word selection method. The replacement table ‘watar’ on which the noun phrase “hasiwo” depends has of our system describe only suffixes and parts of speech. a replacement rule <watar, consonant-verb, öt-{wo/ If we expand it to word meanings, we could solve -din}>. So ‘-ni’ is replaced with ‘-din’. Finally, the ambiguities of word meaning. morpheme connection system connects Uighur We have discussed only case suffixes, but there are other morphemes according to the connection rules and suffixes in Japanese, like ‘mo’ and ‘sika’. We know generates a Uighur output sentence. some of them correspond to Uighur suffixes. For example, Uighur suffix ‘mu’ has similar role of Japanese ‘mo’. So 5 Experiments we should investigate their correspondence further. We used 136 sentences that include 306 verbal phrases We are now making bigger size of experiments of our (254 different patterns) to evaluate performance of our translation system and collecting m replacement rules. ore Japanese-Uighur machine translation system. We We are aiming to make our system fit for practical use. constructed a Japanese-Uighur dictionary that had about 13,000 words including 3,800 verbs. We compared Bibliographical References translation results between the system with/without the Bloch, B. (1946). Studies in Colloquial Japanese, Part I, replacement rules for verbal suffixes. As a result, the Inflection, Journals of the American Oriental Society system translated 119 verbal phrases correctly without the 66. replacement rules, while the system with the replacement IPA L (1987). IPA Lexicon of the Japanese Language for rules translated 212 verbal phrases correctly. Thus, the Computers (Basic Verbs), The Information-technology precisions of translation about verbal phrases improved Promotion Agency. from 38.9% to 69.3%. Kim, J. and Lee, J. and Lee, G. (1998). Generation of We also evaluate translation of 295 case suffixes appeared Korean Predicates Based on Modality-Feature Ordering the 136 sentences. The simple system translated 257 and Lexicalizing Table in Japanese-Korean Machine suffixes correctly, while our system translates 293 case Translation, Journal of Natural Language Processing suffixes correctly. The precisions of translation for case 5(2), pp.3-24. suffixes improved from 87.1% to 99.3%. Kiyose, G. N. (1995). Japanese grammar --A new approach--, Kyoto university press. 6 Conclusion Lee, H. and Nakajima, M. and Agui, T. (1990). A Japanese-Korean Predicate Translation System Using In this paper, we proposed a Japanese-Uighur translation the Semantic Correspondence between Their Auxiliary system. Our system is based on the derivational grammar Predicative Expressions, Transactions of Information and has succeeded in systematic word-for-word Processing Society of Japan 31(6), pp.801-809. translation. In addition, it can generate a natural Uighur Ogawa, Y. and Muhtar and Toyama, K. and Inagaki, Y. sentence by using replacement rules. (1998). Derivational Grammar Approach to Our system has two modules for replacements and now Morphological Analysis of Japanese Sentences, we are going to unify them. In addition, it does not take PRICAI'98, LNAI 1531, pp. 424-435. account of amb iguities of word meaning. For example, Japanese syntactical suffix ‘-(r)are-’ has three meanings, which are passive, potential and honorific, but we now consider it only as a passive suffix. Therefore, we need to