SlideShare une entreprise Scribd logo
1  sur  2
Integrated Intelligent Research (IIR) International Journal of Computing Algorithm
Volume: 05 Issue: 01 June 2016, Page No. 21- 22
ISSN: 2278-2397
21
Finite State Automata as Analyzers of certain
Grammar Class Rules in the Khasi Language
Aiusha V. Hujon
Department of Computer Science, St. Anthony’s College Shillong, Meghalaya, India
Email: avhujon@gmail.com
Abstract—In this paper the concept of Finite states is applied
to design Recognizers for certain morphemes in Khasi
language. Individual Grammar Class is implemented using
Finite State Automata. Then Finite state automata are
implemented to design a Morphological Analyzer for Nouns
and Verbs. These Finite state automata can be concatenated
one after the other to recognize Sentence Rules. Hence these
can be used to design a Morphological Analyzer for legitimate
sentences in the Khasi language.
Keywords— Morphological analyzer, Finite state automata,
khasi language, word analyzers, Austroasiatic languages, word
recognizer, Natrual language processing.
I. INTRODUCTION
The Khasi Language is an Austroasiatic language. According
to classifications of Austro-Asiatic Language Family,
Austroasiatic language is divided into Munda and Mon-khmer,
Mon-khmer is divided into languages in the North and East.
There are three languages in the North; the Khasi, the Khmuic
and the Paluangunic. Khasi is a verb medial language; its basic
word order pattern is Subject Verb Object (SVO)[1]. It is
spoken by 1.6 million native speakers (2001 census), residing
in the Northeastern region of India, particularly in the state of
Meghalaya. Many Morphological Analyzer and Generator have
been constructed for many languages in India. But till date a
working Morphological analyzer and generator for the Khasi
language has not been constructed, although some research is
still going on in this area.
II. APPLICATION OF FSA TO CONSTRUCT CERTAIN KHASI
WORDS ANALYZERS
A Finite state automaton (FSA) can be used to determine valid
strings in a particular language [3]. Natural languages have
their own valid words which are form using letters from their
respective character sets. We can use FSAs as a morphological
analyzer to determine whether an input string of letters makes
up a legitimate Khasi word or not. Consider a few words in
Khasi which are Nouns as given in Table I. A Finite State
Automaton for each word listed in Table I is constructed with
path sequence from the initial state to the final state. The Finite
State Automaton given in Fig. 1 can determine whether a string
of letters is a Noun in the Khasi language or not. Many more
automata can be constructed like this for all the Nouns in the
Khasi Language.
Similarly Verbs can be recognized by an FSA as shown the Fig.
2 as FSA ‘B’. There are certain Verbs in Khasi that can be
converted into a Noun by adding a morpheme “jing” as a
prefix. Such as the verb “Khang” (close) can become a Noun
when prefix with “jing”, i.e., “jingkhang” (door) which is a
Noun, “bam” (eat) can be come “jingbam” (food), “thiah”
(sleep) can become “jingthiah” (bed). For these kinds of Verbs
the FSA for recognizing these verbs can be concatenated with
the FSA for the morpheme “jing”. The diagram in Fig. 2 shows
how this can be done. Let the FSA ‘A’ recognize the prefix
word “jing” and FSA ‘B’ recognizes certain Verbs like “thiah”,
“khang”, “bam”.
TABLE I
A Few Khasi words which are Nouns
Khasi Meaning in English
Briew man
Mrad animal
Khiew pot
Shuki chair
Ing house
Ksew dog
Miaw cat
Dieng tree
Pliang plate
Pela Cup
Fig 1. Finite state automata for certain Nouns in Khasi
Fig 2. FSA recognizing certain Nouns that are prefix with
“jing” and certain Verbs in Khasi
Using the same concept, we can build an FSA for each
Morpheme in Khasi. There are many other words that are form
by using prefixes like “jing” in khasi, e.g., like “pyn” which
Integrated Intelligent Research (IIR) International Journal of Computing Algorithm
Volume: 05 Issue: 01 June 2016, Page No. 21- 22
ISSN: 2278-2397
22
can also concatenate with “bam” (eat) to form “pynbam” (to
feed/feeding). FSA for each class of words like Noun, Verb,
Adjective, Adverb, Determiner, etc., can be constructed as
shown in Fig. 2.
III. SENTENCE RULE AND FSAS
The word form in Khasi is mainly Subject-Verb-Object (SVO).
Most of the sentences in Khasi are of the following forms,
there are many others which are not possible to describe in this
paper. The following are just a few of these forms.
1) Noun Verb e.g., Nga rwai (I sing)
2) Noun Verb Noun e.g, Nga bam ja (I am eating food)
3) Determiner Noun Verb Adverb e.g., U John u long u briew
uba bha (John is a good man)
4) Determiner Noun Verb Adverb Preposition Verb Adverb
e.g., U jah jlang khlem iohi shuh (He disappear without
seeing him again)
5) Determiner Noun Adjective e.g., Ka kot jingkhein (the
arithmetic book)
The FSAs that are shown in Fig. 1 and Fig. 2 are sub-FSAs and
can be used to construct a full FSA for recognizing legitimate
sentences in the Khasi Language. For each of the sentence
forms above, a corresponding FSA can be constructed as in the
shown in Fig. 3. The labels for each edge are given the name of
each grammar class, like Noun or Verb, but instead practically
each of the corresponding FSAs for Noun, Verbs, Adjectives,
etc., can be inserted where ever the label appears in the FSA
for recognizing valid sentences in Khasi. As an example the
words “Nga” a Noun, “Bam” a Verb, and “Kwai” another
Noun can be formed into a sentence by ordering them using the
sentence rule Noun Verb Noun, i.e., “Nga bam kwai” which
means “I’m eating betelnut” in English. An FSA is shown in
Fig. 4 which accepts this sentence in Khasi.
Fig 3. Finite state automata for accepting Sentence Rules in
Khasi
Fig 4. Finite State Automaton To Accept The Sentence “Nga
Bam Kwai” (Noun Verb Noun) In Khasi
IV. CONCLUSION
Since there are no such Analyzers and generators for the Khasi
Language, this paper presents a very preliminary investigation
to design Morphological Analyzers and Generators particularly
for Khasi Language using certain legitimate words forms as
examples to construct Finite state automata to recognize it.
These Analyzers are very initial considerations and could be
used to design Morphological Analyzers to process the Khasi
language and could be applied to design Morphological
analyzer and Generator in certain areas of Machine translation
for the Khasi Language.
References
[1] Diffloth, 1974 cited in Encyclopedia Britannia and
Diffloth, 2005
[2] Daniel Jurafsky, James H. Martin, Speech and Language
Processing, 4th Impression, Pearson Education and
Dorling Kinderseley Publishing, 2008
[3] John E. Hopcroft and Jeffery D. Ullman, Introduction to
Automata Theory, Languages, and Computation, Narosa
Publishing House Pvt. Ltd. New Delhi, 2002
[4] H.W Sten, Ka Grammar Khasi, Gratus
Publication,Shillong,

Contenu connexe

Similaire à Finite State Automata as Analyzers of certain Grammar Class Rules in the Khasi Language

lesson 1 syntax 2021.pdf
lesson 1 syntax 2021.pdflesson 1 syntax 2021.pdf
lesson 1 syntax 2021.pdfMOHAMEDAADDI
 
Lexical Ambiguity and its types in Pashto Language
Lexical Ambiguity and its types in Pashto LanguageLexical Ambiguity and its types in Pashto Language
Lexical Ambiguity and its types in Pashto LanguageAkashSharma618775
 
Grammatical Aspects of Language (1).pptx
Grammatical Aspects of Language (1).pptxGrammatical Aspects of Language (1).pptx
Grammatical Aspects of Language (1).pptxEmelitaCoronel2
 
Paper id 25201466
Paper id 25201466Paper id 25201466
Paper id 25201466IJRAT
 
Linguistics Theories MPB 2014 Progressive-edu.com
Linguistics Theories MPB 2014  Progressive-edu.comLinguistics Theories MPB 2014  Progressive-edu.com
Linguistics Theories MPB 2014 Progressive-edu.comHono Joe
 
Linguistics Theories MPB 2014 Progressive-edu.com
Linguistics Theories MPB 2014  Progressive-edu.comLinguistics Theories MPB 2014  Progressive-edu.com
Linguistics Theories MPB 2014 Progressive-edu.comHono Joe
 
language construction
language constructionlanguage construction
language constructionRajendran
 
Morphological typology(group2)
Morphological typology(group2)Morphological typology(group2)
Morphological typology(group2)Tina Lestary
 
Morphological Typology(group2)
Morphological Typology(group2)Morphological Typology(group2)
Morphological Typology(group2)Ninuk Krismanti
 
George Yule_Phrases and sentences grammar.pdf
George Yule_Phrases and sentences  grammar.pdfGeorge Yule_Phrases and sentences  grammar.pdf
George Yule_Phrases and sentences grammar.pdfNajma Asyifa
 
SIMPLE SENTENCE IN A·WE
SIMPLE SENTENCE IN A·WESIMPLE SENTENCE IN A·WE
SIMPLE SENTENCE IN A·WEkevig
 
Simple Sentence in a·we
Simple Sentence in a·weSimple Sentence in a·we
Simple Sentence in a·wekevig
 
Assignment on morphology
Assignment on morphologyAssignment on morphology
Assignment on morphologyLinda Midy
 
Syntax presetation
Syntax presetationSyntax presetation
Syntax presetationqamaraftab6
 
Syntactic Features in Mother Tongue.pptx
Syntactic Features in Mother Tongue.pptxSyntactic Features in Mother Tongue.pptx
Syntactic Features in Mother Tongue.pptxJamelMirafuentes
 

Similaire à Finite State Automata as Analyzers of certain Grammar Class Rules in the Khasi Language (20)

lesson 1 syntax 2021.pdf
lesson 1 syntax 2021.pdflesson 1 syntax 2021.pdf
lesson 1 syntax 2021.pdf
 
Lexical Ambiguity and its types in Pashto Language
Lexical Ambiguity and its types in Pashto LanguageLexical Ambiguity and its types in Pashto Language
Lexical Ambiguity and its types in Pashto Language
 
Word grammar
Word grammarWord grammar
Word grammar
 
Grammatical Aspects of Language (1).pptx
Grammatical Aspects of Language (1).pptxGrammatical Aspects of Language (1).pptx
Grammatical Aspects of Language (1).pptx
 
Paper id 25201466
Paper id 25201466Paper id 25201466
Paper id 25201466
 
Linguistics Theories MPB 2014 Progressive-edu.com
Linguistics Theories MPB 2014  Progressive-edu.comLinguistics Theories MPB 2014  Progressive-edu.com
Linguistics Theories MPB 2014 Progressive-edu.com
 
Linguistics Theories MPB 2014 Progressive-edu.com
Linguistics Theories MPB 2014  Progressive-edu.comLinguistics Theories MPB 2014  Progressive-edu.com
Linguistics Theories MPB 2014 Progressive-edu.com
 
language construction
language constructionlanguage construction
language construction
 
Morphological typology(group2)
Morphological typology(group2)Morphological typology(group2)
Morphological typology(group2)
 
Morphological Typology(group2)
Morphological Typology(group2)Morphological Typology(group2)
Morphological Typology(group2)
 
George Yule_Phrases and sentences grammar.pdf
George Yule_Phrases and sentences  grammar.pdfGeorge Yule_Phrases and sentences  grammar.pdf
George Yule_Phrases and sentences grammar.pdf
 
Grammar Syntax(1).pptx
Grammar Syntax(1).pptxGrammar Syntax(1).pptx
Grammar Syntax(1).pptx
 
Analysis of ASL
Analysis of ASLAnalysis of ASL
Analysis of ASL
 
SIMPLE SENTENCE IN A·WE
SIMPLE SENTENCE IN A·WESIMPLE SENTENCE IN A·WE
SIMPLE SENTENCE IN A·WE
 
Simple Sentence in a·we
Simple Sentence in a·weSimple Sentence in a·we
Simple Sentence in a·we
 
Presentation (18)-2.pptx
Presentation (18)-2.pptxPresentation (18)-2.pptx
Presentation (18)-2.pptx
 
Ngu phap
Ngu phapNgu phap
Ngu phap
 
Assignment on morphology
Assignment on morphologyAssignment on morphology
Assignment on morphology
 
Syntax presetation
Syntax presetationSyntax presetation
Syntax presetation
 
Syntactic Features in Mother Tongue.pptx
Syntactic Features in Mother Tongue.pptxSyntactic Features in Mother Tongue.pptx
Syntactic Features in Mother Tongue.pptx
 

Plus de MangaiK4

Application of Ancient Indian Agricultural Practices in Cloud Computing Envir...
Application of Ancient Indian Agricultural Practices in Cloud Computing Envir...Application of Ancient Indian Agricultural Practices in Cloud Computing Envir...
Application of Ancient Indian Agricultural Practices in Cloud Computing Envir...MangaiK4
 
A Study on Sparse Representation and Optimal Algorithms in Intelligent Comput...
A Study on Sparse Representation and Optimal Algorithms in Intelligent Comput...A Study on Sparse Representation and Optimal Algorithms in Intelligent Comput...
A Study on Sparse Representation and Optimal Algorithms in Intelligent Comput...MangaiK4
 
High Speed Low-Power Viterbi Decoder Using Trellis Code Modulation
High Speed Low-Power Viterbi Decoder Using Trellis Code ModulationHigh Speed Low-Power Viterbi Decoder Using Trellis Code Modulation
High Speed Low-Power Viterbi Decoder Using Trellis Code ModulationMangaiK4
 
Relation Extraction using Hybrid Approach and an Ensemble Algorithm
Relation Extraction using Hybrid Approach and an Ensemble AlgorithmRelation Extraction using Hybrid Approach and an Ensemble Algorithm
Relation Extraction using Hybrid Approach and an Ensemble AlgorithmMangaiK4
 
Saturation Throughput and Delay Aware Asynchronous Noc Using Fuzzy Logic
Saturation Throughput and Delay Aware Asynchronous Noc Using Fuzzy LogicSaturation Throughput and Delay Aware Asynchronous Noc Using Fuzzy Logic
Saturation Throughput and Delay Aware Asynchronous Noc Using Fuzzy LogicMangaiK4
 
Image Based Password using RSA Algorithm
Image Based Password using RSA AlgorithmImage Based Password using RSA Algorithm
Image Based Password using RSA AlgorithmMangaiK4
 
A Framework for Desktop Virtual Reality Application for Education
A Framework for Desktop Virtual Reality Application for EducationA Framework for Desktop Virtual Reality Application for Education
A Framework for Desktop Virtual Reality Application for EducationMangaiK4
 
Smart Security For Data Sharing In Cloud Computing
Smart Security For Data Sharing In Cloud ComputingSmart Security For Data Sharing In Cloud Computing
Smart Security For Data Sharing In Cloud ComputingMangaiK4
 
BugLoc: Bug Localization in Multi Threaded Application via Graph Mining Approach
BugLoc: Bug Localization in Multi Threaded Application via Graph Mining ApproachBugLoc: Bug Localization in Multi Threaded Application via Graph Mining Approach
BugLoc: Bug Localization in Multi Threaded Application via Graph Mining ApproachMangaiK4
 
Motion Object Detection Using BGS Technique
Motion Object Detection Using BGS TechniqueMotion Object Detection Using BGS Technique
Motion Object Detection Using BGS TechniqueMangaiK4
 
Encroachment in Data Processing using Big Data Technology
Encroachment in Data Processing using Big Data TechnologyEncroachment in Data Processing using Big Data Technology
Encroachment in Data Processing using Big Data TechnologyMangaiK4
 
Performance Evaluation of Hybrid Method for Securing and Compressing Images
Performance Evaluation of Hybrid Method for Securing and Compressing ImagesPerformance Evaluation of Hybrid Method for Securing and Compressing Images
Performance Evaluation of Hybrid Method for Securing and Compressing ImagesMangaiK4
 
A Review on - Data Hiding using Cryptography and Steganography
A Review on - Data Hiding using Cryptography and SteganographyA Review on - Data Hiding using Cryptography and Steganography
A Review on - Data Hiding using Cryptography and SteganographyMangaiK4
 
Modeling Originators for Event Forecasting Multi-Task Learning in Mil Algorithm
Modeling Originators for Event Forecasting Multi-Task Learning in Mil AlgorithmModeling Originators for Event Forecasting Multi-Task Learning in Mil Algorithm
Modeling Originators for Event Forecasting Multi-Task Learning in Mil AlgorithmMangaiK4
 
quasi-uniform theta graph
quasi-uniform theta graphquasi-uniform theta graph
quasi-uniform theta graphMangaiK4
 
The Relationship of Mathematics to the Performance of SHCT it Students in Pro...
The Relationship of Mathematics to the Performance of SHCT it Students in Pro...The Relationship of Mathematics to the Performance of SHCT it Students in Pro...
The Relationship of Mathematics to the Performance of SHCT it Students in Pro...MangaiK4
 
A Tentative analysis of Liver Disorder using Data Mining Algorithms J48, Deci...
A Tentative analysis of Liver Disorder using Data Mining Algorithms J48, Deci...A Tentative analysis of Liver Disorder using Data Mining Algorithms J48, Deci...
A Tentative analysis of Liver Disorder using Data Mining Algorithms J48, Deci...MangaiK4
 
Marine Object Recognition using Blob Analysis
Marine Object Recognition using Blob AnalysisMarine Object Recognition using Blob Analysis
Marine Object Recognition using Blob AnalysisMangaiK4
 
Implementation of High Speed OFDM Transceiver using FPGA
Implementation of High Speed OFDM Transceiver using FPGAImplementation of High Speed OFDM Transceiver using FPGA
Implementation of High Speed OFDM Transceiver using FPGAMangaiK4
 
Renewable Energy Based on Current Fed Switched Inverter for Smart Grid Applic...
Renewable Energy Based on Current Fed Switched Inverter for Smart Grid Applic...Renewable Energy Based on Current Fed Switched Inverter for Smart Grid Applic...
Renewable Energy Based on Current Fed Switched Inverter for Smart Grid Applic...MangaiK4
 

Plus de MangaiK4 (20)

Application of Ancient Indian Agricultural Practices in Cloud Computing Envir...
Application of Ancient Indian Agricultural Practices in Cloud Computing Envir...Application of Ancient Indian Agricultural Practices in Cloud Computing Envir...
Application of Ancient Indian Agricultural Practices in Cloud Computing Envir...
 
A Study on Sparse Representation and Optimal Algorithms in Intelligent Comput...
A Study on Sparse Representation and Optimal Algorithms in Intelligent Comput...A Study on Sparse Representation and Optimal Algorithms in Intelligent Comput...
A Study on Sparse Representation and Optimal Algorithms in Intelligent Comput...
 
High Speed Low-Power Viterbi Decoder Using Trellis Code Modulation
High Speed Low-Power Viterbi Decoder Using Trellis Code ModulationHigh Speed Low-Power Viterbi Decoder Using Trellis Code Modulation
High Speed Low-Power Viterbi Decoder Using Trellis Code Modulation
 
Relation Extraction using Hybrid Approach and an Ensemble Algorithm
Relation Extraction using Hybrid Approach and an Ensemble AlgorithmRelation Extraction using Hybrid Approach and an Ensemble Algorithm
Relation Extraction using Hybrid Approach and an Ensemble Algorithm
 
Saturation Throughput and Delay Aware Asynchronous Noc Using Fuzzy Logic
Saturation Throughput and Delay Aware Asynchronous Noc Using Fuzzy LogicSaturation Throughput and Delay Aware Asynchronous Noc Using Fuzzy Logic
Saturation Throughput and Delay Aware Asynchronous Noc Using Fuzzy Logic
 
Image Based Password using RSA Algorithm
Image Based Password using RSA AlgorithmImage Based Password using RSA Algorithm
Image Based Password using RSA Algorithm
 
A Framework for Desktop Virtual Reality Application for Education
A Framework for Desktop Virtual Reality Application for EducationA Framework for Desktop Virtual Reality Application for Education
A Framework for Desktop Virtual Reality Application for Education
 
Smart Security For Data Sharing In Cloud Computing
Smart Security For Data Sharing In Cloud ComputingSmart Security For Data Sharing In Cloud Computing
Smart Security For Data Sharing In Cloud Computing
 
BugLoc: Bug Localization in Multi Threaded Application via Graph Mining Approach
BugLoc: Bug Localization in Multi Threaded Application via Graph Mining ApproachBugLoc: Bug Localization in Multi Threaded Application via Graph Mining Approach
BugLoc: Bug Localization in Multi Threaded Application via Graph Mining Approach
 
Motion Object Detection Using BGS Technique
Motion Object Detection Using BGS TechniqueMotion Object Detection Using BGS Technique
Motion Object Detection Using BGS Technique
 
Encroachment in Data Processing using Big Data Technology
Encroachment in Data Processing using Big Data TechnologyEncroachment in Data Processing using Big Data Technology
Encroachment in Data Processing using Big Data Technology
 
Performance Evaluation of Hybrid Method for Securing and Compressing Images
Performance Evaluation of Hybrid Method for Securing and Compressing ImagesPerformance Evaluation of Hybrid Method for Securing and Compressing Images
Performance Evaluation of Hybrid Method for Securing and Compressing Images
 
A Review on - Data Hiding using Cryptography and Steganography
A Review on - Data Hiding using Cryptography and SteganographyA Review on - Data Hiding using Cryptography and Steganography
A Review on - Data Hiding using Cryptography and Steganography
 
Modeling Originators for Event Forecasting Multi-Task Learning in Mil Algorithm
Modeling Originators for Event Forecasting Multi-Task Learning in Mil AlgorithmModeling Originators for Event Forecasting Multi-Task Learning in Mil Algorithm
Modeling Originators for Event Forecasting Multi-Task Learning in Mil Algorithm
 
quasi-uniform theta graph
quasi-uniform theta graphquasi-uniform theta graph
quasi-uniform theta graph
 
The Relationship of Mathematics to the Performance of SHCT it Students in Pro...
The Relationship of Mathematics to the Performance of SHCT it Students in Pro...The Relationship of Mathematics to the Performance of SHCT it Students in Pro...
The Relationship of Mathematics to the Performance of SHCT it Students in Pro...
 
A Tentative analysis of Liver Disorder using Data Mining Algorithms J48, Deci...
A Tentative analysis of Liver Disorder using Data Mining Algorithms J48, Deci...A Tentative analysis of Liver Disorder using Data Mining Algorithms J48, Deci...
A Tentative analysis of Liver Disorder using Data Mining Algorithms J48, Deci...
 
Marine Object Recognition using Blob Analysis
Marine Object Recognition using Blob AnalysisMarine Object Recognition using Blob Analysis
Marine Object Recognition using Blob Analysis
 
Implementation of High Speed OFDM Transceiver using FPGA
Implementation of High Speed OFDM Transceiver using FPGAImplementation of High Speed OFDM Transceiver using FPGA
Implementation of High Speed OFDM Transceiver using FPGA
 
Renewable Energy Based on Current Fed Switched Inverter for Smart Grid Applic...
Renewable Energy Based on Current Fed Switched Inverter for Smart Grid Applic...Renewable Energy Based on Current Fed Switched Inverter for Smart Grid Applic...
Renewable Energy Based on Current Fed Switched Inverter for Smart Grid Applic...
 

Dernier

Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
PSYCHIATRIC History collection FORMAT.pptx
PSYCHIATRIC   History collection FORMAT.pptxPSYCHIATRIC   History collection FORMAT.pptx
PSYCHIATRIC History collection FORMAT.pptxPoojaSen20
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdfssuser54595a
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxRoyAbrique
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991RKavithamani
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
Concept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.CompdfConcept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.CompdfUmakantAnnand
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsanshu789521
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application ) Sakshi Ghasle
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAssociation for Project Management
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Celine George
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...Marc Dusseiller Dusjagr
 

Dernier (20)

Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
PSYCHIATRIC History collection FORMAT.pptx
PSYCHIATRIC   History collection FORMAT.pptxPSYCHIATRIC   History collection FORMAT.pptx
PSYCHIATRIC History collection FORMAT.pptx
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
Concept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.CompdfConcept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.Compdf
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha elections
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application )
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
 

Finite State Automata as Analyzers of certain Grammar Class Rules in the Khasi Language

  • 1. Integrated Intelligent Research (IIR) International Journal of Computing Algorithm Volume: 05 Issue: 01 June 2016, Page No. 21- 22 ISSN: 2278-2397 21 Finite State Automata as Analyzers of certain Grammar Class Rules in the Khasi Language Aiusha V. Hujon Department of Computer Science, St. Anthony’s College Shillong, Meghalaya, India Email: avhujon@gmail.com Abstract—In this paper the concept of Finite states is applied to design Recognizers for certain morphemes in Khasi language. Individual Grammar Class is implemented using Finite State Automata. Then Finite state automata are implemented to design a Morphological Analyzer for Nouns and Verbs. These Finite state automata can be concatenated one after the other to recognize Sentence Rules. Hence these can be used to design a Morphological Analyzer for legitimate sentences in the Khasi language. Keywords— Morphological analyzer, Finite state automata, khasi language, word analyzers, Austroasiatic languages, word recognizer, Natrual language processing. I. INTRODUCTION The Khasi Language is an Austroasiatic language. According to classifications of Austro-Asiatic Language Family, Austroasiatic language is divided into Munda and Mon-khmer, Mon-khmer is divided into languages in the North and East. There are three languages in the North; the Khasi, the Khmuic and the Paluangunic. Khasi is a verb medial language; its basic word order pattern is Subject Verb Object (SVO)[1]. It is spoken by 1.6 million native speakers (2001 census), residing in the Northeastern region of India, particularly in the state of Meghalaya. Many Morphological Analyzer and Generator have been constructed for many languages in India. But till date a working Morphological analyzer and generator for the Khasi language has not been constructed, although some research is still going on in this area. II. APPLICATION OF FSA TO CONSTRUCT CERTAIN KHASI WORDS ANALYZERS A Finite state automaton (FSA) can be used to determine valid strings in a particular language [3]. Natural languages have their own valid words which are form using letters from their respective character sets. We can use FSAs as a morphological analyzer to determine whether an input string of letters makes up a legitimate Khasi word or not. Consider a few words in Khasi which are Nouns as given in Table I. A Finite State Automaton for each word listed in Table I is constructed with path sequence from the initial state to the final state. The Finite State Automaton given in Fig. 1 can determine whether a string of letters is a Noun in the Khasi language or not. Many more automata can be constructed like this for all the Nouns in the Khasi Language. Similarly Verbs can be recognized by an FSA as shown the Fig. 2 as FSA ‘B’. There are certain Verbs in Khasi that can be converted into a Noun by adding a morpheme “jing” as a prefix. Such as the verb “Khang” (close) can become a Noun when prefix with “jing”, i.e., “jingkhang” (door) which is a Noun, “bam” (eat) can be come “jingbam” (food), “thiah” (sleep) can become “jingthiah” (bed). For these kinds of Verbs the FSA for recognizing these verbs can be concatenated with the FSA for the morpheme “jing”. The diagram in Fig. 2 shows how this can be done. Let the FSA ‘A’ recognize the prefix word “jing” and FSA ‘B’ recognizes certain Verbs like “thiah”, “khang”, “bam”. TABLE I A Few Khasi words which are Nouns Khasi Meaning in English Briew man Mrad animal Khiew pot Shuki chair Ing house Ksew dog Miaw cat Dieng tree Pliang plate Pela Cup Fig 1. Finite state automata for certain Nouns in Khasi Fig 2. FSA recognizing certain Nouns that are prefix with “jing” and certain Verbs in Khasi Using the same concept, we can build an FSA for each Morpheme in Khasi. There are many other words that are form by using prefixes like “jing” in khasi, e.g., like “pyn” which
  • 2. Integrated Intelligent Research (IIR) International Journal of Computing Algorithm Volume: 05 Issue: 01 June 2016, Page No. 21- 22 ISSN: 2278-2397 22 can also concatenate with “bam” (eat) to form “pynbam” (to feed/feeding). FSA for each class of words like Noun, Verb, Adjective, Adverb, Determiner, etc., can be constructed as shown in Fig. 2. III. SENTENCE RULE AND FSAS The word form in Khasi is mainly Subject-Verb-Object (SVO). Most of the sentences in Khasi are of the following forms, there are many others which are not possible to describe in this paper. The following are just a few of these forms. 1) Noun Verb e.g., Nga rwai (I sing) 2) Noun Verb Noun e.g, Nga bam ja (I am eating food) 3) Determiner Noun Verb Adverb e.g., U John u long u briew uba bha (John is a good man) 4) Determiner Noun Verb Adverb Preposition Verb Adverb e.g., U jah jlang khlem iohi shuh (He disappear without seeing him again) 5) Determiner Noun Adjective e.g., Ka kot jingkhein (the arithmetic book) The FSAs that are shown in Fig. 1 and Fig. 2 are sub-FSAs and can be used to construct a full FSA for recognizing legitimate sentences in the Khasi Language. For each of the sentence forms above, a corresponding FSA can be constructed as in the shown in Fig. 3. The labels for each edge are given the name of each grammar class, like Noun or Verb, but instead practically each of the corresponding FSAs for Noun, Verbs, Adjectives, etc., can be inserted where ever the label appears in the FSA for recognizing valid sentences in Khasi. As an example the words “Nga” a Noun, “Bam” a Verb, and “Kwai” another Noun can be formed into a sentence by ordering them using the sentence rule Noun Verb Noun, i.e., “Nga bam kwai” which means “I’m eating betelnut” in English. An FSA is shown in Fig. 4 which accepts this sentence in Khasi. Fig 3. Finite state automata for accepting Sentence Rules in Khasi Fig 4. Finite State Automaton To Accept The Sentence “Nga Bam Kwai” (Noun Verb Noun) In Khasi IV. CONCLUSION Since there are no such Analyzers and generators for the Khasi Language, this paper presents a very preliminary investigation to design Morphological Analyzers and Generators particularly for Khasi Language using certain legitimate words forms as examples to construct Finite state automata to recognize it. These Analyzers are very initial considerations and could be used to design Morphological Analyzers to process the Khasi language and could be applied to design Morphological analyzer and Generator in certain areas of Machine translation for the Khasi Language. References [1] Diffloth, 1974 cited in Encyclopedia Britannia and Diffloth, 2005 [2] Daniel Jurafsky, James H. Martin, Speech and Language Processing, 4th Impression, Pearson Education and Dorling Kinderseley Publishing, 2008 [3] John E. Hopcroft and Jeffery D. Ullman, Introduction to Automata Theory, Languages, and Computation, Narosa Publishing House Pvt. Ltd. New Delhi, 2002 [4] H.W Sten, Ka Grammar Khasi, Gratus Publication,Shillong,