SlideShare a Scribd company logo
1 of 39
Download to read offline
Contextualization of Morphological Inflection
Ekaterina Vylomova1 Ryan Cotterell2
Timothy Baldwin1 Trevor Cohn1 Jason Eisner2
1School of Computing and Information Systems
The University of Melbourne
2Department of Computer Science
Johns Hopkins University
Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 1 / 39
Language Modelling
This is Marvin:
Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 2 / 39
Language Modelling
OK, Marvin, which word comes next: Two cats are ___
Hmmm, let me guess ...
sitting 3.01 ∗ 10−4
play 2.87 ∗ 10−4
running 2.53 ∗ 10−4
nice 2.32 ∗ 10−4
lost 1.97 ∗ 10−4
playing 1.66 ∗ 10−4
sat 1.54 ∗ 10−4
plays 1.32 ∗ 10−4
...
...Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 3 / 39
Language Modelling
Let’s add a constraint by providing a lemma: Two cats are [PLAY]
That narrows things down a lot ...
sitting 3.01 ∗ 10−4
play 2.87 ∗ 10−4
running 2.53 ∗ 10−4
nice 2.32 ∗ 10−4
lost 1.97 ∗ 10−4
playing 1.66 ∗ 10−4
sat 1.54 ∗ 10−4
plays 1.32 ∗ 10−4
...
...Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 4 / 39
Language Modelling
Hey, this reminds me a bit of .... a wug ... and a second wug:
Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 5 / 39
Morphological (Re-)Inflection
... as well as the SIGMORPHON morphological inflection task
SIGMORPHON Shared Task 2016–2019
PLAY + PRESENT PARTICIPLE → playing
played + PRESENT PARTICIPLE → playing
Lemma Tag Form
RUN PAST ran
RUN PRES;1SG run
RUN PRES;2SG run
RUN PRES;3SG runs
RUN PRES;PL run
RUN PART running
Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 6 / 39
2018 :∼ 96% accuracy on avg.
in high-resource setting
Morphological (Re-)Inflection
Contextualization: But why choose PRESENT PARTICIPLE? Context!
SIGMORPHON Shared Task 2016–2019
PLAY + PRESENT PARTICIPLE → playing
played + PRESENT PARTICIPLE → playing
Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 7 / 39
Morphological (Re-)Inflection
Contextualization: The tags must be inferred from the context!
SIGMORPHON Shared Task 2018 Task 2
SubTask 1
Two cats are ??? together
TWO/NUM CAT/N+PL BE/AUX+PRES+3PL PLAY TOGETHER/ADV
SubTask 2
Two cats are ??? together
PLAY
Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 8 / 39
Morphological (Re-)Inflection
Contextualization: The tags must be inferred from the context!
SIGMORPHON Shared Task 2018 Task 2
SubTask 1
Two cats are playing together
TWO/NUM CAT/N+PL BE/AUX+PRES+3PL PLAY TOGETHER/ADV
SubTask 2
Two cats are playing together
PLAY
Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 9 / 39
A Hybrid (Structured–unstructured) Model
Let’s predict both tags and forms!
Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 10 / 39
Lemmatized
Sequence
Predicted Tag
Sequence
Predicted Form
Sequence
A Hybrid (Structured–unstructured) Model
... or, in other words, p(w, m | ) = ( n
i=1 p(wi | i , mi )) p(m | )
Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 11 / 39
Lemmatized
Sequence
Predicted Tag
Sequence
Predicted Form
Sequence
A Hybrid (Structured–unstructured) Model
... or, in other words, p(w, m | ) = ( n
i=1 p(wi | i , mi )) p(m | )
Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 12 / 39
Lemmatized
Sequence
p(m | ) Neural CRF
Lample et al., 2016
Predicted Tag
Sequence
Predicted Form
Sequence
Rastogi et al., 2016
Aharoni et al., 2017
p(wi | i , mi ) Word Emission
Languages and Grammar Categories
Let’s test the model on a wide variety of languages!
Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 13 / 39
Languages and Grammar Categories
Languages differ in what is explicitly morphosyntactically marked, and how:
Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 14 / 39
Number of values of an inflectional feature
Bulgarian (bg), Slavic
English (en), Germanic
Basque (eu), Isolate
Finnish (fi), Uralic
Gaelic (ga), Celtic
Hindi (hi), Indic
Italian (it), Romance
Latin (la), Romance
Polish (pl), Slavic
Swedish (sv), Germanic
Languages and Grammar Categories
Some languages use word order to express relations between words, while
others use morphosyntactic marking:
English:
Kim gives Sandy an interesting book
Polish:
Jenia daje Maszy ciekawą książkę
Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 15 / 39
Languages and Grammar Categories
Some languages use word order to express relations between words, while
others use morphosyntactic marking:
English:
Kim gives Sandy an interesting book
Subject IObject DObject
Polish:
Jenia daje Maszy ciekawą książkę
Nom Dat Acc.Fem.Sg Acc.Sg
== Maszy daje Jenia ciekawą książkę
== ciekawą książkę daje Jenia Maszy
!= Jenie daje Masza ciekawą książkę
Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 16 / 39
Experiments
How well can such categories and corresponding forms be predicted in each
language?
Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 17 / 39
Experiments
How well can such categories and corresponding forms be predicted in each
language?
Do linguistic features enhance performance?
Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 18 / 39
Experiments
How well can such categories and corresponding forms be predicted in each
language?
Do linguistic features enhance performance?
Does morphological complexity impact on empirical performance?
Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 19 / 39
Experiments
Data: Universal Dependencies v1.2
Baselines: the baseline of the SIGMORPHON 2018 shared task as well as
the best performing system of that year
Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 20 / 39
Nivre et al.,2016
Experiments
SM: biLSTM encoder–decoder with context window of size 2
input = concat (prefix+suffix, lemma, tags, char-level center lemma)
Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 21 / 39
Cotterell et al.,2018
Experiments
CPH: biLSTM encoder–decoder with no context window size restrictions
input = concat (full context, lemma, tags, char-level center lemma)
also predicts target tags as an auxiliary task
Direct: more basic model that relies only on forms and lemmas
Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 22 / 39
Kementchedjhieva et al.,2018
Experiments
Let’s condition only on contextual forms and lemmas (1-best accuracy for
form prediction):
0
25
50
75
100
BG EN EU FI GA HI IT LA PL SV
1.Direct
Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 23 / 39
Experiments
Now also supply contextual tag information, still predicting forms only:
0
25
50
75
100
BG EN EU FI GA HI IT LA PL SV
1.Direct 2.SM
Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 24 / 39
Experiments
Now use a wider context and predict tags as an auxiliary task:
0
25
50
75
100
BG EN EU FI GA HI IT LA PL SV
1.Direct 2.SM 3.CPH
Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 25 / 39
Experiments
Finally, use neural CRF to predict tag sequence and hard monotonic
attention model for forms:
0
25
50
75
100
BG EN EU FI GA HI IT LA PL SV
1.Direct 2.SM 3.CPH 4.Joint
Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 26 / 39
Experiments
How far are we from the results for forms predicted from gold tag
sequence?
0
25
50
75
100
BG EN EU FI GA HI IT LA PL SV
1.Direct 2.SM 3.CPH 4.Joint 5.Gold Tags
Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 27 / 39
Discussion
Q1: Do linguistic features help?
Yes, they do!
Most systems that make use of morphological tags outperform the “Direct”
baseline on most languages
Joint prediction of tags and forms further improves the results
Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 28 / 39
Discussion
Q2: Does morphological complexity impact empirical performance?
Yes, it does!
Performance drops in languages with rich case systems such as Slavic and
Uralic
The model needs to learn which grammatical categories should be in
agreement
Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 29 / 39
Discussion
Q3: How well is agreement captured?
Adjective – Noun (AMod)
is captured quite well
Verb – Noun (Subject – Verb)
is more challenging, since agreement categories can vary depending on
tense, e.g. Ru: Person+Number in present vs. Gender in past, singular
General-purpose inference of agreement categories is still a challenging
task!
Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 30 / 39
Discussion
Q4: Where does most uncertainty come from?
Inherent and Contextual Morphological Categories
Contextual categories participate in agreement: adjective number, case,
gender, verbal gender, etc.
Inherent express the speaker’s intentions: noun number, verbal tense
Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 31 / 39
Discussion
Q4: Where does most uncertainty come from?
Inherent and Contextual Morphological Categories
Contextual categories participate in agreement: adjective number, case,
gender, verbal gender, etc.
Inherent express the speaker’s intentions: noun number, verbal tense
Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 32 / 39
Most uncertainty comes from inherent categories!
Discussion
Q4: Where does most uncertainty come from?
Inherent and Contextual Morphological Categories
Contextual categories participate in agreement: adjective number, case,
gender, verbal gender, etc.
Inherent express the speaker’s intentions: noun number, verbal tense
Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 33 / 39
Most uncertainty comes from inherent categories!
Often such categories must be inferred
Discussion
Q5: Which language is least affected by lemmatization?
Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 34 / 39
Discussion
Q5: Which language is least affected by lemmatization?
Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 35 / 39
Discussion
Q5: Which language is least affected by lemmatization?
Word Order vs. Morphology
Most information on roles and dependencies is expressed
non-morphologically, e.g. in word order or by prepositions:
EN: Kim gives Sandy an interesting book → KIM GIVE SANDY AN
INTERESTING BOOK
PL: Jenia daje Maszy ciekawą książkę → JENIA DAWAĆ’ MASZA
CIEKAWY KSIĄŻKA
Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 36 / 39
Why English?
Discussion
Q5: Which language is least affected by lemmatization?
Word Order vs. Morphology
Most information on roles and dependencies is expressed
non-morphologically, e.g. in word order or by prepositions:
EN: Kim gives Sandy an interesting book → KIM GIVE SANDY AN
INTERESTING BOOK
PL: Jenia daje Maszy ciekawą książkę → JENIA DAWAĆ’ MASZA
CIEKAWY KSIĄŻKA
Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 37 / 39
Why English?
SVO/Roles are still there
Flexible/Roles are partially lost
Future Directions
Evaluation of grammaticality
How well do neural models model grammaticality?
Data de-biasing (e.g., En–Ru )
smart student → umnyj.Nom.Masc.Sg student.Nom.Sg
augment with:
smart student → umnaja.Nom.Fem.Sg studentka.Nom.Fem.Sg
Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 38 / 39
Thank you! Questions?
Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 39 / 39

More Related Content

Similar to Contextualization of Morphological Inflection

Teaching alphabetics and fluency in reading
Teaching alphabetics and fluency in readingTeaching alphabetics and fluency in reading
Teaching alphabetics and fluency in readingMarcia Luptak
 
Deverbal nouns and historical development
Deverbal nouns and historical developmentDeverbal nouns and historical development
Deverbal nouns and historical developmentØivin Andersen
 
Webelhuth et al: Idioms in LInguistics Theory, Part I
Webelhuth et al: Idioms in LInguistics Theory, Part IWebelhuth et al: Idioms in LInguistics Theory, Part I
Webelhuth et al: Idioms in LInguistics Theory, Part IManfred Sailer
 
Ctel Module1
Ctel Module1Ctel Module1
Ctel Module1mrounds5
 
Sigmorphon 2021. Keynote. UniMorph, Morphological inflection
Sigmorphon 2021. Keynote. UniMorph, Morphological inflectionSigmorphon 2021. Keynote. UniMorph, Morphological inflection
Sigmorphon 2021. Keynote. UniMorph, Morphological inflectionKaterina Vylomova
 
Teacher Feedback and Linguistic Profiles Can Help to Improve Learners' Writing
Teacher Feedback and Linguistic Profiles Can Help to Improve Learners' WritingTeacher Feedback and Linguistic Profiles Can Help to Improve Learners' Writing
Teacher Feedback and Linguistic Profiles Can Help to Improve Learners' WritingBenjamin L. Stewart, PhD
 
Investigating literacy teachers' linguistic knowledge
Investigating literacy teachers' linguistic knowledgeInvestigating literacy teachers' linguistic knowledge
Investigating literacy teachers' linguistic knowledgeDominik Lukes
 
Ctel module1 fall09
Ctel module1 fall09Ctel module1 fall09
Ctel module1 fall09jheil65
 
Ctel Module1 Domain 1 Fall07
Ctel Module1 Domain 1 Fall07Ctel Module1 Domain 1 Fall07
Ctel Module1 Domain 1 Fall07jheil65
 
Meaningful mistakes in language behaviour as a basic factor in its evolutiona...
Meaningful mistakes in language behaviour as a basic factor in its evolutiona...Meaningful mistakes in language behaviour as a basic factor in its evolutiona...
Meaningful mistakes in language behaviour as a basic factor in its evolutiona...arkadiusz
 
Effective Approach for Disambiguating Chinese Polyphonic Ambiguity
Effective Approach for Disambiguating Chinese Polyphonic AmbiguityEffective Approach for Disambiguating Chinese Polyphonic Ambiguity
Effective Approach for Disambiguating Chinese Polyphonic AmbiguityIDES Editor
 
Introduction to linguistics: A Quick Review
Introduction to linguistics: A Quick ReviewIntroduction to linguistics: A Quick Review
Introduction to linguistics: A Quick Reviewh4976
 
Conquer the Code
Conquer the CodeConquer the Code
Conquer the CodeUSAteacher
 

Similar to Contextualization of Morphological Inflection (15)

Teaching alphabetics and fluency in reading
Teaching alphabetics and fluency in readingTeaching alphabetics and fluency in reading
Teaching alphabetics and fluency in reading
 
Deverbal nouns and historical development
Deverbal nouns and historical developmentDeverbal nouns and historical development
Deverbal nouns and historical development
 
Webelhuth et al: Idioms in LInguistics Theory, Part I
Webelhuth et al: Idioms in LInguistics Theory, Part IWebelhuth et al: Idioms in LInguistics Theory, Part I
Webelhuth et al: Idioms in LInguistics Theory, Part I
 
Ctel Module1
Ctel Module1Ctel Module1
Ctel Module1
 
Sigmorphon 2021. Keynote. UniMorph, Morphological inflection
Sigmorphon 2021. Keynote. UniMorph, Morphological inflectionSigmorphon 2021. Keynote. UniMorph, Morphological inflection
Sigmorphon 2021. Keynote. UniMorph, Morphological inflection
 
Teacher Feedback and Linguistic Profiles Can Help to Improve Learners' Writing
Teacher Feedback and Linguistic Profiles Can Help to Improve Learners' WritingTeacher Feedback and Linguistic Profiles Can Help to Improve Learners' Writing
Teacher Feedback and Linguistic Profiles Can Help to Improve Learners' Writing
 
Investigating literacy teachers' linguistic knowledge
Investigating literacy teachers' linguistic knowledgeInvestigating literacy teachers' linguistic knowledge
Investigating literacy teachers' linguistic knowledge
 
Ctel module1 fall09
Ctel module1 fall09Ctel module1 fall09
Ctel module1 fall09
 
Pronunciation and syllables
Pronunciation and syllablesPronunciation and syllables
Pronunciation and syllables
 
Ctel Module1 Domain 1 Fall07
Ctel Module1 Domain 1 Fall07Ctel Module1 Domain 1 Fall07
Ctel Module1 Domain 1 Fall07
 
Meaningful mistakes in language behaviour as a basic factor in its evolutiona...
Meaningful mistakes in language behaviour as a basic factor in its evolutiona...Meaningful mistakes in language behaviour as a basic factor in its evolutiona...
Meaningful mistakes in language behaviour as a basic factor in its evolutiona...
 
Effective Approach for Disambiguating Chinese Polyphonic Ambiguity
Effective Approach for Disambiguating Chinese Polyphonic AmbiguityEffective Approach for Disambiguating Chinese Polyphonic Ambiguity
Effective Approach for Disambiguating Chinese Polyphonic Ambiguity
 
Introduction to linguistics: A Quick Review
Introduction to linguistics: A Quick ReviewIntroduction to linguistics: A Quick Review
Introduction to linguistics: A Quick Review
 
Phonics
PhonicsPhonics
Phonics
 
Conquer the Code
Conquer the CodeConquer the Code
Conquer the Code
 

More from Katerina Vylomova

Documenting and modeling inflectional paradigms in under-resourced languages
Documenting and modeling inflectional paradigms in under-resourced languages Documenting and modeling inflectional paradigms in under-resourced languages
Documenting and modeling inflectional paradigms in under-resourced languages Katerina Vylomova
 
The UniMorph Project and Morphological Reinflection Task: Past, Present, and ...
The UniMorph Project and Morphological Reinflection Task: Past, Present, and ...The UniMorph Project and Morphological Reinflection Task: Past, Present, and ...
The UniMorph Project and Morphological Reinflection Task: Past, Present, and ...Katerina Vylomova
 
The Secret Life of Words: Exploring Regularity and Systematicity (joint talk ...
The Secret Life of Words: Exploring Regularity and Systematicity (joint talk ...The Secret Life of Words: Exploring Regularity and Systematicity (joint talk ...
The Secret Life of Words: Exploring Regularity and Systematicity (joint talk ...Katerina Vylomova
 
SIGMORPHON 2020 Shared Task 0: Typologically Diverse Morphological Inflection
SIGMORPHON 2020 Shared Task 0: Typologically Diverse Morphological InflectionSIGMORPHON 2020 Shared Task 0: Typologically Diverse Morphological Inflection
SIGMORPHON 2020 Shared Task 0: Typologically Diverse Morphological InflectionKaterina Vylomova
 
Ekaterina vylomova-what-do-neural models-know-about-language-p2
Ekaterina vylomova-what-do-neural models-know-about-language-p2Ekaterina vylomova-what-do-neural models-know-about-language-p2
Ekaterina vylomova-what-do-neural models-know-about-language-p2Katerina Vylomova
 
Ekaterina vylomova-what-do-neural models-know-about-language-p1
Ekaterina vylomova-what-do-neural models-know-about-language-p1Ekaterina vylomova-what-do-neural models-know-about-language-p1
Ekaterina vylomova-what-do-neural models-know-about-language-p1Katerina Vylomova
 
Evaluation of Semantic Change of Harm-Related Concepts in Psychology
Evaluation of Semantic Change of Harm-Related Concepts in PsychologyEvaluation of Semantic Change of Harm-Related Concepts in Psychology
Evaluation of Semantic Change of Harm-Related Concepts in PsychologyKaterina Vylomova
 
Paradigm Completion for Derivational Morphology
Paradigm Completion for Derivational MorphologyParadigm Completion for Derivational Morphology
Paradigm Completion for Derivational MorphologyKaterina Vylomova
 
Contemporary Models of Natural Language Processing
Contemporary Models of Natural Language ProcessingContemporary Models of Natural Language Processing
Contemporary Models of Natural Language ProcessingKaterina Vylomova
 
Men Are from Mars, Women Are from Venus: Evaluation and Modelling of Verbal A...
Men Are from Mars, Women Are from Venus: Evaluation and Modelling of Verbal A...Men Are from Mars, Women Are from Venus: Evaluation and Modelling of Verbal A...
Men Are from Mars, Women Are from Venus: Evaluation and Modelling of Verbal A...Katerina Vylomova
 
Context-Aware Derivation Prediction // EACL 2017
Context-Aware Derivation Prediction // EACL 2017Context-Aware Derivation Prediction // EACL 2017
Context-Aware Derivation Prediction // EACL 2017Katerina Vylomova
 
Take and Took, Gaggle and Goose, Book and Read: Evaluating the Utility of Vec...
Take and Took, Gaggle and Goose, Book and Read: Evaluating the Utility of Vec...Take and Took, Gaggle and Goose, Book and Read: Evaluating the Utility of Vec...
Take and Took, Gaggle and Goose, Book and Read: Evaluating the Utility of Vec...Katerina Vylomova
 
Neural models for recognition of basic units of semiographic chants
Neural models for recognition of basic units of semiographic chantsNeural models for recognition of basic units of semiographic chants
Neural models for recognition of basic units of semiographic chantsKaterina Vylomova
 
Russia, Russians and Russian language
Russia, Russians and Russian languageRussia, Russians and Russian language
Russia, Russians and Russian languageKaterina Vylomova
 
Ekaterina Vylomova/Brown Bag seminar presentation
Ekaterina Vylomova/Brown Bag seminar presentationEkaterina Vylomova/Brown Bag seminar presentation
Ekaterina Vylomova/Brown Bag seminar presentationKaterina Vylomova
 

More from Katerina Vylomova (16)

Documenting and modeling inflectional paradigms in under-resourced languages
Documenting and modeling inflectional paradigms in under-resourced languages Documenting and modeling inflectional paradigms in under-resourced languages
Documenting and modeling inflectional paradigms in under-resourced languages
 
The UniMorph Project and Morphological Reinflection Task: Past, Present, and ...
The UniMorph Project and Morphological Reinflection Task: Past, Present, and ...The UniMorph Project and Morphological Reinflection Task: Past, Present, and ...
The UniMorph Project and Morphological Reinflection Task: Past, Present, and ...
 
The Secret Life of Words: Exploring Regularity and Systematicity (joint talk ...
The Secret Life of Words: Exploring Regularity and Systematicity (joint talk ...The Secret Life of Words: Exploring Regularity and Systematicity (joint talk ...
The Secret Life of Words: Exploring Regularity and Systematicity (joint talk ...
 
SIGMORPHON 2020 Shared Task 0: Typologically Diverse Morphological Inflection
SIGMORPHON 2020 Shared Task 0: Typologically Diverse Morphological InflectionSIGMORPHON 2020 Shared Task 0: Typologically Diverse Morphological Inflection
SIGMORPHON 2020 Shared Task 0: Typologically Diverse Morphological Inflection
 
Ekaterina vylomova-what-do-neural models-know-about-language-p2
Ekaterina vylomova-what-do-neural models-know-about-language-p2Ekaterina vylomova-what-do-neural models-know-about-language-p2
Ekaterina vylomova-what-do-neural models-know-about-language-p2
 
Ekaterina vylomova-what-do-neural models-know-about-language-p1
Ekaterina vylomova-what-do-neural models-know-about-language-p1Ekaterina vylomova-what-do-neural models-know-about-language-p1
Ekaterina vylomova-what-do-neural models-know-about-language-p1
 
Evaluation of Semantic Change of Harm-Related Concepts in Psychology
Evaluation of Semantic Change of Harm-Related Concepts in PsychologyEvaluation of Semantic Change of Harm-Related Concepts in Psychology
Evaluation of Semantic Change of Harm-Related Concepts in Psychology
 
Paradigm Completion for Derivational Morphology
Paradigm Completion for Derivational MorphologyParadigm Completion for Derivational Morphology
Paradigm Completion for Derivational Morphology
 
Contemporary Models of Natural Language Processing
Contemporary Models of Natural Language ProcessingContemporary Models of Natural Language Processing
Contemporary Models of Natural Language Processing
 
Men Are from Mars, Women Are from Venus: Evaluation and Modelling of Verbal A...
Men Are from Mars, Women Are from Venus: Evaluation and Modelling of Verbal A...Men Are from Mars, Women Are from Venus: Evaluation and Modelling of Verbal A...
Men Are from Mars, Women Are from Venus: Evaluation and Modelling of Verbal A...
 
Context-Aware Derivation Prediction // EACL 2017
Context-Aware Derivation Prediction // EACL 2017Context-Aware Derivation Prediction // EACL 2017
Context-Aware Derivation Prediction // EACL 2017
 
Take and Took, Gaggle and Goose, Book and Read: Evaluating the Utility of Vec...
Take and Took, Gaggle and Goose, Book and Read: Evaluating the Utility of Vec...Take and Took, Gaggle and Goose, Book and Read: Evaluating the Utility of Vec...
Take and Took, Gaggle and Goose, Book and Read: Evaluating the Utility of Vec...
 
Working with text data
Working with text dataWorking with text data
Working with text data
 
Neural models for recognition of basic units of semiographic chants
Neural models for recognition of basic units of semiographic chantsNeural models for recognition of basic units of semiographic chants
Neural models for recognition of basic units of semiographic chants
 
Russia, Russians and Russian language
Russia, Russians and Russian languageRussia, Russians and Russian language
Russia, Russians and Russian language
 
Ekaterina Vylomova/Brown Bag seminar presentation
Ekaterina Vylomova/Brown Bag seminar presentationEkaterina Vylomova/Brown Bag seminar presentation
Ekaterina Vylomova/Brown Bag seminar presentation
 

Recently uploaded

FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryAlex Henderson
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPirithiRaju
 
Seismic Method Estimate velocity from seismic data.pptx
Seismic Method Estimate velocity from seismic  data.pptxSeismic Method Estimate velocity from seismic  data.pptx
Seismic Method Estimate velocity from seismic data.pptxAlMamun560346
 
GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)Areesha Ahmad
 
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Silpa
 
Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .Poonam Aher Patil
 
STS-UNIT 4 CLIMATE CHANGE POWERPOINT PRESENTATION
STS-UNIT 4 CLIMATE CHANGE POWERPOINT PRESENTATIONSTS-UNIT 4 CLIMATE CHANGE POWERPOINT PRESENTATION
STS-UNIT 4 CLIMATE CHANGE POWERPOINT PRESENTATIONrouseeyyy
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticssakshisoni2385
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksSérgio Sacani
 
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptxSCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptxRizalinePalanog2
 
COMPUTING ANTI-DERIVATIVES (Integration by SUBSTITUTION)
COMPUTING ANTI-DERIVATIVES(Integration by SUBSTITUTION)COMPUTING ANTI-DERIVATIVES(Integration by SUBSTITUTION)
COMPUTING ANTI-DERIVATIVES (Integration by SUBSTITUTION)AkefAfaneh2
 
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...Mohammad Khajehpour
 
Bacterial Identification and Classifications
Bacterial Identification and ClassificationsBacterial Identification and Classifications
Bacterial Identification and ClassificationsAreesha Ahmad
 
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptxPSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptxSuji236384
 
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxCOST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxFarihaAbdulRasheed
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsSérgio Sacani
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learninglevieagacer
 
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRLKochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRLkantirani197
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPirithiRaju
 

Recently uploaded (20)

FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 
Seismic Method Estimate velocity from seismic data.pptx
Seismic Method Estimate velocity from seismic  data.pptxSeismic Method Estimate velocity from seismic  data.pptx
Seismic Method Estimate velocity from seismic data.pptx
 
GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)
 
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.
 
Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .
 
STS-UNIT 4 CLIMATE CHANGE POWERPOINT PRESENTATION
STS-UNIT 4 CLIMATE CHANGE POWERPOINT PRESENTATIONSTS-UNIT 4 CLIMATE CHANGE POWERPOINT PRESENTATION
STS-UNIT 4 CLIMATE CHANGE POWERPOINT PRESENTATION
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disks
 
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptxSCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
 
COMPUTING ANTI-DERIVATIVES (Integration by SUBSTITUTION)
COMPUTING ANTI-DERIVATIVES(Integration by SUBSTITUTION)COMPUTING ANTI-DERIVATIVES(Integration by SUBSTITUTION)
COMPUTING ANTI-DERIVATIVES (Integration by SUBSTITUTION)
 
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...
 
Bacterial Identification and Classifications
Bacterial Identification and ClassificationsBacterial Identification and Classifications
Bacterial Identification and Classifications
 
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptxPSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
 
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxCOST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
 
CELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdfCELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdf
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learning
 
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRLKochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
 

Contextualization of Morphological Inflection

  • 1. Contextualization of Morphological Inflection Ekaterina Vylomova1 Ryan Cotterell2 Timothy Baldwin1 Trevor Cohn1 Jason Eisner2 1School of Computing and Information Systems The University of Melbourne 2Department of Computer Science Johns Hopkins University Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 1 / 39
  • 2. Language Modelling This is Marvin: Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 2 / 39
  • 3. Language Modelling OK, Marvin, which word comes next: Two cats are ___ Hmmm, let me guess ... sitting 3.01 ∗ 10−4 play 2.87 ∗ 10−4 running 2.53 ∗ 10−4 nice 2.32 ∗ 10−4 lost 1.97 ∗ 10−4 playing 1.66 ∗ 10−4 sat 1.54 ∗ 10−4 plays 1.32 ∗ 10−4 ... ...Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 3 / 39
  • 4. Language Modelling Let’s add a constraint by providing a lemma: Two cats are [PLAY] That narrows things down a lot ... sitting 3.01 ∗ 10−4 play 2.87 ∗ 10−4 running 2.53 ∗ 10−4 nice 2.32 ∗ 10−4 lost 1.97 ∗ 10−4 playing 1.66 ∗ 10−4 sat 1.54 ∗ 10−4 plays 1.32 ∗ 10−4 ... ...Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 4 / 39
  • 5. Language Modelling Hey, this reminds me a bit of .... a wug ... and a second wug: Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 5 / 39
  • 6. Morphological (Re-)Inflection ... as well as the SIGMORPHON morphological inflection task SIGMORPHON Shared Task 2016–2019 PLAY + PRESENT PARTICIPLE → playing played + PRESENT PARTICIPLE → playing Lemma Tag Form RUN PAST ran RUN PRES;1SG run RUN PRES;2SG run RUN PRES;3SG runs RUN PRES;PL run RUN PART running Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 6 / 39 2018 :∼ 96% accuracy on avg. in high-resource setting
  • 7. Morphological (Re-)Inflection Contextualization: But why choose PRESENT PARTICIPLE? Context! SIGMORPHON Shared Task 2016–2019 PLAY + PRESENT PARTICIPLE → playing played + PRESENT PARTICIPLE → playing Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 7 / 39
  • 8. Morphological (Re-)Inflection Contextualization: The tags must be inferred from the context! SIGMORPHON Shared Task 2018 Task 2 SubTask 1 Two cats are ??? together TWO/NUM CAT/N+PL BE/AUX+PRES+3PL PLAY TOGETHER/ADV SubTask 2 Two cats are ??? together PLAY Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 8 / 39
  • 9. Morphological (Re-)Inflection Contextualization: The tags must be inferred from the context! SIGMORPHON Shared Task 2018 Task 2 SubTask 1 Two cats are playing together TWO/NUM CAT/N+PL BE/AUX+PRES+3PL PLAY TOGETHER/ADV SubTask 2 Two cats are playing together PLAY Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 9 / 39
  • 10. A Hybrid (Structured–unstructured) Model Let’s predict both tags and forms! Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 10 / 39 Lemmatized Sequence Predicted Tag Sequence Predicted Form Sequence
  • 11. A Hybrid (Structured–unstructured) Model ... or, in other words, p(w, m | ) = ( n i=1 p(wi | i , mi )) p(m | ) Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 11 / 39 Lemmatized Sequence Predicted Tag Sequence Predicted Form Sequence
  • 12. A Hybrid (Structured–unstructured) Model ... or, in other words, p(w, m | ) = ( n i=1 p(wi | i , mi )) p(m | ) Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 12 / 39 Lemmatized Sequence p(m | ) Neural CRF Lample et al., 2016 Predicted Tag Sequence Predicted Form Sequence Rastogi et al., 2016 Aharoni et al., 2017 p(wi | i , mi ) Word Emission
  • 13. Languages and Grammar Categories Let’s test the model on a wide variety of languages! Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 13 / 39
  • 14. Languages and Grammar Categories Languages differ in what is explicitly morphosyntactically marked, and how: Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 14 / 39 Number of values of an inflectional feature Bulgarian (bg), Slavic English (en), Germanic Basque (eu), Isolate Finnish (fi), Uralic Gaelic (ga), Celtic Hindi (hi), Indic Italian (it), Romance Latin (la), Romance Polish (pl), Slavic Swedish (sv), Germanic
  • 15. Languages and Grammar Categories Some languages use word order to express relations between words, while others use morphosyntactic marking: English: Kim gives Sandy an interesting book Polish: Jenia daje Maszy ciekawą książkę Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 15 / 39
  • 16. Languages and Grammar Categories Some languages use word order to express relations between words, while others use morphosyntactic marking: English: Kim gives Sandy an interesting book Subject IObject DObject Polish: Jenia daje Maszy ciekawą książkę Nom Dat Acc.Fem.Sg Acc.Sg == Maszy daje Jenia ciekawą książkę == ciekawą książkę daje Jenia Maszy != Jenie daje Masza ciekawą książkę Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 16 / 39
  • 17. Experiments How well can such categories and corresponding forms be predicted in each language? Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 17 / 39
  • 18. Experiments How well can such categories and corresponding forms be predicted in each language? Do linguistic features enhance performance? Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 18 / 39
  • 19. Experiments How well can such categories and corresponding forms be predicted in each language? Do linguistic features enhance performance? Does morphological complexity impact on empirical performance? Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 19 / 39
  • 20. Experiments Data: Universal Dependencies v1.2 Baselines: the baseline of the SIGMORPHON 2018 shared task as well as the best performing system of that year Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 20 / 39 Nivre et al.,2016
  • 21. Experiments SM: biLSTM encoder–decoder with context window of size 2 input = concat (prefix+suffix, lemma, tags, char-level center lemma) Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 21 / 39 Cotterell et al.,2018
  • 22. Experiments CPH: biLSTM encoder–decoder with no context window size restrictions input = concat (full context, lemma, tags, char-level center lemma) also predicts target tags as an auxiliary task Direct: more basic model that relies only on forms and lemmas Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 22 / 39 Kementchedjhieva et al.,2018
  • 23. Experiments Let’s condition only on contextual forms and lemmas (1-best accuracy for form prediction): 0 25 50 75 100 BG EN EU FI GA HI IT LA PL SV 1.Direct Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 23 / 39
  • 24. Experiments Now also supply contextual tag information, still predicting forms only: 0 25 50 75 100 BG EN EU FI GA HI IT LA PL SV 1.Direct 2.SM Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 24 / 39
  • 25. Experiments Now use a wider context and predict tags as an auxiliary task: 0 25 50 75 100 BG EN EU FI GA HI IT LA PL SV 1.Direct 2.SM 3.CPH Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 25 / 39
  • 26. Experiments Finally, use neural CRF to predict tag sequence and hard monotonic attention model for forms: 0 25 50 75 100 BG EN EU FI GA HI IT LA PL SV 1.Direct 2.SM 3.CPH 4.Joint Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 26 / 39
  • 27. Experiments How far are we from the results for forms predicted from gold tag sequence? 0 25 50 75 100 BG EN EU FI GA HI IT LA PL SV 1.Direct 2.SM 3.CPH 4.Joint 5.Gold Tags Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 27 / 39
  • 28. Discussion Q1: Do linguistic features help? Yes, they do! Most systems that make use of morphological tags outperform the “Direct” baseline on most languages Joint prediction of tags and forms further improves the results Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 28 / 39
  • 29. Discussion Q2: Does morphological complexity impact empirical performance? Yes, it does! Performance drops in languages with rich case systems such as Slavic and Uralic The model needs to learn which grammatical categories should be in agreement Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 29 / 39
  • 30. Discussion Q3: How well is agreement captured? Adjective – Noun (AMod) is captured quite well Verb – Noun (Subject – Verb) is more challenging, since agreement categories can vary depending on tense, e.g. Ru: Person+Number in present vs. Gender in past, singular General-purpose inference of agreement categories is still a challenging task! Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 30 / 39
  • 31. Discussion Q4: Where does most uncertainty come from? Inherent and Contextual Morphological Categories Contextual categories participate in agreement: adjective number, case, gender, verbal gender, etc. Inherent express the speaker’s intentions: noun number, verbal tense Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 31 / 39
  • 32. Discussion Q4: Where does most uncertainty come from? Inherent and Contextual Morphological Categories Contextual categories participate in agreement: adjective number, case, gender, verbal gender, etc. Inherent express the speaker’s intentions: noun number, verbal tense Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 32 / 39 Most uncertainty comes from inherent categories!
  • 33. Discussion Q4: Where does most uncertainty come from? Inherent and Contextual Morphological Categories Contextual categories participate in agreement: adjective number, case, gender, verbal gender, etc. Inherent express the speaker’s intentions: noun number, verbal tense Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 33 / 39 Most uncertainty comes from inherent categories! Often such categories must be inferred
  • 34. Discussion Q5: Which language is least affected by lemmatization? Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 34 / 39
  • 35. Discussion Q5: Which language is least affected by lemmatization? Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 35 / 39
  • 36. Discussion Q5: Which language is least affected by lemmatization? Word Order vs. Morphology Most information on roles and dependencies is expressed non-morphologically, e.g. in word order or by prepositions: EN: Kim gives Sandy an interesting book → KIM GIVE SANDY AN INTERESTING BOOK PL: Jenia daje Maszy ciekawą książkę → JENIA DAWAĆ’ MASZA CIEKAWY KSIĄŻKA Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 36 / 39 Why English?
  • 37. Discussion Q5: Which language is least affected by lemmatization? Word Order vs. Morphology Most information on roles and dependencies is expressed non-morphologically, e.g. in word order or by prepositions: EN: Kim gives Sandy an interesting book → KIM GIVE SANDY AN INTERESTING BOOK PL: Jenia daje Maszy ciekawą książkę → JENIA DAWAĆ’ MASZA CIEKAWY KSIĄŻKA Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 37 / 39 Why English? SVO/Roles are still there Flexible/Roles are partially lost
  • 38. Future Directions Evaluation of grammaticality How well do neural models model grammaticality? Data de-biasing (e.g., En–Ru ) smart student → umnyj.Nom.Masc.Sg student.Nom.Sg augment with: smart student → umnaja.Nom.Fem.Sg studentka.Nom.Fem.Sg Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 38 / 39
  • 39. Thank you! Questions? Vylomova, Cotterell, Baldwin, Cohn, Eisner Contextualization of Morphological Inflection 39 / 39