SlideShare a Scribd company logo
1 of 73
Download to read offline
Automated Writing Assistance:
             Grammar Checking and Beyond
              Topic 4: Handling ESL Errors
                         Robert Dale
              Centre for Language Technology
                   Macquarie University


SSLST 2011                                     1
From the Pasadena Star-News 26th Aug 2011




SSLST 2011                                  2
Outline

       •     Background
       •     Article Errors
       •     Preposition Errors
       •     Other ESL Problems
       •     Conclusions




SSLST 2011                        3
Terminology

       • ESL = English as a Second Language
          – Refers to non-native speakers living and speaking in a
            predominantly English-speaking environment
       • EFL = English as a Foreign Language
          – Refers to non-native speakers studying and learning English
            in a non-English speaking country
       • We’ll generally use the term ESL to refer to both
       • Apologies that this is mostly about ESL – there’s less work in
         other languages …

SSLST 2011                                                                4
The Problem

       • Lots of people want to speak English: it is the most commonly
         studied second language
       • Over 1 billion people speak English as a second or a foreign
         language
       • Existing grammar checking tools are not, so far, tailored to the
         needs of ESL learners




SSLST 2011                                                                  5
ESL Errors Are Different:
Bolt [1992]

       • Bolt tested seven grammar-checking programs of the time
         against 35 sentences containing ESL errors
       • Looked at from the perspective of a learner of English at a
         fairly low level of competence
       • Conclusions:
           – ‘all of these programs fail in terms of the criteria that have
             been used.’
           – Expectations are encouraged that cannot be fulfilled
           – Silence on the part of a program suggests everything is ok

SSLST 2011                                                                    6
ESL Errors Are Different:
Donahue [2001] vs Connors + Lundsford [1988]
        Error                                                    US      ESL
       No comma after introductory element                       1    negligible
       Vague pronoun reference                                   2    negligible
       No comma in compound sentence                             3       12
       Wrong word                                                4        2
         No comma in nonrestrictive element                       5   negligible
         Wrong or missing inflected ends                          6       6
         Wrong or missing preposition                             7       5
         Comma splice                                             8       1
         Possessive apostrophe error                              9   negligible
         Tense shift                                             10   negligible
         Unnecessary shift in person                             11      15
         Sentence fragment                                       12       7
         Wrong tense or verb form                                13       4
         Subject-verb agreement                                  14      11
         Lack of comma in a series                               15   negligible
         Pronoun agreement error                                 16   negligible
         Unnecessary commas with restrictive relative pronouns   17   negligible
         Run on, fused sentences                                 18       8
         Dangling, misplaced modifier                            19   negligible
SSLST 2011 it’s confusion
         Its,                                                    20   negligible   7
ESL Errors Are Different
             Error                                     US           ESL
        Missing words                               negligible       3
        Capitalization                              negligible       9
        Wrong pronoun                               negligible      16
        a, an confusion                             negligible      14
        Missing article                             negligible      17
        Wrong verb form                             negligible      10
        No comma before etc.                        negligible      13


       • Half of the ten most frequent error types made by native
         speakers are negligible in the writing of the ESL population




SSLST 2011                                                                8
Errors in the Cambridge Learners Corpus




SSLST 2011                                9
Common ESL Errors

       • The most difficult aspects of English for ESL learners are:
          – Definite and indefinite articles
          – Prepositions
       • Together these account for 2050% of grammar and usage
         errors
       • [The elephant in the room: spelling errors are much more
         common, and incorrect word choice is as problematic as article
         and preposition errors.]


SSLST 2011                                                                10
Article Errors in the CLC by L1
                    L1           Has Articles   Proportion
                    Russian            No            0.186
                    Korean             No            0.176
                    Japanese           No            0.159
                    Chinese            No            0.125
                    Greek              Yes           0.087
                    French             Yes           0.081
                    Spanish            Yes           0.070
                    German             Yes           0.053

             Proportion of sentences with one or more article errors

SSLST 2011                                                             11
Preposition Errors in the CLC by L1
                            L1           Proportion
                            Greek             0.149
                            Spanish           0.139
                            Korean            0.128
                            Chinese           0.122
                            French            0.121
                            Japanese          0.118
                            German            0.100
                            Russian           0.095

             Proportion of sentences with one or more preposition errors

SSLST 2011                                                                 12
The Impact of L1 on ESL Errors

       • Learning will be difficult if the L1 has no close equivalent for a
         feature:
           – Native speakers of Japanese and Russian will have particular
             difficulty mastering the use of articles.
       • Learning will be facilitated if the L1 has an equivalent feature:
           – Native speakers of French or German should find the English
             article system relatively easy to learn.




SSLST 2011                                                                    13
A Note on Data

       • The field has been hamstrung by the privately held nature of
         many learner corpora
       • Two welcome changes:
          – The NUS Corpus of Learner English
          – The Cambridge Learner Corpus FCE Dataset
       • Also the much smaller HOO dataset




SSLST 2011                                                              14
NUCLE: The NUS Corpus of Learner English

       • 1400 essays written by University students at the National
         University of Singapore
       • Over 1M words annotated with error tags and corrections
       • See http://nlp.comp.nus.edu.sg/corpora




SSLST 2011                                                            15
NUCLE: The NUS Corpus of Learner English

       Standoff annotation:

       <MISTAKE start_par="4" start_off="194" end_par="4" end_off="195">
         <TYPE>ArtOrDet</TYPE>
         <CORRECTION>an</CORRECTION>
       </MISTAKE>




SSLST 2011                                                             16
The CLC FCE Dataset

       • A set of 1,244 exam scripts written by candidates sitting the
         Cambridge ESOL First Certificate in English (FCE) examination
         in 2000 and 2001
       • Annotated with errors and corrections
       • A subset of the much larger 30M-word Cambridge Learner
         Corpus
       • See http://ilexir.co.uk/applications/clc-fce-dataset/




SSLST 2011                                                               17
The CLC FCE Dataset

       Inline annotation:
       • Because <NS type="UQ"><i>all</i></NS> students in <NS
          type="MD"><c>the</c></NS> English class are from all
          over the world …




SSLST 2011                                                       18
The HOO Dataset

       • HOO – Helping Our Own – aims to marshall NLP technology to
         help non-native speakers write ACL papers
       • Very small corpus (~36K words) annotated with errors and
         corrections
       • Evaluation software also freely available
       • See http://www.clt.mq.edu.au/research/projects/hoo/




SSLST 2011                                                            19
The HOO Dataset

       Stand-off and inline annotation both available:
       • In our experiments, pseudo-words are fed into <edit
         type="MD"><empty/><corrections><correction>the</correct
         ion></corrections></edit> PB-SMT pipeline.
       • <edit index="1005-0016" type="MD" start="871" end="871" >
         <original><empty/></original>
           <corrections>
              <correction>the </correction>
           </corrections>
         </edit>

SSLST 2011                                                           20
SSLST 2011   21
Outline

       •     Background
       •     Article Errors
       •     Preposition Errors
       •     Other ESL Problems
       •     Conclusions




SSLST 2011                        22
Article Errors

       •     The Problem
       •     Early Rule-based Approaches
       •     Knight and Chandler [1994]
       •     Han et al [2006]
       •     De Felice and Pulman [2008]




SSLST 2011                                 23
Why is Article Choice Hard?

       • Basic problem for speakers of languages that do not use
         articles:
          – choose between a/an, the, and the null determiner
       • The bottom line: it comes down to context
          – I was eating a cake.
          – I was eating the cake.
          – I was eating cake.



SSLST 2011                                                         24
Features Impacting Article Choice:
Countability

       • Count nouns take determiners:
          – I read the paper yesterday.
       • Mass nouns don’t take determiners:
          – We generally write on paper.
       • But the universal grinder and the universal packager [Pelletier
         1975] are always available:
          – There was dog all over the road.
          – Could we have just one rice please?

SSLST 2011                                                                 25
Features Impacting Article Choice:
Countability

       • Semi-idiomatic forms:
          – I looked him in the eye.
          – *I looked him in an eye.




SSLST 2011                             26
Features Impacting Article Choice:
Syntactic Context

       I have knowledge.
        I have a knowledge.
       I have knowledge of this.
        I have a knowledge of this.
       I have a knowledge of English.




SSLST 2011                               27
Features Impacting Article Choice:
Discourse Factors

       • Stereotypically, entities are introduced into a discourse using
         an indefinite determiner and subsequently referred to using a
         definite determiner
          – I saw a man at the bus stop. … The man was crying.
       • But not always:
          – A bus turned the corner. The driver was crying.
          – I went to the beach yesterday.



SSLST 2011                                                                 28
Features Impacting Article Choice:
World Knowledge

       • He bought a Honda.
       • He bought Honda.




SSLST 2011                           29
Article Errors

       •     The Problem
       •     Early Rule-based Approaches
       •     Knight and Chandler [1994]
       •     Han et al [2006]
       •     De Felice and Pulman [2008]




SSLST 2011                                 30
Early Work:
Article Insertion in Machine Translation

       • The Problem:
          – Machine translation of languages like Japanese or Russian
            into English is difficult because the source language doesn’t
            contain articles




SSLST 2011                                                                  31
Murata and Nagao [1993]:
Hand-Crafted Rules

       • When a noun is modified by a referential pronoun (KONO(this),
         SONO(its), …) then {indefinite(0, 0), definite(1, 2), generic(0, 0)}
       • When a noun is accompanied by a particle (WA), and the predicate
         has past tense, then {indefinite(1, 0), definite(1, 3), generic(1, 1)}
       • When a noun is accompanied by a particle (WA), and the predicate
         has present tense, then {indefinite(1, 0), definite(1, 2), generic(1,
         3)}
       • When a noun is accompanied by a particle HE(to), MADE(up to) or
         KARA(from), then {indefinite(1, 0), definite(1, 2), generic(1, 0)}
       • …

SSLST 2011                                                                        32
Article Errors

       •     The Problem
       •     Early Rule-based Approaches
       •     Knight and Chandler [1994]
       •     Han et al [2006]
       •     De Felice and Pulman [2008]




SSLST 2011                                 33
Knight and Chandler [1994]:
A Data-Driven Method for Post-Editing

       • General aim:
          – To build a post-editing tool that can fix errors made in a
            machine translation system
       • Specific task:
          – Article insertion: a, an or the




SSLST 2011                                                               34
Knight and Chandler [1994]:
Before and After
       Stelco Inc. said it plans to shut down      Stelco Inc. said it plans to shut down
       three Toronto-area plants, moving their     three Toronto-area plants, moving their
       fastener operations to leased facility in   fastener operations to a leased facility in
       Brantford, Ontario.                         Brantford, Ontario.
       Company said fastener business “has         The company said the fastener business
       been under severe cost pressure for         “has been under severe cost pressure
       some time.” Fasteners, nuts and bolts       for some time.” The fasteners, nuts and
       are sold to North American auto market.     bolts are sold to the North American
       Company spokesman declined to               auto market.
       estimate impact of closures on earnings.    A company spokesman declined to
       He said new facility will employ 500 of     estimate the impact of the closures on
       existing 600 employees. Steelmaker          earnings. He said the new facility will
       employs about 16,000 people.                employ 500 of the existing 600
                                                   employees. The steelmaker employs
                                                   about 16,000 people.
SSLST 2011                                                                                       35
Knight and Chandler [1994]:
The General Idea

       The steps:
       • Take newspaper-quality English text
       • Remove articles
       • Re-insert automatically
       • Compare results with the original text
       Assumptions:
       • NPs are marked as singular or plural
       • Locations of articles already marked so it’s a binary choice
         between the and a/an.
SSLST 2011                                                              36
Knight and Chandler [1994]:
Baseline

       • In 40Mb of Wall Street Journal text:
           a = 28.2%
           an = 4.6%
           the = 67.2%
       • So 67% is a good lower-bound
       • Upper-bound:
           – Human subjects performed with accuracy of 94%-96%


SSLST 2011                                                       37
Knight and Chandler [1994]:
Baselines


                                     Human    Machine
             Random                   50%      50%
             Always guess the         67%      67%
             Given core context NP   79-80%
             Given NP + 4 words      83-88%     ?
             Given full context      94-96%




SSLST 2011                                              38
Knight and Chandler [1994]:
Approach

       • Characterize NPs via sets of features then use a build decision
         tree to classify
       • Lexical features:
           – ‘word before the article is triple’
       • Abstract features:
           – ‘word after the head noun is a past tense verb’
       • 400k training examples and 30k features; features with less
         than 4 instances discarded


SSLST 2011                                                                 39
Knight and Chandler [1994]:
Performance

       • On 1600 trees for the 1600 most frequent head nouns
         (covering 77% of test instances):
          – 81% accuracy
       • Guess the for the remaining 23% of test instances
          – 78% accuracy overall




SSLST 2011                                                     40
Article Errors

       •     The Problem
       •     Early Rule-based Approaches
       •     Knight and Chandler [1994]
       •     Han et al [2006]
       •     De Felice and Pulman [2008]




SSLST 2011                                 41
Han et al [2006]:
A MaxEnt Approach to Article Selection

       • Basic Approach:
          – A maximum entropy classifier for selecting amongst a/an,
            the or the null determiner
          – Uses local context features such as words and PoS tags




SSLST 2011                                                             42
Han et al [2006]:
Contrasts with Earlier Work

       • More varied training corpus: a range of genres
          – 721 text files, 31.5M words
          – 10th thru 12th grade reading level
       • Much larger training corpus: 6 million NPs (15x larger)
          – Automatically PoS tagged + NP-chunked
       • The use of a maximum entropy classifier




SSLST 2011                                                         43
Han et al [2006]:
Training Results

       •     6M NPs in training set
       •     390k features
       •     Baseline = 71.84% (frequency of null determiner)
       •     Four-fold cross validation
              – performance range 87.59% to 88.29%
              – Average 87.99%




SSLST 2011                                                      44
Han et al [2006]:
Effectiveness of Individual Features
             Feature                          % Correct
             Word/PoS of all words in NP       80.41
             Word/PoS of w(NP-1) + Head/PoS    77.98
             Head/PoS                          77.30
             PoS of all words in NP            73.96
             Word/PoS of w(NP+1)               72.97
             Word/PoS of w(NP[1])              72.53
             PoS of w(NP[1])                   72.52
             Word/PoS of w(NP-1)               72.30
             PoS of Head                       71.98
             Head’s Countability               71.85
             Word/PoS of w(NP-2)               71.85
             Default to null determiner        71.84
SSLST 2011                                                45
Han et al [2006]:
Effectiveness of Individual Features

       • Best feature: Word/PoS of all words in NP
          – Ok if you have a large enough corpus!
       • Second best: W(NP-1) + Head
          – Eg ‘in summary’




SSLST 2011                                           46
Han et al [2006]:
Accuracy by Head Noun Type

             Syntactic Type of Head   % Correct
             Singular Noun             80.99
             Plural Noun               85.02
             Pronoun                   99.66
             Proper Noun, Singular     90.42
             Proper Noun, Plural       82.05
             Number                    92.71
             Demonstrative Pronoun     99.70
             Other                     97.81




SSLST 2011                                        47
Han et al [2006]:
Accuracy as a Function of Training Set Size

             #NPs in Training Set   % Correct
                         150000      83.49
                         300000      84.92
                         600000      85.75
                        1200000      86.59
                        2400000      87.27
                        4800000      87.92
                        6000000      87.99




SSLST 2011                                      48
Han et al [2006]:
Applying the Model to TOEFL Essays

       • Model retrained only on NPs with a common head noun
          – Baseline = frequency of null determiner = 54.40%
          – Training set kept at 6M instances by adding more data
          – Average accuracy = 83.00%
       • Model applied to 668 TOEFL essays w 29759 NPs
          – Subset of NPs classified by two annotators
          – Agreement on 98% of cases with kappa = 0.86
          – One article error every 8 NPs

SSLST 2011                                                          49
Han et al [2006]:
Some Examples

       Above all, I think it is good for students to share room with
       others.
       • Human: missing a or an
       • Classifier: 0.841 a/an; 0.143 the; 0.014 zero
       Those excellent hitters began practicing the baseball when they
       were children, and dedicated a lot of time to become highly
       qualified.
       • Human: superfluous determiner
       • Classifier: 0.103 a/an; 0.016 the; 0.879 zero

SSLST 2011                                                               50
Han et al [2006]:
Results on TOEFL Essays

       • 79% of errors in test set correctly detected
       • Many false positives, so precision only 44%
       • Decisions often borderline:
          – The books are assigned by professors.
          – Marked by annotators as correct, model predicts the (0.51)
            and null (0.49)




SSLST 2011                                                               51
Han et al [2006]:
Sources of Error

       • Model performs poorly on decision between a and the
          – Probably due to the need for discourse information
       • So, new feature: has the head noun appeared before, and if so,
         with what article?
          – No significant effect on performance
       • Error analysis suggests this is due to more complex discourse
         behaviour:
          – A student will not learn if she hates the teacher.
          – … the possibilities that a scholarship would afford …
SSLST 2011                                                                52
Article Errors

       •     The Problem
       •     Early Rule-based Approaches
       •     Knight and Chandler [1994]
       •     Han et al [2006]
       •     De Felice and Pulman [2008]




SSLST 2011                                 53
De Felice and Pulman [2008]:
Richer Syntactic and Semantic Features

       • Basic Approach:
          – As in Han et al [2006], a maximum entropy classifier for
            selecting amongst a/an, the or the null determiner
          – Use a richer set of syntactic and semantic features




SSLST 2011                                                             54
De Felice and Pulman [2008]:
Main Features
             Feature                       Value
             Head Noun                     ‘apple’
             Number                        Singular
             Noun Type                     Count
             Named Entity?                 No
             WordNet Category              Food, Plant
             Prepositional Modification?   Yes, ‘on’
             Object of Preposition?        No
             Adjectival Modification?      Yes, ‘juicy’
             Adjectival Grade              Superlative
             POS3                         VV, DT, JJS, IN, DT, NN

                 Example: Pick the juiciest apple on the tree.
SSLST 2011                                                           55
De Felice and Pulman [2008]:
Additional Features

       • Whether the noun is modified by a predeterminer, possessive,
         numeral and/or a relative clause
       • Whether it is part of a ‘there is …’ phrase




SSLST 2011                                                              56
De Felice and Pulman [2008]:
Performance

       • Trained on British National Corpus
          – 4,043,925 instances
       • Test set of 305,264 BNC instances
       • Baseline = 59.83% (choose null)
       • Accuracy = 92.15%




SSLST 2011                                    57
De Felice and Pulman [2008]:
Comparative Performance on L1 Data

             Author                      Accuracy
             Baseline                    59.83%
             Han et al 2006              83.00%
             Gamon et al 2008            86.07%
             Turner and Charniak 2007    86.74%
             De Felice and Pulman 2008   92.15%




SSLST 2011                                          58
De Felice and Pulman [2008]:
Results on Individual Determiners

                          % of Training Data   Precision    Recall
              a              9.61% (388,476)   70.52%       53.50%
              the         29.19% (1,180,435)   85.17%       91.51%
              null        61.20% (2,475,014)   98.63%       98.79%


       • The indefinite determiner is less frequent and harder to learn




SSLST 2011                                                                59
De Felice and Pulman [2008]:
Testing on L2 Text

       • 3200 instances extracted from the CLC
          – 2000 correct
          – 1200 incorrect
       • Accuracy on correct instances: 92.2%
       • Accuracy on incorrect instances: < 10%
       • Most frequent incorrect usage is a missing determiner
          – Model behaviour influenced by skew in training data
       • Also problems in extracting NLP features from L2 data

SSLST 2011                                                        60
Outline

       •     Background
       •     Article Errors
       •     Preposition Errors
       •     Other ESL Problems
       •     Conclusions




SSLST 2011                        61
The Prevalence of Preposition Errors
                         L1           Proportion
                         Greek             0.149
                         Spanish           0.139
                         Korean            0.128
                         Chinese           0.122
                         French            0.121
                         Japanese          0.118
                         German            0.100
                         Russian           0.095

       Proportion of sentences in the CLC with one or more preposition
       errors
SSLST 2011                                                               62
Prepositions Have Many Roles in English

       • They appear in adjuncts:
          – In total, I spent $64 million dollars.
       • They mark the arguments of verbs:
          – I’ll give ten cents to the next guy.
       • They figure in phrasal verbs:
          – I ran away when I was ten.
       • They play a part in idioms:
          – She talked down to him.

SSLST 2011                                           63
Negative Transfer

       • Many prepositions have a most typical or frequent translation
          – Eg: of in English to de in French
       • But for many prepositions there are multiple translational
         possibilities
          – ESL speakers can easily choose the wrong one
          – Eg: driving in a high speed




SSLST 2011                                                               64
Prepositions in English

       • English has over 100 prepositions, including some multiword
         prepositions and a small number of postpositions
       • The 10 most frequent account for 82% of the errors in the CLC




SSLST 2011                                                               65
Preposition Selection in Well-Formed Text




SSLST 2011                                  66
Preposition Error Detection on Learner Data




SSLST 2011                                    67
Outline

       •     Background
       •     Article Errors
       •     Preposition Errors
       •     Other ESL Problems
       •     Conclusions




SSLST 2011                        68
Collocations

       • Conventional combinations that are preferred over other equally
         syntactically and semantically valid combinations
          – Adj + Noun: stiff breeze vs rigid breeze
          – Verb + Noun: hold an election vs make an election
          – Noun + Noun: movie theatre vs film theatre
          – Adverb + Verb: thoroughly amuse vs completely amuse




SSLST 2011                                                                 69
Collocations

       • Computational approaches generally make use of distributional
         differences for detecting and correcting errors
       • Same general approach as in articles and prepositions:
          – Choose preferred form from a set of alternatives
          – But: the confusion set is potentially much larger
       • Solution:
          – Constrain the space by selecting alternatives with a similar
             meaning
       • See work on automatic thesaurus construction [eg Lin 1998]
SSLST 2011                                                                 70
Verb Form Errors


             Error Type               Example
             Subject-Verb Agreement   He have been living here since June.
             Auxiliary Agreement      He has been live here since June.
             Complementation          He wants live here.


       • See Lee and Seneff [2008] for a method based on detecting
         specific irregularities in parse trees.


SSLST 2011                                                                   71
Outline

       •     Background
       •     Article Errors
       •     Preposition Errors
       •     Other ESL Problems
       •     Conclusions




SSLST 2011                        72
Conclusions

       • The provision of assistance to ESL learners is clearly a
         significant market
       • Technology is at a very early stage, focussing on specific
         subproblems
       • Measurable progress has been hampered by the unavailability
         of shared data sets, but this is changing




SSLST 2011                                                             73

More Related Content

Recently uploaded

Disha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfDisha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfchloefrazer622
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104misteraugie
 
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...Sapna Thakur
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...
JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...
JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...anjaliyadav012327
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfchloefrazer622
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3JemimahLaneBuaron
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactPECB
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 

Recently uploaded (20)

Disha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfDisha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdf
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...
JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...
JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 

Featured

2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by HubspotMarius Sescu
 
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTExpeed Software
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsPixeldarts
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthThinkNow
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfmarketingartwork
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024Neil Kimberley
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)contently
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024Albert Qian
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsKurio // The Social Media Age(ncy)
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Search Engine Journal
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summarySpeakerHub
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next Tessa Mero
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best PracticesVit Horky
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project managementMindGenius
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...RachelPearson36
 

Featured (20)

2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot
 
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPT
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage Engineerings
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 

Tarragona Summer School/Automated Writing Assistance Topic 4

  • 1. Automated Writing Assistance: Grammar Checking and Beyond Topic 4: Handling ESL Errors Robert Dale Centre for Language Technology Macquarie University SSLST 2011 1
  • 2. From the Pasadena Star-News 26th Aug 2011 SSLST 2011 2
  • 3. Outline • Background • Article Errors • Preposition Errors • Other ESL Problems • Conclusions SSLST 2011 3
  • 4. Terminology • ESL = English as a Second Language – Refers to non-native speakers living and speaking in a predominantly English-speaking environment • EFL = English as a Foreign Language – Refers to non-native speakers studying and learning English in a non-English speaking country • We’ll generally use the term ESL to refer to both • Apologies that this is mostly about ESL – there’s less work in other languages … SSLST 2011 4
  • 5. The Problem • Lots of people want to speak English: it is the most commonly studied second language • Over 1 billion people speak English as a second or a foreign language • Existing grammar checking tools are not, so far, tailored to the needs of ESL learners SSLST 2011 5
  • 6. ESL Errors Are Different: Bolt [1992] • Bolt tested seven grammar-checking programs of the time against 35 sentences containing ESL errors • Looked at from the perspective of a learner of English at a fairly low level of competence • Conclusions: – ‘all of these programs fail in terms of the criteria that have been used.’ – Expectations are encouraged that cannot be fulfilled – Silence on the part of a program suggests everything is ok SSLST 2011 6
  • 7. ESL Errors Are Different: Donahue [2001] vs Connors + Lundsford [1988] Error US ESL No comma after introductory element 1 negligible Vague pronoun reference 2 negligible No comma in compound sentence 3 12 Wrong word 4 2 No comma in nonrestrictive element 5 negligible Wrong or missing inflected ends 6 6 Wrong or missing preposition 7 5 Comma splice 8 1 Possessive apostrophe error 9 negligible Tense shift 10 negligible Unnecessary shift in person 11 15 Sentence fragment 12 7 Wrong tense or verb form 13 4 Subject-verb agreement 14 11 Lack of comma in a series 15 negligible Pronoun agreement error 16 negligible Unnecessary commas with restrictive relative pronouns 17 negligible Run on, fused sentences 18 8 Dangling, misplaced modifier 19 negligible SSLST 2011 it’s confusion Its, 20 negligible 7
  • 8. ESL Errors Are Different Error US ESL Missing words negligible 3 Capitalization negligible 9 Wrong pronoun negligible 16 a, an confusion negligible 14 Missing article negligible 17 Wrong verb form negligible 10 No comma before etc. negligible 13 • Half of the ten most frequent error types made by native speakers are negligible in the writing of the ESL population SSLST 2011 8
  • 9. Errors in the Cambridge Learners Corpus SSLST 2011 9
  • 10. Common ESL Errors • The most difficult aspects of English for ESL learners are: – Definite and indefinite articles – Prepositions • Together these account for 2050% of grammar and usage errors • [The elephant in the room: spelling errors are much more common, and incorrect word choice is as problematic as article and preposition errors.] SSLST 2011 10
  • 11. Article Errors in the CLC by L1 L1 Has Articles Proportion Russian No 0.186 Korean No 0.176 Japanese No 0.159 Chinese No 0.125 Greek Yes 0.087 French Yes 0.081 Spanish Yes 0.070 German Yes 0.053 Proportion of sentences with one or more article errors SSLST 2011 11
  • 12. Preposition Errors in the CLC by L1 L1 Proportion Greek 0.149 Spanish 0.139 Korean 0.128 Chinese 0.122 French 0.121 Japanese 0.118 German 0.100 Russian 0.095 Proportion of sentences with one or more preposition errors SSLST 2011 12
  • 13. The Impact of L1 on ESL Errors • Learning will be difficult if the L1 has no close equivalent for a feature: – Native speakers of Japanese and Russian will have particular difficulty mastering the use of articles. • Learning will be facilitated if the L1 has an equivalent feature: – Native speakers of French or German should find the English article system relatively easy to learn. SSLST 2011 13
  • 14. A Note on Data • The field has been hamstrung by the privately held nature of many learner corpora • Two welcome changes: – The NUS Corpus of Learner English – The Cambridge Learner Corpus FCE Dataset • Also the much smaller HOO dataset SSLST 2011 14
  • 15. NUCLE: The NUS Corpus of Learner English • 1400 essays written by University students at the National University of Singapore • Over 1M words annotated with error tags and corrections • See http://nlp.comp.nus.edu.sg/corpora SSLST 2011 15
  • 16. NUCLE: The NUS Corpus of Learner English Standoff annotation: <MISTAKE start_par="4" start_off="194" end_par="4" end_off="195"> <TYPE>ArtOrDet</TYPE> <CORRECTION>an</CORRECTION> </MISTAKE> SSLST 2011 16
  • 17. The CLC FCE Dataset • A set of 1,244 exam scripts written by candidates sitting the Cambridge ESOL First Certificate in English (FCE) examination in 2000 and 2001 • Annotated with errors and corrections • A subset of the much larger 30M-word Cambridge Learner Corpus • See http://ilexir.co.uk/applications/clc-fce-dataset/ SSLST 2011 17
  • 18. The CLC FCE Dataset Inline annotation: • Because <NS type="UQ"><i>all</i></NS> students in <NS type="MD"><c>the</c></NS> English class are from all over the world … SSLST 2011 18
  • 19. The HOO Dataset • HOO – Helping Our Own – aims to marshall NLP technology to help non-native speakers write ACL papers • Very small corpus (~36K words) annotated with errors and corrections • Evaluation software also freely available • See http://www.clt.mq.edu.au/research/projects/hoo/ SSLST 2011 19
  • 20. The HOO Dataset Stand-off and inline annotation both available: • In our experiments, pseudo-words are fed into <edit type="MD"><empty/><corrections><correction>the</correct ion></corrections></edit> PB-SMT pipeline. • <edit index="1005-0016" type="MD" start="871" end="871" > <original><empty/></original> <corrections> <correction>the </correction> </corrections> </edit> SSLST 2011 20
  • 22. Outline • Background • Article Errors • Preposition Errors • Other ESL Problems • Conclusions SSLST 2011 22
  • 23. Article Errors • The Problem • Early Rule-based Approaches • Knight and Chandler [1994] • Han et al [2006] • De Felice and Pulman [2008] SSLST 2011 23
  • 24. Why is Article Choice Hard? • Basic problem for speakers of languages that do not use articles: – choose between a/an, the, and the null determiner • The bottom line: it comes down to context – I was eating a cake. – I was eating the cake. – I was eating cake. SSLST 2011 24
  • 25. Features Impacting Article Choice: Countability • Count nouns take determiners: – I read the paper yesterday. • Mass nouns don’t take determiners: – We generally write on paper. • But the universal grinder and the universal packager [Pelletier 1975] are always available: – There was dog all over the road. – Could we have just one rice please? SSLST 2011 25
  • 26. Features Impacting Article Choice: Countability • Semi-idiomatic forms: – I looked him in the eye. – *I looked him in an eye. SSLST 2011 26
  • 27. Features Impacting Article Choice: Syntactic Context I have knowledge.  I have a knowledge. I have knowledge of this.  I have a knowledge of this. I have a knowledge of English. SSLST 2011 27
  • 28. Features Impacting Article Choice: Discourse Factors • Stereotypically, entities are introduced into a discourse using an indefinite determiner and subsequently referred to using a definite determiner – I saw a man at the bus stop. … The man was crying. • But not always: – A bus turned the corner. The driver was crying. – I went to the beach yesterday. SSLST 2011 28
  • 29. Features Impacting Article Choice: World Knowledge • He bought a Honda. • He bought Honda. SSLST 2011 29
  • 30. Article Errors • The Problem • Early Rule-based Approaches • Knight and Chandler [1994] • Han et al [2006] • De Felice and Pulman [2008] SSLST 2011 30
  • 31. Early Work: Article Insertion in Machine Translation • The Problem: – Machine translation of languages like Japanese or Russian into English is difficult because the source language doesn’t contain articles SSLST 2011 31
  • 32. Murata and Nagao [1993]: Hand-Crafted Rules • When a noun is modified by a referential pronoun (KONO(this), SONO(its), …) then {indefinite(0, 0), definite(1, 2), generic(0, 0)} • When a noun is accompanied by a particle (WA), and the predicate has past tense, then {indefinite(1, 0), definite(1, 3), generic(1, 1)} • When a noun is accompanied by a particle (WA), and the predicate has present tense, then {indefinite(1, 0), definite(1, 2), generic(1, 3)} • When a noun is accompanied by a particle HE(to), MADE(up to) or KARA(from), then {indefinite(1, 0), definite(1, 2), generic(1, 0)} • … SSLST 2011 32
  • 33. Article Errors • The Problem • Early Rule-based Approaches • Knight and Chandler [1994] • Han et al [2006] • De Felice and Pulman [2008] SSLST 2011 33
  • 34. Knight and Chandler [1994]: A Data-Driven Method for Post-Editing • General aim: – To build a post-editing tool that can fix errors made in a machine translation system • Specific task: – Article insertion: a, an or the SSLST 2011 34
  • 35. Knight and Chandler [1994]: Before and After Stelco Inc. said it plans to shut down Stelco Inc. said it plans to shut down three Toronto-area plants, moving their three Toronto-area plants, moving their fastener operations to leased facility in fastener operations to a leased facility in Brantford, Ontario. Brantford, Ontario. Company said fastener business “has The company said the fastener business been under severe cost pressure for “has been under severe cost pressure some time.” Fasteners, nuts and bolts for some time.” The fasteners, nuts and are sold to North American auto market. bolts are sold to the North American Company spokesman declined to auto market. estimate impact of closures on earnings. A company spokesman declined to He said new facility will employ 500 of estimate the impact of the closures on existing 600 employees. Steelmaker earnings. He said the new facility will employs about 16,000 people. employ 500 of the existing 600 employees. The steelmaker employs about 16,000 people. SSLST 2011 35
  • 36. Knight and Chandler [1994]: The General Idea The steps: • Take newspaper-quality English text • Remove articles • Re-insert automatically • Compare results with the original text Assumptions: • NPs are marked as singular or plural • Locations of articles already marked so it’s a binary choice between the and a/an. SSLST 2011 36
  • 37. Knight and Chandler [1994]: Baseline • In 40Mb of Wall Street Journal text: a = 28.2% an = 4.6% the = 67.2% • So 67% is a good lower-bound • Upper-bound: – Human subjects performed with accuracy of 94%-96% SSLST 2011 37
  • 38. Knight and Chandler [1994]: Baselines Human Machine Random 50% 50% Always guess the 67% 67% Given core context NP 79-80% Given NP + 4 words 83-88% ? Given full context 94-96% SSLST 2011 38
  • 39. Knight and Chandler [1994]: Approach • Characterize NPs via sets of features then use a build decision tree to classify • Lexical features: – ‘word before the article is triple’ • Abstract features: – ‘word after the head noun is a past tense verb’ • 400k training examples and 30k features; features with less than 4 instances discarded SSLST 2011 39
  • 40. Knight and Chandler [1994]: Performance • On 1600 trees for the 1600 most frequent head nouns (covering 77% of test instances): – 81% accuracy • Guess the for the remaining 23% of test instances – 78% accuracy overall SSLST 2011 40
  • 41. Article Errors • The Problem • Early Rule-based Approaches • Knight and Chandler [1994] • Han et al [2006] • De Felice and Pulman [2008] SSLST 2011 41
  • 42. Han et al [2006]: A MaxEnt Approach to Article Selection • Basic Approach: – A maximum entropy classifier for selecting amongst a/an, the or the null determiner – Uses local context features such as words and PoS tags SSLST 2011 42
  • 43. Han et al [2006]: Contrasts with Earlier Work • More varied training corpus: a range of genres – 721 text files, 31.5M words – 10th thru 12th grade reading level • Much larger training corpus: 6 million NPs (15x larger) – Automatically PoS tagged + NP-chunked • The use of a maximum entropy classifier SSLST 2011 43
  • 44. Han et al [2006]: Training Results • 6M NPs in training set • 390k features • Baseline = 71.84% (frequency of null determiner) • Four-fold cross validation – performance range 87.59% to 88.29% – Average 87.99% SSLST 2011 44
  • 45. Han et al [2006]: Effectiveness of Individual Features Feature % Correct Word/PoS of all words in NP 80.41 Word/PoS of w(NP-1) + Head/PoS 77.98 Head/PoS 77.30 PoS of all words in NP 73.96 Word/PoS of w(NP+1) 72.97 Word/PoS of w(NP[1]) 72.53 PoS of w(NP[1]) 72.52 Word/PoS of w(NP-1) 72.30 PoS of Head 71.98 Head’s Countability 71.85 Word/PoS of w(NP-2) 71.85 Default to null determiner 71.84 SSLST 2011 45
  • 46. Han et al [2006]: Effectiveness of Individual Features • Best feature: Word/PoS of all words in NP – Ok if you have a large enough corpus! • Second best: W(NP-1) + Head – Eg ‘in summary’ SSLST 2011 46
  • 47. Han et al [2006]: Accuracy by Head Noun Type Syntactic Type of Head % Correct Singular Noun 80.99 Plural Noun 85.02 Pronoun 99.66 Proper Noun, Singular 90.42 Proper Noun, Plural 82.05 Number 92.71 Demonstrative Pronoun 99.70 Other 97.81 SSLST 2011 47
  • 48. Han et al [2006]: Accuracy as a Function of Training Set Size #NPs in Training Set % Correct 150000 83.49 300000 84.92 600000 85.75 1200000 86.59 2400000 87.27 4800000 87.92 6000000 87.99 SSLST 2011 48
  • 49. Han et al [2006]: Applying the Model to TOEFL Essays • Model retrained only on NPs with a common head noun – Baseline = frequency of null determiner = 54.40% – Training set kept at 6M instances by adding more data – Average accuracy = 83.00% • Model applied to 668 TOEFL essays w 29759 NPs – Subset of NPs classified by two annotators – Agreement on 98% of cases with kappa = 0.86 – One article error every 8 NPs SSLST 2011 49
  • 50. Han et al [2006]: Some Examples Above all, I think it is good for students to share room with others. • Human: missing a or an • Classifier: 0.841 a/an; 0.143 the; 0.014 zero Those excellent hitters began practicing the baseball when they were children, and dedicated a lot of time to become highly qualified. • Human: superfluous determiner • Classifier: 0.103 a/an; 0.016 the; 0.879 zero SSLST 2011 50
  • 51. Han et al [2006]: Results on TOEFL Essays • 79% of errors in test set correctly detected • Many false positives, so precision only 44% • Decisions often borderline: – The books are assigned by professors. – Marked by annotators as correct, model predicts the (0.51) and null (0.49) SSLST 2011 51
  • 52. Han et al [2006]: Sources of Error • Model performs poorly on decision between a and the – Probably due to the need for discourse information • So, new feature: has the head noun appeared before, and if so, with what article? – No significant effect on performance • Error analysis suggests this is due to more complex discourse behaviour: – A student will not learn if she hates the teacher. – … the possibilities that a scholarship would afford … SSLST 2011 52
  • 53. Article Errors • The Problem • Early Rule-based Approaches • Knight and Chandler [1994] • Han et al [2006] • De Felice and Pulman [2008] SSLST 2011 53
  • 54. De Felice and Pulman [2008]: Richer Syntactic and Semantic Features • Basic Approach: – As in Han et al [2006], a maximum entropy classifier for selecting amongst a/an, the or the null determiner – Use a richer set of syntactic and semantic features SSLST 2011 54
  • 55. De Felice and Pulman [2008]: Main Features Feature Value Head Noun ‘apple’ Number Singular Noun Type Count Named Entity? No WordNet Category Food, Plant Prepositional Modification? Yes, ‘on’ Object of Preposition? No Adjectival Modification? Yes, ‘juicy’ Adjectival Grade Superlative POS3 VV, DT, JJS, IN, DT, NN Example: Pick the juiciest apple on the tree. SSLST 2011 55
  • 56. De Felice and Pulman [2008]: Additional Features • Whether the noun is modified by a predeterminer, possessive, numeral and/or a relative clause • Whether it is part of a ‘there is …’ phrase SSLST 2011 56
  • 57. De Felice and Pulman [2008]: Performance • Trained on British National Corpus – 4,043,925 instances • Test set of 305,264 BNC instances • Baseline = 59.83% (choose null) • Accuracy = 92.15% SSLST 2011 57
  • 58. De Felice and Pulman [2008]: Comparative Performance on L1 Data Author Accuracy Baseline 59.83% Han et al 2006 83.00% Gamon et al 2008 86.07% Turner and Charniak 2007 86.74% De Felice and Pulman 2008 92.15% SSLST 2011 58
  • 59. De Felice and Pulman [2008]: Results on Individual Determiners % of Training Data Precision Recall a 9.61% (388,476) 70.52% 53.50% the 29.19% (1,180,435) 85.17% 91.51% null 61.20% (2,475,014) 98.63% 98.79% • The indefinite determiner is less frequent and harder to learn SSLST 2011 59
  • 60. De Felice and Pulman [2008]: Testing on L2 Text • 3200 instances extracted from the CLC – 2000 correct – 1200 incorrect • Accuracy on correct instances: 92.2% • Accuracy on incorrect instances: < 10% • Most frequent incorrect usage is a missing determiner – Model behaviour influenced by skew in training data • Also problems in extracting NLP features from L2 data SSLST 2011 60
  • 61. Outline • Background • Article Errors • Preposition Errors • Other ESL Problems • Conclusions SSLST 2011 61
  • 62. The Prevalence of Preposition Errors L1 Proportion Greek 0.149 Spanish 0.139 Korean 0.128 Chinese 0.122 French 0.121 Japanese 0.118 German 0.100 Russian 0.095 Proportion of sentences in the CLC with one or more preposition errors SSLST 2011 62
  • 63. Prepositions Have Many Roles in English • They appear in adjuncts: – In total, I spent $64 million dollars. • They mark the arguments of verbs: – I’ll give ten cents to the next guy. • They figure in phrasal verbs: – I ran away when I was ten. • They play a part in idioms: – She talked down to him. SSLST 2011 63
  • 64. Negative Transfer • Many prepositions have a most typical or frequent translation – Eg: of in English to de in French • But for many prepositions there are multiple translational possibilities – ESL speakers can easily choose the wrong one – Eg: driving in a high speed SSLST 2011 64
  • 65. Prepositions in English • English has over 100 prepositions, including some multiword prepositions and a small number of postpositions • The 10 most frequent account for 82% of the errors in the CLC SSLST 2011 65
  • 66. Preposition Selection in Well-Formed Text SSLST 2011 66
  • 67. Preposition Error Detection on Learner Data SSLST 2011 67
  • 68. Outline • Background • Article Errors • Preposition Errors • Other ESL Problems • Conclusions SSLST 2011 68
  • 69. Collocations • Conventional combinations that are preferred over other equally syntactically and semantically valid combinations – Adj + Noun: stiff breeze vs rigid breeze – Verb + Noun: hold an election vs make an election – Noun + Noun: movie theatre vs film theatre – Adverb + Verb: thoroughly amuse vs completely amuse SSLST 2011 69
  • 70. Collocations • Computational approaches generally make use of distributional differences for detecting and correcting errors • Same general approach as in articles and prepositions: – Choose preferred form from a set of alternatives – But: the confusion set is potentially much larger • Solution: – Constrain the space by selecting alternatives with a similar meaning • See work on automatic thesaurus construction [eg Lin 1998] SSLST 2011 70
  • 71. Verb Form Errors Error Type Example Subject-Verb Agreement He have been living here since June. Auxiliary Agreement He has been live here since June. Complementation He wants live here. • See Lee and Seneff [2008] for a method based on detecting specific irregularities in parse trees. SSLST 2011 71
  • 72. Outline • Background • Article Errors • Preposition Errors • Other ESL Problems • Conclusions SSLST 2011 72
  • 73. Conclusions • The provision of assistance to ESL learners is clearly a significant market • Technology is at a very early stage, focussing on specific subproblems • Measurable progress has been hampered by the unavailability of shared data sets, but this is changing SSLST 2011 73