SlideShare a Scribd company logo
1 of 15
Download to read offline
Machine Learning and NLP
                Research Interests




Machine Learning for Language Learning and
                Processing

   Sharon Goldwater, Frank Keller, Mirella Lapata,
          Victor Lavrenko, Mark Steedman

                       School of Informatics
                      University of Edinburgh
                      keller@inf.ed.ac.uk


                       October 15, 2008



                  Goldwater et al.   Machine Learning for Language   1
Machine Learning and NLP
                       Research Interests




1   Machine Learning and NLP
     Latent Variables
     Multi-class and Structured Variables
     Discrete, Sparse Data
     Other Problems

2   Research Interests
      Parsing
      Language Acquisition
      Language Generation
      Information Retrieval




                         Goldwater et al.   Machine Learning for Language   2
Latent Variables
               Machine Learning and NLP     Multi-class and Structured Variables
                       Research Interests   Discrete, Sparse Data
                                            Other Problems




1   Machine Learning and NLP
     Latent Variables
     Multi-class and Structured Variables
     Discrete, Sparse Data
     Other Problems

2   Research Interests
      Parsing
      Language Acquisition
      Language Generation
      Information Retrieval




                         Goldwater et al.   Machine Learning for Language          3
Latent Variables
                Machine Learning and NLP     Multi-class and Structured Variables
                        Research Interests   Discrete, Sparse Data
                                             Other Problems


Latent Variables

  Natural language processing (NLP) problems typically involve
  inferring latent (non-observed) variables.
      given a bilingual text, infer an alignment;
      given a string of words, infer a parse tree.




                          Goldwater et al.   Machine Learning for Language          4
Latent Variables
                 Machine Learning and NLP     Multi-class and Structured Variables
                         Research Interests   Discrete, Sparse Data
                                              Other Problems


Multi-class and Structured Variables

  The learning targets in NLP often are multi-class, e.g., in part of
  speech tagging:
      standard POS tag sets for English have around 60 classes;
      more elaborate ones around 150 (CLAWS6);
      morphological annotation often increases the size of the tag
      set (e.g., Bulgarian, around 680 tags).


   Everything in the sale will have been used in films .
      PNI    PRP AT0 NN1 VM0 VHI VBN VVN PRP NN2 PUN




                           Goldwater et al.   Machine Learning for Language          5
Latent Variables
                Machine Learning and NLP     Multi-class and Structured Variables
                        Research Interests   Discrete, Sparse Data
                                             Other Problems


Multi-class and Structured Variables
  NLP tasks are often sequencing tasks, rather than simple
  classification:

  PNI → PRP → AT0 → NN1 → VM0 → VHI → VBN → VVN . . .

  Many NLP tasks are not classification but often involve
  hierarchical structure, e.g., in parsing:




  Most structures in the Penn Treebank are unique.
                          Goldwater et al.   Machine Learning for Language          6
Latent Variables
                 Machine Learning and NLP     Multi-class and Structured Variables
                         Research Interests   Discrete, Sparse Data
                                              Other Problems


Discrete, Sparse Data
  Linguistic data is different from standard ML data (speech, vision):
      typically discrete (characters, words, texts);
      follows a Zipfian distribution.




                           Goldwater et al.   Machine Learning for Language          7
Latent Variables
                 Machine Learning and NLP     Multi-class and Structured Variables
                         Research Interests   Discrete, Sparse Data
                                              Other Problems


Discrete, Sparse Data

  The Zipfian distribution leads to ubiquitous data sparseness:
      standard maximum likelihood estimation doesn’t work well for
      linguistic data;
      a large number of smoothing techniques have been developed
      to deal with this problem;
      most of them are ad hoc; Bayesian methods are a principled
      alternative.


                                G ∼ PY(d, θ, G0 )




                           Goldwater et al.   Machine Learning for Language          8
Latent Variables
               Machine Learning and NLP     Multi-class and Structured Variables
                       Research Interests   Discrete, Sparse Data
                                            Other Problems


Other Problems

     NLP typically uses pipeline models; errors propagate;
     models often highly domain-dependent (models for broadcast
     news will not work well for biomedical text, etc.);
     there is no single error function to optimize; evaluation
     metrics differ from task to task (BLEU for MT, ROUGE for
     summarization, PARSEVAL for parsing).




                         Goldwater et al.   Machine Learning for Language          9
Parsing
               Machine Learning and NLP     Language Acquisition
                       Research Interests   Language Generation
                                            Information Retrieval




1   Machine Learning and NLP
     Latent Variables
     Multi-class and Structured Variables
     Discrete, Sparse Data
     Other Problems

2   Research Interests
      Parsing
      Language Acquisition
      Language Generation
      Information Retrieval




                         Goldwater et al.   Machine Learning for Language   10
Parsing
                       Machine Learning and NLP            Language Acquisition
                               Research Interests          Language Generation
                                                           Information Retrieval


Parsing (Keller, Steedman)
  Current focus of research in probabilistic parsing:
      models for more expressive syntactic representations (CCG,
      TAG, dependency grammar);
      semi-supervised induction of grammars and parsing models;
      cognitive modeling:
               incrementality;
               limited parallelism, limited memory;
               evaluation against behavioral data.
        The pilot embarrassed John and put himself in a very awkward situation.
         11 111111 1
         00 000000 0
         11
         00
                   111111 1
                   000000 0
                   1
                   0
                          11111 1111
                          00000 0000
                          1
                          0    1
                               0
                               1
                               0  11
                                  00
                                  11
                                  00                           1
                                                               0
                                                               1
                                                               0
           1             2          3        4        5        6

         First fixation time = 5        1 111111111 1111111111 1
                                        0 000000000 0000000000 0
                                                  11
                                                  00
                                        1
                                        0         11
                                                  00           1
                                                               0
         gaze duration = 5+6            7                      8               9
         Total time = 5+6+8+10
         Second pass time = 8+10                      11
                                                      00
                                                      11
                                                      00                             1
                                                                                     0
                                                                                     1
                                                                                     0
         Skipping rate: e.g. put                      11
                                                      00                             1
                                                                                     0
                                                          10                         11


                                   Goldwater et al.        Machine Learning for Language   11
Parsing
                Machine Learning and NLP     Language Acquisition
                        Research Interests   Language Generation
                                             Information Retrieval


Language Acquisition (Goldwater, Steedman)

  Research focuses on Bayesian models for improving unsupervised
  NLP and understanding of human language acquisition:
      What constraints/biases are needed for effective
      generalization?
      How can different sources of information be successfully
      combined?
  ML methods and problems:
      infinite models, esp. those for sequences/hierarchies;
      incremental, memory-limited inference methods;
      joint inference of different kinds of linguistic information
      (e.g., morphology and syntax).

                          Goldwater et al.   Machine Learning for Language   12
Parsing
                Machine Learning and NLP     Language Acquisition
                        Research Interests   Language Generation
                                             Information Retrieval


Language Generation (Lapata)

  Research focuses on data-driven models for language generation:
      fluent and coherent text that resembles human writing;
      general modeling framework for different input types (time
      series data, pictures, logical forms).
  ML methods and problems:
      mathematical programming for sentence compression and
      summarization;
      latent variable models for image caption generation;
      models have to integrate conflicting constraints and varying
      linguistic representations.


                          Goldwater et al.   Machine Learning for Language   13
Parsing
                  Machine Learning and NLP     Language Acquisition
                          Research Interests   Language Generation
                                               Information Retrieval


Language Generation (Lapata)

  Example: image captioning beyond keywords.




  troop, Israel, force, ceasefire, soldiers
  Thousands of Israeli troops are in Lebanon as the ceasefire begins.


                            Goldwater et al.   Machine Learning for Language   14
Parsing
                 Machine Learning and NLP     Language Acquisition
                         Research Interests   Language Generation
                                              Information Retrieval


Information Retrieval (Lavrenko)
  Universal search:
      learn to relate relevant text/images/products/DB records;
      data: high-dimensional, extremely sparse, but dimensionality
      reduction is a bad idea;
      targets: focused information needs, not broad categories;
      semi-supervised: lots of unlabeled data, few judgments.
  Learning to rank:
      partial preferences → ranking function;
      objective: non-smooth, can be very expensive to evaluate.
  Novelty detection:
      example: identify first reports of events in the news;
      supervised task, but hard to learn anything from labels;
      best approaches unsupervised, performance very low.
                           Goldwater et al.   Machine Learning for Language   15

More Related Content

What's hot

Symbol Emergence in Robotics: Language Acquisition via Real-world Sensorimoto...
Symbol Emergence in Robotics: Language Acquisition via Real-world Sensorimoto...Symbol Emergence in Robotics: Language Acquisition via Real-world Sensorimoto...
Symbol Emergence in Robotics: Language Acquisition via Real-world Sensorimoto...Tadahiro Taniguchi
 
Deep learning for natural language embeddings
Deep learning for natural language embeddingsDeep learning for natural language embeddings
Deep learning for natural language embeddingsRoelof Pieters
 
Deep Learning for Natural Language Processing: Word Embeddings
Deep Learning for Natural Language Processing: Word EmbeddingsDeep Learning for Natural Language Processing: Word Embeddings
Deep Learning for Natural Language Processing: Word EmbeddingsRoelof Pieters
 
Natural Language Processing (NLP)
Natural Language Processing (NLP)Natural Language Processing (NLP)
Natural Language Processing (NLP)Yuriy Guts
 
Natural Language Processing: L01 introduction
Natural Language Processing: L01 introductionNatural Language Processing: L01 introduction
Natural Language Processing: L01 introductionananth
 
Lecture 1: Semantic Analysis in Language Technology
Lecture 1: Semantic Analysis in Language TechnologyLecture 1: Semantic Analysis in Language Technology
Lecture 1: Semantic Analysis in Language TechnologyMarina Santini
 
Semantic Segmentation of Driving Behavior Data: Double Articulation Analyzer ...
Semantic Segmentation of Driving Behavior Data: Double Articulation Analyzer ...Semantic Segmentation of Driving Behavior Data: Double Articulation Analyzer ...
Semantic Segmentation of Driving Behavior Data: Double Articulation Analyzer ...Tadahiro Taniguchi
 
Natural Language Processing (NLP) & Text Mining Tutorial Using NLTK | NLP Tra...
Natural Language Processing (NLP) & Text Mining Tutorial Using NLTK | NLP Tra...Natural Language Processing (NLP) & Text Mining Tutorial Using NLTK | NLP Tra...
Natural Language Processing (NLP) & Text Mining Tutorial Using NLTK | NLP Tra...Edureka!
 
The Role of Natural Language Processing in Information Retrieval
The Role of Natural Language Processing in Information RetrievalThe Role of Natural Language Processing in Information Retrieval
The Role of Natural Language Processing in Information RetrievalTony Russell-Rose
 
Conversational Agents in Portuguese: A Study Using Deep Learning
Conversational Agents in Portuguese: A Study Using Deep LearningConversational Agents in Portuguese: A Study Using Deep Learning
Conversational Agents in Portuguese: A Study Using Deep LearningAndherson Maeda
 
Natural Language Processing for Games Research
Natural Language Processing for Games ResearchNatural Language Processing for Games Research
Natural Language Processing for Games ResearchJose Zagal
 
Natural language processing
Natural language processingNatural language processing
Natural language processingSaurav Aryal
 
Natural language processing
Natural language processingNatural language processing
Natural language processingBasha Chand
 
Artificial Intelligence Notes Unit 4
Artificial Intelligence Notes Unit 4Artificial Intelligence Notes Unit 4
Artificial Intelligence Notes Unit 4DigiGurukul
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language ProcessingYasir Khan
 
Natural language processing (NLP) introduction
Natural language processing (NLP) introductionNatural language processing (NLP) introduction
Natural language processing (NLP) introductionRobert Lujo
 
A Panorama of Natural Language Processing
A Panorama of Natural Language ProcessingA Panorama of Natural Language Processing
A Panorama of Natural Language ProcessingTed Xiao
 
Natural language processing
Natural language processingNatural language processing
Natural language processingHansi Thenuwara
 
Introduction to Natural Language Processing
Introduction to Natural Language ProcessingIntroduction to Natural Language Processing
Introduction to Natural Language ProcessingDavid Rostcheck
 
Lecture: Word Sense Disambiguation
Lecture: Word Sense DisambiguationLecture: Word Sense Disambiguation
Lecture: Word Sense DisambiguationMarina Santini
 

What's hot (20)

Symbol Emergence in Robotics: Language Acquisition via Real-world Sensorimoto...
Symbol Emergence in Robotics: Language Acquisition via Real-world Sensorimoto...Symbol Emergence in Robotics: Language Acquisition via Real-world Sensorimoto...
Symbol Emergence in Robotics: Language Acquisition via Real-world Sensorimoto...
 
Deep learning for natural language embeddings
Deep learning for natural language embeddingsDeep learning for natural language embeddings
Deep learning for natural language embeddings
 
Deep Learning for Natural Language Processing: Word Embeddings
Deep Learning for Natural Language Processing: Word EmbeddingsDeep Learning for Natural Language Processing: Word Embeddings
Deep Learning for Natural Language Processing: Word Embeddings
 
Natural Language Processing (NLP)
Natural Language Processing (NLP)Natural Language Processing (NLP)
Natural Language Processing (NLP)
 
Natural Language Processing: L01 introduction
Natural Language Processing: L01 introductionNatural Language Processing: L01 introduction
Natural Language Processing: L01 introduction
 
Lecture 1: Semantic Analysis in Language Technology
Lecture 1: Semantic Analysis in Language TechnologyLecture 1: Semantic Analysis in Language Technology
Lecture 1: Semantic Analysis in Language Technology
 
Semantic Segmentation of Driving Behavior Data: Double Articulation Analyzer ...
Semantic Segmentation of Driving Behavior Data: Double Articulation Analyzer ...Semantic Segmentation of Driving Behavior Data: Double Articulation Analyzer ...
Semantic Segmentation of Driving Behavior Data: Double Articulation Analyzer ...
 
Natural Language Processing (NLP) & Text Mining Tutorial Using NLTK | NLP Tra...
Natural Language Processing (NLP) & Text Mining Tutorial Using NLTK | NLP Tra...Natural Language Processing (NLP) & Text Mining Tutorial Using NLTK | NLP Tra...
Natural Language Processing (NLP) & Text Mining Tutorial Using NLTK | NLP Tra...
 
The Role of Natural Language Processing in Information Retrieval
The Role of Natural Language Processing in Information RetrievalThe Role of Natural Language Processing in Information Retrieval
The Role of Natural Language Processing in Information Retrieval
 
Conversational Agents in Portuguese: A Study Using Deep Learning
Conversational Agents in Portuguese: A Study Using Deep LearningConversational Agents in Portuguese: A Study Using Deep Learning
Conversational Agents in Portuguese: A Study Using Deep Learning
 
Natural Language Processing for Games Research
Natural Language Processing for Games ResearchNatural Language Processing for Games Research
Natural Language Processing for Games Research
 
Natural language processing
Natural language processingNatural language processing
Natural language processing
 
Natural language processing
Natural language processingNatural language processing
Natural language processing
 
Artificial Intelligence Notes Unit 4
Artificial Intelligence Notes Unit 4Artificial Intelligence Notes Unit 4
Artificial Intelligence Notes Unit 4
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
Natural language processing (NLP) introduction
Natural language processing (NLP) introductionNatural language processing (NLP) introduction
Natural language processing (NLP) introduction
 
A Panorama of Natural Language Processing
A Panorama of Natural Language ProcessingA Panorama of Natural Language Processing
A Panorama of Natural Language Processing
 
Natural language processing
Natural language processingNatural language processing
Natural language processing
 
Introduction to Natural Language Processing
Introduction to Natural Language ProcessingIntroduction to Natural Language Processing
Introduction to Natural Language Processing
 
Lecture: Word Sense Disambiguation
Lecture: Word Sense DisambiguationLecture: Word Sense Disambiguation
Lecture: Word Sense Disambiguation
 

Similar to Machine Learning for Language Learning and Processing

NOVA Data Science Meetup 1/19/2017 - Presentation 2
NOVA Data Science Meetup 1/19/2017 - Presentation 2NOVA Data Science Meetup 1/19/2017 - Presentation 2
NOVA Data Science Meetup 1/19/2017 - Presentation 2NOVA DATASCIENCE
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processingpunedevscom
 
Machine Learning of Natural Language
Machine Learning of Natural LanguageMachine Learning of Natural Language
Machine Learning of Natural Languagebutest
 
NATURAL LANGUAGE PROCESSING.pptx
NATURAL LANGUAGE PROCESSING.pptxNATURAL LANGUAGE PROCESSING.pptx
NATURAL LANGUAGE PROCESSING.pptxFitsum36
 
CNN for NLP using text analysis by using deep learning
CNN for NLP using text analysis by using deep learningCNN for NLP using text analysis by using deep learning
CNN for NLP using text analysis by using deep learningKv Sagar
 
Pycon India 2018 Natural Language Processing Workshop
Pycon India 2018   Natural Language Processing WorkshopPycon India 2018   Natural Language Processing Workshop
Pycon India 2018 Natural Language Processing WorkshopLakshya Sivaramakrishnan
 
NLP in Practice - Part I
NLP in Practice - Part INLP in Practice - Part I
NLP in Practice - Part IDelip Rao
 
Visual-Semantic Embeddings: some thoughts on Language
Visual-Semantic Embeddings: some thoughts on LanguageVisual-Semantic Embeddings: some thoughts on Language
Visual-Semantic Embeddings: some thoughts on LanguageRoelof Pieters
 
Jarrar: Introduction to Natural Language Processing
Jarrar: Introduction to Natural Language ProcessingJarrar: Introduction to Natural Language Processing
Jarrar: Introduction to Natural Language ProcessingMustafa Jarrar
 
Introduction to nlp
Introduction to nlpIntroduction to nlp
Introduction to nlpAmaan Shaikh
 
The Power of Natural Language Processing (NLP) | Enterprise Wired
The Power of Natural Language Processing (NLP) | Enterprise WiredThe Power of Natural Language Processing (NLP) | Enterprise Wired
The Power of Natural Language Processing (NLP) | Enterprise WiredEnterprise Wired
 
Natural language processing (nlp)
Natural language processing (nlp)Natural language processing (nlp)
Natural language processing (nlp)Kuppusamy P
 
Warnikchow - SAIT - 0529
Warnikchow - SAIT - 0529Warnikchow - SAIT - 0529
Warnikchow - SAIT - 0529WarNik Chow
 
Deep Learning for Natural Language Processing
Deep Learning for Natural Language ProcessingDeep Learning for Natural Language Processing
Deep Learning for Natural Language ProcessingParrotAI
 
Introduction to Natural Language Processing
Introduction to Natural Language ProcessingIntroduction to Natural Language Processing
Introduction to Natural Language ProcessingKevinSims18
 
Natural Language Processing (NLP).pptx
Natural Language Processing (NLP).pptxNatural Language Processing (NLP).pptx
Natural Language Processing (NLP).pptxSHIBDASDUTTA
 

Similar to Machine Learning for Language Learning and Processing (20)

NOVA Data Science Meetup 1/19/2017 - Presentation 2
NOVA Data Science Meetup 1/19/2017 - Presentation 2NOVA Data Science Meetup 1/19/2017 - Presentation 2
NOVA Data Science Meetup 1/19/2017 - Presentation 2
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
Machine Learning of Natural Language
Machine Learning of Natural LanguageMachine Learning of Natural Language
Machine Learning of Natural Language
 
Lesson 40
Lesson 40Lesson 40
Lesson 40
 
AI Lesson 40
AI Lesson 40AI Lesson 40
AI Lesson 40
 
NATURAL LANGUAGE PROCESSING.pptx
NATURAL LANGUAGE PROCESSING.pptxNATURAL LANGUAGE PROCESSING.pptx
NATURAL LANGUAGE PROCESSING.pptx
 
CNN for NLP using text analysis by using deep learning
CNN for NLP using text analysis by using deep learningCNN for NLP using text analysis by using deep learning
CNN for NLP using text analysis by using deep learning
 
Pycon India 2018 Natural Language Processing Workshop
Pycon India 2018   Natural Language Processing WorkshopPycon India 2018   Natural Language Processing Workshop
Pycon India 2018 Natural Language Processing Workshop
 
NLP in Practice - Part I
NLP in Practice - Part INLP in Practice - Part I
NLP in Practice - Part I
 
Visual-Semantic Embeddings: some thoughts on Language
Visual-Semantic Embeddings: some thoughts on LanguageVisual-Semantic Embeddings: some thoughts on Language
Visual-Semantic Embeddings: some thoughts on Language
 
Jarrar: Introduction to Natural Language Processing
Jarrar: Introduction to Natural Language ProcessingJarrar: Introduction to Natural Language Processing
Jarrar: Introduction to Natural Language Processing
 
Introduction to nlp
Introduction to nlpIntroduction to nlp
Introduction to nlp
 
L1 nlp intro
L1 nlp introL1 nlp intro
L1 nlp intro
 
The Power of Natural Language Processing (NLP) | Enterprise Wired
The Power of Natural Language Processing (NLP) | Enterprise WiredThe Power of Natural Language Processing (NLP) | Enterprise Wired
The Power of Natural Language Processing (NLP) | Enterprise Wired
 
Natural language processing (nlp)
Natural language processing (nlp)Natural language processing (nlp)
Natural language processing (nlp)
 
Warnikchow - SAIT - 0529
Warnikchow - SAIT - 0529Warnikchow - SAIT - 0529
Warnikchow - SAIT - 0529
 
Deep Learning for Natural Language Processing
Deep Learning for Natural Language ProcessingDeep Learning for Natural Language Processing
Deep Learning for Natural Language Processing
 
Introduction to Natural Language Processing
Introduction to Natural Language ProcessingIntroduction to Natural Language Processing
Introduction to Natural Language Processing
 
Natural Language Processing (NLP).pptx
Natural Language Processing (NLP).pptxNatural Language Processing (NLP).pptx
Natural Language Processing (NLP).pptx
 
Type 2 fuzzy ontology ahmadchan
Type 2 fuzzy ontology ahmadchanType 2 fuzzy ontology ahmadchan
Type 2 fuzzy ontology ahmadchan
 

More from butest

EL MODELO DE NEGOCIO DE YOUTUBE
EL MODELO DE NEGOCIO DE YOUTUBEEL MODELO DE NEGOCIO DE YOUTUBE
EL MODELO DE NEGOCIO DE YOUTUBEbutest
 
1. MPEG I.B.P frame之不同
1. MPEG I.B.P frame之不同1. MPEG I.B.P frame之不同
1. MPEG I.B.P frame之不同butest
 
LESSONS FROM THE MICHAEL JACKSON TRIAL
LESSONS FROM THE MICHAEL JACKSON TRIALLESSONS FROM THE MICHAEL JACKSON TRIAL
LESSONS FROM THE MICHAEL JACKSON TRIALbutest
 
Timeline: The Life of Michael Jackson
Timeline: The Life of Michael JacksonTimeline: The Life of Michael Jackson
Timeline: The Life of Michael Jacksonbutest
 
Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...
Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...
Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...butest
 
LESSONS FROM THE MICHAEL JACKSON TRIAL
LESSONS FROM THE MICHAEL JACKSON TRIALLESSONS FROM THE MICHAEL JACKSON TRIAL
LESSONS FROM THE MICHAEL JACKSON TRIALbutest
 
Com 380, Summer II
Com 380, Summer IICom 380, Summer II
Com 380, Summer IIbutest
 
The MYnstrel Free Press Volume 2: Economic Struggles, Meet Jazz
The MYnstrel Free Press Volume 2: Economic Struggles, Meet JazzThe MYnstrel Free Press Volume 2: Economic Struggles, Meet Jazz
The MYnstrel Free Press Volume 2: Economic Struggles, Meet Jazzbutest
 
MICHAEL JACKSON.doc
MICHAEL JACKSON.docMICHAEL JACKSON.doc
MICHAEL JACKSON.docbutest
 
Social Networks: Twitter Facebook SL - Slide 1
Social Networks: Twitter Facebook SL - Slide 1Social Networks: Twitter Facebook SL - Slide 1
Social Networks: Twitter Facebook SL - Slide 1butest
 
Facebook
Facebook Facebook
Facebook butest
 
Executive Summary Hare Chevrolet is a General Motors dealership ...
Executive Summary Hare Chevrolet is a General Motors dealership ...Executive Summary Hare Chevrolet is a General Motors dealership ...
Executive Summary Hare Chevrolet is a General Motors dealership ...butest
 
Welcome to the Dougherty County Public Library's Facebook and ...
Welcome to the Dougherty County Public Library's Facebook and ...Welcome to the Dougherty County Public Library's Facebook and ...
Welcome to the Dougherty County Public Library's Facebook and ...butest
 
NEWS ANNOUNCEMENT
NEWS ANNOUNCEMENTNEWS ANNOUNCEMENT
NEWS ANNOUNCEMENTbutest
 
C-2100 Ultra Zoom.doc
C-2100 Ultra Zoom.docC-2100 Ultra Zoom.doc
C-2100 Ultra Zoom.docbutest
 
MAC Printing on ITS Printers.doc.doc
MAC Printing on ITS Printers.doc.docMAC Printing on ITS Printers.doc.doc
MAC Printing on ITS Printers.doc.docbutest
 
Mac OS X Guide.doc
Mac OS X Guide.docMac OS X Guide.doc
Mac OS X Guide.docbutest
 
WEB DESIGN!
WEB DESIGN!WEB DESIGN!
WEB DESIGN!butest
 

More from butest (20)

EL MODELO DE NEGOCIO DE YOUTUBE
EL MODELO DE NEGOCIO DE YOUTUBEEL MODELO DE NEGOCIO DE YOUTUBE
EL MODELO DE NEGOCIO DE YOUTUBE
 
1. MPEG I.B.P frame之不同
1. MPEG I.B.P frame之不同1. MPEG I.B.P frame之不同
1. MPEG I.B.P frame之不同
 
LESSONS FROM THE MICHAEL JACKSON TRIAL
LESSONS FROM THE MICHAEL JACKSON TRIALLESSONS FROM THE MICHAEL JACKSON TRIAL
LESSONS FROM THE MICHAEL JACKSON TRIAL
 
Timeline: The Life of Michael Jackson
Timeline: The Life of Michael JacksonTimeline: The Life of Michael Jackson
Timeline: The Life of Michael Jackson
 
Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...
Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...
Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...
 
LESSONS FROM THE MICHAEL JACKSON TRIAL
LESSONS FROM THE MICHAEL JACKSON TRIALLESSONS FROM THE MICHAEL JACKSON TRIAL
LESSONS FROM THE MICHAEL JACKSON TRIAL
 
Com 380, Summer II
Com 380, Summer IICom 380, Summer II
Com 380, Summer II
 
PPT
PPTPPT
PPT
 
The MYnstrel Free Press Volume 2: Economic Struggles, Meet Jazz
The MYnstrel Free Press Volume 2: Economic Struggles, Meet JazzThe MYnstrel Free Press Volume 2: Economic Struggles, Meet Jazz
The MYnstrel Free Press Volume 2: Economic Struggles, Meet Jazz
 
MICHAEL JACKSON.doc
MICHAEL JACKSON.docMICHAEL JACKSON.doc
MICHAEL JACKSON.doc
 
Social Networks: Twitter Facebook SL - Slide 1
Social Networks: Twitter Facebook SL - Slide 1Social Networks: Twitter Facebook SL - Slide 1
Social Networks: Twitter Facebook SL - Slide 1
 
Facebook
Facebook Facebook
Facebook
 
Executive Summary Hare Chevrolet is a General Motors dealership ...
Executive Summary Hare Chevrolet is a General Motors dealership ...Executive Summary Hare Chevrolet is a General Motors dealership ...
Executive Summary Hare Chevrolet is a General Motors dealership ...
 
Welcome to the Dougherty County Public Library's Facebook and ...
Welcome to the Dougherty County Public Library's Facebook and ...Welcome to the Dougherty County Public Library's Facebook and ...
Welcome to the Dougherty County Public Library's Facebook and ...
 
NEWS ANNOUNCEMENT
NEWS ANNOUNCEMENTNEWS ANNOUNCEMENT
NEWS ANNOUNCEMENT
 
C-2100 Ultra Zoom.doc
C-2100 Ultra Zoom.docC-2100 Ultra Zoom.doc
C-2100 Ultra Zoom.doc
 
MAC Printing on ITS Printers.doc.doc
MAC Printing on ITS Printers.doc.docMAC Printing on ITS Printers.doc.doc
MAC Printing on ITS Printers.doc.doc
 
Mac OS X Guide.doc
Mac OS X Guide.docMac OS X Guide.doc
Mac OS X Guide.doc
 
hier
hierhier
hier
 
WEB DESIGN!
WEB DESIGN!WEB DESIGN!
WEB DESIGN!
 

Machine Learning for Language Learning and Processing

  • 1. Machine Learning and NLP Research Interests Machine Learning for Language Learning and Processing Sharon Goldwater, Frank Keller, Mirella Lapata, Victor Lavrenko, Mark Steedman School of Informatics University of Edinburgh keller@inf.ed.ac.uk October 15, 2008 Goldwater et al. Machine Learning for Language 1
  • 2. Machine Learning and NLP Research Interests 1 Machine Learning and NLP Latent Variables Multi-class and Structured Variables Discrete, Sparse Data Other Problems 2 Research Interests Parsing Language Acquisition Language Generation Information Retrieval Goldwater et al. Machine Learning for Language 2
  • 3. Latent Variables Machine Learning and NLP Multi-class and Structured Variables Research Interests Discrete, Sparse Data Other Problems 1 Machine Learning and NLP Latent Variables Multi-class and Structured Variables Discrete, Sparse Data Other Problems 2 Research Interests Parsing Language Acquisition Language Generation Information Retrieval Goldwater et al. Machine Learning for Language 3
  • 4. Latent Variables Machine Learning and NLP Multi-class and Structured Variables Research Interests Discrete, Sparse Data Other Problems Latent Variables Natural language processing (NLP) problems typically involve inferring latent (non-observed) variables. given a bilingual text, infer an alignment; given a string of words, infer a parse tree. Goldwater et al. Machine Learning for Language 4
  • 5. Latent Variables Machine Learning and NLP Multi-class and Structured Variables Research Interests Discrete, Sparse Data Other Problems Multi-class and Structured Variables The learning targets in NLP often are multi-class, e.g., in part of speech tagging: standard POS tag sets for English have around 60 classes; more elaborate ones around 150 (CLAWS6); morphological annotation often increases the size of the tag set (e.g., Bulgarian, around 680 tags). Everything in the sale will have been used in films . PNI PRP AT0 NN1 VM0 VHI VBN VVN PRP NN2 PUN Goldwater et al. Machine Learning for Language 5
  • 6. Latent Variables Machine Learning and NLP Multi-class and Structured Variables Research Interests Discrete, Sparse Data Other Problems Multi-class and Structured Variables NLP tasks are often sequencing tasks, rather than simple classification: PNI → PRP → AT0 → NN1 → VM0 → VHI → VBN → VVN . . . Many NLP tasks are not classification but often involve hierarchical structure, e.g., in parsing: Most structures in the Penn Treebank are unique. Goldwater et al. Machine Learning for Language 6
  • 7. Latent Variables Machine Learning and NLP Multi-class and Structured Variables Research Interests Discrete, Sparse Data Other Problems Discrete, Sparse Data Linguistic data is different from standard ML data (speech, vision): typically discrete (characters, words, texts); follows a Zipfian distribution. Goldwater et al. Machine Learning for Language 7
  • 8. Latent Variables Machine Learning and NLP Multi-class and Structured Variables Research Interests Discrete, Sparse Data Other Problems Discrete, Sparse Data The Zipfian distribution leads to ubiquitous data sparseness: standard maximum likelihood estimation doesn’t work well for linguistic data; a large number of smoothing techniques have been developed to deal with this problem; most of them are ad hoc; Bayesian methods are a principled alternative. G ∼ PY(d, θ, G0 ) Goldwater et al. Machine Learning for Language 8
  • 9. Latent Variables Machine Learning and NLP Multi-class and Structured Variables Research Interests Discrete, Sparse Data Other Problems Other Problems NLP typically uses pipeline models; errors propagate; models often highly domain-dependent (models for broadcast news will not work well for biomedical text, etc.); there is no single error function to optimize; evaluation metrics differ from task to task (BLEU for MT, ROUGE for summarization, PARSEVAL for parsing). Goldwater et al. Machine Learning for Language 9
  • 10. Parsing Machine Learning and NLP Language Acquisition Research Interests Language Generation Information Retrieval 1 Machine Learning and NLP Latent Variables Multi-class and Structured Variables Discrete, Sparse Data Other Problems 2 Research Interests Parsing Language Acquisition Language Generation Information Retrieval Goldwater et al. Machine Learning for Language 10
  • 11. Parsing Machine Learning and NLP Language Acquisition Research Interests Language Generation Information Retrieval Parsing (Keller, Steedman) Current focus of research in probabilistic parsing: models for more expressive syntactic representations (CCG, TAG, dependency grammar); semi-supervised induction of grammars and parsing models; cognitive modeling: incrementality; limited parallelism, limited memory; evaluation against behavioral data. The pilot embarrassed John and put himself in a very awkward situation. 11 111111 1 00 000000 0 11 00 111111 1 000000 0 1 0 11111 1111 00000 0000 1 0 1 0 1 0 11 00 11 00 1 0 1 0 1 2 3 4 5 6 First fixation time = 5 1 111111111 1111111111 1 0 000000000 0000000000 0 11 00 1 0 11 00 1 0 gaze duration = 5+6 7 8 9 Total time = 5+6+8+10 Second pass time = 8+10 11 00 11 00 1 0 1 0 Skipping rate: e.g. put 11 00 1 0 10 11 Goldwater et al. Machine Learning for Language 11
  • 12. Parsing Machine Learning and NLP Language Acquisition Research Interests Language Generation Information Retrieval Language Acquisition (Goldwater, Steedman) Research focuses on Bayesian models for improving unsupervised NLP and understanding of human language acquisition: What constraints/biases are needed for effective generalization? How can different sources of information be successfully combined? ML methods and problems: infinite models, esp. those for sequences/hierarchies; incremental, memory-limited inference methods; joint inference of different kinds of linguistic information (e.g., morphology and syntax). Goldwater et al. Machine Learning for Language 12
  • 13. Parsing Machine Learning and NLP Language Acquisition Research Interests Language Generation Information Retrieval Language Generation (Lapata) Research focuses on data-driven models for language generation: fluent and coherent text that resembles human writing; general modeling framework for different input types (time series data, pictures, logical forms). ML methods and problems: mathematical programming for sentence compression and summarization; latent variable models for image caption generation; models have to integrate conflicting constraints and varying linguistic representations. Goldwater et al. Machine Learning for Language 13
  • 14. Parsing Machine Learning and NLP Language Acquisition Research Interests Language Generation Information Retrieval Language Generation (Lapata) Example: image captioning beyond keywords. troop, Israel, force, ceasefire, soldiers Thousands of Israeli troops are in Lebanon as the ceasefire begins. Goldwater et al. Machine Learning for Language 14
  • 15. Parsing Machine Learning and NLP Language Acquisition Research Interests Language Generation Information Retrieval Information Retrieval (Lavrenko) Universal search: learn to relate relevant text/images/products/DB records; data: high-dimensional, extremely sparse, but dimensionality reduction is a bad idea; targets: focused information needs, not broad categories; semi-supervised: lots of unlabeled data, few judgments. Learning to rank: partial preferences → ranking function; objective: non-smooth, can be very expensive to evaluate. Novelty detection: example: identify first reports of events in the news; supervised task, but hard to learn anything from labels; best approaches unsupervised, performance very low. Goldwater et al. Machine Learning for Language 15