SlideShare une entreprise Scribd logo
1  sur  18
Télécharger pour lire hors ligne
Language
                           Modeling in
                         Turner&Charniak
                              (2007)

                            Kilian Evang


                         Language Models
                         N-gram LMs

 Language Modeling in    Charniak’s LM

                         Determiner

Turner&Charniak (2007)   Selection
                         Method
                         Results
                         Reasons for Success

                         References
      Kilian Evang


       2009-11-30
Language
Recap: Language Models                                           Modeling in
                                                               Turner&Charniak
                                                                    (2007)

                                                                  Kilian Evang


                                                               Language Models
                                                               N-gram LMs
                                                               Charniak’s LM
  ◮   LMs assign probabilities to sentences                    Determiner
                                                               Selection
  ◮   a sentence is a complex event                            Method
                                                               Results
  ◮   LMs break it up into a sequence of “atomic” events       Reasons for Success

                                                               References
  ◮   each “atomic” event conditioned on certain previous
      events
  ◮   conditional probabilities approximated by counting and
      smoothing
Language
N-gram LMs                                             Modeling in
                                                     Turner&Charniak
                                                          (2007)

                                                        Kilian Evang


                                                     Language Models
                       n-gram LMs    Charniak’s LM   N-gram LMs
                                                     Charniak’s LM
 sequence represents   sentence                      Determiner
                                                     Selection
 p(sent) =             p(seq)                        Method

 events are            words,                        Results
                                                     Reasons for Success

                       end symbols                   References




 conditioned on        the n − 1
                       previous
                       events
Language
A Sentence – a Sequence of Events                    Modeling in
                                                   Turner&Charniak
                                                        (2007)

                                                      Kilian Evang


                                                   Language Models
                                                   N-gram LMs

Sentence                                           Charniak’s LM

                                                   Determiner
                                                   Selection
                                                   Method
                                                   Results
                                                   Reasons for Success

   put       the      ball        in   the   box   References




Event sequence
put, the, ball, in, the, box, ∆
Language
A Sentence – a Sequence of Events                    Modeling in
                                                   Turner&Charniak
                                                        (2007)

                                                      Kilian Evang


                                                   Language Models
                                                   N-gram LMs

Sentence                                           Charniak’s LM

                                                   Determiner
                                                   Selection
                                                   Method
                                                   Results
                                                   Reasons for Success

   put      the       ball      in     the   box   References




Conditional probability
p(wi = the|wi −2 = ball, wi −1 = in)
Language
N-gram LMs vs. Charniak’s Parsing LM                       Modeling in
                                                         Turner&Charniak
                                                              (2007)

                                                            Kilian Evang


                       n-gram LMs    Charniak’s LM       Language Models
                                                         N-gram LMs
 sequence represents   sentence      parse tree          Charniak’s LM

 p(sent) =             p(seq)           p(seq)           Determiner
                                                         Selection
                                     seq                 Method
 events are            words,        pre-terminals,      Results
                                                         Reasons for Success
                       end symbols   terminals,          References
                                     constituents,
                                     end symbols
 conditioned on        the n − 1     certain previous
                       previous      events, depending
                       events        on type
Language
A Parse Tree – a Sequence of Events                        Modeling in
                                                         Turner&Charniak
                                                              (2007)

                                                            Kilian Evang
Parse tree
         vp                                              Language Models
                                                         N-gram LMs
                                                         Charniak’s LM

                                                         Determiner
                    np                 pp                Selection
                                                         Method
                                                         Results
                                                         Reasons for Success
                                             np          References



  verb        det        noun   prep   det        noun

   put        the        ball    in    the        box
Language
A Parse Tree – a Sequence of Events                                  Modeling in
                                                                   Turner&Charniak
                                                                        (2007)

                                                                      Kilian Evang
Parse tree
           vp                                                      Language Models
                                                                   N-gram LMs
                                                                   Charniak’s LM

                                                                   Determiner
                      np                      pp                   Selection
                                                                   Method
                                                                   Results
                                                                   Reasons for Success
                                                    np             References



  verb          det        noun   prep        det        noun

   put          the        ball     in        the        box


Event sequence
verb, put, M, ∆, M, np, pp, ∆, noun, ball, M, det, ∆, M, ∆, the,
prep, in, M, ∆, M, np, ∆, noun, box, M, det, ∆, M, ∆, the
Language
Digression: Non-head Constituents                           Modeling in
                                                          Turner&Charniak
                                                               (2007)

                                                             Kilian Evang


                                                          Language Models
Tree fragment                                             N-gram LMs
                                                          Charniak’s LM

                                  l                       Determiner
                                                          Selection
                                                          Method
                                                          Results
            Lm     ...    L1      t      R1    ...   Rn   Reasons for Success

                                                          References


                                  h



Event sequence fragment
M, L1 , . . ., Lm , ∆, M, R1 , . . ., Rn , ∆
Language
A Parse Tree – a Sequence of Events                        Modeling in
                                                         Turner&Charniak
                                                              (2007)

                                                            Kilian Evang
Parse tree
                                                         Language Models
         vp                                              N-gram LMs
                                                         Charniak’s LM

                                                         Determiner
                    np                 pp                Selection
                                                         Method
                                                         Results
                                                         Reasons for Success
                                             np          References



  verb        det        noun   prep   det        noun

   put        the        ball    in    the        box


Conditional probability for a head pre-terminal
p(t = noun|l = np, m = vp, u = verb, i = put)
Language
A Parse Tree – a Sequence of Events                         Modeling in
                                                          Turner&Charniak
                                                               (2007)

                                                             Kilian Evang
Parse tree
                                                          Language Models
         vp                                               N-gram LMs
                                                          Charniak’s LM

                                                          Determiner
                    np                  pp                Selection
                                                          Method
                                                          Results
                                                          Reasons for Success
                                              np          References



  verb        det        noun   prep    det        noun

   put        the        ball    in     the        box


Conditional probability for a head terminal
p(h = ball|t = noun, l = np, m = vp, u = verb, i = put)
Language
A Parse Tree – a Sequence of Events                        Modeling in
                                                         Turner&Charniak
                                                              (2007)


Parse tree                                                  Kilian Evang


         vp                                              Language Models
                                                         N-gram LMs
                                                         Charniak’s LM

                                                         Determiner
                    np                  pp               Selection
                                                         Method
                                                         Results
                                                         Reasons for Success
                                             np
                                                         References



  verb        det        noun   prep   det        noun

   put        the        ball    in    the        box


Conditional probability for a non-head constituent
p(Li = det|Li −1 = M, h = ball, t = noun, l = np, m =
vp, u = verb)
Language
Overview: Conditioning                                                        Modeling in
                                                                            Turner&Charniak
                                                                                 (2007)

                                                                               Kilian Evang
 event type                    conditioned on
                                                                            Language Models
 head pre-terminal t           constituent label l,
                                                                            N-gram LMs
                               mother constituent label m,                  Charniak’s LM

                               mother constituent head pre-terminal u       Determiner
                                                                            Selection
                               mother consitutent head terminal i
                                                                            Method
 head terminal h               head pre-terminal t,                         Results
                                                                            Reasons for Success
                               constituent label l,
                                                                            References
                               mother constituent label m,
                               mother constituent head pre-terminal u
                               mother consitutent head terminal i
 non-head                      (part of) L1...i −1 (L1...m , R1...i −1 ),
 constituent label Li (Ri ),   head terminal h,
 end symbol ∆                  head pre-terminal t,
                               constituent label l,
                               mother constituent label m,
                               mother constituent head pre-terminal u
Language
Determiner Selection                                               Modeling in
                                                                 Turner&Charniak
                                                                      (2007)

                                                                    Kilian Evang


                                                                 Language Models
                                                                 N-gram LMs
                                                                 Charniak’s LM

                                                                 Determiner
  ◮   for each NP,                                               Selection
                                                                 Method
        ◮   for each possible determiner (the, a/an, null),      Results
                                                                 Reasons for Success
              ◮   determine probability of NP
                                                                 References
        ◮   choose determiner resulting in highest probability
  ◮   note: sufficient to determine probabilities for events
      that differ
Language
Determiner Selection – Example                                         Modeling in
                                                                     Turner&Charniak
                                                                          (2007)

                                                                        Kilian Evang
  ◮   “put [NP the ball] in the box”
                                                                     Language Models
        ◮   p(L1 = det|m = vp, u = verb, l = np, t = noun, h =       N-gram LMs
                                                                     Charniak’s LM
            ball) × p(L2 = ∆|m = vp, u = verb, l = np, t =
                                                                     Determiner
            noun, h = ball, L1 = det) × p(det → the|m = vp, u =      Selection
            verb, l = np, t = noun, h = ball, L1 = det)              Method
                                                                     Results
                                                                     Reasons for Success
  ◮   “put [NP a/an ball] in the box”
                                                                     References
        ◮   p(L1 = det|m = vp, u = verb, l = np, t = noun, h =
            ball) × p(L2 = ∆|m = vp, u = verb, l = np, t =
            noun, h = ball, L1 = det) × (p(det → a|m = vp, u =
            verb, l = np, t = noun, h = ball, L1 = det) + p(det →
            an|m = vp, u = verb, l = np, t = noun, h = ball, L1 =
            det))
  ◮   “put [NP ball] in the box”
        ◮   p(L1 = ∆|m = vp, u = verb, l = np, t = noun, h = ball)
Language
Results     Modeling in
          Turner&Charniak
               (2007)

             Kilian Evang


          Language Models
          N-gram LMs
          Charniak’s LM

          Determiner
          Selection
          Method
          Results
          Reasons for Success

          References
Language
Reasons for Success                                                      Modeling in
                                                                       Turner&Charniak
                                                                            (2007)

                                                                          Kilian Evang


                                                                       Language Models
                                                                       N-gram LMs
                                                                       Charniak’s LM

                                                                       Determiner
  ◮   syntactic structure allows for long-distance conditioning,       Selection

      e.g.                                                             Method
                                                                       Results
                                                                       Reasons for Success
            ◮   he [VP gave [NP the sultan of Brunei] [NP a cactus]]
                                                                       References
  ◮   constituent head enforces selectional preferences,
      reflected in head-first strategy
  ◮   ...
Language
References                                                       Modeling in
                                                               Turner&Charniak
                                                                    (2007)

                                                                  Kilian Evang


                                                               Language Models
    Eugene Charniak (2000)                                     N-gram LMs

    A Maximum-Entropy-Inspired Parser                          Charniak’s LM

                                                               Determiner
    Proceedings of the First Meeting of the North American     Selection

    Chapter of the Association for Computational Linguistics   Method
                                                               Results
                                                               Reasons for Success

    Eugene Charniak (2001)                                     References

    Immediate-Head Parsing for Language Models
    Proceedings of the 39th Annual Meeting of the
    Association for Computational Linguistics
    Jenine Turner & Eugene Charniak (2007)
    Language Modeling for Determiner Selection
    Proceedings of NAACL HLT 2007, Companion Volume

Contenu connexe

En vedette

Smoky Mountain Secrets
Smoky Mountain SecretsSmoky Mountain Secrets
Smoky Mountain SecretsJeanMarlowe
 
2011 05 09_배달통
2011 05 09_배달통2011 05 09_배달통
2011 05 09_배달통Sanghoon Kim
 
Joshua Brewer - Lesson on Formulating Hypotheses - Final Draft
Joshua Brewer - Lesson on Formulating Hypotheses - Final DraftJoshua Brewer - Lesson on Formulating Hypotheses - Final Draft
Joshua Brewer - Lesson on Formulating Hypotheses - Final DraftMorningMaestro
 
I Npd Mfei 5 10
I Npd Mfei 5 10I Npd Mfei 5 10
I Npd Mfei 5 10kbmcgourty
 
Pdf negro 19-mayo-2014-01
Pdf negro 19-mayo-2014-01Pdf negro 19-mayo-2014-01
Pdf negro 19-mayo-2014-01imm
 
Vaya con dios1_(pp_tminimizer)
Vaya con dios1_(pp_tminimizer)Vaya con dios1_(pp_tminimizer)
Vaya con dios1_(pp_tminimizer)Vesdo 1
 
앱리뷰 말하는웨더알람
앱리뷰 말하는웨더알람앱리뷰 말하는웨더알람
앱리뷰 말하는웨더알람Sanghoon Kim
 
Documento compraventa terreno campeche
Documento compraventa terreno campecheDocumento compraventa terreno campeche
Documento compraventa terreno campecheErmilo De la Cruz
 
Cross Media for non-profits
Cross Media for non-profitsCross Media for non-profits
Cross Media for non-profitsRCAcrossmedia
 
Doha 2006 sukan asia
Doha 2006 sukan asiaDoha 2006 sukan asia
Doha 2006 sukan asiaOlimpikini
 
Increasing food security through strategies and investments to increase incom...
Increasing food security through strategies and investments to increase incom...Increasing food security through strategies and investments to increase incom...
Increasing food security through strategies and investments to increase incom...Bangladesh Food Security Investment Forum
 
elmuellocalmon
elmuellocalmonelmuellocalmon
elmuellocalmoncristian
 
Final project
Final project Final project
Final project imm
 
Funcionaris versus emprenedors
Funcionaris versus emprenedorsFuncionaris versus emprenedors
Funcionaris versus emprenedorsaleixclosmari
 
La nit del terror. Euclides
La nit del terror. Euclides La nit del terror. Euclides
La nit del terror. Euclides joan
 
Peru presentation
Peru presentationPeru presentation
Peru presentationanchefu
 
Presentatie Ssis Part I
Presentatie Ssis Part IPresentatie Ssis Part I
Presentatie Ssis Part Isecuserve1
 

En vedette (20)

Smoky Mountain Secrets
Smoky Mountain SecretsSmoky Mountain Secrets
Smoky Mountain Secrets
 
2011 05 09_배달통
2011 05 09_배달통2011 05 09_배달통
2011 05 09_배달통
 
Joshua Brewer - Lesson on Formulating Hypotheses - Final Draft
Joshua Brewer - Lesson on Formulating Hypotheses - Final DraftJoshua Brewer - Lesson on Formulating Hypotheses - Final Draft
Joshua Brewer - Lesson on Formulating Hypotheses - Final Draft
 
I Npd Mfei 5 10
I Npd Mfei 5 10I Npd Mfei 5 10
I Npd Mfei 5 10
 
Pdf negro 19-mayo-2014-01
Pdf negro 19-mayo-2014-01Pdf negro 19-mayo-2014-01
Pdf negro 19-mayo-2014-01
 
Dip.IFRS ACCA(UK)
Dip.IFRS ACCA(UK)Dip.IFRS ACCA(UK)
Dip.IFRS ACCA(UK)
 
Vaya con dios1_(pp_tminimizer)
Vaya con dios1_(pp_tminimizer)Vaya con dios1_(pp_tminimizer)
Vaya con dios1_(pp_tminimizer)
 
앱리뷰 말하는웨더알람
앱리뷰 말하는웨더알람앱리뷰 말하는웨더알람
앱리뷰 말하는웨더알람
 
Documento compraventa terreno campeche
Documento compraventa terreno campecheDocumento compraventa terreno campeche
Documento compraventa terreno campeche
 
Cross Media for non-profits
Cross Media for non-profitsCross Media for non-profits
Cross Media for non-profits
 
Doha 2006 sukan asia
Doha 2006 sukan asiaDoha 2006 sukan asia
Doha 2006 sukan asia
 
앱리뷰 Dayta
앱리뷰 Dayta앱리뷰 Dayta
앱리뷰 Dayta
 
Increasing food security through strategies and investments to increase incom...
Increasing food security through strategies and investments to increase incom...Increasing food security through strategies and investments to increase incom...
Increasing food security through strategies and investments to increase incom...
 
elmuellocalmon
elmuellocalmonelmuellocalmon
elmuellocalmon
 
Final project
Final project Final project
Final project
 
2011 05 30_curing
2011 05 30_curing2011 05 30_curing
2011 05 30_curing
 
Funcionaris versus emprenedors
Funcionaris versus emprenedorsFuncionaris versus emprenedors
Funcionaris versus emprenedors
 
La nit del terror. Euclides
La nit del terror. Euclides La nit del terror. Euclides
La nit del terror. Euclides
 
Peru presentation
Peru presentationPeru presentation
Peru presentation
 
Presentatie Ssis Part I
Presentatie Ssis Part IPresentatie Ssis Part I
Presentatie Ssis Part I
 

Dernier

MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesMuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesManik S Magar
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesKari Kakkonen
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfNeo4j
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesThousandEyes
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security ObservabilityGlenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security Observabilityitnewsafrica
 
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS:  6 Ways to Automate Your Data IntegrationBridging Between CAD & GIS:  6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integrationmarketing932765
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024TopCSSGallery
 
Abdul Kader Baba- Managing Cybersecurity Risks and Compliance Requirements i...
Abdul Kader Baba- Managing Cybersecurity Risks  and Compliance Requirements i...Abdul Kader Baba- Managing Cybersecurity Risks  and Compliance Requirements i...
Abdul Kader Baba- Managing Cybersecurity Risks and Compliance Requirements i...itnewsafrica
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentPim van der Noll
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
React Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkReact Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkPixlogix Infotech
 
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxGenerative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxfnnc6jmgwh
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Mark Goldstein
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsRavi Sanghani
 

Dernier (20)

MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesMuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examples
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdf
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security ObservabilityGlenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
 
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS:  6 Ways to Automate Your Data IntegrationBridging Between CAD & GIS:  6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024
 
Abdul Kader Baba- Managing Cybersecurity Risks and Compliance Requirements i...
Abdul Kader Baba- Managing Cybersecurity Risks  and Compliance Requirements i...Abdul Kader Baba- Managing Cybersecurity Risks  and Compliance Requirements i...
Abdul Kader Baba- Managing Cybersecurity Risks and Compliance Requirements i...
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
React Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkReact Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App Framework
 
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxGenerative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and Insights
 

Language Modeling in Turner&Charniak (2007)

  • 1. Language Modeling in Turner&Charniak (2007) Kilian Evang Language Models N-gram LMs Language Modeling in Charniak’s LM Determiner Turner&Charniak (2007) Selection Method Results Reasons for Success References Kilian Evang 2009-11-30
  • 2. Language Recap: Language Models Modeling in Turner&Charniak (2007) Kilian Evang Language Models N-gram LMs Charniak’s LM ◮ LMs assign probabilities to sentences Determiner Selection ◮ a sentence is a complex event Method Results ◮ LMs break it up into a sequence of “atomic” events Reasons for Success References ◮ each “atomic” event conditioned on certain previous events ◮ conditional probabilities approximated by counting and smoothing
  • 3. Language N-gram LMs Modeling in Turner&Charniak (2007) Kilian Evang Language Models n-gram LMs Charniak’s LM N-gram LMs Charniak’s LM sequence represents sentence Determiner Selection p(sent) = p(seq) Method events are words, Results Reasons for Success end symbols References conditioned on the n − 1 previous events
  • 4. Language A Sentence – a Sequence of Events Modeling in Turner&Charniak (2007) Kilian Evang Language Models N-gram LMs Sentence Charniak’s LM Determiner Selection Method Results Reasons for Success put the ball in the box References Event sequence put, the, ball, in, the, box, ∆
  • 5. Language A Sentence – a Sequence of Events Modeling in Turner&Charniak (2007) Kilian Evang Language Models N-gram LMs Sentence Charniak’s LM Determiner Selection Method Results Reasons for Success put the ball in the box References Conditional probability p(wi = the|wi −2 = ball, wi −1 = in)
  • 6. Language N-gram LMs vs. Charniak’s Parsing LM Modeling in Turner&Charniak (2007) Kilian Evang n-gram LMs Charniak’s LM Language Models N-gram LMs sequence represents sentence parse tree Charniak’s LM p(sent) = p(seq) p(seq) Determiner Selection seq Method events are words, pre-terminals, Results Reasons for Success end symbols terminals, References constituents, end symbols conditioned on the n − 1 certain previous previous events, depending events on type
  • 7. Language A Parse Tree – a Sequence of Events Modeling in Turner&Charniak (2007) Kilian Evang Parse tree vp Language Models N-gram LMs Charniak’s LM Determiner np pp Selection Method Results Reasons for Success np References verb det noun prep det noun put the ball in the box
  • 8. Language A Parse Tree – a Sequence of Events Modeling in Turner&Charniak (2007) Kilian Evang Parse tree vp Language Models N-gram LMs Charniak’s LM Determiner np pp Selection Method Results Reasons for Success np References verb det noun prep det noun put the ball in the box Event sequence verb, put, M, ∆, M, np, pp, ∆, noun, ball, M, det, ∆, M, ∆, the, prep, in, M, ∆, M, np, ∆, noun, box, M, det, ∆, M, ∆, the
  • 9. Language Digression: Non-head Constituents Modeling in Turner&Charniak (2007) Kilian Evang Language Models Tree fragment N-gram LMs Charniak’s LM l Determiner Selection Method Results Lm ... L1 t R1 ... Rn Reasons for Success References h Event sequence fragment M, L1 , . . ., Lm , ∆, M, R1 , . . ., Rn , ∆
  • 10. Language A Parse Tree – a Sequence of Events Modeling in Turner&Charniak (2007) Kilian Evang Parse tree Language Models vp N-gram LMs Charniak’s LM Determiner np pp Selection Method Results Reasons for Success np References verb det noun prep det noun put the ball in the box Conditional probability for a head pre-terminal p(t = noun|l = np, m = vp, u = verb, i = put)
  • 11. Language A Parse Tree – a Sequence of Events Modeling in Turner&Charniak (2007) Kilian Evang Parse tree Language Models vp N-gram LMs Charniak’s LM Determiner np pp Selection Method Results Reasons for Success np References verb det noun prep det noun put the ball in the box Conditional probability for a head terminal p(h = ball|t = noun, l = np, m = vp, u = verb, i = put)
  • 12. Language A Parse Tree – a Sequence of Events Modeling in Turner&Charniak (2007) Parse tree Kilian Evang vp Language Models N-gram LMs Charniak’s LM Determiner np pp Selection Method Results Reasons for Success np References verb det noun prep det noun put the ball in the box Conditional probability for a non-head constituent p(Li = det|Li −1 = M, h = ball, t = noun, l = np, m = vp, u = verb)
  • 13. Language Overview: Conditioning Modeling in Turner&Charniak (2007) Kilian Evang event type conditioned on Language Models head pre-terminal t constituent label l, N-gram LMs mother constituent label m, Charniak’s LM mother constituent head pre-terminal u Determiner Selection mother consitutent head terminal i Method head terminal h head pre-terminal t, Results Reasons for Success constituent label l, References mother constituent label m, mother constituent head pre-terminal u mother consitutent head terminal i non-head (part of) L1...i −1 (L1...m , R1...i −1 ), constituent label Li (Ri ), head terminal h, end symbol ∆ head pre-terminal t, constituent label l, mother constituent label m, mother constituent head pre-terminal u
  • 14. Language Determiner Selection Modeling in Turner&Charniak (2007) Kilian Evang Language Models N-gram LMs Charniak’s LM Determiner ◮ for each NP, Selection Method ◮ for each possible determiner (the, a/an, null), Results Reasons for Success ◮ determine probability of NP References ◮ choose determiner resulting in highest probability ◮ note: sufficient to determine probabilities for events that differ
  • 15. Language Determiner Selection – Example Modeling in Turner&Charniak (2007) Kilian Evang ◮ “put [NP the ball] in the box” Language Models ◮ p(L1 = det|m = vp, u = verb, l = np, t = noun, h = N-gram LMs Charniak’s LM ball) × p(L2 = ∆|m = vp, u = verb, l = np, t = Determiner noun, h = ball, L1 = det) × p(det → the|m = vp, u = Selection verb, l = np, t = noun, h = ball, L1 = det) Method Results Reasons for Success ◮ “put [NP a/an ball] in the box” References ◮ p(L1 = det|m = vp, u = verb, l = np, t = noun, h = ball) × p(L2 = ∆|m = vp, u = verb, l = np, t = noun, h = ball, L1 = det) × (p(det → a|m = vp, u = verb, l = np, t = noun, h = ball, L1 = det) + p(det → an|m = vp, u = verb, l = np, t = noun, h = ball, L1 = det)) ◮ “put [NP ball] in the box” ◮ p(L1 = ∆|m = vp, u = verb, l = np, t = noun, h = ball)
  • 16. Language Results Modeling in Turner&Charniak (2007) Kilian Evang Language Models N-gram LMs Charniak’s LM Determiner Selection Method Results Reasons for Success References
  • 17. Language Reasons for Success Modeling in Turner&Charniak (2007) Kilian Evang Language Models N-gram LMs Charniak’s LM Determiner ◮ syntactic structure allows for long-distance conditioning, Selection e.g. Method Results Reasons for Success ◮ he [VP gave [NP the sultan of Brunei] [NP a cactus]] References ◮ constituent head enforces selectional preferences, reflected in head-first strategy ◮ ...
  • 18. Language References Modeling in Turner&Charniak (2007) Kilian Evang Language Models Eugene Charniak (2000) N-gram LMs A Maximum-Entropy-Inspired Parser Charniak’s LM Determiner Proceedings of the First Meeting of the North American Selection Chapter of the Association for Computational Linguistics Method Results Reasons for Success Eugene Charniak (2001) References Immediate-Head Parsing for Language Models Proceedings of the 39th Annual Meeting of the Association for Computational Linguistics Jenine Turner & Eugene Charniak (2007) Language Modeling for Determiner Selection Proceedings of NAACL HLT 2007, Companion Volume