SlideShare une entreprise Scribd logo
1  sur  13
Language Modeling with the Maximum Likelihood Set (Karakos & Khudanpur, ISIT-2006) http://dx.doi.org/10.1109/ISIT.2006.261575 Yusuke Matsubara Tsujii lab. Meeting 2006-06-22
Necessity of smoothing ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
The Maximum Likelihood Set [ Jedynak & Khudanpur 2005 ] ,[object Object],[object Object],[object Object],p 1 +p 2 +p 3 =1 MLEs from possible counts 1 1 1 0 Sample size=3 #(word set) =3
The Maximum Likelihood Set (formal definition) k 2  linear inequality constraints
The Maximum Likelihood Set n = 3 (#samples) k = 3 (#word set) n = 10 (#samples) k = 3 (#word set) Larger samples Nearer to MLE
Choosing a pmf from a MLS ,[object Object],[object Object],reference MLS
Conditional pmf estimation ,[object Object],[object Object],[object Object],[object Object],[object Object]
Experimental results ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Bigram Trigram Witten-Bell Kneser-Ney Witten-Bell Kneser-Ney Reference 8.47 8.36 8.21 8.08 MLS 8.44 8.38 8.24 8.12
Conclusion ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
(0, 0, 1) (0, 1/3, 2/3) (0, 2/3, 1/3) (0, 1, 0) (0, 2/3, 1/3) (0, 1/3, 2/3) (0, 0, 1) (2/3, 0, 1/3) (1/3, 0, 2/3) (1/3, 1/3, 1/3)
 
 
 

Contenu connexe

Tendances

[Yang, Downey and Boyd-Graber 2015] Efficient Methods for Incorporating Knowl...
[Yang, Downey and Boyd-Graber 2015] Efficient Methods for Incorporating Knowl...[Yang, Downey and Boyd-Graber 2015] Efficient Methods for Incorporating Knowl...
[Yang, Downey and Boyd-Graber 2015] Efficient Methods for Incorporating Knowl...Shuyo Nakatani
 
Cs1123 9 strings
Cs1123 9 stringsCs1123 9 strings
Cs1123 9 stringsTAlha MAlik
 
Interactive Latent Dirichlet Allocation
Interactive Latent Dirichlet AllocationInteractive Latent Dirichlet Allocation
Interactive Latent Dirichlet AllocationQuentin Pleplé
 
Algorithm and complexity
Algorithm and complexityAlgorithm and complexity
Algorithm and complexityHABIB FIGA GUYE
 
Pigeonhole Principle
Pigeonhole  PrinciplePigeonhole  Principle
Pigeonhole Principlesheiblu
 
Introduction to Ultra-succinct representation of ordered trees with applications
Introduction to Ultra-succinct representation of ordered trees with applicationsIntroduction to Ultra-succinct representation of ordered trees with applications
Introduction to Ultra-succinct representation of ordered trees with applicationsYu Liu
 
Similarity Search in High Dimensions via Hashing
Similarity Search in High Dimensions via HashingSimilarity Search in High Dimensions via Hashing
Similarity Search in High Dimensions via HashingMaruf Aytekin
 
CPM2013-tabei201306
CPM2013-tabei201306CPM2013-tabei201306
CPM2013-tabei201306Yasuo Tabei
 
Fast and Accurate Preordering for SMT using Neural Networks
Fast and Accurate Preordering for SMT using Neural NetworksFast and Accurate Preordering for SMT using Neural Networks
Fast and Accurate Preordering for SMT using Neural NetworksSDL
 
A+Novel+Approach+Based+On+Prototypes+And+Rough+Sets+For+Document+And+Feature+...
A+Novel+Approach+Based+On+Prototypes+And+Rough+Sets+For+Document+And+Feature+...A+Novel+Approach+Based+On+Prototypes+And+Rough+Sets+For+Document+And+Feature+...
A+Novel+Approach+Based+On+Prototypes+And+Rough+Sets+For+Document+And+Feature+...marxliouville
 

Tendances (12)

[Yang, Downey and Boyd-Graber 2015] Efficient Methods for Incorporating Knowl...
[Yang, Downey and Boyd-Graber 2015] Efficient Methods for Incorporating Knowl...[Yang, Downey and Boyd-Graber 2015] Efficient Methods for Incorporating Knowl...
[Yang, Downey and Boyd-Graber 2015] Efficient Methods for Incorporating Knowl...
 
Incremental Evolving Grammar Fragments
Incremental Evolving Grammar FragmentsIncremental Evolving Grammar Fragments
Incremental Evolving Grammar Fragments
 
Cs1123 9 strings
Cs1123 9 stringsCs1123 9 strings
Cs1123 9 strings
 
Interactive Latent Dirichlet Allocation
Interactive Latent Dirichlet AllocationInteractive Latent Dirichlet Allocation
Interactive Latent Dirichlet Allocation
 
Algorithm and complexity
Algorithm and complexityAlgorithm and complexity
Algorithm and complexity
 
Pigeonhole Principle
Pigeonhole  PrinciplePigeonhole  Principle
Pigeonhole Principle
 
Introduction to Ultra-succinct representation of ordered trees with applications
Introduction to Ultra-succinct representation of ordered trees with applicationsIntroduction to Ultra-succinct representation of ordered trees with applications
Introduction to Ultra-succinct representation of ordered trees with applications
 
Chapter 03
Chapter 03Chapter 03
Chapter 03
 
Similarity Search in High Dimensions via Hashing
Similarity Search in High Dimensions via HashingSimilarity Search in High Dimensions via Hashing
Similarity Search in High Dimensions via Hashing
 
CPM2013-tabei201306
CPM2013-tabei201306CPM2013-tabei201306
CPM2013-tabei201306
 
Fast and Accurate Preordering for SMT using Neural Networks
Fast and Accurate Preordering for SMT using Neural NetworksFast and Accurate Preordering for SMT using Neural Networks
Fast and Accurate Preordering for SMT using Neural Networks
 
A+Novel+Approach+Based+On+Prototypes+And+Rough+Sets+For+Document+And+Feature+...
A+Novel+Approach+Based+On+Prototypes+And+Rough+Sets+For+Document+And+Feature+...A+Novel+Approach+Based+On+Prototypes+And+Rough+Sets+For+Document+And+Feature+...
A+Novel+Approach+Based+On+Prototypes+And+Rough+Sets+For+Document+And+Feature+...
 

En vedette

Mr F Stanley goes on a Trip
Mr F Stanley goes on a TripMr F Stanley goes on a Trip
Mr F Stanley goes on a Tripmrfstanley
 
Cheryl Soltis-Muth Resume 9-26-16 LI
Cheryl Soltis-Muth Resume 9-26-16 LICheryl Soltis-Muth Resume 9-26-16 LI
Cheryl Soltis-Muth Resume 9-26-16 LICheryl Soltis-Muth
 
Volantino Autotrend
Volantino AutotrendVolantino Autotrend
Volantino Autotrendtramutoauto
 
Marketing exchange notes @ bec doms
Marketing exchange notes @ bec domsMarketing exchange notes @ bec doms
Marketing exchange notes @ bec domsBabasab Patil
 
Aditya hridayam
Aditya hridayamAditya hridayam
Aditya hridayamshanaish8
 
Festival dell'ospitalita 2016: Destinazione Umana
Festival dell'ospitalita 2016: Destinazione UmanaFestival dell'ospitalita 2016: Destinazione Umana
Festival dell'ospitalita 2016: Destinazione UmanaEvermind
 
ALPINESTARS 2009 Silver Edition
ALPINESTARS 2009  Silver  EditionALPINESTARS 2009  Silver  Edition
ALPINESTARS 2009 Silver EditionAugusto Derosso
 
Rapid phenotyping of prawn biochemical attributes using hyperspectral imaging
Rapid phenotyping of prawn biochemical attributes using hyperspectral imagingRapid phenotyping of prawn biochemical attributes using hyperspectral imaging
Rapid phenotyping of prawn biochemical attributes using hyperspectral imagingStuart Hinchliff
 

En vedette (12)

Genre research (2)
Genre research  (2)Genre research  (2)
Genre research (2)
 
Budgeting
BudgetingBudgeting
Budgeting
 
Mr F Stanley goes on a Trip
Mr F Stanley goes on a TripMr F Stanley goes on a Trip
Mr F Stanley goes on a Trip
 
Cheryl Soltis-Muth Resume 9-26-16 LI
Cheryl Soltis-Muth Resume 9-26-16 LICheryl Soltis-Muth Resume 9-26-16 LI
Cheryl Soltis-Muth Resume 9-26-16 LI
 
Volantino Autotrend
Volantino AutotrendVolantino Autotrend
Volantino Autotrend
 
Numero2
Numero2 Numero2
Numero2
 
Marketing exchange notes @ bec doms
Marketing exchange notes @ bec domsMarketing exchange notes @ bec doms
Marketing exchange notes @ bec doms
 
Aditya hridayam
Aditya hridayamAditya hridayam
Aditya hridayam
 
Festival dell'ospitalita 2016: Destinazione Umana
Festival dell'ospitalita 2016: Destinazione UmanaFestival dell'ospitalita 2016: Destinazione Umana
Festival dell'ospitalita 2016: Destinazione Umana
 
ALPINESTARS 2009 Silver Edition
ALPINESTARS 2009  Silver  EditionALPINESTARS 2009  Silver  Edition
ALPINESTARS 2009 Silver Edition
 
Rapid phenotyping of prawn biochemical attributes using hyperspectral imaging
Rapid phenotyping of prawn biochemical attributes using hyperspectral imagingRapid phenotyping of prawn biochemical attributes using hyperspectral imaging
Rapid phenotyping of prawn biochemical attributes using hyperspectral imaging
 
Smartphones im Unterricht
Smartphones im UnterrichtSmartphones im Unterricht
Smartphones im Unterricht
 

Similaire à Maximum likelihood-set - introduction

Continuous Architecting of Stream-Based Systems
Continuous Architecting of Stream-Based SystemsContinuous Architecting of Stream-Based Systems
Continuous Architecting of Stream-Based SystemsCHOOSE
 
An Uncertainty-Aware Approach to Optimal Configuration of Stream Processing S...
An Uncertainty-Aware Approach to Optimal Configuration of Stream Processing S...An Uncertainty-Aware Approach to Optimal Configuration of Stream Processing S...
An Uncertainty-Aware Approach to Optimal Configuration of Stream Processing S...Pooyan Jamshidi
 
Adversarial_Examples_in_Audio_and_Text.pptx
Adversarial_Examples_in_Audio_and_Text.pptxAdversarial_Examples_in_Audio_and_Text.pptx
Adversarial_Examples_in_Audio_and_Text.pptxujjawalchaurasia1
 
IGARSSWellLog_Vancouver_07_29.pptx
IGARSSWellLog_Vancouver_07_29.pptxIGARSSWellLog_Vancouver_07_29.pptx
IGARSSWellLog_Vancouver_07_29.pptxgrssieee
 
2-Chapter Two-N-gram Language Models.ppt
2-Chapter Two-N-gram Language Models.ppt2-Chapter Two-N-gram Language Models.ppt
2-Chapter Two-N-gram Language Models.pptmilkesa13
 
Machine Learning meets DevOps
Machine Learning meets DevOpsMachine Learning meets DevOps
Machine Learning meets DevOpsPooyan Jamshidi
 
NIPS2007: structured prediction
NIPS2007: structured predictionNIPS2007: structured prediction
NIPS2007: structured predictionzukun
 
Language Technology Enhanced Learning
Language Technology Enhanced LearningLanguage Technology Enhanced Learning
Language Technology Enhanced Learningtelss09
 
Сергей Кольцов —НИУ ВШЭ —ICBDA 2015
Сергей Кольцов —НИУ ВШЭ —ICBDA 2015Сергей Кольцов —НИУ ВШЭ —ICBDA 2015
Сергей Кольцов —НИУ ВШЭ —ICBDA 2015rusbase
 
Fixed Point Realization of Iterative LR-Aided Soft MIMO Decoding Algorithm
Fixed Point Realization of Iterative LR-Aided Soft MIMO Decoding AlgorithmFixed Point Realization of Iterative LR-Aided Soft MIMO Decoding Algorithm
Fixed Point Realization of Iterative LR-Aided Soft MIMO Decoding AlgorithmCSCJournals
 
Artificial Intelligence
Artificial IntelligenceArtificial Intelligence
Artificial Intelligencevini89
 
Automated Essay Scoring Using Efficient Transformer-Based Language Models
Automated Essay Scoring Using Efficient Transformer-Based Language ModelsAutomated Essay Scoring Using Efficient Transformer-Based Language Models
Automated Essay Scoring Using Efficient Transformer-Based Language ModelsNat Rice
 
SNLI_presentation_2
SNLI_presentation_2SNLI_presentation_2
SNLI_presentation_2Viral Gupta
 
Learning deep structured semantic models for web search
Learning deep structured semantic models for web searchLearning deep structured semantic models for web search
Learning deep structured semantic models for web searchhyunsung lee
 
Iterative Soft Decision Based Complex K-best MIMO Decoder
Iterative Soft Decision Based Complex K-best MIMO DecoderIterative Soft Decision Based Complex K-best MIMO Decoder
Iterative Soft Decision Based Complex K-best MIMO DecoderCSCJournals
 
Iterative Soft Decision Based Complex K-best MIMO Decoder
Iterative Soft Decision Based Complex K-best MIMO DecoderIterative Soft Decision Based Complex K-best MIMO Decoder
Iterative Soft Decision Based Complex K-best MIMO DecoderCSCJournals
 
A Study Of Statistical Models For Query Translation :Finding A Good Unit Of T...
A Study Of Statistical Models For Query Translation :Finding A Good Unit Of T...A Study Of Statistical Models For Query Translation :Finding A Good Unit Of T...
A Study Of Statistical Models For Query Translation :Finding A Good Unit Of T...iyo
 
multipleSeqAlignment.ppta4432455344534534
multipleSeqAlignment.ppta4432455344534534multipleSeqAlignment.ppta4432455344534534
multipleSeqAlignment.ppta4432455344534534alizain9604
 

Similaire à Maximum likelihood-set - introduction (20)

Continuous Architecting of Stream-Based Systems
Continuous Architecting of Stream-Based SystemsContinuous Architecting of Stream-Based Systems
Continuous Architecting of Stream-Based Systems
 
Theory of computing
Theory of computingTheory of computing
Theory of computing
 
An Uncertainty-Aware Approach to Optimal Configuration of Stream Processing S...
An Uncertainty-Aware Approach to Optimal Configuration of Stream Processing S...An Uncertainty-Aware Approach to Optimal Configuration of Stream Processing S...
An Uncertainty-Aware Approach to Optimal Configuration of Stream Processing S...
 
Adversarial_Examples_in_Audio_and_Text.pptx
Adversarial_Examples_in_Audio_and_Text.pptxAdversarial_Examples_in_Audio_and_Text.pptx
Adversarial_Examples_in_Audio_and_Text.pptx
 
IGARSSWellLog_Vancouver_07_29.pptx
IGARSSWellLog_Vancouver_07_29.pptxIGARSSWellLog_Vancouver_07_29.pptx
IGARSSWellLog_Vancouver_07_29.pptx
 
2-Chapter Two-N-gram Language Models.ppt
2-Chapter Two-N-gram Language Models.ppt2-Chapter Two-N-gram Language Models.ppt
2-Chapter Two-N-gram Language Models.ppt
 
Machine Learning meets DevOps
Machine Learning meets DevOpsMachine Learning meets DevOps
Machine Learning meets DevOps
 
NIPS2007: structured prediction
NIPS2007: structured predictionNIPS2007: structured prediction
NIPS2007: structured prediction
 
Language Technology Enhanced Learning
Language Technology Enhanced LearningLanguage Technology Enhanced Learning
Language Technology Enhanced Learning
 
Сергей Кольцов —НИУ ВШЭ —ICBDA 2015
Сергей Кольцов —НИУ ВШЭ —ICBDA 2015Сергей Кольцов —НИУ ВШЭ —ICBDA 2015
Сергей Кольцов —НИУ ВШЭ —ICBDA 2015
 
Fixed Point Realization of Iterative LR-Aided Soft MIMO Decoding Algorithm
Fixed Point Realization of Iterative LR-Aided Soft MIMO Decoding AlgorithmFixed Point Realization of Iterative LR-Aided Soft MIMO Decoding Algorithm
Fixed Point Realization of Iterative LR-Aided Soft MIMO Decoding Algorithm
 
Artificial Intelligence
Artificial IntelligenceArtificial Intelligence
Artificial Intelligence
 
Automated Essay Scoring Using Efficient Transformer-Based Language Models
Automated Essay Scoring Using Efficient Transformer-Based Language ModelsAutomated Essay Scoring Using Efficient Transformer-Based Language Models
Automated Essay Scoring Using Efficient Transformer-Based Language Models
 
SNLI_presentation_2
SNLI_presentation_2SNLI_presentation_2
SNLI_presentation_2
 
defense
defensedefense
defense
 
Learning deep structured semantic models for web search
Learning deep structured semantic models for web searchLearning deep structured semantic models for web search
Learning deep structured semantic models for web search
 
Iterative Soft Decision Based Complex K-best MIMO Decoder
Iterative Soft Decision Based Complex K-best MIMO DecoderIterative Soft Decision Based Complex K-best MIMO Decoder
Iterative Soft Decision Based Complex K-best MIMO Decoder
 
Iterative Soft Decision Based Complex K-best MIMO Decoder
Iterative Soft Decision Based Complex K-best MIMO DecoderIterative Soft Decision Based Complex K-best MIMO Decoder
Iterative Soft Decision Based Complex K-best MIMO Decoder
 
A Study Of Statistical Models For Query Translation :Finding A Good Unit Of T...
A Study Of Statistical Models For Query Translation :Finding A Good Unit Of T...A Study Of Statistical Models For Query Translation :Finding A Good Unit Of T...
A Study Of Statistical Models For Query Translation :Finding A Good Unit Of T...
 
multipleSeqAlignment.ppta4432455344534534
multipleSeqAlignment.ppta4432455344534534multipleSeqAlignment.ppta4432455344534534
multipleSeqAlignment.ppta4432455344534534
 

Maximum likelihood-set - introduction

  • 1. Language Modeling with the Maximum Likelihood Set (Karakos & Khudanpur, ISIT-2006) http://dx.doi.org/10.1109/ISIT.2006.261575 Yusuke Matsubara Tsujii lab. Meeting 2006-06-22
  • 2.
  • 3.
  • 4. The Maximum Likelihood Set (formal definition) k 2 linear inequality constraints
  • 5. The Maximum Likelihood Set n = 3 (#samples) k = 3 (#word set) n = 10 (#samples) k = 3 (#word set) Larger samples Nearer to MLE
  • 6.
  • 7.
  • 8.
  • 9.
  • 10. (0, 0, 1) (0, 1/3, 2/3) (0, 2/3, 1/3) (0, 1, 0) (0, 2/3, 1/3) (0, 1/3, 2/3) (0, 0, 1) (2/3, 0, 1/3) (1/3, 0, 2/3) (1/3, 1/3, 1/3)
  • 11.  
  • 12.  
  • 13.