Jeffreys' and BDeu Priors for Model Selection

•Télécharger en tant que PPTX, PDF•

0 j'aime•829 vues

The Ninth Workshop on Information Theoretic Methods in Science and Engineering (WITMSE), Helsinki, Finland, on September 19–21, 2016.

Sciences

Jeffreys' and BDeu Priors for Model Selection
WITMSE 2016
Helsinki, Finland, September 20
Joe Suzuki
(prof-joe)
Joe Suzuki (Osaka Univ., Japan)

Goal and Contributions
[Goal]
Compare for model selection
• BDeu (Bayesian Dirichlet equivalent uniform)
• Jeffreys prior (T-K estimator)
[Contribution]
Mathematically Proves

Road Map
1. Bayesian Dirichlet Scores
2. BDeu and Jeffreys Scores
3. A Found Property and its Proof
4. Main Theorem
5. Regularity in Model Selection
6. Summary

Express a Prob. by the product of Cond. Probs.

Example 1 : Bayesian Network Structure Learning (BNSL)

Regularity in Model Selection
Fitness + Simplicity → optimal
(-1) x Likelihood + Penalty Term → min
Newton’s
Law of
Motion
Maxwell
Equations
If model A is better than model B w.r.t. fitness and simplicity,
model A should be chosen (regularity).
Information Criteria
LASSO

BDeu violates regularity in model selection
Z XZ X
Y
Y X

B&B for efficient BNSL (Depth First Search)

Those bounds utilize regularity
Campos and Ji 2011 figured out one (=nice)
but the bound is not efficient (experiments).
Designing Pruning rules for BDeu is HARDer.
because regularity cannot be assumed

Bayes Prior
Based on his/her Belief:
Nobody should reject it from a general point of view.
BDeu violates regularity
contradicts with Newton, Maxwell, Information Critreria, LASSO, etc.
People might notice that their beliefs have been
wrong, after knowing the new result in this paper.

Summary
The prior behind BDeu might have been based on a wrong belief
That contradicts regularity in model selection
Future Work: Consider NML and others in a similar way

Recommandé

Ssbse12a.pptPtidej Team

2014 9-26Joe Suzuki

連続変量を含む相互情報量の推定Joe Suzuki

相互情報量を用いた独立性の検定Joe Suzuki

Decision Support Analyss for Software Effort Estimation by AnalogyTim Menzies

Ordinal Common-sense InferenceNaoki Otani

Ssbse12a.pptYann-Gaël Guéhéneuc

Ensemble Learning Featuring the Netflix Prize Competition and ...butest

Recommandé

Ssbse12a.pptPtidej Team

2014 9-26Joe Suzuki

連続変量を含む相互情報量の推定Joe Suzuki

相互情報量を用いた独立性の検定Joe Suzuki

Decision Support Analyss for Software Effort Estimation by AnalogyTim Menzies

Ordinal Common-sense InferenceNaoki Otani

Ssbse12a.pptYann-Gaël Guéhéneuc

Ensemble Learning Featuring the Netflix Prize Competition and ...butest

RとPythonを比較するJoe Suzuki

R集会@統数研Joe Suzuki

E-learning Development of Statistics and in Duex: Practical Approaches and Th...Joe Suzuki

分枝限定法でモデル選択の計算量を低減するJoe Suzuki

連続変量を含む条件付相互情報量の推定Joe Suzuki

E-learning Design and Development for Data Science in Osaka UniversityJoe Suzuki

UAI 2017Joe Suzuki

AMBN2017 サテライトワークショップJoe Suzuki

CRAN Rパッケージ BNSLの概要Joe Suzuki

Forest Learning from DataJoe Suzuki

A Bayesian Approach to Data CompressionJoe Suzuki

A Conjecture on Strongly Consistent LearningJoe Suzuki

A Generalization of the Chow-Liu Algorithm and its Applications to Artificial...Joe Suzuki

A Generalization of Nonparametric Estimation and On-Line Prediction for Stati...Joe Suzuki

研究紹介(学生向け)Joe Suzuki

Bayesian Criteria based on Universal MeasuresJoe Suzuki

MDL/Bayesian Criteria based on Universal Coding/MeasureJoe Suzuki

The Universal Measure for General Sources and its Application to MDL/Bayesian...Joe Suzuki

Universal Prediction without assuming either Discrete or ContinuousJoe Suzuki

Bayesian network structure estimation based on the Bayesian/MDL criteria when...Joe Suzuki

Biological Classification BioHack (3).pdfmuntazimhurra

PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...Sérgio Sacani

Contenu connexe

Plus de Joe Suzuki

RとPythonを比較するJoe Suzuki

R集会@統数研Joe Suzuki

E-learning Development of Statistics and in Duex: Practical Approaches and Th...Joe Suzuki

分枝限定法でモデル選択の計算量を低減するJoe Suzuki

連続変量を含む条件付相互情報量の推定Joe Suzuki

E-learning Design and Development for Data Science in Osaka UniversityJoe Suzuki

UAI 2017Joe Suzuki

AMBN2017 サテライトワークショップJoe Suzuki

CRAN Rパッケージ BNSLの概要Joe Suzuki

Forest Learning from DataJoe Suzuki

A Bayesian Approach to Data CompressionJoe Suzuki

A Conjecture on Strongly Consistent LearningJoe Suzuki

A Generalization of the Chow-Liu Algorithm and its Applications to Artificial...Joe Suzuki

A Generalization of Nonparametric Estimation and On-Line Prediction for Stati...Joe Suzuki

研究紹介(学生向け)Joe Suzuki

Bayesian Criteria based on Universal MeasuresJoe Suzuki

MDL/Bayesian Criteria based on Universal Coding/MeasureJoe Suzuki

The Universal Measure for General Sources and its Application to MDL/Bayesian...Joe Suzuki

Universal Prediction without assuming either Discrete or ContinuousJoe Suzuki

Bayesian network structure estimation based on the Bayesian/MDL criteria when...Joe Suzuki

Plus de Joe Suzuki (20)

RとPythonを比較する

R集会@統数研

E-learning Development of Statistics and in Duex: Practical Approaches and Th...

分枝限定法でモデル選択の計算量を低減する

連続変量を含む条件付相互情報量の推定

E-learning Design and Development for Data Science in Osaka University

UAI 2017

AMBN2017 サテライトワークショップ

CRAN Rパッケージ BNSLの概要

Forest Learning from Data

A Bayesian Approach to Data Compression

A Conjecture on Strongly Consistent Learning

A Generalization of the Chow-Liu Algorithm and its Applications to Artificial...

A Generalization of Nonparametric Estimation and On-Line Prediction for Stati...

研究紹介(学生向け)

Bayesian Criteria based on Universal Measures

MDL/Bayesian Criteria based on Universal Coding/Measure

The Universal Measure for General Sources and its Application to MDL/Bayesian...

Universal Prediction without assuming either Discrete or Continuous

Bayesian network structure estimation based on the Bayesian/MDL criteria when...

Dernier

Biological Classification BioHack (3).pdfmuntazimhurra

PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...Sérgio Sacani

Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisDiwakar Mishra

Zoology 4th semester series (krishna).pdfSumit Kumar yadav

The Philosophy of ScienceUniversity of Hertfordshire

Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...anilsa9823

Presentation Vikram Lander by Vedansh Gupta.pptxgindu3009

Hubble Asteroid Hunter III. Physical properties of newly found asteroidsSérgio Sacani

Disentangling the origin of chemical differences using GHOSTSérgio Sacani

Spermiogenesis or Spermateleosis or metamorphosis of spermatidSarthak Sekhar Mondal

All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...Sérgio Sacani

Chromatin Structure | EUCHROMATIN | HETEROCHROMATINsankalpkumarsahoo174

Pests of mustard_Identification_Management_Dr.UPR.pdfPirithiRaju

Nanoparticles synthesis and characterization kaibalyasahoo82800

Animal Communication- Auditory and Visual.pptxUmerFayaz5

Botany 4th semester series (krishna).pdfSumit Kumar yadav

Natural Polymer Based NanomaterialsAArockiyaNisha

Recombinant DNA technology (Immunological screening)PraveenaKalaiselvan1

Pulmonary drug delivery system M.pharm -2nd sem P'ceuticssakshisoni2385

Botany 4th semester file By Sumit Kumar yadav.pdfSumit Kumar yadav

Dernier (20)

Biological Classification BioHack (3).pdf

PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...

Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis

Zoology 4th semester series (krishna).pdf

The Philosophy of Science

Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...

Presentation Vikram Lander by Vedansh Gupta.pptx

Hubble Asteroid Hunter III. Physical properties of newly found asteroids

Disentangling the origin of chemical differences using GHOST

Spermiogenesis or Spermateleosis or metamorphosis of spermatid

All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...

Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN

Pests of mustard_Identification_Management_Dr.UPR.pdf

Nanoparticles synthesis and characterization

Animal Communication- Auditory and Visual.pptx

Botany 4th semester series (krishna).pdf

Natural Polymer Based Nanomaterials

Recombinant DNA technology (Immunological screening)

Pulmonary drug delivery system M.pharm -2nd sem P'ceutics

Botany 4th semester file By Sumit Kumar yadav.pdf

Jeffreys' and BDeu Priors for Model Selection

1. Jeffreys' and BDeu Priors for Model Selection WITMSE 2016 Helsinki, Finland, September 20 Joe Suzuki (prof-joe) Joe Suzuki (Osaka Univ., Japan)

2. Goal and Contributions [Goal] Compare for model selection • BDeu (Bayesian Dirichlet equivalent uniform) • Jeffreys prior (T-K estimator) [Contribution] Mathematically Proves

3. Road Map 1. Bayesian Dirichlet Scores 2. BDeu and Jeffreys Scores 3. A Found Property and its Proof 4. Main Theorem 5. Regularity in Model Selection 6. Summary

4. Assign a Prob. to each Seq.

5. Express a Prob. by the product of Cond. Probs.

6. Simultaneous Probs.

7. Cond. Probs.

8. BDeu and Jeffreys’ Prior

10. Example 1 : Bayesian Network Structure Learning (BNSL)

11. Example 2: Independence Testing

12. A Motivating Example

13. A Found Property

14. Sketch of J(n)>0 for BDeu

15. Sketch of J(n)≦0 for Jeffreys’

16. An Intuitive Reasoning

17. Main Theorem

18. Examples more likely unlikely

19. Regularity in Model Selection Fitness + Simplicity → optimal (-1) x Likelihood + Penalty Term → min Newton’s Law of Motion Maxwell Equations If model A is better than model B w.r.t. fitness and simplicity, model A should be chosen (regularity). Information Criteria LASSO

20. BDeu violates regularity in model selection Z XZ X Y Y X

21. B&B for efficient BNSL (Depth First Search)

22. Those bounds utilize regularity Campos and Ji 2011 figured out one (=nice) but the bound is not efficient (experiments). Designing Pruning rules for BDeu is HARDer. because regularity cannot be assumed

23. Bayes Prior Based on his/her Belief: Nobody should reject it from a general point of view. BDeu violates regularity contradicts with Newton, Maxwell, Information Critreria, LASSO, etc. People might notice that their beliefs have been wrong, after knowing the new result in this paper.

24. Summary The prior behind BDeu might have been based on a wrong belief That contradicts regularity in model selection Future Work: Consider NML and others in a similar way