Minimum Description Length Principle for Maximum Entropy Model Selection

Gaurav Pandey; Ambedkar Dukkipati

arXiv:1204.6423·cs.IT·November 28, 2013

Minimum Description Length Principle for Maximum Entropy Model Selection

Gaurav Pandey, Ambedkar Dukkipati

PDF

TL;DR

This paper introduces an MDL-based approach for selecting maximum entropy models, deriving NML codelengths, and demonstrating its application to gene selection with promising simulation results.

Contribution

It formulates maximum entropy model selection as an MDL problem and derives the NML codelength for these models, connecting it to the minimax entropy principle.

Findings

01

Derived NML codelengths for maximum entropy models

02

Proved minimax entropy as a special case of model selection

03

Applied method successfully to gene selection problem

Abstract

Model selection is central to statistics, and many learning problems can be formulated as model selection problems. In this paper, we treat the problem of selecting a maximum entropy model given various feature subsets and their moments, as a model selection problem, and present a minimum description length (MDL) formulation to solve this problem. For this, we derive normalized maximum likelihood (NML) codelength for these models. Furthermore, we prove that the minimax entropy principle is a special case of maximum entropy model selection, where one assumes that complexity of all the models are equal. We apply our approach to gene selection problem and present simulation results.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.