Occam's Ghost

Peter K\"ovesarki

arXiv:2006.09813·stat.ML·June 18, 2020

Occam's Ghost

Peter K\"ovesarki

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel regularization method based on minimal bit encoding for non-parametric models, extending Occam's Razor to include model parameters, leading to more efficient probability density estimators.

Contribution

It extends the concept of data encoding to model parameters, providing a true measure of model complexity and enabling automatic feature and parameter pruning.

Findings

01

Minimizes total bits for better regularization

02

Prunes irrelevant parameters effectively

03

Detects features with low probability

Abstract

This article applies the principle of Occam's Razor to non-parametric model building of statistical data, by finding a model with the minimal number of bits, leading to an exceptionally effective regularization method for probability density estimators. The idea comes from the fact that likelihood maximization also minimizes the number of bits required to encode a dataset. However, traditional methods overlook that the optimization of model parameters may also inadvertently play the part in encoding data points. The article shows how to extend the bit counting to the model parameters as well, providing the first true measure of complexity for parametric models. Minimizing the total bit requirement of a model of a dataset favors smaller derivatives, smoother probability density function estimates and most importantly, a phase space with fewer relevant parameters. In fact, it is able…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

freemeson/gaussian-mixture
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Bayesian Methods and Mixture Models · Data Analysis with R