Information-Theoretic Foundations for Machine Learning

Hong Jun Jeon; Benjamin Van Roy

arXiv:2407.12288·stat.ML·May 23, 2025·2 cites

Information-Theoretic Foundations for Machine Learning

Hong Jun Jeon, Benjamin Van Roy

PDF

Open Access

TL;DR

This paper introduces a rigorous information-theoretic framework based on Bayesian statistics and Shannon's theory to analyze and unify diverse machine learning phenomena, providing both theoretical insights and practical intuition.

Contribution

It presents a general, mathematically rigorous framework that unifies analysis across various learning paradigms and offers new insights into machine learning performance and challenges.

Findings

01

Framework applies to i.i.d., sequential, hierarchical, and misspecified data.

02

Provides accurate performance insights across diverse learning settings.

03

Guides future research in complex machine learning scenarios.

Abstract

The progress of machine learning over the past decade is undeniable. In retrospect, it is both remarkable and unsettling that this progress was achievable with little to no rigorous theory to guide experimentation. Despite this fact, practitioners have been able to guide their future experimentation via observations from previous large-scale empirical investigations. In this work, we propose a theoretical framework which attempts to provide rigor to existing practices in machine learning. To the theorist, we provide a framework which is mathematically rigorous and leaves open many interesting ideas for future exploration. To the practitioner, we provide a framework whose results are simple, and provide intuition to guide future investigations across a wide range of learning paradigms. Concretely, we provide a theoretical framework rooted in Bayesian statistics and Shannon's information…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications