Information-Theoretic Foundations for Machine Learning
Hong Jun Jeon, Benjamin Van Roy

TL;DR
This paper introduces a rigorous information-theoretic framework based on Bayesian statistics and Shannon's theory to analyze and unify diverse machine learning phenomena, providing both theoretical insights and practical intuition.
Contribution
It presents a general, mathematically rigorous framework that unifies analysis across various learning paradigms and offers new insights into machine learning performance and challenges.
Findings
Framework applies to i.i.d., sequential, hierarchical, and misspecified data.
Provides accurate performance insights across diverse learning settings.
Guides future research in complex machine learning scenarios.
Abstract
The progress of machine learning over the past decade is undeniable. In retrospect, it is both remarkable and unsettling that this progress was achievable with little to no rigorous theory to guide experimentation. Despite this fact, practitioners have been able to guide their future experimentation via observations from previous large-scale empirical investigations. In this work, we propose a theoretical framework which attempts to provide rigor to existing practices in machine learning. To the theorist, we provide a framework which is mathematically rigorous and leaves open many interesting ideas for future exploration. To the practitioner, we provide a framework whose results are simple, and provide intuition to guide future investigations across a wide range of learning paradigms. Concretely, we provide a theoretical framework rooted in Bayesian statistics and Shannon's information…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
