Information-Theoretic Analysis of Minimax Excess Risk
Hassan Hafez-Kolahi, Behrad Moniri, Shohreh Kasaei

TL;DR
This paper explores the information-theoretic properties of minimax excess risk in machine learning, establishing duality principles and applying Bayesian insights to derive bounds for classification and regression problems.
Contribution
It introduces a duality framework for minimax excess risk, linking it to Bayesian problems and enabling the use of information-theoretic bounds in this context.
Findings
Duality gap is zero under certain conditions when players choose randomly.
Order of play in the minimax game can be swapped, leading to a Bayesian formulation.
Provides bounds on minimax excess risk for classification and regularized least squares.
Abstract
Two main concepts studied in machine learning theory are generalization gap (difference between train and test error) and excess risk (difference between test error and the minimum possible error). While information-theoretic tools have been used extensively to study the generalization gap of learning algorithms, the information-theoretic nature of excess risk has not yet been fully investigated. In this paper, some steps are taken toward this goal. We consider the frequentist problem of minimax excess risk as a zero-sum game between the algorithm designer and the world. Then, we argue that it is desirable to modify this game in a way that the order of play can be swapped. We then prove that, under some regularity conditions, if the world and designer can play randomly the duality gap is zero and the order of play can be changed. In this case, a Bayesian problem surfaces in the dual…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Statistical Methods and Inference · Machine Learning and Data Classification
