An Impossibility Result for High Dimensional Supervised Learning
Mohammad Hossein Rohban, Prakash Ishwar, Birant Orten, William C., Karl, Venkatesh Saligrama

TL;DR
This paper demonstrates that in high-dimensional supervised learning with Gaussian classes, the minimal achievable error remains high despite the Bayes error approaching zero, highlighting the importance of structural assumptions for effective learning.
Contribution
It establishes an impossibility result showing that without strong structural constraints, supervised learning cannot reliably outperform random guessing in high dimensions.
Findings
Minimax error remains high even when Bayes error approaches zero.
Supervised learning effectiveness depends on imposing structural constraints.
High-dimensional limits reveal fundamental limitations of generic Gaussian models.
Abstract
We study high-dimensional asymptotic performance limits of binary supervised classification problems where the class conditional densities are Gaussian with unknown means and covariances and the number of signal dimensions scales faster than the number of labeled training samples. We show that the Bayes error, namely the minimum attainable error probability with complete distributional knowledge and equally likely classes, can be arbitrarily close to zero and yet the limiting minimax error probability of every supervised learning algorithm is no better than a random coin toss. In contrast to related studies where the classification difficulty (Bayes error) is made to vanish, we hold it constant when taking high-dimensional limits. In contrast to VC-dimension based minimax lower bounds that consider the worst case error probability over all distributions that have a fixed Bayes error,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
