An Impossibility Result for High Dimensional Supervised Learning

Mohammad Hossein Rohban; Prakash Ishwar; Birant Orten; William C.; Karl; Venkatesh Saligrama

arXiv:1301.6915·stat.ML·November 17, 2016·ITW

An Impossibility Result for High Dimensional Supervised Learning

Mohammad Hossein Rohban, Prakash Ishwar, Birant Orten, William C., Karl, Venkatesh Saligrama

PDF

TL;DR

This paper demonstrates that in high-dimensional supervised learning with Gaussian classes, the minimal achievable error remains high despite the Bayes error approaching zero, highlighting the importance of structural assumptions for effective learning.

Contribution

It establishes an impossibility result showing that without strong structural constraints, supervised learning cannot reliably outperform random guessing in high dimensions.

Findings

01

Minimax error remains high even when Bayes error approaches zero.

02

Supervised learning effectiveness depends on imposing structural constraints.

03

High-dimensional limits reveal fundamental limitations of generic Gaussian models.

Abstract

We study high-dimensional asymptotic performance limits of binary supervised classification problems where the class conditional densities are Gaussian with unknown means and covariances and the number of signal dimensions scales faster than the number of labeled training samples. We show that the Bayes error, namely the minimum attainable error probability with complete distributional knowledge and equally likely classes, can be arbitrarily close to zero and yet the limiting minimax error probability of every supervised learning algorithm is no better than a random coin toss. In contrast to related studies where the classification difficulty (Bayes error) is made to vanish, we hold it constant when taking high-dimensional limits. In contrast to VC-dimension based minimax lower bounds that consider the worst case error probability over all distributions that have a fixed Bayes error,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.