Statistical optimality conditions for compressive ensembles
Henry W. J. Reeve, Ata Kaban

TL;DR
This paper develops a theoretical framework for analyzing the optimality of ensemble methods trained on compressed high-dimensional data, showing conditions under which they achieve near-minimax rates.
Contribution
It introduces a dimension-independent excess risk bound for compressive ensembles and applies it to classification and regression with Johnson-Lindenstrauss mappings, revealing geometric conditions for optimality.
Findings
Achieves minimax-optimal rates under geometric margin conditions for classification.
Exploits spectral decay for near-optimal compressive regression.
Provides a high-probability bound on empirical process deviations.
Abstract
We present a framework for the theoretical analysis of ensembles of low-complexity empirical risk minimisers trained on independent random compressions of high-dimensional data. First we introduce a general distribution-dependent upper-bound on the excess risk, framed in terms of a natural notion of compressibility. This bound is independent of the dimension of the original data representation, and explains the in-built regularisation effect of the compressive approach. We then instantiate this general bound to classification and regression tasks, considering Johnson-Lindenstrauss mappings as the compression scheme. For each of these tasks, our strategy is to develop a tight upper bound on the compressibility function, and by doing so we discover distributional conditions of geometric nature under which the compressive algorithm attains minimax-optimal rates up to at most…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Statistical Methods and Inference · Machine Learning and Algorithms
