Statistical optimality conditions for compressive ensembles

Henry W. J. Reeve; Ata Kaban

arXiv:2106.01092·cs.LG·June 3, 2021·1 cites

Statistical optimality conditions for compressive ensembles

Henry W. J. Reeve, Ata Kaban

PDF

Open Access

TL;DR

This paper develops a theoretical framework for analyzing the optimality of ensemble methods trained on compressed high-dimensional data, showing conditions under which they achieve near-minimax rates.

Contribution

It introduces a dimension-independent excess risk bound for compressive ensembles and applies it to classification and regression with Johnson-Lindenstrauss mappings, revealing geometric conditions for optimality.

Findings

01

Achieves minimax-optimal rates under geometric margin conditions for classification.

02

Exploits spectral decay for near-optimal compressive regression.

03

Provides a high-probability bound on empirical process deviations.

Abstract

We present a framework for the theoretical analysis of ensembles of low-complexity empirical risk minimisers trained on independent random compressions of high-dimensional data. First we introduce a general distribution-dependent upper-bound on the excess risk, framed in terms of a natural notion of compressibility. This bound is independent of the dimension of the original data representation, and explains the in-built regularisation effect of the compressive approach. We then instantiate this general bound to classification and regression tasks, considering Johnson-Lindenstrauss mappings as the compression scheme. For each of these tasks, our strategy is to develop a tight upper bound on the compressibility function, and by doing so we discover distributional conditions of geometric nature under which the compressive algorithm attains minimax-optimal rates up to at most…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSparse and Compressive Sensing Techniques · Statistical Methods and Inference · Machine Learning and Algorithms