Deviation optimal learning using greedy Q-aggregation
Dong Dai, Philippe Rigollet, Tong Zhang

TL;DR
This paper introduces Q-aggregation, a new approach for model selection that achieves optimal deviation bounds and sparsity, improving upon existing exponential weights methods in regression tasks.
Contribution
The paper proposes Q-aggregation, a novel formulation that provides sharp, minimax optimal deviation bounds and enables the design of greedy, sparse aggregation procedures.
Findings
Q-aggregation achieves sharp oracle inequalities.
Greedy procedures produce sparse models with optimal rates.
Compared methods show improved performance on simulations.
Abstract
Given a finite family of functions, the goal of model selection aggregation is to construct a procedure that mimics the function from this family that is the closest to an unknown regression function. More precisely, we consider a general regression model with fixed design and measure the distance between functions by the mean squared error at the design points. While procedures based on exponential weights are known to solve the problem of model selection aggregation in expectation, they are, surprisingly, sub-optimal in deviation. We propose a new formulation called Q-aggregation that addresses this limitation; namely, its solution leads to sharp oracle inequalities that are optimal in a minimax sense. Moreover, based on the new formulation, we design greedy Q-aggregation procedures that produce sparse aggregation models achieving the optimal rate. The convergence and performance of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
