Error Reduction from Stacked Regressions

Xin Chen; Jason M. Klusowski; Yan Shuo Tan

arXiv:2309.09880·stat.ML·October 10, 2024·2 cites

Error Reduction from Stacked Regressions

Xin Chen, Jason M. Klusowski, Yan Shuo Tan

PDF

Open Access

TL;DR

This paper introduces a novel approach to stacking regressions by learning combination weights through regularized empirical risk minimization, resulting in improved predictive accuracy especially in low signal-to-noise scenarios.

Contribution

It demonstrates that under certain conditions, the proposed stacking method achieves strictly lower population risk than any single estimator, with computational efficiency comparable to isotonic regression.

Findings

01

Stacked estimator outperforms individual estimators in risk reduction.

02

The method is particularly effective when the signal-to-noise ratio is low.

03

Computational complexity is similar to that of isotonic regression.

Abstract

Stacking regressions is an ensemble technique that forms linear combinations of different regression estimators to enhance predictive accuracy. The conventional approach uses cross-validation data to generate predictions from the constituent estimators, and least-squares with nonnegativity constraints to learn the combination weights. In this paper, we learn these weights analogously by minimizing a regularized version of the empirical risk subject to a nonnegativity constraint. When the constituent estimators are linear least-squares projections onto nested subspaces separated by at least three dimensions, we show that thanks to an adaptive shrinkage effect, the resulting stacked estimator has strictly smaller population risk than best single estimator among them, with more significant gains when the signal-to-noise ratio is small. Here "best" refers to an estimator that minimizes a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Statistical Methods and Models · Statistical Methods and Inference · Fault Detection and Control Systems