Exact Finite-Sample Variance Decomposition of Subagging: A Spectral Filtering Perspective
Ye Su, Mingrui Ye, Yining Wang, Jipeng Guo, and Yong Liu

TL;DR
This paper provides an exact finite-sample variance decomposition for subagging, revealing it as a spectral filter that selectively attenuates high-order interaction variance, and introduces an adaptive subsampling algorithm based on these insights.
Contribution
It derives the first exact finite-sample variance decomposition for subagging applicable to any symmetric learner, connecting spectral filtering with regularization and proposing a complexity-guided adaptive subsampling method.
Findings
Subagging acts as a low-pass spectral filter attenuating high-order interactions.
Default resampling ratios often under-regularize high-capacity learners.
Adaptive calibration of resampling ratio improves generalization.
Abstract
Standard resampling ratios (e.g., ) are widely used as default baselines in ensemble learning for three decades. However, how these ratios interact with a base learner's intrinsic functional complexity in finite samples lacks a exact mathematical characterization. We leverage the Hoeffding-ANOVA decomposition to derive the first exact, finite-sample variance decomposition for subagging, applicable to any symmetric base learner without requiring asymptotic limits or smoothness assumptions. We establish that subagging operates as a deterministic low-pass spectral filter: it preserves low-order structural signals while attenuating -th order interaction variance by a geometric factor approaching . This decoupling reveals why default baselines often under-regularize high-capacity interpolators, which instead require smaller to exponentially…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
