Exact Finite-Sample Variance Decomposition of Subagging: A Spectral Filtering Perspective

Ye Su; Mingrui Ye; Yining Wang; Jipeng Guo; and Yong Liu

arXiv:2604.10469·cs.LG·April 14, 2026

Exact Finite-Sample Variance Decomposition of Subagging: A Spectral Filtering Perspective

Ye Su, Mingrui Ye, Yining Wang, Jipeng Guo, and Yong Liu

PDF

TL;DR

This paper provides an exact finite-sample variance decomposition for subagging, revealing it as a spectral filter that selectively attenuates high-order interaction variance, and introduces an adaptive subsampling algorithm based on these insights.

Contribution

It derives the first exact finite-sample variance decomposition for subagging applicable to any symmetric learner, connecting spectral filtering with regularization and proposing a complexity-guided adaptive subsampling method.

Findings

01

Subagging acts as a low-pass spectral filter attenuating high-order interactions.

02

Default resampling ratios often under-regularize high-capacity learners.

03

Adaptive calibration of resampling ratio improves generalization.

Abstract

Standard resampling ratios (e.g., $α \approx 0.632$ ) are widely used as default baselines in ensemble learning for three decades. However, how these ratios interact with a base learner's intrinsic functional complexity in finite samples lacks a exact mathematical characterization. We leverage the Hoeffding-ANOVA decomposition to derive the first exact, finite-sample variance decomposition for subagging, applicable to any symmetric base learner without requiring asymptotic limits or smoothness assumptions. We establish that subagging operates as a deterministic low-pass spectral filter: it preserves low-order structural signals while attenuating $c$ -th order interaction variance by a geometric factor approaching $α^{c}$ . This decoupling reveals why default baselines often under-regularize high-capacity interpolators, which instead require smaller $α$ to exponentially…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.