ITBoost: Information-Theoretic Trust for Robust Boosting
Ye Su, Longlong Zhao, Diego Garcia-Gil, Jipeng Guo, Gangchun Zhang, Jinxin Chen, Jinsong Chen

TL;DR
ITBoost introduces an information-theoretic approach to enhance the robustness of gradient boosting algorithms against noisy labels by evaluating residual evolution patterns.
Contribution
It proposes a novel residual trajectory complexity measure using MDL, improving robustness in noisy label scenarios compared to existing methods.
Findings
ITBoost outperforms leading boosting models on noisy tabular data.
It maintains high accuracy on clean data.
Provides a tighter generalization bound under label noise.
Abstract
Gradient boosting remains a strong and widely used method for tabular data learning, but its performance often degrades when training labels are noisy. This behavior is largely related to the way boosting algorithms emphasize samples with large gradients, without explicitly accounting for whether such errors originate from informative hard cases or from unreliable labels. We address this issue by reconsidering how sample reliability is evaluated during boosting. Instead of relying on instantaneous error, we examine the evolution of each sample's residuals across iterations. Based on this insight, we propose Information-Theoretic Trust Boosting (ITBoost), which uses the Minimum Description Length principle to measure the complexity of residual trajectories. Samples whose residual patterns fluctuate in an irregular manner are treated as less trustworthy and are down-weighted during…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
