Sample-Near-Optimal Agnostic Boosting with Improved Running Time
Arthur da Cunha, Mikael M{\o}ller H{\o}gsgaard, Andrea Paudice

TL;DR
This paper introduces a new agnostic boosting algorithm that achieves near-optimal sample complexity and runs in polynomial time, improving the practicality of boosting in agnostic learning scenarios.
Contribution
It presents the first agnostic boosting algorithm with near-optimal sample complexity and polynomial running time, addressing a key computational challenge.
Findings
Achieves near-optimal sample complexity for agnostic boosting.
Runs in polynomial time with respect to sample size.
Advances the theoretical understanding of agnostic boosting algorithms.
Abstract
Boosting is a powerful method that turns weak learners, which perform only slightly better than random guessing, into strong learners with high accuracy. While boosting is well understood in the classic setting, it is less so in the agnostic case, where no assumptions are made about the data. Indeed, only recently was the sample complexity of agnostic boosting nearly settled arXiv:2503.09384, but the known algorithm achieving this bound has exponential running time. In this work, we propose the first agnostic boosting algorithm with near-optimal sample complexity, running in time polynomial in the sample size when considering the other parameters of the problem fixed.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Machine Learning and Data Classification · Imbalanced Data Classification Techniques
