Sample-Near-Optimal Agnostic Boosting with Improved Running Time

Arthur da Cunha; Mikael M{\o}ller H{\o}gsgaard; Andrea Paudice

arXiv:2601.11265·cs.LG·February 4, 2026

Sample-Near-Optimal Agnostic Boosting with Improved Running Time

Arthur da Cunha, Mikael M{\o}ller H{\o}gsgaard, Andrea Paudice

PDF

Open Access

TL;DR

This paper introduces a new agnostic boosting algorithm that achieves near-optimal sample complexity and runs in polynomial time, improving the practicality of boosting in agnostic learning scenarios.

Contribution

It presents the first agnostic boosting algorithm with near-optimal sample complexity and polynomial running time, addressing a key computational challenge.

Findings

01

Achieves near-optimal sample complexity for agnostic boosting.

02

Runs in polynomial time with respect to sample size.

03

Advances the theoretical understanding of agnostic boosting algorithms.

Abstract

Boosting is a powerful method that turns weak learners, which perform only slightly better than random guessing, into strong learners with high accuracy. While boosting is well understood in the classic setting, it is less so in the agnostic case, where no assumptions are made about the data. Indeed, only recently was the sample complexity of agnostic boosting nearly settled arXiv:2503.09384, but the known algorithm achieving this bound has exponential running time. In this work, we propose the first agnostic boosting algorithm with near-optimal sample complexity, running in time polynomial in the sample size when considering the other parameters of the problem fixed.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Machine Learning and Data Classification · Imbalanced Data Classification Techniques