How to Boost Any Loss Function
Richard Nock, Yishay Mansour

TL;DR
This paper demonstrates that boosting can optimize any loss function, including non-convex, non-differentiable, and discontinuous ones, by leveraging tools from quantum calculus, expanding its applicability beyond traditional first-order methods.
Contribution
It provides a formal proof that boosting can optimize arbitrary loss functions without requiring first-order information, challenging the conventional limitations of boosting methods.
Findings
Boosting can optimize any loss function, regardless of convexity or differentiability.
Quantum calculus tools enable boosting to operate without first-order information.
Boosting surpasses classical $0^{th}$ order limitations in optimization.
Abstract
Boosting is a highly successful ML-born optimization setting in which one is required to computationally efficiently learn arbitrarily good models based on the access to a weak learner oracle, providing classifiers performing at least slightly differently from random guessing. A key difference with gradient-based optimization is that boosting's original model does not requires access to first order information about a loss, yet the decades long history of boosting has quickly evolved it into a first order optimization setting -- sometimes even wrongfully defining it as such. Owing to recent progress extending gradient-based optimization to use only a loss' zeroth () order information to learn, this begs the question: what loss functions can be efficiently optimized with boosting and what is the information really needed for boosting to meet the original boosting blueprint's…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsProbability and Risk Models
