Theoretical and Empirical Analysis of a Parallel Boosting Algorithm
Uday Kamath, Carlotta Domeniconi, Kenneth De Jong

TL;DR
This paper introduces PSBML, a parallel boosting meta-algorithm that combines ensemble and boosting concepts to improve scalability in machine learning without losing accuracy, supported by theoretical and empirical evidence.
Contribution
It presents a novel parallel boosting algorithm, PSBML, with theoretical guarantees and extensive empirical validation demonstrating its scalability and robustness.
Findings
PSBML converges to a distribution centered around the margin.
It maintains accuracy while significantly improving scalability.
The algorithm is robust to noise and effective across various classifiers.
Abstract
Many real-world problems involve massive amounts of data. Under these circumstances learning algorithms often become prohibitively expensive, making scalability a pressing issue to be addressed. A common approach is to perform sampling to reduce the size of the dataset and enable efficient learning. Alternatively, one customizes learning algorithms to achieve scalability. In either case, the key challenge is to obtain algorithmic efficiency without compromising the quality of the results. In this paper we discuss a meta-learning algorithm (PSBML) which combines features of parallel algorithms with concepts from ensemble and boosting methodologies to achieve the desired scalability property. We present both theoretical and empirical analyses which show that PSBML preserves a critical property of boosting, specifically, convergence to a distribution centered around the margin. We then…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Data Stream Mining Techniques · Evolutionary Algorithms and Applications
