Stabilizing Linear Passive-Aggressive Online Learning with Weighted   Reservoir Sampling

Skyler Wu; Fred Lu; Edward Raff; James Holt

arXiv:2410.23601·cs.LG·November 1, 2024

Stabilizing Linear Passive-Aggressive Online Learning with Weighted Reservoir Sampling

Skyler Wu, Fred Lu, Edward Raff, James Holt

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a weighted reservoir sampling method to stabilize online learning algorithms like PA classifiers by maintaining a high-quality ensemble of solutions, improving accuracy especially in the presence of outliers.

Contribution

The paper proposes a novel weighted reservoir sampling technique that enhances the stability and accuracy of online learning models without additional data passes or memory overhead.

Findings

01

WRS improves stability of online classifiers against outliers.

02

Ensemble approach outperforms standard online methods in experiments.

03

Risk of ensemble classifier is bounded by the regret of the base online method.

Abstract

Online learning methods, like the seminal Passive-Aggressive (PA) classifier, are still highly effective for high-dimensional streaming data, out-of-core processing, and other throughput-sensitive applications. Many such algorithms rely on fast adaptation to individual errors as a key to their convergence. While such algorithms enjoy low theoretical regret, in real-world deployment they can be sensitive to individual outliers that cause the algorithm to over-correct. When such outliers occur at the end of the data stream, this can cause the final solution to have unexpectedly low accuracy. We design a weighted reservoir sampling (WRS) approach to obtain a stable ensemble model from the sequence of solutions without requiring additional passes over the data, hold-out sets, or a growing amount of memory. Our key insight is that good solutions tend to be error-free for more iterations than…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

FutureComputing4AI/Weighted-Reservoir-Sampling-Augmented-Training
noneOfficial

Videos

Stabilizing Linear Passive-Aggressive Online Learning with Weighted Reservoir Sampling· slideslive

Taxonomy

TopicsNeural Networks and Reservoir Computing · Data Stream Mining Techniques