Robust online active learning

Davide Cacciarelli; Murat Kulahci; John S{\o}lve Tyssedal

arXiv:2302.00422·stat.ML·July 19, 2023

Robust online active learning

Davide Cacciarelli, Murat Kulahci, John S{\o}lve Tyssedal

PDF

Open Access

TL;DR

This paper examines the vulnerability of online active learning methods to outliers in data streams and proposes a robust approach that enhances model performance by mitigating outlier effects.

Contribution

It introduces a novel robust online active learning method that bounds search areas and employs robust estimators to handle outliers effectively.

Findings

01

Improved predictive accuracy in contaminated data streams

02

Enhanced robustness of online active learning algorithms

03

Demonstrated effectiveness through numerical simulations

Abstract

In many industrial applications, obtaining labeled observations is not straightforward as it often requires the intervention of human experts or the use of expensive testing equipment. In these circumstances, active learning can be highly beneficial in suggesting the most informative data points to be used when fitting a model. Reducing the number of observations needed for model development alleviates both the computational burden required for training and the operational expenses related to labeling. Online active learning, in particular, is useful in high-volume production processes where the decision about the acquisition of the label for a data point needs to be taken within an extremely short time frame. However, despite the recent efforts to develop online active learning strategies, the behavior of these methods in the presence of outliers has not been thoroughly examined. In…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Advanced Statistical Process Monitoring · Fault Detection and Control Systems

MethodsLinear Regression