Robust online active learning
Davide Cacciarelli, Murat Kulahci, John S{\o}lve Tyssedal

TL;DR
This paper examines the vulnerability of online active learning methods to outliers in data streams and proposes a robust approach that enhances model performance by mitigating outlier effects.
Contribution
It introduces a novel robust online active learning method that bounds search areas and employs robust estimators to handle outliers effectively.
Findings
Improved predictive accuracy in contaminated data streams
Enhanced robustness of online active learning algorithms
Demonstrated effectiveness through numerical simulations
Abstract
In many industrial applications, obtaining labeled observations is not straightforward as it often requires the intervention of human experts or the use of expensive testing equipment. In these circumstances, active learning can be highly beneficial in suggesting the most informative data points to be used when fitting a model. Reducing the number of observations needed for model development alleviates both the computational burden required for training and the operational expenses related to labeling. Online active learning, in particular, is useful in high-volume production processes where the decision about the acquisition of the label for a data point needs to be taken within an extremely short time frame. However, despite the recent efforts to develop online active learning strategies, the behavior of these methods in the presence of outliers has not been thoroughly examined. In…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Advanced Statistical Process Monitoring · Fault Detection and Control Systems
MethodsLinear Regression
