Mitigating sampling bias in risk-based active learning via an EM algorithm
Aidan J. Hughes, Lawrence A. Bull, Paul Gardner, Nikolaos Dervilis,, Keith Worden

TL;DR
This paper presents a risk-based active learning method using a semi-supervised Gaussian mixture model with an EM algorithm to mitigate sampling bias, improving decision-making in structural health monitoring applications.
Contribution
It introduces a novel semi-supervised approach employing an EM algorithm to reduce sampling bias in risk-based active learning for SHM.
Findings
Effectively reduces sampling bias in active learning
Improves decision-making accuracy in SHM applications
Demonstrates robustness on a numerical example
Abstract
Risk-based active learning is an approach to developing statistical classifiers for online decision-support. In this approach, data-label querying is guided according to the expected value of perfect information for incipient data points. For SHM applications, the value of information is evaluated with respect to a maintenance decision process, and the data-label querying corresponds to the inspection of a structure to determine its health state. Sampling bias is a known issue within active-learning paradigms; this occurs when an active learning process over- or undersamples specific regions of a feature-space, thereby resulting in a training set that is not representative of the underlying distribution. This bias ultimately degrades decision-making performance, and as a consequence, results in unnecessary costs incurred. The current paper outlines a risk-based approach to active…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Statistical Process Monitoring · Mineral Processing and Grinding
