SUDS: A Strategy for Unsupervised Drift Sampling
Christofer Fellicious, Lorenz Wendlinger, Mario Gancarski, Jelena, Mitrovic, Michael Granitzer

TL;DR
SUDS is a new strategy that improves model retraining after concept drift by selecting homogeneous samples, combined with HADAM, a metric balancing classifier performance and labeled data effort, enhancing adaptability in dynamic environments.
Contribution
SUDS innovatively integrates drift detection with strategic sampling for better retraining, and HADAM offers a new metric to evaluate classifier performance relative to labeled data requirements.
Findings
SUDS improves model performance in drifting environments.
HADAM effectively balances classifier accuracy with labeling effort.
Empirical results show significant gains in resource-efficient adaptation.
Abstract
Supervised machine learning often encounters concept drift, where the data distribution changes over time, degrading model performance. Existing drift detection methods focus on identifying these shifts but often overlook the challenge of acquiring labeled data for model retraining after a shift occurs. We present the Strategy for Drift Sampling (SUDS), a novel method that selects homogeneous samples for retraining using existing drift detection algorithms, thereby enhancing model adaptability to evolving data. SUDS seamlessly integrates with current drift detection techniques. We also introduce the Harmonized Annotated Data Accuracy Metric (HADAM), a metric that evaluates classifier performance in relation to the quantity of annotated data required to achieve the stated performance, thereby taking into account the difficulty of acquiring labeled data. Our contributions are twofold:…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Stream Mining Techniques · Flow Measurement and Analysis · Anomaly Detection Techniques and Applications
MethodsFocus
