Online feature selection for rapid, low-overhead learning in networked systems
Xiaoxuan Wang (1), Forough Shahab Samani (1, 2), Rolf Stadler (1, and 2) ((1) KTH Royal Institute of Technology, Sweden (2) RISE Research, Institutes of Sweden)

TL;DR
This paper introduces OSFS, an online feature selection algorithm that reduces data sources significantly for rapid, low-overhead learning in networked systems, demonstrated through extensive experimental evaluation.
Contribution
The paper presents OSFS, a novel online feature selection method that efficiently reduces data sources while maintaining prediction accuracy, enabling faster and less resource-intensive learning.
Findings
OSFS reduces data sources by two orders of magnitude.
Models trained with OSFS require only hundreds of measurements.
Effective learning is achieved with acceptable prediction accuracy.
Abstract
Data-driven functions for operation and management often require measurements collected through monitoring for model training and prediction. The number of data sources can be very large, which requires a significant communication and computing overhead to continuously extract and collect this data, as well as to train and update the machine-learning models. We present an online algorithm, called OSFS, that selects a small feature set from a large number of available data sources, which allows for rapid, low-overhead, and effective learning and prediction. OSFS is instantiated with a feature ranking algorithm and applies the concept of a stable feature set, which we introduce in the paper. We perform extensive, experimental evaluation of our method on data from an in-house testbed. We find that OSFS requires several hundreds measurements to reduce the number of data sources by two…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Stream Mining Techniques · Machine Learning and Algorithms · Anomaly Detection Techniques and Applications
