An Online Sparse Streaming Feature Selection Algorithm
Feilong Chen, Di Wu, Jie Yang, Yi He

TL;DR
This paper introduces OS2FSU, an online sparse streaming feature selection algorithm that effectively handles missing data and uncertainty, outperforming existing methods in high-dimensional streaming data scenarios.
Contribution
The paper proposes a novel OSFS algorithm incorporating latent factor analysis and fuzzy logic to address missing data and uncertainty in streaming feature selection.
Findings
OS2FSU outperforms five state-of-the-art algorithms on six real datasets.
The method effectively handles missing data in streaming feature selection.
Incorporating uncertainty modeling improves feature selection accuracy.
Abstract
Online streaming feature selection (OSFS), which conducts feature selection in an online manner, plays an important role in dealing with high-dimensional data. In many real applications such as intelligent healthcare platform, streaming feature always has some missing data, which raises a crucial challenge in conducting OSFS, i.e., how to establish the uncertain relationship between sparse streaming features and labels. Unfortunately, existing OSFS algorithms never consider such uncertain relationship. To fill this gap, we in this paper propose an online sparse streaming feature selection with uncertainty (OS2FSU) algorithm. OS2FSU consists of two main parts: 1) latent factor analysis is utilized to pre-estimate the missing data in sparse streaming features before con-ducting feature selection, and 2) fuzzy logic and neighborhood rough set are employed to alleviate the uncertainty…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Mining Algorithms and Applications
MethodsFeature Selection
