Feature Incremental Clustering with Generalization Bounds
Jing Zhang, and Chenping Hou

TL;DR
This paper introduces four feature incremental clustering algorithms tailored for dynamic data streams with expanding feature spaces, providing theoretical generalization bounds and demonstrating effectiveness in activity recognition tasks.
Contribution
It proposes four novel feature incremental clustering algorithms with theoretical analysis of their generalization bounds, addressing a gap in clustering for expanding feature spaces.
Findings
Algorithms effectively cluster activity recognition data.
Generalization bounds depend on data, model complexity, and feature distribution.
Numerical experiments validate the algorithms' effectiveness.
Abstract
In many learning systems, such as activity recognition systems, as new data collection methods continue to emerge in various dynamic environmental applications, the attributes of instances accumulate incrementally, with data being stored in gradually expanding feature spaces. How to design theoretically guaranteed algorithms to effectively cluster this special type of data stream, commonly referred to as activity recognition, remains unexplored. Compared to traditional scenarios, we will face at least two fundamental questions in this feature incremental scenario. (i) How to design preliminary and effective algorithms to address the feature incremental clustering problem? (ii) How to analyze the generalization bounds for the proposed algorithms and under what conditions do these algorithms provide a strong generalization guarantee? To address these problems, by tailoring the most common…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Clustering Algorithms Research · Data Stream Mining Techniques · Time Series Analysis and Forecasting
