Data Stream Clustering: Challenges and Issues
Madjid Khalilian, Norwati Mustapha

TL;DR
This paper reviews the challenges and issues in data stream clustering, highlighting the difficulties in detecting evolving data patterns and discussing various approaches and solutions for clustering in continuous data streams.
Contribution
It provides a comprehensive survey of data stream clustering problems, difficulties, assumptions, heuristics, and solutions, clarifying the state of research in this field.
Findings
Identifies key challenges in evolving data detection and concept drift.
Summarizes different approaches and heuristics used in data stream clustering.
Highlights the importance of unsupervised methods for hidden pattern discovery.
Abstract
Very large databases are required to store massive amounts of data that are continuously inserted and queried. Analyzing huge data sets and extracting valuable pattern in many applications are interesting for researchers. We can identify two main groups of techniques for huge data bases mining. One group refers to streaming data and applies mining techniques whereas second group attempts to solve this problem directly with efficient algorithms. Recently many researchers have focused on data stream as an efficient strategy against huge data base mining instead of mining on entire data base. The main problem in data stream mining means evolving data is more difficult to detect in this techniques therefore unsupervised methods should be applied. However, clustering techniques can lead us to discover hidden information. In this survey, we try to clarify: first, the different problem…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Stream Mining Techniques · Advanced Clustering Algorithms Research · Time Series Analysis and Forecasting
