AdaptiveMDL-GenClust: A Robust Clustering Framework Integrating Normalized Mutual Information and Evolutionary Algorithms
H. Jahani, F. Zamio

TL;DR
This paper presents a robust clustering framework that combines the MDL principle with genetic algorithms, improving clustering accuracy and stability across diverse datasets by reducing bias and dependency on initial conditions.
Contribution
The study introduces an adaptive clustering method integrating MDL and genetic algorithms, enhancing robustness and performance over traditional approaches.
Findings
Outperforms traditional clustering methods in accuracy and stability
Demonstrates effectiveness across diverse benchmark datasets
Reduces bias and dependency on initial clustering solutions
Abstract
Clustering algorithms are pivotal in data analysis, enabling the organization of data into meaningful groups. However, individual clustering methods often exhibit inherent limitations and biases, preventing the development of a universal solution applicable to diverse datasets. To address these challenges, we introduce a robust clustering framework that integrates the Minimum Description Length (MDL) principle with a genetic optimization algorithm. The framework begins with an ensemble clustering approach to generate an initial clustering solution, which is then refined using MDL-guided evaluation functions and optimized through a genetic algorithm. This integration allows the method to adapt to the dataset's intrinsic properties, minimizing dependency on the initial clustering input and ensuring a data-driven, robust clustering process. We evaluated the proposed method on thirteen…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Clustering Algorithms Research · Text and Document Classification Technologies
MethodsMinimum Description Length · Ensemble Clustering
