Statistical Inference for Fuzzy Clustering
Qiuyi Wu, Zihan Zhu, Anru R. Zhang

TL;DR
This paper introduces a new weighted fuzzy c-means framework with statistical inference tools, enabling more accurate and interpretable clustering in biomedical data with uncertain and overlapping groups.
Contribution
It develops a novel weighted fuzzy c-means method with statistical inference, including likelihood ratio tests and confidence intervals, for improved fuzzy clustering analysis.
Findings
Method performs well in simulations.
Provides stable uncertainty quantification.
Applied successfully to biomedical datasets.
Abstract
Clustering is a central tool in biomedical research for discovering heterogeneous patient subpopulations, where group boundaries are often diffuse rather than sharply separated. Traditional methods produce hard partitions, whereas soft clustering methods such as fuzzy -means (FCM) allow mixed memberships and better capture uncertainty and gradual transitions. Despite the widespread use of FCM, principled statistical inference for fuzzy clustering remains limited. We develop a new framework for weighted fuzzy -means (WFCM) for settings with potential cluster size imbalance. Cluster-specific weights rebalance the classical FCM criterion so that smaller clusters are not overwhelmed by dominant groups, and the weighted objective induces a normalized density model with scale parameter and fuzziness parameter . Estimation is performed via a blockwise majorize--minimize…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Advanced Clustering Algorithms Research · Statistical Methods and Inference
