MDL-GBG: A Non-parametric and Interpretable Granular-Ball Generation Method for Clustering
Zeqiang Xian, Caihui Liu, Yong Zhang, Wenjing Qiu, Duoqian Miao, Witold Pedrycz

TL;DR
This paper introduces MDL-GBG, a non-parametric, interpretable method for generating granular-balls in clustering based on the Minimum Description Length principle, improving transparency and effectiveness.
Contribution
It reformulates granular-ball generation as a local model selection problem using MDL, unifying ball operations within a coding-theoretic framework.
Findings
MDL-GBG produces stable granular-balls that enhance clustering performance.
MDL-GBG+AC achieves top ranks in ARI, ACC, and NMI on UCI datasets.
The method offers a principled, interpretable alternative to heuristic strategies.
Abstract
Existing granular-ball generation methods are still mainly driven by handcrafted quality measures and heuristic splitting or stopping criteria, which may weaken the transparency of local generation decisions in clustering. To address this issue, this paper proposes Minimum Description Length based Granular-Ball Generation (MDL-GBG), a non-parametric and interpretable granular-ball generation method for clustering. MDL-GBG reformulates granular-ball generation as a local model selection problem under the Minimum Description Length principle. For each granular ball, three candidate explanations are compared, namely a single-ball model, a two-ball model, and a core-ball-plus-residual model, and the model with the shortest description length is selected. In this way, ball retention, splitting, and residual peeling are unified within a common coding-theoretic framework. A residual…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
