Flexible Bivariate Beta Mixture Model: A Probabilistic Approach for   Clustering Complex Data Structures

Yung-Peng Hsu; Hung-Hsuan Chen

arXiv:2502.19938·cs.LG·February 28, 2025

Flexible Bivariate Beta Mixture Model: A Probabilistic Approach for Clustering Complex Data Structures

Yung-Peng Hsu, Hung-Hsuan Chen

PDF

Open Access 1 Repo

TL;DR

The paper introduces the Flexible Bivariate Beta Mixture Model (FBBMM), a novel probabilistic clustering method capable of handling complex, irregular data structures more effectively than traditional algorithms like k-means and GMM.

Contribution

It proposes the FBBMM, leveraging bivariate beta distributions and advanced optimization techniques, to improve clustering of nonconvex and irregular data shapes.

Findings

01

FBBMM outperforms traditional clustering methods on synthetic datasets.

02

FBBMM demonstrates superior accuracy on real-world complex data.

03

The method is validated with extensive experiments and available as open-source code.

Abstract

Clustering is essential in data analysis and machine learning, but traditional algorithms like $k$ -means and Gaussian Mixture Models (GMM) often fail with nonconvex clusters. To address the challenge, we introduce the Flexible Bivariate Beta Mixture Model (FBBMM), which utilizes the flexibility of the bivariate beta distribution to handle diverse and irregular cluster shapes. Using the Expectation Maximization (EM) algorithm and Sequential Least Squares Programming (SLSQP) optimizer for parameter estimation, we validate FBBMM on synthetic and real-world datasets, demonstrating its superior performance in clustering complex data structures, offering a robust solution for big data analytics across various domains. We release the experimental code at https://github.com/yung-peng/MBMM-and-FBBMM.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yung-peng/mbmm-and-fbbmm
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Methods and Mixture Models · Advanced Clustering Algorithms Research · Statistical Mechanics and Entropy