Estimation of the number of clusters on d-dimensional sphere

Kazuhisa Fujita

arXiv:2011.07530·cs.LG·May 14, 2021

Estimation of the number of clusters on d-dimensional sphere

Kazuhisa Fujita

PDF

TL;DR

This paper introduces SX-means, a model-based method for estimating the number of clusters in spherical data using von Mises-Fisher distributions, applicable across various scientific fields.

Contribution

The paper proposes a novel SX-means algorithm specifically designed for spherical data, addressing the gap in existing clustering methods for such data types.

Findings

01

SX-means accurately estimates the number of clusters in spherical data.

02

The method performs well across different dimensions and data distributions.

03

Experimental results demonstrate its effectiveness compared to existing approaches.

Abstract

Spherical data is distributed on the sphere. The data appears in various fields such as meteorology, biology, and natural language processing. However, a method for analysis of spherical data does not develop enough yet. One of the important issues is an estimation of the number of clusters in spherical data. To address the issue, I propose a new method called the Spherical X-means (SX-means) that can estimate the number of clusters on d-dimensional sphere. The SX-means is the model-based method assuming that the data is generated from a mixture of von Mises-Fisher distributions. The present paper explains the proposed method and shows its performance of estimation of the number of clusters.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.