Hamiltonian formulations of centroid-based clustering
Myeonghwan Seong, Daniel K. Park

TL;DR
This paper introduces a Hamiltonian-based framework for clustering that enhances flexibility in defining objectives and constraints, leveraging quantum simulation techniques to potentially achieve quantum advantage in data grouping tasks.
Contribution
It formulates clustering as a Hamiltonian ground state search, enabling diverse objectives and constraints, and demonstrates applicability on current quantum hardware.
Findings
Hamiltonian formulations accommodate various clustering objectives.
Quantum annealing can be used to solve clustering problems.
Different Hamiltonians require different qubit connectivities.
Abstract
Clustering is a fundamental task in data science that aims to group data based on their similarities. However, defining similarity is often ambiguous, making it challenging to determine the most appropriate objective function for a given dataset. Traditional clustering methods, such as the -means algorithm and weighted maximum -cut, focus on specific objectives -- typically relying on average or pairwise characteristics of the data -- leading to performance that is highly data-dependent. Moreover, incorporating practical constraints into clustering objectives is not straightforward, and these problems are known to be NP-hard. In this study, we formulate the clustering problem as a search for the ground state of a Hamiltonian, providing greater flexibility in defining clustering objectives and incorporating constraints. This approach enables the application of various quantum…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Network Analysis Techniques · Advanced Clustering Algorithms Research · Topological and Geometric Data Analysis
