Quantum (Inspired) $D^2$-sampling with Applications
Poojan Shah, Ragesh Jaiswal

TL;DR
This paper introduces a quantum algorithm for $D^2$-sampling that enhances $k$-means++ clustering with improved runtime, and also presents a classical quantum-inspired implementation with promising experimental results.
Contribution
It develops a quantum $D^2$-sampling algorithm for $k$-means++, preserves approximation guarantees, and introduces a classical quantum-inspired method with practical efficiency.
Findings
Quantum $D^2$-sampling runs in $ ilde{O}( ext{aspect ratio}^2 k^2)$ time.
Quantum-inspired classical $k$-means++ runs in $O(Nd) + ilde{O}( ext{aspect ratio}^2 k^2 d)$ time.
Experimental results show promising performance on large datasets with bounded aspect ratio.
Abstract
-sampling is a fundamental component of sampling-based clustering algorithms such as -means++. Given a dataset with points and a center set , -sampling refers to picking a point from where the sampling probability of a point is proportional to its squared distance from the nearest center in . Starting with empty and iteratively -sampling and updating in rounds is precisely -means++ seeding that runs in time and gives -approximation in expectation for the -means problem. We give a quantum algorithm for (approximate) -sampling in the QRAM model that results in a quantum implementation of -means++ that runs in time . Here is the aspect ratio (i.e., largest to smallest interpoint distance), and hides polylogarithmic factors…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Random Matrices and Applications · Statistical Mechanics and Entropy
