Coresets for $k$-median clustering under Fr\'{e}chet and Hausdorff distances
Abhinandan Nath

TL;DR
This paper introduces the first coreset construction algorithms for approximate $k$-median clustering of polygonal curves and point sets under Fréchet and Hausdorff distances, with sizes independent of input quantity.
Contribution
It presents novel coreset algorithms for curve and point set clustering under complex metrics, generalizing importance sampling techniques to these settings.
Findings
Coreset size is independent of the number of input objects.
Provides a formal condition for the restricted space of cluster centers.
Establishes lower bounds on coreset size for input subset constraints.
Abstract
We give algorithms for computing coresets for -approximate -median clustering of polygonal curves (under the discrete and continuous Fr\'{e}chet distance) and point sets (under the Hausdorff distance), when the cluster centers are restricted to be of low complexity. Ours is the first such result, where the size of the coreset is independent of the number of input curves/point sets to be clustered (although it still depends on the maximum complexity of each input object). Specifically, the size of the coreset is for any , where is the ambient dimension, is the maximum number of points in an input curve/point set, and is the maximum number of points allowed in a cluster center. We formally characterize a general condition on the restricted space of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Face and Expression Recognition · Advanced Image and Video Retrieval Techniques
