
TL;DR
This paper introduces a new model for soft power diagrams that enables efficient outlier detection and cluster similarity measurement in data analysis, applicable to fixed-site clusterings with linear programming solutions.
Contribution
It proposes a novel soft power diagram model with point counting features and provides algorithms for outlier detection and clustering comparison, applicable to non-convex site models.
Findings
Efficient algorithms for outlier detection using soft power diagrams.
Global optimal solutions obtained via linear programming.
Applicable to non-convex models of free sites.
Abstract
Many applications in data analysis begin with a set of points in a Euclidean space that is partitioned into clusters. Common tasks then are to devise a classifier deciding which of the clusters a new point is associated to, finding outliers with respect to the clusters, or identifying the type of clustering used for the partition. One of the common kinds of clusterings are (balanced) least-squares assignments with respect to a given set of sites. For these, there is a 'separating power diagram' for which each cluster lies in its own cell. In the present paper, we aim for efficient algorithms for outlier detection and the computation of thresholds that measure how similar a clustering is to a least-squares assignment for fixed sites. For this purpose, we devise a new model for the computation of a 'soft power diagram', which allows a soft separation of the clusters with 'point…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
