Shift of Pairwise Similarities for Data Clustering
Morteza Haghir Chehreghani

TL;DR
This paper introduces a novel approach to data clustering by adaptively shifting pairwise similarities, including negative values, and demonstrates its effectiveness through theoretical analysis and extensive experiments.
Contribution
It proposes a new regularization technique that shifts pairwise similarities in clustering, connecting it to Correlation Clustering and providing an efficient optimization algorithm.
Findings
The method improves clustering performance on various datasets.
The approach has a fast convergence rate.
It generalizes existing similarity shift techniques.
Abstract
Several clustering methods (e.g., Normalized Cut and Ratio Cut) divide the Min Cut cost function by a cluster dependent factor (e.g., the size or the degree of the clusters), in order to yield a more balanced partitioning. We, instead, investigate adding such regularizations to the original cost function. We first consider the case where the regularization term is the sum of the squared size of the clusters, and then generalize it to adaptive regularization of the pairwise similarities. This leads to shifting (adaptively) the pairwise similarities which might make some of them negative. We then study the connection of this method to Correlation Clustering and then propose an efficient local search optimization algorithm with fast theoretical convergence rate to solve the new clustering problem. In the following, we investigate the shift of pairwise similarities on some common clustering…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace and Expression Recognition · Advanced Clustering Algorithms Research · Bayesian Methods and Mixture Models
