A Branch-and-price procedure for clustering data that are graph connected
Stefano Benati, Diego Ponce, Justo Puerto, and Antonio M., Rodr\'iguez-Ch\'ia

TL;DR
This paper introduces a branch-and-price algorithm for the Graph-Connected Clique-Partitioning Problem, enhancing the solution of larger instances and combining exact and heuristic methods for improved clustering quality.
Contribution
It presents a new ILP formulation and a branch-and-price algorithm with a novel combinatorial pricing problem, improving solution efficiency for GCCP.
Findings
Branch-and-price outperforms previous MILP algorithms in speed
The random shrink heuristic is faster and more accurate
The combined matheuristic effectively solves large instances
Abstract
This paper studies the Graph-Connected Clique-Partitioning Problem (GCCP), a clustering optimization model in which units are characterized by both individual and relational data. This problem, introduced by Benati et al. (2017) under the name of Connected Partitioning Problem, shows that the combination of the two data types improves the clustering quality in comparison with other methodologies. Nevertheless, the resulting optimization problem is difficult to solve; only small-sized instances can be solved exactly, large-sized instances require the application of heuristic algorithms. In this paper we improve the exact and the heuristic algorithms previously proposed. Here, we provide a new Integer Linear Programming (ILP) formulation, that solves larger instances, but at the cost of using an exponential number of variables. In order to limit the number of variables necessary to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
