Coloring for dispersion: A polynomial-time algorithm for cardinality-constrained 2-anticlustering
Nguyen Khoa Tran, Lin Mu, Martin Papenberg, Gunnar W. Klau

TL;DR
This paper presents a polynomial-time algorithm for the 2-maximum dispersion problem with cardinality constraints, solving an open problem and enabling efficient large-scale data processing.
Contribution
It introduces a polynomial-time solution for 2-MDCC by transforming it into a restricted subset sum problem, outperforming previous methods.
Findings
The algorithm solves large datasets in less than a second.
It outperforms previous integer linear programming solutions by several orders of magnitude.
The approach confirms polynomial solvability for the 2-anticlustering problem.
Abstract
The -Maximum Dispersion Problem with Cardinality Constraints (-MDCC) asks for a partition of a given item set with pairwise dissimilarities into cardinality-constrained groups such that the minimum pairwise intra-group dissimilarity, which is also known as the dispersion, is maximized. The problem arises in the context of anticlustering, where the goal is to create maximally heterogeneous groups of items with applications in psychological research, bioinformatics, and data science. It is known that -MDCC is NP-hard for but it has been an open question whether it can be solved in polynomial time for . We give a positive answer to this question by showing that -MDCC can be solved by a quadratic number of cardinality-constrained 2-coloring problem instances (-COLCC). We solve these instances by transforming them into a restricted class of subset sum…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
