On Computing Centroids According to the $p$-Norms of Hamming Distance Vectors
Jiehua Chen, Danny Hermelin, Manuel Sorge

TL;DR
This paper investigates the computational complexity of the $p$-Norm Hamming Centroid problem, establishing NP-hardness for all fixed rational $p > 1$, and provides tight lower and upper bounds on algorithmic running times, along with approximation and fixed-parameter algorithms.
Contribution
It proves NP-hardness for all fixed rational $p > 1$, closing the complexity gap, and offers tight bounds, along with practical algorithms for the problem.
Findings
NP-hardness for all fixed rational p > 1
Tight lower bounds on running time based on complexity assumptions
A fixed-parameter and a factor-2 approximation algorithm
Abstract
In this paper we consider the -Norm Hamming Centroid problem which asks to determine whether some given binary strings have a centroid with a bound on the -norm of its Hamming distances to the strings. Specifically, given a set of strings and a real , we consider the problem of determining whether there exists a string with , where denotes the Hamming distance metric. This problem has important applications in data clustering, and is a generalization of the well-known polynomial-time solvable \textsc{Consensus String} problem, as well as the NP-hard \textsc{Closest String} problem. Our main result shows that the problem is NP-hard for all fixed rational , closing the gap for all rational values of between and . Under standard complexity assumptions the reduction also…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
