Lower Bounds for the Happy Coloring Problems
Ivan Bliznets, Danil Sagunov

TL;DR
This paper investigates the computational complexity of the Maximum Happy Vertices and Edges problems, establishing lower bounds and presenting algorithms that match these bounds, thus clarifying their computational limits.
Contribution
It provides new lower bounds on the complexity of the problems and introduces algorithms with proven optimality under current complexity assumptions.
Findings
NP-hardness of guarantee parameterization
Kernelization lower bounds
Exponential lower bounds under Set Cover and ETH
Abstract
In this paper, we study the Maximum Happy Vertices and the Maximum Happy Edges problems (MHV and MHE for short). Very recently, the problems attracted a lot of attention and were studied in Agrawal '17, Aravind et al. '16, Choudhari and Reddy '18, Misra and Reddy '17. Main focus of our work is lower bounds on the computational complexity of these problems. Established lower bounds can be divided into the following groups: NP-hardness of the above guarantee parameterization, kernelization lower bounds (answering questions of Misra and Reddy '17), exponential lower bounds under the Set Cover Conjecture and the Exponential Time Hypothesis, and inapproximability results. Moreover, we present an randomized algorithm for MHV and an algorithm for MHE, where is the number of colors used and is the number of required happy vertices or…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
\NewEnviron
problemx \BODY
Input:
Question:
11institutetext: St. Petersburg Department of Steklov Institute of Mathematics of the Russian Academy of Sciences, Saint Petersburg, Russia
11email: [email protected], 11email: [email protected] 22institutetext: National Research University Higher School of Economics, Saint Petersburg, Russia
Lower Bounds for the Happy Coloring Problems††thanks: This research was supported by the Russian Science Foundation (project 16-11-10123)
Ivan Bliznets 1122
Danil Sagunov 11
Abstract
In this paper, we study the Maximum Happy Vertices and the Maximum Happy Edges problems (MHV and MHE for short). Very recently, the problems attracted a lot of attention and were studied in Agrawal ’17, Aravind et al. ’16, Choudhari and Reddy ’18, Misra and Reddy ’17. Main focus of our work is lower bounds on the computational complexity of these problems. Established lower bounds can be divided into the following groups: \NP-hardness of the above guarantee parameterization, kernelization lower bounds (answering questions of Misra and Reddy ’17), exponential lower bounds under the Set Cover Conjecture and the Exponential Time Hypothesis, and inapproximability results. Moreover, we present an randomized algorithm for MHV and an algorithm for MHE, where is the number of colors used and is the number of required happy vertices or edges. These algorithms cannot be improved to subexponential taking proved lower bounds into account.
1 Introduction
In this paper, we study Maximum Happy Vertices and Maximum Happy Edges. The problems are motivated by a study of algorithmic aspects of homophyly law in large networks and were introduced by Zhang and Li in 2015 [zhang2015algorithmic]. The law states that in social networks people are more likely to connect with people they like. Social network is represented by a graph, where each vertex corresponds to a person of the network, and an edge between two vertices denotes that the corresponding persons are connected within the network. Furthermore, we let vertices have a color assigned. The color of a vertex indicates type, character or affiliation of the corresponding person in the network. An edge is called happy if its endpoints are colored with the same color. A vertex is called happy if all its neighbours are colored with the same color as the vertex itself. Equivalently, a vertex is happy if all edges incident to it are happy. Formal definition of Maximum Happy Vertices and Maximum Happy Edges is the following.
{problemx}
{problemx}
Recently, MHV and MHE have attracted a lot of attention and were studied from parameterized [Agrawal2018, Aravind2016, Aravind2017, Choudhari2018, Misra2018] and approximation [zhang2015algorithmic, zhang2018improved, zhang2015improved, xu2016submodular] points of view as well as from experimental perspective [lewis2019finding].
NP-hardness of MHVand MHE was proved by Zhang and Li even in case when only three colors are used. Later, Misra and Reddy [Misra2018] proved \NP-hardness of both MHV and MHE on split and on bipartite graphs. However, MHV is polynomially time solvable on cographs and trees [Misra2018, Aravind2016]. Approximation results for MHV are presented in Zhang et al. [zhang2018improved]. They showed that MHV can be approximated within , where is the maximum degree of the input graph, and MHE can be approximated within , where . From parameterized point of view the following parameters were studied: pathwidth [Agrawal2018, Aravind2017], treewidth [Agrawal2018, Aravind2017], neighbourhood diversity [Aravind2017], vertex cover [Misra2018], distance to clique [Misra2018], distance to threshold graphs [Choudhari2018]. Kernelization questions were studied in works [Agrawal2018, gao2018kernelization]. Agrawal [Agrawal2018] provided a kernel for MHV where is the number of colors used and is the number of desired happy vertices. Independently, Gao and Gao [gao2018kernelization] present kernel for general case and in case of planar graphs.
Short summary of our results can be found below.
No polynomial kernels:
If then there are no polynomial kernels for MHV parameterized by vertex cover, and no polynomial kernels for MHE under the following parametrizations: number of uncolored vertices, number of happy edges, and distance to almost any reasonable graph class. Moreover, under , there is no and no bitsize kernel for MHV. Note that these results answer question from [Misra2018]: ”Do the Maximum Happy Vertices and Maximum Happy Edges problems admit polynomial kernels when parameterized by either the vertex cover or the distance to clique parameters?”
Above guarantee:
Above-greedy versions of MHV and MHE are NP-complete even for budget equal .
Exponential lower bounds:
Assuming the Set Cover Conjecture, MHV and MHE do not admit algorithms, where is the number of uncolored vertices in the input graph. Even with , there is no algorithm for MHV and MHE, unless ETH fails.
Innaproximability:
Unless , MHV does not admit approximation algorithm with factors , , , , for any .
Algorithms:
We present randomized algorithm for MHV and algorithm for MHE. Running time of this algorithms match with the corresponding lower bounds. We should note that an algorithm with the running time of for MHE was also presented by Aravind et al. in [Aravind2017].
2 Preliminaries
Basic notation. We denote the set of positive integer numbers by . For each positive integer , by we denote the set of all positive integers not exceeding , . We use for the disjoint union operator, i.e. equals , with an additional constraint that and are disjoint.
We use traditional -notation for asymptotical upper bounds. We additionally use -notation that hides polynomial factors. Many of our results concern the parameterized complexity of the problems, including fixed-parameter tractable algorithms, kernelization algorithms, and some hardness results for certain parameters. For detailed survey in parameterized algorithms we refer to the book of Cygan et al. [cygan2015parameterized].
Throughout the paper, we use standard graph notation and terminology, following the book of Diestel [diestel2018graph]. All graphs in our work are undirected simple graphs. We may refer to the distance to parameter, where is an arbitrary graph class. For a graph , we say that a vertex subset is a modulator of , if becomes a member of after deletion of , i.e. . Then, the distance to parameter of is defined as the size of its smallest modulator.
Graph colorings. When dealing with instances of Maximum Happy Vertices or Maximum Happy Edges, we use a notion of colorings. A coloring of a graph is a function that maps vertices of the graph to the set of colors. If this function is partial, we call such coloring partial. If not stated otherwise, we use for the number of distinct colors, and assume that colors are integers in . A partial coloring is always given as a part of the input for both problems, along with graph . We also call a precoloring of the graph , and use to denote the graph along with the precoloring. The goal of both problems is to extend this partial coloring to a specific coloring that maps each vertex to a color. We call a full coloring (or simply, a coloring) of that extends . We may also say that is a coloring of . For convenience, introduce the notion of potentially happy vertices, both for full and partial colorings.
Definition 1**.**
We call a vertex of potentially happy, if there exists a coloring of such that is happy with respect to . In other words, if and are precolored neighbours of , then . We denote the set of all potentially happy vertices in by .
By we denote the set of all potentially happy vertices in such that they are either precolored with color or have a neighbour precolored with color :
[TABLE]
In other words, if a vertex is happy with respect to some coloring of , then necessarily .
For a graph with precoloring , by we denote the number of potentially happy vertices in . Note that if is a full coloring of a graph , then is equal to the number of vertices in that are happy with respect to .
3 Polynomial kernels for structural graph parameters
In this section, we study existence of polynomial kernels for MHV or MHE under several parameterizations. We start with proving lower bounds for structural graph parameters. We provide reductions to both MHV and MHE from the following problem.
{problemx}
Theorem 1** ([Dom2014]).**
Bounded Rank Disjoint Sets* parameterized by does not admit a polynomial compression even if every set consists of exactly elements and , unless . *
The following two theorems answer open questions posed in [Misra2018].
Theorem 2**.**
Maximum Happy Vertices* parameterized by the vertex cover number does not admit a polynomial compression, unless . *
Proof*.*
We give a polynomial reduction from the Set Packing problem, such that the vertex cover number of the constructed instance of MHV is at most the size of the universe of the initial instance of Set Packing plus one. Since Bounded Rank Disjoint Sets is a special case of Set Packing, from Theorem 1 the theorem statement will then follow. The reduction is as follows.
Given an instance of Set Packing, construct an instance of MHV. For each , introduce vertex in and left it uncolored. For each set , introduce a vertex in and precolor it with color , i.e. . Thus, the set of colors used in precoloring is exactly . Then, for each and such that , introduce an edge between and in . Additionally, introduce two vertices and to and precolor them with colors and respectively. Then, introduce an edge to and for every and , introduce an edge in . Thus, vertices and never become happy and ensure that never become happy for any . Finally, set the number of required happy vertices to . Observe that forms a vertex cover of , hence the vertex cover number of is at most .
We now claim that is a yes-instance of Set Packing if and only if is a yes-instance of MHV. Let be the answer to , i.e. for every distinct . Since are disjoint, do not have any common neighbours in . Hence, we can extend coloring to coloring in a way that are happy with respect to ( is then, in fact, the index of the set containing , i.e. ). At least vertices become happy in , hence is a yes-instance of MHV.
In the other direction, let be a coloring of extending so that at least vertices in are happy with respect to . Only vertices that can be happy in are vertices of type , hence there are vertices that are happy in with respect to . Since these vertices are precolored with pairwise distinct colors and are simultaneously happy, they may have no common neighbours in . This implies that the corresponding sets of the initial instance are pairwise disjoint. Hence, they form an answer to the initial instance of Set Packing. This completes the proof.
Theorem 3**.**
Maximum Happy Edges* parameterized by the number of uncolored vertices or by the number of happy edges does not admit a polynomial compression, unless . *
Proof*.*
As in the proof of Theorem 2, we again provide a polynomial reduction from Bounded Rank Disjoint Sets and then use Theorem 1. In this proof though, we will use the restricted version Bounded Rank Disjoints Sets problem itself (and not the Set Packing problem), formulated in Theorem 1. That is, we will use the constraint that all sets in the given instance are of the same size , and the size of the universe is equal to . We note that the following reduction has very much in common with the reduction described in the proof of Theorem 2.
Given an instance of Bounded Rank Disjoint Sets with and for every , we construct an instance of MHE. We assume that each element of the universe is contained in at least one set, otherwise the given instance is a no-instance. Firstly, as in the proof of Theorem 2, for each element of the universe , introduce a corresponding vertex in . For each set , , introduce not just one, but corresponding vertices . Then again, similarly to the proof of Theorem 1, for each such that , introduce edges between and each vertex corresponding to the set , i.e. edges in total. To finish the construction of , introduce every possible edge in .
Thus, and . Then, precolor the vertices of in the usual way, i.e. set for every and , and leave each vertex uncolored. Finally, we set the number of required happy edges to . Construction of is done in polynomial time. Observe that the number of uncolored vertices in equals the size of the universe , and the number of required happy edges is polynomial of . Hence, existence of a polynomial kernel respectively to any of these two parameters for MHE contradicts the statement of Theorem 1. We argue that the initial instance is a yes-instance if and only if is a yes-instance of MHE.
We prove first that if is a yes-instance, then is a yes-instance. Let be a yes-instance of the restricted version of Bounded Rank Disjoint Sets, and let be the instance solution. As usual, extend to a coloring of by setting to the index of the set in the solution containing , i.e. for some and . Since are disjoint, and their total size equals the size of the universe, such coloring always exists uniquely for a fixed solution of . We claim that there are exactly happy edges in with respect to .
All edges in are either of type or of type . Consider edges of type for a fixed . Happy edges among them are those with . Since and , these edges are exacly . Hence, there are happy edges of this type for a fixed and happy edges of this type in total. It is left to count the number of happy edges of the clique, i.e. edges of type . Observe that each is colored with a color corresponding to a containing set of the answer. Since each set is of size , the vertices are split by color into groups of size . Each group contributes exactly happy edges, and no edge connecting vertices from different groups is happy. Thus, there are exactly happy edges of type in with respect to . We get that exactly edges of are happy with respect to , hence is a yes-instance of MHE.
In the other direction, let be a yes-instance of MHE, and let be an optimal coloring of extending . At least edges are happy in with respect to . Let us show that exactly edges are happy in with respect to .
Claim 1**.**
In any optimal coloring of extending , for each .
Proof of Claim 1*.*
Suppose it is not true, and is an optimal coloring of and for some . For each with , is adjacent to vertices , which are precolored with color . None of edges are happy with respect to , since . The only other edges incident to are edges of the clique. Thus, is incident to at most happy edges.
Choose arbitrary with , and put . becomes incident to at least happy edges. Happiness of edges not incident with has not changed. Thus, the change yields at least one more happy edge. A contradiction with the optimality of .
Claim 2**.**
In any optimal coloring of extending , there are at most happy edges of type in with respect to .
Proof of Claim 2*.*
The vertices are split into groups containing vertices of the same color by , so the happy edges of type are exactly the edges inside the groups. By Claim 1, each is colored with a color corresponding to a set containing in . Hence, each group contains vertices corresponding to elements of the same set, and thus contains at most vertices. So each is incident to at most happy edges of type , and in total there are at most such happy edges in with respect to .
From Claims 1 and 2 follows that at most edges are happy in with respect to . And as seen in the proof of Claim 2, the only way that yields exactly happy edges is when are split by color into disjoint groups of size , each containing vertices corresponding to a set of the initial instance. Hence, if yields happy edges in , is a solution to . Thus, is a yes-instance of Bounded Rank Disjoint Sets. This finishes the whole proof.
Definition 2**.**
We call a graph family uniformly polynomially instantiable, if there is an algorithm that, given positive integer as input, outputs a graph , such that and , in time.
Corollary 1**.**
*For any uniformly polynomially instantiable graph family , Maximum Happy Edges, parameterized by the distance to graphs in , does not admit a polynomial compression, unless . *
Proof*.*
Suppose it is not true and there is a uniformly polynomially instantiable graph family , such that MHE parameterized by the distance to graphs in admits a polynomial compression. We show how to reduce an instance of MHE with uncolored vertices to an instance of MHE with the distance to graph in being at most , and then get a contradiction with Theorem 3.
Let be an instance of MHE with uncolored vertices. Denote the set of all uncolored vertices in by and the set of all precolored vertices by , so . Assume that has no edge between vertices in , otherwise delete it and decrease by one if its endpoints are of the same color in . Construct an instance as follows. Use the algorithm that output instances of , with as input. The algorithm gives a graph , such that and . Take an arbitrary subset of size , and identify its vertices with vertices in . Construct by introducing new vertices to , that are identified with the vertices of . Denote the set of these vertices by . Then, add an edge between vertices in or between vertices in and if there is an edge between corresponding vertices in . Finally, construct by precoloring vertices in with the color of their corresponding vertices in , leave the vertices of uncolored, and precolor all remaining vertices arbitrarily. There may be some happy edges between precolored vertices in , let their number be . Set .
One may easily show that the constructed instance is a yes-instanse of MHE if and only if the initial instance is a yes-instance of MHE. Moreover, deletion of from yields . Hence, has the distance to graphs in being at most . We therefore obtain the required polynomial reduction that leads to the desired contradiction.
In the rest of the section we study kernel bitsize lower bounds for MHV, parameterized by either or , where is the number of potentially happy vertices. This relates to the result of Agrawal in [Agrawal2018], where the author showed that MHV admits a polynomial kernel with vertices. We show that, for any and any , there is no kernel of bitsize for MHV. Similarly, we show that there is no kernel of bitsize for MHV. To prove these lower bounds, we refer to the framework of weak cross-compositions, that originates from works of Dell and van Mekelbeek [DellMekelbeek2014], Dell and Marx [Dell2012] and Hermelin and Wu [Hermelin2012]. These results are finely summarized by Cygan et al. in the chapter on lower bounds for kernelization [Cygan2015]. We recall the notion of weak cross-compositions.
Definition 3** ([Cygan2015, Dell2012, Hermelin2012]).**
Let be a language and be a parameterized language. We say that weakly-cross-composes into if there exists a real constant , called the dimension, a polynomial equivalence relation , and an algorithm , called the weak cross-composition, satisfying the following conditions. The algorithm takes as input a sequence of that are equivalent with respect to , runs in time polynomial in , and outputs one instance such that:
- (a)
for every there exists a polynomial such that for every choice of and input strings it holds that , and 2. (b)
if and only if there exists at least one index i such that .
The framework of weak cross-compositions is used for proving conditional lower bounds on polynomial compression bitsize. This is formulated in the following theorem.
Theorem 4** ([Cygan2015, Dell2012, Hermelin2012]).**
*If an \NP-hard language admits a weak cross-composition of dimension into a parameterized language . Then for any , does not admit a polynomial compression with bitsize , unless . *
Dell and Marx [Dell2012] use this framework to show that the Vertex Cover problem parameterized by the solution size does not admit a kernel with subquadratic bitsize. Their result is the following.
Lemma 1** ([Dell2012, Cygan2015]).**
There exists a weak cross composition of dimension from an \NP-hard problem Multicolored Biclique into the Vertex Cover problem parameterized by the solution size. In fact, this weak cross-composition , given instances of Multicolored Biclique as input, outputs an instance of Vertex Cover satisfying
- •
, and
- •
**
*for some polynomials and . *
The bound for is given because one can look at an instance of Vertex Cover as at an instance of Independent Set. Then, the solution parameter of Independent Set is bounded with polynomial of the maximum input size, independently of the number of instances . We are ready to prove the theorem.
