Proportionally dense subgraph of maximum size: complexity and approximation
Cristina Bazgan, Janka Chleb\'ikov\'a, Cl\'ement Dallard, Thomas, Pontoizeau

TL;DR
This paper studies the computational complexity of finding proportionally dense subgraphs (PDS), proving hardness results, presenting approximation algorithms, and analyzing PDS sizes in specific graph classes like cubic graphs.
Contribution
It establishes hardness results for PDS problems, introduces a polynomial-time approximation algorithm, and characterizes PDS sizes in Hamiltonian cubic graphs.
Findings
Maximum size PDS is APX-hard on split graphs.
Deciding maximal PDS is co-NP-complete on bipartite graphs.
A 2-(Δ+1) approximation algorithm is provided.
Abstract
We define a proportionally dense subgraph (PDS) as an induced subgraph of a graph with the property that each vertex in the PDS is adjacent to proportionally as many vertices in the subgraph as in the graph. We prove that the problem of finding a PDS of maximum size is APX-hard on split graphs, and NP-hard on bipartite graphs. We also show that deciding if a PDS is inclusion-wise maximal is co-NP-complete on bipartite graphs. Nevertheless, we present a simple polynomial-time -approximation algorithm for the problem, where is the maximum degree of the graph. Finally, we show that all Hamiltonian cubic graphs with vertices (except two) have a PDS of size , which we prove to be an upper bound on the size of a PDS in cubic graphs.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Proportionally dense subgraph of maximum size: complexity and approximation
Cristina Bazgan [email protected] Université Paris-Dauphine, Université PSL, CNRS, LAMSADE, 75016 Paris, France
Janka Chlebíková [email protected] School of Computing, University of Portsmouth, Portsmouth, United Kingdom
Clément Dallard [email protected] School of Computing, University of Portsmouth, Portsmouth, United Kingdom
Thomas Pontoizeau [email protected] Université Paris-Dauphine, Université PSL, CNRS, LAMSADE, 75016 Paris, France
Abstract
We define a proportionally dense subgraph (PDS) as an induced subgraph of a graph with the property that each vertex in the PDS is adjacent to proportionally as many vertices in the subgraph as in the graph. We prove that the problem of finding a PDS of maximum size is APX-hard on split graphs, and NP-hard on bipartite graphs. We also show that deciding if a PDS is inclusion-wise maximal is co-NP-complete on bipartite graphs. Nevertheless, we present a simple polynomial-time -approximation algorithm for the problem, where is the maximum degree of the graph. Finally, we show that all Hamiltonian cubic graphs with vertices (except two) have a PDS of size , which we prove to be an upper bound on the size of a PDS in cubic graphs.
Keywords: dense subgraph, approximation, complexity, Hamiltonian cubic graphs
1 Introduction
For a graph , the density of a subgraph on a vertex set is commonly defined as , where is the set of edges in the subgraph. The problem of finding a subgraph of maximum density can be solved in polynomial time using a max flow technique [8]. However, when the subgraph must contain exactly vertices, the problem becomes NP-hard [7, 3] and is known as the Densest -subgraph problem. Two variants of the problem have also been studied where the number of vertices in the subgraph must be either at least or at most . The former is known to be NP-hard [10], but there exists a polynomial-time -approximation algorithm to solve it [2]. It was showed that any -approximation for the at most variant would imply a -approximation for the densest -subgraph problem [1].
An induced subgraph on a vertex set is said to be proportionally dense if all of its vertices in have proportionally as many neighbors in the subgraph as in the graph, and hence the condition holds for each vertex in . In this paper, we study the problem of finding a proportionally dense subgraph (PDS) with a maximum number of vertices. A proportionally dense subgraph grants more importance to the vertices than the standard definition of a dense subgraph, as all the vertices in a PDS must be ‘satisfied’, i.e. respect the above condition. This can be compared with defensive alliances in graphs, where the vertices in the alliance must have at least as many neighbors inside the alliance than outside it [14, 11], without the notion of proportion of neighbors.
From a theoretical point of view, it is interesting to observe a problem that connects local and global properties of vertex subsets, interweaving the size of the subset and the number of neighbors. This interesting paradigm has rarely been seen in graph theory problems.
The notion of proportionality of neighbors is closely related to community detection problems. Olsen [12] defined a community structure as a partition of the vertices of a graph into parts such that each vertex has a greater proportion of neighbors in its part than in any other part, each part being called a community. In the same paper, it was proved that any graph that is not a star contains a community structure that can be found in polynomial time (if there is no restriction on the number of communities), but that it is NP-complete to decide if a given subset of vertices can belong to a same community of a community structure. The special case where the community structure contains exactly two communities, namely a -community structure, has been studied in several classes of graphs: a -community structure always exists and can be found in polynomial time in trees, graphs with maximum degree , minimum degree , and complements of bipartite graphs [5]. Recently, the notion of -community structure has been studied under the name of -PDS partition [4]. In this paper, the authors described an infinite family of graphs without a -PDS partition, and a second infinite family of graphs without a connected -PDS partition (but with a disconnected one). These results answer some open questions originally introduced in [5]. However, the complexity of finding a -PDS partition remains unknown in general graphs, and for larger (fixed) number of PDS’s. As there is equivalence between proportionally dense subgraph and community (with regard to the above definition), one may interpret the problem of finding a proportionally dense subgraph of maximum size as finding a community of maximum size. Hence, all the results presented in this paper can also be applied for community related problems.
Section 2 introduces the basic notations used in the paper. Section 3 presents various hardness results of the Max Proportionally Dense Subgraph problem. Section 4 gives positive results about the approximation of this problem. We prove that the the problem can be solved in linear time on Hamiltonian cubic graphs in Section 5. Conclusion and open problems are given in Section 6.
2 Preliminaries
Throughout the paper, we assume that all graphs are simple, undirected and connected. For a graph , we denote by the set of neighbors of and by the degree of , and thus . Also, denotes the maximum degree of (or simply when no confusion arises).
In addition, given a subset of vertices , we define and ; also, represents the induced subgraph of in .
A star is a complete bipartite graph for any . A split graph is a graph in which the vertices can be partitioned into an independent set and a clique.
The Maximum Proportionally Dense Subgraph problem
Definition 1**.**
Let be a graph and , such that . We say that the induced subgraph is a proportionally dense subgraph (PDS) if for each vertex ,
[TABLE]
We say a vertex is satisfied (in ) if it respects Eq. 1. The size of the proportionally dense subgraph corresponds to the cardinality of .
The proof of the above equivalence from Eq. 1 can be found in [5].
Max Proportionally Dense Subgraph (Max PDS)
Input: A graph .
Output: A proportionally dense subgraph in of maximum size.
A proportionally dense subgraph may be connected or not. We study both cases and talk about connected PDS in the former case. Notice that there exist graphs for which all proportionally dense subgraphs of maximum size are not connected, even if the graph is a cubic graph or a caterpillar. In the cubic graph illustrated in Figure 1, the gray vertices represent a PDS of size , which is not connected. In fact, any connected induced subgraph on the set with at least vertices contains at least one vertex of degree in , which is not satisfied since . It can be checked that the maximum size for a PDS is but only for a connected PDS. Similarly, in the caterpillar in Figure 1, any connected induced subgraph of size at least has one vertex unsatisfied. The maximum size for a PDS is , while only for a connected PDS.
3 Hardness results
In this section we prove several hardness results for Max PDS on split and bipartite graphs and further extend the results to prove that deciding if a PDS is inclusion-wise maximal is co-NP-complete.
We construct two polynomial-time reductions from Max Independent Set, which is known to be NP-hard [9].
Max Independent Set
Input: A graph .
Output: A subset of pairwise non-adjacent vertices in of maximum size.
3.1 Split graphs
We first describe a polynomial-time reduction, and then prove two intermediate results allowing us to easily prove the NP-hardness of Max PDS on split graphs.
Definition 2**.**
Let be a graph not isomorphic to a star. We define the construction transforming the graph into , where is defined as follows:
- •
, where , and , are two additional vertices;
- •
for each and each , the edge if and only if ;
- •
the set induces a clique in .
Obviously, the construction can be done in polynomial time. Notice that is a split graph, and is connected if and only if is not isomorphic to a star. See Figure 2 for an example.
Lemma 1**.**
Let be a graph not isomorphic to a star and let be such that . Let be a set of vertices such that . Then a vertex is satisfied in if and only if .
Proof.
A vertex has degree . Hence, if , then and is not satisfied in as it does not respect Eq. 1. However, if , then . Also, since is connected, , and hence and we have
[TABLE]
and thus is satisfied in . ∎
Lemma 2**.**
Let be a graph not isomorphic to a star and let be such that . Let such that is a PDS. Then, there exists such that is a PDS, and . Moreover, can be found in polynomial time.
Proof.
Firstly, we show that .
- •
if , since is an independent set, then any vertex has and ; hence does not satisfy Eq. 1 and is not a PDS;
- •
if , then is a subset of the clique ; it means any vertex has and , and thus
[TABLE]
so does not satisfy Eq. 1 and is not a PDS.
Now, let and .
Observe that for any , and . Thereby, we obtain , so is satisfied in . Also, if a vertex in is satisfied in , then according to Lemma 1 it is also satisfied in any , as long as .
If there exists which is not satisfied in , then following Lemma 1 it holds . Thus, there exists a vertex , non-adjacent to , which we can transfer from to . Obviously, at most transfers are needed to satisfy all the vertices in , and thus holds true. Since and , then .
Note that and that each vertex is satisfied in , since . Clearly, and are satisfied in . Thus, is a PDS, and it can be found in polynomial time. ∎
Notice that Lemma 2 implies that there exists a PDS of maximum size in that is connected. Hence, the following result also holds when looking for a connected PDS.
Theorem 1**.**
Max Proportionally Dense Subgraph* is NP-hard on split graphs.*
Proof.
Let be a graph not isomorphic to a star, be such that , and . Notice that since is connected and not isomorphic to a star, then there is no independent set of size in . We claim that there is an independent set of size at least in if and only if there is a PDS of size at least in .
Let be an independent set of of size at least . In , we define and . First, note that thus . The vertices in are obviously satisfied in as they only have neighbors in . Hence, if there exist unsatisfied vertices, then they must be from the set . Choose a vertex . Since is an independent set of , then for each edge at most one of the vertices and belongs to . Hence, the vertex is not adjacent to at most one vertex in , and thus . According to Lemma 1, the vertex is satisfied in . Thus, is a PDS of size at least .
Let be of size at least such that is a PDS. According to Lemma 2, there exists such that is a PDS, and . We claim that is an independent set of of size at least . Obviously . Moreover, Lemma 1 states that for all satisfied vertices , . Hence, for each vertex there is at most one vertex that is not adjacent to . Since the vertices and are not adjacent in , it implies that in , and therefore the edge has at most one endpoint in the graph . Thus, is an independent set of size at least . ∎
Proposition 1**.**
It is NP-hard to approximate Max Proportionally Dense Subgraph within on split graphs, and hence the problem is APX-hard (even on split graphs).
Proof.
Let be an instance of Max Independent Set on a cubic graph . It is known that it is NP-hard to decide whether or , for any , where [6].
We construct an instance of Max PDS defined on the graph such that . Note that is of size , that is since is cubic. From Theorem 1, we know that . Consequently, it is NP-hard to decide whether or . We obtain that it is NP-hard to approximate Max PDS within . ∎
3.2 Bipartite graphs
In the following, we modify the previous construction in order to prove the NP-hardness of Max PDS on bipartite graph. The reduction will also be used to show the NP-hardness of an “extension version” of the problem, implying the co-NP-completeness of deciding if a PDS is inclusion-wise maximal.
Definition 3**.**
Let be a graph not isomorphic to a star, and an integer such that . We define the construction transforming the graph into , where is defined as follows:
- •
, where , and contains additional vertices;
- •
for each and each , the edge if and only if ;
- •
for each and each , the edge .
Obviously, the construction can be done in polynomial time. Clearly, is connected if and only if the input graph is not isomorphic to a star. Also, notice that is a bipartite graph as there are edges only between and . See Figure 3 for an example.
We now prove intermediate results, which help concluding that Max PDS is NP-complete on bipartite graphs.
Lemma 3**.**
Let , and be integers such that and . Then .
Proof.
∎
Lemma 4**.**
Let be a graph not isomorphic to a star, an integer such that and be such that . Let be such that . Then a vertex is satisfied in if and only if .
Proof.
If , is obviously not satisfied. If , then notice that . Therefore, . Also, . Consequently, according to Lemma 3,
[TABLE]
∎
Lemma 5**.**
Let be a graph not isomorphic to a star, an integer, , and let be such that . Let such that is a PDS and . Then, there exists such that is a PDS, and . Moreover, can be found in polynomial time.
Proof.
First, we prove that . As , then . Take a vertex and notice that since , then . The vertex is satisfied in if and only if
[TABLE]
This implies that
[TABLE]
Thus, we have and conclude that .
Let and . As is satisfied in , according to Lemma 4, we have . Since is connected to all the vertices in , necessarily and remains satisfied in . Obviously, the vertices in are satisfied in since all their neighbours are in . This is also true for the vertices in . ∎
Notice that Lemma 5 implies that there exists a PDS of maximum size that is connected in . Hence, the following result also holds when looking for a connected PDS.
Theorem 2**.**
Max Proportionally Dense Subgraph* is NP-hard on bipartite graphs.*
Proof.
Let be a graph not isomorphic to a star, . Notice that since is connected and not isomorphic to a star, then there is no independent set of size in . Let such that . We claim that there is an independent set of size at least in if and only if there is a PDS of size at least in .
Let be an independent set of of size at least . In , we define and . First, note that thus . The vertices in are obviously satisfied in as all their neighbors are in . Hence, if there exists vertices not satisfied in , then they must belong to the set . Consider a vertex . Since is an independent set of , then for each edge at most one of the vertices and belongs to , and, therefore, at least one belong to . Therefore, the vertex is not adjacent to at least one vertex in , and thus . According to Lemma 4, is satisfied in . Thus, is a PDS of size at least .
Let be of size at least such that is a PDS. According to Lemma 5, there exists such that is a PDS, and . We claim that is an independent set of of size at least . Obviously . Lemma 4 states that for all satisfied vertices , . Therefore, as and , there is at most one vertex not adjacent to . From the construction , if there is no edge between the vertices and in , then in . Hence, the edge in has at most one vertex . Thus, is an independent set of size at least . ∎
Below, we prove that deciding if a subset of vertices can be extended into a larger subset which induces a PDS is NP-complete. We obtain as a corollary that deciding if a PDS is inclusion-wise maximal is co-NP-complete.
PDS Extension
Input: A graph , .
Question: Is there a vertex subset such that and is a proportionally dense subgraph?
To prove that PDS Extension is NP-complete, we use again the construction as defined in Definition 3.
Lemma 6**.**
Let be a graph not isomorphic to a star, an integer, , and be such that . Let be such that and is a PDS.
Then .
Proof.
Let , and notice that , so there exists a vertex in which is not connected to . Let be such a vertex. Note that and , as is not connected to .
Let . We claim that . Suppose by contradiction that . Then and . According to Lemma 3, we conclude that . Therefore,
[TABLE]
which contradicts that is satisfied, and thus that is a PDS. We conclude that . ∎
Theorem 3**.**
PDS Extension* is NP-complete on bipartite graphs.*
Proof.
Obviously, PDS Extension is in NP. Let be a graph not isomorphic to a star, . Notice that since is connected and not isomorphic to a star, then there is no independent set of size in . Let such that . We claim that there is an independent set of size at least in if and only if there is PDS of size of size at least in .
Assume there exists an independent set of size in . Then, there exists of size such that is a PDS, and (see proof of Theorem 2).
According to Lemma 6, if there exists such that is a PDS and , then . Therefore, there exists an independent set of size at least in (see proof of Theorem 2).
We conclude that deciding if there exists such that and is a PDS is NP-complete, and thus that PDS Extension is NP-complete on bipartite graphs. ∎
Notice that the set is connected, and thus if it can be extended into a PDS, then the PDS is connected. Hence, it is NP-complete to decide whether a vertex subset (inducing a connected subgraph) can be extended into a connected PDS. Furthermore, the set can induce a PDS or not, depending on the values of and . Indeed, is a PDS if and only if , which implies . Therefore, we conclude that deciding if a PDS is inclusion-wise maximal is co-NP-complete.
Corollary 1**.**
Let be a graph and such that a proportionally dense subgraph. Deciding if is inclusion-wise maximal is co-NP-complete on bipartite graphs.
4 Approximation
In this section we show that there exists a polynomial-time 2-approximation algorithm for Max Proportionally Dense Subgraph, which establishes the APX-completeness of the problem. When the maximum degree of the graph is bounded, the approximation ratio can be further improved to using a better upper bound on the size of a PDS.
Lemma 7**.**
Let be a graph and such that is not a proportional dense subgraph. If , then there exists such that . Moreover, if is even and , then there exists such that .
Proof.
Let be a subset such that is not a PDS. Then, there exists a vertex such that Eq. 1 is not satisfied in , and therefore .
- •
If , the inequality implies , and hence .
- •
If ( even), assume by contradiction that for each vertex it holds . In particular, the inequality implies , which is true if and only if . Thus, , a contradiction.
∎
Theorem 4**.**
For any graph , a proportionally dense subgraph of size or can be constructed in time.
Proof.
First, we show that Algorithm 1 terminates and returns a PDS of size or .
- •
Case 1: is odd. Notice that at the end of each loop, the set is modified without changing its size . If is not a PDS, then according to Lemma 7 there exists an unsatisfied vertex for which . Therefore, the vertex chosen within the loop has the property . Thus, the size of the cut between and decreases after each loop and the algorithm terminates.
- •
Case 2: is even. Notice that Algorithm 1 starts with . If is not a PDS, then due to Lemma 7, there exists a vertex such that . The selection of the vertex inside the loop ensures that the size of the cut between and strictly decreases at the end of the loop. Now, observe that after the first loop, . If is not a PDS, according to Lemma 7, there exists a vertex such that . Therefore, the vertex inside the loop has . Obviously, after the second loop, . Since after each loop alternates between and , the cut between and strictly decreases every two loops, and the algorithm terminates.
It is easy to see that the while-loop is called at most times. Now, we prove how one can obtain a running time by computing Algorithms 1, 1 and 1 in time.
Preprocessing
Once has been defined at Algorithm 1, compute and store the following properties for each vertex : , , and whether belongs to or . The computation of these properties for all the vertices can be done in time. While computing the properties, one can also choose a vertex that maximises (as in Algorithm 1).
Main loop
If , then is not a PDS. However, if , then is a PDS if and only if (so we decide Algorithm 1 in constant time). Therefore, if is not a , set (as in Algorithm 1), update the properties of all the vertices and select maximising (as in Algorithm 1) in . Then, repeat from the beginning of the main loop. ∎
Corollary 2**.**
Max Proportionally Dense Subgraph* is polynomial-time -approximable.*
Proof.
For any graph , Algorithm 1 yields a PDS of size at least and since any PDS has size at most , we obtain a -approximation algorithm. ∎
We proved the APX-hardness of Max PDS in Proposition 1, and hence we conclude the APX-completeness of the problem.
Corollary 3**.**
Max Proportionally Dense Subgraph* is APX-complete.*
In the following we show how the approximation ratio can be improved with regard to the maximum degree of the graph.
Lemma 8**.**
Let be a graph and such that is a proportionally dense subgraph. Then .
Proof.
Let be a vertex of with at least one neighbor in (such a vertex exists since is connected). Since is a PDS, fulfills the proportion condition, that is which implies that , and hence . ∎
Proposition 2**.**
Max Proportionally Dense Subgraph* is polynomial-time -approximable.*
Proof.
Let be a graph, be a set returned by Algorithm 1 and denote the size of a PDS of maximum size in . According to Lemma 8 we have . Therefore, since and , we obtain
[TABLE]
∎
Algorithm 1 shows that the decision version associated with Max PDS is in FPT when parameterized by its natural parameter (i.e. the size of a PDS). Indeed, if the parameter , then a PDS of size greater than can be found in polynomial time using Algorithm 1. On the other hand, if , then we have and an exhaustive search can be done in operations.
5 Hamiltonian cubic graphs
In this section we prove that all Hamiltonian cubic graphs of order , except two graphs (see Figure 4), have a proportionally dense subgraph of the maximum possible size (see Lemma 8 for an upper bound on a PDS size). Furthermore, we show that such a PDS can be found in linear time if a Hamiltonian cycle is given in the input. Note that almost all cubic graphs are Hamiltonian, as proved in [13].
We represent a Hamiltonian cubic graph of order as a cycle with the vertices labeled in such a way that is a Hamiltonian cycle and a set of edges between non-successive vertices in the Hamiltonian cycle. We always refer to this cycle when we say the Hamiltonian cycle of a graph. To avoid tedious notations, we use (with ) to refer to the vertex labeled by .
Definition 4**.**
Let be a Hamiltonian cubic graph, . Let be a set of successive vertices in the Hamiltonian cycle labeled with , , …, , with such that . The set is called a shift if the first and the last vertices of the sequence, and , are such that .
Notice that a shift contains vertices. Also, any vertex of has at least two neighbors in . Consequently, if , then , and the following holds for any :
[TABLE]
Thus, is a PDS. If , then is a PDS of the maximum possible size (see Lemma 8) and we call a good shift. On the other hand, if , then the size of is one vertex larger than the size of the maximum possible PDS, and thus is not a PDS. Such a shift is called an almost good shift.
In the following, we prove that either contains a good shift or we can find an almost good shift and a vertex such that is a proportionally dense subgraph of the maximum possible size .
Definition 5**.**
Let be a Hamiltonian cubic graph. For each , we denote by the non-successive neighbor of in the Hamiltonian cycle. Additionally, we define the subsets of vertices and in the following way for :
- •
;
- •
.
For a Hamiltonian cubic graph and , notice that if and only if , and symmetrically if and only if . This particularly implies that . Moreover, notice that for a vertex , the set cannot be a good shift, since . In the same way, if , the set cannot be a good shift, since . These observations are summed up in the following lemma.
Lemma 9**.**
Let be a Hamiltonian cubic graph, and . If and , then the set is a good shift. Symmetrically, if and , then the set is a good shift.
Proof.
The proof is straightforward. Since and , we have , where . The other case is similar. ∎
An important consequence of Lemma 9 is that if is a Hamiltonian cubic graph with no good shift, then we can define subsets of vertices that must be either in or in . To define such subsets we introduce the following notation.
Definition 6**.**
Let be a Hamiltonian cubic graph and . We define the vertex subset where .
Corollary 4**.**
Let be a Hamiltonian cubic graph with no good shift and :
- •
if then ,
- •
if , then ,
- •
.
Proof.
First, notice that for any integer , . Moreover, . Thus, we have .
Now, if , then, with our assumption that has no good shift and Lemma 9, we derive that . Symmetrically, if , then .
This implies that for any vertex , either or . Finally, since if and only if and if and only if , then it is obvious that . ∎
Let be a Hamiltonian cubic graph with no good shift and , where is the greatest common divisor of and . We show that can be partitioned into subsets of vertices , , …, . This partition will be useful to find an almost good shift and a vertex to remove from in order to obtain a PDS in . This result comes from a basic property of the cyclic group that we recall in the following lemma.
Lemma 10**.**
Let and be positive integers, and . If all integers are considered , then where and . Moreover, for any with , .
Proof.
First, we prove that for any , for some . Let . Then there exist two integers with , such that . Moreover, there exist two integers such that since . Then, . Thus, with . This proves that any integer is in a set for some , i.e. .
To prove the second part of the statement, we first show that for any . Let and be the smallest integer such that ). Notice that and let us show that . Let be two integers such that , and . We prove that by verifying that divides and divides . First, notice that . Thus, divides . On the other hand, recall that and notice that , then . This implies that divides , and thus divides . Since , divides . Now, notice that two sets , for some integers are either equal or disjoint. Since for any we have , then obviously all sets , are disjoints. ∎
In the following lemma we summarize the possible values of for some specific values of and .
Lemma 11**.**
Let be an even integer, . Then:
- •
if , then ,
- •
if , then ,
- •
if , then .
Proof.
Consider the case , then . As is even, then is odd and . The other cases can be proved using the same reasoning. ∎
Firstly, we show that if , then there is always a good shift in .
Corollary 5**.**
Let be a Hamiltonian cubic graph with vertices, . Then has a good shift.
Proof.
Suppose by contradiction that there is no good shift in . Notice that if , then . Let . From Lemma 11 we get . According to Corollary 4, . If , then (Lemma 10), and hence or , which is impossible. If , then (Lemma 10). According to Corollary 4, or for any , and thus , which is not possible. ∎
From Lemma 10 and Lemma 11, if a Hamiltonian cubic graph has no good shift, then can be written as (we may have and ). Hence, those graphs can be split into two categories:
- •
type RLRL: for any vertices with , we have and , or and . In this case, we always assume without loss of generality that and .
- •
type RRLL: there exist two vertices with such that or . In this case, we always assume without loss of generality that and .
Now, we show that if a Hamiltonian cubic graph has no good shift, then there exists an almost good shift in (Lemma 12) and a vertex such that is a PDS (Lemma 13 and Theorem 5).
Lemma 12**.**
Any Hamiltonian cubic graph with no good shift has an almost good shift.
Proof.
Let be a Hamiltonian cubic graph with no good shift, and . Since has no good shift, according to Lemma 11 and Corollary 5, and or . From Corollary 4, we know that each vertex in belongs to either or .
- •
Case 1: * is of type RLRL*. Let . Since is even, then is even. Therefore, since two vertices do not both belong to or , then the vertex belongs to . Then the set fulfills the requirements.
- •
Case 2: * is of type RRLL*. Consider the set . According to Lemma 11, since , . Hence, . Thus, and fulfills the requirements.
∎
Recall that the graphs and from Figure 4 have no proportionally dense subgraph of the maximum possible size. In Theorem 5, we show that these are the only cubic Hamiltonian graphs with this property.
Before proving the main theorem, we first deal with small graphs () that are particular cases that need to be treated independently.
Lemma 13**.**
Let be a Hamiltonian cubic graph not isomorphic to or with . Then there exists a PDS of size in .
Proof.
Let . Since is cubic, its number of vertices is even. From Lemma 11, . If , then there exists a good shift from Corollary 5. We then suppose that . The following cases remain:
- •
If , then is the complete graph , and any set of vertices induces a PDS of size .
- •
If , we claim that must have a good shift. By contradiction, suppose that has no good shift. If is of type RRLL then is isomorphic to , and if is of type RLRL then is isomorphic to , which is impossible since we assumed that is not isomorphic to or .
- •
If and has no good shift, since , is necessarily of type RLRL and , , , , . In this case, induces a PDS of size .
- •
If , if has no good shift, since , then is necessarily of type RLRL. Following Lemma 12, let be an almost good shift and:
- –
If , notice that (since ) and . Thus, is a PDS of size . If , the case is symmetrical.
- –
If and , notice that , and since . Thus, is a PDS of size .
- •
If , if has no good shift, since , is necessarily of type RLRL. Following Lemma 12, let be an almost good shift. Since , we have either or . In each case, the graph is completely determined due to the constraints. In the first case, induces a PDS of size . In the second case, induces a PDS of size .
In each case, if is not isomorphic to or , then either has a good shift which is a PDS of size , or we give a PDS of such size. ∎
Theorem 5**.**
Let be a Hamiltonian cubic graph not isomorphic to or . Then there exists a connected PDS of size in .
Proof.
If , then there is a PDS of size in from Lemma 13. Now we suppose that , which implies that .
From Lemma 11, . If , then there exists a good shift (Corollary 5).
We suppose that . If contains a good shift, then the proof is done. Notice that in such case, the PDS is obviously connected. Now, we assume that has no good shift. We prove that given an almost good shift , there exists a vertex such that is a PDS. Observe that such vertex exists if and only if , and either or .
- •
If is of type RLRL, then and . According to Lemma 12, the set is an almost good shift and . Since and , then and . If , then since . Thus, is a PDS of size . Symmetrically, if , then since . Thus, is a PDS of size . On the other hand, if and , then and . Moreover, since then . Therefore, or (since , and . Thus, is a PDS of size . Notice that the resulting PDS is connected. Indeed, let be the vertex we removed from the path . It is easy to see that, either , or since the graph is of type , and thus the PDS is connected.
- •
If is of type RRLL, then and . According to Lemma 12, the set is an almost good shift and . Since and , we necessarily have and . In this case, notice that since , . Moreover, , which implies . We show that either or . Suppose that . Then since , we have . Since , we have . Since and , then and . Thus, . Thus, either or . Now, if , then since , the set is a PDS of size . Else, and then since , the set is a PDS of size . Notice that the resulting PDS is connected. Indeed, let be the vertex we removed from the almost good path . Again, it is easy to verify that either , and then , or , and then since the graph is of type . Thus the PDS is connected.
∎
According to Lemma 8, a PDS in a cubic graph of order contains at most vertices. Thus, we obtain the following corollary.
Corollary 6**.**
Let be a Hamiltonian cubic graph with a given Hamiltonian cycle. Then a connected proportional dense subgraph of maximum size in can be found in linear time.
6 Conclusion and open problems
We prove that Max Proportionally Dense Subgraph is APX-hard even on split graphs, and NP-hard on bipartite graphs, whether the PDS is required to be connected or not. Furthermore, the problem is proved to be -approximable, where is the maximum degree of the graph. We also show that deciding if a PDS is inclusion-wise maximal is co-NP-complete, even on bipartite graphs. Nevertheless, Max PDS can be solved in linear time on Hamiltonian cubic graphs if a Hamiltonian cycle is given.
However, the complexity of finding a PDS of maximum size in cubic graphs remains unknown. More specifically, the question whether a PDS of size always exists in a cubic graph is still open (except for the two graphs given in Figure 4). Also, Algorithm 1 returns a PDS of size or (in linear time), but the PDS may not be connected. An interesting open question is whether there is always a connected PDS of size at least . Finally, the parameterized complexity of finding a PDS of size at least is unknown.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Andersen and Chellapilla [2009] R. Andersen and K. Chellapilla. Finding dense subgraphs with size bounds. In 6th International Workshop on Algorithms and Models for the Web-Graph (WAW) , volume 5427 of LNCS , pages 25–37. Springer, 2009. doi: 10.1007/978-3-540-95995-3_3 . · doi ↗
- 2Anderson [2007] R. Anderson. Finding large and small dense subgraphs. ar Xiv:cs/0702032, 2007.
- 3Asahiro et al. [2002] Y. Asahiro, R. Hassin, and K. Iwama. Complexity of finding dense subgraphs. Discrete Applied Mathematics , 121(1-3):15–26, 2002. doi: 10.1016/S 0166-218X(01)00243-8 . · doi ↗
- 4Bazgan et al. [2018 a] C. Bazgan, J. Chlebíková, and C. Dallard. Graphs without a partition into two proportionally dense subgraphs. ar Xiv:cs/1806.10963, 2018 a.
- 5Bazgan et al. [2018 b] C. Bazgan, J. Chlebíková, and T. Pontoizeau. Structural and algorithmic properties of 2 2 2 -community structures. Algorithmica , 80(6):1890–1908, 2018 b. doi: 10.1007/s 00453-017-0283-7 . · doi ↗
- 6Chlebík and Chlebíková [2006] M. Chlebík and J. Chlebíková. Complexity of approximating bounded variants of optimization problems. Theoretical Computer Science , 354:320–338, 2006. doi: 10.1016/j.tcs.2005.11.029 . · doi ↗
- 7Feige et al. [2001] U. Feige, D. Peleg, and G. Kortsarz. The dense k 𝑘 k -subgraph problem. Algorithmica , 29(3):410–421, 2001. doi: 10.1007/s 004530010050 . · doi ↗
- 8Goldberg [1984] A. V. Goldberg. Finding a maximum density subgraph. Technical Report UCB/CSD-84-171, EECS Department, University of California, Berkeley, 1984.
