Solving Vertex Cover in Polynomial Time on Hyperbolic Random Graphs

Thomas Bl\"asius; Philipp Fischbeck; Tobias Friedrich; Maximilian; Katzmann

arXiv:1904.12503·cs.DS·February 20, 2020

Solving Vertex Cover in Polynomial Time on Hyperbolic Random Graphs

Thomas Bl\"asius, Philipp Fischbeck, Tobias Friedrich, Maximilian, Katzmann

PDF

TL;DR

This paper demonstrates that the Vertex Cover problem can be solved in polynomial time on hyperbolic random graphs, providing insights into real-world network structures and improving approximation algorithms.

Contribution

It proves polynomial-time solvability of Vertex Cover on hyperbolic random graphs and links structural properties to practical algorithm performance.

Findings

01

Vertex Cover solvable in polynomial time on hyperbolic random graphs

02

Structural properties observed in real-world networks

03

Adaptive greedy algorithms outperform standard approaches

Abstract

The VertexCover problem is proven to be computationally hard in different ways: It is NP-complete to find an optimal solution and even NP-hard to find an approximation with reasonable factors. In contrast, recent experiments suggest that on many real-world networks the run time to solve VertexCover is way smaller than even the best known FPT-approaches can explain. Similarly, greedy algorithms deliver very good approximations to the optimal solution in practice. We link these observations to two properties that are observed in many real-world networks, namely a heterogeneous degree distribution and high clustering. To formalize these properties and explain the observed behavior, we analyze how a branch-and-reduce algorithm performs on hyperbolic random graphs, which have become increasingly popular for modeling real-world networks. In fact, we are able to show that the VertexCover…

Tables2

Table 1. Table 1 : The raw data of our experiments. The columns are: (network) the network’s name; (easy) whether or not we could compute an optimal solution; (dom) the percentage of dominant vertices among the high-degree vertices; (tw) an upper bound for the treewidth of the remaining graph after deleting dominant nodes; (greedy) the approximation ratio of greedy; (2-ad) the approximation ratio of 2-adaptive greedy; (4-ad) the approximation ratio of 4-adaptive greedy; (comp) the size of the largest component that remains after the greedy phase of 4-adaptive greedy.

network	easy	dom	tw	greedy	2-ad	4-ad	comp
advogato	✓	$051 %$	314	1.011	1.009	1.005	863
airlines	✓	$028 %$	23	1.000	1.000	1.000	75
as-22july06	✓	$100 %$	3	1.002	1.001	1.001	46
as-caida20071105	✓	$100 %$	3	1.002	1.001	1.000	35
as-skitter	✗	$047 %$	969794
as20000102	✓	$100 %$	2	1.003	1.001	1.001	18
bio-CE-HT	✓	$100 %$	3	1.015	1.009	1.000	225
bio-CE-LC	✓	$100 %$	2	1.003	1.003	1.003	39
bio-DM-HT	✓	$050 %$	13	1.017	1.014	1.004	319
bio-yeast-protein-inter	✓	$100 %$	4	1.013	1.006	1.002	147
bn-fly-drosophila-medulla-1	✓	$072 %$	38	1.018	1.013	1.009	142
bn-mouse-kasthuri-graph-v4	✓	$100 %$	1	1.006	1.000	1.000	12
ca-AstroPh	✓	$094 %$	6	1.003	1.002	1.001	123
ca-cit-HepPh	✓	$084 %$	151	1.003	1.003	1.002	533
ca-CondMat	✓	$099 %$	4	1.003	1.002	1.001	53
ca-GrQc	✓	$099 %$	2	1.004	1.002	1.001	44
ca-HepTh	✓	$095 %$	13	1.005	1.004	1.001	174
cfinder-google	✗	$066 %$	82
cit-HepTh	✗	$013 %$	19737
citeseer	✗	$046 %$	182372
com-amazon	✓	$093 %$	2756	1.011	1.006	1.002	16209
com-dblp	✓	$100 %$	7	1.002	1.001	1.000	69
cpan-authors	✓	$100 %$	2	1.009	1.009	1.009	17
digg-friends	✓	$058 %$	1649	1.008	1.006	1.004	179
ego-facebook	✓	$100 %$	-1	1.000	1.000	1.000	3
ego-gplus	✓	$100 %$	1	1.000	1.000	1.000	5
email-Enron	✓	$085 %$	41	1.003	1.002	1.001	141
EuroSiS	✓	$056 %$	34	1.020	1.018	1.010	274
facebook-wosn-links	✗	$027 %$	36694
flixster	✗	$073 %$	122
hyves	✓	$098 %$	1653	1.008	1.008	1.008	42
livemocha	✓	$004 %$	24380	1.017	1.013	1.006	25300
loc-brightkite-edges	✓	$076 %$	619	1.014	1.009	1.004	4658

Table 2. Table 1 : The raw data of our experiments. The columns are: (network) the network’s name; (easy) whether or not we could compute an optimal solution; (dom) the percentage of dominant vertices among the high-degree vertices; (tw) an upper bound for the treewidth of the remaining graph after deleting dominant nodes; (greedy) the approximation ratio of greedy; (2-ad) the approximation ratio of 2-adaptive greedy; (4-ad) the approximation ratio of 4-adaptive greedy; (comp) the size of the largest component that remains after the greedy phase of 4-adaptive greedy.

network	easy	dom	tw	greedy	2-ad	4-ad	comp
loc-gowalla-edges	✗	$064 %$	3991
moreno-names	✓	$094 %$	3	1.006	1.004	1.002	34
moreno-propro	✓	$100 %$	4	1.014	1.006	1.002	153
munmun-twitter-social	✓	$057 %$	12	1.000	1.000	1.000	5
OClinks	✓	$036 %$	202	1.017	1.015	1.005	498
p2p-Gnutella04	✓	$042 %$	1352	1.019	1.017	1.016	970
p2p-Gnutella05	✓	$040 %$	1075	1.014	1.013	1.013	447
p2p-Gnutella06	✓	$040 %$	1142	1.023	1.022	1.021	820
p2p-Gnutella08	✓	$047 %$	414	1.008	1.008	1.008	45
p2p-Gnutella09	✓	$047 %$	419	1.005	1.005	1.005	63
p2p-Gnutella24	✓	$081 %$	525	1.006	1.005	1.005	70
p2p-Gnutella25	✓	$079 %$	464	1.006	1.005	1.005	77
p2p-Gnutella30	✓	$079 %$	604	1.005	1.005	1.004	62
p2p-Gnutella31	✓	$080 %$	732	1.011	1.010	1.010	65
petster-carnivore	✓	$079 %$	149312	1.008	1.007	1.004	9238
petster-friendship-cat	✗	$012 %$	14929
petster-friendship-dog	✗	$015 %$	340634
petster-friendship-hamster	✗	$023 %$	135
soc-Epinions1	✓	$082 %$	238	1.006	1.003	1.001	228
US-Air	✓	$067 %$	4	1.013	1.000	1.000	23
web-Google	✗	$084 %$	103939
wiki-Vote	✓	$044 %$	384	1.054	1.052	1.050	726
wordnet-words	✓	$095 %$	28	1.004	1.003	1.002	59
YeastS	✓	$070 %$	39	1.013	1.012	1.005	244
youtube-links	✓	$086 %$	1239	1.008	1.004	1.001	570
youtube-u-growth	✗	$090 %$	59358

Equations64

dist (p, q) = acosh (cosh (r (p)) cosh (r (q)) - sinh (r (p)) sinh (r (q)) cos (Δ_{φ} (φ (p), φ (q)))),

dist (p, q) = acosh (cosh (r (p)) cosh (r (q)) - sinh (r (p)) sinh (r (q)) cos (Δ_{φ} (φ (p), φ (q)))),

f (r) = \frac{1}{2 π} \frac{α sinh ( α r )}{cosh ( α R ) - 1} = \frac{α}{2 π} e^{- α (R - r)} (1 + Θ (e^{- α R} - e^{- 2 α r})),

f (r) = \frac{1}{2 π} \frac{α sinh ( α r )}{cosh ( α R ) - 1} = \frac{α}{2 π} e^{- α (R - r)} (1 + Θ (e^{- α R} - e^{- 2 α r})),

θ (r (u), r (v))

θ (r (u), r (v))

= 2 e^{(R - r (u) - r (v)) /2} (1 + Θ (e^{R - r (u) - r (v)})) .

δ (r (u), r (v)) = 2 (e^{- r (u) /2} - e^{- r (v) /2}) + Θ (e^{- 3/2 r (u)}) - Θ (e^{- 3/2 r (v)}) .

δ (r (u), r (v)) = 2 (e^{- r (u) /2} - e^{- r (v) /2}) + Θ (e^{- 3/2 r (u)}) - Θ (e^{- 3/2 r (v)}) .

δ (r (u), r (v))

δ (r (u), r (v))

= 2 (e^{- r (u) /2} - e^{- r (v) /2}) + Θ (e^{- 3/2 r (u)}) - Θ (e^{- 3/2 r (v)}) .

μ (D (u))

μ (D (u))

μ (D (u))

μ (D (u))

= 2 \int_{r (u)}^{R} (2 (e^{- r (u) /2} - e^{- r /2}) + Θ (e^{- 3/2 r (u)}) - Θ (e^{- 3/2 r}))

= 2 \int_{r (u)}^{R} \cdot \frac{α}{2 π} e^{- α (R - r)} (1 + Θ (e^{- α R} - e^{- 2 α r})) d r

μ (D (u))

μ (D (u))

= \frac{2 α}{π} e^{- r (u) /2} e^{- α R} (1 + Θ (e^{- α R})) \int_{r (u)}^{R} e^{α r} d r

- \frac{2 α}{π} e^{- α R} (1 + Θ (e^{- α R})) \int_{r (u)}^{R} e^{(α - 1/2) r} d r \pm O (e^{- (3/4 + α) R} \int_{r (u)}^{R} e^{α r} d r) .

μ (D (u))

μ (D (u))

- \frac{2 α}{( α - 1/2 ) π} e^{- R /2} (1 + Θ (e^{- α R})) (1 - e^{- (α - 1/2) (R - r (u))})

\pm O (e^{- 3/4 R} (1 - e^{- α (R - r (u))})) .

μ (D (u))

μ (D (u))

= \frac{2}{π} e^{- R /2 + l o g l o g (n^{c})} (1 - Θ (e^{- 2 α l o g l o g (n^{c})})) \pm O (1/ n) .

Pr [{v \in D (u)} \neq = \emptyset] = 1 - (1 - μ (D (u)))^{n} \geq 1 - e^{- n μ (D (u))} = 1 - Θ (n^{- c \overset{κ}{ˉ} /4 (1 - o (1))}) .

Pr [{v \in D (u)} \neq = \emptyset] = 1 - (1 - μ (D (u)))^{n} \geq 1 - e^{- n μ (D (u))} = 1 - Θ (n^{- c \overset{κ}{ˉ} /4 (1 - o (1))}) .

μ (A_{r})

μ (A_{r})

μ (A_{r})

μ (A_{r})

= 2 \int_{r}^{R} 2 e^{(R - 2 x) /2} (1 \pm Θ (e^{R - 2 x})) \cdot \frac{α}{2 π} e^{- α (R - x)} (1 + Θ (e^{- α R} - e^{- 2 α x})) d x .

μ (A_{r})

μ (A_{r})

μ (A_{r})

μ (A_{r})

= \frac{2 α}{π} e^{- (α - 1/2) R} (1 + Θ (e^{- α R}))

\displaystyle\qquad\cdot\Bigg{(}\frac{1}{1-\alpha}e^{-(1-\alpha)r}(1-e^{-(1-\alpha)(R-r)})+\Theta\left(e^{R}e^{-(3-\alpha)r}(1-e^{-(3-\alpha)(R-r)})\right)\Bigg{)}.

μ (A_{r})

μ (A_{r})

\displaystyle\qquad\cdot\Bigg{(}(1-e^{-(1-\alpha)(R-r)})+\Theta\left(e^{R-2r}(1-e^{-(3-\alpha)(R-r)})\right)\Bigg{)}.

iw (G_{r}^{S})

iw (G_{r}^{S})

E [∣ {v \in A_{r}} ∣]

E [∣ {v \in A_{r}} ∣]

E [∣ {v \in A_{r}} ∣]

E [∣ {v \in A_{r}} ∣]

\displaystyle\qquad\cdot\Big{(}1+\Theta(e^{-\alpha R}+e^{-(2(R-1/(1-\alpha)\log\log(n^{c}))-R)}

\displaystyle\hphantom{\qquad\cdot\Big{(}}-e^{-(1-\alpha)(R-(R-1/(1-\alpha)\log\log(n^{c})))})\Big{)}

= \frac{2 α}{( 1 - α ) π} n e^{- R /2 + l o g l o g (n^{c}))}

\displaystyle\qquad\cdot\Big{(}1+\Theta(e^{-\alpha R}+e^{-(R-2/(1-\alpha)\log\log(n^{c}))}-e^{-\log\log(n^{c})})\Big{)}

E [∣ {v \in A_{r}} ∣]

E [∣ {v \in A_{r}} ∣]

E [de g (v) ∣ r (v) = r]

E [de g (v) ∣ r (v) = r]

de g (v) \geq \frac{α κ ˉ ε}{4 ( α - 1/2 )} d (1 \pm o (1)) .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Hasso Plattner Institute, University of Potsdam

Potsdam, [email protected] Plattner Institute, University of Potsdam

Potsdam, [email protected]://orcid.org/0000-0003-0076-6308Hasso Plattner Institute, University of Potsdam

Potsdam, [email protected]\CopyrightThomas Bläsius, Philipp Fischbeck, Tobias Friedrich, Maximilian Katzmann\ccsdesc[500]Theory of computation Graph algorithms analysis \ccsdesc[500]Theory of computation Random network models \ccsdesc[500]Mathematics of computing Random graphs \fundingThis research was partially funded by the German Research Foundation (Deutsche

Forschungsgemeinschaft, DFG) – project number 390859508.\hideLIPIcs\EventEditorsChristophe Paul and Markus Bläser \EventNoEds2 \EventLongTitle37th International Symposium on Theoretical Aspects of Computer Science (STACS 2020) \EventShortTitleSTACS 2020 \EventAcronymSTACS \EventYear2020 \EventDateMarch 10–13, 2020 \EventLocationMontpellier, France \EventLogo \SeriesVolume154 \ArticleNo21

Solving Vertex Cover in Polynomial Time on Hyperbolic Random

Graphs

Thomas Bläsius

Philipp Fischbeck

Tobias Friedrich

Maximilian Katzmann

Abstract

The VertexCover problem is proven to be computationally hard in different ways: It is NP-complete to find an optimal solution and even NP-hard to find an approximation with reasonable factors. In contrast, recent experiments suggest that on many real-world networks the run time to solve VertexCover is way smaller than even the best known FPT-approaches can explain. Similarly, greedy algorithms deliver very good approximations to the optimal solution in practice.

We link these observations to two properties that are observed in many real-world networks, namely a heterogeneous degree distribution and high clustering. To formalize these properties and explain the observed behavior, we analyze how a branch-and-reduce algorithm performs on hyperbolic random graphs, which have become increasingly popular for modeling real-world networks. In fact, we are able to show that the VertexCover problem on hyperbolic random graphs can be solved in polynomial time, with high probability.

The proof relies on interesting structural properties of hyperbolic random graphs. Since these predictions of the model are interesting in their own right, we conducted experiments on real-world networks showing that these properties are also observed in practice. When utilizing the same structural properties in an adaptive greedy algorithm, further experiments suggest that, on real instances, this leads to better approximations than the standard greedy approach within reasonable time.

keywords:

vertex cover, random graphs, hyperbolic geometry, efficient algorithm

1 Introduction

VertexCover is a fundamental NP-complete graph problem. For a given undirected graph $G$ on $n$ vertices the goal is to find the smallest vertex subset $S$ , such that each edge in $G$ is incident to at least one vertex in $S$ . Since, by definition, there can be no edge between two vertices outside of $S$ , these remaining vertices form an independent set. Therefore, one can easily derive a maximal independent set from a minimal vertex cover and vice versa.

Due to its NP-completeness there is probably no polynomial time algorithm for solving VertexCover. The best known algorithm for IndependentSet runs in $1.1996^{n}\operatorname{poly}(n)$ [22]. To analyze the complexity of VertexCover on a finer scale, several parameterized solutions have been proposed. One can determine whether a graph $G$ has a vertex cover of size $k$ by applying a branch-and-reduce algorithm. The idea is to build a search tree by recursively considering two possible extensions of the current vertex cover (branching), until a vertex cover is found or the size of the current cover exceeds $k$ . Each branching step is followed by a reduce step in which reduction rules are applied to make the considered graph smaller. This branch-and-reduce technique yields a simple $\mathcal{O}(2^{k}\operatorname{poly}(n))$ algorithm, where the exponential portion comes from the branching. The best known FPT (fixed-parameter tractable) algorithm runs in $\mathcal{O}(1.2738^{k}+kn)$ time [7], and unless ETH (exponential time hypothesis) fails, there can be no $2^{o(k)}\operatorname{poly}(n)$ algorithm [6].

While these FPT approaches promise relatively small running times if the considered network has a small vertex cover, the cover is large for many real-world networks. Nevertheless, it was recently observed that applying a branch-and-reduce technique on real instances is very efficient [1]. Some of the considered networks had millions of vertices, yet an optimal solution (also containing millions of vertices) was computed within seconds. Most instances were solved so quickly since the expensive branching was not necessary at all. In fact, the application of the reduction rules alone already yielded an optimal solution. Most notably, applying the dominance reduction rule, which eliminates vertices whose neighborhood contains a vertex together with its neighborhood, reduces the graph to a very small remainder on which the branching, if necessary, can be done quickly. We trace the effectiveness of the dominance rule back to two properties that are often observed in real-world networks: a heterogeneous degree distribution (the network contains many vertices of small degree and few vertices of high degree) and high clustering (the neighbors of a vertex are likely to be neighbors themselves).

We formalize these key properties using hyperbolic random graphs to analyze the performance of the dominance rule. Introduced by Krioukov et al. [17], hyperbolic random graphs are obtained by randomly distributing nodes in the hyperbolic plane and connecting any two that are geometrically close. The resulting graphs feature a power-law degree distribution and high clustering [14, 17] (the two desired properties) which can be tuned using parameters of the model. Additionally, the generated networks have a small diameter [13]. All of these properties have been observed in many real-world networks such as the internet, social networks, as well as biological networks like protein-protein interaction networks. Furthermore, Boguná, Papadopoulos, and Krioukov showed that the internet can be embedded into the hyperbolic plane such that routing packages between network participants greedily works very well [5], indicating that this network naturally fits into the hyperbolic space.

By making use of the underlying geometry, we show that VertexCover can be solved in polynomial time on hyperbolic random graphs, with high probability. This is done by showing that even a single application of the dominance reduction rule reduces a hyperbolic random graph to a remainder with small pathwidth on which VertexCover can then be solved efficiently. Our analysis provides an explanation for why VertexCover can be solved efficiently on practical instances. We note that, while our analysis makes use of the underlying hyperbolic geometry, the algorithm itself is oblivious to it. Besides the running time the model predicts certain structural properties that also point us to an adapted greedy algorithm that is still very efficient and achieves better approximation ratios. We conducted experiments indicating that these predictions (concerning the structural properties and improved approximation) actually match the real world for a significant fraction of networks.

2 Preliminaries

Let $G=(V,E)$ be an undirected graph. We denote the number of vertices in $G$ with $n$ . The neighborhood of a vertex $v$ is defined as $N(v)=\{w\in V\mid\{v,w\}\in E\}$ and the size of the neighborhood, called the degree of $v$ , is denoted by $\deg(v)$ . For a subset $S\subseteq V$ , we use $G[S]$ to denote the induced subgraph of $G$ obtained by removing all vertices in $V\setminus S$ . Furthermore, we use the shorthand notation $G_{\leq d}$ to denote $G[\{v\in V\mid\deg(v)\leq d\}]$ .

The Hyperbolic Plane.

After choosing a designated origin $O$ in the two-dimensional hyperbolic plane, together with a reference ray starting at $O$ , a point $p$ is uniquely identified by its radius $r(p)$ , denoting the hyperbolic distance to $O$ , and its angle (or angular coordinate) $\varphi(p)$ , denoting the angular distance between the reference ray and the line through $p$ and $O$ . The hyperbolic distance between two points $p$ and $q$ is given by

[TABLE]

where $\cosh(x)=(e^{x}+e^{-x})/2$ , $\sinh(x)=(e^{x}-e^{-x})/2$ (both growing as $e^{x}/2\pm o(1)$ ), and $\Delta_{\varphi}(p,q)=\pi-|\pi-|\varphi(p)-\varphi(q)||$ denotes the angular distance between $p$ and $q$ . If not stated otherwise, we assume that computations on angles are performed modulo $2\pi$ .

We use $B_{p}(r)$ to denote a disk of radius $r$ centered at $p$ , i.e., the set of points with hyperbolic distance at most $r$ to $p$ . Such a disk has an area of $2\pi(\cosh(r)-1)$ and circumference $2\pi\sinh(r)$ . Thus, the area and the circumference of a disk in the hyperbolic plane grow exponentially with its radius. In contrast, this growth is polynomial in Euclidean space. Therefore, representing hyperbolic shapes in the Euclidean geometry results in a distortion. In the native representation, used in our figures, circles can appear teardrop-shaped (see Figure 2).

Hyperbolic Random Graphs.

Hyperbolic random graphs are obtained by distributing $n$ points uniformly at random within the disk $B_{O}(R)$ and connecting any two of them if and only if their hyperbolic distance is at most $R$ ; see Figure 1. The disk radius $R$ (which matches the connection threshold) is defined as $R=2\log(8n/(\pi\bar{\kappa}))$ , where $\bar{\kappa}$ is a constant describing the desired average degree of the generated network. The coordinates for the vertices are drawn as follows. For vertex $v$ the angular coordinate, denoted by $\varphi(v)$ , is drawn uniformly at random from $[0,2\pi]$ and the radius of $v$ , denoted by $r(v)$ , is sampled according to the probability density function $\alpha\sinh(\alpha r)/(\cosh(\alpha R)-1)$ for $r\in[0,R]$ and $\alpha\in(1/2,1)$ . Thus,

[TABLE]

is their joint distribution function for $r\in[0,R]$ . For $r>R$ , $f(r)=0$ . The constant $\alpha\in(1/2,1)$ is used to tune the power-law exponent $\beta=2\alpha+1$ of the degree distribution of the generated network. Note that we obtain power-law exponents $\beta\in(2,3)$ . Exponents outside of this range are atypical for hyperbolic random graphs. On the one hand, for $\beta<2$ the average degree of the generated networks is divergent. On the other hand, for $\beta>3$ hyperbolic random graphs degenerate: They decompose into smaller components, none having a size linear in $n$ . The obtained graphs have logarithmic tree width [4], meaning the VertexCover problem can be solved efficiently in that case.

The probability for a given vertex to lie in a certain area $A$ of the disk is given by its probability measure $\mu(A)=\int_{A}f(r)\mathrm{d}r$ . The hyperbolic distance between two vertices $u$ and $v$ increases with increasing angular distance between them. The maximum angular distance such that they are still connected by an edge is bounded by [14, Lemma 6]

[TABLE]

Interval Graphs and Circular Arc Graphs.

In an interval graph each vertex $v$ is identified with an interval on the real line and two vertices are adjacent if and only if their intervals intersect. The interval width of an interval graph $G$ , denoted by $\operatorname{iw}(G)$ , is its maximum clique size, i.e., the maximum number of intervals that intersect in one point. For any graph the interval width is defined as the minimum interval width over all of its interval supergraphs. Circular arc graphs are a superclass of interval graphs, where each vertex is identified with a subinterval of the circle called circular arc or simply arc. The interval width of a circular arc graph $G$ is at most twice the size of its maximum clique, since one obtains an interval supergraph of $G$ by mapping the circular arcs into the interval $[0,2\pi]$ on the real line and replacing all intervals that were split by this mapping with the whole interval $[0,2\pi]$ . Consequently, for any graph $G$ , if $k$ denotes the minimum over the maximum clique number of all circular arc supergraphs $G^{\prime}$ of $G$ , then the interval width of $G$ is at most $2k$ .

Treewidth and Pathwidth.

A tree decomposition of a graph $G$ is a tree $T$ where each tree node represents a subset of the vertices of $G$ called bag, and the following requirements have to be satisfied: Each vertex in $G$ is contained in at least one bag, all bags containing a given vertex in $G$ form a connected subtree of $T$ , and for each edge in $G$ , there exists a bag containing both endpoints. The width of a tree decomposition is the size of its largest bag minus one. The treewidth of $G$ is the minimum width over all tree decompositions of $G$ . The path decomposition of a graph is defined analogously to the tree decomposition, with the constraint that the tree has to be a path. Additionally, as for the treewidth, the pathwidth of a graph $G$ , denoted by $\operatorname{pw}(G)$ , is the minimum width over all path decompositions of $G$ . Clearly the pathwidth is an upper bound on the treewidth. It is known that for any graph $G$ and any $k\geq 0$ , the interval width of $G$ is at most $k+1$ if and only if its pathwidth is at most $k$ [8, Theorem 7.14]. Consequently, if $k^{\prime}$ is the maximum clique size of a circular arc supergraph of $G$ , then $2k^{\prime}-1$ is an upper bound on the pathwidth of $G$ .

Probabilities.

Since we are analyzing a random graph model, our results are of probabilistic nature. To obtain meaningful statements, we show that they hold with high probability (for short whp.), i.e., with probability $1-\mathcal{O}(n^{-1})$ . The following Chernoff bound is a useful tool for showing that certain events occur with high probability.

Theorem 2.1 (Chernoff Bound [11, A.1]).

Let $X_{1},\dots,X_{n}$ be independent random variables with $X_{i}\in\{0,1\}$ and let $X$ be their sum. Let $f(n)=\Omega(\log(n))$ . If $f(n)$ is an upper bound for $\mathbb{E}[X]$ , then for each constant $c$ there exists a constant $c^{\prime}$ such that $X\leq c^{\prime}f(n)$ holds with probability $1-\mathcal{O}(n^{-c})$ .

3 Vertex Cover on Hyperbolic Random Graphs

Reduction rules are often applied as a preprocessing step, before using a brute force search or branching in a search tree. They simplify the input by removing parts that are easy to solve. For example, an isolated vertex does not cover any edges and can thus never be part of a minimum vertex cover. Consequently, in a preprocessing step all isolated vertices can be removed, which leads to a reduced input size without impeding the search for a minimum.

The dominance reduction rule was previously defined for the IndependentSet problem [12], and later used for VertexCover in the experiments by Akiba and Iwata [1]. Formally, vertex $u$ dominates a neighbor $v\in N(u)$ if $(N(v)\setminus\{u\})\subseteq N(u)$ , i.e., all neighbors of $v$ are also neighbors of $u$ . We say $u$ is dominant if it dominates at least one vertex. The dominance rule states that $u$ can be added to the vertex cover (and afterwards removed from the graph), without impeding the search for a minimum vertex cover. To see that this is correct, assume that $u$ dominates $v$ and let $S$ be a minimum vertex cover that does not contain $u$ . Since $S$ has to cover all edges, it contains all neighbors of $u$ . These neighbors include $v$ and all of $v$ ’s neighbors, since $u$ dominates $v$ . Therefore, removing $v$ from $S$ leaves only the edge $\{u,v\}$ uncovered which can be fixed by adding $u$ instead. The resulting vertex cover has the same size as $S$ . When searching for a minimum vertex cover of $G$ , it is thus safe to assume that $u$ is part of the solution and to reduce the search to $G[V\setminus\{u\}]$ .

In the remainder of this section, we study the effectiveness of the dominance reduction rule on hyperbolic random graphs and conclude that VertexCover can be solved efficiently on these graphs. Our results are summarized in the following main theorem.

Theorem 3.1.

Let $G$ be a hyperbolic random graph on $n$ vertices. Then the VertexCover problem on $G$ can be solved in $\operatorname{poly}(n)$ time, with high probability.

The proof of Theorem 3.1 consists of two parts that make use of the underlying hyperbolic geometry. In the first part, we show that applying the dominance reduction rule once removes all vertices in the inner part of the hyperbolic disk with high probability, as depicted in Figure 1. We note that this is independent of the order in which the reduction rule is applied, as dominant vertices remain dominant after removing other dominant vertices. In the second part, we consider the induced subgraph containing the remaining vertices near the boundary of the disk (black vertices in Figure 1). We prove that this subgraph has a small pathwidth, by showing that there is a circular arc supergraph with a small interval width. Consequently, a tree decomposition of this subgraph can be computed efficiently. Finally, we obtain a polynomial time algorithm for VertexCover by first applying the reduction rules and afterwards solving VertexCover on the remaining subgraph using dynamic programming on the tree decomposition of small width.

3.1 Dominance on Hyperbolic Random Graphs

Recall that a hyperbolic random graph is obtained by distributing $n$ vertices in a hyperbolic disk $B_{O}(R)$ and that any two are connected if their distance is at most $R$ . Consequently, one can imagine the neighborhood of a vertex $u$ as another disk $B_{u}(R)$ . Vertex $u$ dominates another vertex $v$ if its neighborhood disk completely contains that of $v$ (both constrained to $B_{O}(R)$ ), as depicted in Figure 2 left. We define the dominance area $D(u)$ of $u$ to be the area containing all such vertices $v$ . That is, $D(u)=\{p\in B_{O}(R)\mid B_{p}(R)\cap B_{O}(R)\subseteq B_{u}(R)\cap B_{O}(R)\}$ . The result is illustrated in Figure 2 right. We note that it is sufficient for a vertex $v$ to lie in $D(u)$ in order to be dominated by $u$ , however, it is not necessary.

Given the radius $r(u)$ of vertex $u$ we can now compute a lower bound on the probability that $u$ dominates another vertex, i.e., the probability that at least one vertex lies in $D(u)$ , by determining the measure $\mu(D(u))$ . To this end, we first define $\delta(r(u),r(v))$ to be the maximum angular distance between two nodes $u$ and $v$ such that $v$ lies in $D(u)$ .

Lemma 3.2.

Let $u,v$ be vertices with $r(u)\leq r(v)$ . Then, $v\in D(u)$ if $\Delta_{\varphi}(u,v)$ is at most

[TABLE]

Proof 3.3.

Without loss of generality we assume that $\varphi(u)=0$ . For now assume that $\varphi(v)=\varphi(u)$ . Since $r(v)\geq r(u)$ we know that the intersections of the boundaries of $B_{v}(R)$ with $B_{O}(R)$ lie between those of $B_{u}(R)$ with $B_{O}(R)$ , as is depicted in Figure 3. Now let $i_{u}$ denote one of these intersections for $B_{u}(R)$ and $B_{O}(R)$ , and let $i_{v}$ denote the intersection for $B_{v}(R)$ and $B_{O}(R)$ that is on the same side of the ray through $O$ and $u$ as $i_{u}$ . It is easy to see that the maximum angular distance between $u$ and $v$ such that $B_{v}(R)\cap B_{O}(R)$ is contained within $B_{u}(R)\cap B_{O}(R)$ is given by the angular distance between $i_{u}$ and $i_{v}$ . Therefore, $v$ lies in the dominance area of $u$ if $\Delta_{\varphi}(u,v)\leq\Delta_{\varphi}(i_{u},i_{v})$ .

Recall that $\theta(r(p),r(q))$ denotes the maximum angular distance such that $\operatorname{dist}(p,q)\leq R$ , as defined in Equation (2). Since $i_{u}$ and $i_{v}$ have radius $R$ and hyperbolic distance $R$ to $u$ and $v$ , respectively, we know that their angular coordinates are $\theta(r(u),R)$ and $\theta(r(v),R)$ , respectively. Consequently, the angular distance between $i_{u}$ and $i_{v}$ is given by

[TABLE]

Using Lemma 3.2 we can now compute the probability for a given vertex to lie in the dominance area of $u$ . We note that this probability grows roughly like $2/\pi e^{-r(u)/2}$ , which is a constant fraction of the measure of the neighborhood disk of $u$ which grows as $\alpha/(\alpha-1/2)\cdot 2/\pi e^{-r(u)/2}$ [14, Lemma 3.2]. Consequently, the expected number of nodes that $u$ dominates is a constant fraction of the expected number of its neighbors.

Lemma 3.4.

Let $u$ be a node with radius $r(u)\geq R/2$ . The probability for a given node to lie in $D(u)$ is given by

[TABLE]

Proof 3.5.

The probability for a given vertex $v$ to lie in $D(u)$ is obtained by integrating the probability density (given by Equation (1)) over $D(u)$ .

[TABLE]

Since $r(u)\geq R/2$ and $r\in[r(u),R]$ we have $\Theta(e^{-3/2r(u)})-\Theta(e^{-3/2r})=\pm\mathcal{O}(e^{-3/4R})$ and $(1+\Theta(e^{-\alpha R}-e^{-2\alpha r}))=(1+\Theta(e^{-\alpha R}))$ . Due to the linearity of integration, constant factors within the integrand can be moved out of the integral, which yields

[TABLE]

The remaining integrals can be computed easily and we obtain

[TABLE]

As $e^{-R/2}=\Theta(n^{-1})$ and $e^{-3/4R}=\Theta(n^{-3/2})$ , simplifying the error terms yields the claim.

The following lemma shows that, with high probability, all vertices that are not too close to the boundary of the disk dominate at least one vertex.

Lemma 3.6.

Let $G$ be a hyperbolic random graph with average degree $\bar{\kappa}$ . Then there is a constant $c>4/\bar{\kappa}$ , such that all vertices $u$ with $r(u)\leq\rho=R-2\log\log(n^{c})$ are dominant, with high probability.

Proof 3.7.

Vertex $u$ is dominant if at least one vertex lies in $D(u)$ . To show this for any $u$ with $r(u)\leq\rho$ , it suffices to show it for $r(u)=\rho$ , since $D(u)$ increases with decreasing radius. To determine the probability that at least one vertex lies in $D(u)$ , we use Lemma 3.4 and obtain

[TABLE]

By substituting $R=2\log(8n/(\pi\bar{\kappa}))$ , we obtain $\mu(D(u))=\bar{\kappa}/(4n)(c\log(n)(1-o(1))\pm\mathcal{O}(1))$ . The probability of at least one node falling into $D(u)$ is now given by

[TABLE]

Consequently, for large enough $n$ we can choose $c>4/\bar{\kappa}$ such that the probability of a vertex at radius $\rho$ being dominant is at least $1-\Theta(n^{-2})$ , allowing us to apply union bound.

Corollary 3.8.

Let $G$ be a hyperbolic random graph and $c>4/\bar{\kappa}$ . With high probability, all vertices with radius at most $\rho=R-2\log\log(n^{c})$ are removed by the dominance rule.

By Corollary 3.8 the dominance rule removes all vertices of radius at most $\rho$ . Consequently, all remaining vertices have radius at least $\rho$ . We refer to this part of the disk as outer band. More precisely, the outer band is defined as $B_{O}(R)\setminus B_{O}(\rho)$ . It remains to show that the pathwidth of the subgraph induced by the vertices in the outer band is small.

3.2 Pathwidth in the Outer Band

In the following, we use $G_{r}=G[\{v\in V\}\mid r(v)\geq r]$ to denote the induced subgraph of $G$ that contains all vertices with radius at least $r$ . To show that the pathwidth of $G_{\rho}$ (the induced subgraph in the outer band) is small, we first show that there is a circular arc supergraph $G_{\rho}^{S}$ of $G_{\rho}$ with a small maximum clique. We use $G^{S}$ to denote a circular arc supergraph of a hyperbolic random graph $G$ , which is obtained by assigning each vertex $v$ an angular interval $I_{v}$ on the circle, such that the intervals of two adjacent vertices intersect. More precisely, for a vertex $v$ , we set $I_{v}=[\varphi(v)-\theta(r(v),r(v)),\varphi(v)+\theta(r(v),r(v))]$ . Intuitively, this means that the interval of a vertex contains a superset of all its neighbors that have a larger radius, as can be seen in Figure 4 left. The following lemma shows that $G^{S}$ is actually a supergraph of $G$ .

Lemma 3.9.

Let $G=(V,E)$ be a hyperbolic random graph. Then $G^{S}$ is a supergraph of $G$ .

Proof 3.10.

Let $\{u,v\}\in E$ be any edge in $G$ . To show that $G^{S}$ is a supergraph of $G$ we need to show that $u$ and $v$ are also adjacent in $G^{S}$ , i.e., $I_{u}\cap I_{v}\neq\emptyset$ . Without loss of generality assume $r(u)\leq r(v)$ . Since $u$ and $v$ are adjacent in $G$ , the hyperbolic distance between them is at most $R$ . It follows, that their angular distance $\Delta_{\varphi}(u,v)$ is bounded by $\theta(r(u),r(v))$ . Since $\theta(r(u),r(v))\leq\theta(r(u),r(u))$ for $r(u)\leq r(v)$ , we have $\Delta_{\varphi}(u,v)\leq\theta(r(u),r(u))$ . As $I_{u}$ extends by $\theta(r(u),r(u))$ from $\varphi(u)$ in both directions, it follows that $\varphi(v)\in I_{u}$ .

It is easy to see that, after removing a vertex from $G$ and $G^{S}$ , $G^{S}$ is still a supergraph of $G$ . Consequently, $G_{\rho}^{S}$ is a supergraph of $G_{\rho}$ . It remains to show that $G_{\rho}^{S}$ has a small maximum clique number, which is given by the maximum number of arcs that intersect at any angle. To this end, we first compute the number of arcs that intersect a given angle which we set to [math] without loss of generality. Let $A_{r}$ denote the area of the disk containing all vertices $v$ with radius $r(v)\geq r$ whose interval $I_{v}$ intersects [math], as illustrated in Figure 4 right. The following lemma describes the probability for a given vertex to lie in $A_{r}$ .

Lemma 3.11.

Let $G$ be a hyperbolic random graph and let $r\geq R/2$ . The probability for a given vertex to lie in $A_{r}$ is bounded by

[TABLE]

Proof 3.12.

We obtain the measure of $A_{r}$ by integrating the probability density function over $A_{r}$ . Due to the definition of $I_{v}$ we can conclude that $A_{r}$ includes all vertices $v$ with radius $r(v)\geq r$ whose angular distance to [math] is at most $\theta(r(v),r(v))$ , defined in Equation (2). We obtain,

[TABLE]

As before, we can conclude that $(1+\Theta(e^{-\alpha R}-e^{-2\alpha r}))=(1+\Theta(e^{-\alpha R}))$ , since $r\geq R/2$ . By moving constant factors out of the integral, the expression can be simplified to

[TABLE]

We split the sum in the integral and deal with the two resulting integrals separately.

[TABLE]

By placing $1/(1-\alpha)e^{-(1-\alpha)r}$ outside of the brackets we obtain

[TABLE]

Simplifying the remaining error terms then yields the claim.

We can now bound the maximum clique number in $G_{\rho}^{S}$ and thus its interval width $\operatorname{iw}(G_{\rho}^{S})$ .

Theorem 3.13.

Let $G$ be a hyperbolic random graph and $r\geq R/2$ . Then there exists a constant $c$ such that, whp., $\operatorname{iw}(G_{r}^{S})=\mathcal{O}(\log(n))$ if $r\geq R-\frac{1}{(1-\alpha)}\log\log(n^{c})$ , and otherwise

[TABLE]

Proof 3.14.

We start by determining the expected number of arcs that intersect at a given angle, which can be done by computing the expected number of vertices in $A_{r}$ , using Lemma 3.11:

[TABLE]

It remains to show that this bound holds with high probability at every angle. To this end, we make use of a Chernoff bound (Theorem 2.1), by first showing that the bound on $\mathbb{E}[|\{v\in A_{r}\}|]$ is $\Omega(\log(n))$ . We start with the case where $r<R-\frac{1}{1-\alpha}\log\log(n^{c})$ .

[TABLE]

Substituting $R=2\log(8n/(\pi\bar{\kappa}))$ we obtain

[TABLE]

Thus, for all radii smaller than $R-\frac{1}{(1-\alpha)}\log\log(n^{c})$ , the resulting upper bound is lower bounded by $\Omega(\log(n))$ , which lets us apply Theorem 2.1. Moreover, as $\mathbb{E}[|\{v\in A_{r}\}|]$ decreases with increasing $r$ , $\mathcal{O}(\log(n))$ is a pessimistic but valid upper bound for the case $r\geq R-\frac{1}{(1-\alpha)}\log\log(n^{c})$ . Thus, we can also apply Theorem 2.1 to this case, using the $\mathcal{O}(\log(n))$ bound.

By Theorem 2.1, we can choose $c$ such that in both cases the bound holds with probability $1-\mathcal{O}(n^{-c^{\prime}})$ for any $c^{\prime}$ at a given angle. In order to see that it holds at every angle, note that it suffices to show that it holds at all arc endings as the number of intersecting arcs does not change in between arc endings. Since there are exactly $2n$ arc endings, we can apply union bound and obtain that the bound holds with probability $1-\mathcal{O}(n^{-c^{\prime}+1})$ for any $c^{\prime}$ at every angle. Since our bound on $\mathbb{E}[|\{v\in A_{r}\}|]$ is an upper bound on the maximum clique size of $G_{r}^{S}$ , the interval width of $G_{r}^{S}$ is at most twice as large, as argued in Section 2.

Since the interval width of a circular arc supergraph of $G$ is an upper bound on the pathwidth of $G$ [8, Theorem 7.14] and since $\rho\geq R-1/(1-\alpha)\log\log(n^{c})$ for $\alpha\in(1/2,1)$ , we immediately obtain the following corollary.

Corollary 3.15.

Let G be a hyperbolic random graph and let $G_{\rho}$ be the subgraph obtained by removing all vertices with radius at most $\rho=R-2\log\log(n^{c})$ . Then, $\operatorname{pw}(G_{\rho})=\mathcal{O}(\log(n))$ .

We are now ready to prove our main theorem, which we restate for the sake of readability.

Theorem 3.1.

Let $G$ be a hyperbolic random graph on $n$ vertices. Then the VertexCover problem in $G$ can be solved in $\operatorname{poly}(n)$ time, with high probability.

Proof 3.2.

Consider the following algorithm that finds the minimum vertex cover of $G$ . We start with an empty vertex cover $S$ . Initially, all dominant vertices are added to $S$ , which is correct due to the dominance rule. By Lemma 3.6, this includes all vertices of radius at most $\rho=R-2\log\log(n^{c})$ , for some constant $c$ , with high probability. Obviously, finding all vertices that are dominant can be done in $\operatorname{poly}(n)$ time. It remains to determine a vertex cover of $G_{\rho}$ . By Corollary 3.15, the pathwidth of $G_{\rho}$ is $\mathcal{O}(\log(n))$ , with high probability. Since the pathwidth is an upper bound on the treewidth, we can find a tree decomposition of $G_{\rho}$ and solve the VertexCover problem in $G_{\rho}$ in $\operatorname{poly}(n)$ time [8, Theorems 7.18 and 7.9].

Moreover, linking the radius of a vertex in Theorem 3.13 with its expected degree leads to the following corollary, which is interesting in its own right. It links the pathwidth to the degree $d$ in the graph $G_{\leq d}$ . Recall that $G_{\leq d}$ denotes the subgraph of $G$ induced by the vertices of degree at most $d$ .

Corollary 3.16.

Let $G$ be a hyperbolic random graph and let $d\leq\sqrt{n}$ . Then, with high probability, $\operatorname{pw}(G_{\leq d})=\mathcal{O}(d^{2-2\alpha}+\log(n))$ .

Proof 3.17.

Consider the radius $r=R-2\log(\varepsilon d)$ for some constant $\varepsilon>0$ , and the graph $G_{r}$ which is obtained by removing all vertices of radius at most $r$ . By substituting $R=2\log(8n/(\pi\bar{\kappa}))$ and using [14, Lemma 3.2] we can compute the expected degree of a vertex with radius $r$ as

[TABLE]

First assume that $d\geq\log(n)^{1/(2-2\alpha)}$ . We handle the other case later. Since $d\in\Omega(\log(n))$ we can choose $\varepsilon$ large enough to apply Theorem 2.1 and conclude that this holds with high probability. Furthermore, since a smaller radius implies a larger degree, we know that, with high probability, all nodes $v$ with radius at most $r$ , have

[TABLE]

For large enough $n$ we can choose $\varepsilon$ such that, with high probability, $G_{r}$ is a supergraph of $G_{\leq d}$ . To prove the claim, it remains to bound the pathwidth of $G_{r}$ . If $r>R-1/(1-\alpha)\log\log(n^{c})$ , we can apply the first part of Theorem 3.13 to obtain $\operatorname{iw}(G_{r}^{S})=\mathcal{O}(\log(n))$ . Otherwise, we use part two to conclude that the interval width of $G_{r}$ is at most

[TABLE]

As argued in Section 2 the interval width of a graph is an upper bound on the pathwidth.

For $d<\log(n)^{1/(2-2\alpha)}$ (which we excluded above), consider $G_{\leq d^{\prime}}$ for $d^{\prime}=\log(n)^{1/(2-2\alpha)}>d$ . As we already proved the corollary for $d^{\prime}$ , we obtain $\operatorname{pw}(G_{\leq d^{\prime}})=\mathcal{O}(d^{\prime 2-2\alpha}+\log(n))=\mathcal{O}(\log(n))$ . As $G_{\leq d}$ is a subgraph of $G_{\leq d^{\prime}}$ , the same bound holds for $G_{\leq d}$ .

4 Discussion

Our results show that a heterogeneous degree distribution as well as high clustering make the dominance rule very effective. This matches the behavior for real-world networks, which typically exhibit these two properties. However, our analysis actually makes more specific predictions: (I) vertices with sufficiently high degree usually have at least one neighbor they dominate and can thus safely be included in the vertex cover; and (II) the graph remaining after deleting the high degree vertices has simple structure, i.e., small pathwidth.

To see whether this matches the real world, we run experiments on $59$ networks from several network datasets [2, 3, 18, 19, 20]. Although the focus of this paper is the theoretical analysis on hyperbolic random graphs, we briefly report on our experimental results; see Table LABEL:tab:data in Appendix 5. Out of the $59$ instances, we can solve VertexCover for $47$ networks in reasonable time. We refer to these as easy, while the remaining $12$ are called hard. Note that our theoretical analysis aims at explaining why the easy instances are easy.

Recall from Lemma 3.6 that all vertices with radius at most $R-2\log\log(n^{4/\bar{\kappa}})$ probably dominate, which corresponds to an expected degree of $\alpha/(\alpha-1/2)\cdot\log n$ . For more than half of the $59$ networks, more than $78\text{\,}\mathrm{\char 37\relax}$ of the vertices above this degree were in fact dominant. For more than a quarter of the networks, more than $96\text{\,}\mathrm{\char 37\relax}$ were dominant. Restricted to the $47$ easy instances, these number increase to $82\text{\,}\mathrm{\char 37\relax}$ and $99\text{\,}\mathrm{\char 37\relax}$ , respectively.

Experiments concerning the pathwidth of the resulting graph are much more difficult, due to the lack of efficient tools. Therefore, we used the tool by Tamaki et al. [21] to heuristically compute upper bounds on the treewidth instead. As in our analysis, we only removed vertices that dominate in the original graph instead of applying the reduction rule exhaustively. On the resulting subgraphs, the treewidth heuristic ran with a $15\text{\,}\mathrm{m}\mathrm{i}\mathrm{n}$ timeout. The resulting treewidth is at most $50$ for $44\text{\,}\mathrm{\char 37\relax}$ of the networks, at most $15$ for $34\text{\,}\mathrm{\char 37\relax}$ , and at most $5$ for $25\text{\,}\mathrm{\char 37\relax}$ . Restricted to easy instances, the values increase to $55\text{\,}\mathrm{\char 37\relax}$ , $43\text{\,}\mathrm{\char 37\relax}$ , and $32\text{\,}\mathrm{\char 37\relax}$ , respectively.

Hyperbolic random graphs are of course an idealized representation of real-world networks. However, these experiments indicate that the predictions derived from the model match the real world, at least for a significant fraction of networks.

Approximation.

Concerning approximation algorithms for VertexCover, there is a similar theory-practice gap as for exact solutions. In theory, there is a simple 2-approximation and the best known polynomial time approximation reduces the factor to $2-\Theta(\log(n)^{-1/2})$ [15]. However, it is NP-hard to approximate VertexCover within a factor of $1.3606$ [10], and presumably it is even NP-hard to approximate within a factor of $2-\varepsilon$ for all $\varepsilon>0$ [16]. Moreover, the greedy strategy that iteratively adds the vertex with maximum degree to the vertex cover and deletes it, is only a $\log n$ approximation. However, on scale-free networks this strategy performs exceptionally well with approximation ratios very close to 1 [9].

Our results for hyperbolic random graphs at least partially explain this good approximation ratio. Lemma 3.6 states that, with high probability, we do not make any mistake by taking all vertices below a certain radius $\rho$ , which corresponds to vertices of at least logarithmic degree. The same computation for larger values of $\rho$ does no longer give such strong guarantees. However, it still gives bounds on the probability for making a mistake. In fact, this error probability is sub-constant as long as the corresponding expected degree is super-constant.

Although this is not a formal argument, it still explains to a degree why greedy works so well on networks with a heterogeneous degree distribution and high clustering. Moreover, it indicates how the greedy algorithm should be adapted to obtain better approximation ratios: As the probability to make a mistake grows with growing radius and thus with shrinking vertex degree, the majority of mistakes are done when all vertices have already low degree. However, for hyperbolic random graphs, the subgraphs induced by vertices below a certain constant degree decompose into small components for $n\to\infty$ . It thus seems to be a good idea to run the greedy algorithm only until all remaining vertices have low degree, say $k$ . The remaining small connected components of maximum-degree $k$ can then be solved with brute force in reasonable time. In the following we call the resulting algorithm $k$ -adaptive greedy.

We ran experiments on the $47$ easy real networks mentioned above (for the hard instances, we cannot measure approximation ratios). For these networks, we compare the normal greedy algorithm with 2- and 4-adaptive greedy. Note that 2-adaptive greedy is special, as VertexCover can be solved efficiently on graphs with maximum degree 2 (no brute-forcing is necessary). For 4-adaptive greedy, the size of the largest connected component is relevant.

The median approximation ratio for greedy over all $47$ networks is $1.008$ . This goes down to $1.005$ for 2-adaptive and to $1.002$ for 4-adaptive greedy. Thus, the number of too many selected vertices goes down by a factor of $1.6$ and $4$ , respectively. As mentioned above, the size of the largest connected component is relevant for 4-adaptive greedy. For $49\text{\,}\mathrm{\char 37\relax}$ of the networks, this was below $100$ (which is still a reasonable size for a brute-force algorithm). Restricted to these networks, normal greedy has a median approximation ratio of $1.004$ , while 4-adaptive again improves by a factor of 4 to $1.001$ . Moreover, the number of networks for which we actually obtain the optimal solution increases from $4$ to $7$ .

5 Experimental Data

Table LABEL:tab:data (continuing on the next page) shows the raw data of our experiments for which we reported aggregate values in the discussion in Section 4. The percentage of dominant vertices among those with high degree (over $\alpha/(\alpha-1/2)\cdot\log n$ ) is rounded to whole percentages. The approximation ratios are rounded to three decimal digits. Treewidth $-1$ indicates that remaining graph after removing all dominant vertices contained no edge.

Bibliography22

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Takuya Akiba and Yoichi Iwata. Branch-and-reduce exponential/FPT algorithms in practice: A case study of vertex cover. Theor. Comput. Sci. , 609:211 – 225, 2016. doi:10.1016/j.tcs.2015.09.023 . · doi ↗
2[2] Alexandre Arenas, Albert-László Barabási, Vladimir Batagelj, Andrej Mrvar, Mark Newman, and Tore Opsahl. Gephi datasets. https://github.com/gephi/gephi/wiki/Datasets .
3[3] Vladimir Batagelj and Andrej Mrvar. Pajek datasets. http://vlado.fmf.uni-lj.si/pub/networks/data/ , 2006.
4[4] Thomas Bläsius, Tobias Friedrich, and Anton Krohmer. Hyperbolic Random Graphs: Separators and Treewidth. In 24th Annual European Symposium on Algorithms (ESA 2016) , pages 15:1 – 15:16, 2016. doi:10.4230/LIP Ics.ESA.2016.15 . · doi ↗
5[5] Marián Boguná, Fragkiskos Papadopoulos, and Dmitri Krioukov. Sustaining the internet with hyperbolic mapping. Nat. Commun. , 1:62, 2010. doi:10.1038/ncomms 1063 . · doi ↗
6[6] Liming Cai and David Juedes. On the existence of subexponential parameterized algorithms. J. Comput. Syst. Sci. , 67:789 – 807, 2003. doi:10.1016/S 0022-0000(03)00074-6 . · doi ↗
7[7] Jianer Chen, Iyad A. Kanj, and Ge Xia. Improved upper bounds for vertex cover. Theor. Comput. Sci. , 411(40):3736 – 3756, 2010. doi:10.1016/j.tcs.2010.06.026 . · doi ↗
8[8] Marek Cygan, Fedor V. Fomin, Łukasz Kowalik, Daniel Lokshtanov, Dániel Marx, Marcin Pilipczuk, Michał Pilipczuk, and Saket Saurabh. Parameterized Algorithms . Springer, 2015.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Solving Vertex Cover in Polynomial Time on Hyperbolic Random

Abstract

keywords:

1 Introduction

2 Preliminaries

The Hyperbolic Plane.

Hyperbolic Random Graphs.

Interval Graphs and Circular Arc Graphs.

Treewidth and Pathwidth.

Probabilities.

Theorem 2.1** (Chernoff Bound [11, A.1]).**

3 Vertex Cover on Hyperbolic Random Graphs

Theorem 3.1**.**

3.1 Dominance on Hyperbolic Random Graphs

Lemma 3.2**.**

Proof 3.3**.**

Lemma 3.4**.**

Proof 3.5**.**

Lemma 3.6**.**

Proof 3.7**.**

Corollary 3.8**.**

3.2 Pathwidth in the Outer Band

Lemma 3.9**.**

Proof 3.10**.**

Lemma 3.11**.**

Proof 3.12**.**

Theorem 3.13**.**

Proof 3.14**.**

Corollary 3.15**.**

Theorem 3.1**.**

Proof 3.2**.**

Corollary 3.16**.**

Proof 3.17**.**

4 Discussion

Approximation.

5 Experimental Data

Theorem 2.1 (Chernoff Bound [11, A.1]).

Theorem 3.1.

Lemma 3.2.

Proof 3.3.

Lemma 3.4.

Proof 3.5.

Lemma 3.6.

Proof 3.7.

Corollary 3.8.

Lemma 3.9.

Proof 3.10.

Lemma 3.11.

Proof 3.12.

Theorem 3.13.

Proof 3.14.

Corollary 3.15.

Theorem 3.1.

Proof 3.2.

Corollary 3.16.

Proof 3.17.