Distance-preserving graph contractions

Aaron Bernstein; Karl D\"aubel; Yann Disser; Max Klimm; Torsten; M\"utze; Frieder Smolny

arXiv:1705.04544·cs.DS·February 14, 2019

Distance-preserving graph contractions

Aaron Bernstein, Karl D\"aubel, Yann Disser, Max Klimm, Torsten, M\"utze, Frieder Smolny

PDF

TL;DR

This paper introduces a new framework for graph contraction that preserves pairwise distances within a specified tolerance, providing algorithms for trees and complexity results for other graph classes.

Contribution

It formalizes the graph contraction problem with distance preservation, analyzes its complexity, and offers algorithms for specific graph classes and approximate solutions.

Findings

01

Polynomial-time algorithms for trees

02

Hardness results for certain graph classes

03

Efficient algorithms for approximate contractions

Abstract

Compression and sparsification algorithms are frequently applied in a preprocessing step before analyzing or optimizing large networks/graphs. In this paper we propose and study a new framework contracting edges of a graph (merging vertices into super-vertices) with the goal of preserving pairwise distances as accurately as possible. Formally, given an edge-weighted graph, the contraction should guarantee that for any two vertices at distance $d$ , the corresponding super-vertices remain at distance at least $φ (d)$ in the contracted graph, where $φ$ is a tolerance function bounding the permitted distance distortion. We present a comprehensive picture of the algorithmic complexity of the contraction problem for affine tolerance functions $φ (x) = x / α - β$ , where $α \geq 1$ and $β \geq 0$ are arbitrary real-valued parameters. Specifically, we present…

Tables2

Table 1. Table 1. Overview of algorithmic and hardness results presented in this paper.

Problem	Graph classes
	Path	Tree	Cycle	General
Contraction
addit. ( $α = 1$ ), unit lg.		$𝒪 (n)$ [Th. 4]		$m^{\frac{1}{2} - ε}$ -inapx.⁶⁶6even for bipartite graphs and $β = 1$ [Th. 10]

affine ( $α, β$ ), unit lg.	$𝒪 (n)$ [Th. 2]		$𝒪 (n)$ [Th. 3]

addit. ( $α = 1$ )			NP-hard [Th. 7]	$n^{1 - ε}$ -inapx. [Th. 9]
affine ( $α, β$ )	$𝒪 (n^{3})$ [Th. 5]
Weak Contraction
additive ( $α = 1$ )			NP-hard⁷⁷7also NP-hard for planar graphs with arb. large girth, $(α, β) = (2, 0)$ , and unit lg. ( $ℓ = 1$ ) [Th. 11]. [Th. 7]

affine ( $α, β$ )	$𝒪 (n^{5})$ [Th. 6]			$n^{1 - ε}$ -inapx.⁸⁸8even if $(α, β) = (3 / 2, 0)$ . [Th. 12]

Table 2. Table 2. Overview of asympotic bounds presented in this paper.

Contraction with unit lg. ( $ℓ = 1$ )	# of edges in $G / C$	Time	Reference
$(α, β) = (2 k - 1, 1)$	$n^{1 + 1 / k}$	$𝒪 (m)$	[Th. 13]

$(α, β) = (2 \log_{2} n - 1, 1)$	$2 n$	$𝒪 (m)$	[Cor. 14]

$(α, β) = (k - 1, 1)$	$Ω (n^{1 + 1 / k})$	—	[Th. 15]

$(α, β) = (1, k)$	$m - k m / (2 n)$	$𝒪 (m)$	[Th. 16 (i)]

$(α, β) = (1, k)$	$𝒪 (n^{2} / k)$	$𝒪 (m)$	[Th. 16 (ii)]

$(α, β) = (1, 𝒪 (1))$	$Ω (n^{4 / 3 - o (1)})$	—	[AB16]
Contraction with unit lg. ( $ℓ = 1$ )
and min. degree $D$	# of vertices in $G / C$	Time	Reference
$(α, β) = (5, 1)$	$n / D$	$𝒪 (m)$	[Th. 17]

$(α, β) = (k, 1)$	$n / ((k + 1) D)$	—	[Th. 18]

Equations126

dist_{ℓ_{C}} (u, v) \geq φ (dist_{ℓ} (u, v))

dist_{ℓ_{C}} (u, v) \geq φ (dist_{ℓ} (u, v))

Φ (C) := ∣ C ∣ + Δ (C) = m (G) - m (G / C)

Φ (C) := ∣ C ∣ + Δ (C) = m (G) - m (G / C)

∣ E (P^{'}) \cap C ∣ \leq (1 - 1/ α) ∣ E (P^{'}) ∣ + β .

∣ E (P^{'}) \cap C ∣ \leq (1 - 1/ α) ∣ E (P^{'}) ∣ + β .

∣ E (P_{i, j}) \cap C ∣

∣ E (P_{i, j}) \cap C ∣

= ⌊(1 - 1/ α) j + β ⌋ - ⌊(1 - 1/ α) (i - 1) + β ⌋

\leq (1 - 1/ α) j + β - ((1 - 1/ α) (i - 1) + β - 1)

= (1 - 1/ α) (j - i + 1) + 1

\leq (1 - 1/ α) ∣ E (P_{i, j}) ∣ + β,

∣ E (P) \cap C ∣ \leq ⌊ d - min {d, n - d} / α + β ⌋ .

∣ E (P) \cap C ∣ \leq ⌊ d - min {d, n - d} / α + β ⌋ .

λ^{'}

λ^{'}

λ

∣ C ∣ = \frac{1}{d} i = 1 \sum n ∣ E (P_{i}) \cap C ∣ \leq \ e q re f e q : cy c l e - co n d i = 1 \sum n \frac{⌊ d - min { d , n - d } / α + β ⌋}{d} = \ e q re f e q : u ni f or m - so l u t i o n λn .

∣ C ∣ = \frac{1}{d} i = 1 \sum n ∣ E (P_{i}) \cap C ∣ \leq \ e q re f e q : cy c l e - co n d i = 1 \sum n \frac{⌊ d - min { d , n - d } / α + β ⌋}{d} = \ e q re f e q : u ni f or m - so l u t i o n λn .

⌊ x ⌋ + ⌊ y ⌋

⌊ x ⌋ + ⌊ y ⌋

⌊ x ⌋ - ⌊ y ⌋

∣ E (P) \cap C ∣ = i = k \sum k + d - 1 (⌊ λi ⌋ - ⌊ λ (i - 1)⌋) = ⌊ λ (k + d - 1)⌋ - ⌊ λ (k - 1)⌋ \leq \ e q re f e q : f l oor - d i f f ⌈ λ d ⌉ .

∣ E (P) \cap C ∣ = i = k \sum k + d - 1 (⌊ λi ⌋ - ⌊ λ (i - 1)⌋) = ⌊ λ (k + d - 1)⌋ - ⌊ λ (k - 1)⌋ \leq \ e q re f e q : f l oor - d i f f ⌈ λ d ⌉ .

∣ E (P) \cap C ∣

∣ E (P) \cap C ∣

= ⌊ λn ⌋ - ⌊ λ (k - 1)⌋ + ⌊ λ (d - n + k - 1)⌋

\leq \ e q re f e q : f l oor - s u m ⌊ λ (d + k - 1)⌋ - ⌊ λ (k - 1)⌋ \leq \ e q re f e q : f l oor - d i f f ⌈ λ d ⌉ .

load_{C, α} (u, v) := dist_{ℓ} (u, v) / α - dist_{ℓ_{C}} (u, v)

load_{C, α} (u, v) := dist_{ℓ} (u, v) / α - dist_{ℓ_{C}} (u, v)

load_{C, α} (T, v) := max {load_{C, α} (u, v) : u \in V (T)} .

load_{C \cup {{v, u_{i}}}, α} (T_{v, i}, v)

load_{C \cup {{v, u_{i}}}, α} (T_{v, i}, v)

load_{C, α} (T_{v, i}, v)

load_{C, α} (T_{v, i}^{+}, v) = max {load_{C, α} (T_{v, i - 1}^{+}, v), load_{C, α} (T_{v, i}, v)} .

load_{C, α} (T_{v, i}^{+}, v) = max {load_{C, α} (T_{v, i - 1}^{+}, v), load_{C, α} (T_{v, i}, v)} .

L (v, i, s) := min {load_{C, α} (T_{v, i}, v) : C is a feasible solution of (T_{v, i}, ℓ, φ) of size ∣ C ∣ = s} .

L (v, i, s) := min {load_{C, α} (T_{v, i}, v) : C is a feasible solution of (T_{v, i}, ℓ, φ) of size ∣ C ∣ = s} .

L (v, i, 0)

L (v, i, 0)

L (v, 0, s)

L (v, i, s)

L^{+} (v, i, s)

L^{+} (v, i, s)

\displaystyle\hskip 85.35826ptL^{+}(v,i-1,t)+L(v,i,s-t)\leq\beta\big{\}},

wload_{C, α} (T, v) := max {load_{C, α} (u, v) : u \in V (T) and dist_{ℓ_{C}} (u, v) > 0} .

wload_{C, α} (T, v) := max {load_{C, α} (u, v) : u \in V (T) and dist_{ℓ_{C}} (u, v) > 0} .

load_{C, α} (T_{1}, v) + wload_{C, α} (T_{2}, v) \leq β and wload_{C, α} (T_{1}, v) + load_{C, α} (T_{2}, v) \leq β .

load_{C, α} (T_{1}, v) + wload_{C, α} (T_{2}, v) \leq β and wload_{C, α} (T_{1}, v) + load_{C, α} (T_{2}, v) \leq β .

λ^{*} (T_{v}, v, s)

λ^{*} (T_{v}, v, s)

Λ^{*} (T_{v}, v, s)

wload_{C \cup {{v, u_{i}}}, α} (T_{v, i}, v)

wload_{C \cup {{v, u_{i}}}, α} (T_{v, i}, v)

wload_{C, α} (T_{v, i}, v)

wload_{C, α} (T_{v, i}^{+}, v) = max {wload_{C, α} (T_{v, i - 1}^{+}, v), wload_{C, α} (T_{v, i}, v)} .

wload_{C, α} (T_{v, i}^{+}, v) = max {wload_{C, α} (T_{v, i - 1}^{+}, v), wload_{C, α} (T_{v, i}, v)} .

W (v, i, s, λ)

W (v, i, s, λ)

L (v, i, s, λ)

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Distance-preserving graph contractions

***An extended abstract of this work has appeared in the Proceedings of the 9th Innovations in Theoretical Computer Science Conference (ITCS) 2018 [BDD*+*18].

Aaron Bernstein1†††E-Mail: [email protected], Karl Däubel1‡‡‡E-Mail: {daeubel,muetze,smolny}@math.tu-berlin.de, Yann Disser2§§§Supported by the ‘Excellence Initiative’ of the German Federal and State Governments and the Graduate School CE at TU Darmstadt. E-Mail: [email protected],

Max Klimm3¶¶¶E-Mail: [email protected], Torsten Mütze1‡ and Frieder Smolny1‡

1Institut für Mathematik, TU Berlin

2Department of Mathematics, Graduate School CE, TU Darmstadt

3Wirtschaftswissenschaftliche Fakultät, HU Berlin

Abstract.

Compression and sparsification algorithms are frequently applied in a preprocessing step before analyzing or optimizing large networks/graphs. In this paper we propose and study a new framework contracting edges of a graph (merging vertices into super-vertices) with the goal of preserving pairwise distances as accurately as possible. Formally, given an edge-weighted graph, the contraction should guarantee that for any two vertices at distance $d$ , the corresponding super-vertices remain at distance at least $\varphi(d)$ in the contracted graph, where $\varphi$ is a tolerance function bounding the permitted distance distortion. We present a comprehensive picture of the algorithmic complexity of the contraction problem for affine tolerance functions $\varphi(x)=x/\alpha-\beta$ , where $\alpha\geq 1$ and $\beta\geq 0$ are arbitrary real-valued parameters. Specifically, we present polynomial-time algorithms for trees as well as hardness and inapproximability results for different graph classes, precisely separating easy and hard cases. Further we analyze the asymptotic behavior of contractions, and find efficient algorithms to compute (non-optimal) contractions despite our hardness results.

1. Introduction

When dealing with large networks, it is often beneficial to compress or sparsify the data to manageable size before analyzing or optimizing the network directly. To be useful, a meaningful compression should represent salient features of the original network with good approximation, while being much smaller in size. In this paper, we focus on a compression of undirected edge-weighted graphs that approximately maintains all distances between vertices in the graph.

In this context, an extensively studied concept are spanners (e.g. [PS89, ADD*+*93, BKMP05, AB16]). Given an undirected graph $G=(V,E)$ and real numbers $\alpha\geq 1$ and $\beta\geq 0$ , a subgraph $H=(V,E^{\prime})$ , $E^{\prime}\subseteq E$ , is an $(\alpha,\beta)$ -spanner of $G$ if $\operatorname{dist}_{H}(u,v)\leq\alpha\cdot\operatorname{dist}_{G}(u,v)+\beta$ holds for all $u,v\in V$ . While the number of edges in a spanner may be much smaller than that of the original graph, the number of vertices is the same for both, leaving further potential for compression untapped. For illustration, consider the road network of Europe with about 50 million vertices [BMSW13], any spanner of which must again have about 50 million vertices and edges. However, to approximately represent distances in Europe’s road network one may also merge nearby vertices into super-vertices, thus achieving a much better compression of the network. This is akin to the visual process of zooming out of a graphical representation of the map, where neighbored vertices fade into each other and edges between merged vertices vanish. At a large enough zoom level, the entire network merges into a single vertex.

In this paper we propose and study a new framework for contracting networks that formalizes this intuitive idea and makes it applicable to general graphs. Specifically, we study a contraction problem on graphs where a subset of edges $C\subseteq E$ is contracted. We denote by $G/C$ the resulting simple graph obtained from $G$ by contracting the edges in $C$ and by deleting resulting loops and multiple edges, keeping only the minimum length edge between any two vertices. For any two vertices in $G$ , we compare their distance in $G$ with the distance of the corresponding super-vertices in $G/C$ .

It is interesting to contrast this concept with graph spanners. When constructing a spanner, the length of the removed edges is implicitly set to $\infty$ , resulting in an overall increase of distances. On the other hand, a contraction implicitly sets the length of the contracted edges to zero, leading to an overall decrease of distances. For both problems, the ultimate goal is to reduce the complexity of the network while maintaining an approximation guarantee on the distances.

The following example shows that contractions may be better suited than spanners to achieve this goal. In a subgraph with small radius, a spanner can at best result in a spanning tree of the same order, while a contraction can reduce the whole subgraph to a single vertex, while entailing a multiplicative distance distortion of similar magnitude. In addition, the contraction may also merge many edges entering the contracted subgraph. Clearly, the objective here is to maximize the total number of contracted and deleted edges, as this minimizes the memory required to represent the resulting network in a computer (using e.g. adjacency lists).

Given the results presented in this paper and the known results for spanners (discussed in detail below), we further believe that the combination of spanners and contractions is very powerful, promising and flexible. As the former only increases and the latter only decreases the distances, the respective distortion guarantees provably also hold for the overall distortion. In fact, both effects may even compensate each other. This is true regardless of the order in which both compression operations are applied, even when they are applied repeatedly.

In order to measure the distance distortion of the contraction, we assume a non-decreasing tolerance function $\varphi\colon\mathds{R}\to\mathds{R}$ , similar to the corresponding function for spanners, see e.g. [BKMP05]. We are interested in computing contractions that preserve distances in the following sense: For any two vertices $u$ and $v$ at distance $d$ in $G$ , the distance of the corresponding vertices in the contracted graph $G/C$ must be at least $\varphi(d)$ . If this condition is satisfied, we call $C$ a $\varphi$ -distance preserving contraction, or $\varphi$ -contraction for short. Formally, the algorithmic problem Contraction considered in this paper is to compute for a given graph $G=(V,E)$ with edge lengths $\ell\colon E\to\mathds{R}_{>0}$ and a given tolerance function $\varphi$ , a $\varphi$ -contraction $C\subseteq E$ such that the number of contracted and deleted edges is maximized. We are specifically interested in the case where the tolerance function $\varphi$ is an affine function $\varphi(x)=x/\alpha-\beta$ for real-valued parameters $\alpha\geq 1$ and $\beta\geq 0$ . We then simply write $(\alpha,\beta)$ -contraction instead of $\varphi$ -contraction. See Figure 1 for some example instances of the problem Contraction.

When considering the case of a purely multiplicative error ( $\beta=0$ ), a slight subtlety has to be taken into account. Specifically, for a graph with positive edge lengths it is not feasible to contract a single edge. Therefore, we propose a slight modification of our original model: We say that a set $C\subseteq E$ of edges of $G$ is a weak $\varphi$ -distance preserving contraction, or weak $\varphi$ -contraction for short, if it does not contract the entire graph and, for any two vertices $u$ and $v$ at distance $d$ in $G$ , the distance of the corresponding vertices in $G/C$ is either zero or at least $\varphi(d)$ . We will refer to the corresponding algorithmic problem as Weak Contraction. Put differently, in a weak contraction, the distances between different super-vertices satisfy the given distortion guarantee, but for vertices belonging to the same super-vertex, no guarantee is given.

1.1. Our results

In this paper, we present a comprehensive picture of the algorithmic complexity of the described contraction problems. Recall that we are given an input graph with edge lengths and tolerance function $\varphi$ , and our goal is to compute a (weak) contraction that maximizes the total number of contracted and deleted edges. Our main results concern affine tolerance functions $\varphi(x)=x/\alpha-\beta$ with parameters $\alpha\geq 1$ and $\beta\geq 0$ . For the reader’s convenience, our results are summarized in Tables 1 and 2. Within the tables and throughout this paper, $n$ and $m$ denote the number of vertices and edges, respectively, of the input graph under consideration.

Algorithmic results

We develop linear time greedy algorithms for Contraction with unit lengths on paths and cycles for general $\alpha$ and $\beta$ , as well as on trees with $\alpha=1$ (Theorems 2, 3 and 4). The first two algorithms are inspired by LP rounding techniques, the latter algorithm relies on a structural characterization of optimal solutions.

We present dynamic programming algorithms solving Contraction and Weak Contraction on trees in time $\mathcal{O}(n^{3})$ or $\mathcal{O}(n^{5})$ , respectively (Theorems 5 and 6). These dynamic programs compute optimal solutions on subtrees, in the latter case combining several Pareto optimal solutions in a two-dimensional parameter space (hence the larger running time).

Note that instead of maximizing the number of contracted and deleted edges, we could optimize for $\alpha$ or $\beta$ while fixing the other parameters. The resulting problems are polynomially equivalent to our setting, via binary search over one of the parameters.

Hardness results

We complement these algorithms by several hardness results. First we consider the purely additive case where $\alpha=1$ . We show that here both Contraction and Weak Contraction are NP-hard on cycles for any fixed $\beta>0$ , by a reduction of a variant of Partition (Theorem 7). As mentioned before, both problems can be solved efficiently on graphs without cycles, and there is a linear time algorithm for Contraction on cycles with unit lengths. By reductions from Clique we show that both the general as well as the unit lengths case of Contraction with $\alpha=1$ are hard to approximate within factors of $n^{1-\varepsilon}$ or $m^{1/2-\varepsilon}$ , respectively (Theorem 9 and Theorem 10).

Further we consider the purely multiplicative case where $\beta=0$ (here Contraction is trivial). We show that in this case Weak Contraction is NP-hard on planar graphs with arbitrarily large girth and unit length edges by a reduction from a special case of Planar 3SAT (Theorem 11). Since these graphs are locally tree-like, this result constitutes another rather sharp separation from the polynomially solvable tree case. Furthermore, we show that the problem is hard to approximate within a factor of $n^{1-\varepsilon}$ by a reduction from Independent Set (Theorem 12).

Asymptotic bounds

We now discuss our asymptotic bounds for contractions. In this setting, we are interested in (non-optimal) contractions for graphs with unit lengths that can be computed efficiently despite the above-mentioned hardness results. We prove that for any $k\geq 1$ any graph $G$ has a $(2k-1,1)$ -contraction $C$ such that $G/C$ has at most $n^{1+1/k}$ edges, and such a contraction can be computed in time $\mathcal{O}(m)$ (Theorem 13) by successively growing clusters around center vertices. Assuming Erdős’ girth conjecture, we show a corresponding (not tight) lower bound (Theorem 15).

For a purely additive error, we observe two simple $(1,k)$ -contractions that can be computed in $\mathcal{O}(m)$ time (Theorem 16). We show that for any even integer $0\leq k\leq n$ , the edges incident to the $k/2$ vertices of highest degrees form a $(1,k)$ -contraction with objective value at least $km/(2n)$ , which is asymptotically best possible for paths. Another $(1,k)$ -contraction $C$ is implicitly used by Bernstein and Chechik in their faster deterministic algorithm for dynamic shortest paths in dense graphs [BC16]. For any number $0<k\leq n$ , it consists of the edges incident to two vertices of degree at least $n/k$ , and $G/C$ has $\mathcal{O}(n^{2}/k)$ edges. Both of these contractions can be computed in $\mathcal{O}(m)$ time. Further we note that the main result in [AB16] implies that for all $\varepsilon>0$ , any contraction $C$ such that $G/C$ has $\mathcal{O}(n^{4/3-\varepsilon})$ edges does not admit a constant additive error.

One possible advantage of contraction compared to spanners is the potentially significant reduction of vertices as well as edges, e.g. reducing the complexity of performing algorithmic tasks in the smaller graph. To ground this intuition, we exhibit a contraction that significantly reduces the number of vertices in any graph with minimum degree $D$ to $\mathcal{O}(n/D)$ (Theorem 17). We also present a lower bound (Theorem 18) showing that we cannot guarantee $o(n/D)$ vertices, even if we allow larger approximation error.

1.2. Comparison with previous results

There are several models aiming to compress graphs while preserving distances. They differ by their choice of compression operation, such as replacing the graph by a subgraph or minor, and by whether the aim is to preserve all or only certain distances.

As discussed before, graph spanners are a concept closely related to contractions, where the length of removed edges is set to $\infty$ rather than to [math]. Our results highlight further intrinsic similarities of the two models. Like contractions, spanners are NP-hard to compute optimally (see [PS89, LS93]). While the spanner literature considers the problem of minimizing the number of remaining edges, we analyze the objective of maximizing the number of contracted edges, prohibiting a direct comparison of the respective inapproximability results. We note however that approximation algorithms for spanner problems have been studied extensively, even though strong lower bounds are known. For instance, computing $(2,0)$ -spanners in unweighted graphs is $\Theta(\log n)$ -hard to approximate ([KP94, Kor01]); for further references see e.g. [CDKL17].

Despite these negative results, it is still possible to obtain powerful asymptotic guarantees in both models. In particular, our $(2k-1,1)$ -contraction with $\mathcal{O}(n^{1+1/k})$ edges for unweighted graphs has a clear analogy to the classic $(2k-1,0)$ -spanner with the same number of edges [ADD*+*93] (note that the additive error of 1 in our result is strictly necessary, as discussed above). There is, however, a major difference between the two results: whereas the $(2k-1,0)$ -spanner can trivially be shown to be optimal assuming Erdős’ girth conjecture, applying this conjecture to the contraction model only yields a lower bound of $n^{1+1/(2k)}$ edges for a $(2k-1,1)$ -contraction. Closing this gap thus remains as an interesting open problem in the contraction model, whose solution would likely yield further insight into the relationship to spanners.

Halperin and Zwick showed how an optimal $(2k-1,0)$ -spanner can be constructed in linear time (see [BS03]). We achieve the same running time for our $(2k-1,1)$ -contraction. It is interesting to note that the clustering yielding our $(2k-1,1)$ -contraction was previously used in [PS89] to obtain a $(4k+1,0)$ -spanner of the same asymptotic density.

There are also spanner results that significantly sparsify unweighted graphs at the cost of a purely additive error, as a (1,2)-spanner with $\mathcal{O}(n^{3/2})$ edges [ACIM99], or a (1,6)-spanner with $\mathcal{O}(n^{4/3})$ edges [BKMP05]. We do not know if analogous results are possible in the contraction model. The incompressibility result in [AB16] mentioned above implies the same lower bound for spanners as for contractions and every other distance oracle with additive error: For every $\varepsilon>0$ any spanner of size $\mathcal{O}(n^{4/3-\varepsilon})$ does not admit a constant additive error. Finally, for spanners there are results that combine multiplicative and additive error, such as the $(k,k-1)$ -spanner of [BKMP05].

Gupta [Gup01] considered the problem of approximating a tree metric on a subset of the vertices by another tree, and gave a linear time algorithm computing an $8$ -approximation. As Chan et al. [CXKR06] observed later, on complete binary trees a solution of minimum distortion is always achieved by a minor (with possibly different edge lengths) of the input tree, so this seems to be the first investigation of contractions that approximate graph distances. Krauthgamer et al. [KNZ14] considered an extension to general graphs, studying the size of minors preserving all distances between a given terminal set of fixed size. Cheung et al. [CGH16] introduced a multiplicative distortion to this model. As here no two terminals may be merged, these approaches cannot compress a graph at all if every vertex is a terminal.

The pairwise preservers due to Coppersmith et al. [CE06] combine spanners with the aim of preserving only terminal distances. Given a graph $G$ and a set of $k$ terminal pairs, a pairwise preserver is a spanning subgraph inducing exactly the same terminal distances as $G$ . Coppersmith et al. [CE06] proved that for every undirected weighted graph there exists a pairwise preserver of size $\mathcal{O}(n+n^{1/2}k)$ . Furthermore, they showed that every directed weighted graph has a pairwise preserver of size $\mathcal{O}(nk^{1/2})$ . For the special case of undirected unweighted graphs, Bodwin et al. [BVW16] showed the existence of a pairwise preserver with $\mathcal{O}(n^{2/3}k^{2/3}+nk^{1/3})$ edges. Recently, Bodwin [Bod17] proved that any directed weighted graph has a pairwise preserver of size $\mathcal{O}(n+n^{2/3}k)$ .

1.3. Further related work

The preservation of graph properties other than distances has been studied as well. Biedl et al. [BBV00] considered contractions in capacitated networks with the goal of maintaining the maximum flow in the network. Here an edge $e$ is called useless, if for every capacity function there is a maximum flow not using $e$ . Biedl et al. showed that finding all useless edges is NP-complete, but solvable in $\mathcal{O}(n^{2})$ time on certain planar graphs. For undirected networks, Misiołek et al. [MC05] gave an algorithm finding all useless edges in $\mathcal{O}(n+m)$ time. Toivonen et al. [ZMT10] considered a more general model aiming to maintain the quality of paths with respect to any given function, e.g., distance or capacity. They investigated strategies of removing edges, without decreasing the quality of the best path between any pair of vertices.

Graph simplification problems have also been studied in several other contexts, and we conclude this section by mentioning two such examples: Hübler et al. [HKBG08] studied a problem related to graph mining, examining how to choose an induced subgraph with a given number of vertices and with similar topological properties as the input graph. Numerous papers investigate, directly or as a tool, sparsifiers that preserve the effective resistance between certain or all pairs of vertices, see e.g. [DB13, DKW15, KS16, DKP*+*17, CGP*+*18].

1.4. Outline of this paper

In Section 2 we introduce important definitions and notations that will be used throughout this paper. In Section 3 we present our three greedy algorithms for solving Contraction with unit lengths on paths, cycles and trees (the latter result requires $\alpha=1$ ). In Section 4 we discuss efficient dynamic programming algorithms for Contraction and Weak Contraction on trees. Sections 5 and 6 are devoted to our hardness results, focussing on the cases of purely additive and multiplicative error, respectively. In Section 7 we present our asymptotic results on contractions.

2. Preliminaries

Throughout this paper we consider simple undirected graphs $G$ (without parallel edges or loops). We let $V(G)$ and $E(G)$ denote the vertex and edge set of $G$ , respectively, and we define $n(G):=|V(G)|$ and $m(G):=|E(G)|$ . If the context is clear, we simply write $V$ , $E$ , $n$ and $m$ . We also use the notation $[n]:=\{1,2,\ldots,n\}$ . We assume that $G$ is connected, otherwise the contraction problem can be solved independently for each connected component. Edge lengths are given by a function $\ell\colon E\to\mathds{R}_{>0}$ . The distance $\operatorname{dist}_{\ell}(u,v)$ between two vertices $u$ and $v$ is the length of a shortest path between $u$ and $v$ in $G$ with respect to $\ell$ .

Given a subset of edges $C\subseteq E$ , we denote the resulting simple graph obtained from $G$ by contracting the edges in $C$ , deleting resulting loops and keeping only the minimum length edge between any two vertices by $G/C$ . We denote the number of deleted loops and multi-edges by $\Delta(C)$ (thus $m(G/C)=m(G)-|C|-\Delta(C)$ ). Instead of contracting a set $C\subseteq E$ of edges in $G$ , setting their edge lengths to zero has the same effect on the distances in the resulting graph. This is somewhat cleaner conceptually, so we will often adopt this viewpoint. Specifically, we let $\ell_{C}$ be the new length function that assigns 0 to every edge in $C$ , and that is equal to the original edge lengths $\ell$ on the edges $E\setminus C$ .

A tolerance function is a non-decreasing function $\varphi\colon\mathds{R}\to\mathds{R}$ . Roughly speaking, this function describes by how much the distance between two vertices may drop when contracting edges (i.e., setting edge lengths to zero). Formally, given a graph $G$ with edge lengths $\ell$ and a tolerance function $\varphi$ , we say that a subset of edges $C\subseteq E$ is a $\varphi$ -distance preserving contraction or $\varphi$ -contraction for short, if

[TABLE]

holds for any two vertices $u$ and $v$ in $G$ . Similarly, we say that $C$ is a weak $\varphi$ -distance preserving contraction or weak $\varphi$ -contraction for short, if any two vertices $u$ and $v$ satisfy relation (1) or the relation $\operatorname{dist}_{\ell_{C}}(u,v)=0$ , and if the graph $(V,C)$ is disconnected (equivalently, if $G/C$ is not a single vertex). The last condition prevents solutions $C\subseteq E$ for which the graph is contracted to a single vertex. If $\varphi(x)=x/\alpha-\beta$ , then we simply write (weak) $(\alpha,\beta)$ -contraction instead of (weak) $\varphi$ -contraction.

An instance of the problem Contraction or Weak Contraction is a triple $(G,\ell,\varphi)$ , where $G$ is the underlying graph, $\ell$ the length function and $\varphi$ the tolerance function, and the objective is to find a (weak) $\varphi$ -distance preserving contraction $C\subseteq E$ , such that

[TABLE]

is maximized. This quantity equals the number of edges we save when going from $G$ to $G/C$ . Note that on trees we have $\Phi(C)=|C|$ for any (weak) contraction $C$ , whereas on general graphs we have $\Phi(C)\geq|C|$ .

[TABLE]

In this context we sometimes refer to a set of edges that forms a (weak) contraction as a feasible solution, and to a (weak) contraction of maximum value $\Phi(C)$ as an optimal solution.

We begin by proving that our contraction model behaves nicely when contracting edges in phases, i.e., the total error is simply the error accumulated over the contraction phases (but not more). To state this result we denote the composition of tolerance functions $\varphi$ and $\psi$ as $(\psi\circ\varphi)(x):=\psi(\varphi(x))$ .

Theorem 1.

Let $C$ be a (weak) $\varphi$ -contraction for $G$ , and let $C^{\prime}$ be a (weak) $\psi$ -contraction for $G/C$ . Then $C\cup C^{\prime}$ is a (weak) $(\psi\circ\varphi)$ -contraction for $G$ .

Proof.

We only prove the statement for contractions $\varphi$ and $\psi$ . The proof for weak contractions works analogously. Let $\ell$ denote the edge lengths of $G$ and consider a pair of vertices $u,v\in V(G)$ . Then we have $\operatorname{dist}_{\ell_{C\cup C^{\prime}}}(u,v)\geq\psi(\operatorname{dist}_{\ell_{C}}(u,v))$ by the definition of $C^{\prime}$ and $\operatorname{dist}_{\ell_{C}}(u,v)\geq\varphi(\operatorname{dist}_{\ell}(u,v))$ by the definition of $C$ . Combining these inequalities and using that $\psi$ is non-decreasing we obtain $\operatorname{dist}_{\ell_{C\cup C^{\prime}}}(u,v)\geq\psi(\varphi(\operatorname{dist}_{\ell}(u,v)))$ , as desired. ∎

Note that Theorem 1 only concerns the feasibility of repeated contractions, but not about their optimality when searching for contractions of maximum cardinality. With respect to solution quality, contracting in phases may be arbitrarily bad: Consider a star with $k$ unit length edges and additive tolerance functions $\varphi(x)=\psi(x)=x-1$ . An optimum $(\psi\circ\varphi)$ -contraction contains all $k$ edges, whereas finding an optimal $\varphi$ -contraction $C$ and then an optimal $\psi$ -contraction of $G/C$ allows contracting only one edge in each phase, leading to a $(\psi\circ\varphi)$ -contraction of value 2.

3. Greedy algorithms

In this section we consider three special cases of the problem Contraction with affine tolerance function $\varphi(x)=x/\alpha-\beta$ . We obtain simple greedy algorithms computing maximum size $\varphi$ -contractions in $\mathcal{O}(n)$ time on paths and cycles with unit lengths, and on trees with unit lengths and $\alpha=1$ .

3.1. Paths with unit length edges

In this section we consider the special case of contracting a path $P_{n}$ with $n-1$ unit length edges $\ell=1$ and the tolerance function $\varphi(x)=x/\alpha-\beta$ . In this case optimal solutions have a very special structure, which leads to a straightforward greedy algorithm running in linear time. Recall that as a path is a tree, our objective functions satisfies $\Phi(C)=|C|$ for any contraction $C$ .

Observe that a solution $C\subseteq E(P_{n})$ for the instance $(P_{n},\ell,\varphi)$ of the problem Contraction is feasible, if and only if every subpath $P^{\prime}\subseteq P_{n}$ satisfies the condition

[TABLE]

This observation leads to the following natural greedy algorithm $\operatorname{\textsc{Greedy}}(P_{n},\alpha,\beta)$ : The algorithm considers the edges $e_{1},e_{2},\ldots,e_{n-1}$ of $P_{n}$ as they are encountered when starting from one of the two end vertices of $P_{n}$ . It iteratively constructs a solution $C$ for the subpath on the first $i$ edges $e_{1},e_{2},\ldots,e_{i}$ for $i=1,2,\ldots,n-1$ , by initializing $C:=\emptyset$ , and by adding the edge $e_{i}$ to $C$ if and only if the condition $|C|+1\leq(1-1/\alpha)i+\beta$ is satisfied (so after adding $e_{i}$ to $C$ , (3) is still satisfied).

Theorem 2.

Let $P_{n}$ be a path with unit length edges $\ell=1$ and consider the tolerance function $\varphi(x)=x/\alpha-\beta$ , $\alpha,\beta\geq 1$ . The set of edges computed by the algorithm $\operatorname{\textsc{Greedy}}(P_{n},\alpha,\beta)$ is an optimal solution for the instance $(P_{n},\ell,\varphi)$ of the problem Contraction, and it is computed in time $\mathcal{O}(n)$ .

Proof.

Let $C\subseteq E(P_{n})$ be the set of edges computed by the algorithm $\operatorname{\textsc{Greedy}}(P_{n},\alpha,\beta)$ . Clearly, we have $|C|=\lfloor(1-1/\alpha)|E(P_{n})|+\beta\rfloor$ , and this is optimal according to (3). However, it remains to show that $C$ is feasible. For $1\leq i\leq j\leq n-1=|E(P_{n})|$ we let $P_{i,j}$ denote the subpath of $P_{n}$ formed by the edges $e_{i},e_{i+1},\ldots,e_{j}$ . By the definition of our algorithm we know that $|E(P_{1,i})\cap C|=\lfloor(1-1/\alpha)i+\beta\rfloor$ , from which we obtain that

[TABLE]

where we used the assumption $\beta\geq 1$ in the last step. Using (3) it thus follows that $C$ is feasible. ∎

3.2. Cycles with unit length edges

In this section we consider the special case of contracting a cycle $C_{n}$ with $n$ vertices and unit length edges $\ell=1$ and the tolerance function $\varphi(x)=x/\alpha-\beta$ , $\alpha\geq 1$ , $\beta\geq 0$ . For this case we present a greedy algorithm running in linear time. The main purpose of this result is to clearly separate the polynomially solvable cases of Contraction from the NP-hard cases, and the case of a cycle with unit length edges precisely forms this boundary on the polynomially solvable side. Recall in this context that we can solve Contraction in polynomial time on any tree (this will be proved in Section 4.1 below), and that Contraction is NP-hard already on a cycle for $\alpha=1$ (with arbitrary edge lengths; we will show this in Section 5.1 below).

We first argue that on a cycle it is equivalent to maximize the number of contracted edges $|C|$ or to maximize our objective function $\Phi(C)$ defined in (2). This is because the set of pairs $(|C|,\Phi(C))$ for all feasible contractions $C$ in a cycle $G=C_{n}$ is given by $\{(1,1),(2,2),\ldots,(n-3,n-3),(n-2,n-1),(n-1,n),(n,n)\}$ , so it forms a monotone function, implying that maximizing either one of the two quantities is equivalent. Based on this argument, for the rest of this section we consider maximizing the number $|C|$ of contracted edges.

Observe that a solution $C\subseteq E(C_{n})$ ( $C_{n}$ is the cycle we want to contract, and $C$ is the set of edges to be contracted) for the instance $(C_{n},\ell,\varphi)$ of the problem Contraction is feasible, if and only if every subpath $P\subseteq C_{n}$ of length $d:=|E(P)|\in\{1,2,\ldots,n-1\}$ satisfies the condition

[TABLE]

Rounding down on the right-hand side of (4) is justified because $|E(P)\cap C|$ is always an integer.

Defining

[TABLE]

we obtain from (4) that $\lambda\in[0,1]$ is the maximal amount by which we can contract each edge in a uniform fractional solution. Inspired by the rounding technique from [BOR80], we turn this fractional solution into an integer optimal solution, yielding the following greedy algorithm $\operatorname{\textsc{Greedy}}(C_{n},\alpha,\beta)$ : The algorithm considers the edges $e_{1},e_{2},\ldots,e_{n}$ of $C_{n}$ as they are encountered when walking around the cycle. It iteratively constructs a solution $C$ by initializing $C:=\emptyset$ and by adding the edge $e_{i}$ to $C$ if and only if $\lfloor\lambda i\rfloor-\lfloor\lambda(i-1)\rfloor=1$ for all $i=1,2,\ldots,n$ (since $\lambda\in[0,1]$ , this difference is always either 0 or 1). Note that we contract all edges of $C_{n}$ if and only if $\lambda=1$ .

Theorem 3.

Let $C_{n}$ be a cycle with unit length edges $\ell=1$ and consider the tolerance function $\varphi(x)=x/\alpha-\beta$ , $\alpha\geq 1$ , $\beta\geq 0$ . The set of edges computed by the algorithm $\operatorname{\textsc{Greedy}}(C_{n},\alpha,\beta)$ is an optimal solution for the instance $(C_{n},\ell,\varphi)$ of the problem Contraction, and it is computed in time $\mathcal{O}(n)$ .

The next lemma shows that the contraction computed by our algorithm has the maximum size.

Lemma 3.1.

For any feasible solution $C\subseteq E(C_{n})$ we have $|C|\leq\lfloor\lambda n\rfloor$ with $\lambda$ defined in (5).

Proof.

If $\lambda=1$ this inequality is trivial. So let us assume that $\lambda=\lambda^{\prime}<1$ and that the minimum in (5a) is attained for some $d\in\{1,2,\ldots,n-1\}$ . Starting at some vertex $u$ of the cycle, we walk along the cycle and cover it with $n$ consecutive paths $P_{1},P_{2},\ldots,P_{n}$ of length $d$ each ( $P_{i+1}$ starts where $P_{i}$ ends). The sum of the lengths of the paths is $nd$ , so this process ends at the starting vertex $u$ , and each edge of the cycle and each edge of $C$ is covered exactly $d$ times. We therefore obtain

[TABLE]

As $|C|$ must be integral this inequality yields the desired bound $|C|\leq\lfloor\lambda n\rfloor$ . ∎

With Lemma 3.1 in hand, we are now ready to prove Theorem 3.

Proof of Theorem 3.

In this proof we will use that for any two real numbers $x$ and $y$ we have

[TABLE]

Let $C\subseteq E(C_{n})$ be the set of edges computed by the algorithm $\operatorname{\textsc{Greedy}}(C_{n},\alpha,\beta)$ . Clearly, we have $|C|=\sum_{i=1}^{n}(\lfloor\lambda i\rfloor-\lfloor\lambda(i-1)\rfloor)=\lfloor\lambda n\rfloor$ , which is optimal by Lemma 3.1. However, it remains to show that $C$ is feasible. We consider a path $P$ of length $d:=|E(P)|\in\{1,2,\ldots,n-1\}$ on the edges $e_{k},e_{k+1},\ldots,e_{k+d-1}$ (indices are considered cyclically modulo $n$ , so $e_{n+i}=e_{i}$ ). We distinguish two cases: If $k+d-1\leq n$ , we have

[TABLE]

If $k+d-1>n$ , we obtain

[TABLE]

Applying (5) and using that $\lceil\lfloor x\rfloor\rceil=\lfloor x\rfloor$ shows that the right-hand sides of (7) and (8) can both be bounded from above by $\lfloor d-\min\{d,n-d\}/\alpha+\beta\rfloor$ , proving that $C$ is indeed feasible by (4). ∎

3.3. Trees with unit length edges and additive error

In this section we consider the special case of contracting a tree $T$ with unit length edges $\ell=1$ and the tolerance function $\varphi(x)=x-\beta$ (purely additive error; we can assume w.l.o.g. that $\beta$ is an integer). Note that in this setting the objective function defined in (2) satisfies $\Phi(C)=|C|$ for any contraction $C$ . It turns out that in this case, optimal solutions have a very special structure that can be exploited to compute them in linear time. Specifically, an optimal solution is obtained by taking all edges of $T$ which have the property that only short paths start from one of its end vertices. Formally, for the tree $T$ and $d\in\mathds{N}_{\geq 0}$ , we let $L(T,d)$ denote the set of all edges $e$ of $T$ which have one end vertex $v$ such that all paths that start at $v$ and do not contain $e$ have length at most $d-1$ (together with $e$ these paths have length at most $d$ ). E.g., we have $L(T,0)=\emptyset$ , and the set $L(T,1)$ are all the edges incident to a leaf (see Figure 2).

Clearly, the set $L(T,d)$ can be computed in linear time by repeatedly removing all leaves of $T$ in $d$ rounds. This is a variant of the well-known linear time algorithm to compute the so-called center of a tree (see [Ski08, Section 15.11]).

Theorem 4.

Let $T$ be a tree with unit length edges $\ell=1$ and consider the tolerance function $\varphi(x)=x-\beta$ , $\beta\in\mathds{N}_{\geq 0}$ . If $\beta$ is even, the set of edges $L(T,d)$ with $d:=\lfloor\beta/2\rfloor$ is an optimal solution for the instance $(T,\ell,\varphi)$ of the problem Contraction. If $\beta$ is odd, $L(T,d)\cup\{e\}$ , $e\in E\setminus L(T,d)$ , is an optimal solution. These solutions can be computed in time $\mathcal{O}(n)$ .

Proof.

We define $C:=L(T,d)$ if $\beta$ is even and $C:=L(T,d)\cup\{e\}$ , for some $e\in E\setminus L(T,d)$ , if $\beta$ is odd. We first argue that $C$ is a feasible solution. To see this note that for the given tolerance function we only need to verify that the path $P$ between any two leaves $u,v$ of $T$ contains at most $\beta$ edges. Consider all the edges of $P$ for which both end vertices have distance at least $d$ from both $u$ and $v$ . None of those edges is in $L(T,d)$ by its definition. It follows that $|P\cap L(T,d)|\leq 2d=2\lfloor\beta/2\rfloor$ and therefore $|P\cap C|\leq\beta$ .

To prove that $C$ is a solution of maximum size we argue by induction over $\beta$ . The claim is trivially true for $\beta=0$ and $\beta=1$ (in these cases $|C|=0$ and $|C|=1$ , respectively). So let $D$ be an arbitrary feasible solution of the instance $(T,\ell,\beta)$ of the problem Contraction for some $\beta\geq 2$ . We need to show that $|C|\geq|D|$ . To this end we let $V^{*}\subseteq V(T)$ denote the set of leaves of $T$ and we define $E^{*}:=L(T,1)$ . Moreover, we define $T^{*}:=T\setminus V^{*}$ and $C^{*}:=C\setminus E^{*}$ . By induction, $C^{*}=L(T^{*},d-1)$ is an optimal solution for the instance $(T^{*},\ell,\beta-2)$ .

We first consider the case that $E^{*}\setminus D=\emptyset$ or $D\setminus E^{*}=\emptyset$ (this is equivalent to $E^{*}\subseteq D$ or $D\subseteq E^{*}$ ). In this case we define $D^{*}:=D\setminus E^{*}$ , and observe that $D^{*}$ is a feasible solution for the instance $(T^{*},\ell,\beta-2)$ . It follows that $|C^{*}|\geq|D^{*}|$ , implying that $|C|=|C^{*}|+|E^{*}|\geq|D^{*}|+|E^{*}|\geq|D|$ , as claimed.

It remains to consider the case that both sets $E^{*}\setminus D$ and $D\setminus E^{*}$ are nonempty, so there is an edge $e^{\prime}\in E^{*}\setminus D$ and an edge $f\in D\setminus E^{*}$ . We denote the leaf incident to $e^{\prime}$ by $v$ . We will now remove an edge $e\in D\setminus E^{*}$ from $D$ and add $e^{\prime}$ instead to obtain another feasible solution $D^{\prime}$ satisfying $|D|=|D^{\prime}|$ . Repeating this exchange argument and applying the reasoning from the first case then proves the lemma. The edge $e\in D\setminus E^{*}$ to be removed from $D$ is obtained by considering the path that connects $v$ and $f$ in $T$ and that contains $f$ , and by choosing the first edge from $D$ (or equivalently, from $D\setminus E^{*}$ ) that is encountered when following this path from $v$ to $f$ . It may happen that $e=f$ is the first such edge we encounter. To complete the proof of the lemma it remains to show that $D^{\prime}=D\setminus\{e\}\cup\{e^{\prime}\}$ is feasible. To prove this we only need to check paths which start in $v$ and contain $e^{\prime}$ but not $e$ . Let $P^{\prime}$ be such a path, let $Q$ be any path that also starts in $v$ but does contain $e$ , and consider the path $P:=(P^{\prime}\setminus Q)\cup(Q\setminus P^{\prime})$ (see Figure 3). Here and in the following we slightly abuse notation and interpret these set unions/differences/intersections in terms of the edge sets of the graphs. As $D$ is feasible and as $P\cap Q$ contains $e$ , the number of edges in $D$ or $D^{\prime}$ on $P^{\prime}\setminus Q=P\setminus Q$ is at most $\beta-1$ . By the choice of $e$ , the number of edges of $D^{\prime}$ on $P^{\prime}\cap Q$ is 1 (the only edge of $D^{\prime}$ on this path is $e^{\prime}$ ). As $P^{\prime}=(P^{\prime}\setminus Q)\cup(P^{\prime}\cap Q)$ , we obtain that the number of edges from $D^{\prime}$ on $P^{\prime}$ is at most $\beta-1+1=\beta$ , as desired. This completes the proof. ∎

4. Dynamic programs for general trees

In this section we describe dynamic programming algorithms for the problems Contraction and Weak Contraction on trees with general edge lengths and affine tolerance functions. Recall that on trees our objective function satisfies $\Phi(C)=|C|$ for any contraction $C$ .

4.1. Contraction on trees

In this section we describe a dynamic programming algorithm for the problem of computing an optimal contraction of a tree $T$ with arbitrary edge lengths $\ell\colon E\to\mathds{R}_{>0}$ and an affine tolerance function $\varphi(x)=x/\alpha-\beta$ , $\alpha\geq 1$ , $\beta\geq 0$ , generalizing the solution for the special case presented at the beginning of the previous section. The goal is to prove the following result.

Theorem 5.

Let $T$ be a tree with edge lengths $\ell\colon E\to\mathds{R}_{>0}$ and consider the tolerance function $\varphi(x)=x/\alpha-\beta$ , $\alpha\geq 1$ , $\beta\geq 0$ . An optimal solution for the instance $(T,\ell,\varphi)$ of the problem Contraction can be computed by dynamic programming in time $\mathcal{O}(n^{3})$ .

Observe that a solution $C\subseteq E$ is feasible if and only if for any two vertices $u$ and $v$ of $T$ we have $\operatorname{load}_{C,\alpha}(u,v)\leq\beta$ , where the load between $u$ and $v$ is defined as

[TABLE]

Note that $\operatorname{load}_{C,\alpha}(T,v)\geq 0$ , as we have $\operatorname{load}_{C,\alpha}(v,v)=0$ . The next lemma states a criterion when feasible solutions of subtrees can be combined to a feasible solution of the entire tree. The definitions (9a), (9b) and the lemma are illustrated in Figure 4.

Lemma 4.1.

Consider a partition of $T$ into two subtrees $T_{1}$ and $T_{2}$ that only have a vertex $v\in V$ in common. Then $C\subseteq E$ is a feasible solution for the instance $(T,\ell,\varphi)$ of the problem Contraction if and only if the following two conditions hold: $C\cap E(T_{1})$ and $C\cap E(T_{2})$ are feasible solutions for the instances $(T_{1},\ell,\varphi)$ and $(T_{2},\ell,\varphi)$ respectively; and we have $\operatorname{load}_{C,\alpha}(T_{1},v)+\operatorname{load}_{C,\alpha}(T_{2},v)\leq\beta$ .

Proof.

Observe that the path between two vertices $u\in T_{1}$ and $w\in T_{2}$ contains the vertex $v$ , so we obtain $\operatorname{load}_{C,\alpha}(u,w)=\operatorname{load}_{C,\alpha}(u,v)+\operatorname{load}_{C,\alpha}(v,w)$ from (9a). Using (9b) it follows that the condition $\operatorname{load}_{C,\alpha}(u,w)\leq\beta$ holding for all such pairs of vertices $u,w$ is equivalent to $\operatorname{load}_{C,\alpha}(T_{1},v)+\operatorname{load}_{C,\alpha}(T_{2},v)\leq\beta$ . ∎

We will use this lemma to formulate our dynamic programming algorithm. The idea is to compute optimal solution for subtrees and combining them to an optimal solution for the entire tree.

To describe the algorithm we introduce a few definitions. An ordered rooted tree is a rooted tree with a specified left-to-right ordering for the children of each vertex. Given the tree $T$ , we can pick an arbitrary vertex as the root, and for each descendant of the root an arbitrary left-to-right ordering of its children, yielding an ordered rooted tree (different roots and orderings yield different ordered rooted trees, but any one of them is good for our purposes). We slightly abuse notation in the following and use $T$ to denote this ordered rooted tree. All trees considered in the rest of this section are ordered and rooted. For any vertex $v$ of $T$ , we let $T_{v}$ denote the subtree of $T$ rooted at $v$ , and we use $c(v)$ to denote the number of children of $v$ . If $u_{1},u_{2},\ldots,u_{c(v)}$ are the children of $v$ (in the specified ordering), we write $T_{v,i}$ , $i\in\{1,\ldots,c(v)\}$ , for the subtree of $T$ that contains $v$ , $u_{i}$ and all the descendants of $u_{i}$ . We also define $T_{v,0}:=\{v\}$ . Furthermore, we define $T_{v,i}^{+}:=\bigcup_{0\leq j\leq i}T_{v,i}$ , so we have $T_{v}=T_{v,c(v)}^{+}$ . These definitions are illustrated in Figure 4.

Using these definitions it follows straightforwardly from (9a) and (9b) that for any set of edges $C\subseteq E(T_{u_{i}})$ we have

[TABLE]

Note that the load increases if the edge $\{v,u_{i}\}$ is added to $C$ (see (10a)), and it decreases otherwise (see (10b)). Moreover, for any set of edges $C\subseteq T_{v,i}^{+}$ and any $i=1,2,\ldots,c(v)$ we obtain from those definitions that

[TABLE]

These rules allow us to compute the load of all subtrees of $T$ in a bottom-up fashion. Our dynamic program maintains the minimum load of all subtrees of $T$ in three-dimensional matrices $L$ and $L^{+}$ . We begin defining these matrices in an abstract way, and then establish several recursive relations which directly translate into a dynamic program. Specifically, for $v\in V$ , $i\in\{0,1,\ldots,c(v)\}$ and $s\in\{0,1,\ldots,m\}$ (recall that $m=|E|$ ) we define

[TABLE]

If there is no feasible solution of the required size, we have $L(v,i,s)=\infty$ . The entries of $L^{+}(v,i,s)$ are defined analogously to (12) by considering the load of $T_{v,i}^{+}$ instead of $T_{v,i}$ . In words, the entries $L(v,i,s)$ and $L^{+}(v,i,s)$ describe feasible solutions $C$ of size $s$ of the instances $(T_{v,i},\ell,\varphi)$ or $(T_{v,i}^{+},\ell,\varphi)$ , respectively, of the problem Contraction for which the load at the vertex $v$ is as small as possible (the matrices contain the minimum achievable load, not the corresponding set of edges).

Lemma 4.2.

Let $v$ be a vertex of $T$ and let $u_{1},u_{2},\ldots,u_{c(v)}$ be the children of $v$ . Then the matrices $L$ and $L^{+}$ defined in and directly after (12) satisfy the relations

[TABLE]

Moreover, we have

[TABLE]

for all $i\in\{1,2,\ldots,c(v)\}$ and $s\in\{1,2,\ldots,m\}$ .

The most interesting of these recursive relations are of course (13c) and (13d). The relation (13c) captures the two possibilities of either adding the edge $\{v,u_{i}\}$ or not adding it to a partial solution in the tree $T_{u_{i},c(u_{i})}^{+}=T_{u_{i}}$ to obtain a solution for the tree $T_{v,i}$ (recall (10)). The relation (13d), on the other hand, describes how to distribute $s$ contraction edges in $T_{v,i}^{+}$ among the two subtrees $T_{v,i-1}^{+}$ and $T_{v,i}$ ( $t$ is the number of edges contracted in the first tree, and $s-t$ the number of edges in the second tree, respectively).

Proof.

The relations (13a) and (13b) follow immediately from the definitions of the trees $T_{v,i}$ and $T_{v,i}^{+}$ and from (12). The relation (13c) follows from (10) and (12). The relation (13d) follows from (11) and (12) with the help of Lemma 4.1. ∎

We are now ready to prove Theorem 5.

Proof of Theorem 5.

Given the instance $(T,\ell,\varphi)$ , we fix an arbitrary root $r$ of $T$ and an arbitrary ordering of the children of each vertex, making $T$ an ordered rooted tree. We then compute the entries of the matrices $L$ and $L^{+}$ using Lemma 4.2. We first initialize various entries using (13a) and (13b), and compute the remaining entries in a bottom-up fashion moving upwards from the leaves to the root. Specifically, at a vertex $v$ with children $u_{1},u_{2},\ldots,u_{c(v)}$ for which all the entries of $L$ and $L^{+}$ have already been computed, we first compute $L(v,i,s)$ for all $i\in\{1,2,\ldots,c(v)\}$ and $s\in\{1,2,\ldots,m\}$ using (13c), and then $L^{+}(v,i,s)$ for all $i\in\{1,2,\ldots,c(v)\}$ and $s\in\{1,2,\ldots,m\}$ using (13d).

Let $s^{*}$ be the largest $s$ such that $L^{+}(r,c(r),s)\leq\beta$ . From (12) we obtain that $s^{*}$ is the size of an optimal solution of the instance $(T,\ell,\varphi)$ . The corresponding set of edges $C\subseteq E$ can be obtained by keeping track of the arguments for which the minima and maxima in (13c) and (13d) are attained in each step.

Clearly, $L$ and $L^{+}$ both have $\mathcal{O}(n^{2})$ entries, and computing each entry takes time $\mathcal{O}(n)$ , so the running time of our dynamic program is $\mathcal{O}(n^{3})$ . ∎

4.2. Weak Contraction on trees

In this section we consider the problem of computing weak contractions for a tree $T$ with affine tolerance function $\varphi(x)=x/\alpha-\beta$ . Here, our main result is a dynamic programming algorithm that builds on the algorithmic ideas presented in Section 4.1.

Theorem 6.

Let $T$ be a tree with edge lengths $\ell\colon E\to\mathds{R}_{>0}$ and consider the tolerance function $\varphi(x)=x/\alpha-\beta$ , $\alpha\geq 1$ , $\beta\geq 0$ . An optimal solution for the instance $(T,\ell,\varphi)$ of the problem Weak Contraction can be computed by dynamic programming in time $\mathcal{O}(n^{5})$ .

In this setting we need to specifically keep track of pairs of vertices whose distance remains positive when contracting a set of edges $C\subseteq E$ (i.e., not all edges in between these vertices are contracted). To this end we extend the definitions (9) as follows: For any vertex $v$ of $T$ we define the weak load of $T$ at $v$ as

[TABLE]

Note that in the maximization we have to consider all vertices $u$ such that at least one edge on the path from $u$ to $v$ is not in $C$ . This definition together with (9b) yields $\operatorname{wload}_{C,\alpha}(T,v)\leq\operatorname{load}_{C,\alpha}(T,v)$ . In contrast to the load, the weak load may be negative. In particular, $\operatorname{wload}_{C,\alpha}(T,v)=-\infty$ if and only if $C=E$ .

The following lemma is the counterpart to Lemma 4.1 for weak contractions. It describes how to combine feasible solutions on subtrees to a feasible solution of the entire tree. There is one important subtlety here: While the notion of a weak contraction forbids contracting all edges of $T$ , we clearly have to allow this for partial solutions on subtrees of $T$ (as long as some other edge not in the subtree is is not contracted, this might still yield a feasible solution).

Lemma 4.3.

Consider a partition of $T$ into two subtrees $T_{1}$ and $T_{2}$ that only have a vertex $v\in V$ in common. Then $C\subsetneq E$ is a feasible solution for the instance $(T,\ell,\varphi)$ of the problem Weak Contraction if and only if the following two conditions hold: For $i=1,2$ , either $C$ contains every edge of $T_{i}$ or $C\cap E(T_{i})$ is a feasible solution for the instance $(T_{i},\ell,\varphi)$ of Weak Contraction; and we have

[TABLE]

Proof.

Let $C\subsetneq E$ . For the rest of the proof we omit the subscripts $C$ and $\alpha$ and simply write $\operatorname{load}_{C,\alpha}=\operatorname{load}$ and $\operatorname{wload}_{C,\alpha}=\operatorname{wload}$ .

We first assume that $C$ is a feasible solution for the instance $(T,\ell,\varphi)$ of the problem Weak Contraction. I.e., any two vertices $u,w$ of $T$ with $\operatorname{dist}_{\ell_{C}}(u,w)>0$ satisfy the condition $\operatorname{load}(u,w)\leq\beta$ . This is true in particular for all pairs of vertices $u,w\in T_{i}$ , $i=1,2$ , implying that either $C\supseteq T_{i}$ or $C\cap T_{i}\subsetneq T_{i}$ is a feasible solution for the instance $(T_{i},\ell,\varphi)$ . If $\operatorname{wload}(T_{2},v)=-\infty$ , the claimed inequality $\operatorname{load}(T_{1},v)+\operatorname{wload}(T_{2},v)\leq\beta$ is trivially satisfied. So suppose that $\operatorname{wload}(T_{2},v)$ is a finite number, and let $u\in T_{1}$ and $w\in T_{2}$ be such that $\operatorname{load}(u,v)=\operatorname{load}(T_{1},v)$ , and $\operatorname{dist}_{\ell_{C}}(v,w)>0$ as well as $\operatorname{load}(v,w)=\operatorname{wload}(T_{2},v)$ . Then we also have $\operatorname{dist}_{\ell_{C}}(u,w)>0$ , so we know that $\operatorname{load}(u,w)\leq\beta$ by the assumption that $C$ is feasible for $(T,\ell,\varphi)$ . Combining this last inequality with the relation $\operatorname{load}(u,w)=\operatorname{load}(u,v)+\operatorname{load}(v,w)=\operatorname{load}(T_{1},v)+\operatorname{wload}(T_{2},v)$ proves that the right hand side of the equation is at most $\beta$ , as claimed. The proof of the second inequality $\operatorname{wload}(T_{1},v)+\operatorname{load}(T_{2},v)\leq\beta$ works symmetrically. This proves one direction of the equivalence.

To prove the reverse direction, we now assume that either $C\supseteq T_{i}$ or $C\cap T_{i}\subsetneq T_{i}$ is a feasible solution for the instance $(T_{i},\ell,\varphi)$ for $i=1,2$ , and that $\operatorname{load}(T_{1},v)+\operatorname{wload}(T_{2},v)\leq\beta$ and $\operatorname{wload}(T_{1},v)+\operatorname{load}(T_{2},v)\leq\beta$ . To show that $C$ is a feasible solution for the instance $(T,\ell,\varphi)$ , let $u\in T_{1}$ and $w\in T_{2}$ be such that $\operatorname{dist}_{\ell_{C}}(u,w)>0$ . It follows that $\operatorname{dist}_{\ell_{C}}(u,v)>0$ or $\operatorname{dist}_{\ell_{C}}(v,w)>0$ . We first consider the case that $\operatorname{dist}_{\ell_{C}}(u,v)>0$ . By the definitions (9) and (14) we have $\operatorname{load}(u,v)\leq\operatorname{wload}(T_{1},v)$ , and also $\operatorname{load}(v,w)\leq\operatorname{load}(T_{2},v)$ , yielding $\operatorname{load}(u,w)=\operatorname{load}(u,v)+\operatorname{load}(v,w)\leq\operatorname{wload}(T_{1},v)+\operatorname{load}(T_{2},v)\leq\beta$ (the last inequality holds by assumption). This proves that $\operatorname{load}(u,w)\leq\beta$ , as desired. The proof of the other case $\operatorname{dist}_{\ell_{C}}(v,w)>0$ works symmetrically. This completes the proof of the lemma. ∎

As in Section 4.1, we view $T$ as an ordered rooted tree, and consider its subtrees $T_{v}$ , $T_{v,i}$ and $T_{v,i}^{+}$ for all $v\in V$ and $i\in\{0,1,\ldots,c(v)\}$ (recall the definitions given after Lemma 4.1). Let us briefly highlight the differences between Lemmas 4.1 and 4.3. The dynamic programming algorithm presented in Section 4.1 exploits the fact that the optimal way to contract exactly $|C|=s$ edges in a subtree $T_{v}$ of $T$ rooted at a particular vertex $v$ is to contract a set of edges that minimizes $\operatorname{load}_{C,\alpha}(T_{v},v)$ . This is possible as the optimality condition in Lemma 4.1 only depends on this parameter. Here the situation is more complicated, as Lemma 4.3 also considers $\operatorname{wload}_{C,\alpha}(T_{v},v)$ . Figure 5 illustrates that it is not sufficient to minimize only one of these parameters.

Consequently, we keep track of an entire Pareto front of non-dominated partial solutions (see Figure 6). Formally, we define the set $F(T_{v},s)$ of feasible partial solutions of size $s$ as the family of all sets $C\subseteq E(T_{v})$ with $|C|=s$ such that either $C=E(T_{v})$ or $C$ is a feasible solution for the instance $(T_{v},\ell,\varphi)$ of Weak Contraction. For two sets $C,C^{\prime}\in F(T_{v},s)$ we say that $C$ dominates $C^{\prime}$ at $v$ if $\operatorname{load}_{C,\alpha}(T_{v},v)\leq\operatorname{load}_{C^{\prime},\alpha}(T_{v},v)$ and $\operatorname{wload}_{C,\alpha}(T_{v},v)\leq\operatorname{wload}_{C^{\prime},\alpha}(T_{v},v)$ , and we define the Pareto front $P(T_{v},v,s)$ as a minimal family of sets $C\in F(T_{v},s)$ such that no set $C^{\prime}\in F(T_{v},s)$ dominates $C$ at $v$ . Note that the domination relation is reflexive, so there may be several different such minimal families, all with the same pairs of load and weak load values, and any choice among them is equally good for us. This definition is illustrated in Figure 6.

The following crucial lemma asserts that the number of points on the Pareto front, i.e., the size of the family $P(T_{v},v,s)$ is at most $n+1$ . This property is essential for our dynamic programming approach, and it does not follow immediately from the definition of $P(T_{v},v,s)$ , as the set of feasible solutions $F(T_{v},s)$ is typically of exponential size.

Lemma 4.4.

For any $C\subseteq E(T_{v})$ , we have $\operatorname{load}_{C,\alpha}(T_{v},v)\in\Lambda(T_{v},v):=\{\operatorname{dist}_{\ell}(u,v)/\alpha:u\in V(T_{v})\}$ or $\operatorname{load}_{C,\alpha}(T_{v},v)=\operatorname{wload}_{C,\alpha}(T_{v},v)$ . Consequently, the Pareto front $P(T_{v},v,s)$ has size at most $n+1$ .

Proof.

By the definitions (9) and (14) we have $\operatorname{wload}_{C,\alpha}(T_{v},v)\leq\operatorname{load}_{C,\alpha}(T_{v},v)$ for all $C\subseteq E(T_{v})$ . Now let $C\subseteq E(T_{v})$ be such that $\operatorname{wload}_{C,\alpha}(T_{v},v)<\operatorname{load}_{C,\alpha}(T_{v},v)$ . Again by the previously mentioned definitions this implies that $\operatorname{load}_{C,\alpha}(T_{v},v)=\operatorname{load}_{C,\alpha}(u,v)=\operatorname{dist}_{\ell}(u,v)/\alpha$ for some $u\in V(T_{v})$ , which is indeed an element of the set $\Lambda(T_{v},v)$ . Consequently, the Pareto front $P(T_{v},v,s)$ consists of at most one set $C\in F(T_{v},s)$ with $\operatorname{wload}_{C,\alpha}(T_{v},v)=\operatorname{load}_{C,\alpha}(T_{v},v)$ and at most one set $C\in F(T_{v},s)$ with $\operatorname{wload}_{C,\alpha}(T_{v},v)<\operatorname{load}_{C,\alpha}(T_{v},v)$ for each number in $\Lambda(T_{v},v)$ . Using that $|\Lambda(T_{v},v)|\leq|V(T_{v})|\leq n$ it follows that $|P(T_{v},v,s)|\leq n+1$ . ∎

By Lemma 4.4 the load values of all points on the Pareto front with $\operatorname{wload}()<\operatorname{load}()$ are in the set $\Lambda(T_{v},v)$ . There might also be one point with $\operatorname{wload}()=\operatorname{load}()$ on the Pareto front (as in the example shown in Figure 6), and this load value might not be an element of $\Lambda(T_{v},v)$ . We extend the set $\Lambda(T_{v},v)$ accordingly by defining for $s\in\{0,1,\ldots,m\}$ (recall that $m=|E|$ )

[TABLE]

If the set $F(T_{v},s)$ is empty, we have $\lambda^{*}(T_{v},v,s)=\infty$ .

We now describe recursive relations for the weak load that are analogous to (10) and (11) for the load. It follows straightforwardly from (9) and (14) that for any vertex $v$ of $T$ and its children $u_{i}$ , $i=1,2,\ldots,c(v)$ , and for any set of edges $C\subseteq E(T_{u_{i}})$ we have

[TABLE]

Note that the weak load increases if the edge $\{v,u_{i}\}$ is added (see (17a)). On the other hand, if the edge $\{v,u_{i}\}$ is not added, it may decrease or increase (the right hand side of (17b) refers to the load, not to the weak load). Moreover, for any set of edges $C\subseteq T_{v,i}^{+}$ and any $i=1,2,\ldots,c(v)$ the definition (14) readily implies

[TABLE]

These rules together with the corresponding relations (10) and (11) allow us to compute the weak load and the load of all Pareto optimal partial solutions in a bottom-up fashion, similar to the approach taken in Section 4.1. Before it was sufficient to compute one optimal partial solution for every subtree $T_{v,i}$ and $T_{v,i}^{+}$ , $i\in\{1,2,\ldots,c(v)\}$ , and every possible size $s$ of the contracted set of edges, but now our dynamic program keeps track of the entire Pareto fronts $P(T_{v,i},v,s)$ and $P(T_{v,i}^{+},v,s)$ . We store the corresponding pairs of load and weak load values on the Pareto front in separate four-dimensional matrices $W$ , $W^{+}$ , $L$ and $L^{+}$ (the entries of $W$ and $W^{+}$ are certain weak load values, and the entries of $L$ and $L^{+}$ are the corresponding load values). We begin defining these matrices in an abstract way, and then establish several recursive relations which directly translate into a dynamic programming algorithm. Specifically, for $v\in V$ , $i\in\{0,1,\ldots,c(v)\}$ , $s\in\{0,1,\ldots,m\}$ and $\lambda\in\Lambda(T_{v,i},v)$ with $\Lambda(T_{v,i},v)$ as in Lemma 4.4 we define

[TABLE]

If there is no set $C$ satisfying these requirements, we have $W(v,i,s,\lambda)=L(v,i,s,\lambda)=\infty$ . The entries of $W^{+}(v,i,s,\lambda)$ and $L^{+}(v,i,s,\lambda)$ are defined analogously to (19) by considering the tree $T_{v,i}^{+}$ instead of $T_{v,i}$ (in particular, in this case we have $\lambda\in\Lambda(T_{v,i}^{+},v)$ ).

The definitions of $W(v,i,s,\lambda)$ and $L(v,i,s,\lambda)$ given in (19) extend straightforwardly to the value $\lambda=\lambda^{*}(T_{v,i},v,s)$ defined in (16a). Similarly, the definitions of $W^{+}(v,i,s,\lambda)$ and $L^{+}(v,i,s,\lambda)$ from before extend to the value $\lambda=\lambda^{*}(T_{v,i}^{+},v,s)$ . It is easy to see that we have in fact

[TABLE]

(an analogous relation holds for the entries of $L^{+}$ ).

The recursive relations satisfied by the matrices $W$ , $L$ , $W^{+}$ and $L^{+}$ defined before are captured by the following two lemmas. The initialization steps and the recursive computation of $W$ and $L$ are treated in Lemma 4.5. The recursive computation of $W^{+}$ and $L^{+}$ is somewhat more technical, and is treated separately in Lemma 4.6.

Lemma 4.5.

Let $v$ be a vertex of $T$ and let $u_{1},u_{2},\ldots,u_{c(v)}$ be the children of $v$ . Then the matrices $W$ , $W^{+}$ , $L$ and $L^{+}$ defined in and directly after (19) satisfy the relations

[TABLE]

Finally, we have

[TABLE]

where $\lambda\in\Lambda^{*}(T_{u_{i}},u_{i},s-1)$ is minimal such that $\rho:=W^{+}(u_{i},c(u_{i}),s-1,\lambda)+\ell(v,u_{i})/\alpha\leq\beta$ , if such a value $\lambda$ exists, and $\lambda:=\rho:=\infty$ otherwise, for all $i\in\{1,2,\ldots,c(v)\}$ , $s\in\{1,2,\ldots,m\}$ and $\lambda^{*}=\lambda^{*}(T_{v,i},v,s)$ .

Note that the relations (21a)–(21f) are the initialization steps, and the relations (21g)–(21j) capture the two possibilities of either adding or not adding the edge $\{v,u_{i}\}$ to a partial solution in the tree $T_{u_{i},c(u_{i})}^{+}=T_{u_{i}}$ to obtain a solution for the tree $T_{v,i}$ (recall (10) and (17)).

We only refer to well-defined entries of $W^{+}$ and $L^{+}$ in (21h) and in the definition of $\nu$ , as $\lambda-\ell(v,u_{i})/\alpha\in\Lambda(T_{u_{i}},u_{i})$ holds for every $\lambda\in\Lambda(T_{v,i},v)\setminus\{0\}$ . Note that we either have $\nu\leq\lambda$ or $\nu=\infty$ , while $\mu$ may also take a value in the open interval $(\lambda,\infty)$ .

Proof.

The relations (21a)–(21f) follow immediately from the definitions of the trees $T_{v,i}$ and $T_{v,i}^{+}$ and the definitions of the respective matrices given in (19) and afterwards. The relations (21g) and (21i) follow from (17) and the definitions of $W$ and $L$ , respectively: Consider a partial solution $C\in F(T_{v_{i}},s)$ . If $\operatorname{load}_{C,\alpha}(T_{v,i},v)=0$ , then $C$ does not contain the edge $\{v,u_{i}\}$ , so we have $W(v,i,s,0)=\mu$ . The other cases of (21g) as well as (21i) are implied by the following observation: If $\operatorname{wload}_{C,\alpha}(T_{v,i},v)\leq\lambda$ and $\{v,u_{i}\}\notin C$ , then by (17) we have $\mu\leq\beta$ and $\mu\leq\lambda$ .

The relation (21h) is closely related to (21g). If $\mu\neq\nu$ , then (21h) follows immediately from (21g) and the definitions of $W$ and $L$ . If $\mu=\nu\leq\min\{\beta,\lambda\}$ , then both a partial solution containing the edge $\{v,u_{i}\}$ as well as one missing this edge minimize the weak load. As the weak load is bounded from above by the load, we get $L(v,i,s,\lambda)=W(v,i,s,\lambda)=\mu$ in this case. This implies (21h). An analogous argument yields (21j). ∎

The following lemma describes the recursive relations satisfied by the entries of $W^{+}$ and $L^{+}$ . Specifically, the lemma describes how to distribute $s$ contraction edges in $T_{v,i}^{+}$ among the two subtrees $T_{v,i-1}^{+}$ and $T_{v,i}$ ( $t$ is the number of edges contracted in the first tree, and $s-t$ the number of edges in the second tree, respectively). To compute a single point on the Pareto front $P(T_{v,i}^{+},v,s)$ , we need to consider all points on the Pareto fronts $P(T_{v,i-1}^{+},v,t)$ and $P(T_{v,i},v,s-t)$ .

Lemma 4.6.

*Let $v$ be a vertex of $T$ , and let $s\in\{0,1,\dots,m\}$ and $i\in\{1,2,\dots,c(v)\}$ be fixed throughout this lemma. For $t\in\{0,1,\ldots,s\}$ we let $\Pi(t)$ denote the set of all pairs $(\lambda_{1},\lambda_{2})$ with $\lambda_{1}\in\Lambda^{*}(T_{v,i-1}^{+},v,t)$ and $\lambda_{2}\in\Lambda^{*}(T_{v,i},v,s-t)$ such that $W^{+}(v,i-1,t,\lambda_{1})+L(v,i,s-t,\lambda_{2})\leq\beta$ and $L^{+}(v,i-1,t,\lambda_{1})+W(v,i,s-t,\lambda_{2})\leq\beta$ . For $t\in\{0,1,\ldots,s\}$ and $\lambda\in\Lambda(T_{v,i}^{+},v)$ we let $\Pi(t,\lambda)\subseteq\Pi(t)$ denote the set of all pairs $(\lambda_{1},\lambda_{2})\in\Pi(t)$ satisfying $\max\{L^{+}(v,i-1,t,\lambda_{1}),L(v,i,s-t,\lambda_{2})\}\leq\lambda$ .

For all $\lambda\in\Lambda(T_{v,i}^{+},v)$ , defining*

[TABLE]

Proof.

The relation (22c) follows by combining the definitions (19a) and (22a) with the relations (11), (18) and the condition (15) from Lemma 4.3. The argument for (22d) is analogous, using the definitions (19b) and (22b) instead of (19a) and (22a).

The relation (22g) follows by combining the definitions (16a) and (22e) (recall also (20)) with the relations (11), (18) and the condition (15) from Lemma 4.3. The argument for (22h) is analogous, using the definitions (19a) and (22f) instead of (16a) and (22e). ∎

We can trivially compute the quantities $W(t)$ , $L(t)$ , $W^{*}(t)$ and $L^{*}(t)$ as defined in Lemma 4.6 in time $\mathcal{O}(n^{2})$ (using that $|\Pi(t)|=\mathcal{O}(n^{2})$ and $|\Pi(t,\lambda)|=\mathcal{O}(n^{2})$ by Lemma 4.4). The following lemma shows how to do the same computation in time $\mathcal{O}(n)$ , so that the entries $W^{+}(v,i,s,\lambda)$ and $L^{+}(v,i,s,\lambda)$ can be computed via (22c), (22d), (22g) and (22h) in time $\mathcal{O}(n^{2})$ (instead of the trivial bound $\mathcal{O}(n^{3})$ ).

Lemma 4.7.

If the numbers in the sets $\Lambda^{*}(T_{v,i-1}^{+},v,t)$ and $\Lambda^{*}(T_{v,i},v,s-t)$ are sorted increasingly, the quantities $W(t)$ , $L(t)$ , $W^{*}(t)$ and $L^{*}(t)$ defined in Lemma 4.6 can be computed in time $\mathcal{O}(n)$ . Consequently, $W^{+}(v,i,s,\lambda)$ and $L^{+}(v,i,s,\lambda)$ can be computed for all $s\in\{0,1,\ldots,m\}$ and all $\lambda\in\Lambda^{*}(T_{v,i}^{+},v,s)$ in time $\mathcal{O}(n^{2})$ .

Proof.

We define the sequence $P_{1}$ of all pairs of finite numbers $(L^{+}(v,i-1,t,\lambda),W^{+}(v,i-1,t,\lambda))$ for all $\lambda\in\Lambda^{*}(T_{v,i-1}^{+},v,t)$ in increasing order of $\lambda$ -values. Similarly, we define the sequence $P_{2}$ of all pairs of finite numbers $(L(v,i,s-t,\lambda),W(v,i,s-t,\lambda))$ for all $\lambda\in\Lambda^{*}(T_{v,i},v,s-t)$ in increasing order of $\lambda$ -values. By Lemma 4.4 each of these lists has size $\mathcal{O}(n)$ . Note that these sequences correspond to the Pareto fronts $P(T_{v,i-1}^{+},v,t)$ and $P(T_{v,i},v,s-t)$ , respectively. Some pairs of points may appear multiple times consecutively in $P_{1}$ and $P_{2}$ , and in a preprocessing step we eliminate these duplicates in time $\mathcal{O}(n)$ . We know that after this preprocessing step, the first entries in the simplified lists $P_{1}$ and $P_{2}$ are strictly increasing, and the second entries are strictly decreasing (recall Figure 6).

We first argue how to compute $W(t)$ and $L(t)$ . We begin discarding all pairs from each list whose first entry ( $L^{+}$ or $L$ , respectively) is strictly greater than $\lambda$ in time $\mathcal{O}(n)$ . We then process the remaining lists $P_{1}$ and $P_{2}$ beginning at the last entries $(L_{j}^{+},W_{j}^{+})$ and $(L_{k},W_{k})$ (with smallest $W^{+}$ or $W$ -values, respectively) in two phases.

In the first phase we compute $W(t)$ as follows: If $L_{j}^{+}+W_{k}>\beta$ , we discard the last element of $P_{1}$ by decreasing $j$ by 1 (by our sorting of the lists we know that $L_{j}^{+}+W_{k^{\prime}}>\beta$ for all $k^{\prime}\leq k$ ). If $W_{j}^{+}+L_{k}>\beta$ , we discard the last element of $P_{2}$ by decreasing $k$ by 1 (by our sorting of the lists we know that $W_{j^{\prime}}^{+}+L_{k}>\beta$ for all $j^{\prime}\leq j$ ). Once $L_{j}^{+}+W_{k}\leq\beta$ and $W_{j}^{+}+L_{k}\leq\beta$ for the first time, we have found $W(t)=\max\{W_{j}^{+},W_{k}\}$ . If this never happens we know that $W(t)=\infty$ . This computation is correct by the definition of $\Pi(t,\lambda)$ in Lemma 4.6 and by (22a), and it takes time $\mathcal{O}(n)$ .

In the second phase we compute $L(t)$ as follows: If $W(t)=\infty$ , we know that $L(t)=\infty$ , too. Otherwise we distinguish two cases: If $W_{j}^{+}\geq W_{k}$ , we decrease $k$ further as long as both inequalities $W_{j}^{+}\geq W_{k}$ and $L_{j}^{+}+W_{k}\leq\beta$ are still satisfied (so that they still hold for the final $k$ ). If $W_{j}^{+}\leq W_{k}$ , we decrease $j$ further as long as both inequalities $W_{j}^{+}\leq W_{k}$ and $W_{j}^{+}+L_{k}\leq\beta$ are still satisfied (so that they still hold for the final $j$ ). In the end we set $L(t)=\max\{L_{j}^{+},L_{k}\}$ . Note that in the first case, the third constraint $W_{j}^{+}+L_{k}\leq\beta$ remains valid by the monotonicity $L_{k^{\prime}}\leq L_{k}$ for all $k^{\prime}\leq k$ , and in the second case, the third constraint $L_{j}^{+}+W_{k}\leq\beta$ remains valid by the monotonicity $L_{j^{\prime}}^{+}\leq L_{j}^{+}$ for all $j^{\prime}\leq j$ . Therefore, the correctness of the computation of $L(t)$ follows from (22b).

The procedure to compute $W^{*}(t)$ and $L^{*}(t)$ processes $P_{1}$ and $P_{2}$ (as obtained from the preprocessing step explained in the beginning) starting at the first entries $(L_{j}^{+},W_{j}^{+})$ , $j=1$ , and $(L_{k},W_{k})$ , $k=1$ , in two phases very similarly to before. We omit the details here. ∎

We are now ready to prove Theorem 6.

Proof of Theorem 6.

Given the instance $(T,\ell,\varphi)$ , we fix an arbitrary root $r$ of $T$ and an arbitrary ordering of the children of each vertex, making $T$ an ordered rooted tree.

We begin precomputing and sorting all of the sets $\Lambda(T_{v,i},v)$ and $\Lambda(T_{v,i}^{+},v)$ , $v\in V$ , $i\in\{0,1,\ldots,c(v)\}$ , and we maintain them as sorted lists throughout the algorithm. This takes time $\mathcal{O}(n^{2}\log n)$ in total (recall Lemma 4.4).

We then compute the entries of the matrices $W$ , $L$ , $W^{+}$ and $L^{+}$ using Lemmas 4.5 and 4.7. We first initialize various entries using (21a)–(21f), and compute the remaining entries in a bottom-up fashion moving upwards from the leaves to the root. Specifically, at a vertex $v$ with children $u_{1},u_{2},\ldots,u_{c(v)}$ for which all the entries of $W$ , $L$ , $W^{+}$ and $L^{+}$ have already been computed, we first compute $W(v,i,s,\lambda)$ and then $L(v,i,s,\lambda)$ for all $i\in\{1,2,\ldots,c(v)\}$ , $s\in\{1,2,\ldots,m\}$ and $\lambda\in\Lambda(T_{v,i},v)$ using (21g) and (21h), then we compute $L(v,i,s,\lambda^{*}(T_{v,i},v,s))=\lambda^{*}(T_{v,i},v,s)$ and $W(v,i,s,\lambda^{*}(T_{v,i},v,s))$ for all $i\in\{1,2,\ldots,c(v)\}$ and $s\in\{1,2,\ldots,m\}$ using (21i) and (21j). We obtain sorted lists containing the numbers in $\Lambda^{*}(T_{v,i},v,s)$ by inserting $\lambda^{*}(T_{v,i},v,s)$ at the correct position into the precomputed list $\Lambda(T_{v,i},v)$ . Next, we compute $W^{+}(v,i,s,\lambda)$ and then $L^{+}(v,i,s,\lambda)$ for all $i\in\{1,2,\ldots,c(v)\}$ , $s\in\{1,2,\ldots,m\}$ and $\lambda\in\Lambda(T_{v,i}^{+},v)$ using (22c) and (22d), and then we compute $L^{+}(v,i,s,\lambda^{*}(T_{v,i}^{+},v,s))=\lambda^{*}(T_{v,i}^{+},v,s)$ and $W^{+}(v,i,s,\lambda^{*}(T_{v,i}^{+},v,s))$ for all $i\in\{1,2,\ldots,c(v)\}$ and $s\in\{1,2,\ldots,m\}$ using (22g) and (22h). We obtain sorted lists containing the numbers in $\Lambda^{*}(T_{v,i}^{+},v,s)$ by inserting $\lambda^{*}(T_{v,i}^{+},v,s)$ at the correct position into the precomputed list $\Lambda(T_{v,i}^{+},v)$ .

Let $s^{*}$ be the largest $s$ such that $W^{+}(r,c(r),s,\lambda^{*}(T,r,s))$ is finite. From (19) we obtain that $s^{*}$ is the size of an optimal solution of the instance $(T,\ell,\varphi)$ . The corresponding set of edges $C\subseteq E$ can be obtained by keeping track of the arguments for which the minima and maxima in (21g)–(21j) and (22a)–(22h) are attained in each step.

Each of the matrices $W$ , $L$ , $W^{+}$ and $L^{+}$ has $\mathcal{O}(n^{3})$ entries (recall Lemma 4.4). Computing an entry of $W$ or $L$ takes $\mathcal{O}(n)$ time by Lemma 4.5, while computing an entry of $W^{+}$ or $L^{+}$ can be achieved in time $\mathcal{O}(n^{2})$ by Lemma 4.7, so the runnning time of our dynamic program is $\mathcal{O}(n^{5})$ . ∎

5. Hardness for additive tolerance functions

In this section we prove that the problems Contraction and Weak Contraction for the tolerance function $\varphi(x)=x-\beta$ (purely additive error) are hard already on cycles (Section 5.1 below). We then prove that Contraction with the same tolerance function is hard to approximate for general graphs and for bipartite graphs (Section 5.2).

5.1. Hardness of Contraction and Weak Contraction

Recall that we can compute optimal (weak) $(\alpha,\beta)$ -contractions in polynomial time on trees (this was shown in Section 4.1), and have a linear time algorithm for Contraction on cycles with unit length edges (this was shown in Section 3.2). We now show that the problem with $\alpha=1$ is NP-hard on cycles with arbitrary edge lengths.

Theorem 7.

For any fixed $\beta>0$ , the problems Contraction and Weak Contraction with tolerance function $\varphi(x)=x-\beta$ , $\beta\geq 0$ , are NP-hard on cycles.

Theorem 7 (where $\beta$ is not part of the input) follows immediately from Theorem 8 below (where $\beta$ is part of the input). The reason is that an instance with $\alpha=1$ does not change when multiplying all edge lengths and $\beta$ by some constant.

Theorem 8.

The problems Contraction and Weak Contraction with tolerance function $\varphi(x)=x-\beta$ , $\beta\geq 0$ , are NP-hard on cycles.

The rest of this section is devoted to proving Theorem 8.

For our proof we will use the following variant of the well-known problem Partition, referred to as Close-to-1 Partition. To state the problem we say that a set of positive rational numbers $\{a_{1},a_{2},\ldots,a_{n}\}$ is close to 1, if $\sum_{i=1}^{n}a_{i}=n$ and $\varepsilon:=\sum_{i=1}^{n}|a_{i}-1|<1/5$ .

[TABLE]

Note that for a ‘Yes’-instance of this problem, the solution $I\subseteq[n]$ must have size $n/2$ , so $|I|=|[n]\setminus I|=\sum_{i\in I}a_{i}=\sum_{i\in[n]\setminus I}a_{i}=n/2$ . In particular, this implies that $n$ is even.

In the classical problem Partition, the input set is not constrained to be close to 1. Partition was shown to be NP-complete already in Karp’s seminal paper [Kar72]. The fact that Close-to-1 Partition is also NP-complete follows from a straightforward rescaling argument.

Lemma 5.1.

Close-to-1 Partition* is NP-complete.*

Proof.

Given an instance $\{a_{1},a_{2},\ldots,a_{n}\}$ of Partition, we first add $n$ additional zeroes $a_{n+1}=a_{n+2}=\cdots=a_{2n}=0$ to the instance (by this we ensure that a partition with equal sums is transformed into one where both partition classes have the same number $n$ of summands). We then linearly transform all the $a_{i}$ according to $a_{i}^{\prime}:=(a_{i}+C)/D$ , where $C$ and $D$ are sufficiently large constants so that the transformed values $a_{i}^{\prime}$ are close to 1. The transformed set of numbers has even cardinality $2n$ , is close to 1, and it admits a partition into two sets of size $n$ with equal sum if and only if the original instance allows a partition into two sets with equal sum. ∎

Proof of Theorem 8.

We first focus on the problem Contraction. We reduce Close-to-1 Partition, which is NP-complete by Lemma 5.1, to the problem Contraction on a cycle with tolerance function $\varphi(x)=x-\beta$ , $\beta\geq 0$ .

Let $\mathcal{I}=\{a_{1},a_{2},\ldots,a_{n}\}$ be an instance of Close-to-1 Partition such that $a_{1}\geq a_{2}\geq\cdots\geq a_{n}$ . This ensures that all $a_{i}$ that are bigger than 1 appear before all $a_{i}$ that are smaller than 1, which is the only property of the ordering that we exploit in the proof later on. The instance of Contraction we construct is on the cycle $C_{2n+4}$ with $2n+4$ edges. We label the vertices of the cycle by walking around the cycle as follows: The first $n+1$ vertices are labelled $u_{0},u_{1},\ldots,u_{n}$ , then there are two special vertices $v_{1}$ , $v_{2}$ , and the remaining $n+1$ vertices are labelled $w_{0},w_{1},\ldots,w_{n}$ , see Figure 7. We denote the subpath $(u_{0},\ldots,u_{n})$ as $P_{u}$ , and the subpath $(w_{0},\ldots,w_{n})$ by $P_{w}$ .

We now define $\varepsilon:=\sum_{i=1}^{n}|1-a_{i}|<1/5$ , $\beta:=n/2+2\varepsilon$ and $\beta^{\prime}:=\beta+1>\beta$ , and the length function $\ell$ on the cycle edges by setting $\ell(u_{i-1},u_{i}):=a_{i}$ and $\ell(w_{i-1},w_{i})=2-a_{i}$ for all $i\in[n]$ , and by $\ell(u_{n},v_{1})=\ell(v_{2},w_{1}):=\varepsilon$ , $\ell(v_{1},v_{2}):=\beta^{\prime}$ , and $\ell(w_{n},u_{0}):=\beta^{\prime}+2\varepsilon$ (see Figure 7).

Now consider the instance $\mathcal{J}:=(C_{2n+4},\ell,\varphi)$ with $\varphi(x)=x-\beta$ of the problem Contraction. Observe that no $\varphi$ -contraction may contain an edge $\{u,v\}$ of length greater than $\beta$ (in particular, no feasible solution may contain one of the edges of length $\beta^{\prime}$ or $\beta^{\prime}+2\varepsilon$ ). Furthermore any (weak) $\varphi$ -contraction $C$ on this graph satisfies $\Phi(C)=|C|$ .

We will show that $\mathcal{J}$ has an optimal solution of cardinality (and thus of value) $n+2$ if and only if $\mathcal{I}$ is a ‘Yes’-instance. In particular, we will see that any feasible solution of $\mathcal{J}$ of size $n+2$ contains the two edges of length $\varepsilon$ and exactly $n/2$ edges with length $a_{i}$ , $i\in I$ , from $P_{u}$ and the corresponding edges with length $2-a_{i}$ , $i\in I$ , from $P_{w}$ . Such solutions correspond to subsets of $[n]$ in the following natural way: For any subset $I\subseteq[n]$ of size $n/2$ we let $C(I)$ be the subset of edges of the cycle $C_{2n+4}$ consisting of the two edges of length $\varepsilon$ and of all edges $\{u_{i-1},u_{i}\}$ and $\{w_{i-1},w_{i}\}$ (of length $a_{i}$ or $2-a_{i}$ , respectively) for all $i\in I$ . Thus we will show that $C(I)$ is an optimal solution of the instance $\mathcal{J}$ of Contraction if and only if $\sum_{i\in I}a_{i}=\sum_{i\in[n]\setminus I}a_{i}=n/2$ , i.e., $\mathcal{I}$ is a ‘Yes’-instance of Close-to-1 Partition.

Both directions of this equivalence are captured and proved as Claim 2 and 4 below. Claims 1 and 3 are auxiliary statements used in the proofs of these two main claims.

For any path $P$ on the cycle we let $\ell(P)$ denote the sum of $\ell(e)$ over all edges $e$ of $P$ . For all $i\in[n]$ we denote by $P_{i}^{\sqsupset}$ and $P_{i}^{\sqsubset}$ the path on the cycle between the vertices $u_{i}$ and $w_{i}$ that contains and that does not contain the edge $\{v_{1},v_{2}\}$ , respectively (in Figure 7, these are the right and left segment of the cycle).

Claim 1: For all $i\in[n]$ , the number $\ell(P_{i}^{\sqsupset})$ lies in the interval $[n+\beta^{\prime}+\varepsilon,n+\beta^{\prime}+2\varepsilon]$ and the number $\ell(P_{i}^{\sqsubset})$ lies in the interval $[n+\beta^{\prime}+2\varepsilon,n+\beta^{\prime}+3\varepsilon]$ . In particular, we have $\operatorname{dist}_{\ell}(u_{i},w_{i})=\min\{\ell(P_{i}^{\sqsupset}),\ell(P_{i}^{\sqsubset})\}=\ell(P_{i}^{\sqsupset})$ and the difference $\ell(P_{i}^{\sqsubset})-\ell(P_{i}^{\sqsupset})$ lies in the interval $[0,2\varepsilon]$ .

Proof of Claim 1: Note that the condition $\sum_{i=1}^{n}a_{i}=n$ implies that

[TABLE]

By our assumption $a_{1}\geq a_{2}\geq\cdots\geq a_{n}$ , the numbers $\ell(P_{i}^{\sqsupset})$ form a unimodal sequence for $i=0,1,\ldots,n$ that is maximized for $i=0$ and $i=n$ , proving that $\ell(P_{i}^{\sqsupset})\leq n+\beta^{\prime}+2\varepsilon$ (note that $\ell(P_{u})=\ell(P_{w})=n$ ). By (23) the minimum of this unimodal sequence is at most $\varepsilon$ smaller than the maximum. This proves the first part of the claim. As $\ell(P_{i}^{\sqsupset})+\ell(P_{i}^{\sqsubset})=2(n+\beta^{\prime}+2\varepsilon)$ , we obtain the second part of the claim. The last part of the claim is an immediate consequence of the first two. $\square$

Claim 2: If $I\subseteq[n]$ is a solution of the instance $\mathcal{I}$ of Close-to-1 Partition such that $\sum_{i\in I}a_{i}=\sum_{i\in[n]\setminus I}a_{i}=n/2$ , then $C(I)$ is a $(1,\beta)$ -contraction.

Proof of Claim 2: It suffices to prove that there is no pair of vertices whose distance decreases by more than $\beta$ when contracting the edges in $C(I)$ .

We start by verifying this for the pairs $u_{i},w_{i}$ for $i\in[n]$ . We first consider the path $P_{i}^{\sqsupset}$ between $u_{i}$ and $w_{i}$ . Observe that $\sum_{e\in C(I)\cap P_{i}^{\sqsupset}}\ell(e)$ lies in the interval $[n/2+\varepsilon,n/2+2\varepsilon]=[\beta-\varepsilon,\beta]$ . Similarly to before, this follows from the observation that by the assumption $a_{1}\geq a_{2}\geq\cdots\geq a_{n}$ those sums form a unimodal sequence for $i=0,1,\ldots,n$ that is maximized for $i=0$ and $i=n$ , and by using (23) (recall also that $|I|=n/2$ ). Consequently, we have

[TABLE]

Since $\sum_{e\in C(I)}\ell(e)=n+2\varepsilon=2\beta-2\varepsilon$ , we obtain that $\sum_{e\in C(I)\cap P_{i}^{\sqsubset}}\ell(e)$ lies in the interval $[\beta-2\varepsilon,\beta-\varepsilon]$ , yielding

[TABLE]

Combining (24) and (25) proves that

[TABLE]

Now consider two vertices $u_{i}$ and $w_{j}$ , $j<i$ (the case $j>i$ can be treated analogously). Let $P_{i,j}^{\sqsupset}$ and $P_{i,j}^{\sqsubset}$ be the path on the cycle between the vertices $u_{i}$ and $w_{j}$ that contains and that does not contain the edge $\{v_{1},v_{2}\}$ , respectively. Using that $P_{i,j}^{\sqsupset}\subseteq P_{i}^{\sqsupset}$ we obtain

[TABLE]

from (24).

We know that $a_{i}\leq 1+1/5\leq 8/5$ and consequently

[TABLE]

by the assumption that the input $\{a_{1},a_{2},\ldots,a_{n}\}$ of the instance $\mathcal{I}$ is close to 1 (there is plenty of leeway in all those inequalities). Furthermore, we have

[TABLE]

where the second-to-last inequality follows from Claim 1.

Combining those observations yields

[TABLE]

Combining (27) and (30) proves that

[TABLE]

From (27) and (30) we can derive analogous relations for the remaining cases where we need to consider the distance between a vertex $u_{i}$ , $i\in[n]$ , and a vertex $w\in\{v_{1},v_{2},u_{0},u_{1},\allowbreak\ldots,u_{i-1},u_{i+1},\ldots,u_{n}\}$ , between a vertex $w_{i}$ , $i\in[n]$ , and a vertex $u\in\{v_{1},v_{2},w_{0},w_{1},\ldots,\allowbreak w_{i-1},w_{i+1},\ldots,w_{n}\}$ , and between the vertices $v_{1}$ and $v_{2}$ . This completes the proof of Claim 2. $\square$

Claim 3: Every $(1,\beta)$ -contraction $C$ contains at most $n/2$ edges in $(P_{u}\cup P_{w})\cap P_{i}^{\sqsupset}$ for all $i\in[n]$ and at most $n/2$ edges in $(P_{u}\cup P_{w})\cap P_{i}^{\sqsubset}$ for all $i\in[n]$ .

Proof of Claim 3: Note that for any $I\subseteq[n]$ and $k\in\{0,1,\ldots,n\}$ we have $\sum_{i\in I:i>k}a_{i}+\sum_{i\in I:i\leq k}(2-a_{i})\geq|I|-\varepsilon$ by the definition of $\varepsilon$ . Consequently, assuming for the sake of contradiction that $C$ contains strictly more than $n/2$ edges in $(P_{u}\cup P_{w})\cap P_{i}^{\sqsupset}$ , we have $\ell_{C}(P_{i}^{\sqsupset})-\ell(P_{i}^{\sqsupset})\geq n/2+1-\varepsilon$ . Similarly, assuming that $C$ contains strictly more than $n/2$ edges in $(P_{u}\cup P_{w})\cap P_{i}^{\sqsubset}$ yields $\ell_{C}(P_{i}^{\sqsubset})-\ell(P_{i}^{\sqsubset})\geq n/2+1-\varepsilon$ . By Claim 1 the difference $\ell(P_{i}^{\sqsubset})-\ell(P_{i}^{\sqsupset})$ lies in the interval $[0,2\varepsilon]$ , so in both cases we obtain

[TABLE]

where we used that $\varepsilon<1/5$ in the second-to-last step. This contradicts the fact that $C$ is a $(1,\beta)$ -contraction, proving Claim 3. $\square$

Claim 4: Let $C$ be a feasible solution of the instance $\mathcal{J}$ of Contraction. Then we have $|C|\leq n+2$ , and if $|C|=n+2$ , we have $C=C(I)$ for some set $I\subseteq[n]$ with $\sum_{i\in I}a_{i}=\sum_{i\in[n]\setminus I}a_{i}=n/2$ .

Proof of Claim 4: As $C$ does not contain any of the edges of length $\beta^{\prime}$ or $\beta^{\prime}+2\varepsilon$ , we have $|C|\leq n+2$ by Claim 3 (the +2 comes from the two edges of length $\varepsilon$ that may be contained in $C$ ). Suppose now that $|C|=n+2$ . Applying Claim 3 again shows that $C$ must contain both edges of length $\varepsilon$ , and that it contains the edge $\{u_{i-1},u_{i}\}$ if and only if it contains the edge $\{w_{i-1},w_{i}\}$ , for all $i\in[n]$ . Defining $I:=\{i\in[n]:\{u_{i-1},u_{i}\}\in C\}$ we have $|I|=n/2$ and $C=C(I)$ .

By Claim 1 we have $\operatorname{dist}_{\ell}(u_{0},w_{0})=\ell(P_{0}^{\sqsupset})$ and $\operatorname{dist}_{\ell}(u_{n},w_{n})=\ell(P_{n}^{\sqsupset})$ . As $C$ is a $(1,\beta)$ -contraction containing the two edges of length $\varepsilon$ we thus obtain $\sum_{i\in I}a_{i}=\sum_{e\in C\cap P_{u}}\ell(e)\leq\beta-2\varepsilon=n/2$ . Similarly, we have $\sum_{i\in[n]\setminus I}a_{i}=\sum_{i\in I}(2-a_{i})\allowbreak=\sum_{e\in C\cap P_{w}}\ell(e)\allowbreak\leq\beta-2\varepsilon=n/2$ . As $\sum_{i\in[n]}a_{i}=n$ , these two inequalities must be tight, yielding $\sum_{i\in I}a_{i}=\sum_{i\in[n]\setminus I}a_{i}=n/2$ . $\square$

Combining Claims 2 and 4 proves the statement of the theorem for the problem Contraction.

We now focus on the problem Weak Contraction. The hardness result follows immediately from the following claim.

Claim 5: For $n\geq 5$ , any feasible weak $(1,\beta)$ -contraction $C$ on the instance $\mathcal{J}$ is also a feasible $(1,\beta)$ -contraction.

Proof of Claim 5: Suppose for the sake of contradiction that $C$ is not a feasible $(1,\beta)$ -contraction. This means there are vertices $a,b$ such that $\operatorname{dist}_{\ell_{C}}(a,b)=0$ and $\operatorname{dist}_{\ell}(a,b)>\beta$ , i.e., $a$ and $b$ lie on a (maximal) subpath $Q$ formed by edges from $C$ on the cycle. Let $u$ be one end vertex of $Q$ , and let $x$ be the neighbour of $u$ not on $Q$ . Let $v$ be the last vertex on $Q$ when traversed starting at $u$ , such that the length of the $x$ - $v$ -path $P$ containing $u$ is at most $\beta+\ell(x,u)$ , and let $y$ be the next vertex on $Q$ when traversed starting at $u$ . Such a vertex $y$ exists as $\ell(Q)>\beta$ , and the $x$ - $y$ -path $P^{\prime}$ containing $u$ has length strictly greater than $\beta+\ell(x,u)$ .

We have $\operatorname{dist}_{\ell_{C}}(x,y)>0$ , as $C$ does not contract the entire cycle. By (1), we have $\operatorname{dist}_{\ell_{C}}(x,y)\geq\operatorname{dist}_{\ell}(x,y)-\beta$ . As $\operatorname{dist}_{\ell_{C}}(x,y)\leq\ell(x,u)$ , we get $\operatorname{dist}_{\ell}(x,y)\leq\beta+\ell(x,u)$ . As we saw before, the $x$ - $y$ -path $P^{\prime}$ has length strictly greater than $\beta+\ell(x,u)$ , thus the $x$ - $y$ -path $P^{\prime\prime}$ not containing $u$ must have length at most $\beta+\ell(x,u)$ . As the entire cycle has length $2n+2\beta^{\prime}+4\varepsilon=3n+2+8\varepsilon$ and can be partitioned into $P,P^{\prime\prime}$ and the edge $\{v,y\}$ , we get

[TABLE]

where the second inequality holds as the two longest edges of the cycle have length $\beta^{\prime}+2\varepsilon=\beta+1+2\varepsilon$ and $\beta^{\prime}=\beta+1$ , respectively. From this chain of inequalities we obtain $n\leq 2+12\varepsilon<4+2/5$ , contradicting the assumption $n\geq 5$ . ∎

The reader might be tempted to ‘simplify’ the previous reduction proof by omitting the four special edges of length $\varepsilon$ , $\beta^{\prime}$ and $\beta^{\prime}+2\varepsilon$ and by setting $\beta:=n/2$ instead. However, this would invalidate Claim 2 (specifically, the estimate (25) would not always hold).

5.2. Inapproximability of Contraction

We are able to extend the before-mentioned hardness result for Contraction as follows:

Theorem 9.

For any fixed $\beta>0$ and $\varepsilon>0$ , it is NP-hard to approximate the problem Contraction with tolerance function $\varphi(x)=x-\beta$ , $\beta\geq 0$ , to within a factor of $n^{1-\varepsilon}$ .

For the following theorem the additive error is fixed to $\beta=1$ .

Theorem 10.

For any $\varepsilon>0$ , it is NP-hard to approximate the problem Contraction with tolerance function $\varphi(x)=x-1$ on bipartite graphs with unit length edges $\ell=1$ to within a factor of $m^{1/2-\varepsilon}$ .

Our reductions are based on the inapproximability of the well-known Clique problem. Recall that a clique in a graph $G$ is a complete subgraph of $G$ .

[TABLE]

It was shown in [Zuc07] that for any $\varepsilon>0$ , it is NP-hard to approximate Clique to within a factor of $n^{1-\varepsilon}$ .

The following lemma will be used in our proofs. It shows that for $(1,\beta)$ -contractions the feasibility condition (1) needs not be checked for all pairs of vertices $u$ and $v$ , but only for those satisfying certain extra conditions.

Lemma 5.2.

A set of edges $C\subseteq E$ is a $(1,\beta)$ -contraction if and only if all pairs of vertices $u,v\in V$ with the property that every shortest path with respect to $\ell_{C}$ between $u$ and $v$ starts and ends with an edge from $C$ satisfy condition (1).

Proof.

Suppose for the sake of contradiction that all pairs of vertices $u,v\in V$ as in the lemma satisfy condition (1) and that $C$ is not a $(1,\beta)$ -contraction. Then there is a pair of vertices $u,v\in V$ violating (1) and a shortest path $P$ with respect to $\ell_{C}$ between $u$ and $v$ that does not start or end with an edge from $C$ . We choose $u$ and $v$ such that $\operatorname{dist}_{\ell_{C}}(u,v)$ is minimal, and we may assume that the first edge $\{u,w\}$ of $P$ is not contained in $C$ , so $\operatorname{dist}_{\ell}(u,v)-\operatorname{dist}_{\ell_{C}}(u,v)=\operatorname{dist}_{\ell}(w,v)-\operatorname{dist}_{\ell_{C}}(w,v)$ . By our choice of $u$ and $v$ , the vertices $w$ and $v$ satisfy (1), i.e., the right-hand side of this equation is bounded by $\beta$ , a contradiction. ∎

Proof of Theorem 9.

Let $\beta,\varepsilon>0$ be fixed and let $G=(V,E)$ be an instance of Clique.

We define a graph $H=H(G)$ as follows, see Figure 8: The vertex set of $H$ is given by $(V\times\{1,2\})\cup\{s\}$ , i.e., we create two copies of each original vertex and add a special vertex $s$ . The edge set of $H$ is given by $\{\{(u,1),(v,1)\}:\{u,v\}\in E\}$ plus the edges $\{(v,1),(v,2)\}$ and $\{s,(v,2)\}$ for all $v\in V$ . The first set of edges are simply the original edges of $G$ on the first copies of the vertices, the second set is a perfect matching between the two copies of the vertex set, and the third set of edges connects the special vertex $s$ to all vertices of the second copy of the vertex set. The length function $\ell$ on the edges of $H$ is set to $2\beta+2$ , $\beta$ or $\beta+1$ for those three sets of edges, respectively.

Now consider the instance $\mathcal{I}:=(H,\ell,\varphi)$ of the problem Contraction with the tolerance function $\varphi(x)=x-\beta$ . Clearly, any $(1,\beta)$ -contraction $C$ in $H$ can contain only edges of the form $\{(u,1),(u,2)\}$ for some $u\in V$ . As $H$ does not contain two edges between two different connected components of $(V,C)$ , our objective function defined in (2) satisfies $\Phi(C)=|C|$ for any feasible solution $C$ of $\mathcal{I}$ . We will show that it allows a feasible solution with $k$ edges (and thus of value $k$ ) if and only if $G$ has a clique with $k$ vertices. Formally, for $U\subseteq V$ we define $C(U):=\{\{(u,1),(u,2)\}:u\in U\}$ (see Figure 8). We proceed to show that $U$ induces a clique in $G$ if and only if $C(U)$ is a $(1,\beta)$ -contraction in $H=H(G)$ .

Note that for any two vertices $u,v\in U$ we have

[TABLE]

These relations together with Lemma 5.2 show that $C(U)$ is a $(1,\beta)$ -contraction in $H$ if and only if $U$ is a clique in $G$ .

As $n(H)$ differs from $n(G)$ only by a constant factor, an $n^{1-\varepsilon}$ -approximation algorithm for Contraction would yield an $n^{1-\varepsilon^{\prime}}$ -approximation algorithm for Clique via this reduction. Together with the before-mentioned inapproximability of Clique [Zuc07] this proves the theorem. ∎

The rest of this section is devoted to proving Theorem 10, so we now focus on $(1,1)$ -contractions in bipartite graphs with unit length edges $\ell=1$ . The next lemma characterizes the structure of contractions in this setting.

Lemma 5.3.

Let $G=(V,E)$ be a bipartite graph with unit edge lengths $\ell=1$ and let $C\subseteq E$ be a set of edges.

(i)

If $C$ is a $(1,1)$ -contraction, then $C$ is a matching. 2. (ii)

If $C=\{e,f\}$ with edges $e=\{u_{1},u_{2}\},f=\{v_{1},v_{2}\}\in E$ , then $C$ is a $(1,1)$ -contraction if and only if $\operatorname{dist}_{\ell}(u_{1},v_{1})=\operatorname{dist}_{\ell}(u_{2},v_{2})$ and $\operatorname{dist}_{\ell}(u_{1},v_{2})=\operatorname{dist}_{\ell}(u_{2},v_{1})$ . 3. (iii)

$C$ * is a $(1,1)$ -contraction if and only if all two-element subsets of $C$ are.*

Proof.

(i)

Suppose for the sake of contradiction that $C$ contains a path $(u,v,w)$ on two edges. As $G$ is bipartite, it has no triangles, so $\operatorname{dist}_{\ell}(u,w)=2$ and $\operatorname{dist}_{\ell_{C}}(u,w)=0$ , a contradiction to the assumption that $C$ is a $(1,1)$ -contraction. 2. (ii)

For the edges $e=\{u_{1},u_{2}\}$ and $f=\{v_{1},v_{2}\}$ we define $d_{i,j}:=\operatorname{dist}_{\ell}(u_{i},v_{j})$ for $i,j\in\{1,2\}$ .

Let $C=\{e,f\}$ be a $(1,1)$ -contraction. Both $d_{1,1}$ and $d_{2,2}$ must have the same parity (as $G$ is bipartite), so if $d_{1,1}<d_{2,2}$ , the difference between them is exactly 2. However, this would mean that $\operatorname{dist}_{\ell_{C}}(u_{2},v_{2})=d_{1,1}=d_{2,2}-2=\operatorname{dist}_{\ell}(u_{2},v_{2})-2$ , a contradiction to the assumption that $C$ is a $(1,1)$ -contraction. Repeating the same argument with $d_{1,1}$ and $d_{2,2}$ interchanged shows that $d_{1,1}=d_{2,2}$ . An analogous argument shows that $d_{1,2}=d_{2,1}$ .

Now suppose that $d_{1,1}=d_{2,2}$ and $d_{1,2}=d_{2,1}$ . From these conditions it follows that for all $i,j\in\{1,2\}$ every path between $u_{i}$ and $v_{j}$ that contains both edges $e$ and $f$ has length at least $d_{i,j}+2$ with respect to $\ell$ . Consequently, we have $\operatorname{dist}_{\ell_{C}}(u_{i},v_{j})\geq\operatorname{dist}_{\ell}(u_{i},v_{j})-1$ for $C=\{e,f\}$ . By Lemma 5.2, $C$ is a $(1,1)$ -contraction. 3. (iii)

One direction of the equivalence is obvious, so we only need to prove the other direction. So we assume that all two-element subsets of $C$ are $(1,1)$ -contractions, and we need to prove that $C$ is a $(1,1)$ -contraction. The argument is a straightforward generalization of the argument for (ii) from before. Let $P$ be a path that contains exactly $k$ edges from $C$ , and that starts and ends with an edge from $C$ . Let $e_{1},e_{2},\ldots,e_{k}$ be those edges and $u_{1,1},u_{1,2},u_{2,1},u_{2,2},\ldots,u_{k,1},u_{k,2}$ their end vertices as they are encountered when traversing $P$ (so $u_{1,1}$ and $u_{k,2}$ are the end vertices of $P$ ). For all $i=1,2,\ldots,\lfloor k/2\rfloor$ the pair of edges $e_{2i-1}$ and $e_{2i}$ and their end vertices satisfy the distance conditions from (ii). From these conditions it follows that the subpath of $P$ between $u_{2i-1,1}$ and $u_{2i,2}$ has length at least $\operatorname{dist}_{\ell}(u_{2i-1,1},u_{2i,2})+2$ . So overall the length of $P$ is at least $\operatorname{dist}_{\ell}(u_{1,1},u_{k,2})+2\lfloor k/2\rfloor\geq\operatorname{dist}_{\ell}(u_{1,1},u_{k,2})+(k-1)$ . Consequently, we have $\operatorname{dist}_{\ell_{C}}(u_{1,1},u_{k,2})\geq\operatorname{dist}_{\ell}(u_{1,1},u_{k,2})-1$ . By Lemma 5.2, $C$ is a $(1,1)$ -contraction.

∎

With Lemma 5.3 in hand, we are now ready to prove Theorem 10.

Proof of Theorem 10.

Let $\varepsilon>0$ be fixed and let $G=(V,E)$ be an instance of Clique. We construct a bipartite graph $H=H(G)$ as follows, see Figure 9: For every vertex $v\in V$ , the graph $H$ contains two vertices $(v,1)$ and $(v,2)$ and the edge $f_{v}:=\{(v,1),(v,2)\}$ . For every edge $e=\{u,v\}\in E$ , we add a vertex $x_{e}$ and the edges $f_{e,u}:=\{x_{e},(u,1)\}$ and $f_{e,v}:=\{x_{e},(v,1)\}$ to $H$ . Furthermore, we add a new special vertex $s$ to $H$ and all the edges $\{s,(v,2)\}$ , $v\in V$ , and $\{s,x_{e}\}$ , $e\in E$ . It is easy to check that the graph $H$ defined in this way is bipartite.

All edges of $H$ receive unit lengths ( $\ell=1$ ) and we consider the instance $\mathcal{I}=(H,\ell,\varphi)$ of the problem Contraction with the tolerance function $\varphi(x)=x-1$ .

For any set of vertices $U\subseteq V$ we define $C(U):=\{f_{u}:u\in U\}$ (see Figure 9).

Claim 1: If $U\subseteq V$ is a clique in $G$ , then $C(U)$ is a $(1,1)$ -contraction in $H$ and $\Phi(C(U))=|U|$ .

Proof of Claim 1: Let $U$ be a a set of vertices in $G$ that form a clique, and let $u,v\in U$ be two vertices from this clique. Then we have $\operatorname{dist}_{\ell}((u,1),(v,1))=\operatorname{dist}_{\ell}((u,2),(v,2))=2$ and $\operatorname{dist}_{\ell}((u,1),(v,2))=\operatorname{dist}_{\ell}((u,2),(v,1))=3$ , so Lemma 5.3 (ii) implies that $C(\{u,v\})$ is a $(1,1)$ -contraction in $H$ . Repeating this argument for every pair of vertices from $U$ and applying Lemma 5.3 (iii) yields that $C(U)$ is a $(1,1)$ -contraction in $H$ . As there are never two edges in $H$ between any two connected components of the graph $(V,C(U))$ , we have $\Phi(C(U))=|C(U)|=|U|$ . $\square$

For any set of edges $C\subseteq E(H)$ , we let $U(C)$ be the set of vertices $v\in V$ for which $(v,1)$ is incident to an edge in $C$ .

Claim 2: If $C\subseteq E(H)$ is a $(1,1)$ -contraction, then $C$ is a matching in $H$ and $U(C)$ is a clique in $G$ of size at least $\Phi(C)-3$ .

Proof of Claim 2: $C$ is a matching by Lemma 5.3 (i).

Let $u,v\in U(C)$ . We will show that $e=\{u,v\}\in E$ by applying Lemma 5.3 (ii) to the two edges in $C$ incident to $(u,1)$ and $(v,1)$ . To prove that $e\in E$ it suffices to show that $\operatorname{dist}_{\ell}((u,1),(v,1))=2$ .

Let us first consider the case that $f_{u},f_{v}\in C$ . As $\operatorname{dist}_{\ell}((u,2),(v,2))=2$ (the shortest path between those vertices goes via $s$ ), Lemma 5.3 (ii) implies that $\operatorname{dist}_{\ell}((u,1),(v,1))=2$ . We now consider the case that there is an edge $e^{\prime}\in E\setminus\{e\}$ with $f_{u},f_{e^{\prime},v}\in C$ . We then have $\operatorname{dist}_{\ell}((u,2),x_{e^{\prime}})=2$ (via $s$ ), so Lemma 5.3 (ii) yields $\operatorname{dist}_{\ell}((u,1),(v,1))=2$ . Finally, we consider the case that there are two edges $e^{\prime},e^{\prime\prime}\in E\setminus\{e\}$ with $f_{e^{\prime},u},f_{e^{\prime\prime},v}\in C$ . We then have $\operatorname{dist}_{\ell}(x_{e^{\prime}},x_{e^{\prime\prime}})=2$ (via $s$ ), again implying that $\operatorname{dist}_{\ell}((u,1),(v,1))=2$ . This proves that indeed $e\in E$ , so $U(C)$ forms a clique in $G$ .

Every edge in $H$ is either incident to $s$ or to a vertex of the form $(v,1)$ , $v\in V$ . Since at most one of the edges incident to $s$ can be in $C$ , the definition of $U(C)$ shows that the size of $U(C)$ is either $|C|-1$ or $|C|$ . Therefore, to finish the proof of Claim 2, it suffices to show that $\Phi(C)\leq|C|+2$ . If $C$ contains no two edges that are connected by more than one edge in $H$ , then we have $\Phi(C)=|C|$ . Otherwise we consider two such edges $f$ and $g$ from $C$ . It is easy to check that either $f$ or $g$ must be incident to $s$ , so suppose that the edge $f$ contains $s$ . We first consider the case that $f=\{s,x_{e}\}$ for some edge $e=\{u,v\}\in E$ . In this case it follows that $g=\{(u,1),(u,2)\}$ or $g=\{(v,1),(v,2)\}$ , so we have $\Phi(C)=|C|+2$ . Now consider the case that $f=\{s,(u,2)\}$ for some vertex $u\in V$ . In this case it follows that $g=\{(u,1),x_{e}\}$ for exactly one edge $e\in E$ incident to $u$ in $G$ , showing that $\Phi(C)=|C|+2$ . In all three cases we have $\Phi(C)\leq|C|+2$ , as claimed. $\square$

Combining Claims 1 and 2 will allow us to prove the following claim:

Claim 3: If there is an $n^{1/2-\varepsilon}$ -approximation algorithm for Contraction, then there is an $n^{1-\varepsilon/2}$ -approximation algorithm for Clique.

Proof of Claim 3: Suppose for the sake of contradiction that such an approximation algorithm for Contraction exists. We use it to compute a clique in a given instance $G$ of Clique as follows: We construct $\mathcal{I}=(H(G),\ell,\varphi)$ and compute a solution $C$ of Contraction for this instance, and we define the clique $U(C)$ as before (recall Claim 2). If $U(C)\neq\emptyset$ , we return $U(C)$ , otherwise we return any vertex from $G$ . We denote the clique computed in this fashion by $U$ .

We may assume that $n(G)\geq 16^{1/\varepsilon}$ , in particular $n(H)\geq 16^{1/\varepsilon}$ . It follows that

[TABLE]

By assumption we know that

[TABLE]

where $C^{*}$ is an optimal solution of $\mathcal{I}$ . In particular, $\Phi(C)$ is positive.

Combining these observations we get

[TABLE]

where the second inequality holds because of Claim 2, and the last inequality involving the clique number $\omega(G)$ holds because of Claim 1. $\square$

As $m(H)=\Theta(n(H))$ , Claim 3 implies the theorem (using the inapproximability of Clique proved in [Zuc07]). ∎

6. Hardness for multiplicative tolerance function

By Theorem 7, the problem Weak Contraction with purely additive tolerance function $\varphi(x)=x-\beta$ is NP-hard on cycles. In this section we prove the hardness and inapproximability of this problem also in the case of a purely multiplicative tolerance function $\varphi(x)=x/\alpha$ , $\alpha\geq 1$ . Recall that the problem Contraction is trivial for this tolerance function (we may not contract any edges).

6.1. Hardness of planar Weak Contraction

To state the main result of this section recall that the girth of a graph $G$ is defined as the minimum length of a cycle in $G$ .

Theorem 11.

For any $g\geq 2$ , the problem Weak Contraction with tolerance function $\varphi(x)=x/2$ , is NP-hard for planar graphs with girth at least $3g$ and unit length edges $\ell=1$ .

Theorem 11 implies that Weak Contraction is hard for a general multiplicative tolerance function $\varphi(x)=x/\alpha$ , $\alpha\geq 1$ , but it leaves open the question whether this is true also for other fixed values of $\alpha$ other than 2 (when $\alpha$ is not part of the input). The arguments given in this section for $\alpha=2$ carry over straightforwardly to any fixed value $2\leq\alpha<3$ , but not to 3 or larger values (for $\alpha<2$ and unit length edges the problem is trivial).

We first characterize the set of feasible solutions in this special case.

Lemma 6.1.

Let $G=(V,E)$ be a graph with girth at least 6 and unit length edges $\ell=1$ , and consider the tolerance function $\varphi(x)=x/2$ . Furthermore, let $C\subseteq E$ be a set of edges such that $(V,C)$ is disconnected. Then $C$ is a weak $(2,0)$ -contraction if and only if for any two edges $e,f\in C$ either $e$ and $f$ are incident and both contain a degree-1 vertex, or any path containing $e$ and $f$ also contains at least two edges not in $C$ .

Recall that the assumption that $(V,C)$ is disconnected prevents solutions $C\subseteq E$ for which the contracted graph $G/C$ is a single vertex. Note that Lemma 6.1 does not require $G$ to be planar.

Proof.

To prove the equivalence, we need the following auxiliary claim:

Claim: If $C$ is a weak $(2,0)$ -contraction, then every component of $(V,C)$ that is not a single edge is a star with the property that each of its vertices except the center of the star has degree 1 in $G$ .

Proof of Claim: Let $M$ be a component of $(V,C)$ with more than one edge. Clearly, there must be an edge $\{u,v\}$ with vertices $u\notin V(M)$ and $v\in V(M)$ . If $M$ contains a path $P$ on two edges starting at $v$ and ending at some vertex $w$ , then $\operatorname{dist}_{\ell}(u,w)=3$ and $\operatorname{dist}_{\ell_{C}}(u,w)=1$ , a contradiction to the assumption that $C$ is a weak $(2,0)$ -contraction (note that $P\cup\{u,v\}$ is the shortest path between $u$ and $w$ , as the girth of $G$ is at least 6). Thus the edges of $M$ must form a star centered at $v$ . By the same argument, no vertex outside $M$ can be connected to any vertex of $M$ other than $v$ . This proves the claim. $\square$

We first assume that $C$ is a weak $(2,0)$ -contraction, and we need to show that any two edges $e,f\in C$ satisfy the conditions of the lemma. If $e$ and $f$ are incident, the statement follows from the auxiliary claim from before. If $e$ and $f$ are not incident, we consider an inclusion-minimal path $P$ containing both $e$ and $f$ . We let $u$ and $v$ be the end vertices of $P$ , $u^{\prime}$ the other end vertex of $e$ , and $v^{\prime}$ the other end vertex of $f$ ( $u^{\prime}$ and $v^{\prime}$ are the vertices at distance 1 from the ends of the path). If the distance between $u^{\prime}$ and $v^{\prime}$ was only 1, we have $\operatorname{dist}_{\ell}(u,v)=3$ and $\operatorname{dist}_{\ell_{C}}(u,v)=1$ (here we need again the assumption that the girth is at least 6), a contradiction to the assumption that $C$ is a weak $(2,0)$ -contraction. Therefore at least two edges lie between $u^{\prime}$ and $v^{\prime}$ . The auxiliary claim from before implies that no two incident edges on $P$ between $u$ and $v$ are contained in $C$ , therefore $P$ must contain at least two edges not in $C$ . This proves one direction of the equivalence.

To prove the other direction, we now assume that any two edges $e,f$ satisfy the conditions of the lemma, and we need to show that $C$ is a weak $(2,0)$ -contraction. Consider any two vertices $u$ and $v$ with $\operatorname{dist}_{\ell_{C}}(u,v)>0$ , and any path between $u$ and $v$ . As no inner vertex of $P$ is a leaf, we know that between any two consecutive edges from $C$ on $P$ there are at least 2 edges not in $C$ . This proves that $\operatorname{dist}_{\ell_{C}}(u,v)\geq\operatorname{dist}_{\ell}(u,v)/2$ , as desired.

This completes the proof of the lemma. ∎

For a given propositional formula $F$ in conjunctive normal form (CNF) the bipartite variable-clause graph $\Gamma(F)$ is defined as follows: The two partition classes of $\Gamma(F)$ are given by the sets of variables and clauses of $F$ , and there is an edge between a variable $x$ and a clause $c$ if $x$ appears in $c$ . If $c$ contains $x$ as a positive or negative literal, we call the corresponding edge of $\Gamma(F)$ a positive or negative edge, respectively. A planar drawing of $\Gamma(F)$ , where positive and negative edges appear in cyclically contiguous intervals around every variable vertex, is called contiguous.

We call a $k$ -CNF formula regular, if every clause contains exactly $k$ literals, no clause contains a literal twice, every variable appears at least once as a positive literal and at least once as a negative literal in the formula.

Consider now the following variant of 3SAT.

[TABLE]

Lemma 6.2.

Contiguous Planar 3SAT* is NP-complete.*

Proof.

The more general variant of Contiguous Planar 3SAT not requiring $F$ to be regular was shown to be NP-complete in [dBK12]. We now show how to reduce this generalization to Contiguous Planar 3SAT, which will prove the lemma. Given a (not necessarily regular) 3-CNF formula $F$ we first eliminate all variables appearing only as negative or only as positive literals and all clauses containing exactly one literal, as well as multiple appearances of literals in the same clause. This yields a formula $F^{\prime}$ in which all clauses have two or three literals, no clause contains a literal twice, and every variable appears at least once as a positive literal and at least once as a negative literal in $F^{\prime}$ . Moreover, since $\Gamma(F^{\prime})$ is a subgraph of $\Gamma(F)$ , we also obtain a contiguous planar drawing of $\Gamma(F^{\prime})$ . As a last step we eliminate clauses $c$ with two literals by introducing a new variable $x$ for each of them and replacing $c$ by the equivalent formula $(c\lor x)\land(c\lor\overline{x})$ . It is easy to check that the resulting formula $F^{\prime\prime}$ is regular and equisatisfiable to $F$ , and to obtain a contiguous planar drawing of $\Gamma(F^{\prime\prime})$ , see Figure 10.

∎

Proof of Theorem 11.

We first present the proof for the case $g=2$ , and then sketch how to generalize it for larger values of $g$ .

We reduce Contiguous Planar 3SAT to Weak Contraction. Consider an instance $F$ of Contiguous Planar 3SAT with variables $x_{1},x_{2},\ldots,x_{n}$ and clauses $c_{1},c_{2},\linebreak\ldots,c_{m}$ .

Given the formula $F$ , we construct from it a graph $G=G(F)$ as follows, see Figures 11 and 12. For every variable $x_{i}$ , $i\in[n]$ , we add a variable gadget $H(x_{i})$ as shown on the left hand side of Figure 11 to the graph $G$ . The vertices $u_{i}$ and $\overline{u}_{i}$ will be used later to connect this gadget to other parts of the graph. The idea of the variable gadget is that an optimal solution of our instance of Weak Contraction should contain either the four edges $T_{i}:=\{t_{i},\overline{t}_{i},t_{i}^{\prime},t_{i}^{\prime\prime}\}$ or the four edges $F_{i}:=\{f_{i},\overline{f}_{i},f_{i}^{\prime},f_{i}^{\prime\prime}\}$ , corresponding to setting $x_{i}$ to true or false, respectively.

For every clause $c_{j}$ , $j\in[m]$ , we add a clause gadget $H(c_{j})$ (a star with three edges) as shown on the right hand side of Figure 11 to the graph $G$ . The vertices $v_{j}^{1}$ , $v_{j}^{2}$ and $v_{j}^{3}$ will be used later to connect this gadget to other parts of the graph. The idea of the clause gadget is that a feasible solution contains at most one of these three edges, and if it does contain one of them, this restricts the choice we have inside the respective neighbouring variable gadget.

We connect the variable and clause gadgets in $G$ as follows (see Figure 12): For every $j\in[m]$ and $k\in[3]$ , if the $k$ -th literal in the clause $c_{j}$ is $x_{i}$ , we add an edge connecting $u_{i}$ to $v_{j}^{k}$ , and if the $k$ -th literal in the clause $c_{j}$ is $\overline{x_{i}}$ , we add an edge connecting $\overline{u}_{i}$ to $v_{j}^{k}$ . We refer to the edges added to $G$ in this step as connection edges.

This completes the definition of the graph $G=G(F)$ . It is easy to see that this graph is planar. Specifically, a planar embedding can be obtained from the given planar embedding of $\Gamma(F)$ by replacing variable vertices $x_{i}$ in $\Gamma(F)$ by the variable gadgets $H(x_{i})$ in $G$ , and by replacing clause vertices $c_{j}$ by the clause gadgets $H(c_{j})$ . Using that for each variable vertex $x_{i}$ in $\Gamma(F)$ the positive and negative edges appear in cyclically contiguous intervals around $x_{i}$ , the connection edges in $G$ (that connect the variable and clause gadgets) can also be drawn in a planar fashion.

Moreover, it is easy to check that $G$ has girth 6 and no degree-1 vertices.

Now consider the instance $\mathcal{I}:=(G,\ell,\varphi)$ of the problem Weak Contraction with $\ell=1$ (unit length edges) and the tolerance function $\varphi(x)=x/2$ .

Lemma 6.1 implies that any feasible solution of $\mathcal{I}$ is a matching, as $G$ has no vertices of degree 1. As $G$ contains no cycles of length 3 or 4, it cannot contain two edges between vertex sets of two different components of $(V,C)$ for any such feasible solution $C$ . This implies that our objective function satisfies $\Phi(C)=|C|$ .

We proceed to show that $F$ is satisfiable if and only if $\mathcal{I}$ has an optimal solution of cardinality (and thus of value) $4n+m$ . Specifically, a satisfying assignment of $F$ corresponds to a solution that contains exactly all edges of either $T_{i}$ or $F_{i}$ in $H(x_{i})$ for every variable $i\in[n]$ (corresponding to the value true or false assigned to this variable, respectively) and exactly one edge in $H(c_{j})$ for each clause $j\in[m]$ (corresponding to a literal that satisfies this clause).

Formally, for any variable assignment $\tau\colon\{x_{1},x_{2},\ldots,x_{n}\}\to\{\texttt{true},\texttt{false}\}$ , we define the set of edges $C(\tau)\subseteq E(G)$ as follows: $C(\tau)$ contains all edges of $T_{i}$ for any variable $x_{i}$ , $i\in[n]$ , that $\tau$ sets to true, and it contains all edges of $F_{i}$ for any variable $x_{i}$ that $\tau$ sets to false. Moreover, for every clause $c_{j}$ , $j\in[m]$ , that is satisfied by $\tau$ , we choose an index $k\in[3]$ of a literal in $c_{j}$ that is satisfied by $\tau$ and add the edge $e_{j}^{k}$ to $C(\tau)$ .

The following claim is an immediate consequence of Lemma 6.1.

Claim 1: Any subset $C\subseteq E(G)$ is a feasible solution if and only if every path containing two edges from $C$ also contains at least two edges not in $C$ .

By Claim 1, for every variable assignment $\tau$ of $F$ , the set $C(\tau)$ is a feasible solution of $\mathcal{I}$ . In particular, if $\tau$ satisfies $F$ , then $C(\tau)$ is a feasible solution of size $4n+m$ . The remainder of the proof is devoted to showing the converse, i.e., if $C\subseteq E(G)$ is a feasible solution of size $4n+m$ , then $F$ is satisfiable.

For all $i\in[n]$ we let $H(x_{i})^{+}$ denote the subgraph of $G$ induced by all edges of $H(x_{i})$ and all connection edges incident to either $u_{i}$ or $\overline{u}_{i}$ .

Claim 2: For any $j\in[m]$ , $C$ contains at most one edge from $H(c_{j})$ . For any $i\in[n]$ , $C$ contains at most four edges from $H(x_{i})^{+}$ . Moreover, if $C$ contains one of the connection edges incident to $u_{i}$ or $\overline{u}_{i}$ for some $i\in[n]$ , it does not contain any edges from the gadget $H(c_{j})$ that is connected to $H(x_{i})$ via this edge.

Proof of Claim 2: The first and last statement are immediate consequences of Claim 1. The argument for the second statement is as follows: For all $i\in[n]$ we let $E_{i}$ denote the set of edges $\{h_{i},t_{i},f_{i}\}$ plus the connection edges incident to $u_{i}$ , and we let $\overline{E}_{i}$ denote the set of edges $\{\overline{h}_{i},\overline{t}_{i},\overline{f}_{i}\}$ plus the connection edges incident to $\overline{u}_{i}$ . By Claim 1, $C$ contains at most two edges from $E_{i}$ , and if the intersection size is two, then $C$ must contain the edge $h_{i}$ . Similarly, $C$ contains at most two edges from $\overline{E}_{i}$ , and if the intersection size is two, then $C$ must contain the edge $\overline{h}_{i}$ . As $C$ cannot contain $h_{i}$ and $\overline{h}_{i}$ simultaneously, $C$ contains at most three edges from $E_{i}\cup\overline{E}_{i}$ , and if the intersection size is three, then $C$ must contain either $h_{i}$ or $\overline{h}_{i}$ . Again by Claim 1, $C$ contains at most two edges from the 6-cycle $\{f_{i}^{\prime},h_{i}^{\prime},t_{i}^{\prime},f_{i}^{\prime\prime},h_{i}^{\prime\prime},t_{i}^{\prime\prime}\}$ . However, if $C$ contains one of the edges $h_{i}$ or $\overline{h}_{i}$ , it contains at most one edge from this 6-cycle. This proves that $C$ indeed contains at most four edges from $H(x_{i})^{+}$ . $\square$

Note that every edge of $G$ belongs to exactly one subgraph $H(x_{i})^{+}$ or $H(c_{j})$ . So if $|C|=4n+m$ , we know by Claim 2 that $C$ contains exactly four edges from $H(x_{i})$ for all $i\in[n]$ and exactly one edge from $H(c_{j})$ for all $j\in[m]$ , and none of the connection edges in $G$ .

Claim 3: For any $i\in[n]$ , if $C$ contains four edges from $H(x_{i})$ and if $f_{i}$ is not among them, then those edges must be $T_{i}$ . On the other hand, if $\overline{t}_{i}$ is not among them, those edges must be $F_{i}$ . In particular, these two cases cannot occur simultaneously.

Proof of Claim 3: If $C$ contains four edges from $H(x_{i})$ and $f_{i}$ is not among them, Claim 1 enforces taking first the edge $t_{i}$ , then $t_{i}^{\prime}$ and $t_{i}^{\prime\prime}$ , and eventually $\overline{t}_{i}$ . This proves the first part of the statement. The argument for the second part is symmetric. The third part of the statement is a consequence of the first two. $\square$

So given a solution $C$ of $\mathcal{I}$ of size $4n+m$ , we can derive from it a satisfying assignment $\tau$ of $F$ as follows: For every clause $c_{j}$ , $j\in[m]$ , we consider the unique edge $e_{j}^{k}$ from $H(c_{j})$ that belongs to $C$ . We follow the attachment edge incident to $e_{j}^{k}$ , leading to the corresponding variable gadget $H(x_{i})$ , and connecting to either $u_{i}$ or $\overline{u}_{i}$ . If the attachment edge connects to $u_{i}$ , then by Claim 1, $f_{i}\notin C$ , so by Claim 3, the four edges of $H(x_{i})$ contained in $C$ must be $T_{i}$ , so we define $\tau(x_{i}):=\texttt{true}{}$ . If the attachment edge connects to $\overline{u}_{i}$ , then by Claim 1, $\overline{t}_{i}\notin C$ , so by Claim 3, the four edges of $H(x_{i})$ contained in $C$ must be $F_{i}$ , so we define $\tau(x_{i}):=\texttt{false}{}$ . This process does not lead to any contradicting variable assignments by the last statement of Claim 3. However, this process may leave some variables $x_{i}$ undefined, and we can set them arbitrarily, e.g., $\tau(x_{i}):=\texttt{true}{}$ . By construction, each clause receives a satisfying literal, so the assignment $\tau$ is indeed a satisfying assignment of $F$ .

This proves that $F$ is satisfiable if and only if $\mathcal{I}$ has a feasible solution of size $4n+m$ (which must be optimal by Claim 2), completing the proof of the theorem in the case $g=2$ .

For values $g\geq 2$ , the construction of the gadgets $H(x_{i})$ and $H(c_{j})$ can be generalized as follows: We subdivide each of the edges $h_{i},\overline{h}_{i}$ and $h_{i}^{\prime\prime}$ , and each of the edges $e_{j}^{1}$ , $e_{j}^{2}$ and $e_{j}^{3}$ into $1+3(g-2)$ edges. Then the resulting graph $G=G(F)$ clearly has girth $3g$ , and the above arguments can be easily modified to show that any solution $C$ of $\mathcal{I}$ contains at most $1+3(g-2)=3g-5$ edges from $H(c_{j})$ for all $j\in[m]$ , and at most $4+3(g-2)=3g-2$ edges from $H(x_{i})^{+}$ for all $i\in[n]$ , and that $F$ is satisfiable if and only if $\mathcal{I}$ has an optimal solution of size $(3g-2)n+(3g-5)m$ . This completes the proof. ∎

6.2. Inapproximability of Weak Contraction

We are able to further extend our hardness results for Weak Contraction as follows:

Theorem 12.

For any $\varepsilon>0$ , it is NP-hard to approximate the problem Weak Contraction with tolerance function $\varphi(x)=2x/3$ to within a factor of $n^{1-\varepsilon}$ .

Theorem 12 implies that Weak Contraction is hard to approximate for general multiplicative tolerance functions $\varphi(x)=x/\alpha$ , $\alpha\geq 1$ , but it leaves open the question whether this is true also for other fixed values of $\alpha$ other than $3/2$ (when $\alpha$ is not part of the input). The arguments given in this section for $\alpha=3/2$ carry over straightforwardly to any fixed value $1<\alpha<2$ , but not to 2 or larger values (for $\alpha=1$ the problem is trivial).

This time we reduce from the well-known Independent Set problem (which is equivalent to Clique by considering the complement graph). Recall that an independent set in a graph $G$ is a subset of vertices of $G$ such that no two vertices in the subset are adjacent.

[TABLE]

We use again the fact that for any $\varepsilon>0$ , Independent Set is NP-hard to approximate to within a factor of $n^{1-\varepsilon}$ [Zuc07].

Proof of Theorem 12.

Let $G=(V,E)$ be an instance of Independent Set. We construct a graph $H=H(G)$ and a length function $\ell$ on the edges of $H$ as follows, see Figure 13: We start with a copy of $G$ , and all edges of this copy receive length 2. The vertices of this copy are also denoted by $V$ . We then add additional vertices and edges to $H$ as follows: To every vertex $v\in V$ we attach two pending edges $\{v,(v,1)\}$ and $\{v,(v,2)\}$ of length 1 or 2, respectively. We may assume $G$ , and thus also $H$ , to be connected.

Now consider the instance $\mathcal{I}=(H,\ell,\varphi)$ of the problem Weak Contraction with the tolerance function $\varphi(x)=2x/3$ .

We proceed to show that $\mathcal{I}$ has a feasible solution of value $k$ if and only if $G$ has an independent set of size $k$ . This is an immediate consequence of Claim 3 below. To prove Claim 3 we need the following two auxiliary claims.

Claim 1: For any induced subgraph of $H$ that is a path on two edges, a feasible solution $C$ of $\mathcal{I}$ does not contain only the longer of the two edges (either it contains none of the two, the shorter of the two if there is one, or both).

Proof of Claim 1: Consider a path on two edges $\{u,v\}$ , $\{v,w\}$ of length 2 in $H$ such that $\{u,w\}\notin E(H)$ , and suppose for the sake of contradiction that $\{u,v\}\in C$ , but $\{v,w\}\notin C$ . Then we have $\operatorname{dist}_{\ell}(u,w)=4$ and $\operatorname{dist}_{\ell_{C}}(u,w)=2$ , violating the condition (1) for the given tolerance function. A similar contradiction arises if one of the edges has length 1 and the other length 2, and only the edge of length 2 is contracted. This proves the claim. $\square$

Claim 2: No feasible solution of $\mathcal{I}$ contains an edge of length 2.

Proof of Claim 2: Assume for the sake of contradiction that a feasible solution $C$ contains an edge $e$ of length 2. Note that any edge $f$ of $H$ may be reached from $e$ via a walk $e_{1}\dots e_{k}$ where $e_{1}=e$ and $e_{k}=f$ , and for all $i<k-1$ we have $\ell(e_{i})=2$ and the edges $e_{i}$ and $e_{i+1}$ induce a path in $H$ . Now successively applying Claim 1 to the subgraphs induced by $e_{i}$ and $e_{i+1}$ for $i<k$ shows that $C$ contracts $f$ . Thus $C$ violates the condition that a weak contraction must not contract every edge. $\square$

Claim 2 implies that our objective functions satisfies $\Phi(C)=|C|$ for every feasible solution $C$ of $\mathcal{I}$ , because $H$ never contains two edges between two different connected components of $(V,C)$ .

For any set of vertices $U\subseteq V(G)$ we define $C(U):=\{\{u,(u,1)\}:u\in U\}$ .

Claim 3: A set of edges $C\subseteq E(H)$ is a feasible solution of $\mathcal{I}$ if and only if $C=C(U)$ for an independent set $U$ in $G$ .

Proof of Claim 3: Let $C$ be a feasible solution of $\mathcal{I}$ . By Claim 2, $C$ contains only edges of length 1, so we have $C=C(U)$ for some set of vertices $U$ in $G$ . Suppose that two such vertices $u,v\in U$ are connected by an edge, then we would have $\operatorname{dist}_{\ell}(u,v)=4$ and $\operatorname{dist}_{\ell_{C}}(u,v)=2$ , violating the condition (1) for the given tolerance function. It follows that $U$ is an independent set.

To prove the other direction of the equivalence, let $U$ be an independent set in $G$ and consider the set of edges $C(U)$ in $H$ . To verify that $C$ is a weak $(3/2,0)$ -contraction, it suffices to check condition (1) between the end vertices of paths on two edges, one of length 1 from $C$ and the other of length 2, and for paths on $k$ edges that start and end with an edge of length 1 from $C$ . In the first case the contraction $C(U)$ changes the distance from 3 to 2, which is compatible with (1). In the second case the contraction $C(U)$ changes the distance from $2k-2$ to $2k-4$ , which is also compatible with (1), where we use that $k\geq 4$ because of the assumption that $U$ is an independent set. $\square$

Claim 3 implies that $\mathcal{I}$ has a feasible solution with $k$ edges if and only if $G$ has an independent set of size $k$ . As $n(H)=3n(G)=\mathcal{O}(n(G))$ , the theorem follows from the [Zuc07] result. ∎

7. Asymptotic bounds

In this section we show how to compute contractions for graphs that are not optimal, but can be computed efficiently despite our hardness results from the previous section. In this vein, the main results of this section are Theorem 13 and the corresponding (not tight) lower bound (Theorem 15) for the case of tolerance functions of the form $\varphi(x)=x/\alpha-1$ . Further we consider purely additive tolerance functions (Section 7.2) and the factor by which a contraction can reduce the number of vertices (Section 7.3). Throughout this section, we assume all graphs to have unit length edges $\ell=1$ .

7.1. Almost multiplicative contractions

As mentioned in the introduction, a purely multiplicative tolerance function ( $\beta=0$ ) forbids decreasing any distances. In this section we thus consider an ‘almost’ purely multiplicative tolerance function of the form $\varphi(x)=x/\alpha-1$ .

Theorem 13.

Let $k\geq 1$ be a real number. Any graph $G$ has a $(2k-1,1)$ -contraction $C$ such that the contracted graph $G/C$ has at most $n^{1+1/k}$ edges, and such a contraction can be computed in time $\mathcal{O}(m)$ .

Recall that here and throughout, $n$ and $m$ denote the number of vertices and edges of the input graph $G$ , not of the contracted graph $G/C$ . Setting $k:=\log_{2}n$ in Theorem 13 yields the following corollary.

Corollary 14.

Any graph $G$ has a $(2\log_{2}n-1,1)$ -contraction $C$ such that the contracted graph $G/C$ has at most $2n$ edges, and such a contraction can be computed in time $\mathcal{O}(m)$ .

To prove Theorem 13, we use a clustering approach as presented in [Awe85], yielding the next lemma. Specifically, the following crucial lemma appears in a slightly weaker form in that paper. For any real number $r\geq 1$ , we define an $r$ -partition of a graph $G=(V,E)$ as a set of clusters $P_{i}\subseteq V$ , $i\in[l]$ , with corresponding cluster centers $p_{i}\in P_{i}$ , where the sets $P_{i}$ are required to form a partition of the vertex set $V$ and where $\operatorname{dist}_{\ell}(p_{i},u)\leq r-1$ for all $u\in P_{i}$ and $i\in[l]$ . We denote the resulting $r$ -partition by $P:=\{(p_{i},P_{i}):i\in[l]\}$ . We write $\rho(P)$ for the number of pairs $1\leq i<j\leq l$ for which $P_{i}$ and $P_{j}$ are connected by at least one edge, and we refer to this quantity as the density of $P$ .

Lemma 7.1.

Let $r\geq 1$ be a real number. Any graph $G$ with unit length edges has an $r$ -partition $P$ with density $\rho(P)\leq n^{1+1/r}$ , and such a partition can be computed in time $\mathcal{O}(m)$ .

Proof.

The idea of the algorithm is to build an $r$ -partition $P$ of $G$ iteratively in rounds. In each round, we build a new cluster and remove all vertices from that cluster from the graph, processing the subgraph on the remaining vertices in the next round. The algorithm proceeds until all vertices are assigned to a cluster. In round $i$ , we choose an arbitrary vertex $p_{i}$ as a cluster center, and define layers $L_{i,0},L_{i,1},\ldots$ around the vertex $p_{i}$ , where the layer $L_{i,j}$ consists of all vertices at distance exactly $j$ from $p_{i}$ (this distance is measured in the subgraph of $G$ under consideration in this round). We continue computing these layers as long as the number of vertices in the new layer is at least the number of vertices in all previous layers times the factor $n^{1/r}$ . The cluster $P_{i}$ is defined as the union of all layers around $p_{i}$ satisfying this expansion condition. We refer to the first layer violating this condition (which is not added to $P_{i}$ anymore) as the rejected layer. We let $P$ denote the partition of the vertices of $G$ computed in this fashion.

To verify that $P$ is indeed an $r$ -partition, we proceed to show that each vertex within a cluster has distance at most $r-1$ from the center vertex of that cluster, and that the density $\rho(P)$ of the partition is at most $n^{1+1/r}$ . Intuitively, the expansion condition in the definition of the layers ensures that a cluster has few layers and that the number of edges that go to unclustered vertices is small.

Consider a cluster $P_{i}$ with center vertex $p_{i}$ and the layers $L_{i,0},L_{i,1},\ldots,L_{i,d}$ . Suppose for the sake of contradiction that $d\geq r$ . By the definition of the layers in the algorithm we know that $|L_{i,j}|\geq n^{1/r}\sum_{k=0}^{j-1}|L_{i,k}|$ holds for all $j\in[d]$ , implying that $|L_{i,j}|\geq n^{j/r}$ . Consequently, the size of the cluster satisfies $|P_{i}|=\sum_{j=0}^{d}|L_{i,j}|\geq 1+n^{r/r}=n+1$ , a contradiction.

We now show that $\rho(P)\leq n^{1+1/r}$ . The key idea is that the number of vertices in the rejected layer of a cluster $P_{i}$ is at most $n^{1/r}|P_{i}|$ . Thus the number of edges from $P_{i}$ to clusters that are created later is at most $n^{1/r}|P_{i}|$ . For every edge between two clusters we let the cluster that is created first account for that edge. Summing over all these edges between clusters yields the desired upper bound of $\rho(P)\leq n\cdot n^{1/r}=n^{1+1/r}$ .

Using breadth-first search, the partitioning algorithm described above runs in time $\mathcal{O}(m)$ (recall that $G$ is assumed to be connected). This completes the proof of the lemma. ∎

With Lemma 7.1 in hand, we are now ready to prove Theorem 13.

Proof of Theorem 13.

Given $G=(V,E)$ , we first compute a $k$ -partition $P$ into $l$ clusters as described by Lemma 7.1. We define the set $C$ of contracted edges as the union of all edges within the clusters, $C:=\{\{u,v\}\in E:u,v\in P_{i}\text{ for some }i\in[l]\}$ . We thus contract each cluster into a single vertex and remove from every set of resulting parallel edges all but a single edge.

We proceed to show that $C$ is a $(2k-1,1)$ -contraction, i.e., we show that $\operatorname{dist}_{\ell_{C}}(u,v)\geq\operatorname{dist}_{\ell}(u,v)/(2k-1)-1$ for all $u,v\in V$ . Consider two vertices $u\in P_{i}$ and $v\in P_{j}$ , where $i$ and $j$ might be equal. Let $Q_{u,v}$ be the shortest path from $u$ to $v$ in $G$ with edge lengths $\ell_{C}$ (all edges from $C$ receive length zero). The length $d$ of $Q_{u,v}$ is the number of edges on that path that connect different clusters. Note that $Q_{u,v}$ enters and leaves each of the $d+1$ visited clusters at most once, using at most $2k-2$ edges in every cluster, so in $G$ (where all edges have unit lengths) we get $\operatorname{dist}_{\ell}(u,v)\leq d+(d+1)(2k-2)$ .

Combining these observations we obtain

[TABLE]

proving the claim. It remains to show that the contracted graph $G/C$ has at most $n^{1+1/k}$ edges, which is an immediate consequence of the upper bound $m(G/C)=\rho(P)\leq n^{1+1/k}$ given by Lemma 7.1. This completes the proof of the theorem. ∎

Erdős’ girth conjecture [Erd64] asserts that there exist graphs with $\Omega(n^{1+1/k})$ edges and girth $2k+1$ . It has been verified for $k=1,2,3,5$ [Wen91] and the strongest spanner lower bounds depend on it. We derive from the conjecture the following (not tight) lower bound.

Theorem 15.

Assuming Erdős’ girth conjecture, there exists for any integer $k\geq 2$ a graph $G$ such that any $(k-1,1)$ -contraction $C$ results in a graph $G/C$ with $\Omega(n^{1+1/k})$ edges.

Proof.

For a given integer $k\geq 2$ let $G$ be a graph that is guaranteed by Erdős’ girth conjecture, i.e., $G$ has girth $2k+1$ and $\Omega(n^{1+1/k})$ edges. Consider any $(k-1,1)$ -contraction $C$ on $G$ , and consider a connected component of the graph $(V,C)$ . Applying (1) shows that $\operatorname{dist}_{\ell}(u,v)\leq k-1$ holds for any two vertices $u$ and $v$ in that component. Using that the girth of $G$ is $2k+1$ , it follows that for any cycle in $G$ , the connected component of $(V,C)$ does not contain a contiguous segment of cycle edges of length at least half of the cycle. This implies that all connected components of the graph $(V,C)$ are trees with diameter at most $k-1$ . Therefore, the total number of edges within all connected components of $(V,C)$ is at most $n$ . We will further argue that there is at most one edge between any two connected components. Suppose for the sake of contradiction that there are two components of $(V,C)$ with two different edges connecting them, say $\{u,v\}$ and $\{u^{\prime},v^{\prime}\}$ , where $u$ and $u^{\prime}$ lie in the same connected component and $v$ and $v^{\prime}$ in the other. As the diameter of each component is at most $k-1$ , it follows that in $G$ there is a path from $u$ to $u^{\prime}$ of length at most $k-1$ , and a path from $v$ to $v^{\prime}$ of length at most $k-1$ . Together with the two edges connecting the components we obtain a cycle of length at most $2(k-1)+2=2k$ , contradicting the assumption that $G$ has girth $2k+1$ .

Therefore, the resulting graph after the contraction has $\Omega(m)=\Omega(n^{1+1/k})$ edges. ∎

7.2. Additive contractions

Turning to the case of a purely additive error, we obtain the following two results.

Theorem 16.

Let $G$ be a graph with unit length edges.

(i)

For any even integer $0\leq k\leq n$ , the set of edges incident to the $k/2$ vertices of highest degrees is a $(1,k)$ -contraction $C$ in $G$ with $\Phi(C)\geq km/(2n)$ . 2. (ii)

For any real number $0<k\leq n$ , the set of edges incident to two vertices of degree at least $n/k$ is a $(1,k)$ -contraction $C$ in $G$ such that $G/C$ has $\mathcal{O}(n^{2}/k)$ edges.

These contractions can be computed in time $\mathcal{O}(m)$ .

As mentioned in the introduction, Bernstein and Chechik analyzed the contraction of Theorem 16 (ii) in [BC16] and used it in their dynamic shortest paths algorithm, so this part is already proved.

Proof of Theorem 16 (i).

Let $U$ be the set of $k$ vertices in $G$ of highest degree. Then we have

[TABLE]

Let $C$ be the set of edges incident to any vertex in $U$ . As each edge is incident to at most two vertices in $U$ , we get $|C|\geq 1/2\sum_{u\in U}\deg(u)\geq km/n$ from the previous inequality. As no shortest path visits a vertex in $U$ twice, $C$ is indeed a $(1,2k)$ -contraction. The set $C$ can be computed as follows: We first compute the degrees of all vertices in time $\mathcal{O}(m)$ , then find the $k$ -th largest element in this list in time $\mathcal{O}(n)$ , and by another linear time sweep over this list we select $k$ vertices of highest degree. Overall, the required time is $\mathcal{O}(m)$ . ∎

This result implies that the number of edges in $G/C$ is at most $m-km/n$ . If $G$ is a path, no $(1,2k)$ -contraction has an objective value greater than $2k$ , and $km/n=k(1-1/n)$ , showing that the objective value in Theorem 16 (i) can be improved by at most a factor of two.

The information theoretic lower bound in [AB16] implies that for all $\varepsilon>0$ , any contraction $C$ such that $G/C$ has $\mathcal{O}(n^{4/3-\varepsilon})$ edges does not admit a constant additive error.

7.3. Vertex reduction

All of the results above show that contractions can be effectively used to reduce the number of edges in a dense graph. But one possible advantage of using a contraction instead of a spanner is that it also has the potential to reduce the number of vertices in the graph. Unfortunately, for constant approximation errors, it is not possible to guarantee more than a constant-factor reduction in general graphs: it is not hard to see that given a path on $n$ vertices, any $(k,1)$ -contraction will still result in at least $n/(k+1)$ vertices. The same problem applies to general dense graphs, since they could still contain a long path within them. That being said, it seems likely that in practice contraction can lead to significant vertex reduction in many dense graphs. We ground this practical intuition with the following theoretical result for the special case of graphs with large minimum degree.

Theorem 17.

Let $D$ be an integer. Any graph $G$ with minimum degree at least $D$ has a $(5,1)$ -contraction $C$ such that the contracted graph $G/C$ has at most $n/D$ vertices, and such a contraction can be computed in time $\mathcal{O}(m)$ .

Proof.

Recall the definition of an $r$ -partition. For a cluster $P_{i}$ with center vertex $p_{i}$ we refer to $r$ as the radius of that cluster. This is the maximum distance of all cluster vertices from $p_{i}$ .

We will show how to construct a $3$ -partition in which the number of clusters $P_{i}$ is at most $n/D$ . Using the exact same argument as in the proof of Theorem 13, such a $3$ -partition yields the desired $(5,1)$ -contraction. Our construction first builds clusters of radius 1, and then extends them to clusters of radius 2. The clustering with radius 1 proceeds very similarly as in the proof of Lemma 7.1 before with $r=1$ . The crucial difference is that we choose as center vertices only vertices with degree at least $D$ . If no such vertices are left, the clustering process terminates, and the remaining unclustered vertices have degree strictly less than $D$ . It is easy to see that since those vertices have degree at least $D$ in the original graph, they must be adjacent to a vertex in a radius 1 cluster. We can thus assign each of those vertices to such a cluster arbitrarily, yielding a clustering of all vertices of $G$ with radius 2.

The number of clusters is at most $n/D$ because by construction every cluster contains at least $D$ vertices. This shows that the number of vertices in the contracted graph is at most $n/D$ .

This algorithm can be implemented in time $\mathcal{O}(m)$ by using an adjacency list representation where we keep track of degree information after removing an edge from the graph. ∎

To see that we cannot guarantee less than $n/D$ vertices, even with larger approximation error, consider the graph $G$ that consists of $n/D$ isolated $D$ -cliques. We now show that even if $G$ is connected, we cannot guarantee $o(n/D)$ vertices in the contracted graph, even if we allow a larger (constant) approximation error.

Theorem 18.

Let $D$ and $k$ be integers. There exists an infinite family of $n$ -vertex graphs $G$ with minimum degree $D$ such that any $(k,1)$ -contraction $C$ results in a graph $G/C$ with $n/((k+1)D)$ vertices.

Proof.

Assume for simplicity that $n$ is divisible by $D$ . We construct the graph $G$ as follows. We partition the $n$ vertices into $n/D$ layers, with each layer containing exactly $D$ vertices. For $1\leq i<n/D$ , all vertices in layer $i$ receive an edge to all vertices in layer $i+1$ . Clearly all vertices in the resulting graph have degree at least $D$ . Let $u$ and $v$ be two vertices in layers $i$ and $j$ , respectively. Then clearly we have $\operatorname{dist}_{\ell}(u,v)\geq|j-i|$ . Now let $C$ be any $(k,1)$ -contraction on $G$ , and consider the connected components of the graph $(V,C)$ . Applying (1) shows that $\operatorname{dist}_{\ell}(u,v)\leq k$ holds for any two vertices $u$ and $v$ in the same component. Combining these two inequalities shows that every connected component contains vertices from at most $k+1$ layers. As there are $n/D$ layers, the contracted graph has at least $n/((k+1)D)$ vertices. ∎

Acknowledgements

We thank Martin Skutella for stimulating discussions about the problems treated in this paper. We also thank the anonymous referees for their valuable suggestions that helped improving the presentation of results.

Bibliography37

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[AB 16] A. Abboud and G. Bodwin. The 4/3 additive spanner exponent is tight. In STOC’16—Proceedings of the 48th Annual ACM SIGACT Symposium on Theory of Computing , pages 351–361. ACM, New York, 2016.
2[ACIM 99] D. Aingworth, C. Chekuri, P. Indyk, and R. Motwani. Fast estimation of diameter and shortest paths (without matrix multiplication). SIAM J. Comput. , 28(4):1167–1181, 1999.
3[ADD + 93] I. Althöfer, G. Das, D. Dobkin, D. Joseph, and J. Soares. On sparse spanners of weighted graphs. Discrete Comput. Geom. , 9(1):81–100, 1993.
4[Awe 85] B. Awerbuch. Complexity of network synchronization. J. Assoc. Comput. Mach. , 32(4):804–823, 1985.
5[BBV 00] T. C. Biedl, B. Brejová, and T. Vinař. Simplifying flow networks. In Mathematical foundations of computer science 2000 (Bratislava) , volume 1893 of Lecture Notes in Comput. Sci. , pages 192–201. Springer, Berlin, 2000.
6[BC 16] A. Bernstein and S. Chechik. Deterministic decremental single source shortest paths: beyond the O ( m n ) 𝑂 𝑚 𝑛 O(mn) bound. In STOC’16—Proceedings of the 48th Annual ACM SIGACT Symposium on Theory of Computing , pages 389–397. ACM, New York, 2016.
7[BDD + 18] A. Bernstein, K. Däubel, Y. Disser, M. Klimm, T. Mütze, and F. Smolny. Distance-preserving graph contractions. In 9th Innovations in Theoretical Computer Science Conference, ITCS 2018, January 11-14, 2018, Cambridge, MA, USA , pages 51:1–51:14, 2018. Preprint available at ar Xiv:1705.04544 .
8[BKMP 05] S. Baswana, T. Kavitha, K. Mehlhorn, and S. Pettie. New constructions of ( α , β ) 𝛼 𝛽 (\alpha,\beta) -spanners and purely additive spanners. In Proceedings of the Sixteenth Annual ACM-SIAM Symposium on Discrete Algorithms , pages 672–681. ACM, New York, 2005.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Abstract.

1. Introduction

1.1. Our results

Algorithmic results

Hardness results

Asymptotic bounds

1.2. Comparison with previous results

1.3. Further related work

1.4. Outline of this paper

2. Preliminaries

Theorem 1**.**

Proof.

3. Greedy algorithms

3.1. Paths with unit length edges

Theorem 2**.**

Proof.

3.2. Cycles with unit length edges

Theorem 3**.**

Lemma 3.1**.**

Proof.

Proof of Theorem 3.

3.3. Trees with unit length edges and additive error

Theorem 4**.**

Proof.

4. Dynamic programs for general trees

4.1. Contraction on trees

Theorem 5**.**

Lemma 4.1**.**

Proof.

Lemma 4.2**.**

Proof.

Proof of Theorem 5.

4.2. Weak Contraction on trees

Theorem 6**.**

Lemma 4.3**.**

Proof.

Lemma 4.4**.**

Proof.

Lemma 4.5**.**

Proof.

Lemma 4.6**.**

Proof.

Lemma 4.7**.**

Proof.

Proof of Theorem 6.

5. Hardness for additive tolerance functions

5.1. Hardness of Contraction and Weak Contraction

Theorem 7**.**

Theorem 8**.**

Lemma 5.1**.**

Proof.

Proof of Theorem 8.

5.2. Inapproximability of Contraction

Theorem 9**.**

Theorem 10**.**

Lemma 5.2**.**

Proof.

Proof of Theorem 9.

Lemma 5.3**.**

Proof.

Proof of Theorem 10.

6. Hardness for multiplicative tolerance function

6.1. Hardness of planar Weak Contraction

Theorem 11**.**

Lemma 6.1**.**

Proof.

Lemma 6.2**.**

Proof.

Proof of Theorem 11.

6.2. Inapproximability of Weak Contraction

Theorem 12**.**

Proof of Theorem 12.

7. Asymptotic bounds

7.1. Almost multiplicative contractions

Theorem 1.

Theorem 2.

Theorem 3.

Lemma 3.1.

Theorem 4.

Theorem 5.

Lemma 4.1.

Lemma 4.2.

Theorem 6.

Lemma 4.3.

Lemma 4.4.

Lemma 4.5.

Lemma 4.6.

Lemma 4.7.

Theorem 7.

Theorem 8.

Lemma 5.1.

Theorem 9.

Theorem 10.

Lemma 5.2.

Lemma 5.3.

Theorem 11.

Lemma 6.1.

Lemma 6.2.

Theorem 12.

Theorem 13.

Corollary 14.

Lemma 7.1.

Theorem 15.

Theorem 16.

Theorem 17.

Theorem 18.