On Approximating Degree-Bounded Network Design Problems

Xiangyu Guo; Guy Kortsarz; Bundit Laekhanukit; Shi Li; Daniel Vaz,; Jiayi Xian

arXiv:1907.11404·cs.DS·April 28, 2020

On Approximating Degree-Bounded Network Design Problems

Xiangyu Guo, Guy Kortsarz, Bundit Laekhanukit, Shi Li, Daniel Vaz,, Jiayi Xian

PDF

TL;DR

This paper introduces a quasi-polynomial time bicriteria approximation algorithm for the Degree-Bounded Directed Steiner Tree problem, achieving near-optimal cost guarantees while allowing controlled violations of degree constraints.

Contribution

It provides the first non-trivial approximation algorithm for the degree-bounded DST problem with explicit degree violation bounds.

Findings

01

Achieves an $O( ext{log} n ext{log} k)$ cost approximation.

02

Violates degree bounds by at most $O( ext{log}^2 n)$ factor.

03

Improves degree violation bounds for the special case of Degree-Bounded Group Steiner Tree on trees.

Abstract

Directed Steiner Tree (DST) is a central problem in combinatorial optimization and theoretical computer science: Given a directed graph $G = (V, E)$ with edge costs $c \in R_{\geq 0}^{E}$ , a root $r \in V$ and $k$ terminals $K \subseteq V$ , we need to output the minimum-cost arborescence in $G$ that contains an $r$ \textrightarrow $t$ path for every $t \in K$ . Recently, Grandoni, Laekhanukit and Li, and independently Ghuge and Nagarajan, gave quasi-polynomial time $O (lo g^{2} k / lo g lo g k)$ -approximation algorithms for the problem, which are tight under popular complexity assumptions. In this paper, we consider the more general Degree-Bounded Directed Steiner Tree (DB-DST) problem, where we are additionally given a degree bound $d_{v}$ on each vertex $v \in V$ , and we require that every vertex $v$ in the output tree has at most $d_{v}$ children. We give a quasi-polynomial time $(O(\log n…

Equations56

\displaystyle\textstyle{\operatorname*{\mathbb{E}}\left[\exp\big{(}s\cdot(\text{number of copies of $v$ in $T$})\big{)}\right]\leq 1+O\left(\frac{1}{\log n}\right)}.

\displaystyle\textstyle{\operatorname*{\mathbb{E}}\left[\exp\big{(}s\cdot(\text{number of copies of $v$ in $T$})\big{)}\right]\leq 1+O\left(\frac{1}{\log n}\right)}.

\displaystyle\operatorname*{\mathbb{E}}\left[\exp\big{(}s\cdot(\text{\# copies of $v$ in $T_{1},\cdots,T_{Q}$})\big{)}\right]

\displaystyle\operatorname*{\mathbb{E}}\left[\exp\big{(}s\cdot(\text{\# copies of $v$ in $T_{1},\cdots,T_{Q}$})\big{)}\right]

\displaystyle\Pr\left[\exp\big{(}s\cdot(\text{\# copies of $v$ in $T_{1},\cdots,T_{Q}$})\big{)}\geq\exp(M)\right]\leq\frac{1}{10n}.

\displaystyle\Pr\left[\exp\big{(}s\cdot(\text{\# copies of $v$ in $T_{1},\cdots,T_{Q}$})\big{)}\geq\exp(M)\right]\leq\frac{1}{10n}.

min o \in V_{base}^{\circ} \sum x_{o} c (o)

min o \in V_{base}^{\circ} \sum x_{o} c (o)

q \in Λ_{T^{\circ}} (p) \sum x_{q}

q \in Λ_{T^{\circ}} (p) \sum x_{q}

x_{p}

x_{p}

o \in Λ_{T^{\circ}}^{*} (p) \cap O_{t} \sum x_{o}

o \in Λ_{T^{\circ}}^{*} (p) \cap O_{t} \sum x_{o}

o \in O_{t} \sum x_{o}

Pr [V \cap O_{t}^{'} \neq = \emptyset] \geq \frac{1}{h + 1} .

Pr [V \cap O_{t}^{'} \neq = \emptyset] \geq \frac{1}{h + 1} .

\displaystyle\operatorname*{\mathbb{E}}\Big{[}\mathbf{e}^{sm_{p}}\big{|}p\in\mathbf{V}\Big{]}

\displaystyle\operatorname*{\mathbb{E}}\Big{[}\mathbf{e}^{sm_{p}}\big{|}p\in\mathbf{V}\Big{]}

\displaystyle=\prod_{q\in\Lambda_{\mathbf{T}^{\circ}}(p)}\left[1+\frac{x_{q}}{x_{p}}\Big{(}\operatorname*{\mathbb{E}}[\mathbf{e}^{sm_{q}}|q\in\mathbf{V}]-1\Big{)}\right].

\displaystyle\operatorname*{\mathbb{E}}\Big{[}\mathbf{e}^{sm_{p}}\big{|}p\in\mathbf{V}\Big{]}

\displaystyle\operatorname*{\mathbb{E}}\Big{[}\mathbf{e}^{sm_{p}}\big{|}p\in\mathbf{V}\Big{]}

\displaystyle\leq\prod_{q\in\Lambda_{\mathbf{T}^{\circ}}(p)}\left[1+\frac{x_{q}}{x_{p}}\Big{(}\operatorname*{\mathbb{E}}[\mathbf{e}^{sm_{q}}|q\in\mathbf{V}]-1\Big{)}\right].

\displaystyle\quad\operatorname*{\mathbb{E}}\Big{[}\mathbf{e}^{sm_{p}}\big{|}p\in\mathbf{V}\Big{]}\leq\prod_{q\in\Lambda_{\mathbf{T}^{\circ}}(p)}\left[1+\frac{x_{q}}{x_{p}}\Big{(}\operatorname*{\mathbb{E}}[\mathbf{e}^{sm_{q}}|q\in\mathbf{V}]-1\Big{)}\right]

\displaystyle\quad\operatorname*{\mathbb{E}}\Big{[}\mathbf{e}^{sm_{p}}\big{|}p\in\mathbf{V}\Big{]}\leq\prod_{q\in\Lambda_{\mathbf{T}^{\circ}}(p)}\left[1+\frac{x_{q}}{x_{p}}\Big{(}\operatorname*{\mathbb{E}}[\mathbf{e}^{sm_{q}}|q\in\mathbf{V}]-1\Big{)}\right]

\displaystyle\leq\prod_{q\in\Lambda_{\mathbf{T}^{\circ}}(p)}\left[1+\frac{x_{q}}{x_{p}}\big{(}\alpha_{i-1}^{z_{q}/x_{q}}-1\big{)}\right]

\displaystyle\leq\exp\left[\sum_{q\in\Lambda_{\mathbf{T}^{\circ}}(p)}\frac{x_{q}}{x_{p}}\big{(}\alpha_{i-1}^{z_{q}/x_{q}}-1\big{)}\right]\leq\exp\left[\frac{z_{p}}{x_{p}}(\alpha_{i-1}-1)\right]=\alpha_{i}^{z_{p}/x_{p}}.

α_{i}

α_{i}

= 1 + \frac{2 h ^{'} - i + 2}{( 2 h ^{'} - i + 1 ) ^{2}} \leq 1 + \frac{1}{2 h ^{'} - i} .

min u \in V^{\circ} \sum c_{u} x_{u} s.t.

min u \in V^{\circ} \sum c_{u} x_{u} s.t.

x_{v}

x_{v}

o \in O_{t} \sum x_{o}

o \in O_{t} \cap Λ_{u}^{*} \sum x_{o}

v \in Λ_{u} \sum x_{v}

v \in Λ_{u} \sum x_{v}

x_{u}

x_{u}^{'} = 2^{m i n {ℓ_{u}, γ}} x_{u}, for every u \in V^{\circ} .

x_{u}^{'} = 2^{m i n {ℓ_{u}, γ}} x_{u}, for every u \in V^{\circ} .

z_{u} = o \in O_{t} \cap Λ_{u}^{*} \sum x_{o} .

z_{u} = o \in O_{t} \cap Λ_{u}^{*} \sum x_{o} .

\displaystyle\Pr\Big{[}\bigvee_{o\in O_{t}\cap\Lambda^{*}_{u}}\mathbf{E}_{o}\big{|}\mathbf{E}_{u}\Big{]}

\displaystyle\Pr\Big{[}\bigvee_{o\in O_{t}\cap\Lambda^{*}_{u}}\mathbf{E}_{o}\big{|}\mathbf{E}_{u}\Big{]}

\geq 1 - v \in Λ_{u} \prod exp (- \frac{1}{2 ( L - ℓ )} \cdot \frac{z _{v}}{x _{u}}) = 1 - exp (- \frac{1}{2 ( L - ℓ )} \cdot \frac{z _{u}}{x _{u}})

\geq \frac{1}{2 ( L - ℓ )} \cdot \frac{z _{u}}{x _{u}} - \frac{1}{2} (\frac{1}{2 ( L - ℓ )} \cdot \frac{z _{u}}{x _{u}})^{2} \geq \frac{1}{2 ( L - ℓ )} \cdot \frac{z _{u}}{x _{u}} - (\frac{1}{2 ( L - ℓ )})^{2} \frac{z _{u}}{x _{u}}

= (\frac{2 ( L - ℓ ) - 1}{( 2 ( L - ℓ ) ) ^{2}}) \frac{z _{u}}{x _{u}} \geq \frac{1}{2 ( L + 1 - ℓ )} \cdot \frac{z _{u}}{x _{u}} .

α_{ℓ} = 2 α_{ℓ + 1} - 4 α_{ℓ + 1}^{2} = 2 α_{ℓ + 1} (1 - 2 α_{ℓ + 1}) \geq 2 α_{ℓ + 1} (1 - 2 \times \frac{2 ^{γ - ℓ - 1}}{2 L}) = 2 α_{ℓ + 1} (1 - \frac{2 ^{γ - ℓ - 1}}{L}) .

α_{ℓ} = 2 α_{ℓ + 1} - 4 α_{ℓ + 1}^{2} = 2 α_{ℓ + 1} (1 - 2 α_{ℓ + 1}) \geq 2 α_{ℓ + 1} (1 - 2 \times \frac{2 ^{γ - ℓ - 1}}{2 L}) = 2 α_{ℓ + 1} (1 - \frac{2 ^{γ - ℓ - 1}}{L}) .

α_{0}

α_{0}

\displaystyle\Pr\Big{[}\bigvee_{o\in O_{t}\cap\Lambda^{*}_{u}}\mathbf{E}_{o}\big{|}\mathbf{E}_{u}\Big{]}

\displaystyle\Pr\Big{[}\bigvee_{o\in O_{t}\cap\Lambda^{*}_{u}}\mathbf{E}_{o}\big{|}\mathbf{E}_{u}\Big{]}

\geq 1 - v \in Λ_{u} \prod exp (- 2 α_{ℓ + 1} \frac{z _{v}}{x _{u}}) = 1 - exp (- 2 α_{ℓ + 1} \frac{z _{u}}{x _{u}})

\geq 2 α_{ℓ + 1} \frac{z _{u}}{x _{u}} - \frac{1}{2} (2 α_{ℓ + 1} \frac{z _{u}}{x _{u}})^{2} \geq 2 α_{ℓ + 1} \frac{z _{u}}{x _{u}} - (2 α_{ℓ + 1})^{2} \frac{z _{u}}{x _{u}} = α_{ℓ} \frac{z _{u}}{x _{u}} .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

On Approximating Degree-Bounded Network Design Problems

Xiangyu Guo

Dept. of Comp. Sci. and Eng.

University at Buffalo, USA

[email protected]

Guy Kortsarz

Dept. of Comp. Sci.

Rutgers University Camden, USA

[email protected]

Bundit Laekhanukit

ITCS,

SUFE, China

[email protected]

Shi Li

Dept. of Comp. Sci. and Eng.

University at Buffalo, USA

[email protected]

Daniel Vaz

Operations Research Group,

TU Munich, Germany

[email protected]

Jiayi Xian

Dept. of Comp. Sci. and Eng.

University at Buffalo, USA

[email protected]

Abstract

Directed Steiner Tree (DST) is a central problem in combinatorial optimization and theoretical computer science: Given a directed graph $G=(V,E)$ with edge costs $c\in\mathbb{R}_{\geq 0}^{E}$ , a root $r\in V$ and $k$ terminals $K\subseteq V$ , we need to output the minimum-cost arborescence in $G$ that contains an $r$ → $t$ path for every $t\in K$ . Recently, Grandoni, Laekhanukit and Li, and independently Ghuge and Nagarajan, gave quasi-polynomial time $O(\log^{2}k/\log\log k)$ -approximation algorithms for the problem, which are tight under popular complexity assumptions.

In this paper, we consider the more general Degree-Bounded Directed Steiner Tree (DB-DST) problem, where we are additionally given a degree bound $d_{v}$ on each vertex $v\in V$ , and we require that every vertex $v$ in the output tree has at most $d_{v}$ children. We give a quasi-polynomial time $(O(\log n\log k),O(\log^{2}n))$ -bicriteria approximation: The algorithm produces a solution with cost at most $O(\log n\log k)$ times the cost of the optimum solution that violates the degree constraints by at most a factor of $O(\log^{2}n)$ . This is the first non-trivial result for the problem.

While our cost-guarantee is nearly optimal, the degree violation factor of $O(\log^{2}n)$ is an $O(\log n)$ -factor away from the approximation lower bound of $\Omega(\log n)$ from the set-cover hardness. The hardness result holds even on the special case of the Degree-Bounded Group Steiner Tree problem on trees (DB-GST-T). With the hope of closing the gap, we study the question of whether the degree violation factor can be made tight for this special case. We answer the question in the affirmative by giving an $(O(\log n\log k),O(\log n))$ -bicriteria approximation algorithm for DB-GST-T.

1 Introduction

Network design is a central problem in combinatorial optimization and computer science. To capture more practical situations, the more general model of network design with degree-constraints was suggested in the early 90’s [21, 8] and has attracted researchers in both theory and practice for decades. One of the most famous examples is the Degree-Bounded Minimum Spanning Tree (DB-MST) problem, which models the problem of designing a multi-casting network in which each node only has enough power to broadcast to a bounded number of its neighbors. This problem has been studied in a sequence of works (see, e.g.,[15, 17, 11, 23]), leading to the breakthrough result of Goemans [11] followed by the work of Singh and Lau [23], which settled down the problem by giving an algorithm that outputs a solution with optimum cost, while violating the degree bound by an additive factor of +1 [23]. Since the works on DB-MST, many works have been dedicated to the study the generalizations of the problem: the Degree-Bounded Steiner Tree problem, in which the goal is to find a minimum-cost subgraph that connects all the terminals, while meeting the given degree bounds, was studied in [16, 20]. The Survivable Network Design problem, where each pair of nodes $v,w$ are required to have at least $\lambda_{vw}$ edge-disjoint $v$ - $w$ paths, has also been studied in literature; see, e.g., [19, 20].

Recently, degree-bounded network design problems have been studied in the online setting [4, 3, 5]. Besides the standard (also called point-to-point) network design problems, Dehghani et al. [4] also studied the Degree-Bounded Group Steiner Tree problem (DB-GST). They gave a negative result, which shows that it is not possible to approximate both cost and weight of the Online DB-GST problem simultaneously, even when the input graph is a star. More specifically, there exists an input demand sequence that forces any algorithm to pay a factor of $\Omega(n)$ either in the cost or in the degree violation. To date there was no non-trivial approximation algorithm for DB-GST, either in the online or offline setting, and even when all the edges have zero-cost. This was listed as an open problem by Hajiaghayi [13] at the 8th Flexible Network Design Workshop (FND 2016).

In this paper, we study a degree-bounded variant of the classic network design problem, the Degree-Bounded Directed Steiner Tree problem (DB-DST). Formally, in DB-DST, we are given an $n$ -vertex directed graph $G=(V,E)$ with costs on edges, a root vertex $r$ , a set of $k$ terminals $K$ , and degree bounds $d_{v}$ for each vertex $v$ . The goal is to find a minimum-cost rooted tree $T\subseteq G$ that contains a path from the root $r$ to every terminal $t\in K$ , while respecting the degree bound, i.e., the out-degree of each vertex $v$ in $T$ is at most $d_{v}$ . Despite being a classic problem, there was no previous positive result on DB-DST as it is a generalization of DB-GST.

The barriers in obtaining any non-trivial approximation algorithm for DB-GST and DB-DST are similar. Most of the previous algorithms to these two problems either run on the metric closure of the input graph [9, 7, 22], require metric-tree embedding [9, 1, 6] or use height-reduction techniques [24, 2, 12, 10], all of which lose track of the degree of the solution subgraph.

We solve the open problem of Hajiaghayi [13], by presenting a bi-criteria $(O(\log k\log n),O(\log^{2}n))$ -approximation algorithm for DB-DST that runs in quasi-polynomial-time (see Section 1.1 for the definition). Our technique expands upon the recent result of Grandoni, Laekhanukit and Li [12] for the Directed Steiner Tree problem. We observe that the algorithm in [12] can be easily extended to the problem with degree bounds. Nevertheless, to amend the degree-constrained problem into their framework, we are required to prove a concentration bound for the degrees, which is rather non-trivial. Notice that the $O(\log n\log k)$ -approximation factor on the cost of the tree is almost tight due to the hardness of $\Omega(\log^{2-\epsilon}n)$ in [14] for Directed Steiner Tree and the slightly improved hardness of $\Omega({\log^{2}n}/\log\log n)$ in [12].

While our result for DB-DST is (almost) tight on the cost guarantee, the degree violation factor $O(\log^{2}n)$ is an $O(\log n)$ factor away from the approximation lower bound of $\Omega(\log n)$ from the set-cover hardness. To understand if the gap can be reduced, we study the special case of DB-DST obtained from the hardness construction in [14], namely the Degree-Bounded Group Steiner Tree problem on trees (DB-GST-T). In this problem, we are given an (undirected) tree $T^{\circ}=(V^{\circ},E^{\circ})$ with edge-costs, a root $r$ , $k$ subsets of vertices (called groups) $O_{1},\ldots,O_{k}\subseteq V$ and a degree bound $d_{v}$ for each vertex $v\in V^{\circ}$ . The goal is to find a minimum-cost subtree $T\subseteq T^{\circ}$ that joins $r$ to at least one vertex from each group $O_{t}$ , for every $t\in[k]$ , while respecting the degree bound, i.e., the number of children of each vertex $v$ in $T$ is at most $d_{v}$ . We present an $(O(\log k\log n),O(\log n))$ -bicriteria approximation algorithm for DB-GST-T. So, the degree violation of our algorithm is tight and the cost-guarantee is almost tight. This improves upon the $O(\log n\log k,\log n\log k)$ -bicriteria approximation algorithm due to Kortsarz and Nutov [18] who observe that the randomized rounding algorithm in [9] also gives a guarantee on degree-violation.

1.1 Our Results

Our first result is an $(O(\log k\log n),O(\log^{2}n))$ -bicriteria approximation for DB-DST that runs in quasi-polynomial time: We say that a randomized algorithm is an $(\alpha,\beta)$ -bicriteria-approximation algorithm if it outputs a tree $T$ containing an $r$ → $t$ path for every terminal $t\in K$ such that the number of children of every vertex $v$ in $T$ is at most $\beta\cdot d_{v}$ , and the expected cost of the tree is at most $\alpha$ times the cost of the optimum tree that does not violate the degree constraints.

Theorem 1.1.

There is a randomized $(O(\log n\log k),O(\log^{2}n))$ -bicriteria approximation algorithm for the degree-bounded directed Steiner tree problem in $n^{O(\log n)}$ -time.

To the best of our knowledge, our result for DB-DST is the first non-trivial bicriteria approximation for the problem. As we mentioned, the $O(\log n\log k)$ -factor for the cost is almost tight due to the hardness results of [14] and [12] for DST. There is a hardness of $\Omega(\log n)$ for the degree violation factor from the set-cover problem, even if we ignore the cost of the output tree.

Remark

As in [12, 10], we could save a factor of $\log\log n$ in the approximation factor for the problem, with a slight increase in the running time. However, this complicates the algorithmic framework. To deliver the algorithmic idea in a cleaner way, we choose to present the results with $O(\log n\log k)$ approximation ratios.

Our second result is for the degree-bounded group Steiner tree problem on trees (DB-GST-T). We obtain an $\big{(}O(\log n\log k),O(\log n)\big{)}$ -bicriteria approximation, which is (almost) tight on both factors:

Theorem 1.2.

There is a randomized $\big{(}O(\log n\log k),O(\log n)\big{)}$ -bicriteria approximation for the degree-bounded group Steiner tree problem on trees.

1.2 Our Techniques

Our algorithm for degree-bounded directed Steiner tree takes ingredients from both [12] and [10]. As in these papers, we consider an optimum solution, and recursively partition it into balanced sub-trees; we then assign a “state” to each of these sub-trees. The tree structure of this recursive partition, as well as all of the states, form what we call a state tree. We solve the problem indirectly, by finding a good state tree, which we can transform back into a corresponding good solution. The state of a sub-tree contains a set of special vertices in the sub-tree that we call portals; these were used in [10] to obtain their improved approximation algorithm for DST. We construct a super-tree $\mathbf{T}^{\circ}$ that contains all possible state trees as sub-trees and reduce the problem considered into that of finding a good sub-tree of small cost in $\mathbf{T}^{\circ}$ . This can be done by formulating a linear program (LP) relaxation and rounding the LP solution using a recursive procedure. The construction of the super-tree and the LP rounding techniques are similar to those in [12]. To extend the algorithm to DB-DST, we need to store the degrees of all of the portals in the state.

This algorithmic framework outputs a so-called “multi-tree”: This is a tree where a vertex or an edge can appear multiple times. Repeating the procedure for $Q=O(\log n\log k)$ times, we obtain a set of $Q$ multi-trees. This process violates the degree requirements and thus we obtain bicriteria approximation results. The analysis of this process is non-trivial as we need to prove a concentration bound on the number of times a vertex appears in a multi-tree.

Our technique for DB-GST-T is in observing that the rounding algorithm for GST-T (no degree bounds) in [9] is indeed a generalization of random walk. As we slightly boost the branching probability by a constant factor, this (almost) does not affect the degree bound, but the probability of connecting the root vertex to each group is amplified dramatically. A drawback is that it also incurs a huge blow-up in the cost. To handle the blow-up, we stop amplifying the branching probability when the connecting probability is sufficiently large. The best (but inaccurate) way to illustrate our algorithm is by considering a random walk from the root vertex to a group $O_{t}$ . We change the random process by branching into two directions simultaneously in each step, and then stop the extra branching when it generates $\Theta(\log n)$ simultaneous random walks. Since we have $O(\log n)$ simultaneous random walks, the cost incurred by the process is blown-up by a factor $O(\log n)$ , but the degree-violation is blown-up by only a factor $2$ . At the same time, the probability of reaching the group $O_{t}$ goes up by a factor $\Omega(\log n)$ . Thus, if we need $O(\log k\log n)$ rounds to reach every group, then we now need only $O(\log k)$ rounds. There is no difference in the cost for running the algorithm for $O(\log k\log n)$ rounds or $O(\log k)$ rounds (with an extra $O(\log n)$ factor in the cost), but it saves a factor in the degree-violation of $O(\log n)$ .

2 Preliminaries for Degree-Bounded Directed Steiner Tree

2.1 Notations and Assumptions

In our algorithm and analysis for the DB-DST problem, a tree is always an out-arborescence. Given a tree $T$ , we use $\mathrm{root}(T)$ to denote its root. Given $T$ and a vertex $v$ in $T$ , we use $\Lambda_{T}(v)$ to denote the set of children of $v$ , and $\Lambda^{*}_{T}(v)$ to denote the set of descendants of $v$ (including $v$ itself) in the tree $T$ . A sub-tree $T^{\prime}$ of $T$ is a weakly-connected sub-graph of $T$ ; such a $T^{\prime}$ must be an out-arborescence. Sometimes, we shall use left and right children to refer to the two children of a vertex in a tree; in this case, the order of the two children is important and will be clearly specified. For an edge $e=(u,v)$ , we use ${\mathrm{tail}}(e)=v$ to denote its tail. For a triple $\xi=(u,v,v^{\prime})$ of three vertices, we use ${\mathrm{second}}(\xi)=v$ and ${\mathrm{third}}(\xi)=v^{\prime}$ to denote the second and third parameter of $\xi$ .

Our input digraph is $G$ . Let $d_{\max}=\max_{v\in V}d_{v}$ . We shall assume each terminal $t\in K$ has only one incoming edge and no outgoing edges in $G$ . This can be assumed w.l.o.g using the following simple operation: For every terminal $t\in K$ that does not satisfy the condition, we add a new vertex $t^{\prime}$ , an edge $(t,t^{\prime})$ and replace $t$ with $t^{\prime}$ in $K$ . We increase $d_{t}$ by 1 and set $d_{t^{\prime}}=0$ .

One more assumption we can make is that each non-terminal $u\in V\setminus K$ has at most 2 outgoing edges in $G$ . To make sure that this holds, we focus on some non-terminal $u$ with $b\geq 3$ outgoing edges. We replace the star centered at $u$ with its $b$ outgoing edges by a gadget which is a full binary-tree rooted at $u$ with $b$ leaves being the out-neighbors of $u$ . For every newly added vertex $u$ , we set $d_{u}=d_{\max}$ . This way every vertex in $G$ will have at most $2$ outgoing edges. The cost of the edges in the gadget can be naturally defined. However, this operation changes the degree of vertices. To address this issue, we define a simple transformation function $\phi_{v}:\mathbb{Z}\to\mathbb{Z}$ for every $v\in V$ as follows: If $v$ is a vertex in the original graph, then $\phi_{v}$ is identically 1. Otherwise, $v$ is a non-root internal vertex of some gadget and we define $\phi_{v}$ to be the identity function. Then we can compute the original degree $\rho_{u}$ of a vertex $u$ in a tree $T$ of $G$ recursively as follows: $\rho_{u}=0$ if $u$ is a leaf, and $\rho_{u}=\sum_{v\in\Lambda_{T}(u)}\phi_{v}(\rho_{v})$ otherwise. So, we require that for every $v$ in the output tree $T$ , the original degree $\rho_{v}$ of $v$ is at most $d_{v}$ .

2.2 Balanced Tree Partition

We shall use the following basic tool as the starting point of our algorithm design. Its proof is elementary and deferred to Appendix A.

Lemma 2.1.

*Let $T=(V_{T},E_{T})$ be an $n$ -vertex binary tree. Then there exists a vertex $v\in V_{T}$ with $n/3<|\Lambda^{*}_{T}(v)|\leq 2n/3+1$ . *

Given a tree $T=(V_{T},E_{T})$ as in the lemma, we can partition it into two trees $T_{1}=(V_{T_{1}},E_{T_{1}})$ and $T_{2}=(V_{T_{2}},E_{T_{2}})$ , where $T_{2}$ contains vertices in $\Lambda^{*}_{T}(v)$ and $T_{1}$ contains vertices in $V_{T}\setminus(\Lambda^{*}_{T}(v)\setminus\{v\})$ . First assume $n\geq 4$ . Since $2n/3+1<n$ , we know that $v\neq\mathrm{root}(T)$ , thus implying $\mathrm{root}(T_{1})=\mathrm{root}(T)\neq\mathrm{root}(T_{2})=v$ , which is a leaf in $T_{1}$ . Consequently, we have $E_{T_{1}}\uplus E_{T_{2}}=E_{T}$ and $V_{T_{1}}\cup V_{T_{2}}=V_{T},V_{T_{1}}\cap V_{T_{2}}=\{\mathrm{root}(T_{2})\}$ . Moreover, $|V_{T_{1}}|,|V_{T_{2}}|\leq 2n/3+1$ , which is strictly less than $n$ . Thus, $T_{1}$ and $T_{2}$ are sub-trees that form a balanced partition of (the edges of) $T$ . We call this procedure the balanced tree partitioning on $T$ .

When $n=3$ , there are 2 types of trees. If the root has two children, then we could not make both $|V_{T_{1}}|$ and $|V_{T_{2}}|$ to be smaller than $3$ . If the tree is a path of 2 edges, then we can choose $v$ to be the middle vertex and the procedure partitions the tree into two edges. Later, we shall apply the balanced tree partitioning procedure recursively. We stop the recursion when the tree is either an edge, or only contains the root and its 2 children. In other words, the tree has only 1 level of edges.

2.3 Multi-Tree

We define a multi-tree in $G$ as an intermediate structure. It is simply a tree over multi-sets of vertices and edges in $G$ :

Definition 2.2 (Multi-Tree).

Given the input digraph $G=(V,E)$ , a multi-tree in $G$ is a tree $T=(V_{T},E_{T})$ where every vertex $a\in V_{T}$ is associated with a label $\mathrm{label}(a)\in V$ such that for every $(a,b)\in E_{T}$ , we have $(\mathrm{label}(a),\mathrm{label}(b))\in E$ .

We say that each vertex $a\in V_{T}$ is a copy of the vertex $\mathrm{label}(a)\in V$ and each edge $(a,b)\in E_{T}$ is a copy of the edge $(\mathrm{label}(a),\mathrm{label}(b))\in E$ . So, we say that $T$ is rooted at a copy of $v\in V$ , if $\mathrm{label}(\mathrm{root}(T))=v$ , and $T$ contains a copy of some $v\in V$ if there exists some $a\in V_{T}$ with $\mathrm{label}(a)=v$ . We extend the costs $c_{e}$ , the functions $\phi_{v}$ and the degree bounds $d_{v}$ automatically to their copies in a multi-tree. That means, for a vertex $a$ and an edge $(a,b)$ in a multi-tree, $d_{a}=d_{\mathrm{label}(a)},\phi_{a}\equiv\phi_{\mathrm{label}(a)}$ and $c_{(a,b)}=c_{(\mathrm{label}(a),\mathrm{label}(b))}$ . The cost of a multi-tree $T=(V_{T},E_{T})$ is naturally defined as $\mathrm{cost}(T)=\sum_{e\in E_{T}}c_{e}$ . Given a multi-tree $T$ , the “original degree” $\rho_{a}$ of a vertex $a$ can be computed in the same way as before.

Definition 2.3 (Good Multi-Trees).

Let $T=(V_{T},E_{T})$ be a multi-tree in $G$ . We say that $T$ is good if it is rooted at a copy of $r$ , has leaves being copies of terminals, and the original degree of any vertex $a$ in $T$ is at most $d_{a}$ .

We can then state the main theorem for DB-DST, which we prove in Sections 3 to 5.

Theorem 2.4 (Main Theorem for DB-DST).

There is an $n^{O(\log n)}$ -time randomized algorithm that outputs a good multi-tree $T=(V_{T},E_{T})$ such that

(2.4a)

$\operatorname*{\mathbb{E}}_{T}[\mathrm{cost}(T)]\leq\mathrm{opt}$ , where $\mathrm{opt}$ is the cost of the optimum solution for the instance. 2. (2.4b)

For every $t\in K$ , we have $\Pr_{T}[V_{T}\text{ contains a copy of }t]\geq\Omega(1/\log n)$ . 3. (2.4c)

For some $s=\Omega\left(\frac{1}{\log n}\right)$ , it holds, for every $v\in V$ , that

[TABLE]

We show that this implies Theorem 1.1.

Proof of Theorem 1.1.

We run the algorithm in Theorem 2.4 $Q$ times to obtain $Q$ good multi-trees $T_{1},T_{2},\cdots,T_{Q}$ , for some large enough $Q=O(\log n\log k)$ . Our output will contain all edges that appear in the $Q$ multi-trees. Notice that the output may not be a tree, but we can remove edges so that it becomes a tree. Applying union bound, all terminals appear in the union of the $Q$ trees with probability at least $0.9$ , when $Q$ is big enough. By Property (2.4c) in the theorem statement, we have for every $v$ ,

[TABLE]

The above inequality holds since the $Q$ trees are produced independently.

Thus, if $M=O(\log n)$ is big enough, by Markov’s inequality we have

[TABLE]

The event on the left side is exactly that the number of copies of $v$ in $T_{1},\cdots,T_{Q}$ is at least $M/s$ .

Thus, with probability at least $0.8$ , every terminal $t$ appears in one of the $Q$ trees and every vertex $v$ appears at most $M/s=O(\log^{2}n)$ times in $T_{1},T_{2},\cdots,T_{Q}$ . Taking the union of all trees and reflecting the edges in original graph $G$ , we have a sub-graph $G^{\prime}$ of $G$ that contains a path from $r$ to every terminal $t\in K$ . The total cost of edges in $G^{\prime}$ is at most $O(\log n\log k)\cdot\mathrm{opt}$ . For every vertex $v$ , the out-degree of $v$ in $G^{\prime}$ will be at most $(M/s)d_{v}=O(\log^{2}n)d_{v}$ . We can take an arbitrary Steiner tree $T$ in $G^{\prime}$ as the output of the algorithm. This gives us an $(O(\log n\log k),O(\log^{2}n))$ -bicriteria approximation algorithm for the degree-bounded directed Steiner tree problem. The running time of the algorithm is $n^{O(\log n)}$ . ∎

Organization

The remaining part of the paper is organized as follows. In Section 3, we define states and good state trees. In Section 4, we argue that the problem of finding a small cost valid tree can be reduced to that of finding a small cost state-tree. In Section 5, we present our linear programming rounding algorithm that finishes the proof of Theorem 2.4. Section 6 is dedicated to the proof of Theorem 1.2 for the degree-bounded group Steiner tree problem on trees (DB-GST-T).

3 States and State-Trees

Given the optimum tree $T^{*}$ (which is binary by our assumptions) for the DB-DST problem, we can apply the balanced tree partitioning recursively to obtain a decomposition tree: We start from $T^{*}$ and partition it into two trees $T_{1}$ and $T_{2}$ using the balanced-tree-partitioning procedure, and then recursively partition $T_{1}$ and $T_{2}$ until we obtain sub-trees with 1 level of edges: Such a tree contains either a single edge, or two edges from the root. Then the decomposition tree is a full binary tree where each node corresponds to a sub-tree of $T^{*}$ . Due to the balance condition, the height of the tree will be $O(\log n)$ . Throughout the paper, we shall use $h=\Theta(\log n)$ to denote an upper bound on the height of this decomposition tree.

Thanks to its small depth, the decomposition tree becomes the object of interest. However, as each node in the tree corresponds to a sub-tree of the optimum solution $T^{*}$ , it contains too much information for the algorithm to handle. Instead, we shall only extract a small piece of information from each node that we call the state of the node. On one hand, a state contains much less information than a sub-tree does, so we can afford to enumerate all possible states for a node. On the other hand, the states of nodes in the decomposition tree still contain enough information for us to check whether the correspondent multi-tree is good. We call the binary tree of states a state tree; we require in a good state tree, the states of nodes satisfy some consistency constraints. Then we can establish a two-direction connection between good multi-trees and good state trees.

Given a valid tree $T$ in $G$ and a sub-tree $T^{\prime}$ of $T$ , we now start to make definitions related to the state of $T^{\prime}$ w.r.t $T$ . It is convenient to think that $T$ is the optimum tree $T^{*}$ and $T^{\prime}$ is a sub-tree of $T=T^{*}$ obtained from the recursive balanced-partitioning procedure, since this is how we use the definitions. However, the definitions are w.r.t general $T$ and $T^{\prime}$ ; from now on till the end of Section 3, we fix any valid tree $T$ and its sub-tree $T^{\prime}$ .

3.1 Portals

Other than $\mathrm{root}(T^{\prime})$ , the state for $T^{\prime}$ w.r.t $T$ contains the set of portals of $T^{\prime}$ :

Definition 3.1.

A vertex $v$ in $T^{\prime}$ is a portal in $T^{\prime}$ , if $v$ is $\mathrm{root}(T^{\prime})$ or a non-terminal leaf of $T^{\prime}$ .

In general, the set of portals of $T^{\prime}$ can be large, but if $T^{\prime}$ is obtained from the recursive balanced-tree-partitioning procedure for $T$ , then the number of portals can be shown to be at most $h+1$ . As we shall often use the root and set of portals together, we make the following definition:

Definition 3.2 (Root-Portals-Pair).

$(r^{\prime},S)$ * is called a root-portals-pair if $r^{\prime}\in S\subseteq V\setminus K$ .*

It is easy to see that the root-portal-pairs for an internal node of the decomposition tree and its two children satisfy some properties stated in the following definition:

Definition 3.3 (Allowable Child-Pair).

Given three root-portals-pairs $(r^{\prime},S),(r^{\prime},S_{1})$ and $(r^{\prime\prime},S_{2})$ , we say $((r^{\prime},S_{1}),(r^{\prime\prime},S_{2}))$ is an allowable child-pair of $(r^{\prime},S)$ if $r^{\prime\prime}\notin S,S_{1}\cup S_{2}=S\cup\{r^{\prime\prime}\}$ and $S_{1}\cap S_{2}=\{r^{\prime\prime}\}$ .

The following claim motivates the definition of allowable child pairs:

Claim 3.4.

Assume $T^{\prime}=(V^{\prime},E^{\prime})$ contains at least 2 levels of edges. Let $T^{\prime}_{1}=(V^{\prime}_{1},E^{\prime}_{1})$ and $T^{\prime}_{2}=(V^{\prime}_{2},E^{\prime}_{2})$ be the two sub-trees obtained by applying the balanced tree partitioning on $T^{\prime}$ . Let $r^{\prime}=\mathrm{root}(T^{\prime})=\mathrm{root}(T^{\prime}_{1})$ , $r^{\prime\prime}=\mathrm{root}(T^{\prime}_{2})\neq r^{\prime}$ and $S,S_{1},S_{2}$ be the sets of portals in $T^{\prime},T^{\prime}_{1},T^{\prime}_{2}$ respectively. Then, $((r^{\prime},S_{1}),(r^{\prime\prime},S_{2}))$ is an allowable child-pair of $(r^{\prime},S)$ .

Proof.

First, $r^{\prime\prime}$ is not a portal of $T^{\prime}$ since it is a non-root internal vertex in of $T^{\prime}$ . Second, it is easy to see that $S_{1}=(S\cup\{r^{\prime\prime}\})\cap V^{\prime}_{1}$ and $S_{2}=(S\cup\{r^{\prime\prime}\})\cap V^{\prime}_{2}$ . So, $S_{1}\cup S_{2}=S\cup\{r^{\prime\prime}\}$ and $S_{1}\cap S_{2}=\{r^{\prime\prime}\}$ . ∎

3.2 Degree Vectors

The next piece of the information in a state is a degree vector:

Definition 3.5.

A degree vector for a set $S\subseteq V\setminus K$ is a vector $\rho=(\rho_{v})_{v\in S}$ , where $\rho_{v}$ is an integer in $[1,d_{v}]$ for every $v\in S$ .

Supposedly, $\rho_{v}$ will be the original degree of $v$ in the tree $T$ .

Definition 3.6 (Consistency of degree vectors).

Given a root-portals-pair $(r^{\prime},S)$ , an allowable child-pair $((r^{\prime},S_{1}),(r^{\prime\prime},S_{2}))$ of $(r^{\prime},S)$ , three degree vectors $\rho,\rho^{1}$ and $\rho^{2}$ for $S,S_{1}$ and $S_{2}$ respectively, we say $\rho^{1}$ and $\rho^{2}$ are consistent with $\rho$ , if

•

for every $v\in S_{1}\setminus\{r^{\prime\prime}\}$ , we have $\rho_{v}=\rho^{1}_{v}$ ,

•

for every $v\in S_{2}\setminus\{r^{\prime\prime}\}$ , we have $\rho_{v}=\rho^{2}_{v}$ and

•

$\rho^{1}_{r^{\prime\prime}}=\rho^{2}_{r^{\prime\prime}}$ .

So, the degree vectors are consistent if there is no contradictory information among them.

Definition 3.7 (Edge/Triple Agreeing with Degree Vector).

Given a root-portals-pair $(r^{\prime},S)$ with $|S|\leq 2$ , a degree vector $\rho$ for $S$ , and an edge $(r^{\prime},v)\in E$ with $\{r^{\prime},v\}\setminus K=S$ , we say $(r^{\prime},v)$ agrees with $\rho$ if $\rho_{r^{\prime}}=(\phi_{v}(\rho_{v})\text{ or }1)$ , where $(\phi_{v}(\rho_{v})\text{ or }1)$ denotes $\phi_{v}(\rho_{v})$ if $\rho_{v}$ is defined (i.e, if $v\in S$ ) and $1$ otherwise.

Similarly, given a root-portals-pair $(r^{\prime},S)$ with $|S|\leq 3$ , a degree vector $\rho$ for $S$ , and two edges $(r^{\prime},v),(r^{\prime},v^{\prime})\in E$ such that $\{r^{\prime},v,v^{\prime}\}\setminus K=S$ , we say the triple $(r^{\prime},v,v^{\prime})$ agrees with $\rho$ if $\rho_{r^{\prime}}=(\phi_{v}(\rho_{v})\text{ or }1)+(\phi_{v^{\prime}}(\rho_{v^{\prime}})\text{ or }1)$ .

Notice that in the above definition either $v\in S$ or $v\in K$ . In the former case, $\rho_{v}$ is defined; in the latter case $\rho_{v}$ is not defined but we know $\phi_{v}$ is identically 1. The same argument holds for $v^{\prime}$ . The definition corresponds to the case when $T^{\prime}$ is a base case of the recursive balanced tree partitioning, i.e., $T^{\prime}$ contains only 1 level of edges. If $T^{\prime}$ contains an edge $e=(r^{\prime},v)$ , then the portal set of $T^{\prime}$ is $\{r^{\prime},v\}\setminus K$ . We shall have $\rho_{r^{\prime}}=\phi_{v}(\rho_{v})\text{ or }1$ . Thus, if $\rho$ is restricted to the portal set, we have $\rho_{r^{\prime}}=(\phi_{v}(\rho_{v})\text{ or }1)$ . Similarly, if $T^{\prime}$ contains 3 vertices $(r^{\prime},v,v^{\prime})$ with $r^{\prime}$ being the root, then we must have $\rho_{r^{\prime}}=(\phi_{v}(\rho_{v})\text{ or }1)+(\phi_{v^{\prime}}(\rho_{v^{\prime}})\text{ or }1)$ .

3.3 States and Good State-Trees

With degree vectors, we can define states and good state-trees:

Definition 3.8.

A state is a tuple $(r^{\prime},S,\rho)$ where $(r^{\prime},S)$ is a root-portals-pair and $\rho$ is a degree vector for $S$ .

The state of the tree $T^{\prime}$ w.r.t $T$ is the tuple $(r^{\prime},S,\rho)$ with $r^{\prime}=\mathrm{root}(T^{\prime})$ , $S$ being the set of portals in $T^{\prime}$ , and $\rho$ being the vector of original degrees of vertices in $S$ w.r.t the tree $T$ .

Definition 3.9 (Good State Trees).

A good state tree* is a full binary tree $\tau$ of depth at most $h$ , where every node ${p}$ is associated with a state $(r^{\prime}_{p},S_{p},\rho^{p})$ , and every leaf ${o}$ is associated with either an edge $e_{o}\in E$ or a triple $\xi_{o}$ such that the following conditions hold.*

(3.9a)

$\left(r^{\prime}_{\mathrm{root}(\tau)},S_{\mathrm{root}(\tau)}\right)=(r,\{r\})$ *. * 2. (3.9b)

For any leaf ${o}$ of $\tau$ , either $e_{o}$ or $\xi_{o}$ agrees with $\rho^{o}$ . 3. (3.9c)

For an internal node ${p}$ in $\tau$ , letting ${q}$ and ${o}$ be the left and right children of ${p}$ , then the pair $((r^{\prime}_{q},S_{q}),(r^{\prime}_{o},S_{o}))$ is an allowable child-pair of $(r^{\prime}_{p},S_{p})$ (so, $r^{\prime}_{q}=r^{\prime}_{p}\neq r^{\prime}_{o}$ ), and $\rho^{q}$ and $\rho^{o}$ are consistent with $\rho^{p}$ .

We say that a terminal $t\in K$ is involved in a good state tree $\tau$ if there exists a leaf ${o}$ of $\tau$ with $t={\mathrm{tail}}(e_{o})$ , or $t\in\{{\mathrm{second}}(\xi_{o}),{\mathrm{third}}(\xi_{o})\}$ .

Given a good state tree $\tau$ , and a leaf ${o}$ in $\tau$ , we define the cost $c({o})$ as follows. If $e_{o}$ is defined, then we define $c(o)=c_{e_{o}}$ ; otherwise, define $c(o)=c_{(r^{\prime}_{o},{\mathrm{second}}(\xi_{o}))}+c_{(r^{\prime}_{o},{\mathrm{third}}(\xi_{o}))}$ . The cost of a state-tree $\tau$ is defined as $\mathrm{cost}(\tau):=\sum_{{o}\text{ leaf of }\tau}c({o})$ .

4 Reduction to Finding Good State-Trees

4.1 From a Valid Tree to a Good State-Tree Involving All Terminals

In this section, we show that the decomposition tree of the optimum tree $T^{*}$ can be turned into a good state tree $\tau^{*}$ with cost $\mathrm{cost}(\tau^{*})=\mathrm{cost}(T^{*})$ that involves all terminals. As we alluded, the state tree $\tau^{*}$ is constructed by taking the state for each node in the decomposition tree for $T^{*}$ . Formally, it is obtained by calling $\mathrm{gen\mathchar 45\relax state\mathchar 45\relax tree}(T^{*})$ (defined in Algorithm 1). In the algorithm $\rho^{T^{*}}$ is the vector of original degrees of all vertices in $T^{*}$ . The procedure is only for analysis purpose; it is not a part of our algorithm.

Lemma 4.1.

$\tau^{*}$ * is a good state tree involving all terminals and $\mathrm{cost}(\tau^{*})=\mathrm{cost}(T^{*})$ .*

Proof.

We first show that $\tau^{*}$ is a good state tree, by showing that it satisfies all the properties in Definition 3.9. Property (3.9a) trivially holds by the way we define the parameters for the root recursion of $\mathrm{gen\mathchar 45\relax state\mathchar 45\relax tree}$ . Property (3.9b) holds by that each $\rho^{p}$ is $\rho^{T^{*}}$ restricted to $S^{p}$ . Property (3.9c) follows from the same facts and Claim 3.4. $\mathrm{cost}(\tau^{*})=\sum_{e\in E_{T^{*}}}c_{e}=\mathrm{cost}(T^{*})$ since every edge in $T^{*}$ counted exactly once in $\tau^{*}$ . ∎

4.2 From a Good State Tree to a Good Multi-Tree

Now we focus on the other direction of the reduction. Suppose we are given a good state tree $\tau$ , and our goal is to construct a good multi-tree $T$ with $\mathrm{cost}(T)=\mathrm{cost}(\tau)$ . Moreover, if a terminal $t\in K$ is involved in $\tau$ , then $T$ contains a copy of $t$ .

The multi-tree $T$ is constructed by joining the edges associated with all leaf nodes ${o}$ in $\tau$ using a recursive procedure. For each node ${p}$ in $\tau$ we shall construct a multi-tree $T_{p}$ for ${p}$ , as well as a mapping $\pi_{p}$ from $S_{p}$ to vertices in $T_{p}$ . The multi-tree $T_{p}$ and the mapping $\pi_{p}$ satisfy the following properties:

(P1)

For every $v\in S^{p}$ , we have $\mathrm{label}(\pi_{{p}}(v))=v$ ; that is, $\pi_{{p}}(v)$ is a copy of $v$ . 2. (P2)

$\pi_{p}(r^{\prime}_{p})=\mathrm{root}(T_{p})$ .

In particular, the two properties imply that $\mathrm{root}(T_{p})$ is a copy of $r^{\prime}_{p}$ .

The trees and mappings are constructed from the bottom to the top of the tree $\tau$ . Focus on a leaf node ${p}$ with $e_{p}=(r^{\prime},v)$ . If $e_{p}$ is defined, then $T_{p}$ only contains a copy of the edge $(r^{\prime},v)$ . $\pi_{p}$ maps $r^{\prime}$ to the copy of $r^{\prime}$ , and if $v\notin K$ (thus, $v\in S_{p}$ ), $v$ to the copy of $v$ in $T_{p}$ . Otherwise $\xi_{p}$ is defined. Then $T_{p}$ contains a tree with two edges: a copy of $(r^{\prime}_{p},{\mathrm{second}}(\xi_{p}))$ and a copy of $(r^{\prime}_{p},{\mathrm{third}}(\xi_{p}))$ . $\pi_{p}$ can also be defined naturally.

Now consider the case that ${p}$ is an internal node and let ${q}$ and ${o}$ be its left and right children. Then, we have $r^{\prime}_{p}=r^{\prime}_{q},r^{\prime}_{o}\notin S_{p},S_{q}\cup S_{o}=S_{p}\cup\{r^{\prime}_{o}\}$ and $S_{q}\cap S_{o}=\{r^{\prime}_{o}\}$ by Property (3.9c). Then we identify $\pi_{{q}}(r^{\prime}_{o})$ with $\pi_{{o}}(r^{\prime}_{o})=\mathrm{root}(T_{{o}})$ , and then the multi-tree $T_{p}$ is the new tree containing vertices in $T_{{q}}$ and $T_{{o}}$ . Notice that both $\pi_{{q}}(r^{\prime}_{o})$ and $\pi_{{o}}(r^{\prime}_{o})$ are copies of $r^{\prime}_{o}$ ; thus the obtained $T_{p}$ can be well-defined. The mapping $\pi_{p}$ is just the combination of $\pi_{{q}}$ and $\pi_{{o}}$ : For a vertex $v\in S_{q}$ , let $\pi_{p}(v)=\pi_{{q}}(v)$ ; for a vertex $v\in S_{o}$ , let $\pi_{p}(v)=\pi_{{o}}(v)$ ; since $S_{q}\cap S_{o}=\{r^{\prime}_{o}\}$ and we identified $\pi_{{q}}(r^{\prime}_{o})$ with $\pi_{{o}}(r^{\prime}_{o})$ , the mapping is well-defined. Also, it is easy to see that (P1) and (P2) holds for $T_{p}$ and $\pi_{p}$ .

Our final multi-tree for $\tau$ will be $T=T_{\mathrm{root}(\tau)}$ . It is straightforward to see that if $t\in K$ is involved in $\tau$ , then $T$ contains a copy of $t$ . Notice that all the $\rho^{p}$ -vectors are consistent with each other, and for every leaf $o$ , $e_{o}$ or $\epsilon_{o}$ agrees with $\rho^{o}$ . Thus, aggregating all the $\rho^{p}$ vectors will recover the vector $\rho^{T}$ of original degrees of vertices in $\rho^{T}$ . So, the multi-tree $T$ is good since every $v$ in $T$ has $\rho^{T}_{v}\in[1,d_{v}]$ . The cost of $T$ is $\sum_{e\in E_{T}}c_{e}=\sum_{o:\text{ leaves of }\tau}c(o)=\mathrm{cost}(\tau)$ .

5 Finding a Good State Tree using LP Rounding

5.1 Extended State Trees and Construction of $\mathbf{T}^{0}$

With the relationship between good multi-trees and good state trees established, we can now focus on the problem of finding a good state-tree of small cost involving many terminals. We shall construct a quasi-polynomial sized tree $\mathbf{T}^{\circ}$ so that every good state-tree $\tau$ corresponds a sub-tree $\mathbf{T}$ of $\mathbf{T}^{\circ}$ satisfying some property. Roughly speaking, $\mathbf{T}^{\circ}$ is the “super-set” of all potential good state-trees $\tau$ . However, since the consistency conditions are defined over three states for a parent and its two children, it is more convenient to insert a “virtual” node between every internal node and its two children. Also, it is convenient to break a leaf state node $o$ into two nodes, one containing the state information and the other containing $e_{o}$ or $\xi_{o}$ . Formally, for a good state-tree $\tau$ , we construct a correspondent tree $\mathbf{T}$ as follows.

Let $\mathbf{T}$ be a copy of $\tau$ . All nodes in $\mathbf{T}$ are called state nodes. 2. 2.

For every internal state node $p$ in $\mathbf{T}$ with left and right children $p_{1}$ and $p_{2}$ , we create a virtual node $q$ and replace the two edges $(p,p_{1})$ and $(p,p_{2})$ with 3 edges $(p,q),(q,p_{1})$ and $(q,p_{2})$ ; $p_{1}$ is still the left child and $p_{2}$ is the right child. 3. 3.

For every leaf state node $p$ , we create a base node $o$ and let $o$ be the child of $p$ . Then we move the $e_{p}$ or $\xi_{p}$ information from the node $p$ to node $o$ : If $e_{p}$ is defined, then we let $e_{o}=e_{p}$ and undefine $e_{p}$ ; otherwise, let $\xi_{o}=\xi_{p}$ and undefine $\xi_{p}$ . 4. 4.

We add a super node ${\mathbf{r}}$ and an edge from ${\mathbf{r}}$ to the root of $\mathbf{T}$ . ${\mathbf{r}}$ will be the new root for $\mathbf{T}$ .

We call this $\mathbf{T}$ the extended state-tree for $\tau$ ; we say $\mathbf{T}$ is good if its correspondent $\tau$ is good. Clearly, there is a 1-to-1 correspondence between good state trees and good extended state trees.

Our $\mathbf{T}^{\circ}$ will be the “super-set” of all potential good extended state trees $\mathbf{T}$ . Formally, we create a super node ${\mathbf{r}}$ to be the root of $\mathbf{T}^{\circ}$ . Then, for every $\rho_{r}\in[1,d_{r}]$ , we call $\mathrm{cnstr\mathchar 45\relax}\mathbf{T}^{\circ}(0,r,\{r\},\rho=(\rho_{r}))$ to obtain a tree and let its root be a child of ${\mathbf{r}}$ .

The following claim is immediate from the construction of $\mathbf{T}^{\circ}$ .

Claim 5.1.

A subtree $\mathbf{T}$ of $\mathbf{T}^{\circ}$ with $\mathrm{root}(\mathbf{T})=\mathrm{root}(\mathbf{T}^{\circ})$ is a good extended state tree if and only if the following happens:

•

The super node in $\mathbf{T}$ has exactly one child (which is a state node).

•

Each state node in $\mathbf{T}$ has exactly one child (which is an base node or a virtual node).

•

For each virtual node $q$ in $\mathbf{T}$ , both $q$ ’s children in $\mathbf{T}^{\circ}$ are in $\mathbf{T}$ .

On the other hand, every good extended tree $\mathbf{T}$ of depth at most $h+1$ is a sub-tree of $\mathbf{T}^{\circ}$ with root being $\mathrm{root}(\mathbf{T}^{\circ})$ .

Also, we say that a vertex $v$ is involved in $\mathbf{T}$ if there is an base node $o$ in $\mathbf{T}$ with $v={\mathrm{tail}}(e_{o})$ or $v\in\{{\mathrm{second}}(\xi_{o}),{\mathrm{third}}(\xi_{o})\}$ . The cost of $\mathbf{T}$ , denoted as $\mathrm{cost}(\mathbf{T})$ , is defined the sum of $c(o)$ over all base nodes in $\mathbf{T}$ . So, the problem now becomes finding a small-cost good extended state tree in $\mathbf{T}^{\circ}$ that involves each terminal with large probability.

5.2 LP Formulation

We formulate an LP relaxation for our task. Let $\mathbf{V}^{\circ}$ be the set of nodes in $\mathbf{T}^{\circ}$ , ${\mathbf{r}}=\mathrm{root}(\mathbf{T}^{\circ})$ and let $\mathbf{V}^{\circ}_{\mathrm{state}},\mathbf{V}^{\circ}_{\mathrm{virt}}$ and $\mathbf{V}^{\circ}_{\mathrm{base}}$ be the sets of state, virtual and base nodes in $\mathbf{T}^{\circ}$ respectively. Notice that there is only one super node, which is the root ${\mathbf{r}}$ . For every $t\in K$ , let ${\mathbf{O}}_{t}=\left\{t\in\mathbf{V}^{\circ}_{\mathrm{base}}:t={\mathrm{tail}}(e_{o})\text{ or }t\in\{{\mathrm{second}}(\xi_{o}),{\mathrm{third}}(\xi_{o})\}\right\}$ be the set of base nodes involving $t$ . Let $\mathbf{T}^{*}$ be our target good extended state tree; this is the tree correspondent to the good state tree $\tau^{*}$ . Then, in our LP, we have a variable $x_{p}$ for every $p\in\mathbf{V}^{\circ}$ , that indicates whether $p$ is in the $\mathbf{T}^{*}$ or not.

[TABLE]

The objective function of LP (1) is to minimize the total cost of all leaves in $\mathbf{T}^{*}$ . (2) requires that for every state or super node $p$ in $\mathbf{T}^{*}$ , exactly one child of $p$ is in $\mathbf{T}^{*}$ . (3) requires that a virtual node $q$ in $\mathbf{T}^{*}$ has both its children in $\mathbf{T}^{*}$ . (5) says for every node $p$ in $\mathbf{T}^{*}$ and every terminal $t\in K$ , there is a most one descendant base node $o$ of $p$ that is in ${\mathbf{O}}_{t}$ . In the whole tree $\mathbf{T}^{*}$ , exactly one leaf node $o$ has $t={\mathrm{tail}}(e_{o})$ or $t\in\{{\mathrm{second}}(\xi_{o}),{\mathrm{third}}(\xi_{o})\}$ , for every $t\in K$ (Constraint (6)); in the LP, all the variables are between [math] and $1$ (Constraint (4)).

Notice that (5) for $p={\mathbf{r}}$ and any $t\in K$ and (6) for the same $t$ imply that $x_{\mathbf{r}}=1$ . (2) and (3) imply that the $x$ values over the nodes of a root-to-leaf path in $\mathbf{T}^{\circ}$ are non-increasing.

5.3 Rounding Algorithm

Given a valid solution $x$ to LP (1), our rounding algorithm will round it to obtain set $\mathbf{V}\subseteq\mathbf{V}^{\circ}$ , which induces a good state tree. The algorithm is very similar to that of [9] with the only one difference: For every state node or super-node $p$ that is added to $\mathbf{V}$ , we add exactly one child $q$ of $p$ to $\mathbf{V}$ , while the algorithm of [9] makes independent decisions for each child. The algorithm is formally described in Algorithm 3. In the main algorithm, we simply call ${\mathrm{round}}({\mathbf{r}})$ .

It is straightforward to see that the tree induced by ${\mathrm{round}}({\mathbf{r}})$ is a good extended state tree. The following claim also holds:

Claim 5.2.

Let $p\in\mathbf{V}^{\circ}$ and $q\in\Lambda^{*}_{\mathbf{T}^{\circ}}(p)$ . Let $\mathbf{V}$ be the random set returned by ${\mathrm{round}}(p)$ . Then we have $\Pr[q\in\mathbf{V}]=\frac{x_{q}}{x_{p}}$ .

Applying the above claim for $p={\mathbf{r}}$ and every $q\in\mathbf{V}^{\circ}_{\mathrm{base}}$ , we have that the expected cost of the tree induced by $\mathbf{V}$ is exactly $\mathrm{cost}(x)$ .

The main theorem we need about the rounding algorithm is as follows:

Theorem 5.3.

Let $\mathbf{V}$ be the random set returned by ${\mathrm{round}}({\mathbf{r}})$ . Then, for any terminal $t\in K$ we have

[TABLE]

Theorem 5.3 was proved [9] for the original rounding algorithm and was reproved in [22]. However, adapting the analysis to our slightly different rounding algorithm is straightforward and thus we omit the proof of the theorem here.

We now wrap up and finish the proof of the main theorem (Theorem 2.4) except for Property (2.4c), which will be proved in Section 5.4.

We solve LP(1) to obtain a solution $x$ . Notice that $\mathrm{cost}(x)\leq\mathrm{cost}(\mathbf{T}^{*})=\mathrm{cost}(\tau^{*})=\mathrm{cost}(T^{*})$ . Let $\mathbf{V}\leftarrow{\mathrm{round}}({\mathbf{r}})$ . Then by Claim 5.1 and the rounding algorithm, the tree $\mathbf{T}$ induced by $\mathbf{V}$ is a good extended state tree. Let $\tau$ be the good state tree correspondent to $\mathbf{T}$ , and let $T$ be the good multi-tree in $G$ constructed using the procedure in Section 4.2. The cost of the multi-tree $T$ is at most $\mathrm{cost}(x)$ . By Theorem 5.3, for every $t\in K$ , the probability that $t$ is involved $T$ is at least $1/(h+1)=\Omega(1/\log n)$ .

Let us consider the running time of the algorithmic framework, which is polynomial on the size of the tree $\mathbf{T}^{\circ}$ . First notice that if $((r^{\prime},S_{1}),(r^{\prime\prime},S_{2}))$ is an allowable child pair of $(r^{\prime},S)$ , then we have $|S_{1}|,|S_{2}|\leq|S|+1$ since $S_{1}\cup S_{2}=S\cup\{r^{\prime\prime}\}$ . Thus, a state-node $p$ at the $h^{\prime}$ -th level in $\mathbf{T}^{\circ}$ (the children of ${\mathbf{r}}$ have level [math] and for simplicity we do not consider super and virtual nodes when counting levels) has $|S_{p}|\leq h^{\prime}+1$ . Thus, every state node $p$ in $\mathbf{T}^{\circ}$ has $|S_{p}|\leq h+1$ .

Then we consider the degree of the tree $\mathbf{T}^{\circ}$ , which is the maximum number of possible children of a state node $p$ with $(r^{\prime}_{p},S_{p},\rho^{p})=(r^{\prime},S,\rho)$ . First, there are at most $n\times 2^{|S_{p}|}\leq n\cdot 2^{h+1}$ different allowable child pairs $((r^{\prime},S_{1}),(r^{\prime\prime},S_{2}))$ of the pair $(r^{\prime},S)$ : there are at most $n$ choices for $r^{\prime\prime}$ and $2^{h}$ ways to split $S$ into $S_{1}$ and $S_{2}$ . Then, for a fixed allowable child pair $((r^{\prime},S_{1}),(r^{\prime\prime},S_{2}))$ we consider the number of pairs of degree vectors $\big{(}\rho^{1},\rho^{2}\big{)}$ such that $\rho^{1}$ and $\rho^{2}$ are consistent with $\rho$ . This is determined by the value of $\rho^{1}_{r^{\prime\prime}}=\rho^{2}_{r^{\prime\prime}}$ , which has at most $d_{\max}$ possibilities. So, the number of virtual children of a state node is at most $n\cdot 2^{h+1}\cdot d_{\max}=O(\mathrm{poly}(n)$ since $h=O(\log n)$ . The number of child base nodes of $p$ is at most $n^{2}$ . Since the height of the tree $\mathbf{T}^{\circ}$ is at most $O(\log n)$ , its size bounded by $(\mathrm{poly}(n))^{O(\log n)}=n^{O(\log n)}$ . So the running time of the LP rounding algorithm is $n^{O(\log n)}$ . This finishes the proof of Theorems 2.4 except for Property (2.4c).

5.4 Concentration Bound on Number of Copies of a Vertex Appearing in $T$

Finally, we prove Property (2.4c) in Theorem 2.4. To this end, we shall fix a vertex $v\in V$ . For every vertex $p\in\mathbf{V}^{\circ}$ , let $z_{p}=\sum_{o\in\Lambda^{*}_{\mathbf{T}^{\circ}}(p)\cap{\mathbf{O}}_{v}}x_{o}$ . By Constraint (5), we have $z_{p}\leq x_{p}$ . Let $m_{p}=|\Lambda^{*}_{\mathbf{T}^{\circ}}(p)\cap{\mathbf{O}}_{v}\cap\mathbf{V}|$ be the total number of nodes in $\Lambda^{*}_{\mathbf{T}^{\circ}}(p)\cap{\mathbf{O}}_{v}$ that are selected by the rounding algorithm.

As is typical, we shall introduce a parameter $s>0$ and consider the expectation the random exponential variables $\mathbf{e}^{sm_{p}}$ (we use $\mathbf{e}$ for the natural constant). We shall bound $\operatorname*{\mathbb{E}}[\mathbf{e}^{sm_{p}}|p\in\mathbf{V}]$ from bottom to top by induction. So, in this proof, it is more convenient to for us to use a different definition of levels: the level of a node $p$ in $\mathbf{T}^{\circ}$ is the maximum number of edges in a path in $\mathbf{T}^{\circ}$ starting from $p$ . So, the leaves have level [math] and for an internal node $p$ in $\mathbf{T}^{\circ}$ , the level of $p$ is 1 plus the maximum of the level of $q$ over all children $q$ of $p$ . We define an $\alpha_{i}$ for every integer $i\geq 0$ as $\alpha_{0}=\mathbf{e}^{s}$ and $\alpha^{i}=\mathbf{e}^{\alpha_{i-1}-1},\forall i\geq 1$ . Notice that $\alpha_{0},\alpha_{1},\cdots$ is an increasing sequence. Thus, we can induce the following lemma.

Lemma 5.4.

For any node $p$ be in $\mathbf{T}^{\circ}$ of level at most $i$ , $\operatorname*{\mathbb{E}}\Big{[}\mathbf{e}^{sm_{p}}\big{|}p\in\mathbf{V}\Big{]}\leq\alpha_{i}^{z_{p}/x_{p}}.$

Proof.

We prove the lemma by induction on $i$ . If $i=0$ , then $p$ is a leaf, and thus, we have either $z_{p}=0$ or $z_{p}=x_{p}$ , depending on whether $p\in{\mathbf{O}}_{v}$ or not. If $z_{p}=0$ , then $m_{p}$ is always [math], and thus, $\operatorname*{\mathbb{E}}\Big{[}\mathbf{e}^{sm_{p}}\big{|}p\in\mathbf{V}\Big{]}=1=\alpha_{0}^{z_{p}/x_{p}}$ . If $z_{p}=x_{p}$ , then $m_{p}$ is always $1$ (conditioned on $p\in\mathbf{V}$ ), and thus, $\operatorname*{\mathbb{E}}\Big{[}\mathbf{e}^{sm_{p}}\big{|}p\in\mathbf{V}\Big{]}=\mathbf{e}^{s}=\alpha_{0}^{z_{p}/x_{p}}$ . So, the lemma holds if $i=0$ .

Now, let $i\geq 1$ be any integer and we assume the lemma holds for $i-1$ . We shall prove that it also holds for $i$ . Focus on a node $p$ of level at most $i$ . Then all children $q$ of $p$ have level at most $i-1$ . If $p$ is a virtual node, then $p\in\mathbf{V}$ implies that both children of $p$ in $\mathbf{V}$ . Since the two children are handled independently in the rounding algorithm, we have

[TABLE]

If $p$ is the super node or a state node, then we have $\sum_{q\in\Lambda_{\mathbf{T}^{\circ}}(p)}x_{q}=x_{p}$ . Conditioned on $p\in\mathbf{V}$ , the rounding procedure adds exactly one child $q$ of $p$ to $\mathbf{V}$ . Then, we have

[TABLE]

Thus, we always have

[TABLE]

To see the second inequality in the last line, we notice the following three facts: (i) $\alpha_{i-1}^{\theta}-1$ is a convex function of $\theta$ and when $\theta=0$ its value is [math], (ii) $z_{q}/x_{q}\in[0,1]$ for every $q$ in the summation, and (iii) $\sum_{q\in\Lambda_{\mathbf{T}^{\circ}}(p)}\frac{x_{q}}{x_{p}}\cdot\frac{z_{q}}{x_{q}}=\frac{z_{p}}{x_{p}}$ . So, the quantity inside $\exp(\cdot)$ has maximum value $\frac{z_{p}}{x_{p}}(\alpha_{i-1}^{1}-1)$ . The equality in the last line is by the definition of $\alpha_{i}$ . ∎

Let $h^{\prime}=\Theta(h)=\Theta(\log n)$ be the level of the root. Now, we set $s=\ln(1+\frac{1}{2h^{\prime}})$ . We prove inductively the following lemma:

Lemma 5.5.

For every $i\in[0,h^{\prime}]$ , we have $\alpha_{i}\leq 1+\frac{1}{2h^{\prime}-i}$ .

Proof.

By definition, $\alpha_{0}=\mathbf{e}^{s}=1+\frac{1}{2h^{\prime}}$ and thus the statement holds for $i=0$ . Let $i\in[1,h^{\prime}]$ and assume the statement holds for $i-1$ . Then, we have

[TABLE]

The first inequality used the induction hypothesis and the second one used that for every $\theta\in[0,1]$ , we have $e^{\theta}\leq 1+\theta+\theta^{2}$ . ∎

So, by Lemma 5.4 and 5.5, we have $\operatorname*{\mathbb{E}}[\mathbf{e}^{sm_{{\mathbf{r}}}}]\leq\alpha_{h^{\prime}}^{1}\leq 1+\frac{1}{h^{\prime}}=1+O\left(\frac{1}{\log n}\right)$ . This finishes the proof of Property (2.4c) in Theorem 2.4.

6 Bicriteria-Approximation Algorithm for Degree-Bounded Group Steiner Tree on Trees

In this section, we prove Theorem 1.2, which is repeated here. See 1.2

We first set up some notations for the theorem. Recall that $T^{\circ}$ is the input tree, ${V^{\circ}}$ denotes the set of vertices of $T^{\circ}$ , and $r$ denotes the root of $T^{\circ}$ . For simplicity, we assume the costs are on the vertices instead of edges: Every vertex $u\in V^{\circ}$ has a cost $c_{u}\geq 0$ . Notice that this does not change the problem. We have $k$ groups indexed by $[k]$ . For each group $t\in[k]$ , we are given a set $O_{t}\subseteq V^{\circ}$ of leaves in $T^{\circ}$ . W.l.o.g, we assume all $O_{t}$ ’s are disjoint. Every vertex $v\in V$ is given a degree bound $D_{v}$ . The goal of the problem is then to output the smallest cost subtree $T$ of $T^{\circ}$ that satisfies the degree constraints and contains the root $r$ and one vertex from each $O_{t}$ , $t\in[k]$ . Since now we only have one tree $T^{\circ}$ , we use the following notations for children and descendants: For every vertex $u\in V^{\circ}$ , let $\Lambda_{u}$ denote the set of children of $u$ in $T^{\circ}$ , and $\Lambda^{*}_{u}$ to denote the set of descendants of $u$ in $T^{\circ}$ (including $u$ itself).

Now we describe the LP relaxation we use for our problem. For every vertex $u\in T^{\circ}$ , we use $x_{u}$ to indicate whether $u$ is chosen or not (in the correspondent integer program). LP (7) is a valid LP relaxation for the DB-GST-T problem:

[TABLE]

In the correspondent integer program, the objective we try to minimize is $\sum_{u\in{V^{\circ}}}c_{u}x_{u}$ , i.e, the total cost of all verticies we choose. Constraint (8) says that if we choose a vertex $v$ then we must choose its parent $u$ . Constraint (9) requires for every group $t$ , exactly one vertex in $O_{t}$ is added to the tree. Constraint (10) holds since if $u$ is chosen, at most one vertex in $\Lambda^{*}_{u}\cap O_{t}$ is chosen for every group $t$ . Constraint (11) is the degree constraint. In the LP relaxation, we require each $x_{u}$ to take value in $[0,1]$ (Constraint (12)). Notice that (9) and (10) for the root $r$ imply that $x_{r}=1$ .

Modifying the LP solutions.

Solving LP (7), we can obtain the optimum LP solution $(x_{u})_{u\in V^{\circ}}$ . In our rounding algorithm, it would be convenient if every $x_{u}$ is a (non-positive) integer power of $2$ that is not too small. So, we shall modify the LP solution using the following operations, which may violate many of the LP constraints slightly. For every $v\in V^{\circ}$ with $x_{v}<\frac{1}{2n}$ , we change $x_{v}$ to [math]. This can only decrease the cost of the solution. It is easy to see that Constraints (8), (10) and (11) will not be violated. Constraint (9) may not hold any more, but we still have $\sum_{v\in O_{t}}x_{v}\geq 1-n\times\frac{1}{2n}\geq\frac{1}{2}$ for every $t\in[k]$ . We can remove all vertices $v$ with $x_{v}=0$ from the instance and thus assume $x_{v}\geq\frac{1}{2n}$ for every $v\in V^{\circ}$ . Next, we increase each $x_{v}$ to the smallest (non-positive) integer power of $2$ that is greater than or equal to $x_{v}$ . This will violate many constraints in the LP by a factor of $2$ . We list the properties that our new vector $(x_{u})_{u\in V^{\circ}}$ has:

(P1)

For every $u\in{V^{\circ}}$ , $x_{u}$ is an integer power of $2$ between $\frac{1}{2n}$ and $1$ . 2. (P2)

The $x$ values along any root-to-leaf path in $T^{\circ}$ is non-increasing. 3. (P3)

$\sum_{o\in O_{t}}x_{o}\in[\frac{1}{2},2]$ for every group $t\in[k]$ . 4. (P4)

$\sum_{o\in O_{t}\cap\Lambda^{*}_{u}}x_{o}\leq 2x_{u}$ for every $t\in[k]$ and $u\in{V^{\circ}}$ . 5. (P5)

$\sum_{v\in\Lambda_{u}}x_{v}\leq 2d_{u}x_{u}$ for every $u\in{V^{\circ}}$ . 6. (P6)

$\sum_{u\in{V^{\circ}}}c_{u}x_{u}\leq 2\cdot\mathrm{opt}$ , where $\mathrm{opt}$ is the cost of the optimum integer solution.

6.1 The rounding algorithm

We now describe our rounding algorithm. We define two important global parameters: $L:={\lceil\log(2n)\rceil}$ and $\gamma:=\left\lfloor\log L\right\rfloor-2$ . We say an edge $(u,v)$ with $v\in\Lambda_{u}$ has “hop value” 1 if $x_{u}<x_{v}$ and [math] if $x_{u}=x_{v}$ . For every vertex $u\in{V^{\circ}}$ , we define $\ell_{u}$ to be the sum of hop values over all edges in the path from the root to $u$ in $T^{\circ}$ . Thus, for every $u\in{V^{\circ}}$ and $v\in\Lambda_{u}$ , we have $\ell_{v}-\ell_{u}\in\{0,1\}$ , and $\ell_{v}=\ell_{u}$ if and only if $x_{v}=x_{u}$ . By Properties (P1) and (P2), we have that $\ell_{v}\in[0,L]$ for every $v\in V^{\circ}$ .

Our rounding algorithm is applied on some scaled solution $x^{\prime}$ , which is defined as follows:

[TABLE]

As we mentioned in the introduction, this change will increase the probability of choosing $v$ conditioned on choosing $u$ by a factor of $2$ , for some $u\in V^{\circ},v\in\Lambda_{u}$ with $\ell_{u}<\ell_{v}\leq\gamma$ .

We prove one important property for $x^{\prime}$ , which is necessary for us to run the recursive rounding algorithm.

Claim 6.1.

For every $u\in{V^{\circ}}$ and $v\in\Lambda_{u}$ , we have $x^{\prime}_{v}\leq x^{\prime}_{u}$ .

Proof.

If $x_{v}=x_{u}$ then we have $(u,v)$ has hop value [math] and thus $\ell_{v}=\ell_{u}$ . In this case we have $x^{\prime}_{v}=x^{\prime}_{u}$ as well. Otherwise, we have $x_{v}\leq x_{u}/2$ and $h_{v}=h_{u}+1$ . So, $\min\left\{h_{v},\gamma\right\}\leq\min\left\{h_{u},\gamma\right\}+1$ and therefore $x^{\prime}_{v}\leq x^{\prime}_{u}$ . ∎

Notice that $x^{\prime}_{r}=1$ and every $x^{\prime}_{v}$ is an integer power of $2$ between $2^{-L}$ and $1$ . Our recursive rounding algorithm is run over $x^{\prime}$ . In the procedure recursive-rounding $(u)$ , we add $u$ to our output tree and do the following: for every $v\in\Lambda_{u}$ , with probability $x^{\prime}_{v}/x^{\prime}_{u}$ independent of all other choices, we call recursive-rounding $(v)$ . In the root recursion, we shall call recursive-rounding $(r)$ .

Our final algorithm will repeat the recursive procedure $M$ times independently, for a large enough $M=O(\log k)$ . Let $T_{1},T_{2},\cdots,T_{M}$ be the $M$ trees we obtained from the $M$ repetitions. Our final tree $T$ will be the union of the $M$ trees.

We first analyze the expected cost of $T$ . First focus on the tree $T_{1}$ . It is easy to see that the probability $u$ is chosen by $T_{1}$ is exactly $x^{\prime}_{u}\leq 2^{\gamma}x_{u}=O(L)x_{u}$ . Therefore, the expected cost of $T_{1}$ is at most $O(L)\cdot\mathrm{opt}$ by Property (P6). Therefore, the expected cost of the tree $T$ is at most $O\left(ML\right)\cdot\mathrm{opt}=O(L\log k)\cdot\mathrm{opt}=O(\log n\log k)\cdot\mathrm{opt}$ .

We then analyze the degree constraints on $T$ . Given that $u$ is selected by $T_{1}$ , the probability that we select a child of $v$ of $u$ is $\frac{x^{\prime}_{v}}{x^{\prime}_{u}}\leq\frac{2x_{v}}{x_{u}}$ . By Property (P5), we have $\sum_{v\in\Lambda_{u}}\frac{x^{\prime}_{v}}{x^{\prime}_{u}}\leq\sum_{v\in\Lambda_{u}}\frac{2x_{v}}{x_{u}}\leq 4d_{u}$ . Consider all the $M$ trees $T_{1},T_{2},\cdots,T_{M}$ . Even if we condition on the event that $u$ appears in all the $M$ trees, the degree of $u$ is the summation of many independent random $\{0,1\}$ -variables. The expectation of the summation is at most $4Md_{u}=O(\log k)\cdot d_{u}$ . Using Chernoff bound, one can show that the probability that the degree of $u$ is more than $O(\log n)\cdot d_{u}$ is at most $\frac{1}{10n}$ , for some large enough $O(\log n)$ factor. Therefore, with probability at least $0.9$ , every node $u$ in $T$ has degree at most $O(\log n)\cdot d_{u}$ . Therefore, we proved that the degree violation factor of our algorithm is $O(\log n)$ , as claimed in Theorem 1.2.

6.2 Analysis of connectivity probability

It remains to show that with high probability, the tree $T$ contains a vertex from every group. This is the goal of this section. Till the end of the section, we focus on the tree $T_{1}$ and a fixed group $t$ . For every vertex $u\in{V^{\circ}}$ , we define $\mathbf{E}_{u}$ to be the event that $u$ is chosen by $T_{1}$ . Our goal is to give a lower bound on $\Pr[\bigvee_{o\in O_{t}}\mathbf{E}_{o}]$ , i.e, the probability that some vertex in $O_{t}$ is chosen by the tree $T_{1}$ .

Notice that when two adjacent nodes in $T^{\circ}$ have the same $x^{\prime}$ value, then the child is chosen whenever the parent is. Thus, we can w.l.o.g contract any sub-tree of nodes in $T^{\circ}$ with the same $x^{\prime}$ value into one single super-vertex, without changing the rounding algorithm. Notice that if two adjacent vertices $u\in{V^{\circ}},v\in\Lambda_{u}$ have $\ell_{u}=\ell_{v}$ then we have $x_{u}=x_{v}$ and thus $x^{\prime}_{u}=x^{\prime}_{v}$ . So, we contract every maximal sub-tree of vertices in $T^{\circ}$ with the same $\ell$ value. After this operation, for every $u\in V^{\circ}$ , $\ell_{u}$ is exactly the level of $u$ in the tree $T^{\circ}$ . So, for every $u\in{V^{\circ}}$ and $v\in\Lambda_{v}$ we have $\ell_{v}=\ell_{u}+1$ . A super-vertex is in $O_{t}$ if one of its vertices before contracting is in $O_{t}$ . If an internal super-vertex is in $O_{t}$ , we can remove all its descendants without changing the analysis in this section. So, again we have that $O_{t}$ only contains leaves.

For every vertex $u$ , we define

[TABLE]

Notice that $z_{u}\leq 2x_{u}$ by Property (P4).

In the following, we shall bound $\Pr\Big{[}\bigvee_{o\in O_{t}\cap\Lambda^{*}_{u}}\mathbf{E}_{o}\big{|}\mathbf{E}_{u}\Big{]}$ for every $u\in V^{\circ}$ from bottom to top. This is done in two stages due to the threshold $\gamma$ we used when we define $x^{\prime}$ variables. First we consider the case when $\ell_{u}\geq\gamma$ and then we focus on the case when $\ell_{u}<\gamma$ . The two stages are captured by Lemmas 6.2 and 6.3 respectively.

Lemma 6.2.

For a vertex $u$ with $\ell_{u}\geq\gamma$ , we have $\Pr\Big{[}\bigvee_{o\in O_{t}\cap\Lambda^{*}_{u}}\mathbf{E}_{o}\big{|}\mathbf{E}_{u}\Big{]}\geq\frac{1}{2(L+1-\ell_{u})}\frac{z_{u}}{x_{u}}$ .

Similar lemmas have been proved multiple times in many previous results. Since our parameters are slightly different, we provide the complete proof here. There are two different approaches to prove the lemma, one based on bounding the conditional second moment of the random variable for the number of chosen vertices in $O_{t}\cap\Lambda^{*}_{u}$ , and the other based on the mathematical induction on $\ell_{u}$ , which is the one we use here.

Proof of Lemma 6.2.

Suppose $u$ is a leaf. Then $z_{u}/x_{u}=1$ if $u\in O_{t}$ and $z_{u}/x_{u}=0$ otherwise. So, we have $\Pr\Big{[}\bigvee_{o\in O_{t}\cap\Lambda^{*}_{u}}\mathbf{E}_{o}\big{|}\mathbf{E}_{u}\Big{]}=\frac{z_{u}}{x_{u}}$ and the lemma clearly holds since we have $\ell_{u}\leq L$ .

Then, we prove the lemma by induction on $\ell_{u}$ . If $\ell_{u}=L$ then $u$ must be a leaf and thus the lemma holds. We assume the lemma holds for every $u$ with $\ell_{u}=\ell+1$ , for some $\ell\in[\gamma,L-1]$ . Then we prove the lemma for $u$ with $\ell_{u}=\ell$ . If $u$ is a leaf the lemma holds and thus we assume $u$ is not a leaf.

[TABLE]

The inequality in the first line used the induction hypothesis: $\frac{x^{\prime}_{v}}{x^{\prime}_{u}}$ is the probability that we choose $v$ in $T_{1}$ conditioned on that we choose $u$ , and $\frac{1}{2(L-\ell)}\frac{z_{v}}{x_{v}}$ is the lower bound on the probability that we choose some vertex in $O_{t}\cap\Lambda^{*}_{v}$ conditioned on that $v$ is chosen. The equality in the line used that $x^{\prime}_{u}=2^{\gamma}x_{u}$ and $x^{\prime}_{v}=2^{\gamma}x_{v}$ . The inequality in the second line used that $1-\theta\leq e^{-\theta}$ for every real number $\theta$ . The first inequality in the third line used that $e^{-\theta}\leq 1-\theta+\frac{\theta^{2}}{2}$ for every $\theta\geq 0$ . The second inequality in the line used Property (P4), which says $\frac{z_{u}}{x_{u}}\leq 2$ . The last inequality used that $(2(L-\ell)-1)\cdot 2(L-\ell+1)\geq 4(L-\ell)^{2}$ since $L-\ell\geq 1$ . ∎

The lemma implies that for every $u$ with $\ell_{u}\geq\gamma$ , we have $\Pr\Big{[}\bigvee_{o\in O_{t}\cap\Lambda^{*}_{u}}\mathbf{E}_{o}\big{|}\mathbf{E}_{u}\Big{]}\geq\frac{1}{2L}\cdot\frac{z_{u}}{x_{u}}$ .

Now we analyze the probability for $u$ with $\ell_{u}\leq\gamma$ . Recall that $\gamma=\left\lfloor\log L\right\rfloor-2$ and thus we have $2^{\gamma}\in(L/8,L/4]$ . Let $\alpha_{\gamma}=\frac{1}{2L}$ and for every $\ell\in[0,\gamma-1]$ , define $\alpha_{\ell}=2\alpha_{\ell+1}-4\alpha^{2}_{\ell+1}$ . It is easy to see that for every $\ell\in[0,\gamma]$ , we have $\alpha_{\ell}\leq\frac{2^{\gamma-\ell}}{2L}$ . Then, we have for every $\ell\in[0,\gamma-1]$ ,

[TABLE]

Therefore, we have

[TABLE]

The second inequality used that $1-\theta\geq e^{-2\theta}$ for every $\theta\in(0,1/2)$ . The last equality used that $\gamma=\left\lfloor\log L\right\rfloor-2$ and thus $2^{\gamma}=\Theta(L)$ .

With the $\alpha$ values defined, we prove the following lemma via mathematical induction:

Lemma 6.3.

For every vertex $\ell_{u}=\ell\leq\gamma$ , we have $\Pr\Big{[}\bigvee_{o\in O_{t}\cap\Lambda^{*}_{u}}\mathbf{E}_{o}\big{|}\mathbf{E}_{u}\Big{]}\geq\alpha_{\ell}\frac{z_{u}}{x_{u}}$ .

Proof.

The lemma holds if $\ell=\gamma$ as we mentioned. So, we assume $\ell<\gamma$ and the lemma holds with $\ell$ replaced by $\ell+1$ . If $u$ is a leaf, then we have $\Pr\Big{[}\bigvee_{o\in O_{t}\cap\Lambda^{*}_{u}}\mathbf{E}_{o}\big{|}\mathbf{E}_{u}\Big{]}=\frac{z_{u}}{x_{u}}$ and the lemma holds. So again we assume $u$ is not a leaf. Then,

[TABLE]

To see the equality in the first line, we notice that $x^{\prime}_{u}=2^{\ell}x_{u}$ and $x^{\prime}_{v}=2^{\ell+1}x_{v}$ for every $v\in\Lambda_{u}$ . Many other inequalities used the same arguments as in Lemma 6.2. ∎

Applying the lemma for the root $r$ of $T^{\circ}$ , we have that $\Pr\big{[}\bigvee_{o\in O_{t}}\mathbf{E}_{o}\big{]}\geq\alpha_{0}\cdot\frac{z_{r}}{x_{r}}\geq\alpha_{0}\cdot\frac{1}{2}=\Omega(1)$ .

Now we consider all the $M$ trees $T_{1},T_{2},\cdots,T_{M}$ together. The probability that $O_{t}$ is not chosen by any of the $M$ trees is at most $\left(1-\Omega(1)\right)^{M}\leq\frac{1}{10k}$ , if our $M=O(\log k)$ is big enough. Thus the probability that $T$ , the union of all trees $T_{1},T_{2},\cdots,T_{M}$ , contains an $r$ -to- $O_{t}$ path for every $t$ , is at least $0.9$ .

Acknowledgement

X. Guo, S. Li and J. Xian are partially supported by NSF grants CCF-1566356, CCF- 1717134, CCF-1844890. B. Laekhanukit is partially supported by Science and Technology Innovation 2030 –“New Generation of Artificial Intelligence” Major Project No.(2018AAA0100903), NSFC grant 61932002, Program for Innovative Research Team of Shanghai University of Finance and Economics (IRTSHUFE) and the Fundamental Research Funds for the Central Universities and by the 1000-talent award by the Chinese Government. Daniel Vaz has been supported by the Alexander von Humboldt Foundation with funds from the German Federal Ministry of Education and Research (BMBF).

Appendix A Omitted Proofs

Proof of Lemma 2.1.

We assume $n\geq 4$ ; otherwise, if $n=3$ , then we have $2n/3+1=3$ , and $\mathrm{root}(T)$ satisfies the condition. Our goal is to find a vertex $u$ with $n/3<|\Lambda^{*}(u)|\leq 2n/3+1$ . Start from $u=\mathrm{root}(T)$ in the tree, and thus, we have $\Lambda^{*}(u)>2n/3+1$ . Let $v$ be the child of $u$ with the biggest $|\Lambda^{*}(v)|$ . So, $|\Lambda^{*}(v)|\geq(|\Lambda^{*}(u)|-1)/2>n/3$ . We then replace $u$ with $v$ . So $|\lambda^{*}(u)|$ has decreased but the condition $|\Lambda^{*}(u)|>n/3$ is maintained. Thus, if we repeat the process, we will eventually find a $u$ with $n/3<|\Lambda^{*}(u)|\leq 2n/3+1$ . ∎

Bibliography24

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Yair Bartal. Probabilistic approximations of metric spaces and its algorithmic applications. In 37th Annual Symposium on Foundations of Computer Science, FOCS ’96, Burlington, Vermont, USA, 14-16 October, 1996 , pages 184–193, 1996.
2[2] Moses Charikar, Chandra Chekuri, To-Yat Cheung, Zuo Dai, Ashish Goel, Sudipto Guha, and Ming Li. Approximation algorithms for directed steiner problems. J. Algorithms , 33(1):73–91, 1999.
3[3] Sina Dehghani, Soheil Ehsani, Mohammad Taghi Hajiaghayi, Vahid Liaghat, Harald Räcke, and Saeed Seddighin. Online weighted degree-bounded steiner networks via novel online mixed packing/covering. In 43rd International Colloquium on Automata, Languages, and Programming, ICALP 2016, July 11-15, 2016, Rome, Italy , pages 42:1–42:14, 2016.
4[4] Sina Dehghani, Soheil Ehsani, Mohammad Taghi Hajiaghayi, and Vahid Liaghat. Online degree-bounded steiner network design. In Proceedings of the Twenty-seventh Annual ACM-SIAM Symposium on Discrete Algorithms , SODA ’16, pages 164–175, Philadelphia, PA, USA, 2016. Society for Industrial and Applied Mathematics.
5[5] Sina Dehghani, Soheil Ehsani, Mohammad Taghi Hajiaghayi, Vahid Liaghat, and Saeed Seddighin. Greedy algorithms for online survivable network design. In 45th International Colloquium on Automata, Languages, and Programming, ICALP 2018, July 9-13, 2018, Prague, Czech Republic , pages 152:1–152:14, 2018.
6[6] Jittat Fakcharoenphol, Satish Rao, and Kunal Talwar. A tight bound on approximating arbitrary metrics by tree metrics. J. Comput. Syst. Sci. , 69(3):485–497, 2004.
7[7] Zachary Friggstad, Jochen Könemann, Young Kun-Ko, Anand Louis, Mohammad Shadravan, and Madhur Tulsiani. Linear programming hierarchies suffice for directed steiner tree. In Integer Programming and Combinatorial Optimization - 17th International Conference, IPCO 2014, Bonn, Germany, June 23-25, 2014. Proceedings , pages 285–296, 2014.
8[8] Martin Fürer and Balaji Raghavachari. Approximating the minimum-degree steiner tree to within one of optimal. J. Algorithms , 17(3):409–423, 1994.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

On Approximating Degree-Bounded Network Design Problems

Abstract

1 Introduction

1.1 Our Results

Theorem 1.1**.**

Remark

Theorem 1.2**.**

1.2 Our Techniques

2 Preliminaries for Degree-Bounded Directed Steiner Tree

2.1 Notations and Assumptions

2.2 Balanced Tree Partition

Lemma 2.1**.**

2.3 Multi-Tree

Definition 2.2** (Multi-Tree).**

Definition 2.3** (Good Multi-Trees).**

Theorem 2.4** (Main Theorem for DB-DST).**

Proof of Theorem 1.1.

Organization

3 States and State-Trees

3.1 Portals

Definition 3.1**.**

Definition 3.2** (Root-Portals-Pair).**

Definition 3.3** (Allowable Child-Pair).**

Claim 3.4**.**

Proof.

3.2 Degree Vectors

Definition 3.5**.**

Definition 3.6** (Consistency of degree vectors).**

Definition 3.7** (Edge/Triple Agreeing with Degree Vector).**

3.3 States and Good State-Trees

Definition 3.8**.**

Definition 3.9** (Good State Trees).**

4 Reduction to Finding Good State-Trees

4.1 From a Valid Tree to a Good State-Tree Involving All Terminals

Lemma 4.1**.**

Proof.

4.2 From a Good State Tree to a Good Multi-Tree

5 Finding a Good State Tree using LP Rounding

5.1 Extended State Trees and Construction of T0\mathbf{T}^{0}T0

Claim 5.1**.**

5.2 LP Formulation

5.3 Rounding Algorithm

Claim 5.2**.**

Theorem 5.3**.**

5.4 Concentration Bound on Number of Copies of a Vertex Appearing in TTT

Lemma 5.4**.**

Proof.

Lemma 5.5**.**

Proof.

6 Bicriteria-Approximation Algorithm for Degree-Bounded Group Steiner Tree on Trees

Modifying the LP solutions.

6.1 The rounding algorithm

Claim 6.1**.**

Proof.

6.2 Analysis of connectivity probability

Lemma 6.2**.**

Proof of Lemma 6.2.

Lemma 6.3**.**

Proof.

Acknowledgement

Appendix A Omitted Proofs

Proof of Lemma 2.1.

Theorem 1.1.

Theorem 1.2.

Lemma 2.1.

Definition 2.2 (Multi-Tree).

Definition 2.3 (Good Multi-Trees).

Theorem 2.4 (Main Theorem for DB-DST).

Definition 3.1.

Definition 3.2 (Root-Portals-Pair).

Definition 3.3 (Allowable Child-Pair).

Claim 3.4.

Definition 3.5.

Definition 3.6 (Consistency of degree vectors).

Definition 3.7 (Edge/Triple Agreeing with Degree Vector).

Definition 3.8.

Definition 3.9 (Good State Trees).

Lemma 4.1.

5.1 Extended State Trees and Construction of $\mathbf{T}^{0}$

Claim 5.1.

Claim 5.2.

Theorem 5.3.

5.4 Concentration Bound on Number of Copies of a Vertex Appearing in $T$

Lemma 5.4.

Lemma 5.5.

Claim 6.1.

Lemma 6.2.

Lemma 6.3.