Reducing Path TSP to TSP

Vera Traub; Jens Vygen; Rico Zenklusen

arXiv:1907.10376·cs.DM·July 25, 2019

Reducing Path TSP to TSP

Vera Traub, Jens Vygen, Rico Zenklusen

PDF

TL;DR

This paper introduces a reduction from Path TSP to TSP, showing their approximability is essentially equivalent, and applies this to improve approximation algorithms for Graph Path TSP.

Contribution

It provides a black-box reduction from Path TSP to TSP, establishing their approximability equivalence and improving approximation ratios for Graph Path TSP.

Findings

01

Reduction from Path TSP to TSP with arbitrarily small error

02

Improved approximation algorithm for Graph Path TSP to 1.4+ε

03

New techniques including a recursive dynamic program for generalized TSP

Abstract

We present a black-box reduction from the path version of the Traveling Salesman Problem (Path TSP) to the classical tour version (TSP). More precisely, we show that given an $α$ -approximation algorithm for TSP, then, for any $ϵ > 0$ , there is an $(α + ϵ)$ -approximation algorithm for the more general Path TSP. This reduction implies that the approximability of Path TSP is the same as for TSP, up to an arbitrarily small error. This avoids future discrepancies between the best known approximation factors achievable for these two problems, as they have existed until very recently. A well-studied special case of TSP, Graph TSP, asks for tours in unit-weight graphs. Our reduction shows that any $α$ -approximation algorithm for Graph TSP implies an $(α + ϵ)$ -approximation algorithm for its path version. By applying our reduction to the $1.4$ -approximation…

Equations149

odd (F) : = {v : v is a vertex with ∣ δ (v) \cap F ∣ is odd} .

odd (F) : = {v : v is a vertex with ∣ δ (v) \cap F ∣ is odd} .

L_{i} : = {v \in V : d (s, v) \leq i} \forall i \in 0, \dots, d (s, t) - 1 .

L_{i} : = {v \in V : d (s, v) \leq i} \forall i \in 0, \dots, d (s, t) - 1 .

min {ℓ (F) : F is a Φ-tour in G} .

min {ℓ (F) : F is a Φ-tour in G} .

ℓ (F) \leq max {(1 + ε) α, β - \frac{ε}{8} (β - 1)} \cdot ℓ (OPT)

ℓ (F) \leq max {(1 + ε) α, β - \frac{ε}{8} (β - 1)} \cdot ℓ (OPT)

ℓ (F) \leq (1 + δ) \cdot α \cdot ℓ (OPT) + (α + 1) \cdot ℓ (J)

ℓ (F) \leq (1 + δ) \cdot α \cdot ℓ (OPT) + (α + 1) \cdot ℓ (J)

\ell(F)\leq\left(\big{.}\beta+\delta\cdot(\beta-1)\right)\cdot\ell(\mathrm{OPT})-(\beta-1)\cdot\ell(J)

\ell(F)\leq\left(\big{.}\beta+\delta\cdot(\beta-1)\right)\cdot\ell(\mathrm{OPT})-(\beta-1)\cdot\ell(J)

ℓ (F)

ℓ (F)

ℓ (F)

ℓ (F)

\ell(F)\leq\alpha\cdot\ell(\mathrm{OPT})+(\alpha+1)\cdot\ell(J)+2\alpha\cdot|I|\cdot\max\left\{\big{.}\ell(e):e\in E\setminus(\mathrm{OPT}\cup J)\right\}\enspace,

\ell(F)\leq\alpha\cdot\ell(\mathrm{OPT})+(\alpha+1)\cdot\ell(J)+2\alpha\cdot|I|\cdot\max\left\{\big{.}\ell(e):e\in E\setminus(\mathrm{OPT}\cup J)\right\}\enspace,

\ell(Q)\leq\alpha\cdot\left(\big{.}\ell(\mathrm{OPT})+\ell(J)+2\ell(L)\right)\enspace,

\ell(Q)\leq\alpha\cdot\left(\big{.}\ell(\mathrm{OPT})+\ell(J)+2\ell(L)\right)\enspace,

ℓ (F) = ℓ (Q) + ℓ (J) \leq α \cdot ℓ (OPT) + (α + 1) \cdot ℓ (J) + 2 α \cdot ℓ (L) .

ℓ (F) = ℓ (Q) + ℓ (J) \leq α \cdot ℓ (OPT) + (α + 1) \cdot ℓ (J) + 2 α \cdot ℓ (L) .

∣ L ∣ \leq ∣ I ∣ - 1 .

∣ L ∣ \leq ∣ I ∣ - 1 .

ℓ (F)

ℓ (F)

ℓ (OPT) \geq ℓ (H \cap OPT) \geq ∣ H \cap OPT ∣ \cdot \frac{δ \cdot ℓ ( OPT )}{2 \cdot ∣ I ∣},

ℓ (OPT) \geq ℓ (H \cap OPT) \geq ∣ H \cap OPT ∣ \cdot \frac{δ \cdot ℓ ( OPT )}{2 \cdot ∣ I ∣},

ℓ (F_{D})

ℓ (F_{D})

\leq α \cdot ℓ (OPT) + (α + 1) \cdot ℓ (J) + 2 α \cdot ∣ I ∣ \cdot \frac{δ \cdot ℓ ( OPT )}{2 \cdot ∣ I ∣}

= (1 + δ) \cdot α \cdot ℓ (OPT) + (α + 1) \cdot ℓ (J) .

L (R, k)

L (R, k)

R (L, k)

\ell(F)\leq\min\left\{\big{.}\beta\cdot\ell(R)-(\beta-1)\cdot\ell(R(\mathcal{L},k)):\text{$R$ is a $\Phi$-tour}\right\}

\ell(F)\leq\min\left\{\big{.}\beta\cdot\ell(R)-(\beta-1)\cdot\ell(R(\mathcal{L},k)):\text{$R$ is a $\Phi$-tour}\right\}

L \in L : e \in δ (L) \sum y_{L}

L \in L : e \in δ (L) \sum y_{L}

L \in L \sum y_{L}

∣ L \cap T ∣

\begin{array}[]{rr@{\;}c@{\;}ll}\min&\lx@intercol\hfil\displaystyle\sum_{e\in E}\ell(e)x_{e}\hfil\lx@intercol\\ &x(\delta(Q))&\geq&1&\forall\;Q\in\mathcal{F}\\ &x&\in&\mathbb{R}^{E}_{\geq 0}&\end{array}

\begin{array}[]{rr@{\;}c@{\;}ll}\min&\lx@intercol\hfil\displaystyle\sum_{e\in E}\ell(e)x_{e}\hfil\lx@intercol\\ &x(\delta(Q))&\geq&1&\forall\;Q\in\mathcal{F}\\ &x&\in&\mathbb{R}^{E}_{\geq 0}&\end{array}

\begin{array}[]{rr@{\;}c@{\;}ll}\max&\lx@intercol\hfil\displaystyle\sum_{Q\in\mathcal{F}}y_{Q}\hfil\lx@intercol\\ &\displaystyle\sum_{\begin{subarray}{c}Q\in\mathcal{F}:\\ e\in\delta(Q)\end{subarray}}y_{Q}&\leq&\ell(e)&\forall\;e\in E\\ &y&\in&\mathbb{R}^{\mathcal{F}}_{\geq 0}&\end{array}

\begin{array}[]{rr@{\;}c@{\;}ll}\max&\lx@intercol\hfil\displaystyle\sum_{Q\in\mathcal{F}}y_{Q}\hfil\lx@intercol\\ &\displaystyle\sum_{\begin{subarray}{c}Q\in\mathcal{F}:\\ e\in\delta(Q)\end{subarray}}y_{Q}&\leq&\ell(e)&\forall\;e\in E\\ &y&\in&\mathbb{R}^{\mathcal{F}}_{\geq 0}&\end{array}

ℓ (R (L, k))

ℓ (R (L, k))

ℓ (R (L, k))

ℓ (R (L, k))

ℓ (R)

ℓ (R)

ℓ (J)

ℓ (J)

ℓ (OPT (L, k)) \geq ℓ (J) - δ \cdot ℓ (OPT),

ℓ (OPT (L, k)) \geq ℓ (J) - δ \cdot ℓ (OPT),

ℓ (F)

ℓ (F)

\leq β \cdot ℓ (OPT) - (β - 1) (ℓ (J) - δ \cdot ℓ (OPT))

= (β + δ \cdot (β - 1)) \cdot ℓ (OPT) - (β - 1) \cdot ℓ (J),

∣ V ∣^{O (∣ I ∣ + \frac{width ( L )}{δ})} \cdot f_{B} (G, ∣ I ∣ + \frac{width ( L ) + 1}{δ}) \leq ∣ V ∣^{O (∣ I ∣ + \frac{∣ T ∣}{δ})} \cdot f_{B} (G, ∣ I ∣ + \frac{∣ T ∣}{δ}) .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Reducing Path TSP to TSP

Vera Traub

Research Institute for Discrete Mathematics and Hausdorff Center for Mathematics, University of Bonn, Bonn, Germany. Email: \hrefmailto:[email protected]@or.uni-bonn.de, \hrefmailto:[email protected]@or.uni-bonn.de. This research was initiated while the first two authors visited FIM at ETH Zurich.

Jens Vygen11footnotemark: 1

Rico Zenklusen

Department of Mathematics, ETH Zurich, Zurich, Switzerland. Email: \hrefmailto:[email protected]@math.ethz.ch. Research supported in part by the Swiss National Science Foundation grants 200021_184622 and 200021_165866.

(July 24, 2019)

Abstract

We present a black-box reduction from the path version of the Traveling Salesman Problem (Path TSP) to the classical tour version (TSP). More precisely, we show that given an $\alpha$ -approximation algorithm for TSP, then, for any $\varepsilon>0$ , there is an $(\alpha+\varepsilon)$ -approximation algorithm for the more general Path TSP. This reduction implies that the approximability of Path TSP is the same as for TSP, up to an arbitrarily small error. This avoids future discrepancies between the best known approximation factors achievable for these two problems, as they have existed until very recently.

A well-studied special case of TSP, Graph TSP, asks for tours in unit-weight graphs. Our reduction shows that any $\alpha$ -approximation algorithm for Graph TSP implies an $(\alpha+\varepsilon)$ -approximation algorithm for its path version. By applying our reduction to the $1.4$ -approximation algorithm for Graph TSP by Sebő and Vygen, we obtain a polynomial-time $(1.4+\varepsilon)$ -approximation algorithm for Graph Path TSP, improving on a recent $1.497$ -approximation algorithm of Traub and Vygen.

We obtain our results through a variety of new techniques, including a novel way to set up a recursive dynamic program to guess significant parts of an optimal solution. At the core of our dynamic program we deal with instances of a new generalization of (Path) TSP which combines parity constraints with certain connectivity requirements. This problem, which we call $\Phi$ -TSP, has a constant-factor approximation algorithm and can be reduced to TSP in certain cases when the dynamic program would not make sufficient progress.

1 Introduction

The Traveling Salesman Problem (TSP) is one of the most fundamental and well-studied problems in Combinatorial Optimization with a multitude of applications. The common denominator of the numerous variants of the problem is that a set of cities have to be visited on a shortest possible tour. Its best-known variant, often just dubbed TSP, assumes that the distances between the cities are non-negative and symmetric, and the task is to find a tour beginning and ending in the same city. For our purposes it will be useful to work with an undirected graph $G=(V,E)$ with edge lengths $\ell:E\rightarrow\mathbb{R}_{\geq 0}$ . While it is often assumed that $G$ is complete and $\ell$ fulfills the triangle-inequality, we do not assume this, but allow the tour to visit cities more than once; this is easily seen (and well-known) to be equivalent. So a tour is a closed walk in $G$ visiting all vertices.

One of the best-studied extensions of TSP is Path TSP, where in addition to $G$ and $\ell$ , a fixed start $s\in V$ and end $t\in V$ are given and the task is to find a shortest walk from $s$ to $t$ visiting all vertices.

TSP and its variants are well-known to be $\mathsf{APX}$ -hard (see [20, 14] and references therein) and they have been studied very extensively under the viewpoint of approximation algorithms. While TSP and Path TSP look quite similar, there are fundamental differences. First, there is a classical $\nicefrac{{3}}{{2}}$ -approximation algorithm for TSP by Christofides [4] and Serdjukov [25]. This algorithm can easily be adapted to Path TSP, but then has only approximation ratio $\nicefrac{{5}}{{3}}$ as Hoogeveen [12] showed in the early 90’s. Second, the integrality gaps of the classical LP relaxations seem to be different. For TSP it is widely believed to be $\nicefrac{{4}}{{3}}$ , while for Path TSP it is at least $\nicefrac{{3}}{{2}}$ . In both cases there are well-known instances attaining these lower bounds on the integrality gaps, and these instances have unit lengths ( $\ell(e)=1$ for all $e\in E$ ). Therefore the unit-length special cases, Graph TSP and Graph Path TSP, have received considerable attention [1, 19, 16, 17, 24, 8, 26]. In these special cases, the integrality gaps are known to be different: it is at most $\nicefrac{{7}}{{5}}$ for Graph TSP and exactly $\nicefrac{{3}}{{2}}$ for Graph Path TSP as shown by Sebő and Vygen [24].

While Christofides’ algorithm for TSP is still unbeaten after more than four decades, the approximation ratio for Path TSP has been improved. The first improvement, about 20 years after Hoogeveen [12], was obtained by An, Kleinberg, and Shmoys [1], who devised an elegant $\nicefrac{{(1+\sqrt{5})}}{{2}}\approx 1.618$ -approximation algorithm. A sequence of successive improvements [22, 28, 10, 23, 27] culminated in Zenklusen’s recent $\nicefrac{{3}}{{2}}$ -approximation algorithm [30]. Hence, at the moment, the best known approximation ratios for TSP and Path TSP are the same.

For Graph TSP and Graph Path TSP the situation is different. The best known approximation ratios are $\nicefrac{{7}}{{5}}$ for Graph TSP [24] and $1.497$ for Graph Path TSP [26]. Since the latter result achieves an approximation ratio better than the integrality gap $\nicefrac{{3}}{{2}}$ , one might hope that Graph Path TSP is actually no harder than Graph TSP although the integrality gaps differ.

These recent developments naturally lead to the following general question regarding the relation between the approximability of Path TSP and TSP, which we address in this paper:

Is (Graph) Path TSP substantially harder to approximate than its well-known special case (Graph) TSP?

The answer is no. The main contribution of this paper is to show in a constructive way that Path TSP can be approximated equally well as TSP (up to an arbitrarily small error), by presenting a black-box reduction that transforms approximation algorithms for TSP into ones for Path TSP.

1.1 Our results

The main consequence of our reduction can be summarized as follows.

Theorem 1.

Let $\mathcal{A}$ be an $\alpha$ -approximation algorithm for TSP. Then, for any $\varepsilon>0$ , there is an $(\alpha+\varepsilon)$ -approximation algorithm for Path TSP that, for any instance $(G,\ell,s,t)$ , calls $\mathcal{A}$ a strongly polynomial number of times on TSP instances defined on subgraphs of $(G,\ell)$ , and performs further operations taking strongly polynomial time.

The following two statements are immediate consequences of the above theorem.

Corollary 2.

Let $\varepsilon>0$ and $\alpha>1$ . If there is a (strongly) polynomial-time $\alpha$ -approximation algorithm for TSP, then there is a (strongly) polynomial-time $(\alpha+\varepsilon)$ -approximation algorithm for Path TSP.

Corollary 3.

Let $\varepsilon>0$ and $\alpha>1$ . If there is a polynomial-time $\alpha$ -approximation algorithm for Graph TSP, then there is a polynomial-time $(\alpha+\varepsilon)$ -approximation algorithm for Graph Path TSP.

Notice that since Graph (Path) TSP does not involve any large numbers in its input, the notions of polynomial-time and strongly polynomial-time algorithm are identical in this context.

The above statements create a strong link between the approximability of Path TSP and TSP, as well as its graph versions. More precisely, Theorem 1 implies that such a link exists for any class of TSP instances that is closed under taking instances on subgraphs of the original instance (without changing the edge lengths). In particular, any potential future progress on the approximability of (Graph) TSP will immediately carry over to (Graph) Path TSP.

Moreover, Corollary 3 allows us to make significant progress on the currently best approximation factor of $1.497$ for Graph Path TSP [26], through a black-box reduction to the $1.4$ -approximation algorithm for Graph TSP by Sebő and Vygen [24].

Corollary 4.

For any $\varepsilon>0$ , there is a polynomial-time $(1.4+\varepsilon)$ -approximation algorithm for Graph Path TSP.

Our reduction technique is quite versatile. In particular, it applies to a pretty general problem class (the $\Phi$ -tour problem with interfaces of bounded size; see Definition 6 and Theorem 25). This includes the $T$ -tour problem for bounded $|T|$ (see [24, 3, 22] for a definition) and certain uncapacitated vehicle routing problems such as the one with a fixed number of depots studied in [29].

1.2 Organization of the paper

After some brief preliminaries in Section 2 to fix basic terminology and notation, we provide an overview of our approach in Section 3. Here, we first focus on some key aspects of our approach, which is based on a new way to employ dynamic programming by using a well-chosen auxiliary problem, which we call $\Phi$ -TSP. Moreover, we break down the problem of finding a short solution to $\Phi$ -TSP into two cases. Combining the two cases, applying the same algorithm recursively, and using a constant-factor approximation algorithm for $\Phi$ -TSP on the final recursion level will imply our main reduction result, Theorem 1.

For one case, we show in Section 4 how to reduce the problem to TSP. For the other case, we show in Section 5 how to guess a constant fraction of an optimum solution via dynamic programming. The detailed proof of Theorem 1 is in Section 6. Finally, Section 7 contains a $4$ -approximation algorithm for $\Phi$ -TSP, which is followed by concluding remarks in Section 8.

2 Preliminaries

A weighted graph is a tuple $(V,E,\ell)$ , where $V$ is the vertex set, $E$ is the edge set, which we assume w.l.o.g. not to have loops or parallel edges, and $\ell:E\to\mathbb{R}_{\geq 0}$ denotes the edge lengths. We only consider undirected graphs with non-negative edge lengths and do not always state this explicitly.

We often deal with multi-sets of edges. Although $E$ does not contain parallel edges, when we write $F\subseteq E$ , we mean a multi-set $F$ that can contain several copies of the same edge. We use the operator $\stackrel{{\scriptstyle.}}{{\cup}}$ to designate the multi-union. For a vertex set $W\subseteq V$ , we denote by $\delta(W)\subseteq E$ all edges with exactly one endpoint in $W$ , and, for $v\in V$ , we use $\delta(v)$ as a shorthand for $\delta(\{v\})$ . For a multi-set $F\subseteq E$ , we define

[TABLE]

For a vertex set $T\subseteq V$ , a $T$ -join is a multi-set of edges $F\subseteq E$ with $\mathrm{odd}(F)=T$ .

For a set $I\subseteq V$ and a graph $G$ , we denote by $G/I$ the graph obtained from $G$ by contracting the vertex set $I$ . If $I$ is empty, we define $G/\emptyset:=G$ . For a vertex set $W$ and an edge set $F$ , we define $F[W]\coloneqq\{e\in F:\text{both endpoints of$ e $are in$ W $}\}$ . Moreover, $G[W]$ denotes the induced subgraph with vertex set $W$ and edge set $E[W]$ .

Instead of describing tours as walks in $G$ , it is convenient to consider them as multi-edge sets. Then a solution to (Graph) TSP is a multi-edge set $F$ such that $(V,F)$ is connected and $\mathrm{odd}(F)=\emptyset$ . A solution to (Graph) Path TSP (with $s\neq t$ ) is a multi-edge set $F$ such that $(V,F)$ is connected and $\mathrm{odd}(F)=\{s,t\}$ . From such multi-edge sets—which we also call tours or $s$ - $t$ tours, respectively—we can easily recover walks by Euler’s theorem.

We often use $\mathrm{OPT}$ as an arbitrary (but fixed) optimal solution for the problem in question. Finally, when using the notion of approximation algorithm we will not assume that the algorithm is polynomial time, but state it explicitly if this is the case.

In the interest of clarity and simplicity of the presentation, we did not try to optimize the running times of our procedures. Consequently, we often opt for weaker constants that are easier to obtain.

3 Overview of approach

A key novelty of our approach is a new way to set up a dynamic program to successively strengthen a basic algorithm by combining it with a stronger algorithm for TSP. Every time we apply our dynamic program to obtain a stronger algorithm, we end up with a more difficult problem, slowly approaching problem settings for which it is very challenging to find strong approximation algorithms. However, as we show, by guessing a well-chosen set of edges through the dynamic program, we can limit the recursion depth by a constant, which allows us to stay in a regime where our approach runs efficiently.

To introduce our approach, we start with a brief discussion of a much more basic dynamic programming idea that has previously been used in related settings. We explain the challenges this procedure faces when trying to extend it for our purposes, and outline how we overcome the barriers encountered by existing methods.

3.1 Key challenges and high-level approach

Assume we are given an $\alpha$ -approximation algorithm $\mathcal{A}$ for TSP. Then finding a short Path TSP solution using $\mathcal{A}$ as an oracle would be easy if the distance $d(s,t)$ between the start $s$ and the end $t$ was short compared to $\ell(\mathrm{OPT})$ , i.e., the length of a shortest Path TSP solution $\mathrm{OPT}$ . Indeed, in this case the length of a shortest TSP tour $\mathrm{OPT}_{\mathrm{TSP}}$ and a shortest $s$ - $t$ tour $\mathrm{OPT}$ do not differ by much because any solution of one problem can be converted to a solution of the other one by adding a shortest $s$ - $t$ path $P$ . More precisely, $\mathrm{OPT}_{\mathrm{TSP}}\stackrel{{\scriptstyle.}}{{\cup}}P$ is an $s$ - $t$ tour and $\mathrm{OPT}\stackrel{{\scriptstyle.}}{{\cup}}P$ is a TSP tour. Hence, one can simply compute an $\alpha$ -approximate TSP tour $F$ and a shortest $s$ - $t$ path $P$ and return $F\stackrel{{\scriptstyle.}}{{\cup}}P$ .

Consequently, a canonical plan would be to try to transform the Path TSP instance to another one with small $s$ - $t$ distance. It turns out that if the distance between $s$ and $t$ is very large, then such a reduction is indeed possible by using a technique based on dynamic programming that goes back to Blum, Chawla, Karger, Lane, Meyerson, and Minkoff [2], who studied variants of the Orienteering Problem. Their approach was later extended by Traub and Vygen [26] in the context of Graph Path TSP. This approach allows for reducing to Path TSP instances where the distance between $s$ and $t$ is at most $(\nicefrac{{1}}{{3}}+\varepsilon)\cdot\ell(\mathrm{OPT})$ , for some arbitrarily small constant $\varepsilon>0$ (see [26]).

However, this technique faces significant barriers when aiming at a reduction to smaller $s$ - $t$ distances. Thus our approach follows a different path. Nevertheless, it is on a high level inspired by the dynamic program in [2] and later variations and extensions thereof [27, 30, 26, 18]. We therefore start with a brief discussion of this prior technique in the context of Path TSP as used in [26], which will be helpful for the understanding of our approach.

For simplicity of exposition, consider a Graph Path TSP instance and assume that $d(s,t)\geq(\nicefrac{{1}}{{3}}+\varepsilon)\cdot|\mathrm{OPT}|$ for some constant $\varepsilon>0$ . The idea is to study the structure of edges of $\mathrm{OPT}$ in the $d(s,t)$ many $s$ - $t$ cuts $\delta(L_{0}),\ldots,\delta(L_{d(s,t)-1})$ defined by

[TABLE]

The key observation is that a constant fraction of these $s$ - $t$ cuts will only contain a single edge of $\mathrm{OPT}$ , and, hence, one can try to “guess” these edges through a dynamic program. Indeed, every edge can be in at most one of the cuts $\delta(L_{0}),\delta(L_{1}),\ldots,\delta(L_{d(s,t)-1})$ . Hence, the average number of $\mathrm{OPT}$ -edges in a cut $\delta(L_{i})$ can be no higher than $\nicefrac{{|\mathrm{OPT}|}}{{d(s,t)}}$ . Using that every $s$ - $t$ cut must have an odd intersection with $\mathrm{OPT}$ , because $\mathrm{OPT}$ is an $s$ - $t$ tour, this implies that a constant fraction of the cuts contains only one edge of $\mathrm{OPT}$ . For brevity, we call a cut $\delta(L_{i})$ with $|\delta(L_{i})\cap\mathrm{OPT}|=1$ a $1$ -cut. Assume we knew all edges of $\mathrm{OPT}$ contained in $1$ -cuts. Then the problem decomposes into smaller Path TSP instances. See Figure 1 for an illustration.

Of course, the $\mathrm{OPT}$ -edges in $1$ -cuts are not known upfront, and hence, the problem cannot be decomposed so easily. However, one can use a dynamic program to guess the $1$ -cuts from left to right, i.e., from $\delta(L_{0})$ to $\delta(L_{d(s,t)-1})$ , together with the $\mathrm{OPT}$ -edge in each of them. Notice that the sub-instances may not have a short start-to-end distance (e.g. $d(v_{6},u_{8})$ in Figure 1 may be substantially larger than $\nicefrac{{1}}{{3}}$ times an optimum $v_{6}$ - $v_{8}$ -tour in $G[L_{8}\setminus L_{6}]$ ). As shown in [26], this issue can be addressed by applying the dynamic program recursively to the sub-instances. A key observation in [26] is that a constant recursion depth is enough to ensure that the total cost of the remaining sub-instances becomes negligible compared to the edges guessed through the recursive dynamic program.

Notice that to apply this dynamic programming idea, one crucially needs $d(s,t)\geq(\nicefrac{{1}}{{3}}+\varepsilon)\cdot|\mathrm{OPT}|$ for some constant $\varepsilon>0$ . Indeed, otherwise none of the cuts $\delta(L_{i})$ may be a $1$ -cut, and no decomposition into smaller Path TSP instances as above is possible. This is the reason why this techniques has only been applied to reduce the start-to-end distance to about a third of $\mathrm{OPT}$ .

If we could guess not only $1$ -cuts, but also cuts with a larger constant number of $\mathrm{OPT}$ -edges, say up to $5$ , then we could handle instances with an $s$ - $t$ distance below $\nicefrac{{1}}{{3}}\cdot\ell(\mathrm{OPT})$ . (This idea is inspired by a recent dynamic programming approach in [18] in the context of chain-constrained spanning trees.) Our approach aims at realizing this high-level plan. However, this ostensibly simple algorithmic idea comes with several significant technical hurdles. Most importantly, if we guess more than one edge, the resulting sub-problems are not Path TSP problems anymore. More precisely, if we guess 5 edges in each of two consecutive $5$ -cuts, then we have up to 10 interface vertices, i.e., endpoints of guessed edges. See Figure 2 for an example.

An optimum $s$ - $t$ tour is not necessarily connected inside the vertex set of a sub-problem but every connected component must contain at least one interface vertex. Moreover, $\mathrm{OPT}$ needs to connect some of the interface vertices to each other. This induces connectivity constraints for the sub-problem, shown as gray sets in Figure 2. They can also be guessed since the number of interface vertices is constant. Note, however, that we cannot guess the entire connected components, as there are exponentially many options.

Clearly, these sub-problems become significantly more difficult than the original Path TSP problem. Moreover, if we try to apply such a procedure recursively, then the sub-problems can become more complex with each recursion step, because of an increase in the number of interface vertices per sub-problem. Another important issue in a recursive application to our more complex sub-problem is to identify good cuts in which we should guess edges of $\mathrm{OPT}$ . Our cuts will result from the dual of a $T$ -join problem. They will no longer form a chain, but their laminar structure still allows for a dynamic programming approach.

Moreover, it is not obvious how to reduce the problem to TSP in the case when we cannot guess edges by dynamic programming, and this will involve a careful guessing of further edges of $\mathrm{OPT}$ .

We will now describe our approach in detail. We start by defining a new problem class around which our method is centered, and which we call $\Phi$ -TSP.

3.2 $\Phi$ -TSP

As described above, when guessing edges, the endpoints of those edges play a special role in terms of how we have to connect things. We capture this through the notion of an interface. We define this notion for a general graph $G$ below and will typically use it for subgraphs of the instance we are interested in.

Definition 5 (interface).

An interface $\Phi$ of a graph $G=(V,E)$ is a triple $\Phi=(I,T,\mathcal{C})$ with

(i)

$T\subseteq I\subseteq V$ , where $|T|$ is even, and 2. (ii)

$\mathcal{C}\subseteq 2^{I}$ * is a partition of $I$ .*

For an interface $\Phi$ of $G$ , we denote by $(I_{\Phi},T_{\Phi},\mathcal{C}_{\Phi})$ its corresponding triple and call $|I_{\Phi}|$ its size.

For a given interface, we are interested in finding what we call $\Phi$ -tours, which are defined as follows.

Definition 6 ( $\Phi$ -tour).

Let $G=(V,E)$ be a graph. Let $\Phi=(I,T,\mathcal{C})$ be an interface of $G$ . A $\Phi$ -tour in $G$ is a multi-set $F\subseteq E$ with

(i)

$T=\mathrm{odd}(F)$ , i.e., $F$ is a $T$ -join, 2. (ii)

$(V,F)/I$ * is connected, and* 3. (iii)

for any $C\in\mathcal{C}$ , the vertices in $C$ lie in the same connected component of $(V,F)$ .

Figure 3 exemplifies the notation of an interface $\Phi$ and a $\Phi$ -tour.

The problem we focus on in the following, which we call $\Phi$ -TSP, seeks to find a shortest $\Phi$ -tour.

Definition 7 ( $\Phi$ -TSP).

Given a weighted graph $G=(V,E,\ell)$ and an interface $\Phi$ of $G$ , compute a shortest $\Phi$ -tour in $G$ or decide that none exists. In short,

[TABLE]

Note that for any distinct $s,t\in V$ , by choosing the interface $\Phi=(I,T,\mathcal{C})$ with $I=T=\{s,t\}$ and $\mathcal{\{}C\}=\{\{s,t\}\}$ , we have that $\Phi$ -tours correspond to solutions to $s$ - $t$ Path TSP. Analogously, for larger sets $T$ , one captures the $T$ -tour problem (see [24, 3, 22]). Another special case is the uncapacitated vehicle routing problem with a fixed number of depots, for which Xu and Rodrigues [29] gave a $\nicefrac{{3}}{{2}}$ -approximation. Here, $I$ is the set of depots, $T=\emptyset$ , and $\mathcal{C}$ is the partition into singletons.

Depending on the structure of the graph $G$ and the interface $\Phi$ , it may be that no $\Phi$ -tour exists. We call an interface $\Phi$ of $G$ feasible if $G$ admits a $\Phi$ -tour. The existence of a $\Phi$ -tour admits the following easy characterization, which can be checked in linear time.

Lemma 8.

Let $G=(V,E,\ell)$ be a weighted graph. Let $\Phi=(I,T,\mathcal{C})$ be an interface of $G$ . Then $G$ admits a $\Phi$ -tour if and only if all of the following conditions hold.

(i)

Each connected component of $G$ contains an even number of vertices in $T$ , 2. (ii)

$G/I$ * is connected, and* 3. (iii)

for every $C\in\mathcal{C}$ , the vertices in $C$ lie in the same connected component of $G$ .

Proof.

The three mentioned conditions are clearly necessary for $G$ to admit a $\Phi$ -tour. Moreover, if they are satisfied then, due to (i), there exists a $T$ -join $J\subseteq E$ , and points (ii) and (iii) guarantee that $E\stackrel{{\scriptstyle.}}{{\cup}}(E\setminus J)$ is a $\Phi$ -tour in $G$ . ∎

It is crucial for our approach to start with a polynomial-time constant-factor approximation algorithm, which we will successively strengthen as discussed in the following.

A $7$ -approximation algorithm for $\Phi$ -TSP can be obtained easily as follows. Compute a minimum cost edge set $F_{1}$ satisfying (i) ( $T$ -join), a minimum cost edge set $F_{2}$ satisfying (ii) (spanning tree in $G/I$ ), and a 2-approximation $F_{3}$ of a minimum cost edge set satisfying (iii) (Steiner forest). Then the disjoint union $F_{1}\stackrel{{\scriptstyle.}}{{\cup}}F_{2}\stackrel{{\scriptstyle.}}{{\cup}}F_{2}\stackrel{{\scriptstyle.}}{{\cup}}F_{3}\stackrel{{\scriptstyle.}}{{\cup}}F_{3}$ is a $7$ -approximation.

With a little more care we can obtain a $4$ -approximation algorithm, using Jain’s iterative rounding framework [13]:

Theorem 9.

$\Phi$ -TSP admits a strongly polynomial $4$ -approximation algorithm.

We defer the proof to Section 7. In the rest of this paper, we will derive a strongly polynomial $(\alpha+\varepsilon)$ -approximation algorithm for $\Phi$ -TSP instances with bounded interface size, where $\alpha$ is the approximation guarantee for TSP; see Theorem 25.

3.3 Iterative improvement of basic algorithm

For a TSP algorithm $\mathcal{A}$ , we denote for every weighted graph $G$ by $f_{\mathcal{A}}(G)$ the maximum runtime of algorithm $\mathcal{A}$ on any subgraph of $G$ . Similarly, for a $\Phi$ -TSP algorithm $\mathcal{B}$ , we denote for every weighted graph $G$ and any $k\in\mathbb{R}_{\geq 0}$ by $f_{\mathcal{B}}(G,k)$ the maximum runtime of algorithm $\mathcal{B}$ on any instance $(G^{\prime},\Phi)$ , where $G^{\prime}$ is a subgraph of $G$ and $|I_{\Phi}|\leq k$ .

Our plan is to start with the $4$ -approximation algorithm for $\Phi$ -TSP guaranteed by Theorem 9, and successively improve it through a TSP algorithm with an approximation guarantee $\alpha$ . The following Boosting Theorem is the main technical result towards this goal and quantifies the improvement in terms of approximation factor that we are able to obtain in one improvement step.

Theorem 10 (Boosting Theorem).

Let $\alpha,\beta>1$ . Suppose we are given:

(a)

an $\alpha$ -approximation algorithm $\mathcal{A}$ for TSP, and 2. (b)

a $\beta$ -approximation algorithm $\mathcal{B}$ for $\Phi$ -TSP.

Then there is an algorithm that, for any $\varepsilon\in(0,1]$ , any weighted graph $G=(V,E,\ell)$ , and any feasible interface $\Phi=(I,T,\mathcal{C})$ of $G$ , returns a $\Phi$ -tour $F$ in $G$ of length

[TABLE]

in time $|V|^{O\left(\frac{|I|}{\varepsilon}\right)}\cdot\left(f_{\mathcal{A}}(G)+f_{\mathcal{B}}\left(G,\frac{9|I|}{\varepsilon}\right)\right)$ , where $\mathrm{OPT}$ is a shortest $\Phi$ -tour in $G$ . In particular, the algorithm makes calls to $\mathcal{B}$ only on instances with interfaces of size bounded by $\frac{9|I|}{\varepsilon}$ .

To prove Theorem 1, we start with $\beta=4$ (Theorem 9) and apply Theorem 10 repeatedly, but only a constant number of times. The approximation guarantee $\beta$ decreases until it reaches $(1+\varepsilon)\alpha$ . All interfaces will have constant size. We defer the details to Section 6.

3.4 Proof outline of Boosting Theorem (Theorem 10)

Theorem 10 is obtained by designing two algorithms to obtain a $\Phi$ -tour and then returning the better of the $\Phi$ -tours computed by these algorithms. Each of the following two theorems summarizes the guarantee we obtain with one of the two algorithms. After that, Algorithm 1, described below, combines these two sub-procedures to obtain an algorithm that implies Theorem 10.

The following theorem yields a short $\Phi$ -tour if the length of a minimum $T_{\Phi}$ -join is small.

Theorem 11.

Let $\alpha>1$ . Assume we are given an $\alpha$ -approximation algorithm $\mathcal{A}$ for TSP. Then, for any $\delta>0$ , any weighted graph $G=(V,E,\ell)$ , and any feasible interface $\Phi=(I,T,\mathcal{C})$ of $G$ , one can determine a $\Phi$ -tour $F$ in $G$ with

[TABLE]

in time $|V|^{O\left(\frac{|I|}{\delta}\right)}\cdot f_{\mathcal{A}}(G)$ , where $J$ is a shortest $T$ -join in $G$ and $\mathrm{OPT}$ is a shortest $\Phi$ -tour in $G$ .

We will give the proof in Section 4. The next theorem, proven in Section 5, states that we also obtain a short $\Phi$ -tour if the length of a minimum $T$ -join is large.

Theorem 12.

Let $\beta>1$ . Assume we are given a $\beta$ -approximation algorithm $\mathcal{B}$ for $\Phi$ -TSP. Then, for any $\delta>0$ , any weighted graph $G=(V,E,\ell)$ , and any feasible interface $\Phi=(I,T,\mathcal{C})$ of $G$ , one can determine a $\Phi$ -tour $F$ in $G$ with

[TABLE]

in time $|V|^{O\left(|I|+\frac{|T|}{\delta}\right)}\cdot f_{\mathcal{B}}\left(G,|I|+\frac{|T|}{\delta}\right)$ , where $J$ is a shortest $T$ -join in $G$ and $\mathrm{OPT}$ is a shortest $\Phi$ -tour in $G$ .

Lemma 13.

Given a weighted graph $G$ and a feasible interface $\Phi=(I,T,\mathcal{C})$ of $G$ , Algorithm 1 returns a $\Phi$ -tour $F$ in $G$ with the guarantees stated in Theorem 10.

Proof.

The running time guarantee stated in Theorem 10 immediately follows from Theorem 11 and Theorem 12, using $|I|+\frac{|T|}{\nicefrac{{\varepsilon}}{{8}}}\leq\frac{9|I|}{\varepsilon}$ .

Let $F\in\{F_{1},F_{2}\}$ be the $\Phi$ -tour returned by Algorithm 1. To show that $F$ fulfills the approximation guarantee stated in (1), we distinguish two cases.

If $\ell(J)\leq\frac{\varepsilon}{4}\cdot\ell(\mathrm{OPT})$ , then the solution $F_{1}$ will be short enough:

[TABLE]

where we used $\nicefrac{{(\alpha+1)}}{{2}}\leq\alpha$ for the last inequality, which holds because $\alpha\geq 1$ .

If $\ell(J)\geq\frac{\varepsilon}{4}\cdot\ell(\mathrm{OPT})$ , then the $\Phi$ -tour $F_{2}$ will be short enough:

[TABLE]

thus completing the proof of Lemma 13. ∎

For the proof of Theorem 10, it remains to show Theorem 11 and Theorem 12.

4 Finding a short $\Phi$ -tour if there is a short $T$ -join

In this section we prove Theorem 11, i.e., how to get a short $\Phi$ -tour if the shortest $T$ -join has small length compared to $\ell(\mathrm{OPT})$ .

We start by analyzing a simple algorithm for computing a $\Phi$ -tour. However, this simple algorithm will not be sufficient to prove Theorem 11. Thus in a second step, we will refine the algorithm to obtain the desired bound.

Notice that Algorithm 2 always returns an edge set $F$ , even if the input is infeasible. We therefore show first that Algorithm 2 does return a $\Phi$ -tour whenever it is run with a feasible input.

Lemma 14.

The set $F$ returned by Algorithm 2 is a $\Phi$ -tour if and only if the input is feasible, i.e., $G$ admits a $\Phi$ -tour.

Proof.

Assume that $G$ admits a $\Phi$ -tour, which implies by Lemma 8 that the three properties (i), (ii), and (iii) listed in Lemma 8 are fulfilled. Because the set $Q$ computed in Algorithm 2 consists of TSP tours in each connected component of $G$ , the vertex sets of the connected components of $(V,Q)$ and $G$ are the same. Because $G$ fulfills (ii) and (iii), this implies that also $(V,Q)$ and $(V,Q\stackrel{{\scriptstyle.}}{{\cup}}J)$ fulfill these two conditions. Finally, $\mathrm{odd}(Q\stackrel{{\scriptstyle.}}{{\cup}}J)=\mathrm{odd}(J)=T$ , because $\mathrm{odd}(Q)=\emptyset$ and $J$ is a $T$ -join, which shows that $Q\stackrel{{\scriptstyle.}}{{\cup}}J$ is indeed a $\Phi$ -tour. ∎

We now analyze the length of the $\Phi$ -tour returned by Algorithm 2.

Lemma 15.

Assume we are given an $\alpha$ -approximation algorithm $\mathcal{A}$ for TSP. Let $G=(V,E,\ell)$ be a weighted graph, $\Phi=(I,T,\mathcal{C})$ a feasible interface of $G$ , and $J$ a $T$ -join in $G$ . Then, Algorithm 2 computes a $\Phi$ -tour $F$ in $G$ with

[TABLE]

in time $O(|V|\cdot f_{\mathcal{A}}(G))$ , where $\mathrm{OPT}$ is a shortest $\Phi$ -tour. Here $\max\emptyset:=0$ .

Proof.

First, observe that the running time is indeed as claimed, because the bottleneck of the algorithm is calling $\mathcal{A}$ for each connected component of $G$ ; moreover, the connected components can be found in linear time and there are at most $|V|$ many of them.

To bound $\ell(Q)$ , we transform a shortest $\Phi$ -tour $\mathrm{OPT}$ into a union of TSP solutions, one for each connected component of $G$ . Let $L\subseteq E$ be a minimal edge set such that the vertex sets of the connected components of $(V,\mathrm{OPT}\cup J\cup L)$ and $G$ are the same. Observe that the multi-set $\mathrm{OPT}\stackrel{{\scriptstyle.}}{{\cup}}J\stackrel{{\scriptstyle.}}{{\cup}}L\stackrel{{\scriptstyle.}}{{\cup}}L$ is a union of TSP solutions, one for each connected component of $G$ . Because the set $Q$ determined in Algorithm 2 was obtained through $\mathcal{A}$ , which is an $\alpha$ -approximation algorithm, we have

[TABLE]

and, hence, the solution $F=Q\stackrel{{\scriptstyle.}}{{\cup}}J$ returned by the algorithm satisfies

[TABLE]

Moreover, because $\mathrm{OPT}$ is a $\Phi$ -tour, we have that $(V,\mathrm{OPT})/I$ must be connected, which implies that $(V,\mathrm{OPT})$ has at most $|I|$ connected components, and thus

[TABLE]

Together with (2), this leads to the desired guarantee:

[TABLE]

where the inequality follows from $L\subseteq E\setminus(\mathrm{OPT}\cup J)$ , which holds because the edges in $L$ connect different connected components of $(V,\mathrm{OPT}\cup J)$ . ∎

We now explain how to refine Algorithm 2 by a guessing step to obtain the guarantees claimed in Theorem 11. If all edges that are not contained in $\mathrm{OPT}\cup J$ have length at most $\frac{\delta\cdot\ell(\mathrm{OPT})}{2\cdot|I|}$ , Lemma 15 already implies the desired bound. To obtain this property, we delete all edges from $G$ that are heavy, i.e. have length at least $\frac{\delta\cdot\ell(\mathrm{OPT})}{2\cdot|I|}$ , and are not contained in $\mathrm{OPT}\cup J$ . We guess this set of edges to delete as follows. First we guess the set $H$ of heavy edges, which can be done in polynomial time by guessing a minimum length edge in $H$ . Then we guess the set $H^{*}=\mathrm{OPT}\cap H$ of heavy edges contained in $\mathrm{OPT}$ . Algorithm 3 formalizes this procedure and, as we show next, indeed implies Theorem 11.

Proof of Theorem 11.

We start by observing that the running time of Algorithm 3 is indeed bounded by $|V|^{O(\nicefrac{{|I|}}{{\delta}})}\cdot f_{\mathcal{A}}(G)$ . There are at most $|V|^{2}$ possible edges $f$ that are being considered in the outer for-loop. For each of them, there are $|V|^{O(\nicefrac{{|I|}}{{\delta}})}$ possible sets $H^{*}$ considered in the inner for-loop. Thus, there are at most $|V|^{O(\nicefrac{{|I|}}{{\delta}})}$ calls to Algorithm 2. Finally, all other operations can be done in time $|V|^{O(1)}$ .

We now show that Algorithm 3 returns a $\Phi$ -tour with the guarantees claimed by Theorem 11. Let $\mathrm{OPT}$ be a shortest $\Phi$ -tour and let $H\coloneqq\{e\in E:\ell(e)\geq\nicefrac{{\delta\cdot\ell(\mathrm{OPT})}}{{2|I|}}\}$ be the set of heavy edges. Then in some iteration of the outer for-loop we consider the set $H$ . Because

[TABLE]

we have $|H\cap\mathrm{OPT}|\leq\nicefrac{{2|I|}}{{\delta}}$ , and thus, we consider the set $H^{*}\coloneqq H\cap\mathrm{OPT}$ in some iteration of the inner for-loop. As $D=H\setminus(H^{*}\cup J)$ does not contain any edge of $\mathrm{OPT}$ , the $\Phi$ -tour $\mathrm{OPT}$ is a feasible solution of the instance to which we apply Algorithm 2. Moreover, the set $D$ contains all heavy edges not contained in $\mathrm{OPT}\cup J$ and hence by Lemma 15, we obtain

[TABLE]

∎

5 Iterative improvement via dynamic programming

In this section, we show how to prove Theorem 12, i.e., how to obtain a short $\Phi$ -tour if the length of a shortest $T$ -join is large. Here, our goal is to use dynamic programming to “guess” a significant portion, in terms of total length, of edges used in $\mathrm{OPT}$ . Very recently, dynamic programming has become a strong tool in the context of Path TSP, Chain-Constrained Spanning Trees, and related problems [27, 26, 30, 18], leading to the currently best known approximation factors for these settings. The dynamic programming idea we employ combines and extends elements used in these prior dynamic programming techniques.

What we aim to achieve with dynamic programming in the context of $\Phi$ -TSP, for some interface $\Phi=(I,T,\mathcal{C})$ of $G$ , is the following. We can fix an arbitrary laminar family $\mathcal{L}$ of subsets of $V$ . Our goal is to guess what edges of $\mathrm{OPT}$ are crossing the cuts in $\mathcal{L}$ . Clearly, if $\mathrm{OPT}\cap\delta(L)$ contains many edges for some $L\in\mathcal{L}$ , it seems computationally elusive to guess them. This is the reason why we fix some constant $k$ and only guess $\mathrm{OPT}$ -edges in cuts $\delta(L)$ for $L\in\mathcal{L}$ if $|\mathrm{OPT}\cap\delta(L)|\leq k$ . We denote the sets inducing these cuts by $\mathcal{L}(\mathrm{OPT},k)\subseteq\mathcal{L}$ and the $\mathrm{OPT}$ -edges in these cuts by $\mathrm{OPT}(\mathcal{L},k)\subseteq\mathrm{OPT}$ . Formally, for any edge set $R\subseteq E$ , we define

[TABLE]

As we discuss in more detail later, guessing the edges $\mathrm{OPT}(\mathcal{L},k)$ can be achieved through a dynamic program that guesses the $\mathrm{OPT}$ -edges in the different cuts defined by $\mathcal{L}$ step by step, from smaller to larger sets in $\mathcal{L}$ . However, the running time of the propagation step of the dynamic program depends on the number of disjoint sets in $\mathcal{L}$ that can be contained in some larger set $L\in\mathcal{L}$ . We capture this dependency through the width $\operatorname{width}(\mathcal{L})$ of the laminar family $\mathcal{L}$ (see [18] for a similar use of this notion).

Definition 16 (width of a laminar family).

The width $\operatorname{width}(\mathcal{L})$ of a laminar family $\mathcal{L}$ is the number of minimal sets contained in the family.

Observe that the number of minimal sets of a laminar family bounds the size of any subfamily of disjoint sets.

The following theorem formalizes what we can achieve through our dynamic program, which we present later in detail. Notice that for the algorithm to be efficient, we need $\mathcal{L}$ to have width bounded by a constant.

Theorem 17.

Let $\beta>1$ . Assume there is a $\beta$ -approximation algorithm $\mathcal{B}$ for $\Phi$ -TSP. Then there is an algorithm that computes for any feasible interface $\Phi=(I,T,\mathcal{C})$ of a weighted graph $G=(V,E,\ell)$ , any $k\in\mathbb{Z}_{\geq 0}$ , and any laminar family $\mathcal{L}$ over $V$ , a $\Phi$ -tour $F$ with

[TABLE]

in time $|V|^{O\left(|I|+k\cdot\operatorname{width}(\mathcal{L})\right)}\cdot f_{\mathcal{B}}\left(\big{.}G,|I|+k\cdot(\operatorname{width}(\mathcal{L})+1)\right)$ . In particular, the algorithm calls $\mathcal{B}$ only on instances with interfaces of size bounded by $|I|+k\cdot(\operatorname{width}(\mathcal{L})+1)$ .

Note that the guarantee stated in (3) for $R=\mathrm{OPT}$ indeed reflects the guessing of the edges in $\mathrm{OPT}(\mathcal{L},k)$ . More precisely, by replacing $R$ by $\mathrm{OPT}$ in (3), we obtain a $\Phi$ -tour $F$ with an upper bound on its length $\ell(F)$ that decomposes into two terms:

(i)

a term $\ell(\mathrm{OPT}(\mathcal{L},k))$ , i.e., each edge $e\in\mathrm{OPT}(\mathcal{L},k)$ contributes its length $\ell(e)$ , and 2. (ii)

a term $\beta\cdot\ell(\mathrm{OPT}\setminus\mathrm{OPT}(\mathcal{L},k))$ , where the length of each other edge in $\mathrm{OPT}$ gets inflated by the approximation factor $\beta$ of the algorithm $\mathcal{B}$ .

5.1 Finding a suitable laminar family

To make significant progress through Theorem 17, we need to find a laminar family $\mathcal{L}$ over $V$ such that $\ell(\mathrm{OPT}(\mathcal{L},k))$ is large. Let $J$ be a shortest $T$ -join. If $\ell(J)$ is large, then we will construct a family $\mathcal{L}$ with the property that even for any $T$ -join $R$ , the length $\ell(R(\mathcal{L},k))$ is large. Notice that this implies what we want because $\mathrm{OPT}$ is a $T$ -join.

This statement is formalized in Lemma 19, which is derived from the dual of the natural linear program to find a shortest $T$ -join. We exploit that there is an optimal dual solution whose support corresponds to a laminar family of subsets of $V$ , which follows from combinatorial uncrossing arguments.

Lemma 18.

Let $G=(V,E,\ell)$ be a weighted graph. Moreover, let $T\subseteq V$ such that $G$ contains a $T$ -join, and let $t\in T$ . Then there is strongly polynomial algorithm that computes a laminar family $\mathcal{L}$ over $V\setminus\{t\}$ and values $y\in\mathbb{R}^{\mathcal{L}}_{>0}$ such that

[TABLE]

Proof.

We start with a classical linear description to find a minimum length $T$ -join, based on the dominant of the $T$ -join polytope. To this end, let $\mathcal{F}=\{Q\subseteq V\setminus\{t\}:|Q\cap T|\text{ is odd}\}$ ; these vertex sets induce all $T$ -cuts. Then, the following linear program computes the value $\ell(J)$ of a shortest $T$ -join (see, e.g., [21]).

[TABLE]

Its dual problem, which is a fractional $T$ -cut packing problem, is given below.

[TABLE]

If $y\in\mathbb{R}^{\mathcal{F}}_{\geq 0}$ is an optimum dual solution with laminar support $\mathcal{L}$ , then $y$ and $\mathcal{L}$ have the desired properties. Here (5) follows from strong duality and (6) follows from $L\in\mathcal{F}$ .

A strongly polynomial algorithm to compute such an optimal dual solution with laminar support can be obtained by standard techniques: Using the framework of Frank and Tardos [7], one can first find in strongly polynomial time a vector $\hat{\ell}\in\mathbb{R}^{E}_{\geq 0}$ with encoding length polynomial in $|E|$ , and such that the set of optimal solutions of (7) remains the same when replacing $\ell$ by $\hat{\ell}$ . Moreover, also the set of optimal dual bases remains the same. This allows for solving (7) in strongly polynomial time through the ellipsoid method. To find an optimal dual basis, one can delete all variables from (8) that do not correspond to constraints encountered by the ellipsoid algorithm when solving (8). Now solving the reduced dual problem (8) with $\hat{\ell}$ instead of $\ell$ allows for finding an optimal dual basis, which, by the result of Frank and Tardos, remains an optimal dual basis for (8) without replacing $\ell$ by $\hat{\ell}$ . Knowing an optimal dual basis, one can obtain an optimal solution to (8) in strongly polynomial time by solving a linear equation system.

Finally, this solution can be transformed into a laminar one by uncrossing: if $y_{A}>0$ and $y_{B}>0$ for $A,B\in\mathcal{F}$ with $A\setminus B\neq\emptyset$ and $B\setminus A\neq\emptyset$ and $A\cap B\neq\emptyset$ , then either $A\cap B$ and $A\cup B$ belong to $\mathcal{F}$ or $A\setminus B$ and $B\setminus A$ belong to $\mathcal{F}$ ; we can increase the dual variables on these two sets by $\min\{y_{A},y_{B}\}$ and decrease the dual variables $y_{A}$ and $y_{B}$ by the same amount, maintaining a feasible dual solution. Karzanov [15] showed how to obtain a laminar family by a sequence of such uncrossing steps in strongly polynomial time. ∎

We now show that the family $\mathcal{L}$ from Lemma 18 has the desired properties.

Lemma 19.

Let $G=(V,E,\ell)$ be a weighted graph. Moreover, let $T\subseteq V$ such that $G$ admits a $T$ -join. Then, there is a strongly polynomial algorithm that computes a laminar family $\mathcal{L}$ over $V$ with $\operatorname{width}(\mathcal{L})\leq\max\{0,\,|T|-1\}$ such that for any $T$ -join $R\subseteq E$ , and any $k\in\mathbb{Z}_{\geq 0}$ , we have

[TABLE]

where $J$ is a shortest $T$ -join in $G$ .

Proof.

If $T=\emptyset$ , we can simply set $\mathcal{L}=\emptyset$ because $\ell(J)=0$ . Otherwise, we compute $\mathcal{L}$ and $y$ as in Lemma 18 and show that $\mathcal{L}$ has the desired properties. Since every set in $\mathcal{L}$ must contain an element of $T\setminus\{t\}$ , we have $\operatorname{width}(\mathcal{L})\leq|T|-1$ .

Let now $R\subseteq E$ be a $T$ -join, and let $k\in\mathbb{Z}_{\geq 0}$ . Since $R$ is a $T$ -join, it has a non-empty intersection with every cut $\delta(L)$ with $L\in\mathcal{L}$ because of (6). Hence, by (4),

[TABLE]

Again using (4), we moreover obtain

[TABLE]

Combining (5), (9), and (10), we obtain

[TABLE]

as desired. ∎

Finally, Theorem 12 is a direct consequence of Theorem 17 and Lemma 19.

Proof of Theorem 12.

If $T=\emptyset$ , we simply call the given $\beta$ -approximation algorithm $\mathcal{B}$ . Otherwise, let $k=\lfloor\nicefrac{{1}}{{\delta}}\rfloor$ . We apply Lemma 19 to obtain in strongly polynomial time a laminar family $\mathcal{L}$ over $V$ such that

(i)

$\displaystyle\ell(R(\mathcal{L},k))\geq\ell(J)-\frac{1}{k+1}\cdot\ell(R)\geq\ell(J)-\delta\cdot\ell(R)\quad\forall\;\text{$ T $-join$ R\subseteq E $}$ , and 2. (ii)

$\operatorname{width}(\mathcal{L})\leq\max\{0,\,|T|-1\}=|T|-1$ , where the equality follows from the assumption $T\neq\emptyset$ .

Because a shortest $\Phi$ -tour $\mathrm{OPT}$ is a $T$ -join, we have

[TABLE]

which, together with Theorem 17 implies the desired results, i.e., that one can find a $\Phi$ -tour $F$ in $G$ with

[TABLE]

in time

[TABLE]

∎

It remains to derive Theorem 17, which, as mentioned, we show through a dynamic programming approach.

5.2 Combining partial solutions

In the analysis of our dynamic programming algorithm we use the following notion of an induced interface, which allows us to analyze the algorithm with respect to interfaces coming from a shortest $\Phi$ -tour $\mathrm{OPT}$ .

Definition 20 (induced interface).

Let $G=(V,E,\ell)$ be a weighted graph. Let $\Phi=(I,T,\mathcal{C})$ be an interface of $G$ , and let $F$ be a $\Phi$ -tour in $G$ . For $W\subseteq V$ , the interface $\Phi_{W}=(I_{W},T_{W},\mathcal{C}_{W})$ induced by $(F,\Phi)$ on $W$ is defined by

(i)

$I_{W}=(I\cap W)\cup U$ , where $U$ is the set of vertices in $W$ that are connected by an edge of $F$ to a vertex in $V\setminus W$ , 2. (ii)

$T_{W}=\mathrm{odd}(F[W])$ , and 3. (iii)

$\mathcal{C}_{W}\subseteq 2^{I_{W}}$ * contains, for each connected component of $(W,F[W])$ , a set including all vertices of $I_{W}$ contained in that connected component.*

See Figure 4 for an example of an induced interface. Moreover, also Figure 2, which we used as an illustrative example in the introduction to showcase the guessing of multiple edges per cut, highlights an induced interface with $W=L_{3}\setminus L_{1}$ , which is induced by an $s$ - $t$ tour. We remark that the interface induced by $(F,\Phi)$ depends only on $F$ and $I$ , not on $T$ or $\mathcal{C}$ .

The following lemma shows some basic properties of induced interfaces.

Lemma 21.

Let $G=(V,E,\ell)$ be a weighted graph and $\Phi=(I,T,\mathcal{C})$ an interface of $G$ . Let $F$ be a $\Phi$ -tour in $G$ and $W\subseteq V$ . Let $\Phi_{W}$ be the interface induced by $(F,\Phi)$ on $W$ . Then

(i)

$\Phi_{W}$ * is an interface of $G[W]$ ,* 2. (ii)

$F[W]$ * is a $\Phi_{W}$ -tour in $G[W]$ , and* 3. (iii)

for every $W^{\prime}\subseteq W$ , the interface induced by $(F[W],\Phi_{W})$ on $W^{\prime}$ equals the interface induced by $(F,\Phi)$ on $W^{\prime}$ .

Proof.

Let $\Phi_{W}=(I_{W},T_{W},\mathcal{C}_{W})$ . As in Definition 20 (i), let $U$ be the set of vertices in $W$ that are connected by an edge of $F$ to a vertex in $V\setminus W$ .

To prove (i), we have to observe that $T_{W}\subseteq I_{W}$ . (Notice that we clearly have that $|T_{W}|$ is even because $T_{W}=\mathrm{odd}(F[W]).)$ Let $u\in T_{W}$ . If $F$ contains an edge connecting $u$ with $V\setminus W$ , then $u\in U$ and hence $u\in I_{W}$ . Otherwise we have $(\delta(u)\cap F)\subseteq E[W]$ and hence $u\in T_{W}=\mathrm{odd}(F[W])$ implies $u\in\mathrm{odd}(F)$ . Since $F$ is a $\Phi$ -tour, we conclude $u\in T\subseteq I$ . Moreover, $u\in T_{W}=\mathrm{odd}(F[W])\subseteq W$ , so $u\in I\cap W\subseteq I_{W}$ .

To prove (ii), we have to show that $(W,F[W])/I_{W}$ is connected (the other two conditions of Definition 6 trivially hold). Suppose not. Then there is a set $W^{\prime}\subseteq W\setminus I_{W}$ with $W^{\prime}\neq W$ and $F[W]\cap\delta(W^{\prime})=\emptyset$ . This implies, together with $I_{W}=(I\cap W)\cup U$ —which holds by definition of $I_{W}$ —that $W^{\prime}\subseteq V\setminus I$ with $W^{\prime}\neq V$ and $F\cap\delta(W^{\prime})=F[W]\cap\delta(W^{\prime})=\emptyset$ . This contradicts the fact that $(V,F)/I$ is connected, which has to hold because $F$ is a $\Phi$ -tour.

To show (iii), let $(I_{1},T_{1},\mathcal{C}_{1})$ be the interface induced by $(F,\Phi)$ on $W^{\prime}$ and let $(I_{2},T_{2},\mathcal{C}_{2})$ be the interface induced by $(F[W],\Phi_{W})$ on $W^{\prime}$ . Let $U_{1}$ be the set of vertices in $W^{\prime}$ that are connected by an edge of $F$ to a vertex in $V\setminus W^{\prime}$ . Let $U_{2}$ be the set of vertices in $W^{\prime}$ that are connected by an edge of $F[W]$ to a vertex in $W\setminus W^{\prime}$ . Then $U_{1}=(U\cap W^{\prime})\cup U_{2}$ . Therefore,

[TABLE]

Finally, because $(F[W])[W^{\prime}]=F[W^{\prime}]$ , which follows from $W^{\prime}\subseteq W$ , we have

[TABLE]

and also $\mathcal{C}_{1}=\mathcal{C}_{2}$ , because these partitions of $I_{1}=I_{2}$ are both defined with respect to the connected components of $(W^{\prime},F[W^{\prime}])$ , because $(W^{\prime},(F[W])[W^{\prime}])=(W^{\prime},F[W^{\prime}])$ . ∎

Notice that given an interface $\Phi=(I,T,\mathcal{C})$ on a graph $G=(V,E)$ and a $\Phi$ -tour $F\subseteq E$ , then the interface $\Phi_{V}$ induced by $(F,\Phi)$ on $V$ is not necessarily identical to $\Phi$ . More precisely, $\Phi_{V}=(I_{V},T_{V},\mathcal{C}_{V})$ always fulfills $I_{V}=I$ and $T_{V}=T$ . However, $F$ may connect different parts of the partition $\mathcal{C}$ , which, in the interface $\Phi_{V}$ , will then only appear as one set in $\mathcal{C}_{V}$ . See the left-hand side illustration in Figure 4 for such an example where the highlighted $\Phi$ -tour would induce an interface $\Phi_{V}\neq\Phi$ on $V$ because $C_{2}\cup C_{3}$ is a single set in $\mathcal{C}_{V}$ .

In our dynamic program we will combine solutions for different subgraphs with induced interfaces. The following lemma shows sufficient conditions under which this works out.

Lemma 22.

Let $G=(V,E,\ell)$ be a weighted graph. Let $\Phi=(I,T,\mathcal{C})$ be an interface of $G$ and let $F$ be a $\Phi$ -tour in $G$ . Let $W_{0},\ldots,W_{p}$ be a partition of $V$ . For $i\in\{0,\ldots,p\}$ , let $\Phi_{i}=(I_{i},T_{i},\mathcal{C}_{i})$ be the interface induced by $(F,\Phi)$ on $W_{i}$ , and let $F_{i}$ be a $\Phi_{i}$ -tour in $G[W_{i}]$ . Then

[TABLE]

is a $\Phi$ -tour in $G$ , where $X\coloneqq F\cap\bigcup_{i=0}^{p}\delta(W_{i})$ .

Proof.

We first show point (i) of Definition 6, i.e. $\mathrm{odd}(F^{\prime})=T$ . For $i\in\{0,\ldots,p\}$ , we have $\mathrm{odd}(F_{i})=T_{i}=\mathrm{odd}(F[W_{i}])$ since $F_{i}$ is a $\Phi_{i}$ -tour. Thus

[TABLE]

where $\bigtriangleup$ denotes the symmetric difference; we used $F=X\stackrel{{\scriptstyle.}}{{\cup}}F[W_{0}]\stackrel{{\scriptstyle.}}{{\cup}}\dots\stackrel{{\scriptstyle.}}{{\cup}}F[W_{p}]$ . Before proving that $F^{\prime}$ also fulfills the remaining two properties of a $\Phi$ -tour, we show the following claim. See Figure 5 for an illustration.

Claim 23.

Let $\overline{I}\coloneqq I_{0}\stackrel{{\scriptstyle.}}{{\cup}}\dots\stackrel{{\scriptstyle.}}{{\cup}}I_{p}$ and $a,b\in\overline{I}$ . Suppose $(V,F)$ contains an $a$ - $b$ path. Then $(V,F^{\prime})$ contains an $a$ - $b$ path.

Proof of Claim 23.

Suppose the claim is wrong. Then there exist vertices $a,b\in\overline{I}$ such that $(V,F)$ contains an $a$ - $b$ path $P$ , but $(V,F^{\prime})$ does not. We choose $a$ , $b$ , and $P$ such that the number of edges of $P$ is minimum. Consequently, $P$ contains no vertex of $\overline{I}\setminus\{a,b\}$ . We now distinguish two cases.

Case 1: $X\cap E(P)=\emptyset$ .

Then $P$ is completely contained in a single set $W_{i}$ for some $i\in\{0,\ldots,p\}$ , by definition of $X$ . Hence, $a,b\in W_{i}\cap\overline{I}=I_{i}$ and $a$ and $b$ are connected by the path $P$ in $(W_{i},F[W_{i}])$ . Since $\Phi_{i}$ is the interface induced by $(F,\Phi)$ on $W_{i}$ , the vertices $a$ and $b$ are contained in the same set of the partition $\mathcal{C}_{i}$ of $I_{i}$ . This implies that every $\Phi_{i}$ -tour, and in particular $F_{i}$ , must contain an $a$ - $b$ path, contradicting the assumption that $(V,F^{\prime})$ contains no $a$ - $b$ path.

Case 2: $X\cap E(P)\neq\emptyset$ .

Recall $X=F\cap\bigcup_{i=0}^{p}\delta(W_{i})$ . For $i\in\{0,\ldots,p\}$ , the set $I_{i}$ contains all vertices of $W_{i}$ that are an endpoint of an edge in $X$ , by definition of the induced interface $\Phi_{i}$ . Thus all endpoints of edges in $X$ are contained in $\overline{I}$ . Since $P$ contains no vertex of $\overline{I}\setminus\{a,b\}$ , we have $X\cap E(P)=\{\{a,b\}\}$ , i.e. the path $P$ consists only of a single edge that is contained in $X$ and thus also in $F^{\prime}$ . This contradicts our assumption that $(V,F^{\prime})$ contains no $a$ - $b$ path. ∎

(proof of Claim 23)

To show point (iii) of Definition 6, we need to show that any two vertices $a$ and $b$ that are contained in the same set of the partition $\mathcal{C}$ of $I$ are also contained in the same connected component of $(V,F^{\prime})$ . If $a$ and $b$ are contained in the same set of the partition $\mathcal{C}$ , they are contained in the same connected component of $(V,F)$ because $F$ is a $\Phi$ -tour. Hence by Claim 23 and $I\subseteq\bar{I}$ , also $(V,F^{\prime})$ contains an $a$ - $b$ path.

It remains to show point (ii) of Definition 6, i.e., we prove that $(V,F^{\prime})/I$ is connected. First observe that if $p=0$ , then the result holds because then $F^{\prime}=F_{0}$ is a $\Phi_{0}$ -tour and $I_{0}=I$ . Hence, assume from now on $p>0$ . In this case, we first observe that

[TABLE]

Indeed, because $(V,F)/I$ is connected, which follows from $F$ being a $\Phi$ -tour, we have for each $i\in\{0,\ldots,p\}$ that either $I\cap W_{i}\neq\emptyset$ or $\delta(W_{i})\cap F\neq\emptyset$ , both of which imply $I_{i}\neq\emptyset$ .

To conclude that $(V,F^{\prime})/I$ is connected, we will observe the following two properties, which immediately imply the result:

(a)

For each $i\in\{0,\ldots,p\}$ , each vertex $v\in W_{i}$ is connected to a vertex in $I_{i}$ in the graph $(W_{i},F_{i})$ . 2. (b)

All vertices in $\cup_{i=0}^{p}I_{i}$ are connected in $(V,F^{\prime})/I$ .

Notice that (a) is a consequence of (11) and the fact that $(W_{i},F_{i})/I_{i}$ is connected, which holds because $F_{i}$ is a $\Phi_{i}$ -tour in $G[W_{i}]$ . Finally, (b) follows from Claim 23 due to the following. Either $I=\emptyset$ , in which case $(V,F)/I=(V,F)$ is connected—because $F$ is a $\Phi$ -tour—which implies (b) by Claim 23. Or $I\neq\emptyset$ , in which case the connectivity of $(V,F)/I$ implies that in $(V,F)$ each vertex $v\in\cup_{i=0}^{p}I_{i}$ is connected to a vertex of $I$ , again implying (b) by Claim 23. ∎

5.3 The dynamic program

We now expand on the dynamic program used to show Theorem 17. The dynamic program is formally described by Algorithm 4 below. See also Figure 6 for an illustration. Before formally proving that Algorithm 4 indeed returns a $\Phi$ -tour implying Theorem 17, we provide a brief explanatory discussion outlining the core ideas of the algorithm and the line of reasoning we employ to show its correctness.

To this end, let $R$ be a $\Phi$ -tour (unknown to the algorithm), and we will show that the dynamic program returns a $\Phi$ -tour $F\subseteq E$ such that $\ell(F)\leq\beta\cdot\ell(R)-(\beta-1)\cdot\ell(R(\mathcal{L},k))$ . Conceptually, we want to consider the elements of the laminar family $\mathcal{L}(R,k)\subseteq\mathcal{L}$ from smaller to larger ones. Since we do not know the laminar family $\mathcal{L}(R,k)$ , we consider all sets in $\mathcal{L}$ in an arbitrary fixed order of non-decreasing cardinality. We then guess, for every vertex set $L\in\mathcal{L}(R,k)$ , the interface $\Phi_{L}$ induced by $(R,\Phi)$ on $L$ . Now we compute a $\Phi_{L}$ -tour $F_{L,\Phi_{L}}$ in $G[L]$ as follows.

First, we guess the children $L_{1},\dots,L_{p}$ of $L$ in the laminar family $\mathcal{L}(R,k)$ . Then we guess the set $X\subseteq R[L]$ of edges that cross the cuts $\delta(L_{1}),\dots,\delta(L_{p})$ . In other words, we guess all edges in $R(\mathcal{L},k)$ that are contained in $L$ , but not in any child of $L$ . Moreover, for each child $L_{i}$ with $i\in\{1,\ldots,p\}$ , we guess the interface $\Phi_{i}$ induced by $(R,\Phi)$ on $L_{i}$ . Because we consider the elements of the laminar family $\mathcal{L}$ in an order of non-decreasing cardinality, we have already considered $L_{i}$ before considering the current set $L$ . Hence we have already computed some $\Phi_{i}$ -tour $F_{L_{i},\Phi_{i}}$ for all $i\in\{1,\ldots,p\}$ .

We now want to extend the union of these $\Phi_{i}$ -tours for all $i\in\{1,\ldots,p\}$ and the set $X$ of edges crossing the boundaries of the children $L_{1},\dots,L_{p}$ to a $\Phi_{L}$ -tour in $G[L]$ . To this end we define $L_{0}\coloneqq L\setminus\cup_{i=1}^{p}L_{i}$ . Then $L_{0},\dots,L_{p}$ is a partition of $L$ . We also guess the interface $\Phi_{0}$ that $(R,\Phi)$ induces on $L_{0}$ . Then, by Lemma 22 applied to the graph $G[L]$ , the union of $X$ and arbitrary $\Phi_{i}$ -tours in $G[L_{i}]$ for $i=\{0,\dots,p\}$ is a $\Phi_{L}$ -tour in $G[L]$ . Here we use that $\Phi_{i}$ is the interface induced by $(R[L],\Phi_{L})$ on $L_{i}$ for $i=\{0,\dots,p\}$ (cf. Lemma 21 (iii)). Finally, we use the given algorithm $\mathcal{B}$ to compute a $\beta$ -approximation $F_{0}$ of a minimum length $\Phi_{0}$ -tour in the subgraph $G[L_{0}]$ and combine $X$ , $F_{0}$ , and the $\Phi_{i}$ -tours $F_{L_{i},\Phi_{i}}$ for $i\in\{1,\ldots,p\}$ to a $\Phi_{L}$ -tour $F_{L,\Phi_{L}}$ .

In what follows, we now provide a rigorous proof that Algorithm 4 implies Theorem 17 by leveraging the tools from Section 5.2.

5.4 Proof of Theorem 17

We start by showing that Algorithm 4 has indeed the claimed running time, before proving its correctness.

Running time

The running time of Algorithm 4 is dominated by the $5$ -fold nested for-loops. We first determine upper bounds on the number of iterations of each for-loop separately, whenever the algorithm reaches it.

\raisenthenumi for-loop:

It goes over all sets in $\overline{\mathcal{L}}$ . Because $\overline{\mathcal{L}}$ is a laminar family over $V$ , it contains $O(|V|)$ sets. 2. \raisenthenumi for-loop:

It goes over all interfaces $\Phi_{L}=(I_{L},T_{L},\mathcal{C}_{L})$ of $G[L]$ with $|I_{L}|\leq|I|+k$ . There are no more than $(|L|+1)^{|I|+k}\leq(|V|+1)^{|I|+k}$ choices for choosing $I_{L}$ . Moreover, there are at most $2^{|I_{L}|}\leq 2^{|I|+k}$ choices for $T_{L}\subseteq I_{L}$ . Finally, the number of partitions $\mathcal{C}_{L}$ of $I_{L}$ can be upper bounded by $|I_{L}|^{|I_{L}|}\leq|V|^{|I|+k}$ . Overall, the number of iterations of any run of the second for-loop is bounded by $|V|^{O(|I|+k)}$ . 3. \raisenthenumi for-loop:

It iterates over subfamilies of $\overline{\mathcal{L}}$ of disjoint proper subsets of $L$ . Because the sets are disjoint, such a family can have at most $\operatorname{width}(\overline{\mathcal{L}})\leq\operatorname{width}(\mathcal{L})+1$ sets, and we can therefore bound the number of these subfamilies by $|\overline{\mathcal{L}}|^{\operatorname{width}(\mathcal{\overline{L}})}=|V|^{O(\operatorname{width}(\mathcal{L}))}$ . 4. \raisenthenumi for-loop:

It iterates over edge sets $X\subseteq(\cup_{i=1}^{p}\delta(L_{i}))\cap E[L]$ with $|X\cap\delta(L_{i})|\leq k$ for all $i\in\{1,\ldots,p\}$ , and can be bounded as follows. Notice that $|X|\leq\sum_{i=1}^{p}|X\cap\delta(L_{i})|\leq p\cdot k\leq\operatorname{width}(\mathcal{L})\cdot k$ . Hence, there are at most $(|E|+1)^{k\cdot\operatorname{width}(\mathcal{L})}=|V|^{O(k\cdot\operatorname{width}(\mathcal{L}))}$ options for $X$ . 5. \raisenthenumi for-loop:

This loop runs for all $i\in\{0,1,\dots,p\}$ over all interfaces $\Phi_{i}=(I_{i},T_{i},\mathcal{C}_{i})$ of $G[L_{i}]$ , where $L_{i}$ and $I_{i}$ are fixed. The number of interfaces $\Phi_{i}$ for a fixed $i\in\{0,\dots,p\}$ is thus bounded by $(2|I_{i}|)^{|I_{i}|}\leq(2|V|)^{|I_{i}|}$ and, hence, the total number of combinations of such interfaces, and thus also on the number of iterations each time this for-loop is run, is bounded by

[TABLE]

Moreover, for $i\in\{1,\ldots,p\}$ , we have $|I_{i}|\leq k+|I_{L}\cap L_{i}|$ , which follows from the fact that each set $I_{i}$ contains the elements of $I_{L}\cap L_{i}$ together with at most $k$ endpoints of edges from $X$ because $|X\cap\delta(L_{i})|\leq k$ . This implies

[TABLE]

Similarly,

[TABLE]

Combining (13) and (14) with (12), we can bound the number of iterations of the fifth for-loop by $|V|^{O(|I|+k\cdot\operatorname{width}(\mathcal{L}))}$ .

The most expensive single operation performed by Algorithm 4 is the call to Algorithm $\mathcal{B}$ to find a $\Phi_{0}$ -tour, which, by assumption, takes no more than $f_{\mathcal{B}}(G,|I_{0}|)$ time. Due to the bound on $|I_{0}|\leq|I|+k\cdot(\operatorname{width}(\mathcal{L})+1)$ provided by (14), we have that the total running time is thus indeed bounded by $|V|^{O(|I|+k\cdot\operatorname{width}(\mathcal{L}))}\cdot f_{\mathcal{B}}(G,|I|+k\cdot(\operatorname{width}(\mathcal{L})+1))$ .

Correctness

We now show that, whenever $G$ admits a $\Phi$ -tour, then Algorithm 4 will find a $\Phi$ -tour $F_{V,\Phi}$ with the length guarantee claimed by Theorem 17. So let $R$ be a $\Phi$ -tour. We have to show that $F_{V,\Phi}$ computed by the algorithm is a $\Phi$ -tour (instead of Nil) and that it satisfies

[TABLE]

We prove (15) by showing the following claim from smaller to larger sets $L\in\mathcal{L}(R,k)\cup\{V\}$ .

Claim 24.

Let $L\in\mathcal{L}(R,k)\cup\{V\}$ . If $L=V$ , let $\Phi_{L}=\Phi$ . Otherwise, let $\Phi_{L}=(I_{L},T_{L},\mathcal{C}_{L})$ be the interface induced by $(R,\Phi)$ on $L$ . Then Algorithm 4 computes a $\Phi_{L}$ -tour $F_{L,\Phi_{L}}$ such that

[TABLE]

Observe that the claim immediately implies Theorem 17 by choosing $L=V$ . Hence, it remains to prove the claim.

Proof of Claim 24.

We prove the claim by induction from smaller to larger sets in $\mathcal{L}(R,k)\cup\{V\}$ . Hence, let $L\in\mathcal{L}(R,k)\cup\{V\}$ and assume that the claim holds for sets in $\mathcal{L}(R,k)\cup\{V\}$ of strictly smaller cardinality than $L$ . In particular, it holds for the children $L_{1},\ldots,L_{p}$ of $L$ in the laminar family $\mathcal{L}(R,k)\cup\{V\}$ . (Note that $L$ may also not have any children.) Let $L_{0}\coloneqq L\setminus\cup_{i=1}^{p}L_{i}$ , and for $i\in\{0,\ldots,p\}$ , let $\Phi_{i}=(I_{i},T_{i},\mathcal{C}_{i})$ be the interface induced by $(R,\Phi)$ on $L_{i}$ . By using Lemma 21 (iii) in the case $L\neq V$ , we observe that $\Phi_{i}$ is also the interface induced by $(R[L],\Phi_{L})$ on $L_{i}$ . Let $F_{0}$ be a $\Phi_{0}$ -tour obtained through Algorithm $\mathcal{B}$ . Because $L_{0},L_{1},\ldots,L_{p}$ partitions $L$ , we have by Lemma 22 that

[TABLE]

is a $\Phi_{L}$ -tour, where

[TABLE]

Before discussing that this $\Phi_{L}$ -tour $F$ will indeed be considered by Algorithm 4, we bound its length. First, $\ell(F_{0})\leq\beta\cdot\ell(R[L_{0}])$ because $\mathcal{B}$ is a $\beta$ -approximation algorithm and $R[L_{0}]$ is a $\Phi_{0}$ -tour by Lemma 21 (ii). Moreover, for $i\in\{1,\dots,p\}$ we apply the induction hypothesis to $L_{i}$ and $\Phi_{i}$ , which is possible because $L_{i}\in\mathcal{L}(R,k)$ has strictly smaller cardinality than $L$ . Hence, $F_{L_{i},\Phi_{i}}$ is a $\Phi_{i}$ -tour and fulfills the length bound stated in the claim. We therefore get

[TABLE]

where the last equality follows by observing that

[TABLE]

Due to (17), the $\Phi_{L}$ -tour $F$ fulfills the length bound of the claim. It remains to show that the $\Phi_{L}$ -tour $F$ will indeed be considered by Algorithm 4. For this, we show that the following quantities are considered in the five nested for-loops:

\raisenthenumi for-loop:

considers $L$ , 2. \raisenthenumi for-loop:

considers the interface $\Phi_{L}=(I_{L},T_{L},\mathcal{C}_{L})$ , 3. \raisenthenumi for-loop:

considers the children $L_{1},\ldots,L_{p}$ of $L$ in the laminar family $\mathcal{L}(R,k)\cup\{V\}$ , 4. \raisenthenumi for-loop:

considers the set $X$ , 5. \raisenthenumi for-loop:

considers, for $i\in\{0,\dots,p\}$ , the interfaces $\Phi_{i}$ induced by $(R[L],\Phi_{L})$ on $L_{i}$ .

This run would indeed produce $F$ . All that remains to be shown is that the above five quantities, to be considered within the five nested for-loops, fulfill the conditions set by the respective for-loops:

\raisenthenumi for-loop:

Algorithm 4 considers all sets in $\mathcal{L}$ and hence, also $L$ . 2. \raisenthenumi for-loop:

If $L=V$ , the interface $\Phi$ is obviously considered. Otherwise $\Phi_{L}=(I_{L},T_{L},\mathcal{C}_{L})$ is the interface induced by $(R,\Phi)$ on $L$ , and we have $I_{L}=(I\cap L)\cup U$ , where $U$ is the set of vertices in $L$ connected by an edge of $R$ to a vertex in $V\setminus L$ . As $L\in\mathcal{L}(R,k)\cup\{V\}$ , we have $|\delta(L)\cap R|\leq k$ , and hence $|U|\leq k$ , which implies $|I_{L}|\leq|I|+k$ and shows that the interface $\Phi_{L}$ is considered in the second for-loop. 3. \raisenthenumi for-loop:

We have $\{L_{1},\ldots,L_{p}\}\subseteq\overline{\mathcal{L}}$ . Hence, the subfamily $\{L_{1},\ldots,L_{p}\}$ will be considered in the third nested for-loop. 4. \raisenthenumi for-loop:

The set $X$ we want to consider is given by (16). This set clearly satisfies $X\subseteq\left(\cup_{i=1}^{p}\delta(L_{i})\right)\cap E[L]$ because $R[L]\subseteq E[L]$ . Moreover, for each $i\in\{1,\ldots,p\}$ we have

[TABLE]

where the last inequality follows from $L_{i}\in\mathcal{L}(R,k)$ . Hence, the set $X$ will be considered during the fourth nested for-loop of the algorithm. 5. \raisenthenumi for-loop:

For $i\in\{0,\ldots,p\}$ we have that $\Phi_{i}=(I_{i},T_{i},\mathcal{C}_{i})$ is the interface of $G[L_{i}]$ induced by $(R[L],\Phi_{L})$ on $L_{i}$ . Hence, $I_{i}=(I_{L}\cap L_{i})\cup U_{i}$ , where $U_{i}$ are all vertices in $L_{i}$ connected by an edge of $R[L]$ to a vertex in $L\setminus L_{i}$ . We have $R[L]\cap\delta(L_{i})=X\cap\delta(L_{i})$ by our choice of $X$ as described in (16) and because $\{L_{0},\dots,L_{p}\}$ is a partition of $L$ . Therefore, $I_{i}\coloneqq(I_{L}\cap L_{i})\cup U_{i}$ , as desired. Hence, the interfaces $\Phi_{i}$ for $i\in\{0,\dots,p\}$ indeed get considered in the fifth nested for-loop of the algorithm.

∎

As said, Claim 24 implies (15), completing the proof of Theorem 17.

We remark that Claim 24 can be slightly strengthened as follows. The statement also holds when replacing the induced interface $\Phi_{L}=(I_{L},T_{L},\mathcal{C}_{L})$ by any interface $\Phi_{L}^{\prime}=(I_{L},T_{L},\mathcal{C}_{L}^{\prime})$ where $\mathcal{C}_{L}^{\prime}$ is a refinement of $\mathcal{C}_{L}$ . However, we do not need this for our purposes.

6 Proof of the main theorem

We finally prove that the Boosting Theorem (Theorem 10) implies Theorem 1. In fact, we prove a generalization, stated below as Theorem 25, which, for $k=2$ and $\Phi=(I,T,\mathcal{C})$ with $I=T=\{s,t\}$ and $\mathcal{C}=\{\{s,t\}\}$ , yields Theorem 1.

Theorem 25.

Let $\mathcal{A}$ be an $\alpha$ -approximation algorithm for TSP. Then, for any $\varepsilon>0$ and any integer $k$ , there is an $(\alpha+\varepsilon)$ -approximation algorithm for $\Phi$ -TSP restricted to instances with $|I_{\Phi}|\leq k$ that, for any instance $(G,\Phi)$ , calls $\mathcal{A}$ a strongly polynomial number of times on TSP instances defined on subgraphs of $G$ , and performs further operations taking strongly polynomial time.

Proof.

We obtain the result by repeatedly applying the Boosting Theorem, i.e., Theorem 10, to strengthen the $4$ -approximation algorithm for $\Phi$ -TSP guaranteed by Theorem 9 through the $\alpha$ -approximation algorithm for TSP which we assume to exist. Without loss of generality $\varepsilon\leq 1$ . The Boosting Theorem will be repeated $i_{\max}$ many times with error parameter given by $\varepsilon^{\prime}=\nicefrac{{\varepsilon}}{{\alpha}}$ , where

[TABLE]

Notice that $i_{\max}$ is constant, because both $\varepsilon$ and $\alpha$ are fixed.

Let $\beta_{0}:=4$ be the approximation factor for $\Phi$ -TSP before applying the Boosting Theorem. We assume $\alpha\leq 1.5<\beta_{0}$ because Christofides’ algorithm is a strongly polynomial $1.5$ -approximation algorithm for TSP [4, 25]. Let $i\in\{1,\dots,i_{\max}\}$ . After $i$ applications of the Boosting Theorem we obtain an algorithm $\mathcal{B}_{i}$ for $\Phi$ -TSP with approximation ratio at most

[TABLE]

where we used $\varepsilon^{\prime}=\nicefrac{{\varepsilon}}{{\alpha}}$ . We therefore have

[TABLE]

where the last inequality follows by induction on $i$ . Hence,

[TABLE]

Moreover, we define real numbers $k_{i}>0$ for $i\in\{0,\ldots,i_{\max}\}$ to upper bound the size of the interfaces we have to be able to handle after $i$ boosting steps. We want the $\beta_{i_{\max}}$ -approximation algorithm $\mathcal{B}_{i_{\max}}$ , obtained after $i_{\max}$ many applications of the Boosting Theorem, to handle interfaces of size $k_{i_{\max}}\coloneqq k$ . Because $\mathcal{B}_{i_{\max}}$ was obtained by applying the Boosting Theorem to $\mathcal{B}_{i_{\max}-1}$ , we obtain that $\mathcal{B}_{i_{\max}-1}$ needs to handle interfaces of size bounded by $k_{i_{\max}-1}\coloneqq\frac{9}{\varepsilon^{\prime}}\cdot k_{i_{\max}}$ . Repeating this reasoning, we obtain upper bounds $k_{i}$ on the size of the interfaces that we have to handle with $\mathcal{B}_{i}$ that satisfy

[TABLE]

which implies

[TABLE]

Notice that because $i_{\max}$ , $k$ , and $\varepsilon^{\prime}$ are constant, also $k_{0}$ is constant.

For $i=i_{\max}$ , the following claim implies Theorem 25 because $\beta_{i_{\max}}=\alpha+\varepsilon$ and $i_{\max}$ , $k_{0}$ , and $\varepsilon^{\prime}$ are constant and $\mathcal{B}_{0}$ is a strongly polynomial algorithm.

Claim 26.

Let $c>0$ be the hidden constant in the big- $O$ notation in the runtime bound in Theorem 10. Let $i\in\{0,\dots,i_{\max}\}$ and let $\mathcal{A}$ be the given $\alpha$ -approximation algorithm for TSP. Then there is a $\beta_{i}$ -approximation algorithm $\mathcal{B}_{i}$ for $\Phi$ -TSP that, for every weighted graph $G$ , runs in time at most

[TABLE]

on any instance $(G^{\prime},\Phi)$ , where $G^{\prime}$ is a subgraph of $G$ and $|I_{\Phi}|\leq k_{i}$ .

We prove the claim by induction on $i$ . By Theorem 9 we have a strongly polynomial $\beta_{0}$ -approximation algorithm $\mathcal{B}_{0}$ for $\Phi$ -TSP, implying the claim for $i=0$ .

Now let $i\in\{1,\dots,i_{\max}\}$ . By our induction hypothesis, there exists a $\beta_{i-1}$ -approximation algorithm $\mathcal{B}_{i-1}$ that runs in time $f_{i-1}(G)$ on every weighted graph $G$ and every interface $\Phi$ of $G$ with $|I_{\Phi}|\leq k_{i-1}$ . Applying Theorem 10 to the algorithms $\mathcal{A}$ and $\mathcal{B}_{i-1}$ then yields a $\beta_{i}$ -approximation algorithm $\mathcal{B}_{i}$ for $\Phi$ -TSP that runs on every graph $G$ and every interface $\Phi$ with $|I_{\Phi}|\leq k_{i}$ in time at most

[TABLE]

∎

7 Strongly polynomial 4-approximation algorithm for $\Phi$ -TSP

See 9

Proof.

Let $\Phi=(I,T,\mathcal{C})$ be an interface of $G=(V,E)$ . By Lemma 8, we can assume that $\Phi$ is feasible. The main component of our algorithm is to obtain a strongly polynomial $2$ -approximation algorithm for the problem of finding a set (not a multi-set) $F\subseteq E$ of minimum length $\ell(F)$ that satisfies the following three conditions:

(i)

$(V,F)/I$ is connected; 2. (ii)

$(V,F)$ connects all vertices within any $C\in\mathcal{C}$ ; 3. (iii)

each connected component of $(V,F)$ contains an even number of vertices in $T$ .

We will achieve this through an application of Jain’s iterative rounding method for the Generalized Steiner Network Problem [13] together with the elegant framework of Frank and Tardos [7] to transform certain polynomial-time algorithms into strongly polynomial ones.

Before we discuss the details of Jain’s method in our setting together with the framework of Frank and Tardos, we first assume that we can indeed find in strongly polynomial time a set $F\subseteq E$ fulfilling (i), (ii), and (iii) of length no larger than twice the length of a shortest edge set fulfilling these three conditions. Because a shortest $\Phi$ -tour $\mathrm{OPT}$ must fulfill these conditions, and removing parallel edges does not destroy them, there is a subset of $\mathrm{OPT}$ that contains no parallel edges and satisifies (i), (ii), and (iii). Therefore, $\ell(F)\leq 2\ell(\mathrm{OPT})$ .

Due to property (iii), the set $F$ contains a $T$ -join $J\subseteq F$ , which we can find in linear time through standard techniques. We then return $F\stackrel{{\scriptstyle.}}{{\cup}}(F\setminus J)$ , which is indeed a $\Phi$ -tour and satisfies

[TABLE]

as desired. It remains to show how to obtain a strongly polynomial $2$ -approximation algorithm for finding a shortest edge set fulfilling (i), (ii), and (iii). We start by showing how an application of Jain’s iterative rounding method leads to a polynomial-time, but not necessarily strongly polynomial, $2$ -approximation algorithm.

To this end, observe that a set $F\subseteq E$ satisfies (i), (ii), and (iii) if and only if

[TABLE]

where the function $f:2^{V}\to\{0,1\}$ is defined as follows. For $S\subsetneq V$ with $S\neq\emptyset$ , we set $f(S)=1$ if at least one of the following three properties holds:

(a)

$S\cap I=\emptyset$ ; 2. (b)

$\exists\>C\in\mathcal{C}$ s.t. $S\cap C\neq\emptyset$ and $C\setminus S\neq\emptyset$ ; 3. (c)

$|S\cap T|$ is odd.

Otherwise we set $f(S)=0$ . (In particular, $f(\emptyset)=f(V)=0$ .) Indeed, the properties (a), (b), and (c) are just reformulations of (i), (ii), and (iii), respectively.

Jain’s technique [13] leads to a polynomial-time $2$ -approximation algorithm for finding a shortest edge set $F$ satisfying (19) if, first, the function $f$ is weakly supermodular, which means

[TABLE]

and, second, one can separate over the polytope

[TABLE]

in polynomial time.

We start by showing (20). Notice that (20) clearly holds if $X\subseteq Y$ , because in this case we have $\{X,Y\}=\{X\cup Y,X\cap Y\}$ . Hence, in what follows, we always assume that $X\setminus Y\neq\emptyset$ and $Y\setminus X\neq\emptyset$ .

Let $f_{a}$ , $f_{b}$ , and $f_{c}$ be the functions from $2^{V}$ to $\{0,1\}$ that take a value of $1$ precisely for sets $S\subsetneq V,S\neq\emptyset$ that satisfy (a), (b), or (c), respectively. Hence, $f(S)=\max\{f_{a}(S),f_{b}(S),f_{c}(S)\}$ . First, one can observe that each of the functions $f_{a}$ , $f_{b}$ , and $f_{c}$ is weakly supermodular. Consider first $f_{a}$ and let $X,Y\subseteq V$ with $X\setminus Y\neq\emptyset$ and $Y\setminus X\neq\emptyset$ . If $f_{a}(X)=1$ then $f_{a}(X\setminus Y)=1$ . Similarly, if $f_{a}(Y)=1$ , then $f_{a}(Y\setminus X)=1$ . Hence, $f_{a}$ satisfies (20). The function $f_{b}$ corresponds to pairwise connectivity requirements and, as shown in [13], is therefore weakly supermodular. The function $f_{c}$ is easily seen to be a so-called proper function, which means that $f_{c}(V)=0$ , $f_{c}$ is symmetric, and $f_{c}(S_{1}\cup S_{2})\leq\max\{f_{c}(S_{1}),f_{c}(S_{2})\}$ for any pair of disjoint sets $S_{1},S_{2}\subseteq V$ . Finally, it is well-known that any proper function is weakly supermodular (see [9]).

We say that a set $S\subsetneq V$ with $S\neq\emptyset$ is of type (a), (b), or (c), if it satisfies (a), (b), or (c), respectively. Because each of the functions $f_{a}$ , $f_{b}$ , and $f_{c}$ is weakly supermodular, the inequality (20) holds whenever the sets $X$ and $Y$ are of the same type, or if $X$ or $Y$ is none of the three types. Hence, it remains to consider sets $X$ and $Y$ of two different types among the types (a), (b), and (c). Let $S_{a},S_{b},S_{c}\subseteq V$ be sets of type (a), (b), and (c), respectively. Thus, we need to show that (20) holds for the three cases where $(X,Y)$ is either $(S_{a},S_{b})$ , $(S_{a},S_{c})$ , or $(S_{b},S_{c})$ . Moreover, let $C\in\mathcal{C}$ be a set such that $S_{b}\cap C\neq\emptyset$ and $C\setminus S_{b}\neq\emptyset$ , which exists because $S_{b}$ is of type (b).

We start by considering the case $(X,Y)=(S_{a},S_{b})$ . As discussed, we assume that $S_{a}\not\subseteq S_{b}$ and $S_{b}\not\subseteq S_{a}$ ; for otherwise, (20) holds trivially. Notice that in this case we have

[TABLE]

because $(S_{a}\setminus S_{b})\cap I\subseteq S_{a}\cap I=\emptyset$ , as well as $(S_{b}\setminus S_{a})\cap I=S_{b}\cap I$ and $C\subseteq I$ . Hence, $S_{a}\setminus S_{b}$ is of type (a) and $S_{b}\setminus S_{a}$ is of type (b).

Consider now the case $(X,Y)=(S_{a},S_{c})$ . Here, we have

[TABLE]

because $(S_{a}\setminus S_{c})\cap I\subseteq S_{a}\cap I=\emptyset$ , implying that $S_{a}\setminus S_{c}$ is of type (a), and $|(S_{c}\setminus S_{a})\cap T|=|S_{c}\cap T|$ due to $S_{a}\cap T\subseteq S_{a}\cap I=\emptyset$ , which implies that $S_{c}\setminus S_{a}$ is of type (c).

It remains to consider the case $(X,Y)=(S_{b},S_{c})$ . We first observe that

[TABLE]

due to the following. Inequality (22) holds because $S_{b}\cup S_{c}$ can be partitioned into $S_{c}$ and $S_{b}\setminus S_{c}$ . Because $|S_{c}\cap T|$ is odd, either $S_{b}\cup S_{c}$ or $S_{b}\setminus S_{c}$ must also have an odd intersection with $T$ and is thus of type (c). Inequality (23) follows from an analogous reasoning using the partition of $S_{c}$ into $S_{b}\cap S_{c}$ and $S_{c}\setminus S_{b}$ . Moreover, we have

[TABLE]

because $S_{b}$ is of type (b), i.e., $S_{b}\cap C\neq\emptyset$ and $C\setminus S_{b}\neq\emptyset$ . Indeed, even without any assumptions on $S_{c}\subseteq V$ , we have that either $S_{b}\setminus S_{c}$ or $S_{b}\cap S_{c}$ is also of type (b). The same holds for either $S_{b}\cup S_{c}$ or $S_{c}\setminus S_{b}$ . Among the four expressions $f(S_{b}\cup S_{c})$ , $f(S_{b}\cap S_{c})$ , $f(S_{b}\setminus S_{c})$ , and $f(S_{c}\setminus S_{b})$ , consider any one of minimum value and sum up the two inequalities among (22), (23), (24), and (25) containing that expression. This gives the desired result. For example, if $f(S_{b}\setminus S_{c})$ achieves minimum value among the four, then (22) implies $f(S_{b}\cup S_{c})=1$ and (24) implies $f(S_{b}\cap S_{c})=1$ . Hence,

[TABLE]

as desired. This completes the proof that $f$ is weakly supermodular.

To apply Jain’s method, it remains to show that we can separate over $P$ , and we will in fact give a strongly polynomial algorithm. Given $y\in[0,1]^{E}$ , we will either show that all constraints $y(\delta(S))\geq f(S)$ for $S\subseteq V$ are fulfilled or return one of these constraints that is violated. Notice that, because $y\geq 0$ , a constraint $y(\delta(S))\geq f(S)$ can only be violated if $f(S)=1$ , i.e., $S$ is either of type (a), (b), (c). Hence, we can check these constraints for each type separately.

Whether there is a violated constraints of type (a) reduces to finding a minimizer of

[TABLE]

This can be solved through a global minimum cut algorithm applied to the graph $G/I$ with edge weights $y$ . Indeed, this either leads to a cut $S$ with $S\cap I=\emptyset$ as desired or one where $I\subseteq S$ , in which we can replace $S$ by $V\setminus S$ .

To check whether there is a violated constraint of type (b) reduces to

[TABLE]

This can be solved by performing the following for all $C\in\mathcal{C}$ with $|C|\geq 2$ . Number the vertices in $C$ arbitrarily $C=\{c_{1},\ldots,c_{k}\}$ , and solve a minimum $c_{i}$ - $c_{i+1}$ cut problem in $G$ with edge weights $y$ for each $i\in\{1,\ldots,k-1\}$ . If any of these $s$ - $t$ cut problems leads to a cut of value strictly smaller than $1$ , then the minimizing cut corresponds to a violated constraint. Otherwise, there is no violated constraints of type $y(\delta(S))\geq f(S)$ for any set $S$ of type (b).

Finally, checking whether there is a violated constraint of type (c) reduces to

[TABLE]

This is a minimum weight $T$ -cut problem, for which strongly polynomial algorithms are well known (see, e.g., [21]).

In summary, the separation problem over $P$ can be solved in strongly polynomial time, and we can therefore apply Jain’s technique as claimed.

It remains to show that the overall algorithm can be transformed into a strongly polynomial one. This is a consequence of the framework of Frank and Tardos [7] (see also [11, 5, 13] for similar applications). More precisely, the only step that is not strongly polynomial in Jain’s iterative rounding method is solving linear programs on faces of $P$ with objective function given by $\ell$ . Notice that the coefficients in the constraints describing $P$ are all [math] or $1$ . Hence, they have small encoding length. For such cases, Frank and Tardos [7] show how $\ell$ can be replaced (in strongly polynomial time) by another objective $\hat{\ell}$ of encoding length polynomial in the dimension $|E|$ of the problem such that the set of optimal solutions over any polytope in $|E|$ dimensions with constraints of small encoding length is the same for the two objectives $\ell$ and $\hat{\ell}$ . Hence, one can find an optimal linear programming solution with respect to $\hat{\ell}$ instead of $\ell$ , whenever a linear program has to be solved in Jain’s procedure. ∎

8 Conclusions and open problems

We showed that given a polynomial-time $\alpha$ -approximation algorithm for TSP we can obtain a polynomial-time $(\alpha+\varepsilon)$ -approximation algorithm for Path TSP. Feige and Singh [6] proved a similar kind of result for the asymmetric traveling salesman problem (ATSP): given a polynomial-time $\alpha$ -approximation algorithm for ATSP, there is a polynomial-time $(2\alpha+\varepsilon)$ -approximation algorithm for its path version. A natural question is whether our techniques can be used to improve on their result and avoid losing a factor of two in the approximation ratio.

For the Asymmetric Path TSP, the relatively simple dynamic program (sketched in Section 3.1) still works and could be used to reduce to the case where the distance $d(s,t)$ from $s$ to $t$ is not much more than $\nicefrac{{1}}{{2}}\cdot\ell(\mathrm{OPT})$ . (We get $\nicefrac{{1}}{{2}}$ instead of $\nicefrac{{1}}{{3}}$ because a cut can contain two forward edges and one backward edge, and backward edges can belong to many cuts.) To make further progress, we might again try to guess edges also in cuts in which $\mathrm{OPT}$ contains a larger, but constant number of edges. However, even if the distance $d(s,t)$ is very small, the distance $d(t,s)$ from $t$ to $s$ could be large. In this case we do not know how to reduce to ATSP or guess edges of significant length via dynamic programming. Another obstacle is the following: Our approach for reducing Path TSP to TSP required a constant-factor approximation algorithm for $\Phi$ -TSP. Thus, for the asymmetric case one would probably need a suitable constant-factor approximation algorithm for a directed version of $\Phi$ -TSP, and we do not know how to obtain this.

A special case of $\Phi$ -TSP that is more general than Path TSP is the $T$ -tour problem. Here $I_{\Phi}=T_{\Phi}$ is the given set $T$ and $\mathcal{C}_{\Phi}=\{T\}$ . None of the recent improvements for Path TSP seems to extend to general $T$ -tours beyond constant $|T|$ , so Sebő’s $\nicefrac{{8}}{{5}}$ -approximation [22] remains the best that we know. Another question is how well $\Phi$ -TSP can be approximated in general. We showed an approximation ratio of four, but a better ratio might be possible.

Bibliography30

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] H.-C. An, R. Kleinberg, and D. B. Shmoys. Improving Christofides’ algorithm for the s 𝑠 s - t 𝑡 t path TSP. Journal of the ACM , 62(5):34:1–34:28, 2015. Short version appeared in STOC 2012.
2[2] A. Blum, S. Chawla, D. R. Karger, T. Lane, A. Meyerson, and M. Minkoff. Approximation algorithms for orienteering and discounted-reward TSP. SIAM Journal on Computing , 37(2):653–670, 2007. Short version appeared in FOCS 2003.
3[3] J. Cheriyan, Z. Friggstad, and Z. Gao. Approximating minimum-cost connected T 𝑇 {T} -joins. Algorithmica , 72:126–147, 2015. Short version appeared in APPROX/RANDOM 2012.
4[4] N. Christofides. Worst-case analysis of a new heuristic for the Travelling Salesman Problem. Technical Report 388, Graduate School of Industrial Administration, Carnegie Mellon University, 1976.
5[5] F. Eisenbrand. Integer programming and algorithmic geometry of numbers. In M. Jünger, T. M. Liebling, D. Naddef, G. L. Nemhauser, and W. R. Pulleyblank, editors, 50 Years of Integer Programming 1958–2008 , pages 505–559. Springer, 2010.
6[6] U. Feige and M. Singh. Improved approximation algorithms for traveling salesperson tours and paths in directed graphs. In Proceedings of the 10th International Workshop on Approximation Algorithms for Combinatorial Optimization Problems (APPROX) , pages 104–118, 2007.
7[7] A. Frank and É. Tardos. An application of simultaneous diophantine approximation in combinatorial optimization. Combinatorica , 7(1):49–65, 1987.
8[8] Z. Gao. An LP-based 3 2 3 2 \frac{3}{2} -approximation algorithm for the s 𝑠 s - t 𝑡 t path graph traveling salesman problem. Operations Research Letters , 41:615–617, 2013.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Reducing Path TSP to TSP

Abstract

1 Introduction

1.1 Our results

Theorem 1**.**

Corollary 2**.**

Corollary 3**.**

Corollary 4**.**

1.2 Organization of the paper

2 Preliminaries

3 Overview of approach

3.1 Key challenges and high-level approach

3.2 Φ\PhiΦ-TSP

Definition 5** (interface).**

Definition 6** (Φ\PhiΦ-tour).**

Definition 7** (Φ\PhiΦ-TSP).**

Lemma 8**.**

Proof.

Theorem 9**.**

3.3 Iterative improvement of basic algorithm

Theorem 10** (Boosting Theorem).**

3.4 Proof outline of Boosting Theorem (Theorem 10)

Theorem 11**.**

Theorem 12**.**

Lemma 13**.**

Proof.

4 Finding a short Φ\PhiΦ-tour if there is a short TTT-join

Lemma 14**.**

Proof.

Lemma 15**.**

Proof.

Proof of Theorem 11.

5 Iterative improvement via dynamic programming

Definition 16** (width of a laminar family).**

Theorem 17**.**

5.1 Finding a suitable laminar family

Lemma 18**.**

Proof.

Lemma 19**.**

Proof.

Proof of Theorem 12.

5.2 Combining partial solutions

Definition 20** (induced interface).**

Lemma 21**.**

Proof.

Lemma 22**.**

Proof.

Claim 23**.**

Proof of Claim 23.

5.3 The dynamic program

5.4 Proof of Theorem 17

Running time

Correctness

Claim 24**.**

Proof of Claim 24.

6 Proof of the main theorem

Theorem 25**.**

Proof.

Claim 26**.**

7 Strongly polynomial 4-approximation algorithm for Φ\PhiΦ-TSP

Proof.

8 Conclusions and open problems

Theorem 1.

Corollary 2.

Corollary 3.

Corollary 4.

3.2 $\Phi$ -TSP

Definition 5 (interface).

Definition 6 ( $\Phi$ -tour).

Definition 7 ( $\Phi$ -TSP).

Lemma 8.

Theorem 9.

Theorem 10 (Boosting Theorem).

Theorem 11.

Theorem 12.

Lemma 13.

4 Finding a short $\Phi$ -tour if there is a short $T$ -join

Lemma 14.

Lemma 15.

Definition 16 (width of a laminar family).

Theorem 17.

Lemma 18.

Lemma 19.

Definition 20 (induced interface).

Lemma 21.

Lemma 22.

Claim 23.

Claim 24.

Theorem 25.

Claim 26.

7 Strongly polynomial 4-approximation algorithm for $\Phi$ -TSP