Reducing Path TSP to TSP
Vera Traub, Jens Vygen, Rico Zenklusen

TL;DR
This paper introduces a reduction from Path TSP to TSP, showing their approximability is essentially equivalent, and applies this to improve approximation algorithms for Graph Path TSP.
Contribution
It provides a black-box reduction from Path TSP to TSP, establishing their approximability equivalence and improving approximation ratios for Graph Path TSP.
Findings
Reduction from Path TSP to TSP with arbitrarily small error
Improved approximation algorithm for Graph Path TSP to 1.4+ε
New techniques including a recursive dynamic program for generalized TSP
Abstract
We present a black-box reduction from the path version of the Traveling Salesman Problem (Path TSP) to the classical tour version (TSP). More precisely, we show that given an -approximation algorithm for TSP, then, for any , there is an -approximation algorithm for the more general Path TSP. This reduction implies that the approximability of Path TSP is the same as for TSP, up to an arbitrarily small error. This avoids future discrepancies between the best known approximation factors achievable for these two problems, as they have existed until very recently. A well-studied special case of TSP, Graph TSP, asks for tours in unit-weight graphs. Our reduction shows that any -approximation algorithm for Graph TSP implies an -approximation algorithm for its path version. By applying our reduction to the -approximation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Reducing Path TSP to TSP
Vera Traub
Research Institute for Discrete Mathematics and Hausdorff Center for Mathematics, University of Bonn, Bonn, Germany. Email: \hrefmailto:[email protected]@or.uni-bonn.de, \hrefmailto:[email protected]@or.uni-bonn.de. This research was initiated while the first two authors visited FIM at ETH Zurich.
Jens Vygen11footnotemark: 1
Rico Zenklusen
Department of Mathematics, ETH Zurich, Zurich, Switzerland. Email: \hrefmailto:[email protected]@math.ethz.ch. Research supported in part by the Swiss National Science Foundation grants 200021_184622 and 200021_165866.
(July 24, 2019)
Abstract
We present a black-box reduction from the path version of the Traveling Salesman Problem (Path TSP) to the classical tour version (TSP). More precisely, we show that given an -approximation algorithm for TSP, then, for any , there is an -approximation algorithm for the more general Path TSP. This reduction implies that the approximability of Path TSP is the same as for TSP, up to an arbitrarily small error. This avoids future discrepancies between the best known approximation factors achievable for these two problems, as they have existed until very recently.
A well-studied special case of TSP, Graph TSP, asks for tours in unit-weight graphs. Our reduction shows that any -approximation algorithm for Graph TSP implies an -approximation algorithm for its path version. By applying our reduction to the -approximation algorithm for Graph TSP by Sebő and Vygen, we obtain a polynomial-time -approximation algorithm for Graph Path TSP, improving on a recent -approximation algorithm of Traub and Vygen.
We obtain our results through a variety of new techniques, including a novel way to set up a recursive dynamic program to guess significant parts of an optimal solution. At the core of our dynamic program we deal with instances of a new generalization of (Path) TSP which combines parity constraints with certain connectivity requirements. This problem, which we call -TSP, has a constant-factor approximation algorithm and can be reduced to TSP in certain cases when the dynamic program would not make sufficient progress.
1 Introduction
The Traveling Salesman Problem (TSP) is one of the most fundamental and well-studied problems in Combinatorial Optimization with a multitude of applications. The common denominator of the numerous variants of the problem is that a set of cities have to be visited on a shortest possible tour. Its best-known variant, often just dubbed TSP, assumes that the distances between the cities are non-negative and symmetric, and the task is to find a tour beginning and ending in the same city. For our purposes it will be useful to work with an undirected graph with edge lengths . While it is often assumed that is complete and fulfills the triangle-inequality, we do not assume this, but allow the tour to visit cities more than once; this is easily seen (and well-known) to be equivalent. So a tour is a closed walk in visiting all vertices.
One of the best-studied extensions of TSP is Path TSP, where in addition to and , a fixed start and end are given and the task is to find a shortest walk from to visiting all vertices.
TSP and its variants are well-known to be -hard (see [20, 14] and references therein) and they have been studied very extensively under the viewpoint of approximation algorithms. While TSP and Path TSP look quite similar, there are fundamental differences. First, there is a classical -approximation algorithm for TSP by Christofides [4] and Serdjukov [25]. This algorithm can easily be adapted to Path TSP, but then has only approximation ratio as Hoogeveen [12] showed in the early 90’s. Second, the integrality gaps of the classical LP relaxations seem to be different. For TSP it is widely believed to be , while for Path TSP it is at least . In both cases there are well-known instances attaining these lower bounds on the integrality gaps, and these instances have unit lengths ( for all ). Therefore the unit-length special cases, Graph TSP and Graph Path TSP, have received considerable attention [1, 19, 16, 17, 24, 8, 26]. In these special cases, the integrality gaps are known to be different: it is at most for Graph TSP and exactly for Graph Path TSP as shown by Sebő and Vygen [24].
While Christofides’ algorithm for TSP is still unbeaten after more than four decades, the approximation ratio for Path TSP has been improved. The first improvement, about 20 years after Hoogeveen [12], was obtained by An, Kleinberg, and Shmoys [1], who devised an elegant -approximation algorithm. A sequence of successive improvements [22, 28, 10, 23, 27] culminated in Zenklusen’s recent -approximation algorithm [30]. Hence, at the moment, the best known approximation ratios for TSP and Path TSP are the same.
For Graph TSP and Graph Path TSP the situation is different. The best known approximation ratios are for Graph TSP [24] and for Graph Path TSP [26]. Since the latter result achieves an approximation ratio better than the integrality gap , one might hope that Graph Path TSP is actually no harder than Graph TSP although the integrality gaps differ.
These recent developments naturally lead to the following general question regarding the relation between the approximability of Path TSP and TSP, which we address in this paper:
Is (Graph) Path TSP substantially harder to approximate than its well-known special case (Graph) TSP?
The answer is no. The main contribution of this paper is to show in a constructive way that Path TSP can be approximated equally well as TSP (up to an arbitrarily small error), by presenting a black-box reduction that transforms approximation algorithms for TSP into ones for Path TSP.
1.1 Our results
The main consequence of our reduction can be summarized as follows.
Theorem 1**.**
Let be an -approximation algorithm for TSP. Then, for any , there is an -approximation algorithm for Path TSP that, for any instance , calls a strongly polynomial number of times on TSP instances defined on subgraphs of , and performs further operations taking strongly polynomial time.
The following two statements are immediate consequences of the above theorem.
Corollary 2**.**
Let and . If there is a (strongly) polynomial-time -approximation algorithm for TSP, then there is a (strongly) polynomial-time -approximation algorithm for Path TSP.
Corollary 3**.**
Let and . If there is a polynomial-time -approximation algorithm for Graph TSP, then there is a polynomial-time -approximation algorithm for Graph Path TSP.
Notice that since Graph (Path) TSP does not involve any large numbers in its input, the notions of polynomial-time and strongly polynomial-time algorithm are identical in this context.
The above statements create a strong link between the approximability of Path TSP and TSP, as well as its graph versions. More precisely, Theorem 1 implies that such a link exists for any class of TSP instances that is closed under taking instances on subgraphs of the original instance (without changing the edge lengths). In particular, any potential future progress on the approximability of (Graph) TSP will immediately carry over to (Graph) Path TSP.
Moreover, Corollary 3 allows us to make significant progress on the currently best approximation factor of for Graph Path TSP [26], through a black-box reduction to the -approximation algorithm for Graph TSP by Sebő and Vygen [24].
Corollary 4**.**
For any , there is a polynomial-time -approximation algorithm for Graph Path TSP.
Our reduction technique is quite versatile. In particular, it applies to a pretty general problem class (the -tour problem with interfaces of bounded size; see Definition 6 and Theorem 25). This includes the -tour problem for bounded (see [24, 3, 22] for a definition) and certain uncapacitated vehicle routing problems such as the one with a fixed number of depots studied in [29].
1.2 Organization of the paper
After some brief preliminaries in Section 2 to fix basic terminology and notation, we provide an overview of our approach in Section 3. Here, we first focus on some key aspects of our approach, which is based on a new way to employ dynamic programming by using a well-chosen auxiliary problem, which we call -TSP. Moreover, we break down the problem of finding a short solution to -TSP into two cases. Combining the two cases, applying the same algorithm recursively, and using a constant-factor approximation algorithm for -TSP on the final recursion level will imply our main reduction result, Theorem 1.
For one case, we show in Section 4 how to reduce the problem to TSP. For the other case, we show in Section 5 how to guess a constant fraction of an optimum solution via dynamic programming. The detailed proof of Theorem 1 is in Section 6. Finally, Section 7 contains a -approximation algorithm for -TSP, which is followed by concluding remarks in Section 8.
2 Preliminaries
A weighted graph is a tuple , where is the vertex set, is the edge set, which we assume w.l.o.g. not to have loops or parallel edges, and denotes the edge lengths. We only consider undirected graphs with non-negative edge lengths and do not always state this explicitly.
We often deal with multi-sets of edges. Although does not contain parallel edges, when we write , we mean a multi-set that can contain several copies of the same edge. We use the operator to designate the multi-union. For a vertex set , we denote by all edges with exactly one endpoint in , and, for , we use as a shorthand for . For a multi-set , we define
[TABLE]
For a vertex set , a -join is a multi-set of edges with .
For a set and a graph , we denote by the graph obtained from by contracting the vertex set . If is empty, we define . For a vertex set and an edge set , we define F[W]\coloneqq\{e\in F:\text{both endpoints of eW}\}. Moreover, denotes the induced subgraph with vertex set and edge set .
Instead of describing tours as walks in , it is convenient to consider them as multi-edge sets. Then a solution to (Graph) TSP is a multi-edge set such that is connected and . A solution to (Graph) Path TSP (with ) is a multi-edge set such that is connected and . From such multi-edge sets—which we also call tours or - tours, respectively—we can easily recover walks by Euler’s theorem.
We often use as an arbitrary (but fixed) optimal solution for the problem in question. Finally, when using the notion of approximation algorithm we will not assume that the algorithm is polynomial time, but state it explicitly if this is the case.
In the interest of clarity and simplicity of the presentation, we did not try to optimize the running times of our procedures. Consequently, we often opt for weaker constants that are easier to obtain.
3 Overview of approach
A key novelty of our approach is a new way to set up a dynamic program to successively strengthen a basic algorithm by combining it with a stronger algorithm for TSP. Every time we apply our dynamic program to obtain a stronger algorithm, we end up with a more difficult problem, slowly approaching problem settings for which it is very challenging to find strong approximation algorithms. However, as we show, by guessing a well-chosen set of edges through the dynamic program, we can limit the recursion depth by a constant, which allows us to stay in a regime where our approach runs efficiently.
To introduce our approach, we start with a brief discussion of a much more basic dynamic programming idea that has previously been used in related settings. We explain the challenges this procedure faces when trying to extend it for our purposes, and outline how we overcome the barriers encountered by existing methods.
3.1 Key challenges and high-level approach
Assume we are given an -approximation algorithm for TSP. Then finding a short Path TSP solution using as an oracle would be easy if the distance between the start and the end was short compared to , i.e., the length of a shortest Path TSP solution . Indeed, in this case the length of a shortest TSP tour and a shortest - tour do not differ by much because any solution of one problem can be converted to a solution of the other one by adding a shortest - path . More precisely, is an - tour and is a TSP tour. Hence, one can simply compute an -approximate TSP tour and a shortest - path and return .
Consequently, a canonical plan would be to try to transform the Path TSP instance to another one with small - distance. It turns out that if the distance between and is very large, then such a reduction is indeed possible by using a technique based on dynamic programming that goes back to Blum, Chawla, Karger, Lane, Meyerson, and Minkoff [2], who studied variants of the Orienteering Problem. Their approach was later extended by Traub and Vygen [26] in the context of Graph Path TSP. This approach allows for reducing to Path TSP instances where the distance between and is at most , for some arbitrarily small constant (see [26]).
However, this technique faces significant barriers when aiming at a reduction to smaller - distances. Thus our approach follows a different path. Nevertheless, it is on a high level inspired by the dynamic program in [2] and later variations and extensions thereof [27, 30, 26, 18]. We therefore start with a brief discussion of this prior technique in the context of Path TSP as used in [26], which will be helpful for the understanding of our approach.
For simplicity of exposition, consider a Graph Path TSP instance and assume that for some constant . The idea is to study the structure of edges of in the many - cuts defined by
[TABLE]
The key observation is that a constant fraction of these - cuts will only contain a single edge of , and, hence, one can try to “guess” these edges through a dynamic program. Indeed, every edge can be in at most one of the cuts . Hence, the average number of -edges in a cut can be no higher than . Using that every - cut must have an odd intersection with , because is an - tour, this implies that a constant fraction of the cuts contains only one edge of . For brevity, we call a cut with a -cut. Assume we knew all edges of contained in -cuts. Then the problem decomposes into smaller Path TSP instances. See Figure 1 for an illustration.
Of course, the -edges in -cuts are not known upfront, and hence, the problem cannot be decomposed so easily. However, one can use a dynamic program to guess the -cuts from left to right, i.e., from to , together with the -edge in each of them. Notice that the sub-instances may not have a short start-to-end distance (e.g. in Figure 1 may be substantially larger than times an optimum --tour in ). As shown in [26], this issue can be addressed by applying the dynamic program recursively to the sub-instances. A key observation in [26] is that a constant recursion depth is enough to ensure that the total cost of the remaining sub-instances becomes negligible compared to the edges guessed through the recursive dynamic program.
Notice that to apply this dynamic programming idea, one crucially needs for some constant . Indeed, otherwise none of the cuts may be a -cut, and no decomposition into smaller Path TSP instances as above is possible. This is the reason why this techniques has only been applied to reduce the start-to-end distance to about a third of .
If we could guess not only -cuts, but also cuts with a larger constant number of -edges, say up to , then we could handle instances with an - distance below . (This idea is inspired by a recent dynamic programming approach in [18] in the context of chain-constrained spanning trees.) Our approach aims at realizing this high-level plan. However, this ostensibly simple algorithmic idea comes with several significant technical hurdles. Most importantly, if we guess more than one edge, the resulting sub-problems are not Path TSP problems anymore. More precisely, if we guess 5 edges in each of two consecutive -cuts, then we have up to 10 interface vertices, i.e., endpoints of guessed edges. See Figure 2 for an example.
An optimum - tour is not necessarily connected inside the vertex set of a sub-problem but every connected component must contain at least one interface vertex. Moreover, needs to connect some of the interface vertices to each other. This induces connectivity constraints for the sub-problem, shown as gray sets in Figure 2. They can also be guessed since the number of interface vertices is constant. Note, however, that we cannot guess the entire connected components, as there are exponentially many options.
Clearly, these sub-problems become significantly more difficult than the original Path TSP problem. Moreover, if we try to apply such a procedure recursively, then the sub-problems can become more complex with each recursion step, because of an increase in the number of interface vertices per sub-problem. Another important issue in a recursive application to our more complex sub-problem is to identify good cuts in which we should guess edges of . Our cuts will result from the dual of a -join problem. They will no longer form a chain, but their laminar structure still allows for a dynamic programming approach.
Moreover, it is not obvious how to reduce the problem to TSP in the case when we cannot guess edges by dynamic programming, and this will involve a careful guessing of further edges of .
We will now describe our approach in detail. We start by defining a new problem class around which our method is centered, and which we call -TSP.
3.2 -TSP
As described above, when guessing edges, the endpoints of those edges play a special role in terms of how we have to connect things. We capture this through the notion of an interface. We define this notion for a general graph below and will typically use it for subgraphs of the instance we are interested in.
Definition 5** (interface).**
An interface of a graph is a triple with
- (i)
, where is even, and 2. (ii)
* is a partition of .*
For an interface of , we denote by its corresponding triple and call its size.
For a given interface, we are interested in finding what we call -tours, which are defined as follows.
Definition 6** (-tour).**
Let be a graph. Let be an interface of . A -tour in is a multi-set with
- (i)
, i.e., is a -join, 2. (ii)
* is connected, and* 3. (iii)
for any , the vertices in lie in the same connected component of .
Figure 3 exemplifies the notation of an interface and a -tour.
The problem we focus on in the following, which we call -TSP, seeks to find a shortest -tour.
Definition 7** (-TSP).**
Given a weighted graph and an interface of , compute a shortest -tour in or decide that none exists. In short,
[TABLE]
Note that for any distinct , by choosing the interface with and , we have that -tours correspond to solutions to - Path TSP. Analogously, for larger sets , one captures the -tour problem (see [24, 3, 22]). Another special case is the uncapacitated vehicle routing problem with a fixed number of depots, for which Xu and Rodrigues [29] gave a -approximation. Here, is the set of depots, , and is the partition into singletons.
Depending on the structure of the graph and the interface , it may be that no -tour exists. We call an interface of feasible if admits a -tour. The existence of a -tour admits the following easy characterization, which can be checked in linear time.
Lemma 8**.**
Let be a weighted graph. Let be an interface of . Then admits a -tour if and only if all of the following conditions hold.
- (i)
Each connected component of contains an even number of vertices in , 2. (ii)
* is connected, and* 3. (iii)
for every , the vertices in lie in the same connected component of .
Proof.
The three mentioned conditions are clearly necessary for to admit a -tour. Moreover, if they are satisfied then, due to (i), there exists a -join , and points (ii) and (iii) guarantee that is a -tour in . ∎
It is crucial for our approach to start with a polynomial-time constant-factor approximation algorithm, which we will successively strengthen as discussed in the following.
A -approximation algorithm for -TSP can be obtained easily as follows. Compute a minimum cost edge set satisfying (i) (-join), a minimum cost edge set satisfying (ii) (spanning tree in ), and a 2-approximation of a minimum cost edge set satisfying (iii) (Steiner forest). Then the disjoint union is a -approximation.
With a little more care we can obtain a -approximation algorithm, using Jain’s iterative rounding framework [13]:
Theorem 9**.**
-TSP admits a strongly polynomial -approximation algorithm.
We defer the proof to Section 7. In the rest of this paper, we will derive a strongly polynomial -approximation algorithm for -TSP instances with bounded interface size, where is the approximation guarantee for TSP; see Theorem 25.
3.3 Iterative improvement of basic algorithm
For a TSP algorithm , we denote for every weighted graph by the maximum runtime of algorithm on any subgraph of . Similarly, for a -TSP algorithm , we denote for every weighted graph and any by the maximum runtime of algorithm on any instance , where is a subgraph of and .
Our plan is to start with the -approximation algorithm for -TSP guaranteed by Theorem 9, and successively improve it through a TSP algorithm with an approximation guarantee . The following Boosting Theorem is the main technical result towards this goal and quantifies the improvement in terms of approximation factor that we are able to obtain in one improvement step.
Theorem 10** (Boosting Theorem).**
Let . Suppose we are given:
- (a)
an -approximation algorithm for TSP, and 2. (b)
a -approximation algorithm for -TSP.
Then there is an algorithm that, for any , any weighted graph , and any feasible interface of , returns a -tour in of length
[TABLE]
in time , where is a shortest -tour in . In particular, the algorithm makes calls to only on instances with interfaces of size bounded by .
To prove Theorem 1, we start with (Theorem 9) and apply Theorem 10 repeatedly, but only a constant number of times. The approximation guarantee decreases until it reaches . All interfaces will have constant size. We defer the details to Section 6.
3.4 Proof outline of Boosting Theorem (Theorem 10)
Theorem 10 is obtained by designing two algorithms to obtain a -tour and then returning the better of the -tours computed by these algorithms. Each of the following two theorems summarizes the guarantee we obtain with one of the two algorithms. After that, Algorithm 1, described below, combines these two sub-procedures to obtain an algorithm that implies Theorem 10.
The following theorem yields a short -tour if the length of a minimum -join is small.
Theorem 11**.**
Let . Assume we are given an -approximation algorithm for TSP. Then, for any , any weighted graph , and any feasible interface of , one can determine a -tour in with
[TABLE]
in time , where is a shortest -join in and is a shortest -tour in .
We will give the proof in Section 4. The next theorem, proven in Section 5, states that we also obtain a short -tour if the length of a minimum -join is large.
Theorem 12**.**
Let . Assume we are given a -approximation algorithm for -TSP. Then, for any , any weighted graph , and any feasible interface of , one can determine a -tour in with
[TABLE]
in time , where is a shortest -join in and is a shortest -tour in .
Lemma 13**.**
Given a weighted graph and a feasible interface of , Algorithm 1 returns a -tour in with the guarantees stated in Theorem 10.
Proof.
The running time guarantee stated in Theorem 10 immediately follows from Theorem 11 and Theorem 12, using .
Let be the -tour returned by Algorithm 1. To show that fulfills the approximation guarantee stated in (1), we distinguish two cases.
If , then the solution will be short enough:
[TABLE]
where we used for the last inequality, which holds because .
If , then the -tour will be short enough:
[TABLE]
thus completing the proof of Lemma 13. ∎
For the proof of Theorem 10, it remains to show Theorem 11 and Theorem 12.
4 Finding a short -tour if there is a short -join
In this section we prove Theorem 11, i.e., how to get a short -tour if the shortest -join has small length compared to .
We start by analyzing a simple algorithm for computing a -tour. However, this simple algorithm will not be sufficient to prove Theorem 11. Thus in a second step, we will refine the algorithm to obtain the desired bound.
Notice that Algorithm 2 always returns an edge set , even if the input is infeasible. We therefore show first that Algorithm 2 does return a -tour whenever it is run with a feasible input.
Lemma 14**.**
The set returned by Algorithm 2 is a -tour if and only if the input is feasible, i.e., admits a -tour.
Proof.
Assume that admits a -tour, which implies by Lemma 8 that the three properties (i), (ii), and (iii) listed in Lemma 8 are fulfilled. Because the set computed in Algorithm 2 consists of TSP tours in each connected component of , the vertex sets of the connected components of and are the same. Because fulfills (ii) and (iii), this implies that also and fulfill these two conditions. Finally, , because and is a -join, which shows that is indeed a -tour. ∎
We now analyze the length of the -tour returned by Algorithm 2.
Lemma 15**.**
Assume we are given an -approximation algorithm for TSP. Let be a weighted graph, a feasible interface of , and a -join in . Then, Algorithm 2 computes a -tour in with
[TABLE]
in time , where is a shortest -tour. Here .
Proof.
First, observe that the running time is indeed as claimed, because the bottleneck of the algorithm is calling for each connected component of ; moreover, the connected components can be found in linear time and there are at most many of them.
To bound , we transform a shortest -tour into a union of TSP solutions, one for each connected component of . Let be a minimal edge set such that the vertex sets of the connected components of and are the same. Observe that the multi-set is a union of TSP solutions, one for each connected component of . Because the set determined in Algorithm 2 was obtained through , which is an -approximation algorithm, we have
[TABLE]
and, hence, the solution returned by the algorithm satisfies
[TABLE]
Moreover, because is a -tour, we have that must be connected, which implies that has at most connected components, and thus
[TABLE]
Together with (2), this leads to the desired guarantee:
[TABLE]
where the inequality follows from , which holds because the edges in connect different connected components of . ∎
We now explain how to refine Algorithm 2 by a guessing step to obtain the guarantees claimed in Theorem 11. If all edges that are not contained in have length at most , Lemma 15 already implies the desired bound. To obtain this property, we delete all edges from that are heavy, i.e. have length at least , and are not contained in . We guess this set of edges to delete as follows. First we guess the set of heavy edges, which can be done in polynomial time by guessing a minimum length edge in . Then we guess the set of heavy edges contained in . Algorithm 3 formalizes this procedure and, as we show next, indeed implies Theorem 11.
Proof of Theorem 11.
We start by observing that the running time of Algorithm 3 is indeed bounded by . There are at most possible edges that are being considered in the outer for-loop. For each of them, there are possible sets considered in the inner for-loop. Thus, there are at most calls to Algorithm 2. Finally, all other operations can be done in time .
We now show that Algorithm 3 returns a -tour with the guarantees claimed by Theorem 11. Let be a shortest -tour and let be the set of heavy edges. Then in some iteration of the outer for-loop we consider the set . Because
[TABLE]
we have , and thus, we consider the set in some iteration of the inner for-loop. As does not contain any edge of , the -tour is a feasible solution of the instance to which we apply Algorithm 2. Moreover, the set contains all heavy edges not contained in and hence by Lemma 15, we obtain
[TABLE]
∎
5 Iterative improvement via dynamic programming
In this section, we show how to prove Theorem 12, i.e., how to obtain a short -tour if the length of a shortest -join is large. Here, our goal is to use dynamic programming to “guess” a significant portion, in terms of total length, of edges used in . Very recently, dynamic programming has become a strong tool in the context of Path TSP, Chain-Constrained Spanning Trees, and related problems [27, 26, 30, 18], leading to the currently best known approximation factors for these settings. The dynamic programming idea we employ combines and extends elements used in these prior dynamic programming techniques.
What we aim to achieve with dynamic programming in the context of -TSP, for some interface of , is the following. We can fix an arbitrary laminar family of subsets of . Our goal is to guess what edges of are crossing the cuts in . Clearly, if contains many edges for some , it seems computationally elusive to guess them. This is the reason why we fix some constant and only guess -edges in cuts for if . We denote the sets inducing these cuts by and the -edges in these cuts by . Formally, for any edge set , we define
[TABLE]
As we discuss in more detail later, guessing the edges can be achieved through a dynamic program that guesses the -edges in the different cuts defined by step by step, from smaller to larger sets in . However, the running time of the propagation step of the dynamic program depends on the number of disjoint sets in that can be contained in some larger set . We capture this dependency through the width of the laminar family (see [18] for a similar use of this notion).
Definition 16** (width of a laminar family).**
The width of a laminar family is the number of minimal sets contained in the family.
Observe that the number of minimal sets of a laminar family bounds the size of any subfamily of disjoint sets.
The following theorem formalizes what we can achieve through our dynamic program, which we present later in detail. Notice that for the algorithm to be efficient, we need to have width bounded by a constant.
Theorem 17**.**
Let . Assume there is a -approximation algorithm for -TSP. Then there is an algorithm that computes for any feasible interface of a weighted graph , any , and any laminar family over , a -tour with
[TABLE]
in time |V|^{O\left(|I|+k\cdot\operatorname{width}(\mathcal{L})\right)}\cdot f_{\mathcal{B}}\left(\big{.}G,|I|+k\cdot(\operatorname{width}(\mathcal{L})+1)\right). In particular, the algorithm calls only on instances with interfaces of size bounded by .
Note that the guarantee stated in (3) for indeed reflects the guessing of the edges in . More precisely, by replacing by in (3), we obtain a -tour with an upper bound on its length that decomposes into two terms:
- (i)
a term , i.e., each edge contributes its length , and 2. (ii)
a term , where the length of each other edge in gets inflated by the approximation factor of the algorithm .
5.1 Finding a suitable laminar family
To make significant progress through Theorem 17, we need to find a laminar family over such that is large. Let be a shortest -join. If is large, then we will construct a family with the property that even for any -join , the length is large. Notice that this implies what we want because is a -join.
This statement is formalized in Lemma 19, which is derived from the dual of the natural linear program to find a shortest -join. We exploit that there is an optimal dual solution whose support corresponds to a laminar family of subsets of , which follows from combinatorial uncrossing arguments.
Lemma 18**.**
Let be a weighted graph. Moreover, let such that contains a -join, and let . Then there is strongly polynomial algorithm that computes a laminar family over and values such that
[TABLE]
Proof.
We start with a classical linear description to find a minimum length -join, based on the dominant of the -join polytope. To this end, let ; these vertex sets induce all -cuts. Then, the following linear program computes the value of a shortest -join (see, e.g., [21]).
[TABLE]
Its dual problem, which is a fractional -cut packing problem, is given below.
[TABLE]
If is an optimum dual solution with laminar support , then and have the desired properties. Here (5) follows from strong duality and (6) follows from .
A strongly polynomial algorithm to compute such an optimal dual solution with laminar support can be obtained by standard techniques: Using the framework of Frank and Tardos [7], one can first find in strongly polynomial time a vector with encoding length polynomial in , and such that the set of optimal solutions of (7) remains the same when replacing by . Moreover, also the set of optimal dual bases remains the same. This allows for solving (7) in strongly polynomial time through the ellipsoid method. To find an optimal dual basis, one can delete all variables from (8) that do not correspond to constraints encountered by the ellipsoid algorithm when solving (8). Now solving the reduced dual problem (8) with instead of allows for finding an optimal dual basis, which, by the result of Frank and Tardos, remains an optimal dual basis for (8) without replacing by . Knowing an optimal dual basis, one can obtain an optimal solution to (8) in strongly polynomial time by solving a linear equation system.
Finally, this solution can be transformed into a laminar one by uncrossing: if and for with and and , then either and belong to or and belong to ; we can increase the dual variables on these two sets by and decrease the dual variables and by the same amount, maintaining a feasible dual solution. Karzanov [15] showed how to obtain a laminar family by a sequence of such uncrossing steps in strongly polynomial time. ∎
We now show that the family from Lemma 18 has the desired properties.
Lemma 19**.**
Let be a weighted graph. Moreover, let such that admits a -join. Then, there is a strongly polynomial algorithm that computes a laminar family over with such that for any -join , and any , we have
[TABLE]
where is a shortest -join in .
Proof.
If , we can simply set because . Otherwise, we compute and as in Lemma 18 and show that has the desired properties. Since every set in must contain an element of , we have .
Let now be a -join, and let . Since is a -join, it has a non-empty intersection with every cut with because of (6). Hence, by (4),
[TABLE]
Again using (4), we moreover obtain
[TABLE]
Combining (5), (9), and (10), we obtain
[TABLE]
as desired. ∎
Finally, Theorem 12 is a direct consequence of Theorem 17 and Lemma 19.
Proof of Theorem 12.
If , we simply call the given -approximation algorithm . Otherwise, let . We apply Lemma 19 to obtain in strongly polynomial time a laminar family over such that
- (i)
\displaystyle\ell(R(\mathcal{L},k))\geq\ell(J)-\frac{1}{k+1}\cdot\ell(R)\geq\ell(J)-\delta\cdot\ell(R)\quad\forall\;\text{TR\subseteq E}, and 2. (ii)
, where the equality follows from the assumption .
Because a shortest -tour is a -join, we have
[TABLE]
which, together with Theorem 17 implies the desired results, i.e., that one can find a -tour in with
[TABLE]
in time
[TABLE]
∎
It remains to derive Theorem 17, which, as mentioned, we show through a dynamic programming approach.
5.2 Combining partial solutions
In the analysis of our dynamic programming algorithm we use the following notion of an induced interface, which allows us to analyze the algorithm with respect to interfaces coming from a shortest -tour .
Definition 20** (induced interface).**
Let be a weighted graph. Let be an interface of , and let be a -tour in . For , the interface induced by on is defined by
- (i)
, where is the set of vertices in that are connected by an edge of to a vertex in , 2. (ii)
, and 3. (iii)
* contains, for each connected component of , a set including all vertices of contained in that connected component.*
See Figure 4 for an example of an induced interface. Moreover, also Figure 2, which we used as an illustrative example in the introduction to showcase the guessing of multiple edges per cut, highlights an induced interface with , which is induced by an - tour. We remark that the interface induced by depends only on and , not on or .
The following lemma shows some basic properties of induced interfaces.
Lemma 21**.**
Let be a weighted graph and an interface of . Let be a -tour in and . Let be the interface induced by on . Then
- (i)
* is an interface of ,* 2. (ii)
* is a -tour in , and* 3. (iii)
for every , the interface induced by on equals the interface induced by on .
Proof.
Let . As in Definition 20 (i), let be the set of vertices in that are connected by an edge of to a vertex in .
To prove (i), we have to observe that . (Notice that we clearly have that is even because Let . If contains an edge connecting with , then and hence . Otherwise we have and hence implies . Since is a -tour, we conclude . Moreover, , so .
To prove (ii), we have to show that is connected (the other two conditions of Definition 6 trivially hold). Suppose not. Then there is a set with and . This implies, together with —which holds by definition of —that with and . This contradicts the fact that is connected, which has to hold because is a -tour.
To show (iii), let be the interface induced by on and let be the interface induced by on . Let be the set of vertices in that are connected by an edge of to a vertex in . Let be the set of vertices in that are connected by an edge of to a vertex in . Then . Therefore,
[TABLE]
Finally, because , which follows from , we have
[TABLE]
and also , because these partitions of are both defined with respect to the connected components of , because . ∎
Notice that given an interface on a graph and a -tour , then the interface induced by on is not necessarily identical to . More precisely, always fulfills and . However, may connect different parts of the partition , which, in the interface , will then only appear as one set in . See the left-hand side illustration in Figure 4 for such an example where the highlighted -tour would induce an interface on because is a single set in .
In our dynamic program we will combine solutions for different subgraphs with induced interfaces. The following lemma shows sufficient conditions under which this works out.
Lemma 22**.**
Let be a weighted graph. Let be an interface of and let be a -tour in . Let be a partition of . For , let be the interface induced by on , and let be a -tour in . Then
[TABLE]
is a -tour in , where .
Proof.
We first show point (i) of Definition 6, i.e. . For , we have since is a -tour. Thus
[TABLE]
where denotes the symmetric difference; we used . Before proving that also fulfills the remaining two properties of a -tour, we show the following claim. See Figure 5 for an illustration.
Claim 23**.**
Let and . Suppose contains an - path. Then contains an - path.
Proof of Claim 23.
Suppose the claim is wrong. Then there exist vertices such that contains an - path , but does not. We choose , , and such that the number of edges of is minimum. Consequently, contains no vertex of . We now distinguish two cases.
Case 1: .
Then is completely contained in a single set for some , by definition of . Hence, and and are connected by the path in . Since is the interface induced by on , the vertices and are contained in the same set of the partition of . This implies that every -tour, and in particular , must contain an - path, contradicting the assumption that contains no - path.
Case 2: .
Recall . For , the set contains all vertices of that are an endpoint of an edge in , by definition of the induced interface . Thus all endpoints of edges in are contained in . Since contains no vertex of , we have , i.e. the path consists only of a single edge that is contained in and thus also in . This contradicts our assumption that contains no - path. ∎
(proof of Claim 23)
To show point (iii) of Definition 6, we need to show that any two vertices and that are contained in the same set of the partition of are also contained in the same connected component of . If and are contained in the same set of the partition , they are contained in the same connected component of because is a -tour. Hence by Claim 23 and , also contains an - path.
It remains to show point (ii) of Definition 6, i.e., we prove that is connected. First observe that if , then the result holds because then is a -tour and . Hence, assume from now on . In this case, we first observe that
[TABLE]
Indeed, because is connected, which follows from being a -tour, we have for each that either or , both of which imply .
To conclude that is connected, we will observe the following two properties, which immediately imply the result:
- (a)
For each , each vertex is connected to a vertex in in the graph . 2. (b)
All vertices in are connected in .
Notice that (a) is a consequence of (11) and the fact that is connected, which holds because is a -tour in . Finally, (b) follows from Claim 23 due to the following. Either , in which case is connected—because is a -tour—which implies (b) by Claim 23. Or , in which case the connectivity of implies that in each vertex is connected to a vertex of , again implying (b) by Claim 23. ∎
5.3 The dynamic program
We now expand on the dynamic program used to show Theorem 17. The dynamic program is formally described by Algorithm 4 below. See also Figure 6 for an illustration. Before formally proving that Algorithm 4 indeed returns a -tour implying Theorem 17, we provide a brief explanatory discussion outlining the core ideas of the algorithm and the line of reasoning we employ to show its correctness.
To this end, let be a -tour (unknown to the algorithm), and we will show that the dynamic program returns a -tour such that . Conceptually, we want to consider the elements of the laminar family from smaller to larger ones. Since we do not know the laminar family , we consider all sets in in an arbitrary fixed order of non-decreasing cardinality. We then guess, for every vertex set , the interface induced by on . Now we compute a -tour in as follows.
First, we guess the children of in the laminar family . Then we guess the set of edges that cross the cuts . In other words, we guess all edges in that are contained in , but not in any child of . Moreover, for each child with , we guess the interface induced by on . Because we consider the elements of the laminar family in an order of non-decreasing cardinality, we have already considered before considering the current set . Hence we have already computed some -tour for all .
We now want to extend the union of these -tours for all and the set of edges crossing the boundaries of the children to a -tour in . To this end we define . Then is a partition of . We also guess the interface that induces on . Then, by Lemma 22 applied to the graph , the union of and arbitrary -tours in for is a -tour in . Here we use that is the interface induced by on for (cf. Lemma 21 (iii)). Finally, we use the given algorithm to compute a -approximation of a minimum length -tour in the subgraph and combine , , and the -tours for to a -tour .
In what follows, we now provide a rigorous proof that Algorithm 4 implies Theorem 17 by leveraging the tools from Section 5.2.
5.4 Proof of Theorem 17
We start by showing that Algorithm 4 has indeed the claimed running time, before proving its correctness.
Running time
The running time of Algorithm 4 is dominated by the -fold nested for-loops. We first determine upper bounds on the number of iterations of each for-loop separately, whenever the algorithm reaches it.
- \raisenthenumi for-loop:
It goes over all sets in . Because is a laminar family over , it contains sets. 2. \raisenthenumi for-loop:
It goes over all interfaces of with . There are no more than choices for choosing . Moreover, there are at most choices for . Finally, the number of partitions of can be upper bounded by . Overall, the number of iterations of any run of the second for-loop is bounded by . 3. \raisenthenumi for-loop:
It iterates over subfamilies of of disjoint proper subsets of . Because the sets are disjoint, such a family can have at most sets, and we can therefore bound the number of these subfamilies by . 4. \raisenthenumi for-loop:
It iterates over edge sets with for all , and can be bounded as follows. Notice that . Hence, there are at most options for . 5. \raisenthenumi for-loop:
This loop runs for all over all interfaces of , where and are fixed. The number of interfaces for a fixed is thus bounded by and, hence, the total number of combinations of such interfaces, and thus also on the number of iterations each time this for-loop is run, is bounded by
[TABLE]
Moreover, for , we have , which follows from the fact that each set contains the elements of together with at most endpoints of edges from because . This implies
[TABLE]
Similarly,
[TABLE]
Combining (13) and (14) with (12), we can bound the number of iterations of the fifth for-loop by .
The most expensive single operation performed by Algorithm 4 is the call to Algorithm to find a -tour, which, by assumption, takes no more than time. Due to the bound on provided by (14), we have that the total running time is thus indeed bounded by .
Correctness
We now show that, whenever admits a -tour, then Algorithm 4 will find a -tour with the length guarantee claimed by Theorem 17. So let be a -tour. We have to show that computed by the algorithm is a -tour (instead of Nil) and that it satisfies
[TABLE]
We prove (15) by showing the following claim from smaller to larger sets .
Claim 24**.**
Let . If , let . Otherwise, let be the interface induced by on . Then Algorithm 4 computes a -tour such that
[TABLE]
Observe that the claim immediately implies Theorem 17 by choosing . Hence, it remains to prove the claim.
Proof of Claim 24.
We prove the claim by induction from smaller to larger sets in . Hence, let and assume that the claim holds for sets in of strictly smaller cardinality than . In particular, it holds for the children of in the laminar family . (Note that may also not have any children.) Let , and for , let be the interface induced by on . By using Lemma 21 (iii) in the case , we observe that is also the interface induced by on . Let be a -tour obtained through Algorithm . Because partitions , we have by Lemma 22 that
[TABLE]
is a -tour, where
[TABLE]
Before discussing that this -tour will indeed be considered by Algorithm 4, we bound its length. First, because is a -approximation algorithm and is a -tour by Lemma 21 (ii). Moreover, for we apply the induction hypothesis to and , which is possible because has strictly smaller cardinality than . Hence, is a -tour and fulfills the length bound stated in the claim. We therefore get
[TABLE]
where the last equality follows by observing that
[TABLE]
Due to (17), the -tour fulfills the length bound of the claim. It remains to show that the -tour will indeed be considered by Algorithm 4. For this, we show that the following quantities are considered in the five nested for-loops:
- \raisenthenumi for-loop:
considers , 2. \raisenthenumi for-loop:
considers the interface , 3. \raisenthenumi for-loop:
considers the children of in the laminar family , 4. \raisenthenumi for-loop:
considers the set , 5. \raisenthenumi for-loop:
considers, for , the interfaces induced by on .
This run would indeed produce . All that remains to be shown is that the above five quantities, to be considered within the five nested for-loops, fulfill the conditions set by the respective for-loops:
- \raisenthenumi for-loop:
Algorithm 4 considers all sets in and hence, also . 2. \raisenthenumi for-loop:
If , the interface is obviously considered. Otherwise is the interface induced by on , and we have , where is the set of vertices in connected by an edge of to a vertex in . As , we have , and hence , which implies and shows that the interface is considered in the second for-loop. 3. \raisenthenumi for-loop:
We have . Hence, the subfamily will be considered in the third nested for-loop. 4. \raisenthenumi for-loop:
The set we want to consider is given by (16). This set clearly satisfies because . Moreover, for each we have
[TABLE]
where the last inequality follows from . Hence, the set will be considered during the fourth nested for-loop of the algorithm. 5. \raisenthenumi for-loop:
For we have that is the interface of induced by on . Hence, , where are all vertices in connected by an edge of to a vertex in . We have by our choice of as described in (16) and because is a partition of . Therefore, , as desired. Hence, the interfaces for indeed get considered in the fifth nested for-loop of the algorithm.
∎
As said, Claim 24 implies (15), completing the proof of Theorem 17.
We remark that Claim 24 can be slightly strengthened as follows. The statement also holds when replacing the induced interface by any interface where is a refinement of . However, we do not need this for our purposes.
6 Proof of the main theorem
We finally prove that the Boosting Theorem (Theorem 10) implies Theorem 1. In fact, we prove a generalization, stated below as Theorem 25, which, for and with and , yields Theorem 1.
Theorem 25**.**
Let be an -approximation algorithm for TSP. Then, for any and any integer , there is an -approximation algorithm for -TSP restricted to instances with that, for any instance , calls a strongly polynomial number of times on TSP instances defined on subgraphs of , and performs further operations taking strongly polynomial time.
Proof.
We obtain the result by repeatedly applying the Boosting Theorem, i.e., Theorem 10, to strengthen the -approximation algorithm for -TSP guaranteed by Theorem 9 through the -approximation algorithm for TSP which we assume to exist. Without loss of generality . The Boosting Theorem will be repeated many times with error parameter given by , where
[TABLE]
Notice that is constant, because both and are fixed.
Let be the approximation factor for -TSP before applying the Boosting Theorem. We assume because Christofides’ algorithm is a strongly polynomial -approximation algorithm for TSP [4, 25]. Let . After applications of the Boosting Theorem we obtain an algorithm for -TSP with approximation ratio at most
[TABLE]
where we used . We therefore have
[TABLE]
where the last inequality follows by induction on . Hence,
[TABLE]
Moreover, we define real numbers for to upper bound the size of the interfaces we have to be able to handle after boosting steps. We want the -approximation algorithm , obtained after many applications of the Boosting Theorem, to handle interfaces of size . Because was obtained by applying the Boosting Theorem to , we obtain that needs to handle interfaces of size bounded by . Repeating this reasoning, we obtain upper bounds on the size of the interfaces that we have to handle with that satisfy
[TABLE]
which implies
[TABLE]
Notice that because , , and are constant, also is constant.
For , the following claim implies Theorem 25 because and , , and are constant and is a strongly polynomial algorithm.
Claim 26**.**
Let be the hidden constant in the big- notation in the runtime bound in Theorem 10. Let and let be the given -approximation algorithm for TSP. Then there is a -approximation algorithm for -TSP that, for every weighted graph , runs in time at most
[TABLE]
on any instance , where is a subgraph of and .
We prove the claim by induction on . By Theorem 9 we have a strongly polynomial -approximation algorithm for -TSP, implying the claim for .
Now let . By our induction hypothesis, there exists a -approximation algorithm that runs in time on every weighted graph and every interface of with . Applying Theorem 10 to the algorithms and then yields a -approximation algorithm for -TSP that runs on every graph and every interface with in time at most
[TABLE]
∎
7 Strongly polynomial 4-approximation algorithm for -TSP
See 9
Proof.
Let be an interface of . By Lemma 8, we can assume that is feasible. The main component of our algorithm is to obtain a strongly polynomial -approximation algorithm for the problem of finding a set (not a multi-set) of minimum length that satisfies the following three conditions:
- (i)
is connected; 2. (ii)
connects all vertices within any ; 3. (iii)
each connected component of contains an even number of vertices in .
We will achieve this through an application of Jain’s iterative rounding method for the Generalized Steiner Network Problem [13] together with the elegant framework of Frank and Tardos [7] to transform certain polynomial-time algorithms into strongly polynomial ones.
Before we discuss the details of Jain’s method in our setting together with the framework of Frank and Tardos, we first assume that we can indeed find in strongly polynomial time a set fulfilling (i), (ii), and (iii) of length no larger than twice the length of a shortest edge set fulfilling these three conditions. Because a shortest -tour must fulfill these conditions, and removing parallel edges does not destroy them, there is a subset of that contains no parallel edges and satisifies (i), (ii), and (iii). Therefore, .
Due to property (iii), the set contains a -join , which we can find in linear time through standard techniques. We then return , which is indeed a -tour and satisfies
[TABLE]
as desired. It remains to show how to obtain a strongly polynomial -approximation algorithm for finding a shortest edge set fulfilling (i), (ii), and (iii). We start by showing how an application of Jain’s iterative rounding method leads to a polynomial-time, but not necessarily strongly polynomial, -approximation algorithm.
To this end, observe that a set satisfies (i), (ii), and (iii) if and only if
[TABLE]
where the function is defined as follows. For with , we set if at least one of the following three properties holds:
- (a)
; 2. (b)
s.t. and ; 3. (c)
is odd.
Otherwise we set . (In particular, .) Indeed, the properties (a), (b), and (c) are just reformulations of (i), (ii), and (iii), respectively.
Jain’s technique [13] leads to a polynomial-time -approximation algorithm for finding a shortest edge set satisfying (19) if, first, the function is weakly supermodular, which means
[TABLE]
and, second, one can separate over the polytope
[TABLE]
in polynomial time.
We start by showing (20). Notice that (20) clearly holds if , because in this case we have . Hence, in what follows, we always assume that and .
Let , , and be the functions from to that take a value of precisely for sets that satisfy (a), (b), or (c), respectively. Hence, . First, one can observe that each of the functions , , and is weakly supermodular. Consider first and let with and . If then . Similarly, if , then . Hence, satisfies (20). The function corresponds to pairwise connectivity requirements and, as shown in [13], is therefore weakly supermodular. The function is easily seen to be a so-called proper function, which means that , is symmetric, and for any pair of disjoint sets . Finally, it is well-known that any proper function is weakly supermodular (see [9]).
We say that a set with is of type (a), (b), or (c), if it satisfies (a), (b), or (c), respectively. Because each of the functions , , and is weakly supermodular, the inequality (20) holds whenever the sets and are of the same type, or if or is none of the three types. Hence, it remains to consider sets and of two different types among the types (a), (b), and (c). Let be sets of type (a), (b), and (c), respectively. Thus, we need to show that (20) holds for the three cases where is either , , or . Moreover, let be a set such that and , which exists because is of type (b).
We start by considering the case . As discussed, we assume that and ; for otherwise, (20) holds trivially. Notice that in this case we have
[TABLE]
because , as well as and . Hence, is of type (a) and is of type (b).
Consider now the case . Here, we have
[TABLE]
because , implying that is of type (a), and due to , which implies that is of type (c).
It remains to consider the case . We first observe that
[TABLE]
due to the following. Inequality (22) holds because can be partitioned into and . Because is odd, either or must also have an odd intersection with and is thus of type (c). Inequality (23) follows from an analogous reasoning using the partition of into and . Moreover, we have
[TABLE]
because is of type (b), i.e., and . Indeed, even without any assumptions on , we have that either or is also of type (b). The same holds for either or . Among the four expressions , , , and , consider any one of minimum value and sum up the two inequalities among (22), (23), (24), and (25) containing that expression. This gives the desired result. For example, if achieves minimum value among the four, then (22) implies and (24) implies . Hence,
[TABLE]
as desired. This completes the proof that is weakly supermodular.
To apply Jain’s method, it remains to show that we can separate over , and we will in fact give a strongly polynomial algorithm. Given , we will either show that all constraints for are fulfilled or return one of these constraints that is violated. Notice that, because , a constraint can only be violated if , i.e., is either of type (a), (b), (c). Hence, we can check these constraints for each type separately.
Whether there is a violated constraints of type (a) reduces to finding a minimizer of
[TABLE]
This can be solved through a global minimum cut algorithm applied to the graph with edge weights . Indeed, this either leads to a cut with as desired or one where , in which we can replace by .
To check whether there is a violated constraint of type (b) reduces to
[TABLE]
This can be solved by performing the following for all with . Number the vertices in arbitrarily , and solve a minimum - cut problem in with edge weights for each . If any of these - cut problems leads to a cut of value strictly smaller than , then the minimizing cut corresponds to a violated constraint. Otherwise, there is no violated constraints of type for any set of type (b).
Finally, checking whether there is a violated constraint of type (c) reduces to
[TABLE]
This is a minimum weight -cut problem, for which strongly polynomial algorithms are well known (see, e.g., [21]).
In summary, the separation problem over can be solved in strongly polynomial time, and we can therefore apply Jain’s technique as claimed.
It remains to show that the overall algorithm can be transformed into a strongly polynomial one. This is a consequence of the framework of Frank and Tardos [7] (see also [11, 5, 13] for similar applications). More precisely, the only step that is not strongly polynomial in Jain’s iterative rounding method is solving linear programs on faces of with objective function given by . Notice that the coefficients in the constraints describing are all [math] or . Hence, they have small encoding length. For such cases, Frank and Tardos [7] show how can be replaced (in strongly polynomial time) by another objective of encoding length polynomial in the dimension of the problem such that the set of optimal solutions over any polytope in dimensions with constraints of small encoding length is the same for the two objectives and . Hence, one can find an optimal linear programming solution with respect to instead of , whenever a linear program has to be solved in Jain’s procedure. ∎
8 Conclusions and open problems
We showed that given a polynomial-time -approximation algorithm for TSP we can obtain a polynomial-time -approximation algorithm for Path TSP. Feige and Singh [6] proved a similar kind of result for the asymmetric traveling salesman problem (ATSP): given a polynomial-time -approximation algorithm for ATSP, there is a polynomial-time -approximation algorithm for its path version. A natural question is whether our techniques can be used to improve on their result and avoid losing a factor of two in the approximation ratio.
For the Asymmetric Path TSP, the relatively simple dynamic program (sketched in Section 3.1) still works and could be used to reduce to the case where the distance from to is not much more than . (We get instead of because a cut can contain two forward edges and one backward edge, and backward edges can belong to many cuts.) To make further progress, we might again try to guess edges also in cuts in which contains a larger, but constant number of edges. However, even if the distance is very small, the distance from to could be large. In this case we do not know how to reduce to ATSP or guess edges of significant length via dynamic programming. Another obstacle is the following: Our approach for reducing Path TSP to TSP required a constant-factor approximation algorithm for -TSP. Thus, for the asymmetric case one would probably need a suitable constant-factor approximation algorithm for a directed version of -TSP, and we do not know how to obtain this.
A special case of -TSP that is more general than Path TSP is the -tour problem. Here is the given set and . None of the recent improvements for Path TSP seems to extend to general -tours beyond constant , so Sebő’s -approximation [22] remains the best that we know. Another question is how well -TSP can be approximated in general. We showed an approximation ratio of four, but a better ratio might be possible.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] H.-C. An, R. Kleinberg, and D. B. Shmoys. Improving Christofides’ algorithm for the s 𝑠 s - t 𝑡 t path TSP. Journal of the ACM , 62(5):34:1–34:28, 2015. Short version appeared in STOC 2012.
- 2[2] A. Blum, S. Chawla, D. R. Karger, T. Lane, A. Meyerson, and M. Minkoff. Approximation algorithms for orienteering and discounted-reward TSP. SIAM Journal on Computing , 37(2):653–670, 2007. Short version appeared in FOCS 2003.
- 3[3] J. Cheriyan, Z. Friggstad, and Z. Gao. Approximating minimum-cost connected T 𝑇 {T} -joins. Algorithmica , 72:126–147, 2015. Short version appeared in APPROX/RANDOM 2012.
- 4[4] N. Christofides. Worst-case analysis of a new heuristic for the Travelling Salesman Problem. Technical Report 388, Graduate School of Industrial Administration, Carnegie Mellon University, 1976.
- 5[5] F. Eisenbrand. Integer programming and algorithmic geometry of numbers. In M. Jünger, T. M. Liebling, D. Naddef, G. L. Nemhauser, and W. R. Pulleyblank, editors, 50 Years of Integer Programming 1958–2008 , pages 505–559. Springer, 2010.
- 6[6] U. Feige and M. Singh. Improved approximation algorithms for traveling salesperson tours and paths in directed graphs. In Proceedings of the 10th International Workshop on Approximation Algorithms for Combinatorial Optimization Problems (APPROX) , pages 104–118, 2007.
- 7[7] A. Frank and É. Tardos. An application of simultaneous diophantine approximation in combinatorial optimization. Combinatorica , 7(1):49–65, 1987.
- 8[8] Z. Gao. An LP-based 3 2 3 2 \frac{3}{2} -approximation algorithm for the s 𝑠 s - t 𝑡 t path graph traveling salesman problem. Operations Research Letters , 41:615–617, 2013.
