A Polynomial-Time Approximation Scheme for Facility Location on Planar   Graphs

Vincent Cohen-Addad; Marcin Pilipczuk; Micha{\l} Pilipczuk

arXiv:1904.10680·cs.DS·April 25, 2019

A Polynomial-Time Approximation Scheme for Facility Location on Planar Graphs

Vincent Cohen-Addad, Marcin Pilipczuk, Micha{\l} Pilipczuk

PDF

TL;DR

This paper presents a polynomial-time approximation scheme for the Facility Location problem on planar graphs, achieving near-optimal solutions efficiently and resolving a long-standing open problem in the field.

Contribution

The authors develop the first PTAS for Facility Location on planar graphs, providing a significant advancement in approximation algorithms for this classic problem.

Findings

01

Achieves a (1+ε)-approximate solution in quasi-polynomial time.

02

Resolves the open problem of PTAS existence for Facility Location on planar graphs.

03

Provides a new algorithmic framework for similar geometric and graph problems.

Abstract

We consider the classic Facility Location problem on planar graphs (non-uniform, uncapacitated). Given an edge-weighted planar graph $G$ , a set of clients $C \subseteq V (G)$ , a set of facilities $F \subseteq V (G)$ , and opening costs $open : F \to R_{\geq 0}$ , the goal is to find a subset $D$ of $F$ that minimizes $\sum_{c \in C} min_{f \in D} dist (c, f) + \sum_{f \in D} open (f)$ . The Facility Location problem remains one of the most classic and fundamental optimization problem for which it is not known whether it admits a polynomial-time approximation scheme (PTAS) on planar graphs despite significant effort for obtaining one. We solve this open problem by giving an algorithm that for any $ε > 0$ , computes a solution of cost at most $(1 + ε)$ times the optimum in time $n^{2^{O (ε^{- 2} l o g (1/ ε))}}$ .

Figures2

Click any figure to enlarge with its caption.

Figure 1

Figure 2

Equations369

avgcost (f) = \frac{open ( f ) + \sum _{c \in cluster (f)} dist ( c , f )}{∣ cluster ( f ) ∣} .

avgcost (f) = \frac{open ( f ) + \sum _{c \in cluster (f)} dist ( c , f )}{∣ cluster ( f ) ∣} .

conn (S, R) = c \in S \sum f \in R min dist (c, f) and open (R) = f \in R \sum open (f) .

conn (S, R) = c \in S \sum f \in R min dist (c, f) and open (R) = f \in R \sum open (f) .

cost (D^{'}) = ε^{- 1} \cdot (∣ F ∣ + ∣ C ∣ \cdot ∣ E (G) ∣) .

cost (D^{'}) = ε^{- 1} \cdot (∣ F ∣ + ∣ C ∣ \cdot ∣ E (G) ∣) .

OPT = Θ (ε^{- 1} \cdot (∣ F ∣ + ∣ C ∣ \cdot ∣ E (G) ∣)) .

OPT = Θ (ε^{- 1} \cdot (∣ F ∣ + ∣ C ∣ \cdot ∣ E (G) ∣)) .

I = (G, C, F, ε \cdot open);

I = (G, C, F, ε \cdot open);

cost (D \cup {f}; I) ⩾ cost (D; I)

cost (D \cup {f}; I) ⩾ cost (D; I)

cost (D; I) ⩽ cost (D) = OPT .

cost (D; I) ⩽ cost (D) = OPT .

cost (D; I) ⩽ α \cdot cost (D; I)

cost (D; I) ⩽ α \cdot cost (D; I)

ε \cdot cost (D) ⩽ cost (D; I) .

ε \cdot cost (D) ⩽ cost (D; I) .

cluster (f, R) = {c \in C : dist (c, f) = g \in R min dist (c, g)} .

cluster (f, R) = {c \in C : dist (c, f) = g \in R min dist (c, g)} .

conn (S, D) ⩽ conn (S, D) + ε \cdot open (D) .

conn (S, D) ⩽ conn (S, D) + ε \cdot open (D) .

σ (f) = conn (cluster (f, D) \cap S, D) + ε \cdot open (f) .

σ (f) = conn (cluster (f, D) \cap S, D) + ε \cdot open (f) .

0

0

0 ⩽ f \in D \sum σ (f) - f \in D \sum conn (cluster (f, D) \cap S, D) = (conn (S, D) + ε \cdot open (D)) - conn (S, D) .

0 ⩽ f \in D \sum σ (f) - f \in D \sum conn (cluster (f, D) \cap S, D) = (conn (S, D) + ε \cdot open (D)) - conn (S, D) .

avgcost (f) = \frac{open ( f ) + \sum _{c \in cluster (f)} dist ( c , f )}{∣ cluster ( f ) ∣} .

avgcost (f) = \frac{open ( f ) + \sum _{c \in cluster (f)} dist ( c , f )}{∣ cluster ( f ) ∣} .

cost (D) = f \in D \sum ∣ cluster (f) ∣ \cdot avgcost (f) .

cost (D) = f \in D \sum ∣ cluster (f) ∣ \cdot avgcost (f) .

dist (f, D) > \frac{2}{∣ K ∣} \cdot (open (f) + c \in K \sum dist (c, f)) .

dist (f, D) > \frac{2}{∣ K ∣} \cdot (open (f) + c \in K \sum dist (c, f)) .

a := \frac{open ( f ) + \sum _{c \in K} dist ( c , f )}{∣ K ∣} .

a := \frac{open ( f ) + \sum _{c \in K} dist ( c , f )}{∣ K ∣} .

g \in D min dist (c, g) > 2 a - dist (c, f) .

g \in D min dist (c, g) > 2 a - dist (c, f) .

cost (D \cup {f}) - cost (D)

cost (D \cup {f}) - cost (D)

Far (f)

Far (f)

Close (f)

Far = f \in D ⋃ Far (f) and Close = f \in D ⋃ Close (f) .

Far = f \in D ⋃ Far (f) and Close = f \in D ⋃ Close (f) .

Ψ = conn (Far, D) .

Ψ = conn (Far, D) .

I^{'} = (G, C^{'}, F, open);

I^{'} = (G, C^{'}, F, open);

OPT^{'} ⩽ (1 + 6 α ε) OPT - Ψ

OPT^{'} ⩽ (1 + 6 α ε) OPT - Ψ

cost (R; I) ⩽ cost (R; I^{'}) + Ψ + 3 α ε \cdot OPT .

cost (R; I) ⩽ cost (R; I^{'}) + Ψ + 3 α ε \cdot OPT .

conn (C^{'}, D)

conn (C^{'}, D)

dist (c, x (f)) ⩽ dist (c, f) + dist (f, x (f)) ⩽ 2 ε^{2} \cdot avgcost (f) .

dist (c, x (f)) ⩽ dist (c, f) + dist (f, x (f)) ⩽ 2 ε^{2} \cdot avgcost (f) .

f \in D \sum c \in Close (f) \sum dist (c, x (f))

f \in D \sum c \in Close (f) \sum dist (c, x (f))

f \in D \sum c \in Far (f) \sum dist (c, D) = conn (Far, D) .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

A Polynomial-Time Approximation Scheme for Facility Location

on Planar Graphs††thanks: This work is a part of projects CUTACOMBS (Ma. Pilipczuk) and TOTAL (Mi. Pilipczuk) that have received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (grant agreements No. 714704 and No. 677651, respectively).

Vincent Cohen-Addad

Sorbonne Université, CNRS, Laboratoire d’informatique de Paris 6, LIP6, F-75252 Paris, France [email protected].

Marcin Pilipczuk

Institute of Informatics, University of Warsaw, Poland, [email protected].

Michał Pilipczuk

Institute of Informatics, University of Warsaw, Poland, [email protected].

{textblock}

20(0, 12.5)

{textblock}20(-0.25, 12.9)

We consider the classic Facility Location problem on planar graphs (non-uniform, uncapacitated). Given an edge-weighted planar graph $G$ , a set of clients $C\subseteq V(G)$ , a set of facilities $F\subseteq V(G)$ , and opening costs $\mathsf{open}\colon F\to\mathbb{R}_{\geqslant 0}$ , the goal is to find a subset $D$ of $F$ that minimizes $\sum_{c\in C}\min_{f\in D}\mathrm{dist}(c,f)+\sum_{f\in D}\mathsf{open}(f)$ .

The Facility Location problem remains one of the most classic and fundamental optimization problem for which it is not known whether it admits a polynomial-time approximation scheme (PTAS) on planar graphs despite significant effort for obtaining one. We solve this open problem by giving an algorithm that for any $\varepsilon>0$ , computes a solution of cost at most $(1+\varepsilon)$ times the optimum in time $n^{2^{\mathcal{O}(\varepsilon^{-2}\log(1/\varepsilon))}}$ .

1 Introduction

We study the classic Facility Location objective in planar metrics. Given an edge-weighted planar graph $G$ , together with a set $C$ of vertices called clients, a set $F$ of vertices called candidate facilities, and opening costs $\mathsf{open}\colon F\to\mathbb{R}_{\geqslant 0}$ , the Facility Location problem asks for a subset $D$ of $F$ that minimizes $\sum_{c\in C}\min_{f\in D}\mathrm{dist}(c,f)+\sum_{f\in D}\mathsf{open}(f)$ .

The Facility Location objective is a model of choice when trying to identify the best location for public infrastructures such as hospitals, water tanks or fire stations, or when looking for the best location for warehouses or delivery stores. More recent applications also include prepositionning transportation resources such as bikes, scooters, or cabs. This has made Facility Location a fundamental problem that attracted a lot of attention over the years, both in theoretical computer science and in operations research communities. Since the problem is NP-hard, but one is often satisfied with a near-optimum solution, a large volume of work was devoted to the design of approximation algorithms [1, 7, 10, 8], culminating with the $1.488$ -approximation algorithm by Li [9]. Unfortunately, there is no chance of going much beyond this result, as the problem is known to be NP-hard to approximate within factor better than $1.46$ -approximation [6].

Therefore, a natural route is to consider restricted metrics arising in applications. For example, when the underlying metric of the application is a road networks, the shortest path metric induced by edge-weighted planar graphs is model of choice. Thus, it has been a long standing open question whether Facility Locations admits a polynomial-time approximation scheme on planar graphs. For the uniform case, this was resolved only recently in the affirmative by Cohen-Addad et al. [3] using a simple local search algorithm: given a current set of solution $D$ , determine whether there exists a solution $D^{\prime}$ of better cost that differs from $D$ by at most $\mathcal{O}(1/\varepsilon^{2})$ centers. If so, take $D^{\prime}$ as the new solution and repeat, otherwise output $D$ . However, no such approach is known to work in the nonuniform case and, in fact, it is easy to show that the same local search heuristic would provide a solution of cost at least twice the optimum in the worst-case for planar inputs. This has been a major roadblock since local search is the only technique we know so far for obtaining approximation schemes to min-sum clustering objectives such as the classic $k$ -median, $k$ -means or for uniform facility location, despite a significant effort from the community. In fact, and perhaps surprisingly, such a situation is not unique. For the problem of computing a maximum independent set of pseudo-disks, local search yields a PTAS in the unweighted case and it remains an important open problem as whether a PTAS exists for the weighted case [2]. Thus, obtaining a PTAS for the “weighted” version of some problems seems a much harder task than for the unweighted case.

Our main result is a polynomial-time approximation scheme for the (nonuniform, uncapacitated) Facility Location problem in planar graphs. From a complexity perspective, our result refutes APX-hardness of Facility Location on planar graphs (unless $\mathsf{NP}=\mathsf{P}$ ). From a techniques perspective, we believe that our approach provides a new set of interesting tools, such as for example a “metric-Baker” layering tailored to min-sum objectives (and so of a different nature than the “metric-Baker” used for $k$ -center in recent works [5, 4]). More formally, we show that following theorem.

Theorem 1.

*Given a Facility Location instance $(G,C,F,\mathsf{open})$ , where $G$ is a planar graph, and an accuracy parameter $\varepsilon>0$ , one can in $n^{2^{\mathcal{O}(\varepsilon^{-2}\log(1/\varepsilon))}}$ time compute a solution of cost at most $(1+\varepsilon)$ times the optimum cost. *

We now describe the structure of the proof and our algorithm. To do so conveniently, let us first introduce some terminology: we define for a set $D\subseteq F$ , the connection cost of $D$ is as $\mathsf{conn}(D,C)=\sum_{c\in C}\mathrm{dist}(c,D)$ and the opening cost of $D$ as $\sum_{f\in D}\mathsf{open}(f)$ .

The first step of our algorithm is to compute an $\mathcal{O}(1)$ -approximate solution to a modified input instance where every opening cost is scaled down by a factor of $\varepsilon$ . This solution $\widetilde{D}$ is computed through a greedy procedure and has the following satisfying properties: it is still an $\mathcal{O}(\varepsilon^{-1})$ -approximation to the original instance, and interestingly it reveals a lot of structure of the input graph metric. This structure will be crucial for the proof of Theorem 1. Indeed, the proof of the theorem and our algorithm can be broken into two pieces. The first one consists in a partitioning of the instance into separate, more structured, and almost independent sub-instances (based on the output of the greedy procedure). The second one is a heavily technical dynamic programming algorithm for solving these sub-instances.

To understand how the two pieces articulate, we need to introduce a couple of definitions. Let $f\in\widetilde{D}$ be an opened facility and let $\mathsf{cluster}(f)$ be the set of clients connected to $f$ in the solution $\widetilde{D}$ (i.e., $\mathsf{cluster}(f)$ consists of these clients $c\in C$ for which $f$ is the closest facility from $\widetilde{D}$ ). The average cost of $\mathsf{cluster}(f)$ is defined as:

[TABLE]

At a high-level, the sub-instances will be defined by dividing the metric space according to the clustering induced by $\widetilde{D}$ : putting in the same instance the clusters of $\widetilde{D}$ that have roughly the same $\mathsf{avgcost}$ values. More concretely, a deep analysis of the structure of the approximate solution $\widetilde{D}$ and an intricate Baker-type layering step based on average costs of the facilities of $\widetilde{D}$ yields an instance such that (i) all values of $\mathsf{avgcost}(f)$ for $f\in\widetilde{D}$ are within constant ratio from each other, and (ii) for every $f\in\widetilde{D}$ and $c\in\mathsf{cluster}(f)$ the distance $\mathrm{dist}(c,f)$ is within constant ratio of $\mathsf{avgcost}(f)$ . This is described in Section 2. The second part of the algorithm described in Section 3, consists mainly of our technical dynamic programming algorithm for solving the instances produced in the first part.

2 Reducing to the constant scope of the average costs

Setup.

We shall work with an instance $I=(G,C,F,\mathsf{open})$ where $G$ is a planar edge-weighted graph, $C\subseteq V(G)$ is a set of clients, $F\subseteq V(G)$ is a set of facilities, and $\mathsf{open}\colon F\to\mathbb{R}_{\geqslant 0}$ defines the opening cost of facilities. We shall assume that $G$ is embedded in a sphere and that distances between pairs of vertices of $G$ are finite and pairwise distinct.

For a set of clients $S\subseteq C$ and solution $R\subseteq F$ , by $\mathsf{conn}(S,R)$ we denote the contribution of clients from $S$ to the connection cost of $R$ and by $\mathsf{open}(R)$ the opening cost of $R$ . That is,

[TABLE]

We write $\mathsf{conn}(R)$ for $\mathsf{conn}(C,R)$ . Thus, the cost of $R$ is defined as $\mathsf{cost}(R)=\mathsf{conn}(R)+\mathsf{open}(R)$ . For the remainder of this section, let us fix some optimum solution $D$ in $I$ , and we denote $\mathsf{OPT}=\mathsf{cost}(D)$ .

We consider the accuracy parameter $\varepsilon>0$ ; w.l.o.g. we assume that $\varepsilon<1/10$ . Our goal is to compute a $(1+c\varepsilon)$ -approximate solution for some constant $c$ , so that $\varepsilon$ can be scaled appropriately at the end.

Recall that the considered problem admits a constant-factor approximation for the problem: as shown by Li [9], given an instance of non-uniform facility location one can in polynomial time find a solution of cost at most $\alpha$ times the optimum, where $\alpha=1.488$ . We apply this algorithm to the input instance, obtaining a solution $D^{\prime}\subseteq F$ , and we rescale the distances and the opening costs by the same factor so that

[TABLE]

Note that this means that the total contribution of edges of length less than $1$ and facilities of opening cost less than $1$ to any solution is bounded by $|F|+|C|\cdot|E(G)|\leqslant\varepsilon\cdot\mathsf{cost}(D^{\prime})\leqslant\alpha\varepsilon\cdot\mathsf{OPT}$ , where by $\mathsf{OPT}$ we denote the optimum cost of a solution. Thus, at the cost of paying at most $\varepsilon\cdot\mathsf{cost}(D^{\prime})\leqslant\alpha\varepsilon\cdot\mathsf{OPT}$ we may assume that all edges of length less than $1$ can be traversed for free, hence we may simply contract them. Similarly, we zero the opening costs of all facilities whose opening cost is less than $1$ . Therefore, we assume that all edges in $G$ have weight at least $1$ and all opening costs are either [math] or at least $1$ , while

[TABLE]

Robust approximate solution.

Let us consider the modified instance

[TABLE]

that is, the instance is the same as $I$ but all the opening costs are scaled down by a multiplicative factor of $\varepsilon$ . For a solution $R\subseteq F$ , we denote the cost of $R$ in the instance $\widetilde{I}$ by $\mathsf{cost}(R;\widetilde{I})$ ; note that $\mathsf{cost}(R;\widetilde{I})=\mathsf{conn}(R)+\varepsilon\cdot\mathsf{open}(R)$ . Note that for any $R\subseteq F$ , we have $\varepsilon\cdot\mathsf{cost}(R)\leqslant\mathsf{cost}(R;\widetilde{I})\leqslant\mathsf{cost}(R)$ .

We apply the aforementioned $\alpha$ -approximation algorithm of Li [9] to the instance $\widetilde{I}$ . Furthermore, we will need the following property from the returned approximate solution $\widetilde{D}$ :

[TABLE]

This is trivially true for any $f\in\widetilde{D}$ and to ensure that this holds for every $f$ , we make use of the following greedy process. As long as there exists a facility $f\in F\setminus\widetilde{D}$ that violates the condition above, we add it to $\widetilde{D}$ .

Finally, at the end of this greedy process we remove from $\widetilde{D}$ all facilities that do not serve any client, that is, we remove all facilities $f\in\widetilde{D}$ such that for every $c\in C$ we have $\mathrm{dist}(c,\widetilde{D})<\mathrm{dist}(c,f)$ . Note that this step does not increase the cost of $\widetilde{D}$ and does not break property (2). We now start analysing the structure of $\widetilde{D}$ .

We start by verifying that $\widetilde{D}$ is actually an $\mathcal{O}(\varepsilon^{-1})$ -approximate solution in the original instance.

Lemma 2.

*We have $\mathsf{cost}(\widetilde{D})\leqslant\alpha\varepsilon^{-1}\cdot\mathsf{OPT}$ . *

Proof.

Recalling that $D$ is an optimum solution in $I$ , we have that

[TABLE]

On the other hand, $\widetilde{D}$ is an $\alpha$ -approximate solution in $\widetilde{I}$ , hence

[TABLE]

Finally, as observed before we have

[TABLE]

Combining the above three inequalities yields the claim. $\square$

Let $R\subseteq F$ be a nonempty set of facilities. For a facility $f\in R$ , the $R$ -cluster of $f$ , denoted $\mathsf{cluster}(f,R)$ , is the set of all clients that are served by $f$ in the solution $R$ ; that is:

[TABLE]

Note that since distances between pairs of vertices in $G$ are pairwise different, the $R$ -clusters are pairwise disjoint. In the sequel we will most often work with $\widetilde{D}$ -clusters, hence we use shorthands: a cluster means a $\widetilde{D}$ -cluster and for $f\in\widetilde{D}$ we denote $\mathsf{cluster}(f)=\mathsf{cluster}(f,\widetilde{D})$ .

The next lemma intuitively says the following: for any subset of clients, its connection cost in $\widetilde{D}$ is not much larger than its connection cost $D$ .

Lemma 3.

For any subset of clients $S\subseteq C$ we have

[TABLE]

Proof.

For any $f\in D$ , let

[TABLE]

Observe that the right hand side of the inequality is equal to $\sum_{f\in D}\sigma(f)$ .

Consider modifying the solution $\widetilde{D}$ by opening facility $f$ , for any $f\in D$ , and applying (2). If in solution $\widetilde{D}\cup\{f\}$ we consider directing all clients of $\mathsf{cluster}(f,D)\cap S$ to $f$ and all other clients as in $\widetilde{D}$ , then

[TABLE]

By summing the above inequality through all $f\in D$ , we infer that

[TABLE]

This establishes the claim. $\square$

For any $f\in\widetilde{D}$ , we define the average cost of $f$ as

[TABLE]

Recall here that $\mathsf{cluster}(f)$ is nonempty for each $f\in\widetilde{D}$ as we removed from $\widetilde{D}$ all facilites that do not serve any clients. Moreover, we have

[TABLE]

Next, we prove that for every cluster $\mathsf{cluster}(f)$ for any $f\in\widetilde{D}$ , there is always a facility of the optimum solution $D$ that is not far from $f$ , measured in terms of $\mathsf{avgcost}(f)$ . We first state the lemma in a very abstract form so that we can apply it later in various settings.

Lemma 4.

Let $I=(G,C,F,\mathsf{open})$ be a Non-Uniform Facility Location instance, $D\subseteq F$ a nonempty set of facilities, $K\subseteq C$ a nonempty set of clients, and let $f\notin D$ be a facility. Assume that

[TABLE]

*Then $\mathsf{cost}(D;I)>\mathsf{cost}(D\cup\{f\};I)$ . *

Proof.

Let

[TABLE]

Observe that every client $c\in\mathsf{cluster}(f)$ has to be served in solution $D$ by a facility that is at distance more than $2a$ from $f$ , implying by triangle inequality that

[TABLE]

Take solution $D\cup\{f\}$ . By considering directing all the clients of $K$ to $f$ , and all other clients as in $D$ , we observe that

[TABLE]

This implies that $\mathsf{cost}(D\cup\{f\})<\mathsf{cost}(D)$ as desired. $\square$

Corollary 5.

*For every $f\in\widetilde{D}$ there exists $g\in D$ such that $\mathrm{dist}(f,g)\leqslant 2\cdot\mathsf{avgcost}(f)$ . *

Proof.

The claim is obvious for $f\in D$ . Otherwise, we apply Lemma 4 to the instance $I$ , optimum solution $D$ , the facility $f$ , and $K=\mathsf{cluster}(f)$ . The optimality of $D$ implies then that $\mathrm{dist}(f,D)\leqslant 2\cdot\mathsf{avgcost}(f)$ . $\square$

Concentrating the clusters.

We now analyze every cluster $\mathsf{cluster}(f)$ for $f\in\widetilde{D}$ and show that, at the cost of changing the value of $\mathsf{OPT}$ only slightly, we may assume that all clients of $\mathsf{cluster}(f)$ have connection cost w.r.t. $\widetilde{D}$ not differing much from $\mathsf{avgcost}(f)$ . More precisely, we would like to get rid of clients that are far and close according to the following definition: for $f\in\widetilde{D}$ , let

[TABLE]

Moreover, we define

[TABLE]

Let

[TABLE]

For each $f\in\widetilde{D}$ let us pick any vertex $x(f)$ of $G$ that is at distance exactly $\varepsilon^{2}\cdot\mathsf{avgcost}(f)$ from $f$ (subdividing some edge, if a priori there is none). Construct $C^{\prime}$ from $C$ by performing the following operation for each $f\in\widetilde{D}$ : move all clients of $\mathsf{Far}(f)\cup\mathsf{Close}(f)$ to $x(f)$ , thus placing $|\mathsf{Far}(f)|+|\mathsf{Close}(f)|$ clients at $x(f)$ . Similarly, for $f\in\widetilde{D}$ we define $\mathsf{cluster}^{\prime}(f)$ to be the image of $\mathsf{cluster}(f)$ under this operation, i.e. with clients from $\mathsf{Far}(f)\cup\mathsf{Close}(f)$ replaced as above.

Let

[TABLE]

that is, $I^{\prime}$ is constructed from $I$ by replacing the client set with $C^{\prime}$ . Let $\mathsf{OPT}^{\prime}$ be the minimum cost of a solution in the instance $I^{\prime}$ . We now verify that in order to find near-optimum solution to $I$ , it suffices to find a near-optimum solution to $I^{\prime}$ .

Lemma 6.

We have

[TABLE]

Moreover, for every $R\subseteq F$ , we have

[TABLE]

Proof.

For the first inequality, note that we have

[TABLE]

Let us analyze the last summand first. Observe that for each $f\in\widetilde{D}$ and $c\in\mathsf{Close}(f)$ , we have

[TABLE]

Thus, using (3) we have

[TABLE]

We are left with analyzing the middle summand of the right hand side of (4). Observe that we have

[TABLE]

By Lemma 3 applied to $S=\mathsf{Far}$ , we infer that

[TABLE]

and thus we have

[TABLE]

For every $f\in\widetilde{D}$ , let $g(f)$ be the facility of $D$ that is closest to $f$ . By Corollary 5 we have that

[TABLE]

Now, for every $c\in\mathsf{Far}(f)$ we have

[TABLE]

where in the second step we used $\mathrm{dist}(f,x(f))=\varepsilon^{2}\cdot\mathsf{avgcost}(f)$ . Summing this inequality through all $f\in\widetilde{D}$ and $c\in\mathsf{Far}(f)$ we obtain that

[TABLE]

which means that

[TABLE]

By combining (4), (5), (6), and (7) we infer that

[TABLE]

This establishes the first inequality.

For the second inequality, again by triangle inequality we have

[TABLE]

The last summand has already been estimated in (5), so we are left with analyzing the middle summand. Observe that for each $f\in\widetilde{D}$ and $c\in\mathsf{Far}(f)$ , we have

[TABLE]

where the last inequality follows from $\mathrm{dist}(c,f)\geqslant\varepsilon^{-2}\cdot\mathsf{avgcost}(f)$ and $\mathrm{dist}(f,x(f))=\varepsilon^{2}\cdot\mathsf{avgcost}(f)$ . Thus, we have

[TABLE]

By combining (8), (5), and (9) we obtain that

[TABLE]

This concludes the proof. $\square$

Lemma 6 immediately implies the following: any near-optimum solution to $I^{\prime}$ is also a near-optimum solution to $I$ .

Corollary 7.

For any $R\subseteq F$ , if

[TABLE]

for some $\gamma,\delta\geqslant 0$ , then

[TABLE]

Proof.

First, note that $\mathsf{OPT}^{\prime}\leqslant(1+5\alpha\varepsilon)\mathsf{OPT}-\Psi\leqslant 2\cdot\mathsf{OPT}$ . Then we have

[TABLE]

as claimed. $\square$

Thus, by Corollary 7 we may focus on finding a near-optimum solution to instance $I^{\prime}$ instead of $I$ . The instance $I^{\prime}$ , however, has the following concentration property that will be useful later on: for every $f\in\widetilde{D}$ and $c\in\mathsf{cluster}^{\prime}(f)$ , we have

[TABLE]

Finally, we check that solution $\widetilde{D}$ is still not too expensive in the instance $I^{\prime}$ .

Lemma 8.

For every $f\in\widetilde{D}$ it holds that

[TABLE]

In total, we have

[TABLE]

Proof.

Recall that

[TABLE]

Thus, to show (10), it suffices to prove that

[TABLE]

For each $c\in\mathsf{Far}(f)$ , we have $\mathrm{dist}(x(f),f)=\varepsilon^{2}\cdot\mathsf{avgcost}(f)$ and $\mathrm{dist}(c,f)\geqslant\varepsilon^{-2}\cdot\mathsf{avgcost}(f)$ , hence $\mathrm{dist}(x(f),f)-\mathrm{dist}(c,f)\leqslant 0$ . On the other hand $\mathrm{dist}(x(f),f)=\varepsilon^{2}\cdot\mathsf{avgcost}(f)$ , hence for each $c\in\mathsf{Close}(f)$ we have $\mathrm{dist}(x(f),f)-\mathrm{dist}(c,f)\leqslant\varepsilon^{2}\cdot\mathsf{avgcost}(f)$ . This proves (10).

By summing (10) over all $f\in\widetilde{D}$ we obtain that

[TABLE]

as claimed. $\square$

Note that in Lemma 8, the left hand side of (11) is lower bounded by $\mathsf{cost}(\widetilde{D},I^{\prime})$ , but is not necessarily equal to it, as the clients of each cluster $\mathsf{cluster}^{\prime}(f)$ are assigned to $f$ , which may cease to be the closest facility after moving a client.

Layering on magnitudes of the average cost.

We now work with the instance $I^{\prime}$ . The goal is to use the obtained properties of clusters to break the instance into several independent ones at the cost of additionally paying $\varepsilon\mathsf{OPT}$ , so that each of the instances concerns only clients from clusters with average cost of the same magnitude. This is because such instances can be solved efficiently using the following crucial lemma, whose proof will be given later.

Lemma 9.

Suppose we are given an instance $J=(G,C,F,\mathsf{open})$ of Non-uniform Facility Location where $G$ is planar. Moreover, we are provided a real $r>1$ and a set of facilities $D^{\circ}\subseteq F$ such that the clients of $C$ can be partitioned into nonempty clusters $(\mathsf{cluster}(f))_{f\in D^{\circ}}$ so that the following properties hold for each $f\in D^{\circ}$ :

•

$1\leqslant\mathrm{dist}(c,f)\leqslant r$ * for each $c\in\mathsf{cluster}(f)$ ; and*

•

$\mathsf{open}(f)+\sum_{c\in\mathsf{cluster}(f)}\mathrm{dist}(c,f)\leqslant|\mathsf{cluster}(f)|\cdot r$ .

*Then, given $\varepsilon>0$ , one can in time $n^{\mathcal{O}(\varepsilon^{-2}r)}$ compute a solution to $J$ with cost at most $(1+\varepsilon)\mathsf{OPT}(J)+\varepsilon\cdot M$ , where $M=\mathsf{open}(D^{\circ})+\sum_{f\in D^{\circ}}\sum_{c\in\mathsf{cluster}(f)}\mathrm{dist}(c,f)$ . *

Breaking into separate instances that can be treated using Lemma 9 will be done using layering on the levels of magnitude of average costs of facilities from $\widetilde{D}$ . While the layering itself will be quite standard, the proof of the separation property between the instances will be quite non-trivial and will require the fine understanding of properties of $\widetilde{D}$ that we have developed.

Let us partition the facilities of $\widetilde{D}$ into layers $(L_{i})_{i\in\mathbb{Z}}$ , where $L_{i}$ comprises facilities $f\in\widetilde{D}$ satisfying

[TABLE]

For $i\in\mathbb{Z}$ , let

[TABLE]

By Lemma 8, we have

[TABLE]

Let $q=\lceil\varepsilon^{-2}\rceil$ . Pick $a\in\{0,1,\ldots,q-1\}$ such that $\sum_{i\colon i\equiv a\bmod q}\ell_{i}$ is minimum. Then by (8) and the fact that $q\geqslant\varepsilon^{-2}$ we infer that

[TABLE]

Now, define

[TABLE]

Set $W_{j}$ will be called the $j$ -ring. It follows that $S$ and $(W_{j})_{j\in\mathbb{Z}}$ form a partition of $\widetilde{D}$ .

Intuitively, the idea is to construct a near optimum solution by buying all the facilities of $S$ and using them to serve all clients served by them in $\widetilde{D}$ (the cost of this is bounded by $2\alpha\varepsilon\cdot\mathsf{OPT}$ by (13)), and constructing an instance for each nonempty ring $W_{j}$ that is subsequently approximated using Lemma 9. However, we need to prepare those instances carefully so that they can be solved separately.

To this end, we heavily rely on Lemma 4 that more or less says that one needs to open a facility within $2\cdot\mathsf{avgcost}(f)$ of $f$ for every $f\in\widetilde{D}$ . This, together with the exponential scale of average costs, implies that while focusing on the ring $W_{j}$ we do not need to understand how the solution to rings $W_{j^{\prime}}$ for $j^{\prime}>j$ looks like (namely, what are the precise locations of the facilities); instead, we just put one zero-cost facility at every $f\in W_{j^{\prime}}$ that mimicks the closest opened facility, this will be satisfying up to losing a factor $(1+\varepsilon)$ .

Let us now proceed with formal details. Denote

[TABLE]

For every $j\in\mathbb{Z}$ we create the following instance $J_{j}=(G,C_{j},F_{j},\mathsf{open}_{j})$ :

•

The graph $G$ is the graph from the original instance;

•

$C_{j}=\bigcup_{f\in W_{j}}\mathsf{cluster}^{\prime}(f)$ , that is, all clients in clusters of facilities from the ring $W_{j}$ ;

•

$F_{j}=F$ are all facilities from the input;

•

$\mathsf{open}_{j}(f)=0$ for every $f\in W_{j^{\prime}}$ with $j^{\prime}>j$ and every $f\in S$ , and $\mathsf{open}_{j}(f)=\mathsf{open}(f)$ otherwise.

Note that the sets $(C_{j})_{j\in\mathbb{Z}}$ are pairwise disjoint and together with $C_{S}$ form a partition $C$ . For every $j\in\mathbb{Z}$ let $D^{\mathrm{free}}_{j}=S\cup\bigcup_{j^{\prime}>j}W_{j^{\prime}}$ be the set of facilities $f$ with $\mathsf{open}_{j}(f)$ redefined to [math] in the definition of $J_{j}$ .

Observe also that if $W_{j}=\emptyset$ , then $C_{j}=\emptyset$ : the instance is trivial and it admits the empty set as the optimum solution. The algorithm does not really need to construct these instances (and thus in fact constructs at most $n$ instances $J_{j}$ ), but we prefer to define them for the sake of clarity of notation. We henceforth call the instances $J_{j}$ trivial if $W_{j}=\emptyset$ and nontrivial otherwise.

We now verify that it suffices to solve each instance $J_{j}$ separately. This is done through two lemmas. In the first one, we show how to combine solutions to the instances $J_{j}$ into a solution to the instance $I^{\prime}$ .

Lemma 10.

Assume we are given sets $D_{j}\subseteq F_{j}$ for every nontrival instance $J_{j}$ . Then one can construct in polynomial time a set $D\subseteq F$ such that

[TABLE]

Proof.

For every nontrivial instance $J_{j}$ and for every $f\in F_{j}\setminus D_{j}$ we check whether opening $f$ would not decrease the cost of $D_{j}$ in $J_{j}$ ; if this is the case, we add $f$ to $D_{j}$ . We also add $D^{\mathrm{free}}_{j}$ to $D_{j}$ as it does not increase the cost of $D_{j}$ . Henceforth we assume that for every nontrivial instance $J_{j}$ and every $f\in F_{j}\setminus D_{j}$ it holds that

[TABLE]

We define $D_{j}=D^{\mathrm{free}}_{j}$ for every trivial instance $J_{j}$ . Note that property (15) also holds for the trivial instances. Let $D_{j}^{\prime}=D_{j}\setminus D^{\mathrm{free}}_{j}$ for every $j\in\mathbb{Z}$ ; note that $D_{j}^{\prime}=\emptyset$ for trivial $J_{j}$ . Let

[TABLE]

We claim that $D$ satisfies the requirements of the lemma; it is clearly computable in polynomial time as $D_{j}^{\prime}=\emptyset$ for trivial $J_{j}$ . Note that $D_{j}\setminus D_{j}^{\prime}=D^{\mathrm{free}}_{j}$ for every $j\in\mathbb{Z}$ .

For a facility $f\in D_{j}$ , let $\mathsf{cluster}(f,D_{j};J_{j})\subseteq C_{j}$ be the set of clients served by $f$ in the solution $D_{j}$ to $J_{j}$ ; that is, $\mathsf{cluster}(f,D_{j};J_{j})$ is the set of these $c\in C_{j}$ for which $f$ is the closest facility from $D_{j}$ . Consider redirecting, in the solution $D$ to the instance $I^{\prime}$ , all clients from $\mathsf{cluster}^{\prime}(f)$ to $f$ , for every $f\in S\subseteq D$ . Then we have:

[TABLE]

We bound the three summands in the inequality above separatedly. By (13), the first summand is bounded by $2\alpha\varepsilon\mathsf{OPT}$ . Since $D_{j}^{\prime}\subseteq D\cap D_{j}$ for every $j\in\mathbb{Z}$ , we have for the second summand:

[TABLE]

We now estimate the third summand. Consider a nontrivial instance $J_{j}$ and a facility $f\in W_{j}$ . Recall that $\mathsf{cluster}^{\prime}(f)\subseteq C_{j}$ . By applying Lemma 4 to the instance $J_{j}$ , solution $D_{j}$ , facility $f$ , and set $K=\mathsf{cluster}^{\prime}(f)$ we infer that (15) ensures that there exists $g\in D_{j}$ with

[TABLE]

Plugging now the bound of Lemma 8, we obtain

[TABLE]

We now observe the following.

Claim 1.

For every facility $f\in D_{j}$ , we have

[TABLE]

Proof.

Since all but a finite number of $D_{j}$ -s are empty, we can proceed by induction on $j$ , assuming the claim holds for all $j^{\prime}>j$ . Take any $f\in D_{j}$ . If $f\in D$ then $\mathrm{dist}(f,D)=0$ and we are done. Otherwise, $f\in D_{j}\setminus D\subseteq\bigcup_{j^{\prime}>j}D_{j^{\prime}}$ , so $f\in D_{j^{\prime}}$ for some $j^{\prime}>j$ . By (16), there exists $g\in D_{j^{\prime}}$ such that

[TABLE]

By induction assumption for $g$ , we have

[TABLE]

Hence, we have

[TABLE]

as required. $\lrcorner$

By Claim 1, for every $f\in D^{\mathrm{free}}_{j}$ and $c\in\mathsf{cluster}(f,D_{j};J_{j})$ with $c\in\mathsf{cluster}^{\prime}(f_{c})$ for some $f_{c}\in W_{j}$ we have the following:

[TABLE]

By summing the above bound through all $j\in\mathbb{Z}$ and $f\in D^{\mathrm{free}}_{j}$ we obtain

[TABLE]

Since $\mathsf{cost}(\widetilde{D})\leqslant\alpha\varepsilon^{-1}\cdot\mathsf{OPT}$ , we can combine the obtained bounds as follows:

[TABLE]

This concludes the proof. $\square$

The second lemma shows that optima in instances $J_{j}$ almost partition the optimum in $I$ .

Lemma 11.

For $j\in\mathbb{Z}$ , let $\mathsf{OPT}_{j}$ be the cost of the optimum solution of $J_{j}$ . Then

[TABLE]

Proof.

Let $D$ be an optimum solution to $I^{\prime}$ . For every $f\in D$ let $j(f)$ be the maximum value of $j$ such that there exists $g\in W_{j}$ with $\mathrm{dist}(f,g)\leqslant 3\varepsilon^{-2}\cdot\mathsf{avgcost}(g)$ . If no such $j$ exists, we set $j(f)$ to be the minimum value of $j$ for which $J_{j}$ is nontrivial. For every $j\in\mathbb{Z}$ we define

[TABLE]

note that $D_{j}^{\prime}=\emptyset$ for trivial $J_{j}$ . Our goal is to estimate $\sum_{j\in\mathbb{Z}}\mathsf{cost}(D_{j};J_{j})$ by $\mathsf{cost}(D,I^{\prime})$ plus some terms of the order of $\varepsilon\cdot\mathsf{OPT}$ . First, it is immediate from the definition that $\mathsf{open}(D)=\sum_{j\in\mathbb{Z}}\mathsf{open}_{j}(D_{j})$ . Clearly, for trivial $J_{j}$ we have $D_{j}=D^{\mathrm{free}}_{j}$ and $\mathsf{cost}(D_{j};J_{j})=0$ . Let $J_{j}$ be nontrivial. Consider a client $c\in C_{j}$ ; by the definition of $J_{j}$ , there exists $f_{0}\in W_{j}$ with $c\in\mathsf{cluster}^{\prime}(f_{0})$ .

Let $f\in D$ be the facility that serves $c$ in the solution $D$ , that is, $\mathrm{dist}(c,f)=\mathrm{dist}(c,D)$ . We consider cases depending on the relation of $j(f)$ and $j$ .

Case 1: $j(f)>j$ . By the definition of $j(f)$ , there exists $g\in W_{f(j)}\subseteq D^{\mathrm{free}}_{j}$ with $\mathrm{dist}(f,g)\leqslant 3\varepsilon^{-2}\cdot\mathsf{avgcost}(g)\leqslant 3\varepsilon^{2}\cdot\mathsf{avgcost}(f_{0})$ . Therefore

[TABLE]

Case 2: $j(f)=j$ . Here $f\in D_{j}$ and thus $\mathrm{dist}(c,D_{j})\leqslant\mathrm{dist}(c,f)=\mathrm{dist}(c,D)$ .

Case 3: $j(f)<j$ . Supposing that $f_{0}\notin D$ , Lemma 4 applied to the (optimal) solution $D$ in $I^{\prime}$ with facility $f_{0}$ and $K=\mathsf{cluster}^{\prime}(f_{0})$ yields that there exists $g_{0}\in D$ with

[TABLE]

Here, the penultimate inequality follows from Lemma 8. If $f_{0}\in D$ , then we can take $g_{0}=f_{0}$ and the above inequality also holds.

By the definition of $j(f)$ we have that $\mathrm{dist}(f,f_{0})>3\varepsilon^{-2}\cdot\mathsf{avgcost}(f_{0})$ . On the other hand, $\mathrm{dist}(c,f_{0})\leqslant\varepsilon^{-2}\cdot\mathsf{avgcost}(f_{0})$ while $\mathrm{dist}(f_{0},g_{0})\leqslant 4\cdot\mathsf{avgcost}(f_{0})\leqslant\varepsilon^{-2}\cdot\mathsf{avgcost}(f_{0})$ . Since $g_{0}\in D$ , we infer that $f$ is not the closest to $c$ facility of $D$ , a contradiction. We infer that this case is impossible.

We conclude that in any case, we have

[TABLE]

By summing this bound through all the clients and adding opening costs to both sides, we obtain

[TABLE]

where in the last inequality we use Lemma 6. This finishes the proof of the lemma. $\square$

We conclude this section with the observation that it remains to prove Lemma 9 in order to show a polynomial-time approximation scheme for Non-Uniform Facility Location in planar graphs. After initial preprocessing of the input instance $I$ , Corollary 7 asserts that it suffices to find a $(1+\mathcal{O}(\varepsilon))$ -approximate solution to $I^{\prime}$ .

To this end, we break $I^{\prime}$ into instances $(J_{j})_{j\in\mathbb{Z}}$ . For every nontrivial $J_{j}$ , we scale all the edge lengths and opening costs of $J_{j}$ by a factor of $\varepsilon^{-(4(jq+q+a)+2)}$ and define $D^{\circ}=W_{j}$ and $\mathsf{cluster}(f):=\mathsf{cluster}^{\prime}(f)$ for every $f\in D^{\circ}$ . Note that $(\mathsf{cluster}(f))_{f\in D^{\circ}}$ partitions $C_{j}$ . Let

[TABLE]

Then, since for every $f\in W_{j}$ we have

[TABLE]

and for every $c\in\mathsf{cluster}^{\prime}(f)$ it holds that

[TABLE]

we infer that after scaling the distances, $1\leqslant\mathrm{dist}(c,f)\leqslant r/2$ for every $f\in W_{j}$ and $c\in\mathsf{cluster}^{\prime}(f)$ . Furthermore, (17) together with Lemma 8 imply the second condition of Lemma 9.

Consequently, the algorithm Lemma 9 applied to $J_{j}$ prepared as above with accuracy parameter $\varepsilon^{2}$ (instead of $\varepsilon$ ) runs in time $n^{2^{\mathcal{O}(\varepsilon^{-2}\log(1/\varepsilon))}}$ and returns a solution $D_{j}$ of cost (after scaling back again all the edge weights and opening costs) satisfying

[TABLE]

where

[TABLE]

Observe that

[TABLE]

Thus Lemma 10 allows us to combine the solutions $D_{j}$ into a solution $R$ to $I^{\prime}$ of cost satisfying:

[TABLE]

By Lemma 11, this value is at most

[TABLE]

Finally, we may apply Corollary 7 to conclude that

[TABLE]

as required. Consequently, it remains to prove Lemma 9.

3 Dynamic programming algorithm

3.1 Overview

Before we proceed to the formal proof of Lemma 9, we give a short overview. The approach is based on a rather standard layering argument plus portal-based Divide&Conquer. While the formal reasoning is quite lengthy due to a number of technical details that require attention, we hope that presenting an intuitive description of consecutive steps will help the reader with guiding through the proof.

Suppose $D$ is an optimum solution to instance $J$ . The first realization is that $D$ enjoys a similar proximity property as expressed in Lemma 4. Namely, every client $c\in C$ is at distance at most $3r$ from some facility of $D$ . The argument is essentially the same: supposing all clients from some cluster $\mathsf{cluster}(f)$ for $f\in D^{\circ}$ had connection costs larger than $r$ in the solution $D$ , one could improve $D$ by opening facility $f$ and rediricting all clients from $\mathsf{cluster}(f)$ to $f$ . Otherwise, some client from $\mathsf{cluster}(f)$ is within distance at most $r$ from $D$ , which implies that all of them are at distance at most $3r$ .

This proximity property allows us to apply standard layering. We fix a vertex $s$ and classify facilities from $D^{\circ}$ of the graph into layers $(D^{\circ}_{i})_{i\in\mathbb{N}}$ of width $8r$ according to distances from $s$ : layer $D^{\circ}_{i}$ comprises facilities $f\in D^{\circ}$ satisfying $i\cdot 8r\leqslant\mathrm{dist}(s,f)<(i+1)\cdot 8r$ . With every facility $f\in D^{\circ}$ we can associate its contribution to $M$ , equal to $\mathsf{open}(f)+\sum_{c\in\mathsf{cluster}(f)}\mathrm{dist}(c,f)$ . Now, denoting $q=\lceil\varepsilon^{-1}\rceil$ , there exists $a\in\{0,1,\ldots,q-1\}$ such that the total contribution of facilities from layers $D^{\circ}_{i}$ with $i\equiv a\bmod q$ is at most $\varepsilon M$ . Hence, by paying cost $\varepsilon M$ we may open these facilities and direct all clients from their clusters to them. Now it is easy to see that we have a separation property: instance $J$ can be decomposed into instances $(J_{j})_{j\in\mathbb{N}}$ , where $J_{j}$ concerns connecting all clients from clusters of facilities of $\bigcup_{jq+a<i<(j+1)q+a}D^{\circ}_{i}$ to facilities within distance between $(jq+a)\cdot 8r-4r$ and $((j+1)q+a)\cdot 8r-4r$ from $s$ , which can be (approximately) solved separately. This is because in the optimum solution, no client-facility path used for connection crosses any of the entirely bought layers due to having length at most $3r$ .

Let us focus on one instance $J_{j}$ . We may contract all vertices at distance less than $(jq+a)\cdot 8r-8r$ onto $s$ and remove all vertices at distance more than $((j+1)q+a)\cdot 8r$ , as these vertices anyway will not participate in any shortest path used by an optimum solution. Thus, we essentially achieve a small radius property in $J_{j}$ : one may assume that all vertices are at distance at most $8qr=\mathcal{O}(\varepsilon^{-1}r)$ from $s$ .

The idea is to compute a near-optimum solution to $J_{j}$ using Divide&Conquer on balanced separators, presented as dynamic programming. Using standard separation properties of planar graphs one can show that the graph (or rather its plane embedding) admits a hierarchical decomposition into regions so that the decomposition has depth at most $\log n$ and every region is boundaried by a union of at most $6$ shortest paths, all with one endpoint in $s$ . Thus, each of these paths has length $\mathcal{O}(\varepsilon^{-1}r)$ . We apply dynamic programming over this decomposition, where we put portals on the boudaries of regions to limit the number of states. That is, along each path we put portals spaced at $\delta$ , for some parameter $\delta>0$ , and we allow paths connecting clients with facilities to cross region boundaries only through portals. Since the decomposition has depth $\log n$ , each connection path in the optimum solution can be “snapped to portals” to conform with this requirement by using at most $2\log n$ snappings, incurring a total additional cost of $2\delta\cdot\log n$ . Therefore, we put $\delta=\varepsilon/\log n$ so that this error is bounded by $\mathcal{O}(\varepsilon)$ , which summed through all clients yields an $\mathcal{O}(\varepsilon M)$ error term in total. Thus, the total number of portals on the boundary of each region is $\mathcal{O}(\delta^{-1}\varepsilon^{-1}r)=\mathcal{O}(\varepsilon^{-2}r\log n)$ .

In the dynamic programming state associated with a region $R$ , we are concerned about opening facilities within $R$ to serve all clients in $R$ . However, on the boundary of $R$ we have $\mathcal{O}(\varepsilon^{-2}r\log n)$ portals that carry information about the assumed interaction between the parts of the overall solution within $R$ and outside of $R$ . For every portal $\pi$ , this information consists of two pieces:

•

request $\mathsf{req}(\pi)$ that gives a hard request on the sought solution within $R$ : there has to be a facility opened at distance at most $\mathsf{req}(\pi)$ from $\pi$ ;

•

prediction $\mathsf{pred}(\pi)$ that gives a possibility of connecting clients to portals: every client $c$ can be connected to $\pi$ at connection cost $\mathrm{dist}(c,\pi)+\mathsf{pred}(\pi)$ .

Intuitively, predictions represent “virtual” opened facilities residing outside of $R$ , which can be accessed at an additional cost given by $\mathsf{pred}(\pi)$ , while by satisfying requests we make sure that predictions in other regions can be fulfilled. Since all client-facility paths in the optimum solution are of length at most $3r$ , we may assume that all requests and predictions in all considered states are bounded by $3r$ . At the cost of an additional error term $\mathcal{O}(\varepsilon M)$ we can also assume that requests and predictions are rounded to integer multiples of $\delta$ . Thus, for every portal $\pi$ we can limit ourselves to $\mathcal{O}(\delta^{-1}r)=\mathcal{O}(\varepsilon^{-2}r\log n)$ possibilities for $\mathsf{req}(\pi)$ and same for $\mathsf{pred}(\pi)$ .

Let us estimate the number of states constructed so far. For each of $\mathcal{O}(\varepsilon^{-2}\log n)$ portals on the boundary of $R$ we have $\mathcal{O}(\varepsilon^{-2}r\log n)$ possibilities for $\mathsf{req}(\pi)$ and for $\mathsf{pred}(\pi)$ , yielding a total number of states being $(\varepsilon^{-2}r\log n)^{\mathcal{O}(\varepsilon^{-2}r\log n)}=n^{\textrm{poly}(1/\varepsilon)\cdot r\cdot\log\log n}$ , which is quasi-polynomial. As transitions in this dynamic programming can be implemented efficiently, this already yields a QPTAS, and we are left with reducing the number of states to polynomial.

The final trick is to take a closer look at what we store in the states. Since $\mathsf{req}(\cdot)$ stores the requested distance to the closest facility opened within $R$ , it is safe to assume that $\mathsf{req}(\cdot)$ (before rounding to integer multiples of $\delta$ ) will be $1$ -Lipschitz in the following sense: for any two portals $\pi,\rho$ , we have

[TABLE]

An analogous reasoning can be applied to predictions, so we can assume that $\mathsf{pred}(\cdot)$ is $1$ -Lipschitz as well. Now consider any of the $6$ shortest paths comprising the boundary of $R$ , say $P$ . On this path we put portals spaced at $\delta$ , say $\pi_{1},\ldots,\pi_{\ell}$ for $\ell\leqslant\mathcal{O}(\varepsilon^{-2}r\log n)$ in the order on $P$ . As argued, after rounding we have $\mathcal{O}(\varepsilon^{-2}\log n)$ possibilities for $\mathsf{req}(\pi_{1})$ , but observe that once (rounded) $\mathsf{req}(\pi_{i-1})$ is chosen, there are only at most $5$ possibilites for $\mathsf{req}(\pi_{i})$ : it must be an integer multiple of $\delta$ that differs from $\mathsf{req}(\pi_{i-1})$ by at most $2\delta$ , due to $\mathrm{dist}(\pi_{i-1},\pi_{i})=\delta$ . Hence, the total number of choices for the values of requests along $P$ is bounded by $\mathcal{O}(\varepsilon^{-2}\log n)\cdot 5^{\mathcal{O}(\varepsilon^{-2}r\log n)}=n^{\mathcal{O}(\varepsilon^{-2}r)}$ . Same argument applies to predictions, and as the boundary of $R$ consists of at most $6$ such paths, the total number of states we need to consider is $n^{\mathcal{O}(\varepsilon^{-2}r)}$ .

3.2 Proof of Lemma 9

We now proceed with the formal proof of Lemma 9. For the remainder of this section, let us fix the setting and the notation from the statement of Lemma 9.

Fix an optimum solution $D\subseteq F$ in the instance $J$ . We first prove that in fact, every client is not too far from its closest facility in $D$ .

Lemma 12.

*For each $c\in C$ there exists $g\in D$ such that $\mathrm{dist}(c,g)\leqslant 3r$ . *

Proof.

Let $f\in D^{\circ}$ be such that $c\in\mathsf{cluster}(f)$ ; then $\mathrm{dist}(c,f)\leqslant r$ . We shall prove that there exists some client $d\in\mathsf{cluster}(f)$ and facility $g\in D$ such that $\mathrm{dist}(d,g)\leqslant r$ . Indeed, if this is true, then we have $\mathrm{dist}(c,g)\leqslant\mathrm{dist}(c,f)+\mathrm{dist}(f,d)+\mathrm{dist}(d,g)\leqslant r+r+r=3r$ , as required.

Suppose otherwise: for each $d\in\mathsf{cluster}(f)$ , the distance from $d$ to the closest facility from $D$ is larger than $r$ . As $\mathsf{cluster}(f)$ is nonempty, the total connection cost incurred by clients from $\mathsf{cluster}(f)$ in solution $D$ can be lower bounded as follows:

[TABLE]

This means that the solution $D\cup\{f\}$ has a strictly smaller cost than $D$ , which contradicts the optimality of $D$ . $\square$

Let $G^{\prime}$ be the subgraph of $G$ induced by all vertices whose distance from $D^{\circ}$ is at most $4r$ . Observe that all clients of $C$ are placed at vertices of $G^{\prime}$ . Lemma 12 now immediately implies the following.

Lemma 13.

*It holds that $D\subseteq V(G^{\prime})$ and for every $c\in C$ we have $\mathrm{dist}_{G^{\prime}}(c,D)=\mathrm{dist}_{G}(c,D)$ . *

Proof.

For the first assertion, by the optimality of $D$ , for every $g\in D$ there exists some client $c\in C$ such that $g$ is the facility of $D$ closest to $c$ . By Lemma 12 we have $\mathrm{dist}_{G}(c,g)\leqslant 3r$ . If now $f\in D^{\circ}$ is such that $c\in\mathsf{cluster}(f)$ , then $\mathrm{dist}_{G}(c,f)\leqslant r$ . Hence $\mathrm{dist}_{G}(f,g)\leqslant r+3r=4r$ , so $g\in V(G^{\prime})$ .

For the second assertion, observe that by Lemma 12, for every client $c\in C$ , the shortest path from $c$ to a facility of $D$ traverses only vertices that are at distance at most $4r$ from the facility $f\in D^{\circ}$ satisfying $c\in\mathsf{cluster}^{\prime}(f)$ . It follows that the distance from $c$ to $D$ is the same in $G$ as in $G^{\prime}$ $\square$

Let $F^{\prime}$ consist of all the facilities that are placed at vertices of $G^{\prime}$ , and let $J^{\prime}=(G^{\prime},C,F^{\prime},\mathsf{open})$ . We observe that Lemma 12 implies that we can work with instance $J^{\prime}$ instead of $J$ .

Corollary 14.

*For every $R\subseteq F^{\prime}$ , we have $\mathsf{cost}(R;J^{\prime})\geqslant\mathsf{cost}(R;J)$ . Moreover, we have $\mathsf{cost}(D;J^{\prime})=\mathsf{cost}(D;J)$ and consequently $\mathsf{OPT}(J^{\prime})=\mathsf{OPT}(J)$ . *

Proof.

The first assertion is straightforward, because $G^{\prime}$ is an induced subgraph of $G$ , hence distances between vertices of $G^{\prime}$ are not smaller in $G^{\prime}$ than in $G$ . For the second assertion, observe that by Lemma 12 we have $D\subseteq F^{\prime}$ and $\mathrm{dist}_{G^{\prime}}(c,D)=\mathrm{dist}_{G}(c,D)$ for every client $c\in C$ , hence the connection cost of $D$ in $J$ and in $J^{\prime}$ are the same. As the opening costs are also obviously the same, we conclude that indeed $\mathsf{cost}(D;J^{\prime})=\mathsf{cost}(D,J)$ . This, together with the first assertion, immediately entails $\mathsf{OPT}(J^{\prime})=\mathsf{OPT}(J)$ . $\square$

From now on we will assume that the graph $G^{\prime}$ is connected. This can be achieved either by connecting the connected components using edges of very large (but finite) weight, or applying the forthcoming reasoning to every connected component of $G^{\prime}$ separately and taking the union of obtained solutions.

Fix any vertex $s$ and partition the vertices of $G^{\prime}$ into layers $(\mathrm{layer}_{i})_{i\in\mathbb{N}}$ as follows: for $i\in\mathbb{N}$ we set:

[TABLE]

Let $D^{\circ}_{i}=D^{\circ}\cap\mathrm{layer}_{i}$ . Denote $q=\lceil\varepsilon^{-1}\rceil$ . Since $(D^{\circ}_{i})_{i\in\mathbb{N}}$ is a partition of $D^{\circ}$ , it follows that there exists $a\in\{0,1,\ldots,q-1\}$ such that denoting $S=\bigcup_{i\colon i\equiv a\bmod q}D^{\circ}_{i}$ , we have

[TABLE]

Moreover, obviously such $a$ can be found in polynomial time. For $j\in\mathbb{N}$ , define the $j$ -th ring as

[TABLE]

For future reference, we note that rings are separated from each other.

Lemma 15.

*For any different $j,j^{\prime}\in\mathbb{N}$ and $u\in W_{j}$ and $u^{\prime}\in W_{j^{\prime}}$ , we have $\mathrm{dist}_{G^{\prime}}(u,u^{\prime})>8r$ . *

Proof.

By the definition of $W_{j}$ and $W_{j^{\prime}}$ and since $j\neq j^{\prime}$ , we have $|\mathrm{dist}_{G^{\prime}}(u,s)-\mathrm{dist}_{G^{\prime}}(u^{\prime},s)|>8r$ . Then the statement follows by triangle inequality. $\square$

The idea now is to buy the facilities of $S$ and connect the clients from $C_{S}=\bigcup_{f\in S}\mathsf{cluster}(f)$ to the centers of their clusters — which incurs cost at most $\varepsilon\cdot M$ by (18) — and to construct a separate instance for each ring $W_{j}$ so that these instances can be solved independently. We now carefully define those instances.

Fix $j\in\mathbb{N}$ and construct graph $H_{j}$ obtained from $G^{\prime}$ in the following manner:

Remove all vertices $w$ of $G^{\prime}$ satisfying $w\in\bigcup_{\iota>jq+a}L_{\iota}$ . 2. 2.

Contract all vertices $w$ of $G^{\prime}$ satisfying $w\in\bigcup_{\iota<(j-1)q+a}L_{\iota}$ onto $s$ ; we shall use the name $s$ also for the vertex obtained as the result of this contraction. 3. 3.

For every vertex $w$ that, after the contraction explained above, becomes a neighbor of $s$ , we assign the edge $sw$ weight $\mathrm{dist}_{G^{\prime}}(s,w)$ .

Note that in the second, the set of vertices $w$ contracted onto $s$ induces a connected subgraph of $G^{\prime}$ , and thus the contraction is well-defined and preserves the planarity. We shall identify vertices of $H_{j}$ with their origins in $G^{\prime}$ in the obvious way.

In essence, graph $H_{j}$ retains all the relevant information about distances between vertices of $W_{j}$ . This is formalized in the following lemma.

Lemma 16.

The following assertions hold for each $j\in\mathbb{N}$ :

(P1)

For every pair of vertices $u,v\in V(H_{j})$ , we have $\mathrm{dist}_{H_{j}}(u,v)\geqslant\mathrm{dist}_{G^{\prime}}(u,v)$ . 2. (P2)

For every vertex $u\in V(H_{j})$ , we have $\mathrm{dist}_{H_{j}}(u,s)=\mathrm{dist}_{G^{\prime}}(u,s)$ . 3. (P3)

For every pair of vertices $u,v\in V(H_{j})$ satisfying $u\in W_{j}$ and $\mathrm{dist}_{G^{\prime}}(u,v)\leqslant 3r$ , we have $\mathrm{dist}_{H_{j}}(u,v)=\mathrm{dist}_{G^{\prime}}(u,v)$ .

Proof.

For assertion (P1), it suffices to observe that every path in $H_{j}$ with endpoints $u$ and $v$ can be lifted to a path in $G^{\prime}$ of the same length by substituting any edge incident to $s$ , say $sw$ , by the shortest path between $s$ and $w$ in $G^{\prime}$ . For assertion (P2), we already know that $\mathrm{dist}_{H_{j}}(u,s)\geqslant\mathrm{dist}_{G^{\prime}}(u,s)$ , and to see that $\mathrm{dist}_{H_{j}}(u,s)\leqslant\mathrm{dist}_{G^{\prime}}(u,s)$ we may observe that on the shortest path in $G^{\prime}$ from $s$ to $u$ , vertices contracted onto $s$ form a prefix; this prefix can be then replaced by a single edge of the same weight. For assertion (P3), the assumption that $u\in W_{j}$ implies that in $G^{\prime}$ , the vertex $u$ is at distance more than $3r$ from any vertex that is removed or contracted onto $s$ in the construction of $H_{j}$ . Hence, the shortest path from $u$ to $v$ in $G^{\prime}$ survives the construction of $H_{j}$ intact. $\square$

Fix

[TABLE]

For future reference, we also note the following observation.

Lemma 17.

*Let $Q$ be a shortest path in $H$ from $s$ to some vertex $u$ . Then the length of $Q-s$ (i.e. $Q$ with the first vertex removed) is smaller than $L$ . *

Proof.

Let $v$ be the successor of $s$ on the path $Q$ . By the construction of $H$ we have that $u,v\in\bigcup_{(j-1)q+a\leqslant\iota\leqslant jq+a}L_{\iota}$ which in particular means that

[TABLE]

Since $v$ lies on the shortest path from $s$ to $u$ , it follows that the length of the suffix of $Q$ from $v$ to $u$ (which is $Q-s$ ) is equal to the $\mathrm{dist}(v,u)$ , which in turn is smaller than $8r(jq+a+1)-8r((j-1)q+a)=8r(q+1)=L$ . $\square$

Having defined the graph $H_{j}$ , we define the facility set $F_{j}$ and client set $C_{j}$ as follows:

[TABLE]

Note that $F_{j}\subseteq V(H_{j})$ and $C_{j}\subseteq V(H_{j})$ . Finally, we put

[TABLE]

that is, the opening costs are inherited from the original instance $J$ . We now prove that by paying a small cost, we may solve instances $J_{j}$ separately.

Lemma 18.

We have

[TABLE]

Moreover, for any sequence of solutions $(R_{j})_{j\in\mathbb{N}}$ to instances $(J_{j})_{j\in\mathbb{N}}$ , respectively, we have

[TABLE]

Proof.

For each $j\in\mathbb{N}$ , let $D_{j}$ be the set consisting of all facilities $f\in D$ with the following property: there exists a client $c\in C_{j}$ for which $f$ is the closest facility from $D$ . By Lemmas 12 and 13, we have $\mathrm{dist}_{G^{\prime}}(c,D_{j})\leqslant 3r$ for all $c\in C_{j}$ , while from the definition of $D_{j}$ it further follows that $\mathrm{dist}_{G^{\prime}}(f,C_{j})\leqslant 3r$ for all $f\in D_{j}$ . Also, every client $c\in C_{j}$ is at distance at most $r$ from the center of its cluster, which is a facility of $D^{\circ}$ that resides in $W_{j}$ . Hence, every facility $f\in D_{j}$ is at distance at most $4r$ from $W_{j}$ . By Lemma 15 and triangle inequality we now infer that sets $(D_{j})_{j\in\mathbb{N}}$ are pairwise disjoint. Moreover, we have $D_{j}\subseteq F_{j}$ and thus $D_{j}$ can be treated as a solution to the instance $J_{j}$ Therefore, by Lemma 16, assertions (P1) and (P3), we have

[TABLE]

completing the proof of the first assertion.

For the second assertion, since $C_{S}$ and $(C_{j})_{j\in\mathbb{N}}$ form a partition of $C$ , we have

[TABLE]

where in the second inequality we use Lemma 16, assertion (P1), while in the last inequality we use (18). $\square$

Hence, from now on we focus on finding a near-optimum solutions to instances $J_{j}$ , for each $j\in\mathbb{N}$ for which $C_{j}\neq\emptyset$ , as such solutions can be combined into a near-optimum solution to $J^{\prime}$ using Lemma 18, which is then a near-optimum solution to $J$ by Corollary 14. This will be done by dynamic programming. Fix $j\in J$ for which $C_{j}$ is non-empty. For brevity, in the following we write $H$ for $H_{j}$ . Before we proceed, let us observe that $J_{j}$ enjoys the same proximity property as $J$ , expressed in Lemma 12.

Lemma 19.

*Suppose $D_{j}$ is an optimum solution in the instance $J_{j}$ . Then for each $c\in C_{j}$ there exists $g\in D_{j}$ such that $\mathrm{dist}_{H}(c,g)\leqslant 3r$ . *

Proof.

Apply the same reasoning as in the proof of Lemma 12, noting that all relevant vertices and paths are completely contained $H$ due to being at distance at most $3r$ from $W_{j}$ . $\square$

Getting a suitable decomposition.

Our dynamic programming will work over a suitable decomposition of the graph $H$ . To define this decomposition, we will need some structural understanding of $H$ and its embedding.

Recall that we assume that $H$ is embedded in a sphere $\Sigma$ . We shall assume that $H$ is triangulated, as we can always triangulate it using edges of weight $+\infty$ . Let $L$ be the set of faces111We use $L$ here instead of usual $F$ in order to avoid using the same letter as for facility sets. of $H$ . For future reference, we let $\xi\colon V(H)\to L$ be a function that assigns to every vertex $u$ of $H$ an arbitrary face $\xi(u)$ incident to $u$ .

Let $S$ be the spanning tree of shortest paths from $s$ . That is, if for each $v\in V(H)$ by $P_{v}$ we denote the shortest path from $v$ to $s$ in $H$ , then $S$ is the union of paths $\{P_{v}\colon v\in V(H)\}$ . Let $S^{\star}$ be the spanning subgraph of the dual $H^{\star}$ of $H$ consisting of edges of $H^{\star}$ that are dual to the edges not belonging to $S$ . It is well-known that $S^{\star}$ is then a spanning tree of $H^{\star}$ .

Let

[TABLE]

that is, for each edge $fg$ of $S^{\star}$ we add to $A$ two (oriented) arcs: $(f,g)$ and $(g,f)$ . For an arc $a\in A$ , let $L(a)\subseteq L$ denote the set of those faces of $H$ that are contained in this connected component of $S^{\star}$ with (unoriented) $a$ removed that contains the head of $a$ . For nonempty $B\subseteq A$ , we denote

[TABLE]

and we put $L(\emptyset)=L$ by convention. We may now state and prove the decomposition lemma that we shall need; in the following, all logarithms are base $2$ .

Lemma 20.

In polynomial time one can compute a rooted tree $T$ together with a labelling $\beta$ of nodes of $T$ with subsets of $A$ such that the following holds:

(T1)

$T$ * has depth at most $\log n$ ;* 2. (T2)

for each node $t$ of $T$ , we have $|\beta(t)|\leqslant 3$ ; 3. (T3)

if $t_{0}$ is the root of $T$ , then $L(\beta(t_{0}))=L$ ; 4. (T4)

for each leaf $t$ of $T$ , we have $|L(\beta(t))|=1$ ; 5. (T5)

each non-leaf node $t$ of $T$ has at most $7$ children, and if $\mathsf{chld}(t)$ denotes the set of children of $t$ , then

[TABLE]

Proof.

A subset $X$ of nodes of $S^{\star}$ is connected if it induces a connected subtree of $X$ . For a subset of nodes $X$ , by $\partial X$ we denote the set of edges of $S^{\star}$ with one endpoint in $X$ and second outside of $X$ . Let a block be any nonempty, connected subset of nodes $X$ such that $|\partial X|\leqslant 3$ . Note that since $H$ is triangulated, $S^{\star}$ is a tree with maximum degree at most $3$ , so every node of $T$ constitutes a single-node block.

We observe the following.

Claim 2.

*Every block $X$ with $|X|\geqslant 2$ admits a partition into at most $7$ blocks, each of size at most $|X|/2$ . *

Proof.

Let $Z\subseteq X$ be the set of all the nodes of $X$ that have a neighbor (in $S^{\star}$ ) outside of $X$ . Then $|Z|\leqslant 3$ and, consequently, there exists a node $x\in X$ such that every connected component of $S^{\star}[X]-x$ contains at most one node of $Z$ . Further, it is well known that in $S^{\star}[X]$ there exists a balanced node: a node $y$ such that every connected component of $S^{\star}[X]-y$ has at most $|X|/2$ nodes. Then $S^{\star}[X]-\{x,y\}$ has at most $5$ connected components, and it is straightforward to see that each of them is a block and contains at most $|X|/2$ nodes. Hence, as $|X|\geqslant 2$ , for the promised partition of $X$ into blocks we can take the node sets of the connected components of $S^{\star}[X]-\{x,y\}$ , plus blocks $\{x\}$ and $\{y\}$ (or just $\{x\}$ , in case $x=y$ ). $\lrcorner$

We now construct the tree $T$ together with labeling $\beta(\cdot)$ by recursively applying Claim 2 as follows. We start with the block $L$ and, as long as the currently decomposed block $X$ has size larger than $1$ , we apply Claim 2 to $X$ and recursively decompose all the blocks comprising the obtained partition. Then $T$ is the tree of this recursion and the nodes of $T$ can be naturally labelled with blocks decomposed in corresponding calls; thus, the root of $T$ is labelled by $L$ , while the leaves of $T$ are labelled by single-node blocks. Finally, for every node $t$ of $T$ , say associated with a block $X_{t}$ , we set $\beta(t)$ to consist of edges of $\partial X_{t}$ oriented towards endpoints belonging to $X_{t}$ . It is straightforward to verify that the obtained pair $(T,\beta)$ satisfies all of the required properties. Also, the above reasoning can be trivially translated into a polynomial-time algorithm computing $(T,\beta)$ . $\square$

Thus, Lemma 20 essentially provides a hierarchical decomposition of the face set of $H$ using separators consisting of six-tuples of shortest paths originating in $s$ : two per each arc in $\beta(t)$ . The idea is to put portals on those separators and run a bottom-up dynamic programming on the tree $T$ that assembles a near-optimum solution while snapping paths to the portals along the way. First, however, we need to understand how to put portals on paths in $H$ .

Portalization.

Let $X$ be a set of vertices of $H$ and let $f\colon X\to\mathbb{R}\cup\{+\infty\}$ be a function. For positive reals $d,\sigma$ and reals $\alpha\leqslant\beta$ , we shall say that $f$ is

•

$d$ -discrete if all its values are integer multiples of $d$ ;

•

$[\alpha,\beta]$ -bounded if every its value is either $+\infty$ or belongs to the interval $[\alpha,\beta]$ ; and

•

Lipschitz with slack $\sigma$ if

[TABLE]

A function that is $d$ -discrete, $[\alpha,\beta]$ -bounded, and Lipschitz with slack $\sigma$ will be called $(d,\alpha,\beta,\sigma)$ -normal.

For portalization of shortest paths we shall use the following lemma.

Lemma 21.

*Let $P$ be a shortest path in $H$ with one endpoint in $s$ and let $d\in\mathbb{R}_{\geqslant 0}$ . Then one can find a set $\Pi$ of at most $(L/d)+2$ vertices on $P$ with the following property: for every vertex $u$ on $P$ , there exists $\pi\in\Pi$ such that $\mathrm{dist}(u,\pi)\leqslant d$ . Moreover, for any reals $\alpha\leqslant\beta$ , the number of functions on $\Pi$ that are $(d,\alpha,\beta,d)$ -normal is at most $((\beta-\alpha)/d)^{2}\cdot 2^{\mathcal{O}(L/d)}$ , and such functions can be enumerated in time $((\beta-\alpha)/d)^{2}\cdot 2^{\mathcal{O}(L/d)}$ . *

Proof.

Let $m=\beta-\alpha$ . Let $P^{\prime}=P-s$ , i.e., $P^{\prime}$ is $P$ with the first vertex removed. Then, by Lemma 17, the length of $P^{\prime}$ is smaller than $L$ .

Let $u$ and $v$ be the endpoints of $P^{\prime}$ ; then $P^{\prime}$ is the shortest path connecting $u$ and $v$ . Partition the vertices of $P^{\prime}$ into intervals $I_{0},I_{1},I_{2},\ldots,I_{p}$ , where $p=\lfloor L/d\rfloor$ such that $I_{i}$ comprises vertices $w$ of $P^{\prime}$ satisfying $id\leqslant\mathrm{dist}(u,w)<(i+1)d$ ; since the length of $P^{\prime}$ is smaller than $L$ , each of the vertices of $P^{\prime}$ is placed in one of these intervals. Observe that vertices within every interval $I_{i}$ are pairwise at distance smaller than $d$ . Therefore, we may construct a suitable set $\Pi^{\prime}$ for the path $P^{\prime}$ by taking one vertex $\pi_{i}$ from each interval $I_{i}$ that is non-empty; thus, $\Pi^{\prime}$ has size at most $p\leqslant(L/d)+1$ . Finally, we set $\Pi=\Pi^{\prime}\cup\{s\}$ .

We now bound the number $(d,\alpha,\beta,d)$ -normal functions $f$ on $\Pi$ . Note that there are at most $m/d+2$ possibilities for the value $f(s)$ , as this value is either an integer multiple of $d$ between $\alpha$ and $\beta$ , or $+\infty$ . Therefore, it suffices to bound the number of $(d,\alpha,\beta,d)$ -normal functions on $\Pi^{\prime}$ by $(m/d)\cdot 2^{\mathcal{O}(L/d)}$ . Recall that $|\Pi^{\prime}|\leqslant(L/d)+1$ , hence there are at most $2^{(L/d)+1}$ choices on which portals will be assigned value $+\infty$ . Supposing that this choice has been made, we bound the number of choices of (finite) values on remaining portals. Let $1\leqslant i_{1}<i_{2}<\ldots<i_{q}\leqslant p$ be the indices such that portals chosen to be assigned finite values are in intervals $I_{i_{1}},\ldots,I_{i_{q}}$ . As above, there are at most $m/d+1$ possibilities for the value $f(\pi_{i_{1}})$ . However, for $j>1$ , the value $f(\pi_{i_{j}})$ must satisfy inequality

[TABLE]

As $f(\pi_{i_{j}})$ has to be an integer multiple of $d$ , once $f(\pi_{i_{j-1}})$ has been chosen, there are at most $2(i_{j}-i_{j-1})+4$ choices for the value of $f(\pi_{i_{j}})$ . Hence, having chosen $f(\pi_{i_{1}})$ , the number of choices for the remaining values $f(\pi_{i_{2}}),\ldots,f(\pi_{i_{q}})$ is bounded by

[TABLE]

Since $p\leqslant(L/d)+1$ , we conclude that the total number of $(d,\alpha,\beta,d)$ -normal functions on $\Pi^{\prime}$ is bounded by $(m/d)\cdot 2^{\mathcal{O}(L/d)}$ , as required.

The above reasoning can be trivially used to construct the promised enumeration algorithm. $\square$

Defining subproblems.

As expected, in dynamic programming we will need to solve more general subproblems, where portals on boundaries of these subproblems are taken into account. Formally, in an instance of the generalized problem we are working with:

•

The original set of available facilities $F_{j}$ , which we denote $F^{\diamond}$ for consistency; this set is always the same in all instance of the generalized problem, and is equipped with the original opening cost function $\mathsf{open}(\cdot)$ .

•

A subset of relevant clients $C^{\diamond}\subseteq C_{j}$ ; this set varies in instances of the generalized problem.

•

A set of portals $\Pi$ , which are vertices of $H$ .

•

A prediction function $\mathsf{pred}\colon\Pi\to\mathbb{R}\cup\{+\infty\}$ .

•

A request function $\mathsf{req}\colon\Pi\to\mathbb{R}\cup\{+\infty\}$ .

Whenever considering an instance of the generalized problem, all distances are measured in $H$ . Note that we allow negative requests and predictions.

Consider an instance $K=(C^{\diamond},\Pi,\mathsf{req},\mathsf{pred})$ of the generalized problem. For a solution $R\subseteq F^{\diamond}$ , the connection cost of a client $c\in C^{\diamond}$ is defined as

[TABLE]

That is, every client can be connected either to a facility of $f$ at the cost of the distance to this facility, or to a portal at the cost of the distance to this portal plus its prediction. Note that portals are always all open, so the factor $\min_{\pi\in\Pi}(\mathrm{dist}(c,\pi)+\mathsf{pred}(\pi))$ is independent of the solution $R$ . We will say that $c$ is served by the facility $f$ or portal $\pi$ for which the minimum above is attained.

A solution $R\subseteq F^{\diamond}$ is feasible if for every portal $\rho\in\Pi$ with $\mathsf{req}(\rho)\neq+\infty$ , its request is satisfied in the following sense:

[TABLE]

Note that the request of a portal has to be satisfied by a facility included in the solution; it cannot be satisfied by another portal. Again $\rho$ is served by the facility $f$ for which the minimum above is attained.

To analyze the approximation error, we will need to gradually relax the feasibility constraint. For this, for a nonnegative real $\lambda$ we shall say that a solution $R\subseteq F^{\diamond}$ is $\lambda$ -near feasible if for every portal $\rho\in\Pi$ with $\mathsf{req}(\rho)\neq+\infty$ there exists a facility $f\in R$ with $\mathrm{dist}(\rho,f)\leqslant\mathsf{req}(\rho)+\lambda$ . That is, we relax all requests by an additive factor of $\lambda$ .

Finally, for $\gamma\in\mathbb{R}_{\geqslant 0}$ , a solution $R\subseteq F^{\diamond}$ is $\gamma$ -close in $K$ if

[TABLE]

The cost of a solution $R$ is defined as

[TABLE]

Note that the connection costs of portals do not contribute to the cost of the solution. They are only used to define (near) feasibility of a solution. Thus, every portal essentially puts a hard constraint that there needs to be a facility opened within some distance from it. By $\mathsf{OPT}(K)$ we denote the minimum cost of a feasible solution to $K$ .

The intuitive meaning of predictions and requests in the dynamic programming are as follows. In the following, think of dynamic programming over the decomposition provided by Lemma 20 as a recursive algorithm that breaks the given instance into simpler ones (whose number is at most $7$ ), solves them using subcalls, and assembles the obtained solutions into a solution to the input instance. Whenever we break the instance using some separator, which constists of a constant number of shortest paths, we put portals along them using Lemma 21 in all the obtained subinstances. For every portal $\pi$ we guess in which subinstance lies the closest facility $f$ that is open in the (unknown) optimum solution, and we approximately guess the distance $d$ from $\pi$ to this facility (up to additive accuracy $\delta$ , to be defined later). This allows us to define the requests and predictions in subinstances: in the subinstance that is guessed to contain $f$ we put a request $d$ on $\pi$ to make sure that some facility at this distance is indeed open there, while in other subinstances we put a prediction $d$ on $\pi$ , so that solutions in these subinstances may use a virtual, “promised” facility at distance $d$ from $\pi$ .

Since recursion has depth $\mathcal{O}(\log n)$ by Lemma 20, condition (T1), the rounding error will accumulate through $\mathcal{O}(\log n)$ levels. Therefore, we needed to put $\delta=\mathcal{O}(\varepsilon/\log n)$ and make rounding errors of magnitude $\mathcal{O}(\delta)\cdot\mathsf{OPT}$ at each level, so that the total error is kept at $\mathcal{O}(\varepsilon)\cdot\mathsf{OPT}$ . Precisely, we fix

[TABLE]

Dynamic programming states.

Once we have defined the generalized problem with portals, we may formally define the instances solved in the dynamic programming. For every vertex $v$ of $H$ , we may apply Lemma 21 to $P_{v}$ and $d=\delta$ , thus obtaining a suitable set of vertices $\Pi_{v}\subseteq V(P_{v})$ of size at most $\delta^{-1}L+2=\mathcal{O}(\varepsilon^{-2}r\log n)$ .

For each node $t$ of $T$ , we define

[TABLE]

where $B_{t}$ is the set of edges of $H$ dual to the arcs of $\beta(t)$ . Note that by condition (T5) of Lemma 20, we have

[TABLE]

Observe also that if $t_{0}$ is the root of $T$ , then $C^{\diamond}_{t_{0}}=C_{j}$ and $\Pi_{t_{0}}=\emptyset$ . Finally, the following lemma expresses the crucial separation property provided by the decomposition $(T,\beta)$ .

Lemma 22.

Let $s$ and $t$ be nodes of $T$ that are not in the ancestor-descendant relation, and let $u\in\xi^{-1}(L(\beta(s)))$ and $v\in\xi^{-1}(L(\beta(t)))$ . Then there exists a portal $\rho\in\Pi_{t}$ such that

[TABLE]

*Furthermore, the same holds when $s$ is an ancestor of $t$ and $u\in\Pi_{s}$ . *

Proof.

Let $B$ be the set of edges of $H$ that are dual to the arcs of $\beta(t)$ , and let $Z$ be the set of endpoints of these edges. Consider removing all paths $P_{z}$ for $z\in Z$ and all edges of $B$ from the plane. Then the plane breaks into several connected components, out of which one consists of exactly the faces of $L(\beta(t))$ . It follows that every path connecting a vertex from $\xi^{-1}(L(\beta(t)))$ with a vertex that does not belong to $\xi^{-1}(L(\beta(t)))$ has to intersect one of the paths $P_{z}$ for some $z\in Z$ . Observe that $v\in\xi^{-1}(L(\beta(t)))$ . Moreover, if $s$ and $t$ are not in the ancestor-descendant relation in $T$ , then $L(\beta(s))$ and $L(\beta(t))$ are disjoint, implying $u\notin\xi^{-1}(L(\beta(t)))$ . Also, if $u\in\Pi_{s}$ and $s$ is an ancestor of $t$ , then either $u$ lies on one of the paths $P_{z}$ for $z\in Z$ , or $u\notin\xi^{-1}(L(\beta(t)))$ .

In both cases we conclude that the shortest path connecting $u$ and $v$ , call it $Q$ , has to intersect the path $P_{z}$ for some $z\in Z$ . Let $w$ be any vertex in the intersection of these two paths. Then, by Lemma 21, there exists $\rho\in\Pi_{z}\subseteq\Pi_{t}$ such that $\mathrm{dist}(w,\rho)\leqslant\delta$ . We conclude that

[TABLE]

as required. $\square$

For every node $t$ of $T$ , we define $\widetilde{\mathcal{N}}_{t}$ to be the set of all functions from $\Pi_{t}$ to $\mathbb{R}\cup\{+\infty\}$ . Further, let $\mathcal{N}_{t}\subseteq\widetilde{\mathcal{N}}_{t}$ be the subset of all those functions from $\widetilde{\mathcal{N}}_{t}$ that are $(\delta,-5\varepsilon,3r+5\varepsilon,\delta)$ -normal; in the sequel, when saying just normal we mean being $(\delta,-5\varepsilon,3r+5\varepsilon,\delta)$ -normal. While $\widetilde{\mathcal{N}}_{t}$ is infinite, $\mathcal{N}_{t}$ is finite and actually of polynomial size.

Lemma 23.

*For each node $t$ of $T$ we have that $|\mathcal{N}_{t}|\leqslant n^{\mathcal{O}(\varepsilon^{-2}r)}$ and $\mathcal{N}_{t}$ can be enumerated in time $n^{\mathcal{O}(\varepsilon^{-2}r)}$ . *

Proof.

By Lemma 17, for each vertex $u$ of $H$ the number of normal functions on $\Pi_{u}$ is at most $(\delta^{-1}r)^{2}\cdot 2^{\mathcal{O}(\delta^{-1}L)}=n^{\mathcal{O}(\varepsilon^{-2}r)}$ . Observe that $\Pi_{t}$ is the union of at most $6$ sets of the form $\Pi_{u}$ , for vertices $u$ that are endpoints of edges dual to the arcs $\beta(t)$ . Hence every normal function on $\Pi_{t}$ can be described by a $6$ -tuple of such functions on sets of the form $\Pi_{u}$ for $u$ as above. Thus, we have $|\mathcal{N}_{t}|\leqslant n^{\mathcal{O}(\varepsilon^{-2}r)}$ as well. Moreover, since normal functions on $\Pi_{u}$ can be enumerated in time $n^{\mathcal{O}(\varepsilon^{-2}r)}$ for each vertex $u$ , to enumerate $\mathcal{N}_{t}$ it suffices to enumerate all $6$ -tuples of functions as above, and filter out those $6$ -tuples whose union is either ill-defined or is not Lipschitz with slack $\delta$ . This takes time $n^{\mathcal{O}(\varepsilon^{-2}r)}$ . $\square$

Now, for every $t\in V(T)$ and pair $\eta=(\mathsf{pred},\mathsf{req})\in\widetilde{\mathcal{N}}_{t}\times\widetilde{\mathcal{N}}_{t}$ , we define the instance $K_{t}(\eta)$ of the generalized problem as follows:

[TABLE]

Before the explaining how these instances are going to be solved using dynamic programming, let us verify that the subproblem at the root of $T$ corresponds to the instance $J_{j}$ that we are trying to (approximately) solve.

Lemma 24.

Suppose $t_{0}$ is the root of $T$ and, noting that $\Pi_{t_{0}}=\emptyset$ , we let $K=K_{t_{0}}((\emptyset,\emptyset))$ . Then, for any $\lambda\geqslant 0$ , every $\lambda$ -near feasible solution $R$ to $K$ satisfies

[TABLE]

In particular, we have

[TABLE]

Proof.

The first assertion follows immediately by observing that the formulas for $\mathsf{cost}(R;J_{j})$ and $\mathsf{cost}(R;K)$ are the same, because there are no portals in $K$ . The second assertion follows immediately from the first by observing that every solution $R$ to $K$ is $\lambda$ -near feasible for any $\lambda\geqslant 0$ , because in $K$ there are no portals. $\square$

Computing transitions.

We first show that the subproblems in the leaves of $T$ can be solved in polynomial time. For this, we use the following lemma.

Lemma 25.

*There is an algorithm that given an instance $K=(C^{\diamond},\Pi,\mathsf{pred},\mathsf{req})$ of the generalized problem and $\lambda\geqslant 0$ , finds the least expensive $\lambda$ -near feasible solution to $K$ in time $3^{|\Pi|+k}\cdot n^{\mathcal{O}(1)}$ , where $k$ is the total number of distinct vertices on which the clients of $C^{\diamond}$ are placed. *

Proof.

Let $W$ be the set of distinct vertices on which $C^{\diamond}$ are placed, and for $u\in W$ let $\gamma(u)$ be the number of clients placed at vertex $u$ . We perform standard dynamic programming over subsets of $\Pi$ and of $W$ , where we keep track of the cost of connecting any subset of portals and any subset of vertices of $W$ , while introducing candidate facilities one by one. Precisely, let $f_{1},\ldots,f_{p}$ be the facilities of $F^{\diamond}$ , enumerated in any order. Then for every $i\in\{0,1,\ldots,p\}$ , $A\subseteq\Pi$ , and $B\subseteq W$ , define value $\mathsf{dp}[i,A,B]$ to be the smallest cost of a $\lambda$ -near feasible solution contained in $\{f_{1},f_{2},\ldots,f_{i}\}$ , where in the near-feasibility check we consider only requests of portals from $A$ , and in the connection cost computation we consider only clients placed at vertices from $B$ . Then it is easy to see that the function $\mathsf{dp}[\cdot,\cdot,\cdot]$ satisfies the following recursive formula.

[TABLE]

Using the above formula, we can in time $3^{|\Pi|+k}\cdot n^{\mathcal{O}(1)}$ compute all the $2^{|\Pi|+k}\cdot(p+1)$ values of the function $\mathsf{dp}[\cdot,\cdot,\cdot]$ , and return $\mathsf{dp}[p,\Pi,W]$ as the sought minimum cost. A $\lambda$ -near feasible solution attaining this cost can be retrieved from dynamic programming tables by standard means within the same running time. $\square$

Corollary 26.

*Suppose $t$ is a leaf of $T$ and $\lambda\geqslant 0$ is a given real. Then, in total time $n^{\mathcal{O}(\varepsilon^{-2}r)}$ one can compute, for each $\eta\in\mathcal{N}_{t}\times\mathcal{N}_{t}$ , the least expensive $\lambda$ -near feasible solution $R_{t,\eta}\subseteq F^{\diamond}$ to $K_{t}(\eta)$ . *

Proof.

To compute each solution $R_{t,\eta}$ , we apply the algorithm of Lemma 25 to instance $K_{t}(\eta)$ for $\eta\in\mathcal{N}_{t}\times\mathcal{N}_{t}$ and $\lambda$ . Since $t$ is a leaf of $T$ , all clients in $K_{t}(\eta)$ lie on the unique face of $L(\beta(t))$ (Lemma 20, condition (T4)), hence they are all place on distinct three vertices. Therefore, the running time used by each application of the algorithm of Lemma 25 is $3^{|\Pi_{t}|+3}\cdot n^{\mathcal{O}(1)}=n^{\mathcal{O}(\varepsilon^{-2}r)}$ . Since the number of pairs $\eta\in\mathcal{N}_{t}\times\mathcal{N}_{t}$ is $|\mathcal{N}_{t}|^{2}\leqslant n^{\mathcal{O}(\varepsilon^{-2}r)}$ , the total running time follows. $\square$

We now proceed to the main point: how to compute values for a node of $T$ based on values for its children. We first introduce even more helpful notation. For a non-leaf node $t$ of $T$ , let $\Omega_{t}=\bigcup_{t^{\prime}\in\mathsf{chld}(t)}\Pi_{t}$ ; then $\Pi_{t}\subseteq\Omega_{t}$ .

For a non-leaf node $t$ of $T$ , define

[TABLE]

For each $t^{\prime}\in\mathsf{chld}(t)$ we have a natural restriction operator $\mathsf{restrict}_{t,t^{\prime}}\colon\widetilde{\mathcal{M}}_{t}\to\widetilde{\mathcal{N}}_{t^{\prime}}$ that maps every tuple from $\widetilde{\mathcal{M}}_{t}$ to its $t^{\prime}$ -component. Next, define

[TABLE]

Operator $\mathsf{restrict}_{t,t^{\prime}}(\cdot)$ can be then regarded as an operator from $\widetilde{\mathcal{W}}_{t}$ to $\widetilde{\mathcal{U}}_{t^{\prime}}$ by considering acting coordinate-wise.

Having defined sets $\widetilde{\mathcal{M}}_{t}$ , $\widetilde{\mathcal{U}}_{t}$ , and $\widetilde{\mathcal{W}}_{t}$ , we define sets $\mathcal{M}_{t}$ , $\mathcal{U}_{t}$ , and $\mathcal{W}_{t}$ by replacing $\widetilde{\mathcal{N}}_{t}$ with $\mathcal{N}_{t}$ in the definitions. Since every node of $T$ has at most $7$ children (Lemma 20, condition (T5)), by Lemma 23 we have that $|\mathcal{M}_{t}|\leqslant n^{\mathcal{O}(\varepsilon^{-2}r)}$ and all sets $\mathcal{M}_{t}$ can be computed in time $n^{\mathcal{O}(\varepsilon^{-2}r)}$ . Then we also have that

[TABLE]

and all the sets $\mathcal{U}_{t},\mathcal{W}_{t}$ can be computed in time $n^{\mathcal{O}(\varepsilon^{-2}r)}$ .

We now describe tuples from $\widetilde{\mathcal{W}}_{t}$ that may be used in the dynamic programming to combine solutions from smaller subproblems into a solution to a larger subproblem. The intuition here is that when breaking a subproblem into smaller ones, we have to ensure that requests and predictions appropriately match so that solutions to smaller subproblems can be combined to a solution to the original subproblem.

Definition 27.

Consider a non-leaf node $t$ of $T$ . We shall say that a pair $\eta=(\mathsf{req},\mathsf{pred})\in\widetilde{\mathcal{U}}_{t}$ and a pair $\varphi=((\mathsf{req}_{t^{\prime}})_{t^{\prime}\in\mathsf{chld}(t)},(\mathsf{pred}_{t^{\prime}})_{t^{\prime}\in\mathsf{chld}(t)})\in\widetilde{\mathcal{W}}_{t}$ are compatible (denoted $\eta\sim\varphi$ ) if the following two conditions hold:

(C1)

For every $\pi\in\Pi_{t}$ with $\mathsf{req}(\pi)\neq+\infty$ there exists $t^{\prime}\in\mathsf{chld}(t)$ and $\rho\in\Pi_{t^{\prime}}$ such that $\mathsf{req}_{t^{\prime}}(\rho)+\mathrm{dist}(\pi,\rho)\leqslant\mathsf{req}(\pi)$ . 2. (C2)

For every $t^{\prime}\in\mathsf{chld}(t)$ and $\rho\in\Pi_{t^{\prime}}$ with $\mathsf{pred}_{t^{\prime}}(\rho)\neq+\infty$ , there either exists $\pi\in\Pi_{t}$ with $\mathsf{pred}(\pi)+\mathrm{dist}(\pi,\rho)\leqslant\mathsf{pred}_{t^{\prime}}(\rho)$ , or there exists $t^{\prime\prime}\in\mathsf{chld}(t)$ and $\rho^{\prime}\in\Pi_{t^{\prime\prime}}$ with $\mathsf{req}_{t^{\prime\prime}}(\rho^{\prime})+\mathrm{dist}(\rho^{\prime},\rho)\leqslant\mathsf{pred}_{t^{\prime}}(\rho)$ .

Observe that given $\eta\in\widetilde{\mathcal{U}}_{t}$ and $\varphi\in\widetilde{\mathcal{W}}_{t}$ , it can be verified in polynomial time whether $\eta\sim\varphi$ .

Finally, we formulate and prove two lemmas that will imply the correctness of our dynamic programming. The first one concerns combining solutions to smaller subproblems into solutions to larger subproblems. The second one concerns projecting solutions to larger subproblems to solutions to smaller subproblems.

Lemma 28.

Suppose $t$ is a non-leaf node of $T$ and let $\eta\in\widetilde{\mathcal{U}}_{t}$ and $\varphi\in\widetilde{\mathcal{W}}_{t}$ be compatible. Suppose further that, for all $t^{\prime}\in\mathsf{chld}(t)$ , $R_{t^{\prime},\eta_{t^{\prime}}}$ is a feasible solution to the instance $K_{t^{\prime}}(\eta_{t^{\prime}})$ , where $\eta_{t^{\prime}}=\mathsf{restrict}_{t,t^{\prime}}(\varphi)$ . Then

[TABLE]

is a feasible solution to the instance $K_{t}(\eta)$ and, moreover,

[TABLE]

Proof.

For brevity, we shall denote $R_{t^{\prime}}=R_{t^{\prime},\eta_{t^{\prime}}}$ and $K_{t^{\prime}}=K_{t^{\prime}}(\eta_{t^{\prime}})$ . Also, let $\eta=(\mathsf{pred},\mathsf{req})$ and $K_{t}=K_{t}(\eta)$ .

We first verify that $R$ is a feasible solution to $K_{t}$ . Take any portal $\pi\in\Pi_{t}$ with $\mathsf{req}(\pi)\neq+\infty$ . Since $\eta\sim\varphi$ , by (C1) there exists $t^{\prime}\in\mathsf{chld}(t)$ and $\rho\in\Pi_{t^{\prime}}$ such that $\mathsf{req}_{t^{\prime}}(\rho)+\mathrm{dist}(\pi,\rho)\leqslant\mathsf{req}(\pi)$ . As $R_{t^{\prime}}$ is a feasible solution to $K_{t^{\prime}}$ , there exists $f\in R_{t^{\prime}}$ such that $\mathrm{dist}(\rho,f)\leqslant\mathsf{req}_{t^{\prime}}(\rho)$ . Then $f\in R$ as well and

[TABLE]

which certifies that the request of $\pi$ is satisfied by $R$ . Hence, $R$ is indeed a feasible solution to $K_{t}$ .

We are left with proving the postulated upper bound on $\mathsf{cost}(R;K_{t})$ . Take any client $c\in C^{\diamond}_{t}$ . As $(C^{\diamond}_{t^{\prime}})_{t^{\prime}\in\mathsf{chld}(t)}$ form a partition of $C^{\diamond}_{t}$ , there exists a unique node $t^{\prime}\in\mathsf{chld}(t)$ satisfying $c\in C^{\diamond}_{t^{\prime}}$ . Then there either exists a facility $f\in R_{t^{\prime}}$ satisfying

[TABLE]

or there exists a portal $\rho\in\Pi_{t^{\prime}}$ satisfying

[TABLE]

In the former case, since $R_{t^{\prime}}\subseteq R$ we can conclude that

[TABLE]

In the latter case, by (C2) either exists $\pi\in\Pi_{t}$ with $\mathsf{pred}(\pi)+\mathrm{dist}(\pi,\rho)\leqslant\mathsf{pred}_{t^{\prime}}(\rho)$ , or there exists $t^{\prime\prime}\in\mathsf{chld}(t)$ and $\rho^{\prime}\in\Pi_{t^{\prime\prime}}$ with $\mathsf{req}_{t^{\prime\prime}}(\rho^{\prime})+\mathrm{dist}(\rho^{\prime},\rho)\leqslant\mathsf{pred}_{t^{\prime}}(\rho)$ . In the first subcase we conclude that

[TABLE]

which again establish inequality (19) in this subcase. On the other hand, in the second subcase there exists a facility $f\in R_{t^{\prime\prime}}$ with $\mathrm{dist}(\rho^{\prime},f)\leqslant\mathsf{req}_{t^{\prime\prime}}(\rho^{\prime})$ . As $f\in R$ as well, we infer that

[TABLE]

Hence, again inequality (19) is satisfied.

We conclude that in every case, inequality (19) holds. Summing this inequality through all clients $c\in C^{\diamond}_{t}$ and adding $\mathsf{open}(R)$ to both sides yields yields that $\mathsf{cost}(R;K_{t})\leqslant\sum_{t^{\prime}\in\mathsf{chld}(t)}\mathsf{cost}(R_{t^{\prime}};K_{t^{\prime}})$ , as required. $\square$

Lemma 29.

Suppose $t$ is a non-leaf node of $T$ . Suppose further that $\eta\in\widetilde{\mathcal{U}}_{t}$ is such that all predictions involved in $\eta$ are nonnegative, and $R$ is a $\lambda$ -near feasible $\gamma$ -close solution to $K_{t}(\eta)$ , for some reals $\lambda,\gamma>0$ . Then there exist $\varphi\in\widetilde{\mathcal{W}}_{t}$ that is compatible with $\eta$ and $(\lambda+5\delta)$ -near feasible $(\gamma+5\delta)$ -close solutions $R_{t^{\prime},\eta_{t^{\prime}}}\subseteq R$ to instances $K_{t^{\prime}}(\eta_{t^{\prime}})$ for $t^{\prime}\in\mathsf{chld}(t)$ , where $\eta_{t^{\prime}}=\mathsf{restrict}_{t,t^{\prime}}(\varphi)$ , such that

[TABLE]

*Moreover, all request and prediction functions involved in $\varphi$ are $(\delta,-\lambda-5\delta,\gamma+4\delta,\delta)$ -normal, and all predictions involved in $\varphi$ are nonnegative. *

Proof.

Denote $K_{t}=K_{t}(\eta)$ and $\eta=(\mathsf{pred},\mathsf{req})$ . For each $t^{\prime}\in\mathsf{chld}(t)$ , let

[TABLE]

Then $(R_{t^{\prime}})_{t^{\prime}\in\mathsf{chld}(t)}$ form a partition of $R$ .

For any $t^{\prime}\in\mathsf{chld}(t)$ and $\rho\in\Pi_{t^{\prime}}$ , we shall say that $\rho$ is facility-important if

•

there exists a facility $f\in R_{t^{\prime}}$ and a client $c\in C^{\diamond}$ served by $f$ in $R$ such that $\mathrm{dist}(c,\rho)+\mathrm{dist}(\rho,f)\leqslant\mathrm{dist}(c,f)+4\delta$ ; or

•

there exists a facility $f\in R_{t^{\prime}}$ and portal $\pi\in\Pi_{t}$ with $\mathsf{req}(\pi)\neq+\infty$ served by $f$ in $R$ such that $\mathrm{dist}(\pi,\rho)+\mathrm{dist}(\rho,f)\leqslant\mathrm{dist}(\pi,f)+2\delta$ .

Further, $\rho$ is client-important if

•

there exists a client $c\in C^{\diamond}_{t^{\prime}}$ and a facility $f\in R$ that serves $c$ in $R$ such that $\mathrm{dist}(c,\rho)+\mathrm{dist}(\rho,f)\leqslant\mathrm{dist}(c,f)+2\delta$ ; or

•

there exists a client $c\in C^{\diamond}_{t^{\prime}}$ and a portal $\pi\in\Pi_{t}$ that serves $c$ in $R$ such that $\mathrm{dist}(c,\rho)+\mathrm{dist}(\rho,\pi)\leqslant\mathrm{dist}(c,\pi)+2\delta$ .

We observe the following.

Claim 3.

Let $\rho\in\Pi_{t^{\prime}}$ for some $t^{\prime}\in\mathsf{chld}(t)$ . If $\rho$ is facility-important, then

[TABLE]

If $\rho$ is client-important, then

[TABLE]

Proof.

Recall that $R$ is $\gamma$ -close in $K_{t}$ . When $\rho$ is facility-important due to the first alternative in the definition, we have

[TABLE]

here and in the following, we assume notation from the definition. Also, when $\rho$ is facility-important due to the second alternative, we have

[TABLE]

Now, if $\rho$ is client-important due to the first alternative in the definition, then we have

[TABLE]

Also, when $\rho$ is facility-important due to the second alternative, we have

[TABLE]

This concludes the proof. $\lrcorner$

For a real $x$ , let $\mathsf{round}^{\downarrow}(x)$ be the largest integer multiple of $\delta$ that is not larger than $x$ , and $\mathsf{round}^{\uparrow}(x)$ be the smallest integer multiple of $\delta$ that not smaller than $x$ . That is,

[TABLE]

We now define $\varphi=(\mathsf{pred}_{t^{\prime}},\mathsf{req}_{t^{\prime}})_{t^{\prime}\in\mathsf{chld}(t)}$ . Consider any $t^{\prime}\in\mathsf{chld}(t)$ and $\rho\in\Pi_{t^{\prime}}$ . We put

[TABLE]

Clearly, functions $\mathsf{req}_{t^{\prime}}(\cdot)$ and $\mathsf{pred}_{t^{\prime}}(\cdot)$ are $\delta$ -discrete and, as functions of $\rho$ under rounding are Lipschitz, they are also Lipschitz with slack $\delta$ . We are left with verifying that these functions are also $[-\lambda-5\delta,\gamma+4\delta]$ -bounded, $\eta$ and $\varphi$ are compatible, $R_{t^{\prime}}$ is a $(\lambda+5\delta)$ -near feasible $(\gamma+5\delta)$ -close solution to $K_{t^{\prime}}$ for each $t^{\prime}\in\mathsf{chld}(t)$ , where $K_{t^{\prime}}=K_{t^{\prime}}(\eta_{t^{\prime}})$ , and that the postulated lower bound on $\mathsf{cost}(R;K_{t})$ holds. We prove these properties in the following claims.

Claim 4.

*For each $t^{\prime}\in\mathsf{chld}(t)$ , the function $\mathsf{req}_{t^{\prime}}(\cdot)$ is $[-\lambda-5\delta,\gamma]$ -bounded and the function $\mathsf{pred}_{t^{\prime}}(\cdot)$ is $[0,\gamma+4\delta]$ -bounded. *

Proof.

First, take any $\rho\in\Pi_{t^{\prime}}$ that is facility-important (as otherwise $\mathsf{req}_{t^{\prime}}(\rho)=+\infty$ anyway). Then $\mathsf{req}_{t^{\prime}}(\rho)\geqslant-\lambda-5\delta$ by definition and $\mathsf{req}_{t^{\prime}}(\rho)\leqslant\gamma$ by Claim 3. Next, take any $\rho\in\Pi_{t^{\prime}}$ that is client-important (as otherwise $\mathsf{pred}_{t^{\prime}}(\rho)=+\infty$ anyway). Then $\mathsf{pred}_{t^{\prime}}(\rho)\geqslant 2\delta$ by definition and $\mathsf{pred}_{t^{\prime}}(\rho)\leqslant\gamma+4\delta$ by Claim 3. $\lrcorner$

Claim 5.

*It holds that $\eta$ and $\varphi$ are compatible. *

Proof.

We first verify condition (C1). Take any $\pi\in\Pi_{t}$ with $\mathsf{req}(\pi)\neq+\infty$ . Since $R$ is a $\lambda$ -near feasible solution to instance $K_{t}$ , there exists $f\in R$ such that

[TABLE]

Then $f\in R_{t^{\prime}}$ for some $t^{\prime}\in\mathsf{chld}(t)$ , and in particular $\xi(f)\in L(\beta(t^{\prime}))$ . By Lemma 22, there exists a portal $\rho\in\Pi_{t^{\prime}}$ such that

[TABLE]

In particular $\rho$ is facility-important, so combining the above with the definition of $\mathsf{req}_{t^{\prime}}(\rho)$ we obtain

[TABLE]

this directly implies (C1).

We now verify condition (C2). Take any $\rho\in\Pi_{t^{\prime}}$ for any $t^{\prime}\in\mathsf{chld}(t)$ with $\mathsf{pred}_{t^{\prime}}(\rho)\neq+\infty$ . Then $\rho$ is client-important, so there exists a client $c\in C^{\diamond}_{t^{\prime}}$ and either a facility $f\in R$ serving $c$ and satisfying $\mathrm{dist}(c,\rho)+\mathrm{dist}(\rho,f)\leqslant\mathrm{dist}(c,f)+2\delta$ , or a portal $\pi\in\Pi_{t}$ serving $c$ such that $\mathrm{dist}(c,\rho)+\mathrm{dist}(\rho,\pi)\leqslant\mathrm{dist}(c,\pi)+2\delta$ . We consider these two cases separately.

Suppose the first case holds. Since $f$ serves $c$ in $R$ , for any $\pi^{\prime}\in\Pi_{t}$ and $f^{\prime}\in R$ , we have

[TABLE]

Then we also have

[TABLE]

and similarly

[TABLE]

Therefore, by the definition of $\mathsf{pred}_{t^{\prime}}(\rho)$ , we have

[TABLE]

As $f\in R$ , there exists $t^{\prime\prime}\in\mathsf{chld}(t)$ such that $f\in R_{t^{\prime\prime}}$ . Then, by Lemma 22, there is a portal $\rho^{\prime}\in\Pi_{t^{\prime\prime}}$ such that

[TABLE]

We note that

[TABLE]

implying that $\rho^{\prime}$ is facility-important. Therefore, by the definition of $\mathsf{req}_{t^{\prime\prime}}(\rho^{\prime})$ we infer that

[TABLE]

Combining all the above we infer that

[TABLE]

which establishes (C2) in this case.

Suppose now the second case holds. Since $\pi$ serves $c$ in $R$ , for any $\pi^{\prime}\in\Pi_{t}$ and $f^{\prime}\in R$ , we have

[TABLE]

Using the same reasoning as in the first case, but considering expression $\mathrm{dist}(c,\pi)+\mathsf{pred}(\pi)$ instead of $\mathrm{dist}(c,f)$ , we infer that

[TABLE]

which establishes (C2) in this case as well. $\lrcorner$

For the next claim, recall that $(C^{\diamond}_{t^{\prime}})_{t^{\prime}\in\mathsf{chld}(t)}$ form a partition of $C^{\diamond}_{t}$ .

Claim 6.

Let $c\in C^{\diamond}_{t}$ and let $t^{\prime}\in\mathsf{chld}(t)$ be the unique node satisfying $c\in C^{\diamond}_{t^{\prime}}$ . Then the following holds.

[TABLE]

Proof.

By the definition of $\mathsf{conn}_{K_{t}}(c,R)$ , there either exists a portal $\pi\in\Pi_{t}$ such that

[TABLE]

or there exists a facility $f\in R$ such that

[TABLE]

Suppose the first case holds. By Lemma 22, there exists a portal $\rho\in\Pi_{t^{\prime}}$ such that

[TABLE]

In particular, $\rho$ is facility-important. By the definition of $\mathsf{pred}_{t^{\prime}}(\rho)$ , we have

[TABLE]

By combining the above we conclude that

[TABLE]

This establishes (21) in this case.

Now suppose the second case holds. Since $(R_{t^{\prime}})_{t^{\prime}\in\mathsf{chld}(t)}$ is a partition of $R$ , there exists $t^{\prime\prime}\in\mathsf{chld}(t)$ such that $f\in R_{t^{\prime\prime}}$ . If $t^{\prime\prime}=t^{\prime}$ , then we have

[TABLE]

so (21) indeed holds in this situation. Assume then that $t^{\prime\prime}\neq t^{\prime}$ . By Lemma 22, there exists a portal $\rho\in\Pi_{t^{\prime}}$ such that

[TABLE]

In particular, $\rho$ is facility-important. By the definition of $\mathsf{pred}_{t^{\prime}}(\rho)$ , we have

[TABLE]

By combining the above we conclude that

[TABLE]

Hence, again (21) holds in this case. $\lrcorner$

Claim 7.

*It holds that $\mathsf{cost}(R;K_{t})\geqslant\sum_{t^{\prime}\in\mathsf{chld}(t)}\mathsf{cost}(R_{t^{\prime}};K_{t^{\prime}})-5\delta|C^{\diamond}_{t}|$ . *

Proof.

The claimed upper bound on $\mathsf{cost}(R;K_{t})$ follows by adding the thesis of Claim 6 through all clients $c\in C^{\diamond}_{t}$ , and adding the opening costs of facilities of $R$ to both sides. $\lrcorner$

Claim 8.

*For each $t^{\prime}\in\mathsf{chld}(t)$ , $R_{t^{\prime}}$ is a $(\lambda+5\delta)$ -near feasible $(\gamma+5\delta)$ -close solution to $K_{t^{\prime}}$ . *

Proof.

We first verify the $(\lambda+5\delta)$ -near feasibility. Take any $\rho\in\Pi_{t^{\prime}}$ with $\mathsf{req}_{t^{\prime}}(\rho)\neq+\infty$ ; then $\rho$ is facility-important. By the definition of $\mathsf{req}_{t^{\prime}}(\rho)$ , there exists a facility $f\in R_{t^{\prime}}$ such that

[TABLE]

as required.

We now verify the $(\gamma+5\delta)$ -closeness. Claim 6 asserts that for each $c\in C^{\diamond}_{t^{\prime}}$ we have

[TABLE]

which by $\gamma$ -closeness of $R$ implies that

[TABLE]

This is the first condition of the $(\gamma+5\delta)$ -closeness. For the second condition, consider any $\rho\in\Pi_{t^{\prime}}$ with $\mathsf{req}_{t^{\prime}}(\rho)\neq+\infty$ . In particular, $\rho$ is facility-important, so there exists a facility $f\in R_{t^{\prime}}$ and either a client $c\in C^{\diamond}$ served by $f$ such that $\mathrm{dist}(c,\rho)+\mathrm{dist}(\rho,f)\leqslant\mathrm{dist}(c,f)+4\delta$ , or a portals $\pi\in\Pi_{t}$ served by $f$ such that $\mathrm{dist}(\pi,\rho)+\mathrm{dist}(\rho,f)\leqslant\mathrm{dist}(\pi,f)+2\delta$ . By $\gamma$ -closeness of $R$ in $K$ , in the first case we have

[TABLE]

while in the second case we have

[TABLE]

In both cases, we conclude that $\mathrm{dist}(\rho,f)\leqslant\gamma+5\delta$ , as required. $\lrcorner$

Claims 4, 5, 7, and 8 conclude the proof. $\square$

The algorithm.

We are finally ready to present the whole algorithm. First, using the algorithm of Lemma 20 in polynomial time we compute the tree $T$ together with sets $\beta(t)$ for nodes $t$ of $T$ . For each node $t$ we compute the portal set $\Pi_{t}$ and the set of functions $\mathcal{N}_{t}$ , as explained before; this takes total time $n^{\mathcal{O}(\varepsilon^{-2}r)}$ , since $T$ is of size $n^{\mathcal{O}(1)}$ . Sets $\mathcal{N}_{t}$ give rise to sets $\mathcal{U}_{t}$ and $\mathcal{W}_{t}$ as defined before.

The remaining, main part of the algorithm is summarized using pseudo-code as Algorithm $\mathtt{Solve}$ . We process the nodes of $T$ in a bottom-up manner. For each node $t$ , say at depth $i$ , and each $\eta\in\mathcal{U}_{t}$ , we construct the instance $K_{t}(\eta)$ and compute an $5\varepsilon$ -near feasible solution $R_{t,\eta}$ to it as follows. If $t$ is a leaf, we use the algorithm of Corollary 26 to compute the least expensive $5\varepsilon$ -near feasible solution $R_{t,\eta}$ . Otherwise, we iterate over all $\varphi\in\mathcal{W}_{t}$ such that $\eta$ and $\varphi$ are compatible, and consider all candidate solutions $R(\varphi)$ defined as

[TABLE]

Here, $R_{t^{\prime},\mathsf{restrict}_{t,t^{\prime}}(\varphi)}$ is the pre-computed soluton to the instance $K_{t^{\prime}}(\mathsf{restrict}_{t,t^{\prime}}(\varphi))$ . Out of these candidate solutions we take the least expensive one and we declare it as $R_{t,\eta}$ .

Finally, we return $R=R_{t_{0},(\emptyset,\emptyset)}$ as computed solution, where $t_{0}$ is the root of $T$ . This concludes the description of the algorithm and we are left with analyzing its running time and approximation guarantee.

Lemma 30.

*Algorithm $\mathtt{Solve}$ runs in time $n^{\mathcal{O}(\varepsilon^{-2}r)}$ . *

Proof.

It suffices to observe that, by Corollary 26 and Lemma 28, the time spent on processing every node of $T$ is bounded by $n^{\mathcal{O}(\varepsilon^{-2}r)}$ . Since the number of nodes of $T$ is $n^{\mathcal{O}(1)}$ , the total running time follows. $\square$

Lemma 31.

Algorithm $\mathtt{Solve}$ returns a solution $R$ to the instance $J_{j}$ satisfying

[TABLE]

Proof.

Let $D\subseteq F_{j}$ be an optimum solution to the instance $J_{j}$ . By Lemma 24, $D$ is also an optimum feasible solution to the instance $K=K_{t_{0}}((\emptyset,\emptyset))$ , where $t_{0}$ is the root of $T$ , Furthermore, by Lemma 19 we infer that $D$ is $3r$ -close in $K$ .

By applying Lemma 29 in a top-down manner along the tree $T$ , we obtain, for every node $t$ of $T$ , an element $\eta_{t}\in\widetilde{\mathcal{U}}_{t}$ and a solution $D_{t}$ to the instance $K_{t}(\eta_{t})$ such that the following holds:

•

whenever $t$ is not a leaf, we have that $\varphi_{t}=(\eta_{t^{\prime}})_{t^{\prime}\in\mathsf{chld}(t)}$ is compatible with $\eta_{t}$ ;

•

$D_{t}$ is a $(5i\delta)$ -near feasible $(3r+5i\delta)$ -close solution in $K_{t}(\eta_{t})$ , where $i$ is the depth of $t$ in $T$ ;

•

all request and prediction functions involved in $\eta_{t}$ are $(\delta,-5i\delta,3r+5i\delta,\delta)$ -normal, and all prediction functions are nonnegative;

•

whenever $t$ is not a leaf, it holds that

[TABLE]

Recall that $T$ has depth at most $\log n$ . Therefore, $5i\delta\leqslant 5\varepsilon$ whenever $i$ is the depth of a node in $t$ , implying that all request and prediction functions involved in elements $\eta_{t}$ are $(\delta,-5\varepsilon,3r+5\varepsilon,\delta)$ -normal. We infer that

[TABLE]

Recall also that for each non-leaf node $t$ of $T$ , we have that $\{C^{\diamond}_{t^{\prime}}\colon t^{\prime}\in\mathsf{chld}(t)\}$ form a partition of $C^{\diamond}_{t}$ . Therefore, by combining inequalities (22) in a bottom-up manner along $T$ we infer that

[TABLE]

Again, as $i\delta\leqslant\varepsilon$ whenever $i\leqslant\log n$ , for each leaf $t$ of $T$ the solution $D_{t}$ is $5\varepsilon$ -near feasible in $K_{t}(\eta_{t})$ . Hence, due to (23) for each leaf $t$ the algorithm computes an $5\varepsilon$ -near feasible solution $R_{t}$ to $K_{t}(\eta_{t})$ satisfying

[TABLE]

For each non-leaf node $t$ of $T$ , define solution $R_{t}$ to instance $K_{t}(\eta_{t})$ by a bottom-up induction: $R_{t}=\bigcup_{t^{\prime}\in\mathsf{chld}(t)}R_{t^{\prime}}$ . Then by (23) and the fact that $\eta_{t}\sim\varphi_{t}$ for every non-leaf $t$ , we have that for each node $t$ , the algorithm computes a solution to $\eta_{t}$ of cost at most $\mathsf{cost}(R_{t};K_{t}(\eta_{t}))$ . In particular, if we denote $R=R_{t_{0}}$ , where $t_{0}$ is the root of $T$ , then the solution returned by the algorithm has cost at most $\mathsf{cost}(R;K)$ . Hence, we proceed with upper bounding $\mathsf{cost}(R;K)$ .

For each node $t$ of $T$ let us define tuples of functions $\eta^{\prime}_{t}$ and $\varphi^{\prime}_{t}$ (here, only when $t$ is not a leaf) as follows:

[TABLE]

That is, $\eta^{\prime}_{t}$ is obtained from $\eta_{t}$ by adding $5\varepsilon$ to all requests and all predictions on all portals of $\Pi_{t}$ , and similarly for $\varphi_{t}$ . Note that for each non-leaf node $t$ of $T$ , we still have the following properties:

•

$\eta^{\prime}_{t^{\prime}}=\mathsf{restrict}_{t,t^{\prime}}(\varphi^{\prime}_{t})$ for each $t^{\prime}\in\mathsf{chld}(t)$ , and

•

$\eta^{\prime}_{t}$ and $\varphi^{\prime}_{t}$ are compatible.

However, the $5\varepsilon$ shift in requests and predictions makes the following assertion hold for each leaf $t$ of $T$ :

[TABLE]

That is, we obtained feasibility instead of $5\varepsilon$ -near feasibility at the cost of increasing the cost of the solution.

Denoting $\mathsf{desc}(t)$ the set of leaves of $T$ that are descendants of $t$ , we may now apply Lemma 28 through a bottom-up induction along the tree $T$ to infer the following for each node $t$ of $T$ :

[TABLE]

In particular, assertion (27) holds for the root $t_{0}$ of $T$ . Then, we may use assertions (24), (25), and (26) to infer the following:

[TABLE]

It now suffices to use Lemma 24 to infer that $\mathsf{cost}(R;K)=\mathsf{cost}(R;J_{j})$ and $\mathsf{cost}(D;K)=\mathsf{cost}(D;J_{j})$ ; this combined with the above concludes the proof. $\square$

We now conclude the proof of Lemma 9. Apply Algorithm $\mathtt{Solve}$ to each instance $J_{j}$ for which $C_{j}$ is non-empty, yielding a solution $R_{j}$ . As the number of such instances is at most $n$ , by Lemma 30 this takes total time $n^{\mathcal{O}(\varepsilon^{-2}r)}$ . As the final solution return $R=S\cup\bigcup_{j\in\mathbb{N}}R_{j}$ , where we set $R_{j}=\emptyset$ whenever $C_{j}=\emptyset$ . Then, by Lemmas 18 and 31 we have

[TABLE]

Finally, we observe that since $\mathrm{dist}(c,f)\geqslant 1$ for each client $c\in\mathsf{cluster}(f)$ , we have

[TABLE]

Therefore, we conclude that

[TABLE]

It now remains to apply Corollary 14 to infer the same inequality for instance $J$ instead of $J^{\prime}$ , and to rescale $\varepsilon$ by a multiplicative factor of $11$ .

Bibliography10

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] V. Arya, N. Garg, R. Khandekar, A. Meyerson, K. Munagala, and V. Pandit. Local search heuristics for k-median and facility location problems. SIAM J. Comput. , 33(3):544–562, 2004.
2[2] T. M. Chan and S. Har-Peled. Approximation algorithms for maximum independent set of pseudo-disks. Discrete & Computational Geometry , 48(2):373–392, 2012.
3[3] V. Cohen-Addad, P. N. Klein, and C. Mathieu. Local search yields approximation schemes for k 𝑘 k -means and k 𝑘 k -median in Euclidean and minor-free metrics. In FOCS 2016 , pages 353–364. IEEE Computer Society, 2016.
4[4] D. Eisenstat, P. N. Klein, and C. Mathieu. Approximating k 𝑘 k -center in planar graphs. In SODA 2014 , pages 617–627, 2014.
5[5] E. Fox-Epstein, P. N. Klein, and A. Schild. Embedding planar graphs into low-treewidth graphs with applications to efficient approximation schemes for metric problems. In SODA 2019 , pages 1069–1088, 2019.
6[6] S. Guha and S. Khuller. Greedy strikes back: Improved facility location algorithms. J. Algorithms , 31(1):228–248, 1999.
7[7] D. S. Hochbaum. Heuristics for the fixed cost median problem. Math. Program. , 22(1):148–162, 1982.
8[8] K. Jain and V. Vazirani. Approximation algorithms for metric facility location and k -median problems using the primal-dual schema and Lagrangian relaxation. J. ACM , 48(2):274–296, 2001.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

A Polynomial-Time Approximation Scheme for Facility Location

1 Introduction

Theorem 1**.**

2 Reducing to the constant scope of the average costs

Setup.

Robust approximate solution.

Lemma 2**.**

Proof.

Lemma 3**.**

Proof.

Lemma 4**.**

Proof.

Corollary 5**.**

Proof.

Concentrating the clusters.

Lemma 6**.**

Proof.

Corollary 7**.**

Proof.

Lemma 8**.**

Proof.

Layering on magnitudes of the average cost.

Lemma 9**.**

Lemma 10**.**

Proof.

Claim 1**.**

Proof.

Lemma 11**.**

Proof.

3 Dynamic programming algorithm

3.1 Overview

3.2 Proof of Lemma 9

Lemma 12**.**

Proof.

Lemma 13**.**

Proof.

Corollary 14**.**

Proof.

Lemma 15**.**

Proof.

Lemma 16**.**

Proof.

Lemma 17**.**

Proof.

Lemma 18**.**

Proof.

Lemma 19**.**

Proof.

Getting a suitable decomposition.

Lemma 20**.**

Proof.

Claim 2**.**

Proof.

Portalization.

Lemma 21**.**

Proof.

Defining subproblems.

Dynamic programming states.

Lemma 22**.**

Proof.

Lemma 23**.**

Proof.

Lemma 24**.**

Proof.

Computing transitions.

Lemma 25**.**

Proof.

Corollary 26**.**

Proof.

Definition 27**.**

Lemma 28**.**

Proof.

Lemma 29**.**

Proof.

Theorem 1.

Lemma 2.

Lemma 3.

Lemma 4.

Corollary 5.

Lemma 6.

Corollary 7.

Lemma 8.

Lemma 9.

Lemma 10.

Claim 1.

Lemma 11.

Lemma 12.

Lemma 13.

Corollary 14.

Lemma 15.

Lemma 16.

Lemma 17.

Lemma 18.

Lemma 19.

Lemma 20.

Claim 2.

Lemma 21.

Lemma 22.

Lemma 23.

Lemma 24.

Lemma 25.

Corollary 26.

Definition 27.

Lemma 28.

Lemma 29.

Claim 3.

Claim 4.

Claim 5.

Claim 6.

Claim 7.

Claim 8.

Lemma 30.

Lemma 31.