Network Utility Maximization under Maximum Delay Constraints and   Throughput Requirements

Qingyu Liu; Haibo Zeng; Minghua Chen

arXiv:1812.06169·cs.NI·December 27, 2018

Network Utility Maximization under Maximum Delay Constraints and Throughput Requirements

Qingyu Liu, Haibo Zeng, Minghua Chen

PDF

Open Access

TL;DR

This paper addresses the complex problem of optimizing network utility with maximum delay and throughput constraints, introducing a polynomial-time approximation algorithm that balances utility maximization with constraint violations, demonstrated through extensive simulations.

Contribution

It presents PASS, a novel polynomial-time approximation algorithm for network utility maximization under delay and throughput constraints, with theoretical guarantees and practical effectiveness.

Findings

01

PASS achieves up to 100% utility improvement over existing methods.

02

PASS relaxes delay and throughput constraints within acceptable ratios for practical use.

03

Extensive simulations validate PASS's effectiveness in supporting video conferencing traffic.

Abstract

We consider the problem of maximizing aggregate user utilities over a multi-hop network, subject to link capacity constraints, maximum end-to-end delay constraints, and user throughput requirements. A user's utility is a concave function of the achieved throughput or the experienced maximum delay. The problem is important for supporting real-time multimedia traffic, and is uniquely challenging due to the need of simultaneously considering maximum delay constraints and throughput requirements. We first show that it is NP-complete either (i) to construct a feasible solution strictly meeting all constraints, or (ii) to obtain an optimal solution after we relax maximum delay constraints or throughput requirements up to constant ratios. We then develop a polynomial-time approximation algorithm named PASS. The design of PASS leverages a novel understanding between non-convex…

Figures9

Click any figure to enlarge with its caption.

Tables2

Table 1. Table 1. Compare our work with existing studies.

Maximization Objective

Constraints

Networking Setting

Aggregate Throughput-

Based Utilities

Aggregate Maximum-

Delay-Based Utilities

Throughput

Requirements

Maximum Delay

Constraints

Multiple-Unicast

Many, e.g., (Kelly et al., 1998; Low and Lapsley, 1999; Wang et al., 2003; Palomar and Chiang, 2006)

✓

✗

✓

✗

✓

(Misra et al., 2009; Zhang et al., 2010; Correa et al., 2004, 2007; Liu et al., 2018)

✗

✓^∗

✓

✗

(Cao et al., 2017; Yu et al., 2018)

✓^∗∗

✗

✓

Out Work

✓

Table 2. Table 2. Information of ( d e , c e ) subscript 𝑑 𝑒 subscript 𝑐 𝑒 (d_{e},c_{e}) for each link e ∈ E 𝑒 𝐸 e\in E in the Amazon EC2 network (Hajiesmaili et al . , 2017 ; Liu et al . , 2016 ) , where d e subscript 𝑑 𝑒 d_{e} is link delay (in ms) and c e subscript 𝑐 𝑒 c_{e} is link capacity (in Mbps), (OR: Oregon, VA: Virginia, IR: Ireland, TO: Tokyo, SI: Singapore, SP: Sao Paulo).

	OR	VA	IR	TO	SI	SP
OR	N/A	(41,82)	(86,86)	(68,138)	(117,74)	(104,67)
VA	-	N/A	(54,72)	(101,41)	(127,52)	(82,70)
IR	-	-	N/A	(138,56)	(117,44)	(120,61)
TO	-	-	-	N/A	(45,166)	(151,41)
SI	-	-	-	-	N/A	(182,33)
SP	-	-	-	-	-	N/A

Equations124

d^{p} ≜ e \in E : e \in p \sum d_{e},

d^{p} ≜ e \in E : e \in p \sum d_{e},

x_{i}^{e} ≜ p \in P_{i} : e \in p \sum x^{p}

x_{i}^{e} ≜ p \in P_{i} : e \in p \sum x^{p}

x_{e} ≜ i = 1 \sum K x_{i}^{e} = p \in P : e \in p \sum x^{p} .

x_{e} ≜ i = 1 \sum K x_{i}^{e} = p \in P : e \in p \sum x^{p} .

∣ f_{i} ∣ ≜ p \in P_{i} \sum x^{p} = e \in Out (s_{i}) \sum x_{i}^{e} = e \in In (t_{i}) \sum x_{i}^{e},

∣ f_{i} ∣ ≜ p \in P_{i} \sum x^{p} = e \in Out (s_{i}) \sum x_{i}^{e} = e \in In (t_{i}) \sum x_{i}^{e},

M (f_{i}) ≜ p \in P_{i} : x^{p} > 0 max d^{p},

M (f_{i}) ≜ p \in P_{i} : x^{p} > 0 max d^{p},

T (f_{i}) ≜ p \in P_{i} \sum (x^{p} \cdot d^{p}) = e \in E \sum (x_{i}^{e} \cdot d_{e}) .

T (f_{i}) ≜ p \in P_{i} \sum (x^{p} \cdot d^{p}) = e \in E \sum (x_{i}^{e} \cdot d_{e}) .

(MUDT) : obj:

(MUDT) : obj:

∣ f_{i} ∣ \geq R_{i}, \forall i = 1, 2, ..., K,

X

X

(MUAT-T) : obj:

(MUAT-T) : obj:

∣ f_{i} ∣ \geq R_{i}, \forall i = 1, 2, ..., K,

(MUAT-M) : obj:

(MUAT-M) : obj:

∣ f_{i} ∣ = R_{i}, \forall i = 1, 2, ..., K,

T (\overset{ˉ}{f}_{i}) + ϵ \cdot \hat{f}_{i} \cdot M (\overset{ˉ}{f}_{i}) \leq T (\hat{f}_{i}) .

T (\overset{ˉ}{f}_{i}) + ϵ \cdot \hat{f}_{i} \cdot M (\overset{ˉ}{f}_{i}) \leq T (\hat{f}_{i}) .

U_{i}^{d} (σ \cdot a) \leq σ \cdot U_{i}^{d} (a), \forall i = 1, 2, ..., K,

U_{i}^{d} (σ \cdot a) \leq σ \cdot U_{i}^{d} (a), \forall i = 1, 2, ..., K,

\overset{ˉ}{f}_{i} \geq (1 - ϵ) \cdot R_{i}, \forall i = 1, 2, ..., K,

\overset{ˉ}{f}_{i} \geq (1 - ϵ) \cdot R_{i}, \forall i = 1, 2, ..., K,

M (\overset{ˉ}{f}_{i}) \leq D_{i} / ϵ, \forall i = 1, 2, ..., K,

\overset{ˉ}{f} = {\overset{ˉ}{f}_{1}, \overset{ˉ}{f}_{2}, ..., \overset{ˉ}{f}_{K}} \in X .

i = 1 \sum K U_{i}^{t} (\overset{ˉ}{f}_{i}) \geq (1 - ϵ) \cdot i = 1 \sum K U_{i}^{t} (∣ f_{i}^{*} ∣) .

i = 1 \sum K U_{i}^{t} (\overset{ˉ}{f}_{i}) \geq (1 - ϵ) \cdot i = 1 \sum K U_{i}^{t} (∣ f_{i}^{*} ∣) .

i = 1 \sum K U_{i}^{d} (M (\overset{ˉ}{f}_{i})) \leq \frac{1}{ϵ} \cdot i = 1 \sum K U_{i}^{d} (M (f_{i}^{*})) .

i = 1 \sum K U_{i}^{d} (M (\overset{ˉ}{f}_{i})) \leq \frac{1}{ϵ} \cdot i = 1 \sum K U_{i}^{d} (M (f_{i}^{*})) .

\overset{ˉ}{f}_{i} \geq (1 - ϵ_{m a x}) \cdot R_{i}, \forall i = 1, 2, ..., K,

\overset{ˉ}{f}_{i} \geq (1 - ϵ_{m a x}) \cdot R_{i}, \forall i = 1, 2, ..., K,

M (\overset{ˉ}{f}_{i}) \leq D_{i}, \forall i = 1, 2, ..., K,

\overset{ˉ}{f} = {\overset{ˉ}{f}_{1}, \overset{ˉ}{f}_{2}, ..., \overset{ˉ}{f}_{K}} \in X,

ϵ_{m a x} = 1 \leq i \leq K max {(\hat{f}_{i} - \overset{ˉ}{f}_{i}) / \hat{f}_{i}},

ϵ_{m a x} = 1 \leq i \leq K max {(\hat{f}_{i} - \overset{ˉ}{f}_{i}) / \hat{f}_{i}},

i = 1 \sum K U_{i}^{t} (\overset{ˉ}{f}_{i}) \geq (1 - ϵ_{m a x}) \cdot i = 1 \sum K U_{i}^{t} (∣ f_{i}^{*} ∣) .

i = 1 \sum K U_{i}^{t} (\overset{ˉ}{f}_{i}) \geq (1 - ϵ_{m a x}) \cdot i = 1 \sum K U_{i}^{t} (∣ f_{i}^{*} ∣) .

i = 1 \sum K U_{i}^{d} (M (\overset{ˉ}{f}_{i})) \leq \frac{1}{ϵ _{m i n}} \cdot i = 1 \sum K U_{i}^{d} (M (f_{i}^{*})),

i = 1 \sum K U_{i}^{d} (M (\overset{ˉ}{f}_{i})) \leq \frac{1}{ϵ _{m i n}} \cdot i = 1 \sum K U_{i}^{d} (M (f_{i}^{*})),

ϵ_{m i n} = 1 \leq i \leq K min {(\hat{f}_{i} - \overset{ˉ}{f}_{i}) / \hat{f}_{i}} .

ϵ_{m i n} = 1 \leq i \leq K min {(\hat{f}_{i} - \overset{ˉ}{f}_{i}) / \hat{f}_{i}} .

\overset{ˉ}{f}_{i} \geq R_{i}, \forall i = 1, 2, ..., K,

\overset{ˉ}{f}_{i} \geq R_{i}, \forall i = 1, 2, ..., K,

M (\overset{ˉ}{f}_{i}) \leq \frac{λ}{ϵ} \cdot D_{i}, \forall i = 1, 2, ..., K,

\overset{ˉ}{f} = {\overset{ˉ}{f}_{1}, \overset{ˉ}{f}_{2}, ..., \overset{ˉ}{f}_{K}} \in X,

λ = max {1, 1 \leq i \leq K max {M (\overset{ˉ}{f}_{i}) / M (\overset{g}{ˉ}_{i})}} .

λ = max {1, 1 \leq i \leq K max {M (\overset{ˉ}{f}_{i}) / M (\overset{g}{ˉ}_{i})}} .

i = 1 \sum K U_{i}^{t} (\overset{ˉ}{f}_{i}) \geq i = 1 \sum K U_{i}^{t} (∣ f_{i}^{*} ∣) .

i = 1 \sum K U_{i}^{t} (\overset{ˉ}{f}_{i}) \geq i = 1 \sum K U_{i}^{t} (∣ f_{i}^{*} ∣) .

i = 1 \sum K U_{i}^{d} (M (\overset{ˉ}{f}_{i})) \leq \frac{λ}{ϵ} \cdot i = 1 \sum K U_{i}^{d} (M (f_{i}^{*})) .

i = 1 \sum K U_{i}^{d} (M (\overset{ˉ}{f}_{i})) \leq \frac{λ}{ϵ} \cdot i = 1 \sum K U_{i}^{d} (M (f_{i}^{*})) .

max 1 \leq i \leq K min {U_{i}^{t} (∣ f_{i} ∣)},

max 1 \leq i \leq K min {U_{i}^{t} (∣ f_{i} ∣)},

max 1 \leq i \leq K min {- U_{i}^{d} (M (f_{i}))},

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNetwork Traffic and Congestion Control · Software-Defined Networks and 5G · Caching and Content Delivery

Full text

Network Utility Maximization under Maximum Delay

Constraints and Throughput Requirements

Qingyu Liu, Haibo Zeng

Electrical and Computer EngineeringVirginia Tech

and

Minghua Chen

Information EngineeringThe Chinese University of Hong Kong

(2018)

Abstract.

We consider the problem of maximizing aggregate user utilities over a multi-hop network, subject to link capacity constraints, maximum end-to-end delay constraints, and user throughput requirements. A user’s utility is a concave function of the achieved throughput or the experienced maximum delay. The problem is important for supporting real-time multimedia traffic, and is uniquely challenging due to the need of simultaneously considering maximum delay constraints and throughput requirements. We first show that it is NP-complete either (i) to construct a feasible solution strictly meeting all constraints, or (ii) to obtain an optimal solution after we relax maximum delay constraints or throughput requirements up to constant ratios. We then develop a polynomial-time approximation algorithm named PASS. The design of PASS leverages a novel understanding between non-convex maximum-delay-aware problems and their convex average-delay-aware counterparts, which can be of independent interest and suggest a new avenue for solving maximum-delay-aware network optimization problems. Under realistic conditions, PASS achieves constant or problem-dependent approximation ratios, at the cost of violating maximum delay constraints or throughput requirements by up to constant or problem-dependent ratios. PASS is practically useful since the conditions for PASS are satisfied in many popular application scenarios. We empirically evaluate PASS using extensive simulations of supporting video-conferencing traffic across Amazon EC2 datacenters. Compared to existing algorithms and a conceivable baseline, PASS obtains up to $100\%$ improvement of utilities, by meeting the throughput requirements but relaxing the maximum delay constraints that are acceptable for practical video conferencing applications.

Network utility maximization, multiple-unicast network flow, delay-aware network optimization

††copyright: rightsretained††ccs: Mathematics of computing Network flows††ccs: Networks Network resources allocation††journalyear: 2018††copyright: acmcopyright††conference: The Twentieth International Symposium on Mobile Ad Hoc Networking and Computing; July 2–5, 2019; Catania, Italy††booktitle: Submission to MobiHoc ’19, July 2–5, 2019, Catania, Italy††price: 15.00††doi: x††isbn: x

1. Introduction

We consider a multiple-unicast communication scenario where each unicast source streams a network flow to its destination over a multi-hop network, possibly using multiple paths. We study the problem of maximizing aggregate user utilities, subject to link capacity constraints, maximum delay constraints, and user throughput requirements. A user’s utility is a concave function of the achieved throughput or the experienced maximum delay. The maximum delay denotes the maximum Source-to-Destination (S2D) delay, or equivalently the delay of the slowest S2D path that carries traffic.

Our study is motivated by the increasingly interests on supporting delay-critical traffic in various applications, e.g., video conferencing (Chen et al., 2011; Liu et al., 2016; Hajiesmaili et al., 2017). It is reported that 51 million users per month attend WebEx meetings, and 3 billion minutes of calls per day use Skype (Liu et al., 2018). Low S2D delay is vital for such video conferencing applications. As recommended by the International Telecommunication Union (ITU) (ITU, 2003), a delay less than 150ms can provide a transparent interactivity while delays above 400ms are unacceptable for video conferencing. We remark that the maximum S2D delay, instead of the average one, is a critical concern for provisioning low delay services, since there may exist traffic which experiences an arbitrarily large S2D delay even for the solution that minimizes average S2D delay performance (Liu et al., 2018). In sharp contrast, all the traffic can be streamed from its source to its destination timely following any solution that has an acceptable maximum S2D delay performance, because the maximum S2D delay is defined as an upper bound of S2D delays of all the traffic.

We consider a delay model where transmission over a link experiences a constant delay if the aggregate flow rate of the link is within a constant capacity, and unbounded delay otherwise. This model fits a number of practical applications, particularly the routing of delay-critical video conferencing traffic over inter-datacenter networks. Specifically, according to recent reports from Microsoft (Hong et al., [n. d.]) and Google (Jain et al., [n. d.]), most real-world inter-datacenter networks are characterized by sharing link bandwidth for different applications, with over-provisioned link capacities. (i) Real-world inter-datacenter networks nowadays are utilized to simultaneously support traffic from various services, some of which have stringent delay requirements (e.g., video conferencing) while others are bandwidth-hungry and less sensitive to delay (e.g., data maintenance). Link capacity is often reserved separately for different types of services depending on their characteristics. (ii) Cloud providers typically over-provision inter-datacenter link capacity by $2-3$ times on a dedicated backbone to guarantee reliability, and the average link-capacity utilizations (the aggregate utilization of applications, not the bandwidth-utilization of individual applications) for busy links are $30-60\%$ (Liu et al., 2016). As such, for applications whose traffic volume is within the reserved capacity for their types of service, queuing delays are negligible and the constant propagation delays dominate end-to-end delays, as evaluated by (Liu et al., 2016) in a realistic network of Amazon EC2. Otherwise, if the traffic volume exceeds the reserved capacity, the applications will start to experience substantial queuing delays and thus substantial end-to-end delays. These observations justify our link capacity and delay model, especially for the critical problem of routing video-conferencing traffic over real-world inter-datacenter networks.

1.1. Existing Studies

We summarize existing studies in Tab. 1. In the literature, there exist many network utility maximization studies with throughput concerns, e.g., (Kelly et al., 1998; Low and Lapsley, 1999; Wang et al., 2003; Palomar and Chiang, 2006), but less of them consider maximum delays. This is because the maximum delay of a single-unicast network flow is non-convex with the flow decision variables, and hence even a maximum-delay-aware problem in a simple networking scenario, e.g., the single-unicast maximum delay minimization problem, is NP-hard and thus challenging to solve (Misra et al., 2009).

Misra et al. (Misra et al., 2009) study the single-unicast maximum delay minimization problem subject to a throughput requirement, and design a Fully-Polynomial-Time Approximation Scheme (FPTAS). Zhang et al. (Zhang et al., 2010) generalize the FPTAS of (Misra et al., 2009) and develop an FPTAS to minimize maximum delay subject to throughput, reliability, and differential delay constraints also in the single-unicast scenario. We observe that both FPTASes require to solve flow problems iteratively in time-expanded networks, by employing a binary-search based idea applicable only in the single-unicast setting. It is thus unclear how to extend their techniques to the general multiple-unicast scenario where the utility of an unicast (user) can be a concave function with the experienced maximum delay.

Cao et al. (Cao et al., 2017) develop an FPTAS that can maximize throughputs subject to maximum delay constraints in a multiple-unicast setting. This FPTAS is generalized by Yu et al. (Yu et al., 2018) to design FPTASes for other throughput maximization problems for practical IoT applications. Similar to FPTASes proposed by (Misra et al., 2009; Zhang et al., 2010), to satisfy maximum delay constraints while optimizing throughputs, FPTASes of (Cao et al., 2017; Yu et al., 2018) require to solve flow problems iteratively in time-expanded networks, which is time-consuming. Moreover, the design of FPTASes in (Cao et al., 2017; Yu et al., 2018) leverages the primal-dual algorithm, where their primal problems and associated dual problems need to be casted as linear programs. It is unclear how to extend their technique to the general scenario where the utility of an unicast can be a concave function with the achieved throughput.

We note that there exist other maximum-delay-aware studies in the literature. However, they only develop heuristic approaches instead of approximation algorithms. For example, Liu et al. (Liu et al., 2016) target the multicast maximum delay optimization problems. Their heuristic approach suffers from two limitations: (i) the running time could be high because the number of variables increases exponentially in the network size, and (ii) there is not yet theoretical performance guarantee of the achieved solution.

Instead of modeling link delay as a constant within a capacity as in (Misra et al., 2009; Zhang et al., 2010; Cao et al., 2017; Yu et al., 2018; Liu et al., 2016), there exist studies which model the link delay as a link-flow-dependent function. For example, Correa et al. (Correa et al., 2004, 2007) minimize maximum delay with delay-function-dependent approximation ratios guaranteed. Liu et al. (Liu et al., 2018) minimize maximum delay with constant approximation ratios guaranteed. Our study models link delay as a constant within a capacity, which is the same as those in (Misra et al., 2009; Zhang et al., 2010; Cao et al., 2017; Yu et al., 2018; Liu et al., 2016), but different from the ones in (Correa et al., 2004, 2007; Liu et al., 2018). We remark that maximum-delay-aware problems are fundamentally different with these different link delay models, since it is APX-hard to minimize the single-unicast maximum delay (hence no PTAS exists unless P = NP) with the flow-dependent delay model (Correa et al., 2007), but an FPTAS111Unless P = NP, it holds that $\textsf{FPTAS}\subsetneq\textsf{PTAS}$ in that the runtime of a PTAS is required to be polynomial in problem input but not $1/\epsilon$ , while the runtime of an FPTAS is polynomial in both the problem input and $1/\epsilon$ (WIKI, [n. d.]). exists to minimize the single-unicast maximum delay with the constant delay model (Misra et al., 2009).

Overall, with the constant delay model, existing maximum-delay-aware studies focus on either the throughput-constrained maximum delay minimization problem or the maximum-delay-constrained throughput maximization problem, which are just special cases of our problem (Tab. 1). To design approximation algorithms, they rely on a technique of solving problems in expanded networks iteratively, leading to impractically high time complexities (e.g., at least $O(|E|^{3}|V|^{4}\mathcal{L})$ to minimize single-unicast maximum delay where $|V|$ is number of nodes, $|E|$ is number of links, and $\mathcal{L}$ is input size of the given problem instance (Misra et al., 2009)). It is unclear how to generalize their techniques to our multiple-unicast utility maximization scenario, where the utility of an unicast is a concave function of the achieved throughput or the experienced maximum delay. In sharp contrast, we develop an approximation algorithm for our problem of maximizing utilities, by leveraging a novel understanding between non-convex maximum-delay-aware problems and their convex average-delay-aware counterparts. Specifically, we solve an average-delay-aware problem only once in the input network, and then deletes certain flow rate from individual unicast flows, resulting in a small time complexity (e.g., $O(|E|^{3}\mathcal{L})$ to minimize single-unicast maximum delay in a dense network (Thm. 3.2).

1.2. Our Contributions

In this paper, we study a multiple-unicast flow problem of maximizing aggregate user utilities over a multi-hop network, subject to link capacity constraints, maximum delay constraints, and user throughput requirements. We make the following contributions.

$\rhd$ We prove that it is NP-complete either (i) to construct a feasible solution meeting all constraints, or (ii) to obtain an optimal solution after we relax maximum delay constraints or throughput requirements up to constant ratios, due to the need of simultaneously considering maximum delay constraints and user throughput requirements.

$\rhd$ We design an algorithm named PASS (Polynomial-time Algorithm Supporting utility-maximal flows Subject to throughput/delay constraints) for constructing approximate solutions to our problem in a polynomial time. Our design leverages a novel understanding between non-convex maximum-delay-aware problems and their convex average-delay-aware counterparts, which can be of independent interest and suggests a new avenue for solving maximum-delay-aware network optimization problems.

$\rhd$ We characterize sufficient conditions for PASS to solve our problem in a polynomial time, providing (i) a constant approximation ratio after relaxing throughput requirements and maximum delay constraints by constant ratios, or (ii) a problem-dependent approximation ratio satisfying maximum delay constraints, after relaxing throughput requirements by a problem-dependent ratio, or (iii) a problem-dependent approximation ratio satisfying throughput requirements, after relaxing maximum delay constraints by a problem-dependent ratio. We note that one can use pre-scaled maximum delay constraints or throughput requirements as the input to PASS to generate feasible solutions as the output.

$\rhd$ We observe that our characterized conditions are satisfied in many popular application settings, where PASS can be applied with strong theoretical performance guarantee. Representative settings include minimizing throughput-constrained maximum delay and maximizing maximum-delay-constrained network utility. We evaluate the empirical performance of PASS in simulations of supporting video-conferencing traffic across Amazon EC2 datacenters. Compared to existing algorithms as well as a conceivable baseline, PASS can obtain up to $100\%$ improvement of utilities, by meeting throughput requirements but relaxing maximum delay constraints that are acceptable for video conferencing applications.

2. System Model

2.1. Preliminary

We consider a multi-hop network modeled as a directed graph $G\triangleq(V,E)$ with $|V|$ nodes and $|E|$ links. Each link $e\in E$ has a constant capacity $c_{e}\geq 0$ and a constant delay $d_{e}\geq 0$ . For each link $e\in E$ , data streamed to $e$ experiences a delay of $d_{e}$ to pass it, and the rate of streaming data to $e$ must be within the capacity $c_{e}$ . We are given $K$ users, where for each user $i$ ( $i=1,2,...,K$ ), a source $s_{i}\in V$ needs to stream a single-unicast network flow to a destination $t_{i}\in V\backslash\{s_{i}\}$ , possibly using multiple paths.

We denote $P_{i}$ as the set of all simple paths from $s_{i}$ to $t_{i}$ , and $P\triangleq\cup_{i=1}^{K}P_{i}$ . For any $p\in P$ , its path delay $d^{p}$ is defined as

[TABLE]

i.e., the summation of link delays along the path. We denote a multiple-unicast network flow solution as $f\triangleq\{f_{i},i=1,2,...,K\}$ , where a single-unicast flow $f_{i}$ is defined as the assigned flow rate over $P_{i}$ , i.e., $f_{i}\triangleq\{x^{p}:x^{p}\geq 0,p\in P_{i}\}$ . For $f_{i}$ , we define

[TABLE]

as the aggregated link rate of $e\in E$ of the unicast $i$ (or the user $i$ equivalently). Similarly, we denote $x_{e}$ as the total aggregated link rate of link $e\in E$ , and

[TABLE]

We further denote the flow rate, or the throughput equivalently, achieved by a single-unicast flow $f_{i}$ by $|f_{i}|$ ,

[TABLE]

where $\textsf{Out}(v)$ (resp. In(v)) is the set of outgoing (resp. incoming) links of $v$ . The maximum delay experienced by $f_{i}$ is defined as

[TABLE]

i.e., the delay of the longest (slowest) path with positive rates from $s_{i}$ to $t_{i}$ 222We call a path $p\in P_{i}$ with $x^{p}>0$ as a flow-carrying path of $f_{i}$ .. The total delay of $f_{i}$ is defined as

[TABLE]

With $\mathcal{T}(f_{i})$ , we can easily define the average delay experienced by $f_{i}$ as $\mathcal{A}(f_{i})\triangleq\mathcal{T}(f_{i})/|f_{i}|$ , and we let $\mathcal{A}(f_{i})=0$ if $|f_{i}|=0$ .

For each $f_{i}$ , $i=1,2,...,K$ , we denote its throughput-based utility as $\mathcal{U}_{i}^{t}(|f_{i}|)$ , which is a function that rewards $f_{i}$ based on the achieved throughput. Similarly, we denote its maximum-delay-based utility as $-\mathcal{U}_{i}^{d}(\mathcal{M}(f_{i}))$ , where $\mathcal{U}_{i}^{d}(\mathcal{M}(f_{i}))$ is a function that penalizes $f_{i}$ based on the experienced maximum delay.

2.2. Problem Definition

In this paper, we study the following problem of Maximizing aggregate user Utilities subject to link capacity constraints, maximum Delay constraints, and Throughput requirements (MUDT),

[TABLE]

where $\mathcal{X}$ defines a feasible multiple-unicast flow $f$ meeting flow conservation constraints and link capacity constraints, i.e.,

[TABLE]

In formula (1), the objective (1b) (resp. (1b)) maximizes the aggregate throughput-based utilities (resp. maximum-delay-based utilities) of all the users, the throughput requirements (1e) require the throughput achieved by each user $i$ to be no smaller than $R_{i}$ , the maximum delay constraints (1e) restrict the maximum delay experienced by each user $i$ to be no greater than $D_{i}$ , and the feasibility constraint (1e) defines a feasible multiple-unicast network flow solution, meeting link capacity constraints.

In the end of this section, we give an important theorem of MUDT, which argues that it is impossible even to (i) construct a feasible solution meeting all constraints, or (ii) obtain an optimal solution meeting relaxed constraints, in a polynomial time, unless P = NP. Thus it is non-trivial to develop polynomial-time approximation algorithms for MUDT subject to relaxed constraints.

Theorem 2.1.

For MUDT, it is NP-complete (i) to construct a feasible solution that meets all constraints, or (ii) to obtain an optimal solution that meets throughput requirements but relaxes maximum delay constraints, or (iii) to obtain an optimal solution that meets maximum delay constraints but relaxes throughput requirements.

Proof.

Refer to our Appendix 7.3. ∎

3. Proposed Algorithm PASS

In this section we design an algorithm PASS for MUDT of maximizing aggregate user utilities. We characterize conditions of the input utility functions such that PASS theoretically gives approximate solutions in a polynomial time, meeting relaxed constraints.

3.1. Algorithmic Structure of PASS

We note that the non-convex maximum delays bring difficulties for solving MUDT. The key idea of our proposed PASS is to replace the non-convex maximum delays in MUDT by the convex average delays, and solve the average-delay-aware counterpart to obtain an approximate solution to MUDT in a polynomial time. (i) We denote the average-delay-aware counterpart of the MUDT that maximizes throughput-based utilities, i.e., problem (1) with an objective of (1b), as MUAT-T, with the following formulation

[TABLE]

(ii) Similarly, we denote the average-delay-aware counterpart of the MUDT that maximizes maximum-delay-based utilities, i.e., problem (1) with an objective of (1b), as MUAT-M. MUAT-M has the following formulation

[TABLE]

Algorithm 1 describes the details of PASS. It first solves the average-delay-aware counterpart of the MUDT and obtain the corresponding multiple-unicast flow solution $f=\{f_{i},i=1,2,...,K\}$ (line 5). Next for each $i=1,2,...,K$ , we delete a rate of $\epsilon\cdot|f_{i}|$ iteratively from the slowest flow-carrying paths of $f_{i}$ (line 8). In the end, the remaining flow is the solution returned by PASS.

3.2. PASS can Solve MUDT Approximately, Meeting Relaxed Constraints

Now we give an important lemma which will be used later to prove the approximation ratio of our PASS.

Lemma 3.1.

In Algorithm 1 with an arbitrary $\epsilon\in(0,1)$ , suppose $\hat{f}=\{\hat{f}_{i},i=1,2,...,K\}$ is the solution to the average-delay-aware counterpart of MUDT (solution achieved in line 5), and suppose $\bar{f}=\{\bar{f}_{i},i=1,2,...,K\}$ is the solution returned in the end (the remaining solution achieved in line 14). For any $i=1,2,...,K$ , we have

[TABLE]

Proof.

Refer to our Appendix 7.1. ∎

Lem. 3.1 implies that $\epsilon\cdot\mathcal{M}(\bar{f}_{i})\leq\mathcal{A}(\hat{f}_{i}),\forall i=1,2,...,K$ , i.e., the maximum delay of each single-unicast flow after deleting rate is bounded by a constant ratio as compared to the average delay of the corresponding single-unicast flow before deleting rate. With this critical observation that relates the non-convex maximum delays with the convex average delays, we can characterize conditions for PASS to solve MUDT approximately in a polynomial time.

Theorem 3.2.

Given a feasible problem (1), suppose we use PASS (Algorithm 1) with an arbitrary $\epsilon\in(0,1)$ to solve it. If the problem is feasible, meeting all conditions below

(1)

*for each $i=1,2,...,K$ , for an arbitrary $a\geq 0$ , $\mathcal{U}_{i}^{t}(a)$ is concave, non-decreasing, and non-negative with $a$ , $\mathcal{U}_{i}^{d}(a)$ is convex, non-decreasing, and non-negative with $a$ , * 2. (2)

for an arbitrary $a\geq 0$ , the following holds given any $\sigma\geq 1$

[TABLE]

then PASS must return a solution $\bar{f}=\{\bar{f}_{i},i=1,...,K\}$ in a polynomial time, meeting the following relaxed constraints

[TABLE]

Suppose $f^{*}=\{f_{i}^{*},i=1,2,...,K\}$ is the optimal solution to the problem (1). If the throughput-based utility maximization (1b) is the objective, $\bar{f}$ provides the following approximation ratio

[TABLE]

If the maximum-delay-based utility maximization (1b) is the objective, $\bar{f}$ provides the following approximation ratio

[TABLE]

Proof.

Refer to our Appendix 7.2. ∎

It is clear that PASS provides a constant approximation ratio, at the cost of violating throughput requirements (1e) by a constant ratio of $(1-\epsilon)$ , and violating maximum delay constraints (1e) by a constant ratio of $1/\epsilon$ . For certain applications, the throughput requirements or the maximum delay constraints are hard constraints that cannot be violated. We note that one can use pre-scaled maximum delay constraints and throughput requirements as the input to PASS to generate feasible solutions as the output. Moreover, in the following, by slightly modifying PASS, we respectively develop (i) an algorithm PASS-M to achieve approximate solutions that can strictly meet maximum delay constraints, and (ii) an algorithm PASS-T to achieve approximate solutions that can strictly meet throughput requirements.

3.3. Modify PASS to Strictly Meet Maximum Delay Constraints

We introduce PASS-M in Algorithm 2. Similar to PASS, PASS-M first solves the average-delay-aware counterpart of MUDT. But different from PASS that deletes $\epsilon\cdot|f_{i}|$ rate from slowest flow-carrying paths of each $f_{i}$ , PASS-M deletes rate from slowest flow-carrying paths of $f_{i}$ till the maximum delay of $f_{i}$ strictly meets the constraint $D_{i}$ . In the following theorem, we prove that PASS-M can obtain a solution with a problem-dependent approximation ratio.

Theorem 3.3.

Given a feasible problem (1), suppose it meets all conditions in Thm. 3.2. Suppose we use PASS-M (Algorithm 2) to solve it. Then PASS-M must return a solution $\bar{f}=\{\bar{f}_{i},i=1,2,...,K\}$ in a polynomial time, meeting the following relaxed constraints

[TABLE]

where $\epsilon_{\max}$ is defined as follows

[TABLE]

where $\hat{f}=\{\hat{f}_{i},i=1,2,...,K\}$ is the optimal solution to the average-delay-aware problem in line 4 of Algorithm 2. Suppose $f^{*}=\{f_{i}^{*},i=1,2,...,K\}$ is the optimal solution to problem (1). If the throughput-based utility maximization (1b) is the objective, $\bar{f}$ provides the following approximation ratio

[TABLE]

If the maximum-delay-based utility maximization (1b) is the objective, $\bar{f}$ provides the following approximation ratio

[TABLE]

where $\epsilon_{\min}$ is defined as follows

[TABLE]

Proof.

Refer to our Appendix 7.4. ∎

Comparing Thm. 3.2 of PASS with Thm. 3.3 of PASS-M, to solve MUDT, (i) PASS achieves a solution with a constant approximation ratio, at the cost of violating both throughput requirements and maximum delay constraints by constant ratios, while (ii) PASS-M obtains a solution with a problem-dependent approximation ratio, strictly meeting maximum delay constraints, but at the cost of violating throughput requirements by a problem-dependent ratio.

3.4. Modify PASS to Strictly Meet Throughput Requirements

In order to strictly meet throughput requirements, our PASS-T suggest to use the optimal solution to the average-delay-aware counterpart of MUDT directly as a solution to the maximum-delay-aware problem MUDT, i.e.,

$\rhd$ PASS-T: directly solve the average-delay-aware counterpart of the problem (1).

Theorem 3.4.

Given a feasible problem (1), suppose it meets all conditions in Thm. 3.2. We denote $\bar{g}=\{\bar{g}_{1},\bar{g}_{2},...,\bar{g}_{K}\}$ as the solution returned if we use PASS (Algorithm 1) to solve it with an $\epsilon\in(0,1)$ . Now suppose we use PASS-T to solve the problem (1). Then PASS-T must return a solution $\bar{f}=\{\bar{f}_{i},i=1,2,...,K\}$ in a polynomial time, meeting the following relaxed constraints

[TABLE]

where $\lambda$ is defined as follows

[TABLE]

Suppose $f^{*}=\{f_{i}^{*},i=1,2,...,K\}$ is the optimal solution to problem (1). If the throughput-based utility maximization (1b) is the objective, $\bar{f}$ provides the following approximation ratio

[TABLE]

If the maximum-delay-based utility maximization (1b) is the objective, $\bar{f}$ provides the following approximation ratio

[TABLE]

Proof.

Refer to our Appendix 7.5. ∎

Thm. 3.4 suggests that we can figure out an approximation ratio of PASS-T with the knowledge of an arbitrary solution of PASS. Comparing Thm. 3.2 of PASS with Thm. 3.4 of PASS-T, in order to solve MUDT, (i) PASS achieves a solution with a constant approximation ratio, at the cost of violating both throughput requirements and maximum delay constraints by constant ratios, while (ii) PASS-T obtains a solution with a problem-dependent approximation ratio, strictly meeting throughput requirements, but at the cost of violating maximum delay constraints by a problem-dependent ratio.

3.5. Our Proposed Algorithms Can Solve Other Maximum-Delay-Aware Problems

As shown in problem (1), MUDT has an objective of either (1b) or (1b), both of which maximize aggregate user utilities. Differently, another two representative user-utility-sensitive objectives are

[TABLE]

both of which maximize worst user utilities. Following same proof to Thm. 3.2, Thm. 3.3, and Thm. 3.4, it is easy to verify that as long as the conditions in Thm. 3.2 are satisfied, we can use PASS, PASS-M, and PASS-T to solve the problem with an objective of either (14a) or (14b), subject to throughput requirements (1e), maximum delay constraints (1e), and feasibility constraints (1e), approximately in a polynomial time. Our design of PASS suggests a new avenue for solving maximum-delay-aware network optimization problems.

Overall in this section, we design PASS to solve the maximum-delay-aware problem MUDT approximately in a polynomial time under practical conditions. PASS solves the average-delay-aware counterpart of MUDT only once in the input network, and then deletes certain flow rate from slowest flow-carrying paths to obtain solutions with theoretical performance guarantee. Note again that in sharp contrast, existing maximum-delay-aware problems either minimize throughput-constrained maximum delay or maximize maximum-delay-constrained throughput, which are special cases of our problem MUDT. They rely on a time-consuming technique of solving problems iteratively in the time-expanded network to provide approximate solutions. Our PASS leverages a novel understanding between non-convex maximum-delay-aware problems and their convex average-delay-aware counterparts, which can be of independent interest and suggest a new avenue for solving maximum-delay-aware network optimization problems.

4. Popular Delay-/Throughput- Aware Network Communication Scenarios

In this section we introduce several popular network communication settings that are sensitive both to the throughputs and to the maximum delays. Although associated problems are all NP-hard, we observe that they are all special cases of MUDT, and all satisfy conditions introduced in Thm. 3.2, and hence can be solved by PASS, PASS-M, and PASS-T approximately with strong theoretical performance guarantee in a polynomial time.

4.1. Throughput-Constrained Maximum Delay Minimization

The Throughput-Constrained maximum Delay Minimization problem (TCDM) aims to find a network flow to minimize the weighted summation of maximum delays of all users, subject to link capacity constraints and throughput requirements.

[TABLE]

where in the objective (15a) a non-negative weight $w_{i}\geq 0$ is associated with the maximum delay of $f_{i}$ for each $i=1,2,...,K$ .

TCDM is NP-hard, since as its special case when $K=1$ , the single-unicast maximum delay minimization problem is known to be NP-hard (Misra et al., 2009). Maximum delay minimization problems similar to TCDM have been studied in (Misra et al., 2009; Zhang et al., 2010; Correa et al., 2004, 2007; Liu et al., 2018). It is clear that TCDM satisfies our conditions introduced in Thm. 3.2. Therefore, by replacing the non-convex maximum delays with the convex average delays, we can get the average-delay-aware counterpart formulated in the way of problem (3), and thus can either (i) use PASS to solve TCDM with a constant approximation ratio while violating throughput requirements also by a constant ratio (see Thm. 3.2), or (ii) use PASS-T to solve TCDM with a problem-dependent approximation ratio, strictly meeting throughput requirements (see Thm. 3.4).

4.2. Maximum-Delay-Constrained Throughput-Based Utility Maximization

The maximum-Delay-Constrained throughput-based Utility Maximization (DCUM) problem aims to find a network flow to maximize aggregate user utilities, subject to link capacity constraints and maximum delay constraints. It has the following formulation.

[TABLE]

DCUM is NP-hard, because as its special case when $K=1$ and $\mathcal{U}_{1}^{t}|f_{1}|=|f_{1}|$ , the problem can be proved to be NP-hard following a similar proof as introduced in the Appendix of (Misra et al., 2009). Throughput-based utility maximization problems similar to DCUM have been studied in (Cao et al., 2017; Yu et al., 2018). Due to practical concerns, it is fair to assume that the throughput-based utility function of each user is concave, non-decreasing, and non-negative with the achieved throughput, thus meeting conditions introduced in our Thm. 3.2. After replacing the non-convex maximum delays with the convex average delays, we can get the average-delay-aware counterpart formulated in the way of problem (2), and thus can either (i) use PASS to solve DCUM with a constant approximation ratio while violating maximum delay constraints also by a constant ratio (see Thm. 3.2), or (ii) use PASS-M to solve DCUM with a problem-dependent approximation ratio, strictly meeting maximum delay constraints (see Thm. 3.3).

5. Performance Evaluation

We evaluate the empirical performance of our proposed algorithms, by simulating the delay-critical video conferencing traffic over a real-world continent-scale inter-datacenter network topology of 6 globally distributed Amazon EC2 datacenters (see Fig. 1). The network is modeled as a complete undirected graph. Each undirected link is treated as two directed links that operate independently and have identical delays and capacities, a common way to model an undirected graph by a directed one, e.g. in (Grimmer and Kapoor, 2016). We set link delays and capacities according to practical evaluations on Amazon EC2 from (Hajiesmaili et al., 2017; Liu et al., 2016) (see Tab. 2). We assume two unicasts, namely $K=2$ , with $s_{1}$ to be Virginia, $t_{1}$ to be Singapore, $s_{2}$ to be Oregon, and $t_{2}$ to be Tokyo. Our test environment is an Intel Core i5 (2.40 GHz) processor with 8 GB memory running Windows 64-bit operating system. All the experiments are implemented in C++ and linear programs are solved using CPLEX (IBM, 2017).

5.1. Use PASS to Minimize Maximum Delay

We now use PASS to minimize the maximum delay, subject to link capacity constraints and throughput requirements (i.e., to solve TCDM with formula (15)). We assume $w_{1}=w_{2}$ and $R_{1}=R_{2}=R$ in the formula (15).

We compare PASS with the optimal solution, a conceivable greedy baseline, and PASS-T respectively. (i) Because link delays are all integers (see Tab. 2), the delay of any path must be an integer. Therefore, we can obtain the optimal solution minimizing the summation of maximum delays, by enumerating all possible maximum delays of individual unicasts to figure out the minimal performance such that a feasible flow exists in the time-expanded network. Note that this approach theoretically has an exponential time complexity, and is the foundation of the FPTAS (Misra et al., 2009) designed for the single-unicast maximum delay minimization problem. (ii) In order to minimize delay while satisfying throughput requirements, the baseline greedily obtains the routing solution from the unicast $1$ to the unicast $K$ one by one. In the iteration of the unicast $i$ , it assigns as much rate as possible to the shortest paths from $s_{i}$ to $t_{i}$ iteratively respecting the link capacity constraints, till the throughput requirement $R_{i}$ is satisfied. Similar heuristic approaches have been used in other delay-aware network flow studies, e.g., in (Devetak et al., 2011).

First, we evaluate the summation of maximum delays of PASS with $\epsilon$ (see Fig. 2(a)). We set $R=230$ and vary $\epsilon$ from $1\%$ to $99\%$ by a step of $1\%$ . According to the figure, (i) PASS-T obtains the optimal solution to our problem, (ii) the delay of the baseline is strictly larger than optimal, and (iii) the delay of PASS is a staircase function with $\epsilon$ . We remark that the delay of PASS can be smaller than optimal in many instances because PASS can only support $(1-\epsilon)$ -fraction of the throughput requirement, while the optimal solution achieves the minimal summation of maximum delays among network flows supporting the full throughput requirement.

Second, we evaluate the summation of maximum delays of PASS with the throughput requirement $R$ (see Fig. 2(b)). We set $\epsilon=3\%$ since a $3\%$ throughput loss is very acceptable for video conferencing with protection/recovery capabilities (Weinstein, 2008). We vary $R$ from $116$ to $239$ with a unit step. We remark that $116$ Mbps is the smallest throughput when the baseline needs multiple paths to forward it for each of the two unicasts, and $239$ Mbps is the largest throughput that can be routed. From Fig. 2(b), it is clear that PASS outputs a smaller maximum delay compared with the baseline in most instances. In average, the maximum delay of the baseline ( $402$ ) is over $11\%$ more than that of the optimal ( $362$ ) and of the PASS ( $359$ ). In the worst case ( $R\in[116,138]$ ), the maximum delay of the baseline is over $40\%$ more than that of the optimal and of the PASS. In addition, PASS-T obtains the optimal solution to our problem in most instances, except for instances where $R\in[212,223]$ .

5.2. Use PASS to Maximize Throughput

We then use PASS to maximize the throughput, subject to link capacity constraints and maximum delay constraints (i.e., to solve DCUM with formula (16)). We assume $\mathcal{U}_{1}^{t}(|f_{1}|)=|f_{1}|$ , $\mathcal{U}_{2}^{t}(|f_{2}|)=|f_{2}|$ , and $D_{1}=D_{2}=D$ in the formula (16). We compare PASS with the optimal solution, a conceivable baseline, and PASS-M, respectively. Similar to the greedy approach introduced in Sec. 5.1, the baseline assigns as much rate as possible to the shortest paths respecting both link capacity constraints and maximum delay constraints iteratively from the unicast $1$ to the unicast $K$ one by one. Besides, similar to Sec. 5.1, we can obtain the optimal solution maximizing throughput by solving multiple-unicast flow problems in the time-expanded network.

We set $D=150$ due to the following two concerns. (i) An end-to-end delay less than $150$ ms can provide a transparent interactivity for video conferencing (ITU, 2003). (ii) A delay larger than $150$ ms (as long as it is less than $400$ ms) is still acceptable for video conferencing (ITU, 2003), and hence a solution that violates the maximum delay constraint (e.g., the solution of PASS) may still be useful if it can achieve a huge amount of throughput increment.

We vary $\epsilon$ from $1\%$ to $99\%$ with a step of $1\%$ . We give the throughput results in Fig. 3(a), and the achieved maximum delay ratio results, i.e., $\max\{\mathcal{M}(f_{1}),\mathcal{M}(f_{2})\}/D$ where $f$ is the solution, in Fig. 3(b). In our simulations, both the baseline and PASS-M obtain the optimal throughput strictly meeting maximum delay constraints. For $\epsilon\leq 49\%$ , the throughput of PASS is strictly larger than the optimal, while violating maximum delay constraints (e.g., $8\%$ more than $D$ when $\epsilon=49\%$ ). For $\epsilon\geq 51\%$ , the solution of PASS meets maximum delay constraints, but the achieved throughput is strictly smaller than optimal. It is impressive that with a small $\epsilon$ , e.g., $\epsilon=1\%$ , the throughput of PASS is over $90\%$ more than optimal, while in the same time the maximum delays of PASS are less than $331$ ms which is still acceptable for video conferencing. In average, we observe a $2.0\%$ throughput increment as compared to optimal, but with a $2.2\%$ violation with the maximum delay constraints, when $\epsilon$ is decreased by $1\%$ for instances where $\epsilon\leq 49\%$ .

5.3. Use PASS to Maximize Network Utility

Finally we use PASS to maximize aggregate user utilities, subject to link capacity constraints, maximum delay constraints, and throughput requirements (i.e., to solve MUDT with formula (1)). We assume the objective is (1b) where $\mathcal{U}_{i}^{t}(|f_{i}|)=w_{i}\cdot|f_{i}|,i=1,2$ . And we assume $R_{1}=R_{2}=80$ , and $D_{1}=D_{2}=150$ in the formula (1).

We vary the weight $w_{1}$ (resp. $w_{2}$ ) from $1$ to $10$ with a step of $1$ , thus leading to $100$ simulation instances each of which is characterized by a specific $\langle w_{1},w_{2}\rangle,1\leq w_{1}\leq 10,1\leq w_{2}\leq 10$ . For each instance, we respectively run PASS, PASS-M, PASS-T, and compare their solutions with the optimal. Note that we obtain the optimal solution by solving multiple-unicast flow problems in the time-expanded network, similar to Sec. 5.1 and 5.2.

We present the achieved network utilities of different algorithms of the $100$ simulation instances in Fig. 4(a). And in Fig. 4(b), we give the utility increment ( $\%$ ) of our designed algorithms as compared to the optimal utility. Note that PASS, PASS-M, and PASS-T can obtain utilities that is strictly greater than optimal, because all of the three algorithms optimize utility subject to relaxed constraints, while the optimal utility is achieved by a feasible solution strictly meeting all the constraints.

From Fig. 4 we learn that PASS and PASS-T obtain a huge utility improvement compared to optimal (over $100\%$ more than optimal), while the utility achieved by PASS-M is close-to-optimal. According to Thm. 3.2, theoretically PASS can violate both throughput requirements and maximum delay constraints. Empirically, (i) the throughput achieved by PASS is $138$ (resp. $302$ ) in average for the first unicast (resp. second unicast), both satisfying throughput requirements $R_{1}=R_{2}=80$ . (ii) The maximum delay experienced by PASS is $195$ (resp. $301$ ) in average for the first unicast (resp. second unicast), both violating maximum delay constraints $D_{1}=D_{2}=150$ . But considering that video conferencing applications can accept a delay less than $400$ ms (ITU, 2003), the solution of PASS is acceptable. According to Thm. 3.3, theoretically PASS-M can meet maximum delay constraints while violate throughput requirements. Empirically, the throughput achieved by PASS-M is $71$ (resp. $154$ ) in average for the first unicast (resp. second unicast). It is clear that the first unicast flow violates throughput requirement. According to Thm. 3.4, theoretically PASS-T can meet throughput requirements while violate maximum delay constraints. Empirically, the maximum delay experienced by PASS-T is $222$ (resp. $322$ ) in average for the first unicast (resp. second unicast), both violating maximum delay constraints but within $400$ ms that is the largest acceptable delay.

6. Conclusion

We consider the problem of maximizing aggregate user utilities subject to link capacity constraints, maximum delay constraints, and throughput requirements. A user’s utility is a concave function of the achieved throughput or the experienced maximum delay. The problem is uniquely challenging due to the need of jointly considering maximum delay constraints and throughput requirements. We first prove that it is NP-complete either (i) to construct a feasible solution meeting all constraints, or (ii) to obtain an optimal solution after we relax maximum delay constraints or throughput requirements up to constant ratios. We then design the first polynomial-time approximation algorithm named PASS to obtain solutions that (i) achieve constant or problem-dependent approximation ratios, at the cost of (ii) violating maximum delay constraints or throughput requirements up to constant or problem-dependent ratios, under realistic conditions. PASS is practically useful since our conditions are satisfied in many popular application settings. We evaluate PASS empirically using extensive simulations of routing delay-critical video-conferencing traffic over Amazon EC2 datacenters. Our design leverage a new understanding between maximum-delay-aware problems and their average-delay-aware counterparts, which can be of independent interest and suggest a new avenue for solving maximum-delay-aware network optimization problems.

7. Appendix

7.1. Proof to Lem. 3.1

Proof.

According to Algorithm 1, for any $i=1,2,...,K$ , $\bar{f}_{i}$ is obtained by iteratively deleting $\epsilon\cdot|\hat{f}_{i}|$ rate from $\hat{f}_{i}$ . Suppose that there are in total $N_{i}$ iterations to get $\bar{f}_{i}$ by deleting rate from $\hat{f}_{i}$ (namely assume $N_{i}$ to be the number of iterations of the while-loop of line 8). And we use $f_{i}^{n}$ to represent the flow of the unicast $i$ at the beginning of the $n$ -th iteration (or equivalently, at the end of the $(n-1)$ -th iteration). Obviously, $f_{i}^{1}=\hat{f}_{i}$ , $f_{i}^{N_{i}+1}=\bar{f}_{i}$ . We denote $P_{i}^{n}$ as the set of of all flow-carrying paths in flow $f_{i}^{n}$ , and $p_{i}^{n}\in P_{i}^{n}$ as the slowest flow-carrying path in $P_{i}^{n}$ . In the $n$ -th iteration of the unicast $i$ , PASS delete some rate, say $x_{i}^{n}>0$ , from $p_{i}^{n}$ .

Since all link delays are non-negative constants, the path delay cannot increase with reduced flow rate. Thus,

[TABLE]

Considering the total delay of the unicast $i$ , for any $1\leq n\leq N_{i}$ , we have the following held for any $i=1,2,...,K$

[TABLE]

In (18), equality $(a)$ holds because $\sum_{e\in p_{i}^{n}}d_{e}$ is the path delay of the slowest flow-carrying path $p_{i}^{n}$ . Equality $(b)$ holds because flow $f_{i}^{n+1}$ is the flow when $f_{i}^{n}$ deletes $x_{i}^{n}$ rate from path $p_{i}^{n}$ . Inequality $(c)$ comes from (17) and $f_{i}^{N_{i}+1}=\bar{f}_{i}$ .

We then do summation for (18) over $n\in[1,N_{i}]$ , and get

[TABLE]

which proves our lemma. ∎

7.2. Proof to Thm. 3.2

Proof.

First, we prove the polynomial time complexity. Due to condition 1, both problem (2) and (3) can be solved in polynomial time, since (i) they are convex programs with a polynomial number of variables and a polynomial number of constraints, and (ii) convex programming problems can be solved up to an arbitrarily small additive error in polynomial time (e.g., see (Potra and Ye, 1993; Grötschel et al., 2012) for details). For example, the time complexity is $O(|E|^{3}K^{3}\mathcal{L})$ where $\mathcal{L}$ is the input size of the instance of the problem (2) or (3) if they are linear programs (Ye, 1991). After solving the average-delay-aware problem, we get $K$ single-unicast flows each of which is defined on edges. By the classic flow decomposition technique (Ford and Fulkerson, 1956), we can then achieve $K$ single-unicast flows $\hat{f}=\{\hat{f}_{i},i=1,2,...,K\}$ each of which is defined on paths within a time of $O(|V|^{2}|E|K)$ . Note that the flow decomposition outputs at most $|E|$ paths for each $\hat{f}_{i}$ , and hence there are at most $|E|$ iterations to obtain each $\bar{f}_{i}$ by deleting rate from $\hat{f}_{i}$ . Overall, Algorithm 1 has a polynomial time complexity that is even independent to $\epsilon$ when all conditions are satisfied.

Second, we prove the existence of $\bar{f}$ .

(i) Suppose (1b) is the objective of the problem (1). Because problem (1) is feasible and $f^{*}$ is its optimal solution, $f^{*}$ must satisfy all the constraints of problem (1), implying that $f^{*}$ also satisfies the constraints (2d) and (2d) of the problem (2) that is the average-delay-aware counterpart of the problem (1). Now consider that we have $\mathcal{T}(g)\leq\mathcal{M}(g)\cdot|g|$ for any single-unicast flow $g$ , for any $i=1,2,...,K$ , the following holds

[TABLE]

where the inequality (a) comes from that $f^{*}$ meets the constraints (1e) of the problem (1). Therefore, $f^{*}$ is also a feasible solution to the problem (2). Due to the existence of $f^{*}$ , problem (2) must be feasible and hence Algorithm 1 must return a solution $\bar{f}$ .

(ii) Suppose (1b) is the objective of the problem (1). Because problem (1) is feasible and $f^{*}$ is its optimal solution, $f^{*}$ must meet all the constraints of problem (1), e.g., we have $|f_{i}^{*}|\geq R_{i},\forall i=1,2,...,K$ . Now we construct another network flow $f$ based on $f^{*}$ as follows: for each $i=1,2,...,K$ , we obtain $f_{i}$ directly from $f_{i}^{*}$ , by deleting flow rate from arbitrary flow-carrying paths of $f_{i}^{*}$ till $|f_{i}^{*}|=R_{i}$ . The existence of $f^{*}$ implies the existence of $f$ . For problem (3), it is clear that $f$ meets the throughput requirements (3d). Since $f^{*}$ meets the constraint (1e), $f$ must satisfy the constraint (3d). Since we delete certain flow rate from $f_{i}^{*}$ to obtain $f_{i}$ , it is clear that the maximum delay does not increase, i.e., we have

[TABLE]

further implying the following for any $i=1,2,...,K$

[TABLE]

i.e., $f$ meets the constraints (3d). Therefore, $f$ is a feasible solution to the problem (3). Due to the existence of $f$ , problem (3) must be feasible and hence Algorithm 1 must return a solution $\bar{f}$ .

Third, we prove that $\bar{f}$ satisfies the relaxed constraints (5). Suppose $\hat{f}$ is the solution to the average-delay-aware problem in line 5. Then clearly that $\hat{f}$ meets the following constraints:

[TABLE]

We know $\bar{f}_{i}$ is the solution by deleting a rate of $\epsilon\cdot|\hat{f}_{i}|$ from $\hat{f}_{i}$ for each $i=1,2,...,K$ . It is clear that $\bar{f}$ satisfies the constraints (5a) and (5c). Now we look at the constraints (5b).

According to our Lem. 3.1, for any $i=1,2,...,K$ , it holds that

[TABLE]

implying that $\mathcal{M}(\bar{f}_{i})\leq\mathcal{A}(\hat{f}_{i})/\epsilon,\forall i=1,2,...,K$ . Based on the satisfied constraints (20b), we have the following for any $i=1,2,...,K$

[TABLE]

Finally, we prove the approximation ratio of $\bar{f}$ . If (1b) is the objective of problem (1), we have

[TABLE]

where the inequality (b) holds because in the second part of this proof, we have proved that $f^{*}$ is a feasible solution to the average-delay-aware problem (2), while $\hat{f}$ is its optimal solution. Inequality (a) comes from the following inequalities for each $i=1,2,...,K$

[TABLE]

where the inequality (c) holds due to the concavity of the function $\mathcal{U}_{i}^{t}(\cdot)$ , and the inequality (d) comes from that the function $\mathcal{U}_{i}^{t}(\cdot)$ is non-negative, considering that the condition 1 is satisfied.

If (1b) is the objective of problem (1), we assume $f$ is the feasible solution to the average-delay-aware problem (3) that is constructed from $f^{*}$ as discussed in the second part of this proof. Then we have

[TABLE]

where the inequality (a) comes from the satisfied condition 2, the inequality (b) holds since $f$ is feasible to problem (3) while $\hat{f}$ is optimal to problem (3), and the inequality (c) is true because of the inequality (19) and the non-decreasing property of $\mathcal{U}_{i}^{d}(\cdot)$ . ∎

7.3. Proof to Thm. 2.1

Proof.

First, we consider the following problem that is a special case of the MUDT with relaxed maximum delay constraints,

[TABLE]

It has been proved to be NP-complete to find the optimal solution to above problem (see Appendix of (Misra et al., 2009)).

Second, we consider the following problem that is a special case of the MUDT with relaxed throughput requirements,

[TABLE]

Follow a similar proof as that in the Appendix of (Misra et al., 2009), it can be proved to be NP-complete to find the optimal solution to the aforementioned problem.

Third, also following a similar proof as that in the Appendix of (Misra et al., 2009), it can be proved that it is NP-complete even to construct a feasible solution to the following problem that is a special case of our MUDT, strictly meeting all constraints

[TABLE]

where $\mathcal{U}_{1}^{t}(|f_{1}|)=1$ which is a constant. ∎

7.4. Proof to Thm. 3.3

Proof.

First, due to the same proof to Thm. 3.2, Algorithm 2 has a polynomial time complexity, and must give a solution $\bar{f}$ .

Second, it is straightforward that constraints (8b) and (8c) are met. Now let us denote $(|\hat{f}_{i}|-|\bar{f}_{i}|)/|\hat{f}_{i}|$ as $\epsilon_{i}$ . Thus $\epsilon_{\min}\leq\epsilon_{i}\leq\epsilon_{\max}$ for any $i=1,2,...,K$ , implying the following

[TABLE]

i.e., the constraints (8b) are satisfied.

Third, following the same proof as to Thm. 3.2, the approximation ratio (9) can be proved.

As for the approximation ratio (10), let as assume $\tilde{f}$ to be the solution where for each $i=1,2,...,K$ , we delete $\epsilon_{\min}|\hat{f}_{i}|$ rate from the slowest flow-carrying paths of $\hat{f}_{i}$ to obtain $\tilde{f}_{i}$ . It is clear that

[TABLE]

because both $\bar{f}_{i}$ and $\tilde{f}_{i}$ are flows after we delete rates from the slowest flow-carrying paths of $\hat{f}_{i}$ , but the amount of deleted rate to obtain $\bar{f}_{i}$ is no smaller than the amount of deleted rate to obtain $\tilde{f}_{i}$ , for each $i=1,2,...,K$ . Therefore, we have the following

[TABLE]

where the inequality (a) comes from our Thm. 3.2, since $\tilde{f}$ is also the solution returned if we use Algorithm 1 with $\epsilon=\epsilon_{\min}$ to solve the problem (1). ∎

7.5. Proof to Thm. 3.4

Proof.

Same to the proof as that of Thm. 3.2, it holds that PASS-T must return a solution $\bar{f}$ in a polynomial time, meeting the constraints (11a), (11c), and providing the approximation ratio (12).

Because that $\bar{g}$ is the solution of PASS, we have

[TABLE]

According to the definition of $\lambda$ , we have

[TABLE]

implying the following considering the inequality (22)

[TABLE]

i.e., $\bar{f}$ satisfies the constraints (11b). We further have

[TABLE]

where the inequality (a) comes from the satisfied condition 2, and the inequality (b) holds due to the inequality (23). Thus the approximation ratio (13) holds. ∎

Bibliography27

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1(1)
2Cao et al . (2017) Zizhong Cao, Paul Claisse, René-Jean Essiambre, Murali Kodialam, and TV Lakshman. 2017. Optimizing throughput in optical networks: The joint routing and power control problem. IEEE/ACM Trans. Networking 25, 1 (2017), 199–209.
3Chen et al . (2011) Xiangwen Chen, Minghua Chen, Baochun Li, Yao Zhao, Yunnan Wu, and Jin Li. 2011. Celerity: a low-delay multi-party conferencing solution. In Proc. ACM Int’l Conf. Multimedia . 493–502.
4Correa et al . (2004) Jose R Correa, Andreas S Schulz, and Nicolás E Stier Moses. 2004. Computational complexity, fairness, and the price of anarchy of the maximum latency problem. In Proc. Int’l Conf. Integer Programming and Combinatorial Optimization . 59–73.
5Correa et al . (2007) José R Correa, Andreas S Schulz, and Nicolás E Stier-Moses. 2007. Fast, fair, and efficient flows in networks. Operations Research 55, 2 (2007), 215–225.
6Devetak et al . (2011) Fabrizio Devetak, Junghwan Shin, Tricha Anjali, and Sanjiv Kapoor. 2011. Minimizing path delay in multipath networks. In Proc. IEEE Int’l Conf. Communications . 1–5.
7Ford and Fulkerson (1956) Lester R Ford and Delbert R Fulkerson. 1956. Maximal flow through a network. Canadian journal of Mathematics 8, 3 (1956), 399–404.
8Grimmer and Kapoor (2016) Benjamin Grimmer and Sanjiv Kapoor. 2016. Nash equilibrium and the price of anarchy in priority based network routing. In Proc. IEEE Int’l Conf. Computer Communications . 1–9.