Online Interval Scheduling with Predictions

Joan Boyar; Lene M. Favrholdt; Shahin Kamali; Kim S. Larsen

arXiv:2302.13701·cs.DS·January 24, 2025

Online Interval Scheduling with Predictions

Joan Boyar, Lene M. Favrholdt, Shahin Kamali, Kim S. Larsen

PDF

Open Access

TL;DR

This paper investigates online interval scheduling and disjoint path allocation using predictions, establishing bounds on algorithm performance based on prediction accuracy and validating findings with real-world experiments.

Contribution

It introduces a framework analyzing the impact of prediction errors on online scheduling algorithms, providing tight bounds and trade-offs between consistency and robustness.

Findings

01

Tight bounds on competitive ratios as a function of prediction error.

02

Asymptotically optimal trade-offs between consistency and robustness.

03

Experimental validation on real-world workloads confirming theoretical results.

Abstract

In online interval scheduling, the input is an online sequence of intervals, and the goal is to accept a maximum number of non-overlapping intervals. In the more general disjoint path allocation problem, the input is a sequence of requests, each consisting of pairs of vertices of a known graph, and the goal is to accept a maximum number of requests forming edge-disjoint paths between accepted pairs. We study a setting with a potentially erroneous prediction specifying the set of requests forming the input sequence and provide tight upper and lower bounds on the competitive ratios of online algorithms as a function of the prediction error. We also present asymptotically tight trade-offs between consistency (competitive ratio with error-free predictions) and robustness (competitive ratio with adversarial predictions) of interval scheduling algorithms. Finally, we provide experimental…

Tables1

Table 1. Table 1 : Details on the benchmarks from [ 19 ] used in our experiments.

name	input size ( $N$ )	no. timesteps ( $m$ )	max. length	avg. length
LLNL-uBGL-2006-2	13,225	16,671,553	14,403	1,933.92
NASA-iPSC-1993-3.1	18,066	7,947,562	62,643	772.21
CTC-SP2-1996-3.1	77,205	8,986,769	71,998	11,279.61
SDSC-DS-2004-2.1	84,893	31,629,689	6,589,808	7,579.36

Equations40

η (\hat{I}, I) \geq ∣ \textsc Opt (I) - \textsc Opt (\hat{I}) ∣ .

η (\hat{I}, I) \geq ∣ \textsc Opt (I) - \textsc Opt (\hat{I}) ∣ .

η (I, \hat{I}) \leq \textsc Opt (\textsc FP \cup \textsc FN) .

η (I, \hat{I}) \leq \textsc Opt (\textsc FP \cup \textsc FN) .

η (I, \hat{I} \cup {x}) = \textsc Opt (\textsc FP \cup (\textsc FN ∖ {x})) \leq \textsc Opt (\textsc FP \cup \textsc FN) = η (I, \hat{I}) .

η (I, \hat{I} \cup {x}) = \textsc Opt (\textsc FP \cup (\textsc FN ∖ {x})) \leq \textsc Opt (\textsc FP \cup \textsc FN) = η (I, \hat{I}) .

η (I, \hat{I} ∖ {y}) = \textsc Opt ((\textsc FP ∖ {y}) \cup \textsc FN) \leq \textsc Opt (\textsc FP \cup \textsc FN) = η (I, \hat{I}) .

η (I, \hat{I} ∖ {y}) = \textsc Opt ((\textsc FP ∖ {y}) \cup \textsc FN) \leq \textsc Opt (\textsc FP \cup \textsc FN) = η (I, \hat{I}) .

\textsc Opt (\textsc FP \cup \textsc FN) \geq ∣ \textsc Opt (I) - \textsc Opt (\hat{I}) ∣ .

\textsc Opt (\textsc FP \cup \textsc FN) \geq ∣ \textsc Opt (I) - \textsc Opt (\hat{I}) ∣ .

\textsc Opt (I)

\textsc Opt (I)

\leq \textsc Opt (\hat{I} \cup \textsc FN)

\leq \textsc Opt (\hat{I}) + \textsc Opt (\textsc FN),

\textsc Opt (I) - \textsc Opt (\hat{I}) \leq \textsc Opt (\textsc FN) \leq \textsc Opt (\textsc FP \cup \textsc FN) .

\textsc Opt (I) - \textsc Opt (\hat{I}) \leq \textsc Opt (\textsc FN) \leq \textsc Opt (\textsc FP \cup \textsc FN) .

∣ \textsc FP ∣ + ∣ \textsc FN ∣ = ∣ I \cup \hat{I} ∣ - ∣ I \cap \hat{I} ∣ = ∣ (I \cup \hat{I}) ∖ (I \cap \hat{I}) ∣

∣ \textsc FP ∣ + ∣ \textsc FN ∣ = ∣ I \cup \hat{I} ∣ - ∣ I \cap \hat{I} ∣ = ∣ (I \cup \hat{I}) ∖ (I \cap \hat{I}) ∣

\big{|}\big{(}\operatorname{\textsc{Opt}}(I)\cup\operatorname{\textsc{Opt}}(\operatorname{\mathit{\hat{I}}})\big{)}\setminus\big{(}\operatorname{\textsc{Opt}}(I)\cap\operatorname{\textsc{Opt}}(\operatorname{\mathit{\hat{I}}})\big{)}\big{|}

\big{|}\big{(}\operatorname{\textsc{Opt}}(I)\cup\operatorname{\textsc{Opt}}(\operatorname{\mathit{\hat{I}}})\big{)}\setminus\big{(}\operatorname{\textsc{Opt}}(I)\cap\operatorname{\textsc{Opt}}(\operatorname{\mathit{\hat{I}}})\big{)}\big{|}

\frac{∣ I \cup I ^ ∣ - ∣ I \cap I ^ ∣}{∣ I \cup I ^ ∣}

\frac{∣ I \cup I ^ ∣ - ∣ I \cap I ^ ∣}{∣ I \cup I ^ ∣}

\begin{array}[]{rcl}\operatorname{\textsc{Trust}}(\operatorname{\mathit{\hat{I}}},I)&\geq&\operatorname{\textsc{Opt}}(\operatorname{\mathit{I^{\ast}}})-\operatorname{\textsc{Opt}}(\operatorname{\textsc{FP}})\\ &\geq&\operatorname{\textsc{Opt}}(I)-\operatorname{\textsc{Opt}}(\operatorname{\textsc{FN}})-\operatorname{\textsc{Opt}}(\operatorname{\textsc{FP}})\\ &\geq&\operatorname{\textsc{Opt}}(I)-2\operatorname{\textsc{Opt}}(\operatorname{\textsc{FP}}\cup\operatorname{\textsc{FN}})\\ &=&\operatorname{\textsc{Opt}}(I)-2\eta(\operatorname{\mathit{\hat{I}}},I)\\ &=&(1-2\gamma(\operatorname{\mathit{\hat{I}}},I))\operatorname{\textsc{Opt}}(I)\end{array}

\begin{array}[]{rcl}\operatorname{\textsc{Trust}}(\operatorname{\mathit{\hat{I}}},I)&\geq&\operatorname{\textsc{Opt}}(\operatorname{\mathit{I^{\ast}}})-\operatorname{\textsc{Opt}}(\operatorname{\textsc{FP}})\\ &\geq&\operatorname{\textsc{Opt}}(I)-\operatorname{\textsc{Opt}}(\operatorname{\textsc{FN}})-\operatorname{\textsc{Opt}}(\operatorname{\textsc{FP}})\\ &\geq&\operatorname{\textsc{Opt}}(I)-2\operatorname{\textsc{Opt}}(\operatorname{\textsc{FP}}\cup\operatorname{\textsc{FN}})\\ &=&\operatorname{\textsc{Opt}}(I)-2\eta(\operatorname{\mathit{\hat{I}}},I)\\ &=&(1-2\gamma(\operatorname{\mathit{\hat{I}}},I))\operatorname{\textsc{Opt}}(I)\end{array}

\textsc Trust (\hat{I}, I_{w}) = 0 = \textsc Opt (I) - 2 \textsc Opt (\textsc FN \cup \textsc FP) = (1 - 2 γ (\hat{I}_{w}, I_{w})) \textsc Opt (I_{w}) .

\textsc Trust (\hat{I}, I_{w}) = 0 = \textsc Opt (I) - 2 \textsc Opt (\textsc FN \cup \textsc FP) = (1 - 2 γ (\hat{I}_{w}, I_{w})) \textsc Opt (I_{w}) .

\textsc TrustGreedy (\hat{I}, I) \geq (1 - γ (\hat{I}, I)) \textsc Opt (I) .

\textsc TrustGreedy (\hat{I}, I) \geq (1 - γ (\hat{I}, I)) \textsc Opt (I) .

\textsc TG (\hat{I}, I) \geq \textsc Opt (I) - \textsc Opt (\textsc FP \cup \textsc FN) = (1 - γ (\hat{I}, I)) \textsc Opt (I) :

\textsc TG (\hat{I}, I) \geq \textsc Opt (I) - \textsc Opt (\textsc FP \cup \textsc FN) = (1 - γ (\hat{I}, I)) \textsc Opt (I) :

\textsc Opt (I)

\textsc Opt (I)

\leq ∣ \textsc TG ∣ + \textsc Opt (F), since, by Lemma \ref lemma:feasible, F is feasible

\leq ∣ \textsc TG ∣ + \textsc Opt (\textsc FP \cup \textsc FN), since U \subseteq \textsc FP and \textsc Opt^{\textsc FN} \subseteq \textsc FN

E = {(1, 2), (2, 3), (3, 4), (4, 5), (5, 6), (6, 7), (7, 8), (5, 8), (1, 6), (1, 8)} .

E = {(1, 2), (2, 3), (3, 4), (4, 5), (5, 6), (6, 7), (7, 8), (5, 8), (1, 6), (1, 8)} .

V^{'} = {12, 23, 34, 45, 56, 67, 78, 18, 16, 58},

V^{'} = {12, 23, 34, 45, 56, 67, 78, 18, 16, 58},

\begin{array}[]{r@{}l}E^{\prime}=\{&(12,23),(23,34),(34,45),(45,56),(56,67),(67,78),(78,18),(18,16),\\ &(16,12),(58,18),(58,78),(58,56),(58,45),(12,18),(16,67),(16,56)\}\end{array}

\begin{array}[]{r@{}l}E^{\prime}=\{&(12,23),(23,34),(34,45),(45,56),(56,67),(67,78),(78,18),(18,16),\\ &(16,12),(58,18),(58,78),(58,56),(58,45),(12,18),(16,67),(16,56)\}\end{array}

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsOptimization and Search Problems · Complexity and Algorithms in Graphs · Advanced Bandit Algorithms Research

Full text

11institutetext: University of Southern Denmark

11email: {joan,lenem,kslarsen}@imada.sdu.dk

https://imada.sdu.dk/u/{joan,lenem,kslarsen} 22institutetext: York University

22email: [email protected]

https://www.eecs.yorku.ca/~kamalis/

Online Interval Scheduling with Predictions††thanks: The first, second, and fourth authors were supported in part by the Danish Council for Independent Research grant DFF-0135-00018B.

The third author was supported in part by the Natural Sciences and Engineering Research Council of Canada (NSERC).

Joan Boyar 11

Lene M. Favrholdt 11

Shahin Kamali 22

Kim S. Larsen 11

Abstract

In online interval scheduling, the input is an online sequence of intervals, and the goal is to accept a maximum number of non-overlapping intervals. In the more general disjoint path allocation problem, the input is a sequence of requests, each involving a pair of vertices of a known graph, and the goal is to accept a maximum number of requests forming edge-disjoint paths between accepted pairs. These problems have been studied under extreme settings without information about the input or with error-free advice. We study an intermediate setting with a potentially erroneous prediction that specifies the set of intervals/requests forming the input sequence. For both problems, we provide tight upper and lower bounds on the competitive ratios of online algorithms as a function of the prediction error. For disjoint path allocation, our results rule out the possibility of obtaining a better competitive ratio than that of a simple algorithm that fully trusts predictions, whereas, for interval scheduling, we develop a superior algorithm. We also present asymptotically tight trade-offs between consistency (competitive ratio with error-free predictions) and robustness (competitive ratio with adversarial predictions) of interval scheduling algorithms. Finally, we provide experimental results on real-world scheduling workloads that confirm our theoretical analysis.

Keywords:

Online interval scheduling. Algorithms with prediction. Competitive analysis. Disjoint paths.

1 Introduction

In the interval scheduling problem, the input is a set of intervals with integral endpoints, each representing timesteps at which a process starts and ends. A scheduler’s task is to decide whether to accept or reject each job so that the intervals of accepted jobs do not overlap except possibly at one of their endpoints. The objective is to maximize the number of accepted intervals, referred to as the payoff of the scheduler. This problem is also known as fixed job scheduling and k-track assignment [33].

Interval scheduling is a special case of the disjoint path allocation problem, where the input is a graph $G$ and a set of $n$ requests, each defined by a pair of vertices in $G$ . An algorithm can accept or reject each pair, given that it can form edge-disjoint paths between vertices of accepted pairs. Interval scheduling is the particular case when $G$ is a path graph. The disjoint path allocation problem can be solved in polynomial time for trees [27] and outerplanar graphs by a combination of [23, 32, 26], but the problem is NP-complete for general graphs [25], and even on quite restricted graphs such as series-parallel graphs [40]. The disjoint path problem is the same as call control/call allocation with all bandwidths (both of the calls and the edges they would be routed on) being equal to 1 and as the maximum multi-commodity integral flow problem with edges having unit capacity.

In this work, we focus on the online variant of the problem, in which the set of requests is not known in advance but is revealed in the form of a sequence $I$ of intervals. A new request must either be irrevocably accepted or rejected, subject to maintaining disjoint paths between accepted requests. We analyze an online algorithm via a comparison with an optimal offline algorithm, $\operatorname{\textsc{Opt}}$ . The competitive ratio of an online algorithm $\operatorname{\textsc{Alg}}$ is defined as $\inf_{I}\left\{\operatorname{\textsc{Alg}}(I)/\operatorname{\textsc{Opt}}(I)\right\}$ , where $\operatorname{\textsc{Alg}}(I)$ and $\operatorname{\textsc{Opt}}(I)$ , respectively, denote the payoff of $\operatorname{\textsc{Alg}}$ and $\operatorname{\textsc{Opt}}$ for intervals in $I$ (for randomized algorithms, $\operatorname{\textsc{Alg}}(I)$ is the expected payoff of $\operatorname{\textsc{Alg}}$ ). Since we consider a maximization problem, our ratios are between zero and one.

For interval scheduling on a path graph with $m$ edges, the competitive ratios of the best deterministic and randomized algorithms are respectively $m$ and $\lceil\log m\rceil$ [16]. These results suggest that the constraints on online algorithms must be relaxed to compete with $\operatorname{\textsc{Opt}}$ . Specifically, the problem has been considered in the advice complexity model for path graphs [13, 28], trees [14], and grid graphs [15]. Under the advice model, the online algorithm can access error-free information on the input called advice. The objective is to quantify the trade-offs between the competitive ratio and the size of the advice.

In recent years, there has been an increasing interest in improving the performance of online algorithms via the notion of prediction. Here, it is assumed that the algorithm has access to machine-learned information in the form of a prediction. Unlike the advice model, the prediction may be erroneous and is quantified by an error measure $\eta$ . The objective is to design algorithms whose competitive ratio degrades gently as a function of $\eta$ . Several online optimization problems have been studied under the prediction model, including non-clairvoyant scheduling [41, 43], makespan scheduling [34], contract scheduling [4, 5], and other variants of scheduling problems [8, 37, 11, 10]. Other online problems studied under the prediction model include bin packing [2, 3], knapsack [44, 31, 17], caching [38, 42], matching problems [6, 35, 36], and various graph problems [22, 24, 21, 9, 12]. See also the survey by Mitzenmacher and Vassilvitskii [39] and the collection at [1].

1.1 Contributions

We study the disjoint path allocation problem under a setting where the scheduler is provided with a set $\operatorname{\mathit{\hat{I}}}$ of intervals predicted to form the input sequence $I$ . Given the erroneous nature of the prediction, some intervals in $\operatorname{\mathit{\hat{I}}}$ may be incorrectly predicted to be in $I$ (false positives), and some intervals in $I$ may not be included in $\operatorname{\mathit{\hat{I}}}$ (false negatives). We let the error set be the set of intervals that are false positives or false negatives and define the error parameter $\eta(\operatorname{\mathit{\hat{I}}},I)$ to be the cardinality of the largest set of non-overlapping intervals in the error set, i.e., $\eta(\operatorname{\mathit{\hat{I}}},I)=\operatorname{\textsc{Opt}}(\operatorname{\textsc{FP}}\cup\operatorname{\textsc{FN}})$ . We explain later that this definition of $\eta$ satisfies specific desired properties for the prediction error (Proposition 1). In the following, we use $\operatorname{\textsc{Alg}}(\operatorname{\mathit{\hat{I}}},I)$ to denote the payoff of an algorithm $\operatorname{\textsc{Alg}}$ for prediction $\operatorname{\mathit{\hat{I}}}$ and input $I$ . We also define $\gamma(\operatorname{\mathit{\hat{I}}},I)=\eta(\operatorname{\mathit{\hat{I}}},I)/\operatorname{\textsc{Opt}}(I)$ ; this normalized error measure is helpful in describing our results because the point of reference in the competitive analysis is $\operatorname{\textsc{Opt}}(I)$ . Our first result concerns general graphs:

•

Disjoint-Path Allocation: We first study a simple algorithm $\operatorname{\textsc{Trust}}$ , which accepts a request only if it belongs to the set of intervals in a given optimal solution for $\operatorname{\mathit{\hat{I}}}$ . We show that, for any graph $G$ , any input sequence $I$ , and any prediction $\operatorname{\mathit{\hat{I}}}$ , $\operatorname{\textsc{Trust}}(\operatorname{\mathit{\hat{I}}},I)\geq(1-2\gamma(\operatorname{\mathit{\hat{I}}},I))\operatorname{\textsc{Opt}}(I)$ (Theorem 3.1). Furthermore, for any algorithm $\operatorname{\textsc{Alg}}$ and any positive integer $p$ ,

there are worst-case input sequence $I_{w}$ and prediction set $\operatorname{\mathit{\hat{I}}}_{w}$ over a star graph with $8p$ leaves, such that $\eta(\operatorname{\mathit{\hat{I}}}_{w},I_{w})=p$ and $\operatorname{\textsc{Alg}}(\operatorname{\mathit{\hat{I}}}_{w},I_{w})\leq(1-2\gamma(\operatorname{\mathit{\hat{I}}}_{w},I_{w}))\operatorname{\textsc{Opt}}(I_{w})\,$ (Theorem 3.2). Thus, $\operatorname{\textsc{Trust}}$ achieves an optimal competitive ratio in any graph that contains $S_{8}$ as a subgraph, i.e., any graph of maximum degree at least 8.

The above result demonstrates that even for trees, the problem is so hard that no algorithm can do better than the trivial $\operatorname{\textsc{Trust}}$ . Therefore, our main results concern the more interesting case of path graphs, that is, interval scheduling:

•

Interval Scheduling: We first show a negative result for deterministic interval scheduling algorithms. Given any deterministic algorithm $\operatorname{\textsc{Alg}}$ and integer $p$ , we show there are worst-case instances $I_{w}$ and predictions $\operatorname{\mathit{\hat{I}}}_{w}$ such that $\eta(\operatorname{\mathit{\hat{I}}}_{w},I_{w})=p$ and $\operatorname{\textsc{Alg}}(\operatorname{\mathit{\hat{I}}}_{w},I_{w})\leq(1-\gamma(\operatorname{\mathit{\hat{I}}}_{w},I_{w}))\operatorname{\textsc{Opt}}(I_{w})$ (Theorem 4.1, setting $c=2$ ).

Next, we present a negative result for $\operatorname{\textsc{Trust}}$ . For any positive integer, $p$ , we show there are worst-case instances $I_{w}$ and predictions $\operatorname{\mathit{\hat{I}}}_{w}$ such that $\eta(\operatorname{\mathit{\hat{I}}}_{w},I_{w})=p$ and $\operatorname{\textsc{Trust}}(\operatorname{\mathit{\hat{I}}}_{w},I_{w})=(1-2\gamma(\operatorname{\mathit{\hat{I}}}_{w},I_{w}))\operatorname{\textsc{Opt}}(I_{w})\,.$ (Theorem 4.2). This suggests that there is room for improvement over $\operatorname{\textsc{Trust}}$ .

Finally, we introduce our main technical result, a deterministic algorithm $\operatorname{\textsc{TrustGreedy}}$ that achieves an optimal competitive ratio for interval scheduling. $\operatorname{\textsc{TrustGreedy}}$ is similar to $\operatorname{\textsc{Trust}}$ in that it maintains an optimal solution for $\operatorname{\mathit{\hat{I}}}$ , but unlike $\operatorname{\textsc{Trust}}$ , it updates its planned solution to accept requests greedily when it is possible without a decrease in the payoff of the maintained solution. For any input $I$ and prediction $\operatorname{\mathit{\hat{I}}}$ , we show that $\operatorname{\textsc{TrustGreedy}}(\operatorname{\mathit{\hat{I}}},I)\geq(1-\gamma(\operatorname{\mathit{\hat{I}}},I))\operatorname{\textsc{Opt}}(I)\,$ (Theorem 4.3), which proves optimality of $\operatorname{\textsc{TrustGreedy}}$ in the light of Theorem 4.1.

•

Consistency-Robustness Trade-off: We study the trade-off between consistency and robustness, which measure an algorithm’s competitive ratios in the extreme cases of error-free prediction (consistency) and adversarial prediction (robustness) [38]. We focus on randomized algorithms because a non-trivial trade-off is infeasible for deterministic algorithms (Proposition 2). Suppose that for any input $I$ , an algorithm $\operatorname{\textsc{Alg}}$ guarantees a consistency of $\alpha<1$ and robustness of $\beta\leq\frac{1}{\lceil\log m\rceil}$ . We show $\alpha\leq 1-\frac{\lfloor\log m\rfloor-1}{2}\beta$ and $\beta\leq\frac{2}{\lfloor\log m\rfloor-1}\cdot(1-\alpha)$ (Theorem 5.1). For example, to guarantee a robustness of $\frac{1}{10\lfloor\log m\rfloor}$ , the consistency must be at most $19/20$ , and to guarantee a consistency of $\frac{2}{3}$ , the robustness must be at most $\frac{2}{3}\frac{1}{\lfloor\log m\rfloor-1}$ . We also present a family of randomized algorithms that provides an almost Pareto-optimal trade-off between consistency and robustness (Theorem 5.2).

•

Experiments on Real-World Data: We compare our algorithms with the online $\operatorname{\textsc{Greedy}}$ algorithm (which accepts an interval if and only if it does not overlap previously accepted intervals), and $\operatorname{\textsc{Opt}}$ on real-world scheduling data from [19]. Our results are in line with our theoretical analysis: both $\operatorname{\textsc{Trust}}$ and $\operatorname{\textsc{TrustGreedy}}$ are close-to-optimal for small error values; $\operatorname{\textsc{TrustGreedy}}$ is almost always better than $\operatorname{\textsc{Greedy}}$ even for large values of error, while $\operatorname{\textsc{Trust}}$ is better than $\operatorname{\textsc{Greedy}}$ only for small error values.

2 Model and Predictions

We assume that an oracle provides the online algorithm with a set $\operatorname{\mathit{\hat{I}}}$ of requests predicted to form the input sequence $I$ . One may consider alternative predictions, such as statistical information about the input. While these predictions are compact and can be efficiently learned, they cannot help achieve close-to-optimal solutions. In particular, for interval scheduling on a path with $m$ edges, since the problem is AOC-complete, one cannot achieve a competitive ratio $c\leq 1$ with fewer than $cm/(e\ln 2)$ bits [18].

In what follows, true positive (respectively, negative) intervals are correctly predicted to appear (respectively, not to appear) in the request sequence. False positives and negatives are defined analogously as those incorrectly predicted to appear or not appear. We let $\operatorname{\textsc{TP}}$ , $\operatorname{\textsc{TN}}$ , $\operatorname{\textsc{FP}}$ , $\operatorname{\textsc{FN}}$ denote the four sets containing these different types of intervals. Thus, $I=\operatorname{\textsc{TP}}\cup\operatorname{\textsc{FN}}$ and $\operatorname{\mathit{\hat{I}}}=\operatorname{\textsc{TP}}\cup\operatorname{\textsc{FP}}$ . We use $\eta(\operatorname{\mathit{\hat{I}}},I)$ , to denote the error for the input formed by the set $I$ , when the set of predictions is $\operatorname{\mathit{\hat{I}}}$ . When there is no risk of confusion, we use $\eta$ instead of $\eta(\operatorname{\mathit{\hat{I}}},I)$ .

The error measure we use here is $\eta=\operatorname{\textsc{Opt}}(\operatorname{\textsc{FP}}\cup\operatorname{\textsc{FN}})$ , and hence, the normalized error measure is $\gamma=\operatorname{\textsc{Opt}}(\operatorname{\textsc{FP}}\cup\operatorname{\textsc{FN}})/\operatorname{\textsc{Opt}}(I)$ . Our error measure satisfies the following desirable properties, the first two of which were strongly recommended in Im, et al. [30]: $\eta(I,\operatorname{\mathit{\hat{I}}})\leq\operatorname{\textsc{Opt}}(\operatorname{\textsc{FP}}\cup\operatorname{\textsc{FN}})$ . In Section 2.0.1, we discuss natural error models, such as Hamming distance between the request sequence and prediction, and explain why these measures do not satisfy our desired properties.

•

Monotonicity: This property ensures that increasing the number of true positives or negatives does not increase the error. To be more precise, if we increase $|\operatorname{\textsc{TP}}|$ by one unit (decreasing $|\operatorname{\textsc{FN}}|$ by one unit) or increase $|\operatorname{\textsc{TN}}|$ by one unit (decreasing $|\operatorname{\textsc{FP}}|$ by one unit), the error must not increase. Formally, for any $I$ , $\operatorname{\mathit{\hat{I}}}$ , the following must hold.

–

For any $x\in I\setminus\operatorname{\mathit{\hat{I}}}$ , $\eta(I,\operatorname{\mathit{\hat{I}}}\cup\{x\})\leq\eta(\operatorname{\mathit{\hat{I}}},I)$ .

–

For any $y\in\operatorname{\mathit{\hat{I}}}\setminus I$ , $\eta(I,\operatorname{\mathit{\hat{I}}}\setminus\{y\})\leq\eta(\operatorname{\mathit{\hat{I}}},I)$ .

•

Lipschitz property: Let $\operatorname{\textsc{Opt}}(I)$ denote the number of requests in an optimal solution for the input sequence, and $\operatorname{\textsc{Opt}}(\operatorname{\mathit{\hat{I}}})$ denote the number of requests in an optimal solution for a set of predicted requests. The Lipschitz property requires the error to be at least equal to the net difference between $\operatorname{\textsc{Opt}}(I)$ and $\operatorname{\textsc{Opt}}(\operatorname{\mathit{\hat{I}}})$ , that is,

[TABLE]

Note that this property ensures that the error is not “too small”. In particular, we should not be able to decrease the error to an arbitrarily small value by adding “dummy requests”. For example, false discovery rate, defined as $\frac{|\operatorname{\textsc{FP}}|}{|\operatorname{\textsc{FP}}|+|\operatorname{\textsc{TP}}|}$ , does not satisfy Lipschitz property: an adversary can construct a bad input and then add a lot of intervals to $I\cap\operatorname{\mathit{\hat{I}}}$ , contributing to $|\operatorname{\textsc{TP}}|$ , that neither the algorithm nor $\operatorname{\textsc{Opt}}$ will choose, driving down the error.

•

Lipschitz completeness (or simply completeness): We need the error measure to ensure that the error is not “too large”. Consider the following example for the disjoint paths problem. The input is formed by a set $I=A\cup B$ of requests, with $A=\{A_{1},A_{2},\ldots,A_{k}\}$ and $B=\{B_{1},B_{2},\ldots,B_{k-1}\}$ , where the $A_{i}$ ’s are disjoint, the $B_{i}$ ’s are disjoint, and $B_{i}$ overlaps $A_{i}$ and $A_{i+1}$ . The true optimal solution is then $\operatorname{\textsc{Opt}}(I)=|A|=k$ . Suppose the prediction is $\operatorname{\mathit{\hat{I}}}=(A\setminus\{A_{1},A_{2}\})\cup B$ , and note that $\operatorname{\textsc{Opt}}(\operatorname{\mathit{\hat{I}}})=|B|=k-1$ . The optimal solutions for $I$ and $\operatorname{\mathit{\hat{I}}}$ are disjoint but $|\operatorname{\textsc{Opt}}(I)-\operatorname{\textsc{Opt}}(\operatorname{\mathit{\hat{I}}})|=1$ , $\operatorname{\textsc{FP}}=0$ and $\operatorname{\textsc{FN}}=2$ . In this case, the error should be relatively small, independent of $k$ . More generally, the error measure must not grow with the dissimilarity between the optimal solutions for $I$ and $\operatorname{\mathit{\hat{I}}}$ , but rather with the size of the optimal solution for $\operatorname{\textsc{FP}}$ and $\operatorname{\textsc{FN}}$ . This is guaranteed by the Lipschitz completeness, which requires

[TABLE]

Proposition 1 ()

The error measure $\eta(\operatorname{\mathit{\hat{I}}},I)=\operatorname{\textsc{Opt}}(\operatorname{\textsc{FP}}\cup\operatorname{\textsc{FN}})$ satisfies the properties of monotonicity, Lipschitz, and Lipschitz completeness.

Proof

We check all properties listed above:

•

Monotonicity: First, consider increasing the number of true positives. Let $x\in I\setminus\operatorname{\mathit{\hat{I}}}$ . Since $x$ is a false negative, it may or may not have been counted in $\operatorname{\textsc{Opt}}(\operatorname{\textsc{FP}}\cup\operatorname{\textsc{FN}})$ , but removing it from $\operatorname{\textsc{FN}}$ (thus adding it to $\operatorname{\textsc{TP}}$ ) cannot make $\operatorname{\textsc{Opt}}(\operatorname{\textsc{FP}}\cup\operatorname{\textsc{FN}})$ larger, i.e.,

[TABLE]

Similarly, for any $y\in\operatorname{\mathit{\hat{I}}}\setminus I$ , $\operatorname{\textsc{Opt}}((\operatorname{\textsc{FP}}\setminus\{y\})\cup\operatorname{\textsc{FN}})$ cannot be larger than $\operatorname{\textsc{Opt}}(\operatorname{\textsc{FP}}\cup\operatorname{\textsc{FN}})=\eta(I,\operatorname{\mathit{\hat{I}}})$ , so

[TABLE]

•

Lipschitz property: We need to show that

[TABLE]

We note that

[TABLE]

which implies

[TABLE]

•

Lipschitz completeness: Follows trivially with the suggested bound, since $\eta=\operatorname{\textsc{Opt}}(\operatorname{\textsc{FP}}\cup\operatorname{\textsc{FN}})$ .∎

2.0.1 Alternative Error Measures.

In what follows, we review a few alternative error measures that do not satisfy our desired properties of monotonicity, Lipschitz, and Lipschitz completeness (or simply completeness).

•

Hamming distance between the bit strings representing the request sequence and the predictions:

[TABLE]

It fails completeness.

•

Using $\operatorname{\textsc{Opt}}(I)$ and $\operatorname{\textsc{Opt}}(\operatorname{\mathit{\hat{I}}})$ instead of $I$ and $\operatorname{\mathit{\hat{I}}}$ in the above measure:

[TABLE]

also fails completeness, according to the example given in connection with the definition of completeness.

•

Either $|\operatorname{\textsc{FP}}|$ or $|\operatorname{\textsc{FN}}|$ fails Lipschitz property.

•

Normalizing the Hamming distance, we obtain the Jaccard distance:

[TABLE]

This measure is sensitive to dummy requests: The adversary can construct a bad input and then add a lot of intervals to $I\cap\operatorname{\mathit{\hat{I}}}$ that neither the algorithm nor $\operatorname{\textsc{Opt}}$ will choose, driving down the error.

•

We also considered normalizing by the total number of possible intervals (order $m^{2}$ ), but this measure fails the Lipschitz property, as we can make the error arbitrarily small by “scaling up” each edge to an arbitrarily long path, without changing algorithms’ payoffs.

3 Disjoint-Path Allocation

In this section, we show that a simple algorithm $\operatorname{\textsc{Trust}}$ for the disjoint path allocation problem has an optimal competitive ratio for any graph of maximal degree at least 8. $\operatorname{\textsc{Trust}}$ simply relies on the predictions being correct. Specifically, it computes an optimal solution $\operatorname{\mathit{I^{\ast}}}$ in $\operatorname{\mathit{\hat{I}}}$ before processing the first request. Then, it accepts any interval in $\operatorname{\mathit{I^{\ast}}}$ that arrives and rejects all others.

We first establish that, on any graph, $\operatorname{\textsc{Trust}}(\operatorname{\mathit{\hat{I}}},I)\geq\operatorname{\textsc{Opt}}(I)-2\eta(\operatorname{\mathit{\hat{I}}},I)=(1-2\gamma(\operatorname{\mathit{\hat{I}}},I))\operatorname{\textsc{Opt}}(I)$ . The proof follows by observing that (i) false negatives cause a deficit of at most $\operatorname{\textsc{Opt}}(\operatorname{\textsc{FN}})$ in the schedule of $\operatorname{\textsc{Trust}}$ compared to the optimal schedule for $\operatorname{\mathit{I^{\ast}}}$ , (ii) false positives cause a deficit of at most $\operatorname{\textsc{Opt}}(\operatorname{\textsc{FP}})$ in the optimal schedule of $\operatorname{\mathit{I^{\ast}}}$ , compared to the optimal schedule for $I$ , and (iii) $\operatorname{\textsc{Opt}}(\operatorname{\textsc{FP}})+\operatorname{\textsc{Opt}}(\operatorname{\textsc{FN}})\leq 2\operatorname{\textsc{Opt}}(\operatorname{\textsc{FP}}\cup\operatorname{\textsc{FN}})=2\eta$ .

Theorem 3.1 ()

For any graph $G$ , any prediction $\operatorname{\mathit{\hat{I}}}$ , and input sequence $I$ , we have $\operatorname{\textsc{Trust}}(\operatorname{\mathit{\hat{I}}},I)\geq(1-2\gamma(\operatorname{\mathit{\hat{I}}},I))\operatorname{\textsc{Opt}}(I)\,.$

Proof

Since $\operatorname{\mathit{I^{\ast}}}$ is an optimal selection from $\operatorname{\textsc{TP}}\cup\operatorname{\textsc{FP}}$ , the largest number of intervals that $\operatorname{\textsc{Opt}}$ would be able to accept from $I$ compared to $\operatorname{\mathit{I^{\ast}}}$ would be an optimal selection from $\operatorname{\textsc{FN}}$ . Thus, $\operatorname{\textsc{Opt}}(I)\leq\operatorname{\textsc{Opt}}(\operatorname{\mathit{I^{\ast}}})+\operatorname{\textsc{Opt}}(\operatorname{\textsc{FN}})$ , and so $\operatorname{\textsc{Opt}}(\operatorname{\mathit{I^{\ast}}})\geq\operatorname{\textsc{Opt}}(I)-\operatorname{\textsc{Opt}}(\operatorname{\textsc{FN}})$ .

Similarly,

the largest number of intervals that can be detracted from $\operatorname{\textsc{Trust}}$ is realized when intervals that it planned to accept from $\operatorname{\mathit{I^{\ast}}}$ do not appear is $\operatorname{\textsc{Opt}}(\operatorname{\textsc{FP}})$ . Therefore, $\operatorname{\textsc{Trust}}(\operatorname{\mathit{\hat{I}}},I)\geq\operatorname{\textsc{Opt}}(\operatorname{\mathit{I^{\ast}}})-\operatorname{\textsc{Opt}}(\operatorname{\textsc{FP}})$ . Now,

[TABLE]

∎

The following result shows that Theorem 3.1 is tight for star graphs of degree 8. One can conclude that $\operatorname{\textsc{Trust}}$ is optimal for any graph that contains stars of degree 8 as a subgraph, i.e., any graph of maximal degree at least 8.

Theorem 3.2 ()

Let $\operatorname{\textsc{Alg}}$ be any deterministic algorithm and $p$ be any positive integer. On the star graph, $S_{8p}$ , there exists a set of predicted intervals $\operatorname{\mathit{\hat{I}}}_{w}$ and a request sequence $I_{w}$ such that $\eta(\operatorname{\mathit{\hat{I}}}_{w},I_{w})=p$ and $\operatorname{\textsc{Alg}}(\operatorname{\mathit{\hat{I}}}_{w},I_{w})\leq(1-2\gamma(\operatorname{\mathit{\hat{I}}},I))\operatorname{\textsc{Opt}}(I_{w})\,$ .

Proof

We consider the non-center vertices of $S_{8p}$ in $p$ groups of eight, and handle them all identically, one group at a time, treating each group independently. The prediction is fixed, but the input sequence depends on the algorithm’s actions. For each group, we show that the error in the prediction is 1, and the payoff of $\operatorname{\textsc{Opt}}$ is at least 2 units more than that of $\operatorname{\textsc{Alg}}$ . Given that groups do not share edges between themselves, the total error and algorithms’ payoffs are summed over all groups. Hence, the total error will be equal to $\eta(\operatorname{\mathit{\hat{I}}}_{w},I_{w})=p$ , and we can write $\operatorname{\textsc{Alg}}(I_{w})\leq\operatorname{\textsc{Opt}}(I_{w})-2\eta(\operatorname{\mathit{\hat{I}}}_{w},I_{w})$ , that is, $\operatorname{\textsc{Alg}}(I_{w})\leq(1-2\gamma(\operatorname{\mathit{\hat{I}}}_{w},I_{w}))\operatorname{\textsc{Opt}}(I_{w})$ .

Next, we explain how an adversary defines the input for each group. For group $0\leq i\leq s-1$ , the non-center vertices are $8i+j$ , where $1\leq j\leq 8$ , but we refer to these vertices by the value $j$ . Let $\operatorname{\mathit{\hat{I}}}_{w}=\left\{(1,2),(2,3),(3,4),(4,5),(6,7),(7,8)\right\}$ be the part of the prediction relevant for the current group of eight vertices. Both $(6,7)$ and $(7,8)$ are always included in the input sequence, with $(6,7)$ arriving immediately before $(7,8)$ . $\operatorname{\textsc{Alg}}$ accepts at most one of them. This is discussed in the cases below. The first request in the input is always $(2,3)$ , and $\operatorname{\textsc{Alg}}$ can either accept or reject it.

Case $\operatorname{\textsc{Alg}}$ accepts $\bm{(2,3)}$ : The next interval to arrive is $(6,7)$ . If $\operatorname{\textsc{Alg}}$ rejects this interval, the next to arrive is $(7,8)$ . If $\operatorname{\textsc{Alg}}$ also rejects this interval, then the intervals $(1,2)$ and $(3,4)$ also arrive, but $(4,5)$ is a false positive (see Figure 1(a)). Then, $\operatorname{\textsc{Opt}}$ accepts $\{(1,2),(3,4),(6,7)\}$ , $\operatorname{\textsc{Alg}}$ only accepts $\{(2,3)\}$ , and $\operatorname{\textsc{Opt}}(\operatorname{\textsc{FN}}\cup\operatorname{\textsc{FP}})=1$ . Thus, we may assume that $\operatorname{\textsc{Alg}}$ accepts at least one of $(6,7)$ and $(7,8)$ , which we call $(7,x)$ where $x\in\{6,8\}$ . We call the other of these two edges $(7,y)$ . Then, the intervals $(1,2)$ and $(3,4)$ also arrive, along with a false negative $(5,x)$ . The interval $(4,5)$ is a false positive and is not in the input (see Figure 1(b)). Since $(4,5)$ and $(5,x)$ share an edge, $\operatorname{\textsc{Opt}}(\operatorname{\textsc{FN}}\cup\operatorname{\textsc{FP}})=1$ . $\operatorname{\textsc{Alg}}$ accepts $\{(2,3),(7,x)\}$ , and $\operatorname{\textsc{Opt}}$ accepts $\{(1,2),(3,4),(5,x),(7,y)\}$ . To conclude, the error increases by 1, and $\operatorname{\textsc{Alg}}$ ’s deficit to $\operatorname{\textsc{Opt}}$ increases by 2.

Case $\operatorname{\textsc{Alg}}$ rejects $\bm{(2,3)}$ : The next interval to arrive is $(3,4)$ .

Subcase $\operatorname{\textsc{Alg}}$ accepts $\bm{(3,4)}$ : As in the previous case, we consider which of $(6,7)$ and $(7,8)$ $\operatorname{\textsc{Alg}}$ accepts. If neither is accepted, in addition to $(2,3)$ , $(4,5)$ arrives, but $(1,2)$ is a false positive (Figure 1(c)). Again, payoffs of $\operatorname{\textsc{Alg}}$ and $\operatorname{\textsc{Opt}}$ are respectively 1 and 3, and $\operatorname{\textsc{Opt}}(\operatorname{\textsc{FN}}\cup\operatorname{\textsc{FP}})=1$ . The error is increased by 1, and the net advantage of $\operatorname{\textsc{Opt}}$ over $\operatorname{\textsc{Alg}}$ is increased by at least 2.

Next, we assume that $\operatorname{\textsc{Alg}}$ accepts $(7,x)$ and rejects $(7,y)$ . Then, in addition to the intervals $(2,3)$ and $(3,4)$ , $(4,5)$ arrives, along with a false negative $(1,x)$ (Figure 1(d)). The interval $(1,2)$ is a false positive and is not in the input. Since $(1,2)$ and $(1,x)$ share an edge, $\operatorname{\textsc{Opt}}(\operatorname{\textsc{FN}}\cup\operatorname{\textsc{FP}})=1$ . $\operatorname{\textsc{Alg}}$ accepts $\{(3,4),(7,x)\}$ , and $\operatorname{\textsc{Opt}}$ accepts $\{(1,x),(2,3),(4,5),(7,y)\}$ . Again, the error is increased by 1, and the net advantage of $\operatorname{\textsc{Opt}}$ over $\operatorname{\textsc{Alg}}$ is increased by 2.

Subcase $\operatorname{\textsc{Alg}}$ rejects $\bm{(3,4)}$ : The next interval to arrive is $(1,2)$ .

Regardless of whether $\operatorname{\textsc{Alg}}$ accepts or rejects $(1,2)$ , as in the previous cases, we consider which of $(6,7)$ and $(7,8)$ $\operatorname{\textsc{Alg}}$ accepts. If neither is accepted, then $(2,3)$ and $(3,4)$ have already arrived, but $(4,5)$ is a false positive. The payoff of $\operatorname{\textsc{Alg}}$ is at most 1 if it accepts $(1,2)$ and 0 otherwise, while $\operatorname{\textsc{Opt}}$ accepts $\{(1,2),(2,3),(3,4)\}$ , and $\operatorname{\textsc{Opt}}(\operatorname{\textsc{FN}}\cup\operatorname{\textsc{FP}})=1$ . Thus, the error is increased by 1, and the net advantage of $\operatorname{\textsc{Opt}}$ over $\operatorname{\textsc{Alg}}$ is increased by 2. In what follows, we assume $\operatorname{\textsc{Alg}}$ accepts $(7,x)$ for $x\in\{6,8\}$ .

Subsubcase $\operatorname{\textsc{Alg}}$ accepts $\bm{(1,2)}$ : Then, in addition to the intervals $(2,3)$ and $(3,4)$ , a false negative, $(5,x)$ , arrives. The interval $(4,5)$ is a false positive and is not in the input. Since $(4,5)$ and $(5,x)$ share an edge, $\operatorname{\textsc{Opt}}(\operatorname{\textsc{FN}}\cup\operatorname{\textsc{FP}})=1$ . $\operatorname{\textsc{Alg}}$ accepts $\{(1,2),(7,x)\}$ , and $\operatorname{\textsc{Opt}}$ accepts $\{(1,2),(3,4),(5,x),(7,y)\}$ (Figure 1(e)). As before, the error is increased by 1, and the net advantage of $\operatorname{\textsc{Opt}}$ over $\operatorname{\textsc{Alg}}$ is increased by 2.

Subsubcase $\operatorname{\textsc{Alg}}$ rejects $\bm{(1,2)}$ : In this case, the interval $(4,5)$ is a false positive, and there are no false negatives. Thus, the payoffs of $\operatorname{\textsc{Alg}}$ and $\operatorname{\textsc{Opt}}$ are respectively 1 and 3, and $|\operatorname{\textsc{Opt}}(\operatorname{\textsc{FN}}\cup\operatorname{\textsc{FP}})|=1$ (Figure 1(f)). That is, the error is increased by 1, and $\operatorname{\textsc{Alg}}$ ’s deficit compared to $\operatorname{\textsc{Opt}}$ is increased by 2.

This completes the proof for one group of eight vertices. Repeating it independently for each of the $s$ groups of eight vertices gives the claimed result. ∎

4 Interval Scheduling

In this section, we show tight upper and lower bounds on the competitive ratio of a deterministic algorithm for interval scheduling. As an introduction to the difficulties in designing algorithms for the problem, we start by proving a general lower bound. We show that for any deterministic algorithm $\operatorname{\textsc{Alg}}$ , there exists an input sequence $I_{w}$ and a set of predictions $\operatorname{\mathit{\hat{I}}}_{w}$ such that $\operatorname{\textsc{Alg}}(\operatorname{\mathit{\hat{I}}}_{w},I_{w})=\operatorname{\textsc{Opt}}(I_{w})-\eta(\operatorname{\mathit{\hat{I}}}_{w},I_{w})$ , and that this can be established for any positive integer error. We also show that the competitive ratio of $\operatorname{\textsc{Alg}}$ is arbitrarily small.

Theorem 4.1 ()

Let $\operatorname{\textsc{Alg}}$ be any deterministic algorithm. For any positive integers $p$ and $c\in[2,m]$ , there are instances $I_{w}$ and predictions $\operatorname{\mathit{\hat{I}}}_{w}$ such that $p\leq\eta(\operatorname{\mathit{\hat{I}}}_{w},I_{w})\leq(c-1)p$ and $\operatorname{\textsc{Alg}}(\operatorname{\mathit{\hat{I}}}_{w},I_{w})=(1-\gamma(\operatorname{\mathit{\hat{I}}}_{w},I_{w}))\operatorname{\textsc{Opt}}(I_{w})\leq\frac{1}{c}\operatorname{\textsc{Opt}}(I_{w})\,.$

Proof

$\operatorname{\textsc{Alg}}$ will be presented with $p$ intervals of length $c$ , and the remainder of the sequence will depend on which of these it accepts. The prediction, however, will include the following $2p$ requests: $\operatorname{\mathit{\hat{I}}}=\bigcup_{i=0}^{p-1}\big{\{}(ci,c(i+1)),(ci,ci+1)\big{\}}\,.$

The input $I_{w}$ is formed by $p$ phases, $i\in[0,p-1]$ . The $i$ th phase starts with the true positive $(ci,c(i+1))$ . There are two cases to consider:

•

If $\operatorname{\textsc{Alg}}$ accepts $(ci,c(i+1))$ , then the phase continues with

$\left\{(ci+j,ci+(j+1))\mid 0\leq j\leq c-1\right\}.$ The first of these requests is a true positive, and the other $c-1$ are false negatives. Note that $\operatorname{\textsc{Alg}}$ cannot accept any of these $c$ requests. The optimal algorithm rejects the original request $(ci,c(i+1))$ and accepts all of the $c$ following unit-length requests.

•

If $\operatorname{\textsc{Alg}}$ rejects $(ci,c(i+1))$ , the phase ends with no further requests. In this case, $(ci,ci+1)$ is a false positive.

The contribution, $\eta_{i}$ , of phase $i$ to $|\operatorname{\textsc{FP}}\cup\operatorname{\textsc{FN}}|$ is $\eta_{i}=c-1$ in the first case and $\eta_{i}=1$ in the second. Since the intervals in $\operatorname{\textsc{FP}}\cup\operatorname{\textsc{FN}}$ are disjoint, we can write $\operatorname{\textsc{Opt}}(\operatorname{\textsc{FP}}\cup\operatorname{\textsc{FN}})=\sum_{i=0}^{p-1}\eta_{i}$ and it follows that $p\leq\operatorname{\textsc{Opt}}(\operatorname{\textsc{FP}}\cup\operatorname{\textsc{FN}})\leq(c-1)p$ . Moreover, the net advantage of $\operatorname{\textsc{Opt}}$ over $\operatorname{\textsc{Alg}}$ in phase $i$ is at least $\eta_{i}$ : in the first case, $\operatorname{\textsc{Opt}}$ accepts $\eta_{i}+1$ and $\operatorname{\textsc{Alg}}$ accepts one request, and in the second case, $\operatorname{\textsc{Opt}}$ accepts $\eta_{i}=1$ and $\operatorname{\textsc{Alg}}$ accepts no requests. Given that there are $p$ phases, we can write $\operatorname{\textsc{Alg}}(\operatorname{\mathit{\hat{I}}}_{w},I_{w})\leq\operatorname{\textsc{Opt}}(I_{w})-\sum_{i=0}^{p-1}\eta_{i}=\operatorname{\textsc{Opt}}(I_{w})-\operatorname{\textsc{Opt}}(\operatorname{\textsc{FP}}\cup\operatorname{\textsc{FN}})=(1-\gamma(\operatorname{\mathit{\hat{I}}}_{w},I_{w}))\operatorname{\textsc{Opt}}(I_{w}).$

In phases where $\operatorname{\textsc{Alg}}$ accepts the first request, $\operatorname{\textsc{Opt}}$ accepts $c$ times as many requests as $\operatorname{\textsc{Alg}}$ . In phases where $\operatorname{\textsc{Alg}}$ rejects the first request, $\operatorname{\textsc{Opt}}$ accepts one interval, and $\operatorname{\textsc{Alg}}$ accepts no intervals. Thus, $\operatorname{\textsc{Opt}}(I_{w})\geq c\cdot\operatorname{\textsc{Alg}}(\operatorname{\mathit{\hat{I}}}_{w},I_{w})\,.$ ∎

For $c=2$ , we get $\eta(\operatorname{\mathit{\hat{I}}}_{w},I)=p$ and $\operatorname{\textsc{Alg}}(\operatorname{\mathit{\hat{I}}}_{w},I_{w})=(1-\gamma(\operatorname{\mathit{\hat{I}}}_{w},I_{w}))\operatorname{\textsc{Opt}}(\operatorname{\mathit{\hat{I}}}_{w})$ . The next theorem shows that the competitive ratio of $\operatorname{\textsc{Trust}}$ compared to the lower bound of Theorem 4.1 is not tight. The proof follows from an adversarial sequence similar to that of Theorem 4.1 in which the payoff of $\operatorname{\textsc{Opt}}$ and $\eta$ grow in phases while the payoff of $\operatorname{\textsc{Trust}}$ stays 0.

Theorem 4.2 ()

For any integer $p\geq 1$ , there exists a prediction $\operatorname{\mathit{\hat{I}}}_{w}$ and an input sequence $I_{w}$ so that $\eta(\operatorname{\mathit{\hat{I}}}_{w},I_{w})=p$ and $\operatorname{\textsc{Trust}}(\operatorname{\mathit{\hat{I}}}_{w},I_{w})=(1-2\gamma(\operatorname{\mathit{\hat{I}}}_{w},I_{w}))\operatorname{\textsc{Opt}}(I_{w})\,.$

Proof

Let the prediction be

$\operatorname{\mathit{\hat{I}}}_{w}=\bigcup_{i=0}^{p-1}\big{\{}(3i,3i+2),(3i+1,3i+3)\big{\}}\,.$

$\operatorname{\textsc{Trust}}$ chooses an optimal solution $\operatorname{\mathit{I^{\ast}}}$ from $\operatorname{\mathit{\hat{I}}}_{w}$ . For each $i$ , $\operatorname{\mathit{I^{\ast}}}$ will contain either $(3i,3i+2)$ or $(3i+1,3i+3)$ . If $(3i,3i+2)$ is in $\operatorname{\mathit{I^{\ast}}}$ , that interval will be in $\operatorname{\textsc{FP}}$ , and $\operatorname{\textsc{Opt}}$ will select $(3i+1,3i+3)$ , which will be a $\operatorname{\textsc{TP}}$ -interval in $I_{w}$ . Further, $I_{w}$ will contain the $\operatorname{\textsc{FN}}$ -interval, $(3i,3i+1)$ .

If, instead, $(3i+1,3i+3)$ is in $\operatorname{\mathit{I^{\ast}}}$ , that interval will be in $\operatorname{\textsc{FP}}$ , and $\operatorname{\textsc{Opt}}$ will select $(3i,3i+2)$ , which will be a $\operatorname{\textsc{TP}}$ -interval in $I_{w}$ . Further, $I_{w}$ will then contain the $\operatorname{\textsc{FN}}$ -interval, $(3i+2,3i+3)$ .

Thus, $\operatorname{\textsc{Opt}}(I_{w})=2p$ , and for each $i$ , the interval in $\operatorname{\textsc{FP}}$ and the interval in $\operatorname{\textsc{FN}}$ overlap, that so $\operatorname{\textsc{Opt}}(\operatorname{\textsc{FN}}\cup\operatorname{\textsc{FP}})=p$ . Since $\operatorname{\mathit{I^{\ast}}}=\operatorname{\textsc{FP}}$ , $\operatorname{\textsc{Trust}}$ does not accept any intervals, so

[TABLE]

∎

4.1 $\operatorname{\textsc{TrustGreedy}}$

In this section, we introduce an algorithm $\operatorname{\textsc{TrustGreedy}}$ , $\operatorname{\textsc{TG}}$ , which achieves an optimal competitive ratio for interval scheduling.

4.1.1 The algorithm.

$\operatorname{\textsc{TG}}$ starts by choosing an optimal solution offline set $\operatorname{\mathit{I^{\ast}}}$ of the schedules in $\operatorname{\mathit{\hat{I}}}$ , and plans to accept those intervals in $\operatorname{\mathit{I^{\ast}}}$ and reject all others, and it just follows its plan, except possibly when the next request is in $\operatorname{\textsc{FN}}$ . $\operatorname{\textsc{TG}}$ maintains an updated plan, $A$ . Initially, $A$ is $\operatorname{\mathit{I^{\ast}}}$ . When a request, $r$ , is in $\operatorname{\textsc{FN}}$ , $\operatorname{\textsc{TG}}$ accepts if $r$ overlaps no previously accepted intervals and can be accepted by replacing at most one other interval in $A$ that ends no earlier than $r$ . In that case, $r$ is added to $A$ , possibly replacing an overlapping interval to maintain the feasibility of $A$ (no two intervals overlap). As a comment, only the first interval from $\operatorname{\textsc{FN}}$ that replaces an interval $r$ in the current $A$ is said to “replace” it. There may be other intervals from $\operatorname{\textsc{FN}}$ that overlap $r$ and are accepted by $\operatorname{\textsc{TG}}$ , but they are not said to “replace” it. We let $U$ denote the set of intervals in $\operatorname{\mathit{I^{\ast}}}\cap\operatorname{\textsc{FP}}$ that are not replaced during the execution of $\operatorname{\textsc{TG}}$ .

4.1.2 Analysis.

Let $\operatorname{\textsc{TG}}$ denote the set of intervals chosen by $\operatorname{\textsc{TrustGreedy}}$ on input $I$ and prediction $\operatorname{\mathit{\hat{I}}}$ , and $\operatorname{\textsc{Opt}}$ the intervals chosen by the optimal algorithm. We define the following subsets of $\operatorname{\textsc{TG}}$ and $\operatorname{\textsc{Opt}}$ :

•

$\operatorname{\textsc{TG}}^{\operatorname{\textsc{FN}}}=\operatorname{\textsc{TG}}\cap\operatorname{\textsc{FN}}$ and $\operatorname{\textsc{Opt}}^{\operatorname{\textsc{FN}}}=\operatorname{\textsc{Opt}}\cap\operatorname{\textsc{FN}}$

•

$\operatorname{\textsc{TG}}^{\operatorname{\textsc{TP}}}=\operatorname{\textsc{TG}}\cap\operatorname{\mathit{\hat{I}}}=\operatorname{\textsc{TG}}\cap\operatorname{\textsc{TP}}$ and $\operatorname{\textsc{Opt}}^{\operatorname{\textsc{TP}}}=\operatorname{\textsc{Opt}}\cap\operatorname{\mathit{\hat{I}}}=\operatorname{\textsc{Opt}}\cap\operatorname{\textsc{TP}}$

Lemma 1

Each interval $i\in\operatorname{\textsc{Opt}}^{\operatorname{\textsc{TP}}}$ overlaps an interval in $\operatorname{\mathit{I^{\ast}}}$ extending no further to the right than $i$ .

Proof

Assume to the contrary that there is no interval in $\operatorname{\mathit{I^{\ast}}}$ that overlaps $i$ and ends no later than $i$ . If $i$ does not overlap anything in $\operatorname{\mathit{I^{\ast}}}$ , we could have added $i$ to $\operatorname{\mathit{I^{\ast}}}$ and have a feasible solution (non-overlapping intervals), contradicting the fact that $\operatorname{\mathit{I^{\ast}}}$ is optimal. Thus, $i$ must overlap an interval $r$ in $\operatorname{\mathit{I^{\ast}}}$ , which, by assumption, must end strictly later than $i$ . This contradicts the construction of $\operatorname{\mathit{I^{\ast}}}$ , since $i$ would have been in $\operatorname{\mathit{I^{\ast}}}$ instead of $r$ . ∎

We define a set $O^{\operatorname{\textsc{FN}}}$ consisting of a copy of each interval in $\operatorname{\textsc{Opt}}^{\operatorname{\textsc{FN}}}$ and let $\operatorname{\mathcal{F}}=O^{\operatorname{\textsc{FN}}}\cup U$ . We define a mapping $f\colon\operatorname{\textsc{Opt}}\rightarrow\operatorname{\textsc{TG}}\cup\operatorname{\mathcal{F}}$ as follows. For each $i\in\operatorname{\textsc{Opt}}$ :

If there is an interval in $\operatorname{\mathit{I^{\ast}}}$ that overlaps $i$ and ends no later than $i$ , then let $r$ be the rightmost such interval.

(a)

If $r\in U\cup\operatorname{\textsc{TG}}^{\operatorname{\textsc{TP}}}$ , then $f(i)=r$ . 2. (b)

Otherwise, $r$ has been replaced by some interval $t$ . In this case, $f(i)=t$ . 2. 2.

Otherwise, by Lemma 1, $i$ belongs to $\operatorname{\textsc{Opt}}^{\operatorname{\textsc{FN}}}$ .

(a)

If there is an interval in $\operatorname{\textsc{TG}}^{\operatorname{\textsc{FN}}}$ that overlaps $i$ and ends no later than $i$ and an interval in $U$ that overlaps $i$ ’s right endpoint, let $r$ be the rightmost interval in $\operatorname{\textsc{TG}}^{\operatorname{\textsc{FN}}}$ that overlaps $i$ and ends no later than $i$ . In this case, $f(i)=r$ . 2. (b)

Otherwise, let $o_{i}$ be the copy of $i$ in $O^{\operatorname{\textsc{FN}}}$ . In this case, $f(i)=o_{i}$ .

We let $F$ denote the subset of $\operatorname{\mathcal{F}}$ mapped to by $f$ and note that in step 1a, intervals are added to $F\cap U$ when $r\in U$ . In step 2b, all intervals are added to $F\cap O^{\operatorname{\textsc{FN}}}$ .

Lemma 2

The mapping $f$ is an injection.

Proof

Intervals in $U\cup\operatorname{\textsc{TG}}^{\operatorname{\textsc{TP}}}$ are only mapped to in step 1a. Note that $U$ and $\operatorname{\textsc{TG}}$ are disjoint. If an interval $i\in\operatorname{\textsc{Opt}}$ is mapped to an interval $r\in U\cup\operatorname{\textsc{TG}}$ in this step, $i$ overlaps the right endpoint of $r$ . There can be only one interval in $\operatorname{\textsc{Opt}}$ overlapping the right endpoint of $r$ , so this part of the mapping is injective. Intervals in $\operatorname{\textsc{TG}}^{\operatorname{\textsc{FN}}}$ are only mapped to in steps 1b and 2a. In step 1b, only intervals that replace intervals in $\operatorname{\mathit{I^{\ast}}}$ are mapped to. Since each interval in $\operatorname{\textsc{TG}}^{\operatorname{\textsc{FN}}}$ replaces at most one interval in $\operatorname{\mathit{I^{\ast}}}$ and the right endpoint of each interval in $\operatorname{\mathit{I^{\ast}}}$ overlaps at most one interval in $\operatorname{\textsc{Opt}}$ , no interval is mapped to twice in step 1b. If, in step 2a, an interval, $i$ , is mapped to an interval, $r$ , $i$ overlaps the right endpoint of $r$ . There can be only one interval in $\operatorname{\textsc{Opt}}$ overlapping the right endpoint of $r$ , so no interval is mapped to twice in step 2a.

We now argue that no interval is mapped to in both steps 1b and 2a. Assume that an interval, $i_{1}$ , is mapped to an interval, $t$ , in step 1b. Then, there is an interval, $r$ , such that $r$ overlaps the right endpoint of $t$ and $i_{1}$ overlaps the right endpoint of $r$ . This means that the right endpoint of $i_{1}$ is no further to the left than the right endpoint of $t$ . Assume for the sake of contradiction that an interval $i_{2}\neq i_{1}$ is mapped to $t$ in step 2a. Then, $i_{2}$ overlaps the right endpoint of $t$ , and there is an interval, $u\in U$ , overlapping the right endpoint of $i_{2}$ . Since $i_{2}$ overlaps $t$ , $i_{2}$ must be to the left of $i_{1}$ . Since $i_{2}$ is mapped to $t$ , $t$ extends no further to the right than $i_{2}$ . Thus, since $r$ overlaps both $t$ and $i_{1}$ , $r$ must overlap the right endpoint of $i_{2}$ , and hence, $r$ overlaps $u$ . This is a contradiction since $r$ and $u$ are both in $\operatorname{\mathit{I^{\ast}}}$ . Intervals in $F\cap O^{\operatorname{\textsc{FN}}}$ are only mapped to in step 2b and no two intervals are mapped to the same interval in this step. ∎

Lemma 3

The subset $F$ of $\operatorname{\mathcal{F}}$ mapped to by $f$ is a feasible solution.

Proof

We first note that $F\cap U$ is feasible since $F\cap U\subseteq U\subseteq\operatorname{\mathit{I^{\ast}}}$ and $\operatorname{\mathit{I^{\ast}}}$ is feasible. Moreover, $F\cap O^{\operatorname{\textsc{FN}}}$ is feasible since the intervals of $F\cap O^{\operatorname{\textsc{FN}}}$ are identical to the corresponding subsets of $\operatorname{\textsc{Opt}}$ . Thus, we need to show that no interval in $F\cap U$ overlaps any interval in $F\cap O^{\operatorname{\textsc{FN}}}$ .

Consider an interval $u\in F\cap U$ mapped to from an interval $i\in\operatorname{\textsc{Opt}}$ . Since $i$ is not mapped to its own copy in $\operatorname{\mathcal{F}}$ , its copy does not belong to $F$ . Since $i\in\operatorname{\textsc{Opt}}$ , no interval in $F\cap O^{\operatorname{\textsc{FN}}}$ overlaps $i$ . Thus, we need to argue that $F\cap O^{\operatorname{\textsc{FN}}}$ contains no interval strictly to the left of $i$ overlapping $u$ .

Assume for the sake of contradiction that there is an interval $\ell\in F\cap O^{\operatorname{\textsc{FN}}}$ to the left of $i$ overlapping $u$ . Since $\ell$ ended up in $F$ although its right endpoint is overlapped by an interval from $U$ , there is no interval in $\operatorname{\mathit{I^{\ast}}}$ (because of step 1 in the mapping algorithm) or in $\operatorname{\textsc{TG}}^{\operatorname{\textsc{FN}}}$ (because of step 2a in the mapping algorithm) overlapping $\ell$ and ending no later than $\ell$ . Thus, $\operatorname{\mathit{I^{\ast}}}\cup\operatorname{\textsc{TG}}^{\operatorname{\textsc{FN}}}$ contains no interval strictly to the left of $u$ overlapping $\ell$ . This contradicts the fact that $u$ has not been replaced since the interval in $\operatorname{\textsc{Opt}}^{\operatorname{\textsc{FN}}}$ corresponding to $\ell$ could have replaced it. ∎

The following theorem follows from Lemmas 2 and 3.

Theorem 4.3 ()

For any prediction $\operatorname{\mathit{\hat{I}}}$ and any input sequence $I$ , we have

[TABLE]

Proof

We show that

[TABLE]

∎

5 Consistency-Robustness Trade-off

We study the trade-off between the competitive ratio of the interval scheduling algorithm when predictions are error-free (consistency) and when predictions are adversarial (robustness). The following proposition shows an obvious trade-off between the consistency and robustness of deterministic algorithms.

Proposition 2 ()

If a deterministic algorithm has non-zero consistency, $\alpha$ , it has robustness $\beta\leq\frac{1}{m}$ .

Proof

Consider a prediction that indicates the input to be one long interval, $\operatorname{\mathit{\hat{I}}}=(0,m)$ . In order to have non-zero consistency, $\alpha$ , the algorithm must accept this interval, if it is first in some sequence because it might be the only interval in that sequence.

Suppose an input $\sigma$ is $(0,m),(0,1),(1,2),(2,3),\ldots,(m-1,m)\,.$ Clearly, $\operatorname{\textsc{Opt}}$ accepts the $m$ intervals of length $1$ , giving robustness $\frac{1}{m}$ . ∎

The more interesting case is randomized algorithms. The proof of the following was inspired by the proof of Theorem 13.8 in [16] for the online case without predictions, and that $\mathrm{\Omega}(\log m)$ result was originally proven in [7].

Theorem 5.1

If a (possibly randomized) algorithm $\operatorname{\textsc{Alg}}$ is both $\alpha$ -consistent and $\beta$ -robust, then $\alpha\leq 1-\frac{\lfloor\log m\rfloor-1}{2}\beta$ and $\beta\leq\frac{2}{\lfloor\log m\rfloor-1}\cdot(1-\alpha)$ .

Proof

Let $r=\lfloor\log m\rfloor-1$ and let $m^{\prime}=2^{r+1}$ . Consider a prediction $\sigma=\langle\operatorname{\mathit{\hat{I}}}_{0},\operatorname{\mathit{\hat{I}}}_{1},\ldots,\operatorname{\mathit{\hat{I}}}_{r},\operatorname{\mathit{\hat{I}}}^{\prime}\rangle$ , where $\operatorname{\mathit{\hat{I}}}^{\prime}=\langle(0,1),(1,2),\ldots,(m^{\prime}-1,m^{\prime})\rangle$ and, for $0\leq i\leq r$ , $\operatorname{\mathit{\hat{I}}}_{i}=\langle(0,m^{\prime}/2^{i}),(m^{\prime}/2^{i},2m^{\prime}/2^{i}),\ldots,(m^{\prime}-m^{\prime}/2^{i},m^{\prime})\rangle$ . Note that $\operatorname{\mathit{\hat{I}}}_{i}$ consists of $2^{i}$ disjoint intervals of length $m^{\prime}/2^{i}$ . For $0\leq i\leq r$ , let $\sigma_{i}=\langle\operatorname{\mathit{\hat{I}}}_{0},\operatorname{\mathit{\hat{I}}}_{1},\ldots,\operatorname{\mathit{\hat{I}}}_{i}\rangle$ .

In order to maximize the number of small intervals that can be accepted if they arrive, an algorithm would minimize the (expected) fraction of the line occupied by the larger intervals, to leave space for the small intervals, while maintaining $\beta$ -robustness. Since $\operatorname{\textsc{Opt}}(\sigma_{0})=1$ and $\operatorname{\textsc{Alg}}$ is $\beta$ -robust, $E[\operatorname{\textsc{Alg}}(\sigma_{0})]\geq\beta$ . For $\sigma_{i}$ with $i\geq 1$ , $\operatorname{\textsc{Opt}}$ accepts all intervals in $\operatorname{\mathit{\hat{I}}}_{i}$ , so $\operatorname{\textsc{Opt}}(\sigma_{i})=2^{i}$ . To be $\beta$ -robust, the expected number of intervals of length at most $m^{\prime}/2^{i}$ that $\operatorname{\textsc{Alg}}$ accepts is at least $2^{i}\beta$ . Inductively, for $i\geq 1$ , by the linearity of expectations, this is at least $2^{i-1}\beta$ intervals of length $m^{\prime}/2^{i}$ , and these intervals have a total expected size of at least $2^{i-1}\beta\times m^{\prime}/2^{i}=\frac{m^{\prime}}{2}\beta$ . Again, by the linearity of expectations, for $\sigma_{r}$ , the expected sum of the lengths of the accepted intervals is at least $\sum_{i=0}^{r}\frac{m^{\prime}}{2}\beta=\frac{m^{\prime}(r+1)}{2}\beta$ .

From $\sigma_{r}$ , the expected number of intervals $\operatorname{\textsc{Alg}}$ must have accepted is at least $2^{r}\beta$ . If $\sigma$ is the actual input sequence, then the predictions are correct, so for $\operatorname{\textsc{Alg}}$ to be $\alpha$ -consistent, we must have $E[\operatorname{\textsc{Alg}}(\sigma^{\prime})]\geq m^{\prime}\alpha$ . Since also $2^{r}\beta+(m^{\prime}-\frac{m^{\prime}(r+1)}{2}\beta)\geq E[\operatorname{\textsc{Alg}}(\sigma^{\prime})]$ , we can combine these two inequalities and obtain $\frac{2^{r}}{m^{\prime}}\beta+1-\frac{r+1}{2}\beta\geq\alpha$ . Since $\frac{2^{r}}{m^{\prime}}=\frac{1}{2}$ , this reduces to $\alpha\leq 1-\frac{r}{2}\beta$ . Solving for $\beta$ , $\beta\leq\frac{2}{r}(1-\alpha)$ . ∎

Note that as $\alpha$ approaches 1 (optimal consistency), $\beta$ goes to [math] (worst-case robustness) and vice-versa. Next, we present a family of algorithms, RobustTrust, which has a parameter $0\leq\alpha\leq 1$ and works as follows. With a probability of $\alpha$ , RobustTrust applies $\operatorname{\textsc{TG}}$ . (Applying $\operatorname{\textsc{Trust}}$ , instead of $\operatorname{\textsc{TG}}$ , gives the same consistency and robustness results.) With probability $1-\alpha$ , RobustTrust ignores the predictions, and applies the Classify-and-Randomly-Select ( $\operatorname{\textsc{Crs}}$ ) algorithm described in Theorem 13.7 in [16]. $\operatorname{\textsc{Crs}}$ is strictly $\lceil\log m\rceil$ -competitive (they use ratios at least one). A similar algorithm was originally proven $O(\log m)$ -competitive in [7].

For completeness, we include the $\operatorname{\textsc{Crs}}$ algorithm. To avoid the problem of $m$ possibly not being a power of $2$ , we define $j=\lceil\log m\rceil$ and $m^{\prime}=2^{j}$ . Thus, the algorithm will define its behavior for a longer line and some sequences that cannot exist.

We define a set of $\lceil\log m\rceil$ levels for the possible requests. Since $m^{\prime}$ is a power of two, there is an odd number of edges, so the middle edge, $e_{1}$ , in the line is well defined. The set $E_{1}=\{e_{1}\}$ and Level 1 consists of all intervals containing $e_{1}$ . After Levels 1 through $i$ are defined, we define $E_{i+1}$ and Level $i+1$ as follows: After removing all edges in $E_{1}\cup E_{2}\cup\cdots\cup E_{i}$ from the line, we are left with $2^{i}$ segments, each consisting of $2^{j-i}$ vertices. The set $E_{i+1}$ consists of the middle edges of these segments, and Level $i+1$ consists of all intervals, not in any of the Levels $1$ through $i$ , but containing an edge in $E_{i+1}$ . Thus, the levels create a partition of all possible intervals.

The algorithm $\operatorname{\textsc{Crs}}$ initially chooses a level $i$ between $1$ and $j$ , each with probability $\frac{1}{j}$ . It accepts any interval in Level $i$ that does not overlap an interval it already has accepted. Any intervals not in Level $i$ are rejected.

When RobustTrust applies $\operatorname{\textsc{TG}}$ and the predictions are correct, it accepts exactly as many intervals as there are in the optimal solution. From these observations, we can get the following results.

Theorem 5.2 ()

RobustTrust* ( $\operatorname{\textsc{Rt}}$ ) with parameter $\alpha$ has consistency at least $\alpha$ and robustness at least $\frac{1-\alpha}{\lceil\log m\rceil}$ .*

Proof

We investigate the RobustTrust when all predictions are correct (the consistency) and when some predictions may be incorrect (robustness).

Suppose all predictions are correct. RobustTrust applies $\operatorname{\textsc{TG}}$ with probability $\alpha$ . Since $\operatorname{\textsc{TG}}$ is optimal when all predictions are correct, the expected payoff of RobustTrust is at least $\alpha\cdot\operatorname{\textsc{Opt}}$ . Therefore, the competitive ratio (consistency) of RobustTrust is at least $\alpha$ .

Suppose some predictions are incorrect. If the intervals in Level $i$ are the only intervals given, and $\operatorname{\textsc{Crs}}$ chooses that level, $\operatorname{\textsc{Crs}}$ accepts as many intervals as $\operatorname{\textsc{Opt}}$ does, since each interval in Level $i$ contains an edge in $E_{i}$ , and no intervals containing more than one edge in $E_{i}$ exist. Since the number of levels is $\lceil\log m\rceil$ , the expected number of intervals from $\operatorname{\textsc{Opt}}$ ’s configuration that $\operatorname{\textsc{Crs}}$ accepts on any given level is $\frac{1}{\lceil\log m\rceil}$ times the number of intervals $\operatorname{\textsc{Opt}}$ accepted from that level, so by the linearity of expectations, this totals $\frac{1}{\lceil\log m\rceil}\operatorname{\textsc{Opt}}$ . $\operatorname{\textsc{Crs}}$ is chosen with probability $1-\alpha$ , so the robustness is at most $\frac{1-\alpha}{\lceil\log m\rceil}$ . ∎

6 Experimental Results

We present an experimental evaluation of $\operatorname{\textsc{Trust}}$ and $\operatorname{\textsc{TrustGreedy}}$ in comparison with the $\operatorname{\textsc{Greedy}}$ algorithm, which serves as a baseline online algorithm, and $\operatorname{\textsc{Opt}}$ , which serves as the performance upper bound. We evaluate our algorithms using real-world scheduling data for parallel machines [19]. Each benchmark from [19] specifies the start and finish times of tasks as scheduled on parallel machines with several processors. We use these tasks to generate inputs to the interval scheduling problem; Table 1 details the interval scheduling inputs we generated from benchmarks of [19]. For each benchmark with $N$ tasks, we create an instance $I$ of an interval scheduling problem by randomly selecting $n=\lfloor N/2\rfloor$ tasks from the benchmark and randomly permuting them. This sequence serves as the input to all algorithms. To generate the prediction, we consider $1000$ equally distanced values of $d\in[0,n]$ . For each value of $d$ , we initiate the prediction set $\operatorname{\mathit{\hat{I}}}$ with the set of intervals in $I$ , remove $|\operatorname{\textsc{FN}}|=d$ randomly selected intervals from $\operatorname{\mathit{\hat{I}}}$ and add to it $|\operatorname{\textsc{FP}}|=d$ randomly selected intervals from the remaining $N-n$ tasks in the benchmark. The resulting set $\operatorname{\mathit{\hat{I}}}$ is given to $\operatorname{\textsc{Trust}}$ and $\operatorname{\textsc{TrustGreedy}}$ as prediction $\operatorname{\mathit{\hat{I}}}$ . For each value of $d$ , we compute the normalized error $\gamma(\operatorname{\mathit{\hat{I}}},I)=\frac{\operatorname{\textsc{Opt}}(\operatorname{\textsc{FN}}\cup\operatorname{\textsc{FP}})}{\operatorname{\textsc{Opt}}(I)}$ , and report the payoff of $\operatorname{\textsc{Trust}}$ and $\operatorname{\textsc{TrustGreedy}}$ as a function of $\gamma$ .

Figure 2 shows the results for two representative benchmarks from [19], namely, LLNL (the workload of the BlueGene/L system installed at Lawrence Livermore National Lab), SDSC (the workload log from San Diego Supercomputer Center), NASA-iPSC (scheduling log from Numerical Aerodynamic Simulation -NAS- Systems Division at NASA Ames Research Center) and CTC-SP2 (Cornell Theory Center IBM SP2 log). These four benchmarks are selected to represent a variety of input sizes and interval lengths. These four benchmarks are selected to represent a variety of input sizes and interval lengths. The results are aligned with our theoretical findings: $\operatorname{\textsc{Trust}}$ quickly becomes worse than $\operatorname{\textsc{Greedy}}$ as the error value increases, while $\operatorname{\textsc{TrustGreedy}}$ degrades gently as a function of the prediction error. In particular, $\operatorname{\textsc{TrustGreedy}}$ is better than $\operatorname{\textsc{Greedy}}$ for almost all error values. We note that $\operatorname{\textsc{Greedy}}$ performs better when there is less overlap between the input intervals, which is the case in LLNL compared to SDSC. In an extreme case, when no two intervals overlap, $\operatorname{\textsc{Greedy}}$ is trivially optimal. Nevertheless, even for LLNL, $\operatorname{\textsc{TrustGreedy}}$ is not much worse than $\operatorname{\textsc{Greedy}}$ for extreme values of error: the payoff for the largest normalized error of $\gamma=1.87$ was 5149 and 5198 for $\operatorname{\textsc{TrustGreedy}}$ and $\operatorname{\textsc{Greedy}}$ , respectively. Note that for SDSC, where there are more overlaps between intervals, $\operatorname{\textsc{TrustGreedy}}$ is strictly better than $\operatorname{\textsc{Greedy}}$ , even for the largest error values. It is worth noting that, in an extreme case, where $\operatorname{\textsc{FP}}=\operatorname{\textsc{FN}}=n$ , the predictions contain a completely different set from the input sequence. In that case, $|\operatorname{\textsc{FP}}\cup\operatorname{\textsc{FN}}|=2n$ , and $\gamma=\frac{\operatorname{\textsc{Opt}}(\operatorname{\textsc{FP}}\cup\operatorname{\textsc{FN}})}{\operatorname{\textsc{Opt}}(I)}$ takes values in $[1.5,2]$ .

We also experiment in a setting where false positives and negatives contribute differently to the error set. We generate the input sequences in the same way as in the previous experiments. To generate the prediction set $\operatorname{\mathit{\hat{I}}}$ , we consider $1000$ equally-distanced values of $d$ in the range $[0,n]$ as before. We first consider a setting in which all error is due to false negatives; for that, we generate $\operatorname{\mathit{\hat{I}}}$ by removing $d$ randomly selected intervals from $I$ . In other words, $\operatorname{\mathit{\hat{I}}}$ is a subset of the intervals in $I$ . Figures 3(a) and 3(c) illustrate the payoff of $\operatorname{\textsc{Trust}}$ and $\operatorname{\textsc{TrustGreedy}}$ in this case. We note that $\operatorname{\textsc{TrustGreedy}}$ is strictly better than both $\operatorname{\textsc{Trust}}$ and $\operatorname{\textsc{Greedy}}$ . In an extreme case, when $d=n$ , $\operatorname{\mathit{\hat{I}}}$ becomes empty and $\operatorname{\textsc{TrustGreedy}}$ becomes $\operatorname{\textsc{Greedy}}$ ; in other words, $\operatorname{\textsc{Greedy}}$ is the same algorithm as $\operatorname{\textsc{TrustGreedy}}$ with the empty predictions set $\operatorname{\mathit{\hat{I}}}$ .

We also consider a setting in which there are no false negatives. For that, we generate $\operatorname{\mathit{\hat{I}}}$ by adding $d$ intervals to $\operatorname{\mathit{\hat{I}}}$ . In other words, $\operatorname{\mathit{\hat{I}}}$ will be a superset of intervals in $I$ . Figures 3(a) and 3(c) illustrate the payoff of $\operatorname{\textsc{Trust}}$ and $\operatorname{\textsc{TrustGreedy}}$ in this case. In this case, the payoff of $\operatorname{\textsc{Trust}}$ and $\operatorname{\textsc{TrustGreedy}}$ is similar to the setting where both false positives and negatives contributed to the error set. In particular, $\operatorname{\textsc{Trust}}$ quickly becomes worse than $\operatorname{\textsc{Greedy}}$ as the error increases, while $\operatorname{\textsc{TrustGreedy}}$ degrades gently as a function of the prediction error.

7 Related Problems: Matching and Independent Set

In [27], the authors observe that finding disjoint paths on stars is equivalent to finding maximal matchings on general graphs, where each request in the input to the disjoint path selection bijects to an edge in the input graph for the matching problem. Therefore, we can extend the results of Section 3 to the following online matching problem. The input is a graph $G=(V,E)$ , where $V$ is known, and edges in $E$ appear in an online manner; upon arrival of an edge, it must be added to the matching or rejected. The prediction is a set $\operatorname{\mathit{\hat{E}}}$ that specifies edges in $E$ . As before, we use $\operatorname{\textsc{FP}}$ and $\operatorname{\textsc{FN}}$ to indicate the set of false positives and false negatives and define $\gamma(\operatorname{\mathit{\hat{E}}},E)=\frac{\operatorname{\textsc{Opt}}(\operatorname{\textsc{FP}}\cup\operatorname{\textsc{FN}})}{\operatorname{\textsc{Opt}}(E)}$ , where $\operatorname{\textsc{Opt}}(S)$ indicates the size of the matching for graph $G=(V,S)$ .

The correspondence between the two problems is as follows: Consider a set of intervals on a star. Each such interval is a pair of vertices. We can assume no pair contains the star’s center since all such intervals should be accepted if they can be. For the matching problem, the pairs of vertices from the disjoint paths problem on the star can be the edges in the graph. A feasible solution to the disjoint paths problem corresponds to matching and vice versa. One can similarly consider an instance of a matching problem, and the endpoints of the edges can be the non-center vertices of the star in the disjoint paths problem.

Using this correspondence between disjoint paths on a star and matchings in general graphs, for the star $S_{8}$ , we get the following graph for matching: $G=(V,E)$ , where $V=\left\{1,2,3,4,5,6,7,8\right\}$ and

[TABLE]

See also Figure 4. Note that the edges in this graph correspond to the intervals that are used in the proof of Theorem 3.2. The proof can be simulated in this new setting so that the number of intervals accepted in the different cases in Theorem 3.2 is the same as the number of edges in the matchings found in the corresponding subgraphs of $G$ . Thus, the same result holds for matchings in any graph class containing this graph.

All edges have one even-numbered endpoint and one odd, so this includes the bipartite graph class. It is also planar but not an interval or chordal graph.

Given the correspondence between interval scheduling and the matching problem, the following is immediate from Theorems 3.1 and 3.2.

Proposition 3

For any instance $G=(V,E)$ of the online matching problem under the edge-arrival model and a prediction set $\operatorname{\mathit{\hat{E}}}$ , there is an algorithm $\operatorname{\textsc{Trust}}$ that matches at least $(1-2\gamma(\operatorname{\mathit{\hat{E}}},E))\operatorname{\textsc{Opt}}(G)$ edges. Moreover, there are instances $G_{w}=(V,E_{w})$ of the matching problem, along with predictions $\operatorname{\mathit{\hat{E}}}_{w}$ for which any deterministic algorithm matches at most $(1-2\gamma(\operatorname{\mathit{\hat{E}}},E)_{w})\operatorname{\textsc{Opt}}(G_{w})$ edges.

Using the correspondence between matchings in a graph, $G$ , and an independent set in the line graph of $G$ , we can get the same result for the independent set. The line graph of a graph, $G$ , has a vertex for each edge in $G$ and an edge between two vertices if the corresponding edges in $G$ share a vertex.

The line graph $G^{\prime}=(V^{\prime},E^{\prime})$ of the graph above used for matching is defined by

[TABLE]

where, for brevity, we use the notation $12$ to denote the vertex corresponding to the edge $(1,2)$ from $G$ . The set of edges is then

[TABLE]

Intervals from the proof in Theorem 3.2 correspond to vertices here. See also Figure 4.

We note that the graph $G^{\prime}$ is planar, but not outerplanar, since, contracting $67$ and $78$ into one vertex, $67\textrm{-}78$ , the sets $\left\{16,58\right\}$ and $\left\{18,56,67\textrm{-}78\right\}$ form a $K_{2,3}$ minor, which is a so-called forbidden subgraph for outerplanarity [20, 29]. Also, it is not chordal. However, the lower bound from Theorem 4.1, that for any deterministic algorithm $\operatorname{\textsc{Alg}}$ , there are instances $I$ and predictions $\operatorname{\mathit{\hat{I}}}$ such that $\operatorname{\textsc{Alg}}(\operatorname{\mathit{\hat{I}}},I)=\operatorname{\textsc{Opt}}(I)-\operatorname{\textsc{Opt}}(\operatorname{\textsc{FP}}\cup\operatorname{\textsc{FN}})$ clearly holds for independent sets in interval graphs, too, by considering the interval graph corresponding to a set of intervals on the line.

Using the correspondence between matchings in a graph, $G$ , and the independent set in the line graph of $G$ , we can get a similar result for the independent set under the vertex-arrival model.

Bibliography44

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Algorithms with predictions. https://algorithms-with-predictions.github.io/ , accessed: 2023-02-19
2[2] Angelopoulos, S., Dürr, C., Jin, S., Kamali, S., Renault, M.: Online computation with untrusted advice. In: Proc. ITCS. pp. 52:1–52:15 (2020)
3[3] Angelopoulos, S., Kamali, S., Shadkami, K.: Online bin packing with predictions. In: Proc. IJCAI. pp. 4574–4580 (2022)
4[4] Angelopoulos, S., Kamali, S.: Contract scheduling with predictions. In: 35th AAAI Conference on Artificial Intelligence (AAAI), 33rd Conference on Innovative Applications of Artificial Intelligence (IAAI), 11th Symposium on Educational Advances in Artificial Intelligence (EAAI). pp. 11726–11733. AAAI Press (2021)
5[5] Angelopoulos, S., Arsénio, D., Kamali, S.: Competitive sequencing with noisy advice. Co RR abs/2111.05281 (2021)
6[6] Antoniadis, A., Gouleakis, T., Kleer, P., Kolev, P.: Secretary and online matching problems with machine learned advice. In: Proc. Neur IPS (2020)
7[7] Awerbuch, B., Bartal, Y., Fiat, A., Rosén, A.: Competitive non-preemptive call control. In: Proc. (SODA). pp. 312–320 (1994)
8[8] Azar, Y., Leonardi, S., Touitou, N.: Flow time scheduling with uncertain processing time. In: Proc. STOC (2021)

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

Online Interval Scheduling with Predictions††thanks: The first, second, and fourth authors were supported in part by the Danish Council for Independent Research grant DFF-0135-00018B.

Abstract

Keywords:

1 Introduction

1.1 Contributions

2 Model and Predictions

Proposition 1 ()

Proof

2.0.1 Alternative Error Measures.

3 Disjoint-Path Allocation

Theorem 3.1 ()

Proof

Theorem 3.2 ()

Proof

4 Interval Scheduling

Theorem 4.1 ()

Proof

Theorem 4.2 ()

Proof

4.1 \textscTrustGreedy⁡\operatorname{\textsc{TrustGreedy}}\textscTrustGreedy

4.1.1 The algorithm.

4.1.2 Analysis.

Lemma 1

Proof

Lemma 2

Proof

Lemma 3

Proof

Theorem 4.3 ()

Proof

5 Consistency-Robustness Trade-off

Proposition 2 ()

Proof

Theorem 5.1

Proof

Theorem 5.2 ()

Proof

6 Experimental Results

7 Related Problems: Matching and Independent Set

Proposition 3

4.1 $\operatorname{\textsc{TrustGreedy}}$