Loading paper

Newton-type algorithms for inverse optimization I: weighted bottleneck Hamming distance and $\ell_\infty$-norm objectives | Tomesphere

arXiv:2302.13411·math.OC·March 1, 2023

Newton-type algorithms for inverse optimization I: weighted bottleneck Hamming distance and $\ell_\infty$-norm objectives

Krist\'of B\'erczi, Lydia Mirabel Mendoza-Cadena, Kitti Varga

TL;DR

This paper develops efficient algorithms for inverse optimization problems focusing on weighted bottleneck Hamming distance and weighted _-norm objectives, providing combinatorial and min-max solutions with polynomial complexity.

Contribution

It introduces new polynomial-time algorithms for inverse optimization with specific distance measures, extending to multiple cost functions.

Findings

01

Purely combinatorial algorithm for weighted bottleneck Hamming distance

02

Min-max characterization and pseudo-polynomial algorithm for weighted _-norm

03

Extension methods for multiple cost functions

Abstract

In minimum-cost inverse optimization problems, we are given a feasible solution to an underlying optimization problem together with a linear cost function, and the goal is to modify the costs by a small deviation vector so that the input solution becomes optimal. The difference between the new and the original cost functions can be measured in several ways. In this paper, we focus on two objectives: the weighted bottleneck Hamming distance and the weighted $ℓ_{\infty}$ -norm. We consider a general model in which the coordinates of the deviation vector are required to fall within given lower and upper bounds. For the weighted bottleneck Hamming distance objective, we present a simple, purely combinatorial algorithm that determines an optimal deviation vector in strongly polynomial time. For the weighted $ℓ_{\infty}$ -norm objective, we give a min-max characterization for the optimal…

Equations106

m : = max ⎩ ⎨ ⎧ 0, F \in F max c (F^{*}) - c (F) - s \in F^{*} ∖ F u (s) < 0 \sum u (s) + s \in F ∖ F^{*} ℓ (s) > 0 \sum ℓ (s) ⎭ ⎬ ⎫ .

m : = max ⎩ ⎨ ⎧ 0, F \in F max c (F^{*}) - c (F) - s \in F^{*} ∖ F u (s) < 0 \sum u (s) + s \in F ∖ F^{*} ℓ (s) > 0 \sum ℓ (s) ⎭ ⎬ ⎫ .

p_{[δ ∣ ℓ, u ∣ w]} (s) : = ⎩ ⎨ ⎧ u (s) m ℓ (s) - m 0 if s \in F^{*}, u (s) \neq = + \infty and w (s) \leq δ, if s \in F^{*}, u (s) = + \infty and w (s) \leq δ, if s \in S ∖ F^{*}, ℓ (s) \neq = - \infty and w (s) \leq δ, if s \in S ∖ F^{*}, ℓ (s) = - \infty and w (s) \leq δ, otherwise .

p_{[δ ∣ ℓ, u ∣ w]} (s) : = ⎩ ⎨ ⎧ u (s) m ℓ (s) - m 0 if s \in F^{*}, u (s) \neq = + \infty and w (s) \leq δ, if s \in F^{*}, u (s) = + \infty and w (s) \leq δ, if s \in S ∖ F^{*}, ℓ (s) \neq = - \infty and w (s) \leq δ, if s \in S ∖ F^{*}, ℓ (s) = - \infty and w (s) \leq δ, otherwise .

(c - p_{[δ ∣ ℓ, u ∣ w]}) (F^{*}) - (c - p_{[δ ∣ ℓ, u ∣ w]}) (F)

(c - p_{[δ ∣ ℓ, u ∣ w]}) (F^{*}) - (c - p_{[δ ∣ ℓ, u ∣ w]}) (F)

= (c (F^{*}) - s \in F^{*} \sum p_{[δ ∣ ℓ, u ∣ w]} (s)) - (c (F) - s \in F \sum p_{[δ ∣ ℓ, u ∣ w]} (s))

= c (F^{*}) - c (F) - s \in F^{*} ∖ F \sum p_{[δ ∣ ℓ, u ∣ w]} (s) + s \in F ∖ F^{*} \sum p_{[δ ∣ ℓ, u ∣ w]} (s)

= c (F^{*}) - c (F) - s \in F^{*} ∖ F u (s) \neq = + \infty w (s) \leq δ \sum u (s) - s \in F^{*} ∖ F u (s) = + \infty w (s) \leq δ \sum m + s \in F ∖ F^{*} ℓ (s) \neq = - \infty w (s) \leq δ \sum ℓ (s) + s \in F ∖ F^{*} ℓ (s) = - \infty w (s) \leq δ \sum (- m) .

(c - p_{[δ ∣ ℓ, u ∣ w]}) (F^{*}) - (c - p_{[δ ∣ ℓ, u ∣ w]}) (F)

(c - p_{[δ ∣ ℓ, u ∣ w]}) (F^{*}) - (c - p_{[δ ∣ ℓ, u ∣ w]}) (F)

= c (F^{*}) - c (F) - s \in F^{*} ∖ F u (s) \neq = + \infty w (s) \leq δ \sum u (s) + s \in F ∖ F^{*} ℓ (s) \neq = - \infty w (s) \leq δ \sum ℓ (s)

\leq c (F^{*}) - c (F) - s \in F^{*} ∖ F u (s) \neq = + \infty w (s) \leq δ \sum p (s) + s \in F ∖ F^{*} ℓ (s) \neq = - \infty w (s) \leq δ \sum p (s)

= (c - p) (F^{*}) - (c - p) (F)

\leq 0.

(c - p_{[δ ∣ ℓ, u ∣ w]}) (F^{*}) - (c - p_{[δ ∣ ℓ, u ∣ w]}) (F)

(c - p_{[δ ∣ ℓ, u ∣ w]}) (F^{*}) - (c - p_{[δ ∣ ℓ, u ∣ w]}) (F)

= c (F^{*}) - c (F) - s \in F^{*} ∖ F u (s) \neq = + \infty w (s) \leq δ \sum u (s) - s \in F^{*} ∖ F u (s) = + \infty w (s) \leq δ \sum m + s \in F ∖ F^{*} ℓ (s) \neq = - \infty w (s) \leq δ \sum ℓ (s) + s \in F ∖ F^{*} ℓ (s) = - \infty w (s) \leq δ \sum (- m)

\leq c (F^{*}) - c (F) - s \in F^{*} ∖ F u (s) \neq = + \infty w (s) \leq δ \sum u (s) + s \in F ∖ F^{*} ℓ (s) \neq = - \infty w (s) \leq δ \sum ℓ (s) - 1 \cdot m

\leq c (F^{*}) - c (F) - s \in F^{*} ∖ F u (s) < 0 w (s) \leq δ \sum u (s) + s \in F ∖ F^{*} ℓ (s) > 0 w (s) \leq δ \sum ℓ (s) - m

= c (F^{*}) - c (F) - s \in F^{*} ∖ F u (s) < 0 \sum u (s) + s \in F ∖ F^{*} ℓ (s) > 0 \sum ℓ (s) - m

\leq 0,

(c - p_{[δ^{'} ∣ ℓ, u ∣ w]}) (F^{*}) - (c - p_{[δ^{'} ∣ ℓ, u ∣ w]}) (F)

(c - p_{[δ^{'} ∣ ℓ, u ∣ w]}) (F^{*}) - (c - p_{[δ^{'} ∣ ℓ, u ∣ w]}) (F)

= c (F^{*}) - c (F) - s \in F^{*} ∖ F u (s) \neq = + \infty w (s) \leq δ^{'} \sum u (s) - s \in F^{*} ∖ F u (s) = + \infty w (s) \leq δ^{'} \sum m + s \in F ∖ F^{*} ℓ (s) \neq = - \infty w (s) \leq δ^{'} \sum ℓ (s) + s \in F ∖ F^{*} ℓ (s) = - \infty w (s) \leq δ^{'} \sum (- m)

\leq c (F^{*}) - c (F) - s \in F^{*} ∖ F u (s) \neq = + \infty w (s) \leq δ \sum u (s) - s \in F^{*} ∖ F u (s) = + \infty w (s) \leq δ \sum m + s \in F ∖ F^{*} ℓ (s) \neq = - \infty w (s) \leq δ \sum ℓ (s) + s \in F ∖ F^{*} ℓ (s) = - \infty w (s) \leq δ \sum (- m)

= (c - p_{[δ ∣ ℓ, u ∣ w]}) (F^{*}) - (c - p_{[δ ∣ ℓ, u ∣ w]}) (F)

\leq 0,

p_{[δ ∣ ℓ, u ∣ w]} (s) : = ⎩ ⎨ ⎧ ℓ (s) δ / w (s) u (s) ℓ (s) - δ / w (s) u (s) if s \in F^{*} and δ / w (s) < ℓ (s), if s \in F^{*} and ℓ (s) \leq δ / w (s) \leq u (s), if s \in F^{*} and u (s) < δ / w (s), if s \in S ∖ F^{*} and - δ / w (s) < ℓ (s), if s \in S ∖ F^{*} and ℓ (s) \leq - δ / w (s) \leq u (s), if s \in S ∖ F^{*} and u (s) < - δ / w (s) .

p_{[δ ∣ ℓ, u ∣ w]} (s) : = ⎩ ⎨ ⎧ ℓ (s) δ / w (s) u (s) ℓ (s) - δ / w (s) u (s) if s \in F^{*} and δ / w (s) < ℓ (s), if s \in F^{*} and ℓ (s) \leq δ / w (s) \leq u (s), if s \in F^{*} and u (s) < δ / w (s), if s \in S ∖ F^{*} and - δ / w (s) < ℓ (s), if s \in S ∖ F^{*} and ℓ (s) \leq - δ / w (s) \leq u (s), if s \in S ∖ F^{*} and u (s) < - δ / w (s) .

(c - p_{[δ ∣ ℓ, u ∣ w]}) (F^{*}) - (c - p_{[δ ∣ ℓ, u ∣ w]}) (F)

(c - p_{[δ ∣ ℓ, u ∣ w]}) (F^{*}) - (c - p_{[δ ∣ ℓ, u ∣ w]}) (F)

= (c (F^{*}) - s \in F^{*} \sum p_{[δ ∣ ℓ, u ∣ w]} (s)) - (c (F) - s \in F \sum p_{[δ ∣ ℓ, u ∣ w]} (s))

= c (F^{*}) - c (F) - s \in F^{*} ∖ F \sum p_{[δ ∣ ℓ, u ∣ w]} (s) + s \in F ∖ F^{*} \sum p_{[δ ∣ ℓ, u ∣ w]} (s)

\displaystyle{}~{}~{}=c(F^{*})-c(F)-\sum_{\mathchoice{\makebox[0.8pt]{$\displaystyle\begin{subarray}{c}s\in F^{*}\setminus F\\ \delta/w(s)\geq\ell(s)\\ \delta/w(s)\leq u(s)\end{subarray}$}}{\makebox[0.8pt]{$\textstyle\begin{subarray}{c}s\in F^{*}\setminus F\\ \delta/w(s)\geq\ell(s)\\ \delta/w(s)\leq u(s)\end{subarray}$}}{\makebox[0.8pt]{$\scriptstyle\begin{subarray}{c}s\in F^{*}\setminus F\\ \delta/w(s)\geq\ell(s)\\ \delta/w(s)\leq u(s)\end{subarray}$}}{\makebox[0.8pt]{$\scriptscriptstyle\begin{subarray}{c}s\in F^{*}\setminus F\\ \delta/w(s)\geq\ell(s)\\ \delta/w(s)\leq u(s)\end{subarray}$}}}\frac{\delta}{w(s)}-\sum_{\mathchoice{\makebox[0.8pt]{$\displaystyle\begin{subarray}{c}s\in F^{*}\setminus F\\ u(s)<\delta/w(s)\end{subarray}$}}{\makebox[0.8pt]{$\textstyle\begin{subarray}{c}s\in F^{*}\setminus F\\ u(s)<\delta/w(s)\end{subarray}$}}{\makebox[0.8pt]{$\scriptstyle\begin{subarray}{c}s\in F^{*}\setminus F\\ u(s)<\delta/w(s)\end{subarray}$}}{\makebox[0.8pt]{$\scriptscriptstyle\begin{subarray}{c}s\in F^{*}\setminus F\\ u(s)<\delta/w(s)\end{subarray}$}}}u(s)+\sum_{\mathchoice{\makebox[0.8pt]{$\displaystyle\begin{subarray}{c}s\in F\setminus F^{*}\\ -\delta/w(s)<\ell(s)\end{subarray}$}}{\makebox[0.8pt]{$\textstyle\begin{subarray}{c}s\in F\setminus F^{*}\\ -\delta/w(s)<\ell(s)\end{subarray}$}}{\makebox[0.8pt]{$\scriptstyle\begin{subarray}{c}s\in F\setminus F^{*}\\ -\delta/w(s)<\ell(s)\end{subarray}$}}{\makebox[0.8pt]{$\scriptscriptstyle\begin{subarray}{c}s\in F\setminus F^{*}\\ -\delta/w(s)<\ell(s)\end{subarray}$}}}\ell(s)+\sum_{\mathchoice{\makebox[0.8pt]{$\displaystyle\begin{subarray}{c}s\in F\setminus F^{*}\\ -\delta/w(s)\geq\ell(s)\\ -\delta/w(s)\leq u(s)\end{subarray}$}}{\makebox[0.8pt]{$\textstyle\begin{subarray}{c}s\in F\setminus F^{*}\\ -\delta/w(s)\geq\ell(s)\\ -\delta/w(s)\leq u(s)\end{subarray}$}}{\makebox[0.8pt]{$\scriptstyle\begin{subarray}{c}s\in F\setminus F^{*}\\ -\delta/w(s)\geq\ell(s)\\ -\delta/w(s)\leq u(s)\end{subarray}$}}{\makebox[0.8pt]{$\scriptscriptstyle\begin{subarray}{c}s\in F\setminus F^{*}\\ -\delta/w(s)\geq\ell(s)\\ -\delta/w(s)\leq u(s)\end{subarray}$}}}\left(-\,\frac{\delta}{w(s)}\right)

\leq c (F^{*}) - c (F) - s \in F^{*} ∖ F \sum p (s) + s \in F ∖ F^{*} \sum p (s)

= (c - p) (F^{*}) - (c - p) (F)

\leq 0,

(c - p_{[δ^{'} ∣ ℓ, u ∣ w]}) (F^{*}) - (c - p_{[δ^{'} ∣ ℓ, u ∣ w]}) (F)

(c - p_{[δ^{'} ∣ ℓ, u ∣ w]}) (F^{*}) - (c - p_{[δ^{'} ∣ ℓ, u ∣ w]}) (F)

\displaystyle{}~{}~{}=c(F^{*})-c(F)-\sum_{\mathchoice{\makebox[0.8pt]{$\displaystyle\begin{subarray}{c}s\in F^{*}\setminus F\\ \delta^{\prime}/w(s)<\ell(s)\end{subarray}$}}{\makebox[0.8pt]{$\textstyle\begin{subarray}{c}s\in F^{*}\setminus F\\ \delta^{\prime}/w(s)<\ell(s)\end{subarray}$}}{\makebox[0.8pt]{$\scriptstyle\begin{subarray}{c}s\in F^{*}\setminus F\\ \delta^{\prime}/w(s)<\ell(s)\end{subarray}$}}{\makebox[0.8pt]{$\scriptscriptstyle\begin{subarray}{c}s\in F^{*}\setminus F\\ \delta^{\prime}/w(s)<\ell(s)\end{subarray}$}}}\ell(s)-\sum_{\mathchoice{\makebox[0.8pt]{$\displaystyle\begin{subarray}{c}s\in F^{*}\setminus F\\ \delta^{\prime}/w(s)\geq\ell(s)\\ \delta^{\prime}/w(s)\leq u(s)\end{subarray}$}}{\makebox[0.8pt]{$\textstyle\begin{subarray}{c}s\in F^{*}\setminus F\\ \delta^{\prime}/w(s)\geq\ell(s)\\ \delta^{\prime}/w(s)\leq u(s)\end{subarray}$}}{\makebox[0.8pt]{$\scriptstyle\begin{subarray}{c}s\in F^{*}\setminus F\\ \delta^{\prime}/w(s)\geq\ell(s)\\ \delta^{\prime}/w(s)\leq u(s)\end{subarray}$}}{\makebox[0.8pt]{$\scriptscriptstyle\begin{subarray}{c}s\in F^{*}\setminus F\\ \delta^{\prime}/w(s)\geq\ell(s)\\ \delta^{\prime}/w(s)\leq u(s)\end{subarray}$}}}\frac{\delta^{\prime}}{w(s)}-\sum_{\mathchoice{\makebox[0.8pt]{$\displaystyle\begin{subarray}{c}s\in F^{*}\setminus F\\ u(s)<\delta^{\prime}/w(s)\end{subarray}$}}{\makebox[0.8pt]{$\textstyle\begin{subarray}{c}s\in F^{*}\setminus F\\ u(s)<\delta^{\prime}/w(s)\end{subarray}$}}{\makebox[0.8pt]{$\scriptstyle\begin{subarray}{c}s\in F^{*}\setminus F\\ u(s)<\delta^{\prime}/w(s)\end{subarray}$}}{\makebox[0.8pt]{$\scriptscriptstyle\begin{subarray}{c}s\in F^{*}\setminus F\\ u(s)<\delta^{\prime}/w(s)\end{subarray}$}}}u(s)

\displaystyle{}~{}~{}~{}~{}+\sum_{\mathchoice{\makebox[0.8pt]{$\displaystyle\begin{subarray}{c}s\in F\setminus F^{*}\\ -\delta^{\prime}/w(s)<\ell(s)\end{subarray}$}}{\makebox[0.8pt]{$\textstyle\begin{subarray}{c}s\in F\setminus F^{*}\\ -\delta^{\prime}/w(s)<\ell(s)\end{subarray}$}}{\makebox[0.8pt]{$\scriptstyle\begin{subarray}{c}s\in F\setminus F^{*}\\ -\delta^{\prime}/w(s)<\ell(s)\end{subarray}$}}{\makebox[0.8pt]{$\scriptscriptstyle\begin{subarray}{c}s\in F\setminus F^{*}\\ -\delta^{\prime}/w(s)<\ell(s)\end{subarray}$}}}\ell(s)+\sum_{\mathchoice{\makebox[0.8pt]{$\displaystyle\begin{subarray}{c}s\in F\setminus F^{*}\\ -\delta^{\prime}/w(s)\geq\ell(s)\\ -\delta^{\prime}/w(s)\leq u(s)\end{subarray}$}}{\makebox[0.8pt]{$\textstyle\begin{subarray}{c}s\in F\setminus F^{*}\\ -\delta^{\prime}/w(s)\geq\ell(s)\\ -\delta^{\prime}/w(s)\leq u(s)\end{subarray}$}}{\makebox[0.8pt]{$\scriptstyle\begin{subarray}{c}s\in F\setminus F^{*}\\ -\delta^{\prime}/w(s)\geq\ell(s)\\ -\delta^{\prime}/w(s)\leq u(s)\end{subarray}$}}{\makebox[0.8pt]{$\scriptscriptstyle\begin{subarray}{c}s\in F\setminus F^{*}\\ -\delta^{\prime}/w(s)\geq\ell(s)\\ -\delta^{\prime}/w(s)\leq u(s)\end{subarray}$}}}\left(-\,\frac{\delta^{\prime}}{w(s)}\right)+\sum_{\mathchoice{\makebox[0.8pt]{$\displaystyle\begin{subarray}{c}s\in F\setminus F^{*}\\ u(s)<-\delta^{\prime}/w(s)\end{subarray}$}}{\makebox[0.8pt]{$\textstyle\begin{subarray}{c}s\in F\setminus F^{*}\\ u(s)<-\delta^{\prime}/w(s)\end{subarray}$}}{\makebox[0.8pt]{$\scriptstyle\begin{subarray}{c}s\in F\setminus F^{*}\\ u(s)<-\delta^{\prime}/w(s)\end{subarray}$}}{\makebox[0.8pt]{$\scriptscriptstyle\begin{subarray}{c}s\in F\setminus F^{*}\\ u(s)<-\delta^{\prime}/w(s)\end{subarray}$}}}\ell(s)

\displaystyle{}~{}~{}\leq c(F^{*})-c(F)-\sum_{\mathchoice{\makebox[0.8pt]{$\displaystyle\begin{subarray}{c}s\in F^{*}\setminus F\\ \delta/w(s)<\ell(s)\end{subarray}$}}{\makebox[0.8pt]{$\textstyle\begin{subarray}{c}s\in F^{*}\setminus F\\ \delta/w(s)<\ell(s)\end{subarray}$}}{\makebox[0.8pt]{$\scriptstyle\begin{subarray}{c}s\in F^{*}\setminus F\\ \delta/w(s)<\ell(s)\end{subarray}$}}{\makebox[0.8pt]{$\scriptscriptstyle\begin{subarray}{c}s\in F^{*}\setminus F\\ \delta/w(s)<\ell(s)\end{subarray}$}}}\ell(s)-\sum_{\mathchoice{\makebox[0.8pt]{$\displaystyle\begin{subarray}{c}s\in F^{*}\setminus F\\ \delta/w(s)\geq\ell(s)\\ \delta^{\prime}/w(s)\leq u(s)\end{subarray}$}}{\makebox[0.8pt]{$\textstyle\begin{subarray}{c}s\in F^{*}\setminus F\\ \delta/w(s)\geq\ell(s)\\ \delta^{\prime}/w(s)\leq u(s)\end{subarray}$}}{\makebox[0.8pt]{$\scriptstyle\begin{subarray}{c}s\in F^{*}\setminus F\\ \delta/w(s)\geq\ell(s)\\ \delta^{\prime}/w(s)\leq u(s)\end{subarray}$}}{\makebox[0.8pt]{$\scriptscriptstyle\begin{subarray}{c}s\in F^{*}\setminus F\\ \delta/w(s)\geq\ell(s)\\ \delta^{\prime}/w(s)\leq u(s)\end{subarray}$}}}\frac{\delta}{w(s)}-\sum_{\mathchoice{\makebox[0.8pt]{$\displaystyle\begin{subarray}{c}s\in F^{*}\setminus F\\ u(s)<\delta/w(s)\end{subarray}$}}{\makebox[0.8pt]{$\textstyle\begin{subarray}{c}s\in F^{*}\setminus F\\ u(s)<\delta/w(s)\end{subarray}$}}{\makebox[0.8pt]{$\scriptstyle\begin{subarray}{c}s\in F^{*}\setminus F\\ u(s)<\delta/w(s)\end{subarray}$}}{\makebox[0.8pt]{$\scriptscriptstyle\begin{subarray}{c}s\in F^{*}\setminus F\\ u(s)<\delta/w(s)\end{subarray}$}}}u(s)

\displaystyle{}~{}~{}~{}~{}+\sum_{\mathchoice{\makebox[0.8pt]{$\displaystyle\begin{subarray}{c}s\in F\setminus F^{*}\\ -\delta/w(s)<\ell(s)\end{subarray}$}}{\makebox[0.8pt]{$\textstyle\begin{subarray}{c}s\in F\setminus F^{*}\\ -\delta/w(s)<\ell(s)\end{subarray}$}}{\makebox[0.8pt]{$\scriptstyle\begin{subarray}{c}s\in F\setminus F^{*}\\ -\delta/w(s)<\ell(s)\end{subarray}$}}{\makebox[0.8pt]{$\scriptscriptstyle\begin{subarray}{c}s\in F\setminus F^{*}\\ -\delta/w(s)<\ell(s)\end{subarray}$}}}\ell(s)+\sum_{\mathchoice{\makebox[0.8pt]{$\displaystyle\begin{subarray}{c}s\in F\setminus F^{*}\\ -\delta/w(s)\geq\ell(s)\\ -\delta/w(s)\leq u(s)\end{subarray}$}}{\makebox[0.8pt]{$\textstyle\begin{subarray}{c}s\in F\setminus F^{*}\\ -\delta/w(s)\geq\ell(s)\\ -\delta/w(s)\leq u(s)\end{subarray}$}}{\makebox[0.8pt]{$\scriptstyle\begin{subarray}{c}s\in F\setminus F^{*}\\ -\delta/w(s)\geq\ell(s)\\ -\delta/w(s)\leq u(s)\end{subarray}$}}{\makebox[0.8pt]{$\scriptscriptstyle\begin{subarray}{c}s\in F\setminus F^{*}\\ -\delta/w(s)\geq\ell(s)\\ -\delta/w(s)\leq u(s)\end{subarray}$}}}\left(-\frac{\delta}{w(s)}\right)+\sum_{\mathchoice{\makebox[0.8pt]{$\displaystyle\begin{subarray}{c}s\in F\setminus F^{*}\\ u(s)<-\delta/w(s)\end{subarray}$}}{\makebox[0.8pt]{$\textstyle\begin{subarray}{c}s\in F\setminus F^{*}\\ u(s)<-\delta/w(s)\end{subarray}$}}{\makebox[0.8pt]{$\scriptstyle\begin{subarray}{c}s\in F\setminus F^{*}\\ u(s)<-\delta/w(s)\end{subarray}$}}{\makebox[0.8pt]{$\scriptscriptstyle\begin{subarray}{c}s\in F\setminus F^{*}\\ u(s)<-\delta/w(s)\end{subarray}$}}}\ell(s)

= (c - p_{[δ ∣ ℓ, u ∣ w]}) (F^{*}) - (c - p_{[δ ∣ ℓ, u ∣ w]}) (F)

\leq 0,

W (F) : = ⎩ ⎨ ⎧ \frac{1}{s \in F ^{*} ∖ F u ( s ) = + \infty \sum \frac{1}{w ( s )} + s \in F ∖ F ^{*} ℓ ( s ) = - \infty \sum \frac{1}{w ( s )}} 0 if the divisor is not 0, otherwise,

W (F) : = ⎩ ⎨ ⎧ \frac{1}{s \in F ^{*} ∖ F u ( s ) = + \infty \sum \frac{1}{w ( s )} + s \in F ∖ F ^{*} ℓ ( s ) = - \infty \sum \frac{1}{w ( s )}} 0 if the divisor is not 0, otherwise,

m_{1}

m_{1}

m_{2}

m_{3}

m : = max {0, m_{1}, m_{2}, m_{3}} .

m : = max {0, m_{1}, m_{2}, m_{3}} .

0

0

= c (F^{*}) - c (F) - s \in F^{*} ∖ F u (s) \neq = + \infty \sum u (s) - s \in F^{*} ∖ F u (s) = + \infty \sum \frac{m}{w ( s )} + s \in F ∖ F^{*} ℓ (s) \neq = - \infty \sum ℓ (s) + s \in F ∖ F^{*} ℓ (s) = - \infty \sum (- \frac{m}{w ( s )})

= c (F^{*}) - c (F) - s \in F^{*} ∖ F u (s) \neq = + \infty \sum u (s) + s \in F ∖ F^{*} ℓ (s) \neq = - \infty \sum ℓ (s) - m s \in F^{*} ∖ F u (s) = + \infty \sum \frac{1}{w ( s )} + s \in F ∖ F^{*} ℓ (s) = - \infty \sum \frac{1}{w ( s )} .

0

0

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Optimization Algorithms Research · Sparse and Compressive Sensing Techniques · Matrix Theory and Algorithms

Full text

Newton-type algorithms for inverse optimization I:

weighted bottleneck Hamming distance and $\ell_{\infty}$ -norm objectives

Kristóf Bérczi

MTA-ELTE Momentum Matroid Optimization Research Group and ELKH-ELTE Egerváry Research Group, Department of Operations Research, Eötvös Loránd University, Budapest, Hungary. Email: [email protected], [email protected], [email protected].

Lydia Mirabel Mendoza-Cadena

MTA-ELTE Momentum Matroid Optimization Research Group and ELKH-ELTE Egerváry Research Group, Department of Operations Research, Eötvös Loránd University, Budapest, Hungary. Email: [email protected], [email protected], [email protected].

Kitti Varga

MTA-ELTE Momentum Matroid Optimization Research Group and ELKH-ELTE Egerváry Research Group, Department of Operations Research, Eötvös Loránd University, Budapest, Hungary. Email: [email protected], [email protected], [email protected].

Abstract

In minimum-cost inverse optimization problems, we are given a feasible solution to an underlying optimization problem together with a linear cost function, and the goal is to modify the costs by a small deviation vector so that the input solution becomes optimal.

The difference between the new and the original cost functions can be measured in several ways. In this paper, we focus on two objectives: the weighted bottleneck Hamming distance and the weighted $\operatorname{\ell_{\infty}}$ -norm. We consider a general model in which the coordinates of the deviation vector are required to fall within given lower and upper bounds. For the weighted bottleneck Hamming distance objective, we present a simple, purely combinatorial algorithm that determines an optimal deviation vector in strongly polynomial time. For the weighted $\operatorname{\ell_{\infty}}$ -norm objective, we give a min-max characterization for the optimal solution, and provide a pseudo-polynomial algorithm for finding an optimal deviation vector that runs in strongly polynomial time in the case of unit weights. For both objectives, we assume that an algorithm with the same time complexity for solving the underlying combinatorial optimization problem is available.

For both objectives, we also show how to extend the results to inverse optimization problems with multiple cost functions.

Keywords: Algorithm, Bottleneck Hamming distance, Infinite norm, Inverse optimization, Min-max theorem

1 Introduction

Inverse optimization problems have long been the focus of research due to their wide applicability in both theory and practice. The roots of inverse optimization go back to the work of Burton and Toit [6] who studied the inverse shortest paths problem, that is, the problem of recovering the edge costs given some information about the shortest paths in the graph. Since their pioneering work, countless of applications and extensions emerged; we refer the interested reader to [23] for the basics and to [14, 8] for surveys.

In a classical optimization problem, we are given a set of feasible solutions together with a linear cost function, and the goal is to find a feasible solution that minimizes or maximizes the cost. In contrast, in an inverse optimization problem we are also given a fixed feasible solution, and the goal is to modify the costs ‘as little as possible’ so that the input solution becomes optimal. There are various ways to measure the deviation of the new cost function from the original one, and, as one would expect, the choice of the objective greatly affects the complexity of the problem. In order to avoid confusion, we refer to the solutions of the inverse optimization problem and those of the underlying combinatorial optimization problem as feasible deviation vectors and solutions, respectively.

In the past decades, inverse optimization problems found numerous applications. As an example, let us briefly describe the Pathway concordance problem [7]. A clinical pathway describes a standardized sequence of steps for managing a clinical process in the delivery of care for a specific disease, with the aim of optimizing the outcome on a patient or population-level. These processes are determined by multidisciplinary medical experts, and have been shown to efficiently improve e.g. patient survival and satisfaction, wait times, and cost of care. However, patients’ journeys through the healthcare system can differ significantly from the recommended pathways, which raises the problem of measuring the concordance of patient-traversed pathways against the recommended ones. The problem can be modeled by a directed graph whose vertices correspond to activities that the patient can undertake, and the arcs indicate that a patient went from one activity to another. The ‘cost’ of a patient undertaking or missing certain activities and traversing arcs can be modeled by arc costs. The goal is to determine arc costs such that the reference pathways are optimal, that is, they are shortest paths between the corresponding start and end vertices. Then, assuming such arc costs are available, the journey of any patient can be scored based on the cost of the associated directed walk through the network.

The present work is the first member of a series of papers. Our general goal is to give min-max characterizations and simple algorithms for inverse optimization problems under various objectives. Here we set up the basics of a general framework that uses a Newton-type approach for finding an optimal deviation vector, and derive algorithms for the weighted bottleneck Hamming distance and weighted $\operatorname{\ell_{\infty}}$ -norm objectives that follow the proposed scheme.

In the second part [5], we focus on a novel objective called the weighted span that aims at finding a ‘balanced’ or ‘fair’ deviation vector, and we propose an analogous algorithm for that objective. Nevertheless, the analysis of the algorithm there is much more involved due to the different nature of the span compared to the $\operatorname{\ell_{\infty}}$ -norm.

Previous work.

Inverse optimization under the weighted bottleneck Hamming distance objective111This objective is sometimes called weighted bottleneck-type Hamming distance objective in the literature. has been of great interest recently. One of the earliest results is due to Duin and Volgenant [10] who considered the inverse minimum spanning tree, the inverse shortest path tree and the linear assignment problems. Liu and Yao [18] gave a strongly polynomial algorithm for the weighted inverse maximum perfect matching problem. An algorithm with an improved running time based on a binary search technique was later given by Tayyebi [24], and an analogous algorithm for the inverse matroid problem was given by Aman, Hassanpour and Tayyebi [3]. Mohaghegh and Baroughi Bonab [20] showed that the inverse min-max spanning $r$ -arborescence problem is solvable in strongly polynomial time. Guan, He, Pardalos and Zhang [13] presented a mathematical model for the inverse max+sum spanning tree problem, together with a method to check feasibility and a binary search algorithm for solving it. Karimi, Aman and Dolati [16] studied the inverse shortest $s$ - $t$ path problem and provided an LP-based algorithm which can be applied for some inverse multiobjective problem as well. Tayyebi and Aman [25] considered a general inverse linear programming problem, and proposed an algorithm that is based on a binary search technique. As an application, they specialized the method for solving the corresponding inverse minimum-cost flow problem in strongly polynomial time. Nguyen and Hung [21] studied the so-called inverse connected $p$ -median problem under the unweighted bottleneck Hamming distance objective. In this problem, the goal is to modify vertex weights of a block graph at minimum total cost so that a predetermined set of $p$ connected vertices becomes a connected $p$ -median on the perturbed block graph. They formulated the problem as a quasiconvex univariate optimization problem, and developed a combinatorial algorithm that solves the problem in polynomial time. Jiang, Liu and Peng [15] presented a strongly polynomial algorithm for the inverse minimum flow problem. Dong, Li and Yang [9] addressed the partial inverse min-max spanning tree problem, and presented two algorithms to solve the problem in polynomial time.

Inverse problems under the $\ell_{\infty}$ -norm have been studied in various settings. Xiaoguang [26] considered the inverse optimization problem of submodular functions on digraphs, and gave an LP-based algorithm that solves most inverse network optimization problems in polynomial time. Zhang and Liu [29] suggested a method for solving a general inverse LP problem including upper and lower bound constraints. In a later paper [19], the same authors studied the inverse maximum-weight matching problem in non-bipartite graphs under the $\ell_{\infty}$ -norm objective. They showed that the problem can be formulated as a maximum-mean alternating cycle problem in an undirected network, and can be solved in polynomial time by a binary search algorithm and in strongly polynomial time by an ascending algorithm. Using LP descriptions, Ahuja and Orlin [2] proved that if an optimization problem can be modeled as an LP, then the same holds for the underlying inverse optimization problem under $\ell_{1}$ - or $\ell_{\infty}$ -norm objectives. Furthermore, if the optimization problem is polynomially solvable for linear cost functions, then the inverse counterparts with $\ell_{1}$ - and $\ell_{\infty}$ -norms are also polynomially solvable. In [30], Zhang and Liu proposed a model that generalizes numerous inverse combinatorial optimization problems when no bounds are given on the coordinates of the deviation vector. Yang and Zhang [27] presented strongly polynomial algorithms to solve the inverse min-max spanning tree and the inverse maximum capacity path problems when bounds are also given on the coordinates of the deviation vector. Lasserre [17] considered the inverse optimization problem associated with the polynomial program and a given current feasible solution, and provided a systematic numerical scheme to compute an inverse optimal solution. Ahmadian, Bhaskar, Sanità, and Swamy [1] studied integral inverse optimization problems from an approximation point of view. They obtained tight or nearly-tight approximation guarantees for various inverse optimization problems, and some of their results apply for $\ell_{\infty}$ -norm as well. Zhang, Guan, and Zhang [28] provided a mathematical model of the inverse spanning tree problem, gave a characterization of optimal solutions, and developed a strongly polynomial algorithm for determining an optimal deviation vector. Recently, the authors [4] introduced inverse optimization problems with multiple cost functions, and studied the inverse minimum-cost $s$ - $t$ path, $r$ -arborescence, and bipartite perfect matching problems.

Most papers on inverse optimization consider algorithmic aspects, and so they do not provide a min-max characterization for the optimum value in question. Recently, Frank and Murota [12] developed a general min-max formula for the minimum of an integer-valued separable discrete convex function, where the minimum is taken over the set of integral elements of a box total dual integral polyhedron. Their approach covers and even extends a wide class of inverse combinatorial optimization problems. Nevertheless, our problems do not fit in the box-TDI framework as neither the bottleneck Hamming distance nor the $\ell_{\infty}$ -norm is separable convex.

Problem definitions.

We denote the sets of real and positive real numbers by $\mathbb{R}$ and $\mathbb{R}_{+}$ , respectively. For a positive integer $k$ , we use $[k]\coloneqq\{1,\dots,k\}$ . Let $S$ be a ground set of size $n$ . Given subsets $X,Y\subseteq S$ , the symmetric difference of $X$ and $Y$ is denoted by $X\triangle Y\coloneqq(X\setminus Y)\cup(Y\setminus X)$ . For a weight function $w\in\mathbb{R}_{+}^{S}$ , the total sum of its values over $X$ is denoted by $w(X)\coloneqq\sum\{w(s)\mid s\in X\}$ , where the sum over the empty set is always considered to be [math]. Furthermore, we define $\frac{1}{w}(X)\coloneqq\sum\big{\{}\frac{1}{w(s)}\bigm{|}s\in X\big{\}}$ , and set $\|w\|_{-1}\coloneqq\frac{1}{w}(S)$ . When the weights are rational numbers, then the values can be re-scaled as to satisfy $1/w(s)$ being an integer for each $s\in S$ . Throughout the paper, we assume that $w$ is given in such a form without explicitly mentioning it, implying that $\frac{1}{w}(X)$ is a non-negative integer for every $X\subseteq S$ . By convention, we define $\min\{\emptyset\}=+\infty$ and $\max\{\emptyset\}=-\infty$ .

Let $S$ be a finite ground set, $\mathcal{F}\subseteq 2^{S}$ be a collection of feasible solutions for an underlying optimization problem, $F^{*}\in\mathcal{F}$ be an input solution, $c\in\mathbb{R}^{S}$ be a cost function, $w\in\mathbb{R}_{+}^{S}$ be a positive weight function, and $\ell\colon S\to\mathbb{R}\cup\{-\infty\}$ and $u\colon S\to\mathbb{R}\cup\{+\infty\}$ be lower and upper bounds, respectively, such that $\ell\leq u$ . We assume that an oracle $\mathcal{O}$ is also available that determines an optimal solution of the underlying optimization problem $(S,\mathcal{F},c^{\prime})$ for any cost function $c^{\prime}\in\mathbb{R}^{S}$ .

In the constrained minimum-cost inverse optimization problem under the weighted bottleneck Hamming distance objective $\big{(}S,\mathcal{F},F^{*},c,\ell,u,\mathrm{H}_{\infty,w}(\cdot)\big{)}$ , we seek a deviation vector $p\in\mathbb{R}^{S}$ such that

(a)

$F^{*}$ is a minimum cost member of $\mathcal{F}$ with respect to $c-p$ , 2. (b)

$p$ is within the bounds $\ell\leq p\leq u$ , and 3. (c)

$\mathrm{H}_{\infty,w}(p)\coloneqq\max\left\{w(s)\mid s\in S,\,p(s)\neq 0\right\}$ is minimized.

In the constrained minimum-cost inverse optimization problem under weighted $\ell_{\infty}$ -norm objective $(S,\mathcal{F},F^{*},c,\ell,u,\|\cdot\|_{\infty,w})$ , condition (c) modifies to

(c’)

$\|p\|_{\infty,w}\coloneqq\max\left\{w(s)\cdot|p(s)|\bigm{|}s\in S\right\}$

Due to the lower and upper bounds $\ell$ and $u$ , it might happen that there exists no deviation vector $p$ satisfying the requirements. A deviation vector is called feasible if it satisfies conditions (a) and (b), and optimal if in addition it attains the minimum in (c) or (c’). We denote the problems by $\big{(}S,\mathcal{F},F^{*},c,-\infty,+\infty,\mathrm{H}_{\infty,w}(\cdot)\big{)}$ and $(S,\mathcal{F},F^{*},c,-\infty,+\infty,\|\cdot\|_{\infty,w})$ when no bounds are given on the coordinates of $p$ at all, and call these problems unconstrained.

As an extension, we also consider multiple underlying optimization problems at the same time. In this setting, instead of a single cost function, we are given $k$ cost functions $c^{1},\dots,c^{k}$ together with an input solution $F^{*}$ , and our goal is to find a single deviation vector $p$ with $\ell\leq p\leq u$ such that $F^{*}$ has minimum cost with respect to $c^{j}-p$ for all $j\in[k]$ . In other words, condition (a) modifies to

(a’)

$F^{*}$ is a minimum cost member of $\mathcal{F}$ with respect to $c^{j}-p$ for $j\in[k]$ .

In case of multiple cost functions, we use $\{c^{j}\}_{j\in[k]}$ instead of $c$ when denoting the problems.

Our results.

Our main results are simple, purely combinatorial algorithms that efficiently solve the above, general problems. For the weighted bottleneck Hamming distance objective, we present an algorithm that makes $O(n)$ calls to the oracle $\mathcal{O}$ . In particular, the algorithm runs in strongly polynomial time, assuming that a strongly polynomial algorithm for the underlying optimization problem is available.

For the weighted $\operatorname{\ell_{\infty}}$ -norm objective, we give an algorithm for finding an optimal deviation vector that makes $O(n\cdot\|w\|_{-1})$ calls to the oracle $\mathcal{O}$ . In particular, the algorithm runs in strongly polynomial time for unit weights if the oracle $\mathcal{O}$ for the underlying optimization problem can be realized by a strongly polynomial algorithm. Furthermore, we provide a min-max characterization for the minimum size of an optimal deviation vector in the unconstrained setting, i.e. when $\ell\equiv-\infty$ and $u\equiv+\infty$ .

For both objectives, we show how to solve the problem when multiple cost functions are given instead of a single one.

The proposed algorithms do not rely on the standard techniques commonly used in the literature, i.e. binary search and LP-based methods. Instead, we suggest a Newton-type algorithm that iteratively updates the cost function, resembling the approach of Zhang and Liu [30] for the $\operatorname{\ell_{\infty}}$ -norm objective. They showed that if the inverse optimization problem can be reformulated as a certain maximization problem using dominant sets, then Radzik’s method [22] provides a strongly polynomial algorithm for finding an optimal solution. In contrast, our algorithms apply to general inverse optimization problems. Furthermore, we consider the constrained setting in which the coordinates of the deviation vector are ought to fall within given lower and upper bounds, hence the cost function has to be updated carefully. For these reasons, Radzik’s method cannot be applied to get a strongly polynomial algorithm.

A high-level description of the algorithm is given by the following scheme.

Choose $p_{0}$ minimizing the objective such that $\ell\leq p_{0}\leq u$ , set $c_{0}\coloneqq c-p_{0}$ and $i\coloneqq 0$ .
Let $F_{i}$ be an optimal solution of the underlying optimization problem with respect to $c_{i}$ .
If $c_{i}(F^{*})=c_{i}(F_{i})$ , then $p_{i}$ is an optimal deviation vector and stop. Otherwise, find $p_{i+1}$ satisfying $\ell\leq p_{i+1}\leq u$ and $(c-p_{i+1})(F^{*})=(c-p_{i+1})(F_{i})$ , and minimizing the objective. If no such $p_{i+1}$ exists, then the problem is infeasible and stop. Otherwise set $i\leftarrow i+1$ and go back to Step 2.

The rest of the paper is organized as follows. Section 2 presents a strongly polynomial algorithm for the weighted bottleneck Hamming distance objective, including the case of multiple cost functions. The weighted $\operatorname{\ell_{\infty}}$ -norm objective is discussed in Section 3, where first we provide a min-max characterization for the weighted $\operatorname{\ell_{\infty}}$ -norm of an optimal deviation vector in the unconstrained setting, then give an algorithm for the constrained setting, including the case of multiple cost functions.

2 Weighted bottleneck Hamming distance objective

As an illustration of our technique, we first consider the problem of minimizing the weighted bottleneck Hamming distance of the original and the modified cost functions. In Section 2.1, we show that there exists an optimal deviation vector having a restricted structure. We characterize the feasibility of the problem in Section 2.2. The algorithm for the case of a single cost function is presented in Section 2.3. We explain how to extend the algorithm for multiple cost functions in Section 2.4.

2.1 Optimal deviation vectors

Consider an instance $\big{(}S,\mathcal{F},F^{*},c,\ell,u,\mathrm{H}_{\infty,w}(\cdot)\big{)}$ of the constrained minimum-cost inverse optimization problem under the weighted bottleneck Hamming distance objective, where ${w\in\mathbb{R}^{S}_{+}}$ is a positive weight function. For ease of discussion, we define

[TABLE]

Recall that $\mathcal{O}$ denotes an algorithm that determines an optimal solution of the underlying optimization problem $(S,\mathcal{F},c^{\prime})$ for any cost function $c^{\prime}$ . Observe that if $\mathcal{O}$ runs in strongly polynomial time, then the value of $m$ can be determined in strongly polynomial time. For any $\delta\geq 0$ , let $p_{[\delta|\ell,u|w]}\colon S\to\mathbb{R}$ be defined as

[TABLE]

The following lemma shows that there exists an optimal deviation vector of special form.

Lemma 1.

Let $\big{(}S,\mathcal{F},F^{*},c,\ell,u,\mathrm{H}_{\infty,w}(\cdot)\big{)}$ be a feasible minimum-cost inverse optimization problem and let $p$ be an optimal deviation vector. Then $p_{[\delta|\ell,u|w]}$ is also an optimal deviation vector, where $\delta\coloneqq\mathrm{H}_{\infty,w}(p)$ .

Proof.

The lower and upper bounds $\ell\leq p_{[\delta|\ell,u|w]}\leq u$ hold by definition, hence (b) is satisfied.

Now we show that (a) holds. Let $F\in\mathcal{F}$ be an arbitrary solution. Then

[TABLE]

If $\big{\{}s\in F^{*}\setminus F\bigm{|}u(s)=+\infty,\ w(s)\leq\delta\big{\}}\cup\big{\{}s\in F\setminus F^{*}\bigm{|}\ell(s)=-\infty,\ w(s)\leq\delta\big{\}}=\emptyset$ , then

[TABLE]

Otherwise $\big{\{}s\in F^{*}\setminus F\bigm{|}u(s)=+\infty,\ w(s)\leq\delta\big{\}}\cup\big{\{}s\in F\setminus F^{*}\bigm{|}\ell(s)=-\infty,\ w(s)\leq\delta\big{\}}\neq\emptyset$ . Note that, by the feasibility of $p$ and by the definition of $\delta$ , we have $\ell(s)\leq 0\leq u(s)$ whenever $w(s)>\delta$ . Thus we obtain

[TABLE]

where the last inequality holds by the definition of $m$ . Therefore, (a) is indeed satisfied.

Finally, to see (c), observe that $\mathrm{H}_{\infty,w}(p_{[\delta|\ell,u|w]})\leq\delta=\mathrm{H}_{\infty,w}(p)$ , hence $p_{[\delta|\ell,u|w]}$ is also optimal. ∎

By Lemma 1, it suffices to look for the optimal deviation vector among vectors of special form. It turns out that the value of $\delta$ can be chosen from the values of the weight function $w$ .

Lemma 2.

Let $\big{(}S,\mathcal{F},F^{*},c,\ell,u,\mathrm{H}_{\infty,w}(\cdot)\big{)}$ be a feasible minimum-cost inverse optimization problem and let $\delta\geq 0$ be such that $p_{[\delta|\ell,u|w]}$ is a feasible deviation vector. Then the following hold.

(i)

There exists $s\in S$ with $w(s)\leq\delta$ for which $p_{[w(s)|\ell,u|w]}$ is also a feasible deviation vector. 2. (ii)

For any $\delta^{\prime}\geq\delta$ , the deviation vector $p_{[\delta^{\prime}|\ell,u|w]}$ is also feasible.

Proof.

To see (i), let $s\in S$ be such an element for which $w(s)=\max\{w(s^{\prime})\mid s^{\prime}\in S,\,w(s^{\prime})\leq\delta\}$ holds. Then by definition, we have $p_{[\delta|\ell,u|w]}=p_{[w(s)|\ell,u|w]}$ .

For (ii), let $F\in\mathcal{F}$ be arbitrary. Note that, by the feasibility of $p_{[\delta|\ell,u|w]}$ , we have $\ell(s)\leq 0\leq u(s)$ whenever $w(s)>\delta$ . Then

[TABLE]

concluding the proof of the lemma. ∎

2.2 Characterizing feasibility

We give a necessary and sufficient condition for the feasibility of the minimum-cost inverse optimization problem $\big{(}S,\mathcal{F},F^{*},c,\ell,\allowbreak u,\mathrm{H}_{\infty,w}(\cdot)\big{)}$ .

Lemma 3.

Let $\big{(}S,\mathcal{F},F^{*},c,\ell,u,\mathrm{H}_{\infty,w}(\cdot)\big{)}$ be a minimum-cost inverse optimization problem. Then the problem is feasible if and only if $p_{[w_{\max}|\ell,u|w]}$ is a feasible deviation vector, where $w_{\max}\coloneqq\max\left\{w(s)\bigm{|}s\in S\right\}$ .

Proof.

Clearly, if $p_{[w_{\max}|\ell,u|w]}$ is feasible, then so is the problem.

To see the other direction, suppose to the contrary that $p_{[w_{\max}|\ell,u|w]}$ is not feasible, but there exists a feasible deviation vector $p$ . If, in addition, $p$ is chosen to be optimal, then, by Lemma 1, the deviation vector $p_{[\delta|\ell,u|w]}$ is also optimal for $\delta\coloneqq\mathrm{H}_{\infty,w}(p)$ . Obviously, $\delta\leq w_{\max}$ holds. By Lemma 2, this implies the feasibility of $p_{[w_{\max}|\ell,u|w]}$ , a contradiction. ∎

2.3 Algorithm

We turn to the description of the algorithm and its analysis. The high-level idea is as described in the introduction. In each iteration, we determine an optimal solution $F\in\mathcal{F}$ using the oracle $\mathcal{O}$ as a black box. If the cost of $F$ equals that of $F^{*}$ , then we stop. Otherwise, we modify the costs in such a way that $F$ is “eliminated”, that is, $F$ and $F^{*}$ share the same cost with respect to the modified cost function – hence the name Newton-type. The algorithm is presented as Algorithm 1.

It remains to prove correctness and the running time of the algorithm.

Theorem 4.

Algorithm 1 determines an optimal deviation vector, if exists, for the minimum-cost inverse optimization problem $\big{(}S,\mathcal{F},F^{*},c,\ell,u,\mathrm{H}_{\infty,w}(\cdot)\big{)}$ using $O(n)$ calls to the oracle $\mathcal{O}$ .

Proof.

We discuss the time complexity and the correctness of the algorithm separately.

Time complexity. We show that the algorithm terminates after at most $n$ iterations of the while loop. To see this, observe that if $F^{*}$ is not a minimum $c_{i}$ -cost member of $\mathcal{F}$ for some $i$ , then either $S_{i+1}\subsetneq S_{i}$ by the definition of $\delta_{i+1}$ , or the algorithm declares the problem to be infeasible. As the size of the set $S_{i}$ can decrease at most $|S|=n$ times, the statement follows.

Correctness. By the above, the algorithm terminates after a finite number of iterations. Observe that if the algorithm returns Infeasible, then it correctly recognizes the problem to be infeasible by Lemma 3.

Assume now that the algorithm terminates with returning a deviation vector $p_{[\delta_{i}|\ell,u|w]}$ whose feasibility follows from the fact that the while loop ended. If $F^{*}$ is a minimum $c_{0}$ -cost member of $\mathcal{F}$ , then we are clearly done. Otherwise, there exists an index $q$ such that $F^{*}$ is a minimum $c_{q+1}$ -cost member of $\mathcal{F}$ . Suppose to the contrary that $p_{[\delta_{q+1}|\ell,u|w]}$ is not optimal. Since the problem is feasible, by Lemma 1, there exists $\delta<\delta_{q+1}$ such that the deviation vector $p_{[\delta|\ell,u|w]}$ is optimal. Note that $p_{[\delta_{q}|\ell,u|w]}$ is not a feasible deviation vector since $(c-p_{[\delta_{q}|\ell,u|w]})(F^{*})>(c-p_{[\delta_{q}|\ell,u|w]})(F_{q})$ . By Lemma 2, we get $\delta_{q}<\delta<\delta_{q+1}$ . However, by Lemma 2, we know that $\delta=w(s)$ for some $s\in S$ , contradicting the definition of $\delta_{q+1}$ . ∎

Note that the Algorithm 1 runs in strongly polynomial time assuming that $\mathcal{O}$ can be realized by a strongly polynomial-time algorithm.

2.4 Multiple cost functions

Consider now an instance $\big{(}S,\mathcal{F},F^{*},\{c^{j}\}_{j\in[k]},\ell,u,\mathrm{H}_{\infty,w}(\cdot)\big{)}$ of the problem with multiple cost functions. By Lemma 1, for each $j\in[k]$ , there exists $\delta^{j}\geq 0$ such that $p_{[\delta^{j}|\ell,u|w]}$ is an optimal deviation vector for the problem $\big{(}S,\mathcal{F},F^{*},c^{j},\ell,u,\mathrm{H}_{\infty,w}(\cdot)\big{)}$ . Let $\delta\coloneqq\max\big{\{}\delta^{j}\mid j\in[k]\big{\}}$ . By Lemma 2, $p_{[\delta|\ell,u|w]}$ is a feasible deviation vector for the problem $\big{(}S,\mathcal{F},F^{*},c^{j},\ell,u,\mathrm{H}_{\infty,w}(\cdot)\big{)}$ for $j\in[k]$ , and it is clearly optimal. Therefore, we get the following.

Corollary 5.

The minimum-cost inverse optimization problem $\big{(}S,\mathcal{F},F^{*},\{c^{j}\}_{j\in[k]},\ell,u,\allowbreak\mathrm{H}_{\infty,w}(\cdot)\big{)}$ with multiple cost functions can be solved using $O(k\cdot n)$ calls to the oracle $\mathcal{O}$ .

3 Weighted $\operatorname{\ell_{\infty}}$ -norm objective

Next we consider the weighted $\operatorname{\ell_{\infty}}$ -norm objective. Similarly to the case of the weighted bottleneck Hamming distance, we first prove that there exists an optimal deviation vector of a special form in Section 3.1. We characterize the feasibility of the problem in Section 3.2. Then we present an algorithm for the case of a single cost function in the constrained setting in Section 3.3. We explain how to extend the algorithm for multiple cost functions in Section 3.4. Finally, we give a min-max characterization of the weighted $\operatorname{\ell_{\infty}}$ -norm of an optimal deviation vector in the unconstrained setting with multiple cost functions in Section 3.5.

3.1 Optimal deviation vectors

Consider an instance $(S,\mathcal{F},F^{*},c,\ell,u,\|\cdot\|_{\infty,w})$ of the constrained minimum-cost inverse optimization problem under the weighted $\operatorname{\ell_{\infty}}$ -norm objective, where $w\in\mathbb{R}^{S}_{+}$ is a positive weight function. For any $\delta\geq 0$ , let $p_{[\delta|\ell,u|w]}\colon S\to\mathbb{R}$ be defined as

[TABLE]

We simply write $p_{[\delta||w]}$ when $\ell\equiv-\infty$ and $u\equiv+\infty$ . The following technical lemma shows that there exists an optimal deviation vector of special form.

Lemma 6.

Let $\left(S,\mathcal{F},F^{*},c,\ell,u,\|\cdot\|_{\infty,w}\right)$ be a feasible minimum-cost inverse optimization problem and let $p$ be an optimal deviation vector. Then $p_{[\delta|\ell,u|w]}$ is also an optimal deviation vector, where $\delta\coloneqq\max\left\{w(s)\cdot|p(s)|\bigm{|}s\in S\right\}$ .

Proof.

The lower and upper bounds $\ell\leq p_{[\delta|\ell,u|w]}\leq u$ hold by definition, hence (b) is satisfied.

Now we show that (a) holds. The assumption $\ell\leq p\leq u$ and the definition of $\delta$ imply that $-\delta/w(s)\leq p(s)\leq u(s)$ and $\ell(s)\leq p(s)\leq\delta/w(s)$ hold for every $s\in S$ . Let $F\in\mathcal{F}$ be an arbitrary solution. Then

[TABLE]

where the last inequality holds by the feasibility of $p$ .

Finally, to see that (c) holds for $p_{[\delta|\ell,u|w]}$ , observe that $\|p_{[\delta|\ell,u|w]}\|_{\infty,w}\leq\delta=\|p\|_{\infty,w}$ . This concludes the proof of the lemma. ∎

By Lemma 6, it suffices to look for the optimal deviation vector among vectors of special form. Furthermore, we get the following useful property of deviation vectors of such form.

Lemma 7.

Let $\left(S,\mathcal{F},F^{*},c,\ell,u,\|\cdot\|_{\infty,w}\right)$ be a feasible minimum-cost inverse optimization problem and let $\delta\geq 0$ be such that $p_{[\delta|\ell,u|w]}$ is a feasible deviation vector. Then for any $\delta^{\prime}\geq\delta$ , the deviation vector $p_{[\delta^{\prime}|\ell,u|w]}$ is also feasible.

Proof.

Let $F\in\mathcal{F}$ be an arbitrary solution. Then

[TABLE]

concluding the proof of the lemma. ∎

3.2 Characterizing feasibility

We give a necessary and sufficient condition for the feasibility of the minimum-cost inverse optimization problem $\left(S,\mathcal{F},F^{*},c,\ell,u,\|\cdot\|_{\infty,w}\right)$ .

Lemma 8.

Let $\left(S,\mathcal{F},F^{*},c,\ell,u,\|\cdot\|_{\infty,w}\right)$ be a minimum-cost inverse optimization problem. For any $F\in\mathcal{F}$ , define

[TABLE]

and let

[TABLE]

Then the problem is feasible if and only if $p_{[m|\ell,u|w]}$ is a feasible deviation vector for

[TABLE]

Proof.

Clearly, if $p_{[m|\ell,u|w]}$ is feasible, then so is the problem.

To see the other direction, suppose to the contrary that $p_{[m|\ell,u|w]}$ is not feasible, but there exists a feasible deviation vector $p$ . Then there exists $F\in\mathcal{F}$ such that

[TABLE]

If $\{s\in F^{*}\setminus F\mid u(s)=+\infty\}\cup\{s\in F\setminus F^{*}\mid\ell(s)=-\infty\}=\emptyset$ , then we obtain

[TABLE]

where the last inequality holds since $p$ is feasible, leading to a contradiction.

If ${\left\{s\in F^{*}\setminus F\mid u(s)=+\infty\right\}}\allowbreak\cup\left\{s\in F\setminus F^{*}\mid\ell(s)=-\infty\right\}\neq\emptyset$ , then we obtain

[TABLE]

which contradicts the definition of $m$ . ∎

3.3 Algorithm

We turn to the description of the algorithm and its analysis, which is similar to that of Algorithm 1. The algorithm is presented as Algorithm 2.

For proving the correctness and the running time of the algorithm, we need the following lemmas.

Lemma 9.

If $F^{*}$ is not a minimum $c_{i}$ -cost member of $\mathcal{F}$ for some $i$ , then either $\delta_{i+1}>0$ and $S_{i+1}\subseteq S_{i}$ , or Algorithm 2 declares the problem to be infeasible.

Proof.

The statement follows from the definition of $\delta_{i+1}$ and from Lemma 9. ∎

Lemma 10.

If $F^{*}$ is not a minimum $c_{i}$ -cost member of $\mathcal{F}$ for some $i$ , then $c_{i+1}(F^{*})=c_{i+1}(F_{i})$ , or $S_{i+1}\subsetneq S_{i}$ , or Algorithm 2 declares the problem to be infeasible.

Proof.

Let $i$ be an index such that $F^{*}$ is not a minimum $c_{i}$ -cost member of $\mathcal{F}$ , and assume that Algorithm 2 does not declare the problem to be infeasible in the $i$ th step. Then $S_{i+1}\subseteq S_{i}$ holds by Lemma 9. If $S_{i+1}\subsetneq S_{i}$ , then we are done, hence consider the case $S_{i+1}=S_{i}$ . Then

[TABLE]

hence we get

[TABLE]

concluding the proof of the lemma. ∎

Lemma 11.

If $F^{*}$ is not a minimum $c_{i}$ -cost member of $\mathcal{F}$ for some $i$ , then

[TABLE]

or $S_{i+1}\subsetneq S_{i}$ , or Algorithm 2 declares the problem to be infeasible.

Proof.

Let $i$ be an index such that $F^{*}$ is not a minimum $c_{i}$ -cost member of $\mathcal{F}$ , and assume that Algorithm 2 does not declare the problem to be infeasible in the $i$ -th step and that $S_{i+1}=S_{i}$ . Then $c_{i+1}(F^{*})=c_{i+1}(F_{i})$ hold by Lemma 10, respectively. Thus we get

[TABLE]

Since $\delta_{i+1}>0$ by Lemma 9 and $c_{i}(F_{i})-c_{i}(F_{i+1})\leq 0$ by the optimality of $F_{i}$ with respect to $c_{i}$ , the statement follows. ∎

With the help of Lemmas 9–11, we are ready to prove the main result of this section.

Theorem 12.

Algorithm 2 determines an optimal deviation vector, if exists, for the minimum-cost inverse optimization problem $(S,\mathcal{F},F^{*},c,\ell,u,\|\cdot\|_{\infty,w})$ using $O(n\cdot\|w\|_{-1})$ calls to the $\mathcal{O}$ .

Proof.

We discuss the time complexity and the correctness of the algorithm separately.

Time complexity. Recall that $w\in\mathbb{R}^{S}_{+}$ is scaled so that $\frac{1}{w}(X)$ is an integer for each $X\subseteq S$ . Between two iterations of the while loop, the size of the set $S_{i}$ or the value of $\frac{1}{w}\big{(}(F_{i}\setminus F^{*})\cap S_{i}\big{)}-\frac{1}{w}\big{(}(F_{i}\cap F^{*})\cap S_{i}\big{)}$ strictly decreases by Lemma 11. The size of $S_{i}$ can decrease at most $n$ times. Between two iterations where the size of $S_{i}$ decreases, the value of $\frac{1}{w}\big{(}(F_{i}\setminus F^{*})\cap S_{i}\big{)}-\frac{1}{w}\big{(}(F_{i}\cap F^{*})\cap S_{i}\big{)}$ can decrease at most $2\|w\|_{-1}$ times. Hence the total number of iterations is $O(n\cdot\|w\|_{-1})$ .

Correctness. By the above, the procedure terminates after a finite number of iterations. First, we show that if the the algorithm returns Infeasible, then it correctly recognizes the problem to be infeasible. To see this, assume that the algorithm terminated in the $i$ th step and declared the problem to be infeasible. Then $(F^{*}\triangle F_{i})\cap S_{i}=\emptyset$ , so by the definitions of $S_{i}$ and $m$ as in Lemma 8, we obtain

[TABLE]

hence the problem is infeasible by Lemma 8.

Assume now that the algorithm terminates with returning a deviation vector whose feasibility follows from the fact that the while loop ended. If $F^{*}$ is a minimum $c_{0}$ -cost member of $\mathcal{F}$ , then we are clearly done. Otherwise, there exists an index $q$ such that $F^{*}$ is a minimum $c_{q+1}$ -cost member of $\mathcal{F}$ . Suppose to the contrary that $p_{[d_{q+1}|\ell,u|w]}$ is not optimal. By Lemma 6, there exists $\delta<d_{q+1}$ such that the deviation vector $p_{[\delta|\ell,u|w]}$ is optimal. Thus, by Lemma 10, we get

[TABLE]

a contradiction. This concludes the proof of the theorem. ∎

Note that the Algorithm 2 runs in strongly polynomial time assuming that $\mathcal{O}$ can be realized by a strongly polynomial-time algorithm.

3.4 Multiple cost functions

Consider now an instance $\big{(}S,\mathcal{F},F^{*},\{c^{j}\}_{j\in[k]},\ell,u,\|\cdot\|_{\infty,w}\big{)}$ of the problem with multiple cost functions. By Lemma 6, for each $j\in[k]$ , there exists $\delta^{j}\geq 0$ such that $p_{[\delta^{j}|\ell,u|w]}$ is an optimal deviation vector for the problem $\big{(}S,\mathcal{F},F^{*},c^{j},\ell,u,\|\cdot\|_{\infty,w}\big{)}$ . Let $\delta\coloneqq\max\big{\{}\delta^{j}\mid j\in[k]\big{\}}$ . By Lemma 7, $p_{[\delta|\ell,u|w]}$ is a feasible deviation vector for the problem $(S,\mathcal{F},F^{*},c^{j},\ell,u,\|\cdot\|_{\infty,w})$ for $j\in[k]$ , and it is clearly optimal. Therefore, we get the following.

Corollary 13.

The minimum-cost inverse optimization problem $\big{(}S,\,\mathcal{F},\,F^{*},\,\{c^{\,j}\}_{j\,\in\,[k]},\,\ell,\,u,\allowbreak{\|\cdot\|_{\infty,w}}\big{)}$ with multiple cost functions can be solved using $O(k\cdot n\cdot\|w\|_{-1})$ calls to oracle $\mathcal{O}$ .

3.5 Min-max theorem

With the help of Lemmas 6 and 7, we provide a min-max characterization for the weighted infinity norm of an optimal deviation vector in the unconstrained setting, even for the case of multiple cost functions. Recall that we use the notation $\frac{1}{w}(X)\coloneqq\sum\left\{\frac{1}{w(s)}\,\middle|\,s\in X\right\}$ .

Theorem 14.

Let $\big{(}S,\mathcal{F},F^{*},\{c^{j}\}_{j\in[k]},-\infty,+\infty,\|\cdot\|_{\infty,w}\big{)}$ be a feasible minimum-cost inverse optimization problem with multiple cost functions. Then

[TABLE]

Proof.

By Lemma 6, for each $j\in[k]$ there exists $\delta^{j}\geq 0$ such that $p_{[\delta^{j}||w]}$ is an optimal deviation vector for the problem $(S,\mathcal{F},F^{*},c^{j},-\infty,+\infty,\|\cdot\|_{\infty,w})$ . Our goal is to show that $p_{[\delta||w]}$ is an optimal deviation vector for the multiple-cost variant, where $\delta\coloneqq\max\big{\{}\delta^{j}\bigm{|}j\in[k]\big{\}}$ .

Let $p$ be an optimal deviation vector. For ease of discussion, let us define

[TABLE]

The intuition behind the definition of this value is as follows: if the $c^{j}$ -cost of a set $F$ is smaller than that of $F^{*}$ , then the weighted $\operatorname{\ell_{\infty}}$ -norm of a feasible deviation vector is clearly lower bounded by the fraction appearing in the expression.

If $F^{*}$ is a minimum $c^{j}$ -cost member of $\mathcal{F}$ for each $j\in[k]$ , then we are clearly done. Otherwise, $\delta,d>0$ holds, and it suffices to show $\delta=d$ . Let $j\in[k]$ and $F\in\mathcal{F}$ , $F\neq F^{*}$ be arbitrary. Since $\delta\geq\delta^{j}$ , Lemma 7 implies that $p_{[\delta||w]}$ is feasible, thus

[TABLE]

This implies

[TABLE]

hence $\delta\geq d$ . To prove $\delta\leq d$ , it is enough to show that $p_{[d||w]}$ is a feasible deviation vector. For any $j\in[k]$ and for any $F\in\mathcal{F}$ , $F\neq F^{*}$ , we have

[TABLE]

which means that $p_{[d||w]}$ is indeed feasible. ∎

4 Conclusions

In this paper, we considered general minimum-cost inverse optimization problems in the constrained setting, i.e. with lower and upper bounds on the coordinates of the deviation vector. We provided simple, purely combinatorial algorithms for the weighted bottleneck Hamming distance and $\operatorname{\ell_{\infty}}$ -norm objectives. The algorithms follow a scheme that resembles Newton’s algorithm, and find an optimal deviation vector in strongly polynomial when the bottleneck Hamming distance, and in pseudo-polynomial time when the $\operatorname{\ell_{\infty}}$ -norm is considered. For both objectives, we extended the results extend to inverse optimization problems with multiple cost functions.

Despite the extensive literature on inverse optimization, only few results are known when the desired deviation vector $p$ is required to be integral, see e.g. [1, 11, 12]. If $\ell$ , $u$ and $c$ are integral vectors, then the deviation vector $p_{[\delta|\ell,u|w]}$ defined in the bottleneck Hamming distance case is integral independently from the choice of $\delta$ , therefore the fractional and integral optimums coincide. For the unweighted $\operatorname{\ell_{\infty}}$ -norm objective, i.e. when in addition $w\equiv 1$ holds, the deviation vector $p_{[\delta|\ell,u|w]}$ might not be integral as we assign value $\delta$ to some of its coordinates. However, Lemmas 6 and 7 together imply that if $p_{[\delta|\ell,u|w]}$ is an optimal fractional deviation vector, then $p_{[\lceil\delta\rceil|\ell,u|w]}$ is an optimal integral deviation vector.

Though the proposed algorithm is capable of solving minimum cost inverse optimization problems in a very general setting, it naturally has its limitations. In particular, it is not suitable for handling deeper connections between the coordinates of the cost function. For example, if the underlying optimization problem is a minimum cost $s$ - $t$ path problem in a directed graph with conservative arc-costs $c$ , then it is not clear how to implement the algorithm as to maintain conservativeness throughout.

In an accompanying paper [5], we consider minimum-cost inverse optimization problems under the weighted span objective, and provide a min-max characterization for the weighted span of an optimal deviation vector in the unconstrained setting, as well as an efficient algorithm for the constrained setting.

Acknowledgement.

The work was supported by the Lendület Programme of the Hungarian Academy of Sciences – grant number LP2021-1/2021 and by the Hungarian National Research, Development and Innovation Office – NKFIH, grant number FK128673.

Bibliography30

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] S. Ahmadian, U. Bhaskar, L. Sanità, and C. Swamy. Algorithms for inverse optimization problems. In 26th Annual European Symposium on Algorithms (ESA 2018) , volume 112 of Leibniz International Proceedings in Informatics, LIP Ics . Schloss Dagstuhl–Leibniz-Zentrum für Informatik, 2018.
2[2] R. K. Ahuja and J. B. Orlin. Inverse optimization. Operations Research , 49(5):771–783, 2001.
3[3] M. Aman, H. Hassanpour, and J. Tayyeb. Inverse matroid optimization problem under the weighted Hamming distances. Bulletin of the Transilvania University of Brasov, Series III: Mathematics, Informatics, Physics , 9(58):85–98, 2016.
4[4] K. Bérczi, L. M. Mendoza-Cadena, and K. Varga. Inverse optimization problems with multiple weight functions. Discrete Applied Mathematics , 327:134–147, 2023.
5[5] K. Bérczi, L. M. Mendoza-Cadena, and K. Varga. Newton-type algorithms for inverse optimization II: weighted span objective. ar Xiv preprint ar Xiv:2302.13414 , 2023.
6[6] D. Burton and P. L. Toint. On an instance of the inverse shortest paths problem. Mathematical Programming , 53:45–61, 1992.
7[7] T. C. Y. Chan, M. Eberg, K. Forster, C. Holloway, L. Ieraci, Y. Shalaby, and N. Yousefi. An inverse optimization approach to measuring clinical pathway concordance. Management Science , 68(3):1882–1903, 2021.
8[8] M. Demange and J. Monnot. An introduction to inverse combinatorial problems. In Paradigms of Combinatorial Optimization: Problems and New Approaches , pages 547–586. John Wiley & Sons, Inc., second edition, 2014.