Newton-type algorithms for inverse optimization I: weighted bottleneck Hamming distance and $\ell_\infty$-norm objectives
Krist\'of B\'erczi, Lydia Mirabel Mendoza-Cadena, Kitti Varga

TL;DR
This paper develops efficient algorithms for inverse optimization problems focusing on weighted bottleneck Hamming distance and weighted _-norm objectives, providing combinatorial and min-max solutions with polynomial complexity.
Contribution
It introduces new polynomial-time algorithms for inverse optimization with specific distance measures, extending to multiple cost functions.
Findings
Purely combinatorial algorithm for weighted bottleneck Hamming distance
Min-max characterization and pseudo-polynomial algorithm for weighted _-norm
Extension methods for multiple cost functions
Abstract
In minimum-cost inverse optimization problems, we are given a feasible solution to an underlying optimization problem together with a linear cost function, and the goal is to modify the costs by a small deviation vector so that the input solution becomes optimal. The difference between the new and the original cost functions can be measured in several ways. In this paper, we focus on two objectives: the weighted bottleneck Hamming distance and the weighted -norm. We consider a general model in which the coordinates of the deviation vector are required to fall within given lower and upper bounds. For the weighted bottleneck Hamming distance objective, we present a simple, purely combinatorial algorithm that determines an optimal deviation vector in strongly polynomial time. For the weighted -norm objective, we give a min-max characterization for the optimal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Optimization Algorithms Research · Sparse and Compressive Sensing Techniques · Matrix Theory and Algorithms
Newton-type algorithms for inverse optimization I:
weighted bottleneck Hamming distance and -norm objectives
Kristóf Bérczi
MTA-ELTE Momentum Matroid Optimization Research Group and ELKH-ELTE Egerváry Research Group, Department of Operations Research, Eötvös Loránd University, Budapest, Hungary. Email: [email protected], [email protected], [email protected].
Lydia Mirabel Mendoza-Cadena
MTA-ELTE Momentum Matroid Optimization Research Group and ELKH-ELTE Egerváry Research Group, Department of Operations Research, Eötvös Loránd University, Budapest, Hungary. Email: [email protected], [email protected], [email protected].
Kitti Varga
MTA-ELTE Momentum Matroid Optimization Research Group and ELKH-ELTE Egerváry Research Group, Department of Operations Research, Eötvös Loránd University, Budapest, Hungary. Email: [email protected], [email protected], [email protected].
Abstract
In minimum-cost inverse optimization problems, we are given a feasible solution to an underlying optimization problem together with a linear cost function, and the goal is to modify the costs by a small deviation vector so that the input solution becomes optimal.
The difference between the new and the original cost functions can be measured in several ways. In this paper, we focus on two objectives: the weighted bottleneck Hamming distance and the weighted -norm. We consider a general model in which the coordinates of the deviation vector are required to fall within given lower and upper bounds. For the weighted bottleneck Hamming distance objective, we present a simple, purely combinatorial algorithm that determines an optimal deviation vector in strongly polynomial time. For the weighted -norm objective, we give a min-max characterization for the optimal solution, and provide a pseudo-polynomial algorithm for finding an optimal deviation vector that runs in strongly polynomial time in the case of unit weights. For both objectives, we assume that an algorithm with the same time complexity for solving the underlying combinatorial optimization problem is available.
For both objectives, we also show how to extend the results to inverse optimization problems with multiple cost functions.
Keywords: Algorithm, Bottleneck Hamming distance, Infinite norm, Inverse optimization, Min-max theorem
1 Introduction
Inverse optimization problems have long been the focus of research due to their wide applicability in both theory and practice. The roots of inverse optimization go back to the work of Burton and Toit [6] who studied the inverse shortest paths problem, that is, the problem of recovering the edge costs given some information about the shortest paths in the graph. Since their pioneering work, countless of applications and extensions emerged; we refer the interested reader to [23] for the basics and to [14, 8] for surveys.
In a classical optimization problem, we are given a set of feasible solutions together with a linear cost function, and the goal is to find a feasible solution that minimizes or maximizes the cost. In contrast, in an inverse optimization problem we are also given a fixed feasible solution, and the goal is to modify the costs ‘as little as possible’ so that the input solution becomes optimal. There are various ways to measure the deviation of the new cost function from the original one, and, as one would expect, the choice of the objective greatly affects the complexity of the problem. In order to avoid confusion, we refer to the solutions of the inverse optimization problem and those of the underlying combinatorial optimization problem as feasible deviation vectors and solutions, respectively.
In the past decades, inverse optimization problems found numerous applications. As an example, let us briefly describe the Pathway concordance problem [7]. A clinical pathway describes a standardized sequence of steps for managing a clinical process in the delivery of care for a specific disease, with the aim of optimizing the outcome on a patient or population-level. These processes are determined by multidisciplinary medical experts, and have been shown to efficiently improve e.g. patient survival and satisfaction, wait times, and cost of care. However, patients’ journeys through the healthcare system can differ significantly from the recommended pathways, which raises the problem of measuring the concordance of patient-traversed pathways against the recommended ones. The problem can be modeled by a directed graph whose vertices correspond to activities that the patient can undertake, and the arcs indicate that a patient went from one activity to another. The ‘cost’ of a patient undertaking or missing certain activities and traversing arcs can be modeled by arc costs. The goal is to determine arc costs such that the reference pathways are optimal, that is, they are shortest paths between the corresponding start and end vertices. Then, assuming such arc costs are available, the journey of any patient can be scored based on the cost of the associated directed walk through the network.
The present work is the first member of a series of papers. Our general goal is to give min-max characterizations and simple algorithms for inverse optimization problems under various objectives. Here we set up the basics of a general framework that uses a Newton-type approach for finding an optimal deviation vector, and derive algorithms for the weighted bottleneck Hamming distance and weighted -norm objectives that follow the proposed scheme.
In the second part [5], we focus on a novel objective called the weighted span that aims at finding a ‘balanced’ or ‘fair’ deviation vector, and we propose an analogous algorithm for that objective. Nevertheless, the analysis of the algorithm there is much more involved due to the different nature of the span compared to the -norm.
Previous work.
Inverse optimization under the weighted bottleneck Hamming distance objective111This objective is sometimes called weighted bottleneck-type Hamming distance objective in the literature. has been of great interest recently. One of the earliest results is due to Duin and Volgenant [10] who considered the inverse minimum spanning tree, the inverse shortest path tree and the linear assignment problems. Liu and Yao [18] gave a strongly polynomial algorithm for the weighted inverse maximum perfect matching problem. An algorithm with an improved running time based on a binary search technique was later given by Tayyebi [24], and an analogous algorithm for the inverse matroid problem was given by Aman, Hassanpour and Tayyebi [3]. Mohaghegh and Baroughi Bonab [20] showed that the inverse min-max spanning -arborescence problem is solvable in strongly polynomial time. Guan, He, Pardalos and Zhang [13] presented a mathematical model for the inverse max+sum spanning tree problem, together with a method to check feasibility and a binary search algorithm for solving it. Karimi, Aman and Dolati [16] studied the inverse shortest - path problem and provided an LP-based algorithm which can be applied for some inverse multiobjective problem as well. Tayyebi and Aman [25] considered a general inverse linear programming problem, and proposed an algorithm that is based on a binary search technique. As an application, they specialized the method for solving the corresponding inverse minimum-cost flow problem in strongly polynomial time. Nguyen and Hung [21] studied the so-called inverse connected -median problem under the unweighted bottleneck Hamming distance objective. In this problem, the goal is to modify vertex weights of a block graph at minimum total cost so that a predetermined set of connected vertices becomes a connected -median on the perturbed block graph. They formulated the problem as a quasiconvex univariate optimization problem, and developed a combinatorial algorithm that solves the problem in polynomial time. Jiang, Liu and Peng [15] presented a strongly polynomial algorithm for the inverse minimum flow problem. Dong, Li and Yang [9] addressed the partial inverse min-max spanning tree problem, and presented two algorithms to solve the problem in polynomial time.
Inverse problems under the -norm have been studied in various settings. Xiaoguang [26] considered the inverse optimization problem of submodular functions on digraphs, and gave an LP-based algorithm that solves most inverse network optimization problems in polynomial time. Zhang and Liu [29] suggested a method for solving a general inverse LP problem including upper and lower bound constraints. In a later paper [19], the same authors studied the inverse maximum-weight matching problem in non-bipartite graphs under the -norm objective. They showed that the problem can be formulated as a maximum-mean alternating cycle problem in an undirected network, and can be solved in polynomial time by a binary search algorithm and in strongly polynomial time by an ascending algorithm. Using LP descriptions, Ahuja and Orlin [2] proved that if an optimization problem can be modeled as an LP, then the same holds for the underlying inverse optimization problem under - or -norm objectives. Furthermore, if the optimization problem is polynomially solvable for linear cost functions, then the inverse counterparts with - and -norms are also polynomially solvable. In [30], Zhang and Liu proposed a model that generalizes numerous inverse combinatorial optimization problems when no bounds are given on the coordinates of the deviation vector. Yang and Zhang [27] presented strongly polynomial algorithms to solve the inverse min-max spanning tree and the inverse maximum capacity path problems when bounds are also given on the coordinates of the deviation vector. Lasserre [17] considered the inverse optimization problem associated with the polynomial program and a given current feasible solution, and provided a systematic numerical scheme to compute an inverse optimal solution. Ahmadian, Bhaskar, Sanità, and Swamy [1] studied integral inverse optimization problems from an approximation point of view. They obtained tight or nearly-tight approximation guarantees for various inverse optimization problems, and some of their results apply for -norm as well. Zhang, Guan, and Zhang [28] provided a mathematical model of the inverse spanning tree problem, gave a characterization of optimal solutions, and developed a strongly polynomial algorithm for determining an optimal deviation vector. Recently, the authors [4] introduced inverse optimization problems with multiple cost functions, and studied the inverse minimum-cost - path, -arborescence, and bipartite perfect matching problems.
Most papers on inverse optimization consider algorithmic aspects, and so they do not provide a min-max characterization for the optimum value in question. Recently, Frank and Murota [12] developed a general min-max formula for the minimum of an integer-valued separable discrete convex function, where the minimum is taken over the set of integral elements of a box total dual integral polyhedron. Their approach covers and even extends a wide class of inverse combinatorial optimization problems. Nevertheless, our problems do not fit in the box-TDI framework as neither the bottleneck Hamming distance nor the -norm is separable convex.
Problem definitions.
We denote the sets of real and positive real numbers by and , respectively. For a positive integer , we use . Let be a ground set of size . Given subsets , the symmetric difference of and is denoted by . For a weight function , the total sum of its values over is denoted by , where the sum over the empty set is always considered to be [math]. Furthermore, we define \frac{1}{w}(X)\coloneqq\sum\big{\{}\frac{1}{w(s)}\bigm{|}s\in X\big{\}}, and set . When the weights are rational numbers, then the values can be re-scaled as to satisfy being an integer for each . Throughout the paper, we assume that is given in such a form without explicitly mentioning it, implying that is a non-negative integer for every . By convention, we define and .
Let be a finite ground set, be a collection of feasible solutions for an underlying optimization problem, be an input solution, be a cost function, be a positive weight function, and and be lower and upper bounds, respectively, such that . We assume that an oracle is also available that determines an optimal solution of the underlying optimization problem for any cost function .
In the constrained minimum-cost inverse optimization problem under the weighted bottleneck Hamming distance objective \big{(}S,\mathcal{F},F^{*},c,\ell,u,\mathrm{H}_{\infty,w}(\cdot)\big{)}, we seek a deviation vector such that
- (a)
is a minimum cost member of with respect to , 2. (b)
is within the bounds , and 3. (c)
is minimized.
In the constrained minimum-cost inverse optimization problem under weighted -norm objective , condition (c) modifies to
- (c’)
\|p\|_{\infty,w}\coloneqq\max\left\{w(s)\cdot|p(s)|\bigm{|}s\in S\right\}
Due to the lower and upper bounds and , it might happen that there exists no deviation vector satisfying the requirements. A deviation vector is called feasible if it satisfies conditions (a) and (b), and optimal if in addition it attains the minimum in (c) or (c’). We denote the problems by \big{(}S,\mathcal{F},F^{*},c,-\infty,+\infty,\mathrm{H}_{\infty,w}(\cdot)\big{)} and when no bounds are given on the coordinates of at all, and call these problems unconstrained.
As an extension, we also consider multiple underlying optimization problems at the same time. In this setting, instead of a single cost function, we are given cost functions together with an input solution , and our goal is to find a single deviation vector with such that has minimum cost with respect to for all . In other words, condition (a) modifies to
- (a’)
is a minimum cost member of with respect to for .
In case of multiple cost functions, we use instead of when denoting the problems.
Our results.
Our main results are simple, purely combinatorial algorithms that efficiently solve the above, general problems. For the weighted bottleneck Hamming distance objective, we present an algorithm that makes calls to the oracle . In particular, the algorithm runs in strongly polynomial time, assuming that a strongly polynomial algorithm for the underlying optimization problem is available.
For the weighted -norm objective, we give an algorithm for finding an optimal deviation vector that makes calls to the oracle . In particular, the algorithm runs in strongly polynomial time for unit weights if the oracle for the underlying optimization problem can be realized by a strongly polynomial algorithm. Furthermore, we provide a min-max characterization for the minimum size of an optimal deviation vector in the unconstrained setting, i.e. when and .
For both objectives, we show how to solve the problem when multiple cost functions are given instead of a single one.
The proposed algorithms do not rely on the standard techniques commonly used in the literature, i.e. binary search and LP-based methods. Instead, we suggest a Newton-type algorithm that iteratively updates the cost function, resembling the approach of Zhang and Liu [30] for the -norm objective. They showed that if the inverse optimization problem can be reformulated as a certain maximization problem using dominant sets, then Radzik’s method [22] provides a strongly polynomial algorithm for finding an optimal solution. In contrast, our algorithms apply to general inverse optimization problems. Furthermore, we consider the constrained setting in which the coordinates of the deviation vector are ought to fall within given lower and upper bounds, hence the cost function has to be updated carefully. For these reasons, Radzik’s method cannot be applied to get a strongly polynomial algorithm.
A high-level description of the algorithm is given by the following scheme.
- Choose minimizing the objective such that , set and .
- Let be an optimal solution of the underlying optimization problem with respect to .
- If , then is an optimal deviation vector and stop. Otherwise, find satisfying and , and minimizing the objective. If no such exists, then the problem is infeasible and stop. Otherwise set and go back to Step 2.
The rest of the paper is organized as follows. Section 2 presents a strongly polynomial algorithm for the weighted bottleneck Hamming distance objective, including the case of multiple cost functions. The weighted -norm objective is discussed in Section 3, where first we provide a min-max characterization for the weighted -norm of an optimal deviation vector in the unconstrained setting, then give an algorithm for the constrained setting, including the case of multiple cost functions.
2 Weighted bottleneck Hamming distance objective
As an illustration of our technique, we first consider the problem of minimizing the weighted bottleneck Hamming distance of the original and the modified cost functions. In Section 2.1, we show that there exists an optimal deviation vector having a restricted structure. We characterize the feasibility of the problem in Section 2.2. The algorithm for the case of a single cost function is presented in Section 2.3. We explain how to extend the algorithm for multiple cost functions in Section 2.4.
2.1 Optimal deviation vectors
Consider an instance \big{(}S,\mathcal{F},F^{*},c,\ell,u,\mathrm{H}_{\infty,w}(\cdot)\big{)} of the constrained minimum-cost inverse optimization problem under the weighted bottleneck Hamming distance objective, where is a positive weight function. For ease of discussion, we define
[TABLE]
Recall that denotes an algorithm that determines an optimal solution of the underlying optimization problem for any cost function . Observe that if runs in strongly polynomial time, then the value of can be determined in strongly polynomial time. For any , let be defined as
[TABLE]
The following lemma shows that there exists an optimal deviation vector of special form.
Lemma 1**.**
Let \big{(}S,\mathcal{F},F^{*},c,\ell,u,\mathrm{H}_{\infty,w}(\cdot)\big{)} be a feasible minimum-cost inverse optimization problem and let be an optimal deviation vector. Then is also an optimal deviation vector, where .
Proof.
The lower and upper bounds hold by definition, hence (b) is satisfied.
Now we show that (a) holds. Let be an arbitrary solution. Then
[TABLE]
If \big{\{}s\in F^{*}\setminus F\bigm{|}u(s)=+\infty,\ w(s)\leq\delta\big{\}}\cup\big{\{}s\in F\setminus F^{*}\bigm{|}\ell(s)=-\infty,\ w(s)\leq\delta\big{\}}=\emptyset, then
[TABLE]
Otherwise \big{\{}s\in F^{*}\setminus F\bigm{|}u(s)=+\infty,\ w(s)\leq\delta\big{\}}\cup\big{\{}s\in F\setminus F^{*}\bigm{|}\ell(s)=-\infty,\ w(s)\leq\delta\big{\}}\neq\emptyset. Note that, by the feasibility of and by the definition of , we have whenever . Thus we obtain
[TABLE]
where the last inequality holds by the definition of . Therefore, (a) is indeed satisfied.
Finally, to see (c), observe that , hence is also optimal. ∎
By Lemma 1, it suffices to look for the optimal deviation vector among vectors of special form. It turns out that the value of can be chosen from the values of the weight function .
Lemma 2**.**
Let \big{(}S,\mathcal{F},F^{*},c,\ell,u,\mathrm{H}_{\infty,w}(\cdot)\big{)} be a feasible minimum-cost inverse optimization problem and let be such that is a feasible deviation vector. Then the following hold.
- (i)
There exists with for which is also a feasible deviation vector. 2. (ii)
For any , the deviation vector is also feasible.
Proof.
To see (i), let be such an element for which holds. Then by definition, we have .
For (ii), let be arbitrary. Note that, by the feasibility of , we have whenever . Then
[TABLE]
concluding the proof of the lemma. ∎
2.2 Characterizing feasibility
We give a necessary and sufficient condition for the feasibility of the minimum-cost inverse optimization problem \big{(}S,\mathcal{F},F^{*},c,\ell,\allowbreak u,\mathrm{H}_{\infty,w}(\cdot)\big{)}.
Lemma 3**.**
Let \big{(}S,\mathcal{F},F^{*},c,\ell,u,\mathrm{H}_{\infty,w}(\cdot)\big{)} be a minimum-cost inverse optimization problem. Then the problem is feasible if and only if is a feasible deviation vector, where w_{\max}\coloneqq\max\left\{w(s)\bigm{|}s\in S\right\}.
Proof.
Clearly, if is feasible, then so is the problem.
To see the other direction, suppose to the contrary that is not feasible, but there exists a feasible deviation vector . If, in addition, is chosen to be optimal, then, by Lemma 1, the deviation vector is also optimal for . Obviously, holds. By Lemma 2, this implies the feasibility of , a contradiction. ∎
2.3 Algorithm
We turn to the description of the algorithm and its analysis. The high-level idea is as described in the introduction. In each iteration, we determine an optimal solution using the oracle as a black box. If the cost of equals that of , then we stop. Otherwise, we modify the costs in such a way that is “eliminated”, that is, and share the same cost with respect to the modified cost function – hence the name Newton-type. The algorithm is presented as Algorithm 1.
It remains to prove correctness and the running time of the algorithm.
Theorem 4**.**
Algorithm 1 determines an optimal deviation vector, if exists, for the minimum-cost inverse optimization problem \big{(}S,\mathcal{F},F^{*},c,\ell,u,\mathrm{H}_{\infty,w}(\cdot)\big{)} using calls to the oracle .
Proof.
We discuss the time complexity and the correctness of the algorithm separately.
Time complexity. We show that the algorithm terminates after at most iterations of the while loop. To see this, observe that if is not a minimum -cost member of for some , then either by the definition of , or the algorithm declares the problem to be infeasible. As the size of the set can decrease at most times, the statement follows.
Correctness. By the above, the algorithm terminates after a finite number of iterations. Observe that if the algorithm returns Infeasible, then it correctly recognizes the problem to be infeasible by Lemma 3.
Assume now that the algorithm terminates with returning a deviation vector whose feasibility follows from the fact that the while loop ended. If is a minimum -cost member of , then we are clearly done. Otherwise, there exists an index such that is a minimum -cost member of . Suppose to the contrary that is not optimal. Since the problem is feasible, by Lemma 1, there exists such that the deviation vector is optimal. Note that is not a feasible deviation vector since . By Lemma 2, we get . However, by Lemma 2, we know that for some , contradicting the definition of . ∎
Note that the Algorithm 1 runs in strongly polynomial time assuming that can be realized by a strongly polynomial-time algorithm.
2.4 Multiple cost functions
Consider now an instance \big{(}S,\mathcal{F},F^{*},\{c^{j}\}_{j\in[k]},\ell,u,\mathrm{H}_{\infty,w}(\cdot)\big{)} of the problem with multiple cost functions. By Lemma 1, for each , there exists such that is an optimal deviation vector for the problem \big{(}S,\mathcal{F},F^{*},c^{j},\ell,u,\mathrm{H}_{\infty,w}(\cdot)\big{)}. Let \delta\coloneqq\max\big{\{}\delta^{j}\mid j\in[k]\big{\}}. By Lemma 2, is a feasible deviation vector for the problem \big{(}S,\mathcal{F},F^{*},c^{j},\ell,u,\mathrm{H}_{\infty,w}(\cdot)\big{)} for , and it is clearly optimal. Therefore, we get the following.
Corollary 5**.**
The minimum-cost inverse optimization problem \big{(}S,\mathcal{F},F^{*},\{c^{j}\}_{j\in[k]},\ell,u,\allowbreak\mathrm{H}_{\infty,w}(\cdot)\big{)} with multiple cost functions can be solved using calls to the oracle .
3 Weighted -norm objective
Next we consider the weighted -norm objective. Similarly to the case of the weighted bottleneck Hamming distance, we first prove that there exists an optimal deviation vector of a special form in Section 3.1. We characterize the feasibility of the problem in Section 3.2. Then we present an algorithm for the case of a single cost function in the constrained setting in Section 3.3. We explain how to extend the algorithm for multiple cost functions in Section 3.4. Finally, we give a min-max characterization of the weighted -norm of an optimal deviation vector in the unconstrained setting with multiple cost functions in Section 3.5.
3.1 Optimal deviation vectors
Consider an instance of the constrained minimum-cost inverse optimization problem under the weighted -norm objective, where is a positive weight function. For any , let be defined as
[TABLE]
We simply write when and . The following technical lemma shows that there exists an optimal deviation vector of special form.
Lemma 6**.**
Let be a feasible minimum-cost inverse optimization problem and let be an optimal deviation vector. Then is also an optimal deviation vector, where \delta\coloneqq\max\left\{w(s)\cdot|p(s)|\bigm{|}s\in S\right\}.
Proof.
The lower and upper bounds hold by definition, hence (b) is satisfied.
Now we show that (a) holds. The assumption and the definition of imply that and hold for every . Let be an arbitrary solution. Then
[TABLE]
where the last inequality holds by the feasibility of .
Finally, to see that (c) holds for , observe that . This concludes the proof of the lemma. ∎
By Lemma 6, it suffices to look for the optimal deviation vector among vectors of special form. Furthermore, we get the following useful property of deviation vectors of such form.
Lemma 7**.**
Let be a feasible minimum-cost inverse optimization problem and let be such that is a feasible deviation vector. Then for any , the deviation vector is also feasible.
Proof.
Let be an arbitrary solution. Then
[TABLE]
concluding the proof of the lemma. ∎
3.2 Characterizing feasibility
We give a necessary and sufficient condition for the feasibility of the minimum-cost inverse optimization problem .
Lemma 8**.**
Let be a minimum-cost inverse optimization problem. For any , define
[TABLE]
and let
[TABLE]
Then the problem is feasible if and only if is a feasible deviation vector for
[TABLE]
Proof.
Clearly, if is feasible, then so is the problem.
To see the other direction, suppose to the contrary that is not feasible, but there exists a feasible deviation vector . Then there exists such that
[TABLE]
If , then we obtain
[TABLE]
where the last inequality holds since is feasible, leading to a contradiction.
If , then we obtain
[TABLE]
which contradicts the definition of . ∎
3.3 Algorithm
We turn to the description of the algorithm and its analysis, which is similar to that of Algorithm 1. The algorithm is presented as Algorithm 2.
For proving the correctness and the running time of the algorithm, we need the following lemmas.
Lemma 9**.**
If is not a minimum -cost member of for some , then either and , or Algorithm 2 declares the problem to be infeasible.
Proof.
The statement follows from the definition of and from Lemma 9. ∎
Lemma 10**.**
If is not a minimum -cost member of for some , then , or , or Algorithm 2 declares the problem to be infeasible.
Proof.
Let be an index such that is not a minimum -cost member of , and assume that Algorithm 2 does not declare the problem to be infeasible in the th step. Then holds by Lemma 9. If , then we are done, hence consider the case . Then
[TABLE]
hence we get
[TABLE]
concluding the proof of the lemma. ∎
Lemma 11**.**
If is not a minimum -cost member of for some , then
[TABLE]
or , or Algorithm 2 declares the problem to be infeasible.
Proof.
Let be an index such that is not a minimum -cost member of , and assume that Algorithm 2 does not declare the problem to be infeasible in the -th step and that . Then hold by Lemma 10, respectively. Thus we get
[TABLE]
Since by Lemma 9 and by the optimality of with respect to , the statement follows. ∎
With the help of Lemmas 9–11, we are ready to prove the main result of this section.
Theorem 12**.**
Algorithm 2 determines an optimal deviation vector, if exists, for the minimum-cost inverse optimization problem using calls to the .
Proof.
We discuss the time complexity and the correctness of the algorithm separately.
Time complexity. Recall that is scaled so that is an integer for each . Between two iterations of the while loop, the size of the set or the value of \frac{1}{w}\big{(}(F_{i}\setminus F^{*})\cap S_{i}\big{)}-\frac{1}{w}\big{(}(F_{i}\cap F^{*})\cap S_{i}\big{)} strictly decreases by Lemma 11. The size of can decrease at most times. Between two iterations where the size of decreases, the value of \frac{1}{w}\big{(}(F_{i}\setminus F^{*})\cap S_{i}\big{)}-\frac{1}{w}\big{(}(F_{i}\cap F^{*})\cap S_{i}\big{)} can decrease at most times. Hence the total number of iterations is .
Correctness. By the above, the procedure terminates after a finite number of iterations. First, we show that if the the algorithm returns Infeasible, then it correctly recognizes the problem to be infeasible. To see this, assume that the algorithm terminated in the th step and declared the problem to be infeasible. Then , so by the definitions of and as in Lemma 8, we obtain
[TABLE]
hence the problem is infeasible by Lemma 8.
Assume now that the algorithm terminates with returning a deviation vector whose feasibility follows from the fact that the while loop ended. If is a minimum -cost member of , then we are clearly done. Otherwise, there exists an index such that is a minimum -cost member of . Suppose to the contrary that is not optimal. By Lemma 6, there exists such that the deviation vector is optimal. Thus, by Lemma 10, we get
[TABLE]
a contradiction. This concludes the proof of the theorem. ∎
Note that the Algorithm 2 runs in strongly polynomial time assuming that can be realized by a strongly polynomial-time algorithm.
3.4 Multiple cost functions
Consider now an instance \big{(}S,\mathcal{F},F^{*},\{c^{j}\}_{j\in[k]},\ell,u,\|\cdot\|_{\infty,w}\big{)} of the problem with multiple cost functions. By Lemma 6, for each , there exists such that is an optimal deviation vector for the problem \big{(}S,\mathcal{F},F^{*},c^{j},\ell,u,\|\cdot\|_{\infty,w}\big{)}. Let \delta\coloneqq\max\big{\{}\delta^{j}\mid j\in[k]\big{\}}. By Lemma 7, is a feasible deviation vector for the problem for , and it is clearly optimal. Therefore, we get the following.
Corollary 13**.**
The minimum-cost inverse optimization problem \big{(}S,\,\mathcal{F},\,F^{*},\,\{c^{\,j}\}_{j\,\in\,[k]},\,\ell,\,u,\allowbreak{\|\cdot\|_{\infty,w}}\big{)} with multiple cost functions can be solved using calls to oracle .
3.5 Min-max theorem
With the help of Lemmas 6 and 7, we provide a min-max characterization for the weighted infinity norm of an optimal deviation vector in the unconstrained setting, even for the case of multiple cost functions. Recall that we use the notation .
Theorem 14**.**
Let \big{(}S,\mathcal{F},F^{*},\{c^{j}\}_{j\in[k]},-\infty,+\infty,\|\cdot\|_{\infty,w}\big{)} be a feasible minimum-cost inverse optimization problem with multiple cost functions. Then
[TABLE]
Proof.
By Lemma 6, for each there exists such that is an optimal deviation vector for the problem . Our goal is to show that is an optimal deviation vector for the multiple-cost variant, where \delta\coloneqq\max\big{\{}\delta^{j}\bigm{|}j\in[k]\big{\}}.
Let be an optimal deviation vector. For ease of discussion, let us define
[TABLE]
The intuition behind the definition of this value is as follows: if the -cost of a set is smaller than that of , then the weighted -norm of a feasible deviation vector is clearly lower bounded by the fraction appearing in the expression.
If is a minimum -cost member of for each , then we are clearly done. Otherwise, holds, and it suffices to show . Let and , be arbitrary. Since , Lemma 7 implies that is feasible, thus
[TABLE]
This implies
[TABLE]
hence . To prove , it is enough to show that is a feasible deviation vector. For any and for any , , we have
[TABLE]
which means that is indeed feasible. ∎
4 Conclusions
In this paper, we considered general minimum-cost inverse optimization problems in the constrained setting, i.e. with lower and upper bounds on the coordinates of the deviation vector. We provided simple, purely combinatorial algorithms for the weighted bottleneck Hamming distance and -norm objectives. The algorithms follow a scheme that resembles Newton’s algorithm, and find an optimal deviation vector in strongly polynomial when the bottleneck Hamming distance, and in pseudo-polynomial time when the -norm is considered. For both objectives, we extended the results extend to inverse optimization problems with multiple cost functions.
Despite the extensive literature on inverse optimization, only few results are known when the desired deviation vector is required to be integral, see e.g. [1, 11, 12]. If , and are integral vectors, then the deviation vector defined in the bottleneck Hamming distance case is integral independently from the choice of , therefore the fractional and integral optimums coincide. For the unweighted -norm objective, i.e. when in addition holds, the deviation vector might not be integral as we assign value to some of its coordinates. However, Lemmas 6 and 7 together imply that if is an optimal fractional deviation vector, then is an optimal integral deviation vector.
Though the proposed algorithm is capable of solving minimum cost inverse optimization problems in a very general setting, it naturally has its limitations. In particular, it is not suitable for handling deeper connections between the coordinates of the cost function. For example, if the underlying optimization problem is a minimum cost - path problem in a directed graph with conservative arc-costs , then it is not clear how to implement the algorithm as to maintain conservativeness throughout.
In an accompanying paper [5], we consider minimum-cost inverse optimization problems under the weighted span objective, and provide a min-max characterization for the weighted span of an optimal deviation vector in the unconstrained setting, as well as an efficient algorithm for the constrained setting.
Acknowledgement.
The work was supported by the Lendület Programme of the Hungarian Academy of Sciences – grant number LP2021-1/2021 and by the Hungarian National Research, Development and Innovation Office – NKFIH, grant number FK128673.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] S. Ahmadian, U. Bhaskar, L. Sanità, and C. Swamy. Algorithms for inverse optimization problems. In 26th Annual European Symposium on Algorithms (ESA 2018) , volume 112 of Leibniz International Proceedings in Informatics, LIP Ics . Schloss Dagstuhl–Leibniz-Zentrum für Informatik, 2018.
- 2[2] R. K. Ahuja and J. B. Orlin. Inverse optimization. Operations Research , 49(5):771–783, 2001.
- 3[3] M. Aman, H. Hassanpour, and J. Tayyeb. Inverse matroid optimization problem under the weighted Hamming distances. Bulletin of the Transilvania University of Brasov, Series III: Mathematics, Informatics, Physics , 9(58):85–98, 2016.
- 4[4] K. Bérczi, L. M. Mendoza-Cadena, and K. Varga. Inverse optimization problems with multiple weight functions. Discrete Applied Mathematics , 327:134–147, 2023.
- 5[5] K. Bérczi, L. M. Mendoza-Cadena, and K. Varga. Newton-type algorithms for inverse optimization II: weighted span objective. ar Xiv preprint ar Xiv:2302.13414 , 2023.
- 6[6] D. Burton and P. L. Toint. On an instance of the inverse shortest paths problem. Mathematical Programming , 53:45–61, 1992.
- 7[7] T. C. Y. Chan, M. Eberg, K. Forster, C. Holloway, L. Ieraci, Y. Shalaby, and N. Yousefi. An inverse optimization approach to measuring clinical pathway concordance. Management Science , 68(3):1882–1903, 2021.
- 8[8] M. Demange and J. Monnot. An introduction to inverse combinatorial problems. In Paradigms of Combinatorial Optimization: Problems and New Approaches , pages 547–586. John Wiley & Sons, Inc., second edition, 2014.
