Online Inverse Linear Optimization: Efficient Logarithmic-Regret Algorithm, Robustness to Suboptimality, and Lower Bound

Shinsaku Sakaue; Taira Tsuchiya; Han Bao; Taihei Oki

arXiv:2501.14349·cs.LG·May 23, 2025

Online Inverse Linear Optimization: Efficient Logarithmic-Regret Algorithm, Robustness to Suboptimality, and Lower Bound

Shinsaku Sakaue, Taira Tsuchiya, Han Bao, Taihei Oki

PDF

Open Access 1 Video

TL;DR

This paper introduces a highly efficient online inverse linear optimization algorithm with logarithmic regret, robustness to suboptimal actions, and a matching lower bound, significantly improving computational complexity over previous methods.

Contribution

It presents the first logarithmic-regret method with per-round complexity independent of T, using online Newton step and MetaGrad for robustness to suboptimality.

Findings

01

Achieves $O(n \\ln T)$ regret bound with per-round complexity independent of T.

02

Provides a regret bound for suboptimal actions: $O(n\\ln T + \\sqrt{\\Delta_T n\\ln T})$.

03

Establishes a lower bound of $\\Omega(n)$, showing the bound's tightness.

Abstract

In online inverse linear optimization, a learner observes time-varying sets of feasible actions and an agent's optimal actions, selected by solving linear optimization over the feasible actions. The learner sequentially makes predictions of the agent's true linear objective function, and their quality is measured by the regret, the cumulative gap between optimal objective values and those achieved by following the learner's predictions. A seminal work by B\"armann et al. (2017) obtained a regret bound of $O (T)$ , where $T$ is the time horizon. Subsequently, the regret bound has been improved to $O (n^{4} ln T)$ by Besbes et al. (2021, 2025) and to $O (n ln T)$ by Gollapudi et al. (2021), where $n$ is the dimension of the ambient space of objective vectors. However, these logarithmic-regret methods are highly inefficient when $T$ is large, as they need to maintain regions specified…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Online Inverse Linear Optimization: Efficient Logarithmic-Regret Algorithm, Robustness to Suboptimality, and Lower Bound· slideslive

Taxonomy

TopicsFace and Expression Recognition · Advanced Optimization Algorithms Research