Online Inverse Linear Optimization: Efficient Logarithmic-Regret Algorithm, Robustness to Suboptimality, and Lower Bound
Shinsaku Sakaue, Taira Tsuchiya, Han Bao, Taihei Oki

TL;DR
This paper introduces a highly efficient online inverse linear optimization algorithm with logarithmic regret, robustness to suboptimal actions, and a matching lower bound, significantly improving computational complexity over previous methods.
Contribution
It presents the first logarithmic-regret method with per-round complexity independent of T, using online Newton step and MetaGrad for robustness to suboptimality.
Findings
Achieves $O(n \\ln T)$ regret bound with per-round complexity independent of T.
Provides a regret bound for suboptimal actions: $O(n\\ln T + \\sqrt{\\Delta_T n\\ln T})$.
Establishes a lower bound of $\\Omega(n)$, showing the bound's tightness.
Abstract
In online inverse linear optimization, a learner observes time-varying sets of feasible actions and an agent's optimal actions, selected by solving linear optimization over the feasible actions. The learner sequentially makes predictions of the agent's true linear objective function, and their quality is measured by the regret, the cumulative gap between optimal objective values and those achieved by following the learner's predictions. A seminal work by B\"armann et al. (2017) obtained a regret bound of , where is the time horizon. Subsequently, the regret bound has been improved to by Besbes et al. (2021, 2025) and to by Gollapudi et al. (2021), where is the dimension of the ambient space of objective vectors. However, these logarithmic-regret methods are highly inefficient when is large, as they need to maintain regions specified…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsFace and Expression Recognition · Advanced Optimization Algorithms Research
