Online Stackelberg Optimization via Nonlinear Control

William Brown; Christos Papadimitriou; Tim Roughgarden

arXiv:2406.18805·cs.LG·June 28, 2024

Online Stackelberg Optimization via Nonlinear Control

William Brown, Christos Papadimitriou, Tim Roughgarden

PDF

Open Access

TL;DR

This paper introduces a unified online control framework for Stackelberg optimization in adaptive agent interactions, achieving low regret and broad applicability to problems like pricing, recommendations, and game theory.

Contribution

It develops a tractable algorithmic framework for online nonlinear control with local controllability, extending to adversarial disturbances and bandit feedback scenarios.

Findings

01

Achieves $O( ootT)$ regret with known dynamics

02

Provides tight bounds under adversarial disturbances

03

Demonstrates applications in pricing, recommendations, and game theory

Abstract

In repeated interaction problems with adaptive agents, our objective often requires anticipating and optimizing over the space of possible agent responses. We show that many problems of this form can be cast as instances of online (nonlinear) control which satisfy \textit{local controllability}, with convex losses over a bounded state space which encodes agent behavior, and we introduce a unified algorithmic framework for tractable regret minimization in such cases. When the instance dynamics are known but otherwise arbitrary, we obtain oracle-efficient $O (T)$ regret by reduction to online convex optimization, which can be made computationally efficient if dynamics are locally \textit{action-linear}. In the presence of adversarial disturbances to the state, we give tight bounds in terms of either the cumulative or per-round disturbance magnitude (for \textit{strongly} or…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDistributed Control Multi-Agent Systems