Online Convex Optimization Perspective for Learning from Dynamically Revealed Preferences
Violet Xinying Chen, Fatma K{\i}l{\i}n\c{c}-Karzan

TL;DR
This paper introduces a new online convex optimization framework for learning agents' private utility functions from their actions, providing improved regret bounds and computational efficiency over existing methods.
Contribution
It develops a novel convex loss function and a unified OL framework that enhances regret performance and computational speed in learning from revealed preferences.
Findings
The new loss function bounds regret for other loss functions.
The framework supports various online convex optimization algorithms.
Empirical results show improved regret and faster solutions.
Abstract
We study the problem of online learning (OL) from revealed preferences: a learner wishes to learn a non-strategic agent's private utility function through observing the agent's utility-maximizing actions in a changing environment. We adopt an online inverse optimization setup, where the learner observes a stream of agent's actions in an online fashion and the learning performance is measured by regret associated with a loss function. We first characterize a special but broad class of agent's utility functions, then utilize this structure in designing a new convex loss function. We establish that the regret with respect to our new loss function also bounds the regret with respect to all other usual loss functions in the literature. This allows us to design a flexible OL framework that enables a unified treatment of loss functions and supports a variety of online convex optimization…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Game Theory and Applications
