Watch and Learn: Optimizing from Revealed Preferences Feedback

Aaron Roth; Jonathan Ullman; Zhiwei Steven Wu

arXiv:1504.01033·cs.DS·November 19, 2015·5 cites

Watch and Learn: Optimizing from Revealed Preferences Feedback

Aaron Roth, Jonathan Ullman, Zhiwei Steven Wu

PDF

Open Access

TL;DR

This paper develops algorithms for leaders in Stackelberg games to optimize their strategies using only revealed preferences from followers, applicable even when the follower's utility is unknown and the problem is non-convex.

Contribution

It introduces a novel approach to solve Stackelberg games with unknown follower utilities using revealed preferences, covering non-convex optimization scenarios.

Findings

01

Efficient algorithms for Stackelberg games with unknown follower utilities.

02

Applicable to profit maximization and tolling in congestion games.

03

Solves non-convex optimization problems using revealed preference feedback.

Abstract

A Stackelberg game is played between a leader and a follower. The leader first chooses an action, then the follower plays his best response. The goal of the leader is to pick the action that will maximize his payoff given the follower's best response. In this paper we present an approach to solving for the leader's optimal strategy in certain Stackelberg games where the follower's utility function (and thus the subsequent best response of the follower) is unknown. Stackelberg games capture, for example, the following interaction between a producer and a consumer. The producer chooses the prices of the goods he produces, and then a consumer chooses to buy a utility maximizing bundle of goods. The goal of the seller here is to set prices to maximize his profit---his revenue, minus the production cost of the purchased bundle. It is quite natural that the seller in this example should not…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Optimization and Search Problems · Auction Theory and Applications