Learning Competitive Equilibria in Exchange Economies with Bandit Feedback
Wenshuo Guo, Kirthevasan Kandasamy, Joseph E Gonzalez, Michael I., Jordan, Ion Stoica

TL;DR
This paper introduces an online learning algorithm for exchange economies that learns agent utilities from stochastic feedback to approximate competitive equilibria, ensuring fair and efficient resource allocation.
Contribution
It proposes a novel randomized online learning mechanism that learns agent preferences and approximates competitive equilibria without prior utility knowledge.
Findings
Achieves sublinear loss in utility estimation
Effectively learns CE allocations through simulations
Demonstrates practical applicability of the method
Abstract
The sharing of scarce resources among multiple rational agents is one of the classical problems in economics. In exchange economies, which are used to model such situations, agents begin with an initial endowment of resources and exchange them in a way that is mutually beneficial until they reach a competitive equilibrium (CE). The allocations at a CE are Pareto efficient and fair. Consequently, they are used widely in designing mechanisms for fair division. However, computing CEs requires the knowledge of agent preferences which are unknown in several applications of interest. In this work, we explore a new online learning mechanism, which, on each round, allocates resources to the agents and collects stochastic feedback on their experience in using that allocation. Its goal is to learn the agent utilities via this feedback and imitate the allocations at a CE in the long run. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAuction Theory and Applications · Advanced Bandit Algorithms Research · Experimental Behavioral Economics Studies
