Converging to Stability in Two-Sided Bandits: The Case of Unknown Preferences on Both Sides of a Matching Market
Gaurab Pokharel, Sanmay Das

TL;DR
This paper introduces algorithms for two-sided bandit matching markets where both sides have unknown preferences, demonstrating convergence to stable matchings despite increased uncertainty on both sides.
Contribution
It develops new algorithms that converge to stable matchings in settings where both sides' preferences are uncertain and initially unknown, extending previous work that assumed known preferences on one side.
Findings
Algorithms converge to stable matchings under unknown preferences.
Agents' beliefs are updated optimistically over time.
The approach handles more complex, realistic matching scenarios.
Abstract
We study the problem of repeated two-sided matching with uncertain preferences (two-sided bandits), and no explicit communication between agents. Recent work has developed algorithms that converge to stable matchings when one side (the proposers or agents) must learn their preferences, but the preferences of the other side (the proposees or arms) are common knowledge, and the matching mechanism uses simultaneous proposals at each round. We develop new algorithms that provably converge to stable matchings for two more challenging settings: one where the arm preferences are no longer common knowledge, and a second, more general one where the arms are also uncertain about their preferences. In our algorithms, agents start with optimistic beliefs about arms' preferences and update these preferences over time. The key insight is in how to combine these beliefs about arm preferences with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAuction Theory and Applications · Advanced Bandit Algorithms Research · Experimental Behavioral Economics Studies
