Learning in Random Utility Models Via Online Decision Problems

Emerson Melo

arXiv:2506.16030·econ.TH·June 23, 2025

Learning in Random Utility Models Via Online Decision Problems

Emerson Melo

PDF

Open Access

TL;DR

This paper introduces a gradient-based online learning algorithm for Random Utility Models, proving its consistency and linking it to established online optimization methods, with applications in game theory and behavioral modeling.

Contribution

It develops a novel online decision-making algorithm for RUMs, demonstrating Hannan consistency and equivalence to FTRL, advancing understanding of learning dynamics in stochastic choice.

Findings

01

Proves Hannan consistency for a broad class of RUMs.

02

Shows the algorithm is equivalent to Follow-The-Regularized-Leader.

03

Applications include modeling recency bias and equilibria in games.

Abstract

This paper examines the Random Utility Model (RUM) in repeated stochastic choice settings where decision-makers lack full information about payoffs. We propose a gradient-based learning algorithm that embeds RUM into an online decision-making framework. Our analysis establishes Hannan consistency for a broad class of RUMs, meaning the average regret relative to the best fixed action in hindsight vanishes over time. We also show that our algorithm is equivalent to the Follow-The-Regularized-Leader (FTRL) method, offering an economically grounded approach to online optimization. Applications include modeling recency bias and characterizing coarse correlated equilibria in normal-form games

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGame Theory and Applications · Advanced Bandit Algorithms Research · Auction Theory and Applications