Learning in Random Utility Models Via Online Decision Problems

Emerson Melo

arXiv:2112.10993·econ.TH·August 16, 2022

Learning in Random Utility Models Via Online Decision Problems

Emerson Melo

PDF

Open Access

TL;DR

This paper introduces a gradient-based learning algorithm for Random Utility Models in repeated decision-making, demonstrating its consistency and equivalence to established machine learning algorithms, with applications to economic and game-theoretic scenarios.

Contribution

It develops a novel gradient-based learning method for RUMs, proves its Hannan consistency, and links it to the FTRL algorithm, providing an economic foundation for machine learning techniques.

Findings

01

Large class of RUMs are Hannan consistent

02

Gradient-based algorithm is equivalent to FTRL

03

Applications include recency bias and no-regret learning

Abstract

This paper studies the Random Utility Model (RUM) in a repeated stochastic choice situation, in which the decision maker is imperfectly informed about the payoffs of each available alternative. We develop a gradient-based learning algorithm by embedding the RUM into an online decision problem. We show that a large class of RUMs are Hannan consistent (\citet{Hahn1957}); that is, the average difference between the expected payoffs generated by a RUM and that of the best-fixed policy in hindsight goes to zero as the number of periods increase. In addition, we show that our gradient-based algorithm is equivalent to the Follow the Regularized Leader (FTRL) algorithm, which is widely used in the machine learning literature to model learning in repeated stochastic choice problems. Thus, we provide an economically grounded optimization framework to the FTRL algorithm. Finally, we apply our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAuction Theory and Applications · Advanced Bandit Algorithms Research · Game Theory and Voting Systems