Boundedly Rational Meta-Learning in Sequential Consumer Choice
Mehrzad Khosravi, Max Kleiman-Weiner, Hema Yoganarasimhan

TL;DR
This paper investigates how consumers transfer knowledge across different decision contexts, proposing a boundedly rational meta-learning model that better explains observed behaviors than traditional Bayesian models.
Contribution
It introduces a class of boundedly rational meta-dynamic programming policies that approximate full Bayesian transfer with limited computational complexity.
Findings
Participants improve across routes, reducing pseudo-regret.
Low-D bounded meta-learning models fit human data better than full Bayesian models.
Consumers transfer brand knowledge using coarse, approximate representations.
Abstract
Many consumer decisions are repeated choices under uncertainty. Standard models capture these decisions using Bayesian learning and dynamic programming: consumers update beliefs from feedback and use those beliefs to guide future choices. In many markets, however, learning does not restart when consumers enter a new context: prior experience with a brand, product, or provider can shape beliefs in later, related decisions. We study this cross-context knowledge transfer, or meta-learning, in sequential choice. We design a hierarchical laboratory task in which participants repeatedly choose among airlines across routes and observe noisy binary outcomes. Reduced-form evidence shows that participants improve not only within routes, but also across routes: they choose better airlines earlier in later routes and reduce pseudo-regret. To identify the mechanism behind this transfer, we compare…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
