Modeling Choice via Self-Attention
Joohwan Ko, Andrew A. Li

TL;DR
This paper introduces a novel neural network-based choice model using self-attention, providing theoretical guarantees and demonstrating superior performance on real data compared to existing models.
Contribution
It is the first to integrate self-attention into choice modeling with both theoretical analysis and empirical validation, advancing the intersection of deep learning and operations management.
Findings
The attention-based model is a low-rank generalization of Halo-MNL.
Supports a nonconvex estimator with near-optimal sample complexity.
Outperforms existing models on real-world choice data.
Abstract
Models of choice are a fundamental input to many now-canonical optimization problems in the field of Operations Management, including assortment, inventory, and price optimization. Naturally, accurate estimation of these models from data is a critical step in the application of these optimization problems in practice. Concurrently, recent advancements in deep learning have sparked interest in integrating these techniques into choice modeling. However, there is a noticeable research gap at the intersection of deep learning and choice modeling, particularly with both theoretical and empirical foundations. Thus motivated, we first propose a choice model that is the first to successfully (both theoretically and practically) leverage a modern neural network architectural concept (self-attention). Theoretically, we show that our attention-based choice model is a low-rank generalization of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMental Health Research Topics
