Reinforcement Learning Based Computationally Efficient Conditional Choice Simulation Estimation of Dynamic Discrete Choice Models
Ahmed Khwaja, Sonal Srivastava

TL;DR
This paper introduces a reinforcement learning-based method for efficiently estimating dynamic discrete choice models in high-dimensional settings, combining scalability with interpretability for policy analysis.
Contribution
It develops a novel RL-based two-step CCS estimation approach that enhances computational efficiency and scalability of DDC models in big data contexts.
Findings
Method outperforms traditional CCS estimation in simulations
Maintains interpretability and transparency of structural models
Demonstrates effectiveness on marketing-related models
Abstract
Dynamic discrete choice (DDC) models have found widespread application in marketing. However, estimating these becomes challenging in "big data" settings with high-dimensional state-action spaces. To address this challenge, this paper develops a Reinforcement Learning (RL)-based two-step ("computationally light") Conditional Choice Simulation (CCS) estimation approach that combines the scalability of machine learning with the transparency, explainability, and interpretability of structural models, which is particularly valuable for counterfactual policy analysis. The method is premised on three insights: (1) the CCS ("forward simulation") approach is a special case of RL algorithms, (2) starting from an initial state-action pair, CCS updates the corresponding value function only after each simulation path has terminated, whereas RL algorithms may update for all the state-action pairs…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsConsumer Market Behavior and Pricing · Supply Chain and Inventory Management · Economic and Environmental Valuation
