From Individual Learning to Market Equilibrium: Correcting Structural and Parametric Biases in RL Simulations of Economic Models
Ruxin Chen, Zeqiang Zhang

TL;DR
This paper identifies biases in RL-based economic simulations and introduces a calibrated Mean-Field RL framework that aligns agent behavior with market equilibrium, improving the realism of economic modeling.
Contribution
It proposes a novel Mean-Field RL method that corrects structural and parametric biases, enabling agents to learn equilibrium-consistent policies in economic models.
Findings
RL agents initially learn non-equilibrium policies
Parametric bias affects intertemporal decision-making
The proposed method converges to equilibrium-aligned policies
Abstract
The application of Reinforcement Learning (RL) to economic modeling reveals a fundamental conflict between the assumptions of equilibrium theory and the emergent behavior of learning agents. While canonical economic models assume atomistic agents act as `takers' of aggregate market conditions, a naive single-agent RL simulation incentivizes the agent to become a `manipulator' of its environment. This paper first demonstrates this discrepancy within a search-and-matching model with concave production, showing that a standard RL agent learns a non-equilibrium, monopsonistic policy. Additionally, we identify a parametric bias arising from the mismatch between economic discounting and RL's treatment of intertemporal costs. To address both issues, we propose a calibrated Mean-Field Reinforcement Learning framework that embeds a representative agent in a fixed macroeconomic field and adjusts…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEconomic theories and models · Economic Theory and Institutions · Complex Systems and Time Series Analysis
