Reasoning Like an Economist: Post-Training on Economic Problems Induces Strategic Generalization in LLMs
Yufa Zhou, Shaobo Wang, Xingyu Dong, Xiangqi Jin, Yifang Chen, Yue Min, Kexin Yang, Xingzhang Ren, Dayiheng Liu, Linfeng Zhang

TL;DR
This paper demonstrates that post-training large language models with economic reasoning problems enhances their ability to perform structured multi-agent reasoning and strategic generalization, with implications for real-world economic applications.
Contribution
It introduces Recon, a 7B-parameter LLM fine-tuned on economic problems, showing improved reasoning and strategic generalization in multi-agent scenarios.
Findings
Enhanced economic reasoning performance on benchmarks
Improved multi-agent strategic reasoning capabilities
Post-training with domain-specific data boosts model alignment
Abstract
Directly training Large Language Models (LLMs) for Multi-Agent Systems (MAS) remains challenging due to intricate reward modeling, dynamic agent interactions, and demanding generalization requirements. This paper explores whether post-training techniques, specifically Supervised Fine-Tuning (SFT) and Reinforcement Learning with Verifiable Rewards (RLVR), can effectively to multi-agent scenarios. We use economic reasoning as a testbed, leveraging its strong foundations in mathematics and game theory, its demand for structured analytical reasoning, and its relevance to real-world applications such as market design, resource allocation, and policy analysis. We introduce (easoning like an omist), a 7B-parameter open-source LLM post-trained on a hand-curated dataset of 2,100 high-quality economic reasoning problems.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOrganizational Management and Leadership
MethodsShrink and Fine-Tune
