EvoNash-MARL: A Closed-Loop Multi-Agent Reinforcement Learning Framework for Medium-Horizon Equity Allocation
Chongliu Jia, Yi Luo, Sipeng Han, Pengwei Li, Jie Ding, Youshuang Hu, Yimiao Qian, and Qiya Wang

TL;DR
EvoNash-MARL is a novel closed-loop multi-agent reinforcement learning framework designed to enhance robustness and performance in medium- to long-horizon equity allocation under realistic trading constraints.
Contribution
It introduces a unified approach combining multi-agent policy populations, game-theoretic aggregation, and constraint-aware validation within a walk-forward design.
Findings
Achieves a 19.6% annualized return on out-of-sample data from 2014 to 2024.
Outperforms the SPY benchmark with an 11.7% return.
Demonstrates stable performance through 2026 under realistic market conditions.
Abstract
Medium- to long-horizon equity allocation is challenging due to weak predictive structure, non-stationary market regimes, and the degradation of signals under realistic trading constraints. Conventional approaches often rely on single predictors or loosely coupled pipelines, which limit robustness under distributional shift. This paper proposes EvoNash-MARL, a closed-loop framework that integrates reinforcement learning with population-based policy optimization and execution-aware selection to improve robustness in medium- to long-horizon allocation. The framework combines multi-agent policy populations, game-theoretic aggregation, and constraint-aware validation within a unified walk-forward design. Under a 120-window walk-forward protocol, the final configuration achieves the highest robust score among internal baselines. On out-of-sample data from 2014 to 2024, it delivers a 19.6%…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
