EvoNash-MARL: A Closed-Loop Multi-Agent Reinforcement Learning Framework for Medium-Horizon Equity Allocation

Chongliu Jia; Yi Luo; Sipeng Han; Pengwei Li; Jie Ding; Youshuang Hu; Yimiao Qian; and Qiya Wang

arXiv:2604.10911·cs.AI·April 15, 2026

EvoNash-MARL: A Closed-Loop Multi-Agent Reinforcement Learning Framework for Medium-Horizon Equity Allocation

Chongliu Jia, Yi Luo, Sipeng Han, Pengwei Li, Jie Ding, Youshuang Hu, Yimiao Qian, and Qiya Wang

PDF

TL;DR

EvoNash-MARL is a novel closed-loop multi-agent reinforcement learning framework designed to enhance robustness and performance in medium- to long-horizon equity allocation under realistic trading constraints.

Contribution

It introduces a unified approach combining multi-agent policy populations, game-theoretic aggregation, and constraint-aware validation within a walk-forward design.

Findings

01

Achieves a 19.6% annualized return on out-of-sample data from 2014 to 2024.

02

Outperforms the SPY benchmark with an 11.7% return.

03

Demonstrates stable performance through 2026 under realistic market conditions.

Abstract

Medium- to long-horizon equity allocation is challenging due to weak predictive structure, non-stationary market regimes, and the degradation of signals under realistic trading constraints. Conventional approaches often rely on single predictors or loosely coupled pipelines, which limit robustness under distributional shift. This paper proposes EvoNash-MARL, a closed-loop framework that integrates reinforcement learning with population-based policy optimization and execution-aware selection to improve robustness in medium- to long-horizon allocation. The framework combines multi-agent policy populations, game-theoretic aggregation, and constraint-aware validation within a unified walk-forward design. Under a 120-window walk-forward protocol, the final configuration achieves the highest robust score among internal baselines. On out-of-sample data from 2014 to 2024, it delivers a 19.6%…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.