AI-Trader: Benchmarking Autonomous Agents in Real-Time Financial Markets
Tianyu Fan, Yuhao Yang, Yangqin Jiang, Yifei Zhang, Yuxuan Chen, Chao Huang

TL;DR
AI-Trader introduces a comprehensive benchmark for evaluating autonomous LLM agents in real-time financial markets, revealing current limitations and guiding future improvements in AI-driven trading strategies.
Contribution
This work presents the first fully-automated, live, and data-uncontaminated benchmark for LLM agents in financial decision-making across multiple markets and trading granularities.
Findings
Most LLM agents show poor returns and weak risk management.
Risk control capability is crucial for cross-market robustness.
AI trading strategies perform better in highly liquid markets.
Abstract
Large Language Models (LLMs) have demonstrated remarkable potential as autonomous agents, approaching human-expert performance through advanced reasoning and tool orchestration. However, decision-making in fully dynamic and live environments remains highly challenging, requiring real-time information integration and adaptive responses. While existing efforts have explored live evaluation mechanisms in structured tasks, a critical gap remains in systematic benchmarking for real-world applications, particularly in finance where stringent requirements exist for live strategic responsiveness. To address this gap, we introduce AI-Trader, the first fully-automated, live, and data-uncontaminated evaluation benchmark for LLM agents in financial decision-making. AI-Trader spans three major financial markets: U.S. stocks, A-shares, and cryptocurrencies, with multiple trading granularities to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStock Market Forecasting Methods · Financial Markets and Investment Strategies · Complex Systems and Time Series Analysis
