TradeTrap: Are LLM-based Trading Agents Truly Reliable and Faithful?
Lewen Yan, Jilin Mei, Tianyi Zhou, Lige Huang, Jie Zhang, Dongrui Liu, Jing Shao

TL;DR
TradeTrap is a comprehensive framework for stress-testing LLM-based autonomous trading agents, revealing their vulnerabilities to small perturbations that can lead to significant financial risks.
Contribution
We introduce TradeTrap, a novel evaluation framework for systematically assessing the robustness of autonomous trading agents against system-level perturbations.
Findings
Small perturbations can cause extreme portfolio risks.
Autonomous agents are systematically misled under certain perturbations.
Robustness issues are prevalent across different agent types.
Abstract
LLM-based trading agents are increasingly deployed in real-world financial markets to perform autonomous analysis and execution. However, their reliability and robustness under adversarial or faulty conditions remain largely unexamined, despite operating in high-risk, irreversible financial environments. We propose TradeTrap, a unified evaluation framework for systematically stress-testing both adaptive and procedural autonomous trading agents. TradeTrap targets four core components of autonomous trading agents: market intelligence, strategy formulation, portfolio and ledger handling, and trade execution, and evaluates their robustness under controlled system-level perturbations. All evaluations are conducted in a closed-loop historical backtesting setting on real US equity market data with identical initial conditions, enabling fair and reproducible comparisons across agents and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStock Market Forecasting Methods · Financial Markets and Investment Strategies · Blockchain Technology Applications and Security
