LiveTradeBench: Seeking Real-World Alpha with Large Language Models
Haofei Yu, Fenghai Li, Jiaxuan You

TL;DR
LiveTradeBench provides a realistic environment for evaluating large language models in live trading scenarios, highlighting their decision-making under real-time market uncertainty and revealing gaps in static benchmark assessments.
Contribution
We introduce LiveTradeBench, a novel live trading benchmark for LLMs that incorporates real-time data, multi-asset management, and diverse markets to evaluate decision-making under uncertainty.
Findings
High LMArena scores do not guarantee better trading results.
Models exhibit diverse portfolio styles based on risk and reasoning.
Some LLMs effectively use live signals to adapt trading decisions.
Abstract
Large language models (LLMs) achieve strong performance across benchmarks--from knowledge quizzes and math reasoning to web-agent tasks--but these tests occur in static settings, lacking real dynamics and uncertainty. Consequently, they evaluate isolated reasoning or problem-solving rather than decision-making under uncertainty. To address this, we introduce LiveTradeBench, a live trading environment for evaluating LLM agents in realistic and evolving markets. LiveTradeBench follows three design principles: (i) Live data streaming of market prices and news, eliminating dependence on offline backtesting and preventing information leakage while capturing real-time uncertainty; (ii) a portfolio-management abstraction that extends control from single-asset actions to multi-asset allocation, integrating risk management and cross-asset reasoning; and (iii) multi-market evaluation across…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStock Market Forecasting Methods · Financial Markets and Investment Strategies · Complex Systems and Time Series Analysis
