FutureX-Pro: Extending Future Prediction to High-Value Vertical Domains

Jiashuo Liu; Siyuan Chen; Zaiyuan Wang; Zhiyuan Zeng; Jiacheng Guo; Liang Hu; Lingyue Yin; Suozhi Huang; Wenxin Hao; Yang Yang; Zerui Cheng; Zixin Yao; Lingyue Yin; Haoxin Liu; Jiayi Cheng; Yuzhen Li; Zezhong Ma; Bingjie Wang; Bingsen Qiu; Xiao Liu; Zeyang Zhang; Zijian Liu; Jinpeng Wang; Mingren Yin; Tianci He; Yali Liao; Yixiao Tian; Zhenwei Zhu; Anqi Dai; Ge Zhang; Jingkai Liu; Kaiyuan Zhang; Wenlong Wu; Xiang Gao; Xinjie Chen; Zhixin Yao; Zhoufutu Wen; B. Aditya Prakash; Jose Blanchet; Mengdi Wang; Nian Si; Wenhao Huang

arXiv:2601.12259·cs.AI·January 21, 2026

FutureX-Pro: Extending Future Prediction to High-Value Vertical Domains

Jiashuo Liu, Siyuan Chen, Zaiyuan Wang, Zhiyuan Zeng, Jiacheng Guo, Liang Hu, Lingyue Yin, Suozhi Huang, Wenxin Hao, Yang Yang, Zerui Cheng, Zixin Yao, Lingyue Yin, Haoxin Liu, Jiayi Cheng, Yuzhen Li, Zezhong Ma, Bingjie Wang, Bingsen Qiu, Xiao Liu, Zeyang Zhang, Zijian Liu

PDF

Open Access

TL;DR

This paper introduces FutureX-Pro, a specialized framework extending future prediction capabilities of large language models to high-value domains like finance, retail, health, and disasters, assessing their readiness for industrial use.

Contribution

It extends the FutureX benchmark to high-stakes vertical domains, providing a live evaluation pipeline for assessing LLMs' domain-specific future prediction accuracy.

Findings

01

Performance gap identified between generalist reasoning and high-precision vertical applications.

02

Benchmarking reveals current SOTA LLMs need improvement for industrial deployment.

03

FutureX-Pro enables targeted evaluation of LLMs in critical sectors.

Abstract

Building upon FutureX, which established a live benchmark for general-purpose future prediction, this report introduces FutureX-Pro, including FutureX-Finance, FutureX-Retail, FutureX-PublicHealth, FutureX-NaturalDisaster, and FutureX-Search. These together form a specialized framework extending agentic future prediction to high-value vertical domains. While generalist agents demonstrate proficiency in open-domain search, their reliability in capital-intensive and safety-critical sectors remains under-explored. FutureX-Pro targets four economically and socially pivotal verticals: Finance, Retail, Public Health, and Natural Disaster. We benchmark agentic Large Language Models (LLMs) on entry-level yet foundational prediction tasks -- ranging from forecasting market indicators and supply chain demands to tracking epidemic trends and natural disasters. By adapting the contamination-free,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsForecasting Techniques and Applications · Topic Modeling · Misinformation and Its Impacts