Fino1: On the Transferability of Reasoning-Enhanced LLMs and Reinforcement Learning to Finance

Lingfei Qian; Weipeng Zhou; Yan Wang; Xueqing Peng; Han Yi; Yilun Zhao; Jimin Huang; Qianqian Xie; Jian-yun Nie

arXiv:2502.08127·cs.CL·June 17, 2025·2 cites

Fino1: On the Transferability of Reasoning-Enhanced LLMs and Reinforcement Learning to Finance

Lingfei Qian, Weipeng Zhou, Yan Wang, Xueqing Peng, Han Yi, Yilun Zhao, Jimin Huang, Qianqian Xie, Jian-yun Nie

PDF

Open Access 1 Repo 9 Models 5 Datasets

TL;DR

This paper introduces FinCoT, a high-quality financial reasoning dataset, and Fin-o1, models trained on it, demonstrating improved reasoning in finance and providing a comprehensive benchmark for evaluating LLMs in financial contexts.

Contribution

The paper presents the first open financial chain-of-thought corpus, develops new financial reasoning models, and conducts the first empirical comparison of RL methods in finance.

Findings

01

Fin-o1 models outperform existing financial reasoning models.

02

GRPO-based RL yields reliable performance gains.

03

Standard reasoning models degrade on complex financial tasks.

Abstract

As the fundamental capability behind decision-making in finance, financial reasoning poses distinct challenges for LLMs. Although reinforcement learning (RL) have boosted generic reasoning, the progress in finance is hindered by the absence of empirical study of building effective financial chain-of-thought (CoT) corpus, a systematic comparison of different RL methods, and comprehensive benchmarks. To address these gaps, we introduce FinCoT, the first open high-fidelity CoT corpus for finance, distilled from seven QA datasets by a novel three-stage pipeline that incorporates domain supervision, iterative LLM refinement, and difficulty-aware filtering. Based on FinCoT, we develop Fin-o1, the first open financial reasoning models trained via supervised fine-tuning and GRPO-based RL. Our models outperform existing financial reasoning models and SOTA general models such as GPT-o1,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

the-finai/fino1
noneOfficial

Models

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Semantic Web and Ontologies · Artificial Intelligence in Law

MethodsAttention Is All You Need · Linear Layer · Absolute Position Encodings · Multi-Head Attention · Dense Connections · Layer Normalization · Label Smoothing · Residual Connection · Adam · Dropout