Fino1: On the Transferability of Reasoning-Enhanced LLMs and Reinforcement Learning to Finance
Lingfei Qian, Weipeng Zhou, Yan Wang, Xueqing Peng, Han Yi, Yilun Zhao, Jimin Huang, Qianqian Xie, Jian-yun Nie

TL;DR
This paper introduces FinCoT, a high-quality financial reasoning dataset, and Fin-o1, models trained on it, demonstrating improved reasoning in finance and providing a comprehensive benchmark for evaluating LLMs in financial contexts.
Contribution
The paper presents the first open financial chain-of-thought corpus, develops new financial reasoning models, and conducts the first empirical comparison of RL methods in finance.
Findings
Fin-o1 models outperform existing financial reasoning models.
GRPO-based RL yields reliable performance gains.
Standard reasoning models degrade on complex financial tasks.
Abstract
As the fundamental capability behind decision-making in finance, financial reasoning poses distinct challenges for LLMs. Although reinforcement learning (RL) have boosted generic reasoning, the progress in finance is hindered by the absence of empirical study of building effective financial chain-of-thought (CoT) corpus, a systematic comparison of different RL methods, and comprehensive benchmarks. To address these gaps, we introduce FinCoT, the first open high-fidelity CoT corpus for finance, distilled from seven QA datasets by a novel three-stage pipeline that incorporates domain supervision, iterative LLM refinement, and difficulty-aware filtering. Based on FinCoT, we develop Fin-o1, the first open financial reasoning models trained via supervised fine-tuning and GRPO-based RL. Our models outperform existing financial reasoning models and SOTA general models such as GPT-o1,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗TheFinAI/Fino1-8Bmodel· 143 dl· ♡ 35143 dl♡ 35
- 🤗SandLogicTechnologies/Fino1-8B-GGUFmodel· 30 dl30 dl
- 🤗TheFinAI/Fino1-14Bmodel· 11 dl11 dl
- 🤗ThinkTim21/FinPlan-1model· 10 dl· ♡ 210 dl♡ 2
- 🤗dinalad0/my-fino1-modelmodel· 2 dl2 dl
- 🤗TheFinAI/Fin-o1-8Bmodel· 225 dl· ♡ 11225 dl♡ 11
- 🤗TheFinAI/Fin-o1-14Bmodel· 10 dl· ♡ 610 dl♡ 6
- 🤗ryanfortin/community-blend-qwen3-8bmodel· 7 dl7 dl
- 🤗khazarai/Fino1-4Bmodel· 63 dl· ♡ 163 dl♡ 1
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Semantic Web and Ontologies · Artificial Intelligence in Law
MethodsAttention Is All You Need · Linear Layer · Absolute Position Encodings · Multi-Head Attention · Dense Connections · Layer Normalization · Label Smoothing · Residual Connection · Adam · Dropout
