CFGPT: Chinese Financial Assistant with Large Language Model
Jiangtong Li, Yuxuan Bian, Guoxuan Wang, Yang Lei, Dawei Cheng, Zhijun, Ding, Changjun Jiang

TL;DR
This paper introduces CFGPT, a comprehensive Chinese financial language model framework that includes specialized datasets, a tailored LLM, and an application deployment system for real-world financial tasks.
Contribution
The work presents a new Chinese financial LLM framework with dedicated datasets, training procedures, and deployment tools, advancing AI capabilities in financial applications.
Findings
Developed CFData dataset with 584M documents and 141B tokens.
Trained CFLLM based on InternLM-7B for financial tasks.
Created CFAPP for deploying financial language models in real-world scenarios.
Abstract
Large language models (LLMs) have demonstrated great potential in natural language processing tasks within the financial domain. In this work, we present a Chinese Financial Generative Pre-trained Transformer framework, named CFGPT, which includes a dataset~(CFData) for pre-training and supervised fine-tuning, a financial LLM~(CFLLM) to adeptly manage financial texts, and a deployment framework~(CFAPP) designed to navigate real-world financial applications. The CFData comprising both a pre-training dataset and a supervised fine-tuning dataset, where the pre-training dataset collates Chinese financial data and analytics, alongside a smaller subset of general-purpose text with 584M documents and 141B tokens in total, and the supervised fine-tuning dataset is tailored for six distinct financial tasks, embodying various facets of financial analysis and decision-making with 1.5M instruction…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStock Market Forecasting Methods
MethodsAttention Is All You Need · Softmax · Dense Connections · Position-Wise Feed-Forward Layer · Absolute Position Encodings · Residual Connection · Adam · Linear Layer · Multi-Head Attention · Dropout
