Data-Centric Financial Large Language Models

Zhixuan Chu; Huaiyu Guo; Xinyuan Zhou; Yijia Wang; Fei Yu; Hong Chen,; Wanqing Xu; Xin Lu; Qing Cui; Longfei Li; Jun Zhou; Sheng Li

arXiv:2310.17784·cs.CL·November 15, 2023·5 cites

Data-Centric Financial Large Language Models

Zhixuan Chu, Huaiyu Guo, Xinyuan Zhou, Yijia Wang, Fei Yu, Hong Chen,, Wanqing Xu, Xin Lu, Qing Cui, Longfei Li, Jun Zhou, Sheng Li

PDF

Open Access

TL;DR

This paper introduces a data-centric approach for financial large language models (FLLMs) that preprocess and understand data better, using multitask fine-tuning and abductive augmentation reasoning to improve performance on financial tasks.

Contribution

It proposes a novel data-centric methodology with automatic data augmentation for financial LLMs, achieving state-of-the-art results in financial analysis and interpretation.

Findings

01

FLLM with AAR outperforms baseline models

02

Achieves state-of-the-art on financial analysis tasks

03

Open sources a new financial analysis benchmark

Abstract

Large language models (LLMs) show promise for natural language tasks but struggle when applied directly to complex domains like finance. LLMs have difficulty reasoning about and integrating all relevant information. We propose a data-centric approach to enable LLMs to better handle financial tasks. Our key insight is that rather than overloading the LLM with everything at once, it is more effective to preprocess and pre-understand the data. We create a financial LLM (FLLM) using multitask prompt-based finetuning to achieve data pre-processing and pre-understanding. However, labeled data is scarce for each task. To overcome manual annotation costs, we employ abductive augmentation reasoning (AAR) to automatically generate training data by modifying the pseudo labels from FLLM's own outputs. Experiments show our data-centric FLLM with AAR substantially outperforms baseline financial LLMs…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Stock Market Forecasting Methods · Natural Language Processing Techniques