PIXIU: A Large Language Model, Instruction Data and Evaluation Benchmark for Finance
Qianqian Xie, Weiguang Han, Xiao Zhang, Yanzhao Lai, Min Peng,, Alejandro Lopez-Lira, Jimin Huang

TL;DR
PIXIU introduces the first comprehensive financial LLM, instruction dataset, and evaluation benchmark, advancing open-source financial AI by enabling better instruction tuning and performance assessment of language models in finance.
Contribution
The paper presents a new financial LLM called FinMA, a large-scale instruction dataset with 136K samples, and a standardized benchmark for evaluating financial language models.
Findings
FinMA effectively follows financial instructions and performs well on multiple tasks.
The benchmark reveals strengths and weaknesses of FinMA and other LLMs in financial tasks.
Open-sourcing facilitates future research in financial AI.
Abstract
Although large language models (LLMs) has shown great performance on natural language processing (NLP) in the financial domain, there are no publicly available financial tailtored LLMs, instruction tuning datasets, and evaluation benchmarks, which is critical for continually pushing forward the open-source development of financial artificial intelligence (AI). This paper introduces PIXIU, a comprehensive framework including the first financial LLM based on fine-tuning LLaMA with instruction data, the first instruction data with 136K data samples to support the fine-tuning, and an evaluation benchmark with 5 tasks and 9 datasets. We first construct the large-scale multi-task instruction data considering a variety of financial tasks, financial document types, and financial data modalities. We then propose a financial LLM called FinMA by fine-tuning LLaMA with the constructed dataset to be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStock Market Forecasting Methods · Topic Modeling · Explainable Artificial Intelligence (XAI)
