FinDABench: Benchmarking Financial Data Analysis Ability of Large Language Models
Shu Liu, Shangqing Zhao, Chenghao Jia, Xinlin Zhuang, Zhaoguang Long,, Jie Zhou, Aimin Zhou, Man Lan, Qingquan Wu, Chong Yang

TL;DR
FinDABench is a comprehensive benchmark designed to evaluate the financial data analysis capabilities of large language models across foundational, reasoning, and technical skills, aiming to advance LLMs in finance.
Contribution
The paper introduces FinDABench, the first specialized benchmark for assessing LLMs' financial data analysis abilities across multiple dimensions.
Findings
Evaluates LLMs' numerical calculation and sentiment assessment skills.
Assesses models' ability to understand financial texts and detect anomalies.
Examines models' technical skills in data analysis and visualization.
Abstract
Large Language Models (LLMs) have demonstrated impressive capabilities across a wide range of tasks. However, their proficiency and reliability in the specialized domain of financial data analysis, particularly focusing on data-driven thinking, remain uncertain. To bridge this gap, we introduce \texttt{FinDABench}, a comprehensive benchmark designed to evaluate the financial data analysis capabilities of LLMs within this context. \texttt{FinDABench} assesses LLMs across three dimensions: 1) \textbf{Foundational Ability}, evaluating the models' ability to perform financial numerical calculation and corporate sentiment risk assessment; 2) \textbf{Reasoning Ability}, determining the models' ability to quickly comprehend textual information and analyze abnormal financial reports; and 3) \textbf{Technical Skill}, examining the models' use of technical knowledge to address real-world data…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Stock Market Forecasting Methods
MethodsFocus
