FinTruthQA: A Benchmark for AI-Driven Financial Disclosure Quality Assessment in Investor -- Firm Interactions

Peilin Zhou; Ziyue Xu; Xinyu Shi; Jiageng Wu; Yikang Jiang; Dading Chong; Wang Dong; Jun Chen; Bin Ke; Jie Yang

arXiv:2406.12009·cs.CL·April 10, 2026

FinTruthQA: A Benchmark for AI-Driven Financial Disclosure Quality Assessment in Investor -- Firm Interactions

Peilin Zhou, Ziyue Xu, Xinyu Shi, Jiageng Wu, Yikang Jiang, Dading Chong, Wang Dong, Jun Chen, Bin Ke, Jie Yang

PDF

1 Repo 2 Datasets

TL;DR

FinTruthQA is a new benchmark dataset for evaluating AI models' ability to assess the quality of financial disclosures in investor-firm interactions, highlighting current model strengths and weaknesses.

Contribution

This paper introduces FinTruthQA, the first benchmark for AI-driven financial disclosure quality assessment, with comprehensive annotations and evaluation of various models.

Findings

01

Models perform well on question identification and relevance (F1 > 95%)

02

Models are weaker on answer readability (F1 ~88%) and relevance (F1 ~80%)

03

Domain-adapted models outperform general-purpose models and LLM prompting

Abstract

Accurate and transparent financial information disclosure is essential for market efficiency, investor decision-making, and corporate governance. Chinese stock exchanges' investor interactive platforms provide a widely used channel through which listed firms respond to investor concerns, yet these responses are often limited or non-substantive, making disclosure quality difficult to assess at scale. To address this challenge, we introduce FinTruthQA, to our knowledge the first benchmark for AI-driven assessment of financial disclosure quality in investor-firm interactions. FinTruthQA comprises 6,000 real-world financial Q&A entries, each manually annotated based on four key evaluation criteria: question identification, question relevance, answer readability, and answer relevance. We benchmark statistical machine learning models, pre-trained language models and their fine-tuned variants,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

bethxx99/FinTruthQA
github

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.