FinTruthQA: A Benchmark for AI-Driven Financial Disclosure Quality Assessment in Investor -- Firm Interactions
Peilin Zhou, Ziyue Xu, Xinyu Shi, Jiageng Wu, Yikang Jiang, Dading Chong, Wang Dong, Jun Chen, Bin Ke, Jie Yang

TL;DR
FinTruthQA is a new benchmark dataset for evaluating AI models' ability to assess the quality of financial disclosures in investor-firm interactions, highlighting current model strengths and weaknesses.
Contribution
This paper introduces FinTruthQA, the first benchmark for AI-driven financial disclosure quality assessment, with comprehensive annotations and evaluation of various models.
Findings
Models perform well on question identification and relevance (F1 > 95%)
Models are weaker on answer readability (F1 ~88%) and relevance (F1 ~80%)
Domain-adapted models outperform general-purpose models and LLM prompting
Abstract
Accurate and transparent financial information disclosure is essential for market efficiency, investor decision-making, and corporate governance. Chinese stock exchanges' investor interactive platforms provide a widely used channel through which listed firms respond to investor concerns, yet these responses are often limited or non-substantive, making disclosure quality difficult to assess at scale. To address this challenge, we introduce FinTruthQA, to our knowledge the first benchmark for AI-driven assessment of financial disclosure quality in investor-firm interactions. FinTruthQA comprises 6,000 real-world financial Q&A entries, each manually annotated based on four key evaluation criteria: question identification, question relevance, answer readability, and answer relevance. We benchmark statistical machine learning models, pre-trained language models and their fine-tuned variants,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
