Fin-RATE: A Real-world Financial Analytics and Tracking Evaluation Benchmark for LLMs on SEC Filings
Yidong Jiang, Junrong Chen, Eftychia Makri, Jialin Chen, Peiwen Li, Ali Maatouk, Leandros Tassiulas, Eliot Brenner, Bing Xiang, Rex Ying

TL;DR
Fin-RATE is a comprehensive benchmark for evaluating large language models on complex financial analysis tasks involving SEC filings, highlighting significant performance challenges and diagnostic insights across multiple reasoning dimensions.
Contribution
Introduces Fin-RATE, a novel benchmark that assesses LLMs on multi-faceted financial analysis tasks, addressing gaps in existing benchmarks by evaluating reasoning, retrieval, and cross-entity tracking.
Findings
LLMs' accuracy drops by up to 18.60% in complex tasks
Performance degradation linked to increased hallucinations and mismatches
Existing benchmarks do not quantify reasoning and factual errors effectively
Abstract
With the increasing deployment of Large Language Models (LLMs) in the finance domain, LLMs are increasingly expected to parse complex regulatory disclosures. However, existing benchmarks often focus on isolated details, failing to reflect the complexity of professional analysis that requires synthesizing information across multiple documents, reporting periods, and corporate entities. Furthermore, these benchmarks do not disentangle whether errors arise from retrieval failures, generation inaccuracies, domain-specific reasoning mistakes, or misinterpretation of the query or context, making it difficult to precisely diagnose performance bottlenecks. To bridge these gaps, we introduce Fin-RATE, a benchmark built on U.S. Securities and Exchange Commission (SEC) filings and mirroring financial analyst workflows through three pathways: detail-oriented reasoning within individual disclosures,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAuditing, Earnings Management, Governance · Financial Reporting and XBRL · Financial Distress and Bankruptcy Prediction
