Agentic Retrieval-Augmented Generation for Financial Document Question Answering
Yang Shu, Yingmin Liu, Zequn Xie

TL;DR
The paper introduces FinAgent-RAG, an innovative retrieval-augmented generation framework for financial QA that enhances multi-step numerical reasoning through iterative retrieval, programmatic reasoning, and adaptive resource allocation.
Contribution
It presents a novel agentic RAG approach with domain-specific retrieval, code-based reasoning, and dynamic strategy routing, improving accuracy and reducing costs in financial document question answering.
Findings
Achieves over 74% accuracy on benchmark datasets.
Reduces API costs by 41.3% with adaptive strategies.
Outperforms existing baselines by 5.62--9.32 percentage points.
Abstract
Financial document question answering (QA) demands complex multi-step numerical reasoning over heterogeneous evidence--structured tables, textual narratives, and footnotes--scattered across corporate filings. Existing retrieval-augmented generation (RAG) approaches adopt a single-pass retrieve-then-generate paradigm that struggles with the compositional reasoning chains prevalent in financial analysis. We propose FinAgent-RAG, an agentic RAG framework that orchestrates iterative retrieval-reasoning loops with self-verification, specifically engineered for the precision requirements of financial numerical reasoning. The framework integrates three domain-specific innovations: (1) a Contrastive Financial Retriever trained with hard negative mining to distinguish semantically similar but numerically distinct financial passages, (2) a Program-of-Thought reasoning module that generates…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
