Rethinking Retrieval: From Traditional Retrieval Augmented Generation to Agentic and Non-Vector Reasoning Systems in the Financial Domain for Large Language Models
Elias Lumer, Matt Melich, Olivia Zino, Elena Kim, Sara Dieter, Pradeep Honaganahalli Basavaraju, Vamse Kumar Subbiah, James A. Burke, Roberto Hernandez

TL;DR
This paper systematically compares vector-based and non-vector RAG architectures for financial document question answering, demonstrating that advanced RAG techniques enhance retrieval accuracy and answer quality with manageable latency and costs.
Contribution
It provides the first comprehensive evaluation of vector-based agentic RAG versus hierarchical systems in the financial domain, including the impact of advanced techniques like reranking and chunk retrieval.
Findings
Vector-based RAG outperforms hierarchical systems in retrieval metrics and answer quality.
Cross-encoder reranking significantly improves retrieval precision.
Advanced RAG techniques offer favorable cost-performance tradeoffs.
Abstract
Recent advancements in Retrieval-Augmented Generation (RAG) have enabled Large Language Models to answer financial questions using external knowledge bases of U.S. SEC filings, earnings reports, and regulatory documents. However, existing work lacks systematic comparison of vector-based and non-vector RAG architectures for financial documents, and the empirical impact of advanced RAG techniques on retrieval accuracy, answer quality, latency, and cost remain unclear. We present the first systematic evaluation comparing vector-based agentic RAG using hybrid search and metadata filtering against hierarchical node-based systems that traverse document structure without embeddings. We evaluate two enhancement techniques applied to the vector-based architecture, i) cross-encoder reranking for retrieval precision, and ii) small-to-big chunk retrieval for context completeness. Across 1,200 SEC…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Information Retrieval and Search Behavior · Stock Market Forecasting Methods
