Integrating Domain Knowledge for Financial QA: A Multi-Retriever RAG Approach with LLMs
Yukun Zhang, Stefan Elbl Droguett, Samyak Jain

TL;DR
This paper presents a multi-retriever RAG system combined with LLMs to improve financial numerical reasoning QA by integrating domain knowledge, achieving state-of-the-art results and surpassing previous models.
Contribution
The study introduces a multi-retriever RAG approach with domain-specific training that significantly enhances financial QA performance over existing models.
Findings
Domain-specific training with SecBERT improves neural symbolic model performance.
Prompt-based LLM achieves SOTA with >7% improvement but below human level.
External knowledge benefits outweigh hallucination risks in larger models.
Abstract
This research project addresses the errors of financial numerical reasoning Question Answering (QA) tasks due to the lack of domain knowledge in finance. Despite recent advances in Large Language Models (LLMs), financial numerical questions remain challenging because they require specific domain knowledge in finance and complex multi-step numeric reasoning. We implement a multi-retriever Retrieval Augmented Generators (RAG) system to retrieve both external domain knowledge and internal question contexts, and utilize the latest LLM to tackle these tasks. Through comprehensive ablation experiments and error analysis, we find that domain-specific training with the SecBERT encoder significantly contributes to our best neural symbolic model surpassing the FinQA paper's top model, which serves as our baseline. This suggests the potential superior performance of domain-specific training.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Stock Market Forecasting Methods · Advanced Text Analysis Techniques
