# Bayesian RAG: uncertainty-aware retrieval for reliable financial question answering

**Authors:** Lebede Ngartera, Saralees Nadarajah, Rodoumta Koina

PMC · DOI: 10.3389/frai.2025.1668172 · Frontiers in Artificial Intelligence · 2026-01-27

## TL;DR

Bayesian RAG improves financial question answering by incorporating uncertainty into retrieval, reducing errors and hallucinations.

## Contribution

Introduces Bayesian RAG, a probabilistic framework using Monte Carlo Dropout to quantify retrieval uncertainty in financial AI systems.

## Key findings

- Achieves 93.1% accuracy with significant gains in Precision@3 (+20.6%), MRR (+22.7%), and NDCG@10 (+25.4%) over BM25 baselines.
- Reduces hallucination by 27.8% and successfully extracts precise financial figures from 10-K reports.
- Enables uncertainty calibration improvements by 26.8% and offers modular integration for production AI systems.

## Abstract

Large language models excel at generating plausible responses but often produce factually incorrect answers in high-stakes financial analysis, leading to regulatory violations and financial losses, a critical challenge for deploying AI systems in production. Traditional Retrieval-Augmented Generation (RAG) systems rely on deterministic embeddings that cannot quantify retrieval uncertainty, resulting in overconfident but unreliable answers for complex financial queries. We introduce Bayesian RAG, a principled probabilistic framework that integrates epistemic uncertainty quantification directly into retrieval using Monte Carlo Dropout, bridging the gap between theoretical rigor and practical deployment. Our approach computes distributional embeddings for queries and documents, enabling a Bayesian scoring function Si = μi−λ·σi that balances semantic relevance against uncertainty. Comprehensive evaluation on Apple and Microsoft 2023 10-K reports demonstrates substantial improvements: 93.1% accuracy with significant gains in Precision@3 (+20.6%), MRR (+22.7%), and NDCG@10 (+25.4%) over BM25 baselines, plus 26.8% better uncertainty calibration. Critically, Bayesian RAG successfully extracts precise financial figures ($211.915B Microsoft, $383.285B Apple revenue) where traditional methods fail, reducing hallucination by 27.8%. Bayesian RAG advances uncertainty quantification in retrieval systems through principled Monte Carlo Dropout integration, establishing theoretical foundations for uncertainty-aware information retrieval. The modular design enables seamless integration with existing RAG pipelines, making it immediately deployable in production systems for risk-aware AI applications in finance, healthcare, and regulatory compliance.

## Full-text entities

- **Diseases:** hallucination (MESH:D006212)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12886353/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12886353/full.md

## References

30 references — full list in the complete paper: https://tomesphere.com/paper/PMC12886353/full.md

---
Source: https://tomesphere.com/paper/PMC12886353