A Hybrid Retrieval and Reranking Framework for Evidence-Grounded Retrieval-Augmented Generation
Fariba Afrin Irany, Sampson Akwafuo

TL;DR
This paper introduces a hybrid retrieval and reranking framework for evidence-grounded question answering in biomedical and healthcare domains, enhancing the relevance and verification of generated responses.
Contribution
It presents a novel hybrid retrieval and reranking approach using cloud-based tools for biomedical document question answering, improving evidence relevance and factual verification.
Findings
Achieved 100% grounding accuracy on evaluated claims.
Retrieved and reranked 500 evidence chunks for 25 queries.
Generated evidence-grounded answers with high reliability.
Abstract
Retrieval-augmented generation (RAG) improves large language model reliability by grounding generated responses in external evidence. However, RAG performance depends on the relevance of retrieved passages, the quality of evidence ranking, and the ability to verify whether generated claims are supported by source documents. This study presents a hybrid retrieval and reranking framework for citation-aware RAG in biomedical and healthcare-related document question answering. The framework uses Amazon Bedrock Knowledge Bases for document ingestion, parsing, chunking, embedding generation, and evidence retrieval. Source PDF documents are stored in Amazon S3, embedded using Amazon Titan Text Embeddings V2, and indexed with Amazon OpenSearch Serverless. Hybrid retrieval first retrieves candidate evidence chunks, and Cohere reranking then prioritizes the most relevant passages before answer…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
