Graph-Aware Late Chunking for Retrieval-Augmented Generation in Biomedical Literature
Pouria Mortezaagha, Arya Rahgozar

TL;DR
This paper introduces GraLC-RAG, a graph-aware framework for biomedical literature retrieval that improves structural coverage across document sections, addressing limitations of traditional ranking metrics.
Contribution
It proposes a novel structure-aware chunking and retrieval method that enhances multi-section evidence retrieval in biomedical RAG systems.
Findings
Structure-aware methods retrieve from more sections than content-similarity methods.
KG-infused retrieval narrows answer quality gaps while increasing section diversity.
Standard metrics undervalue the importance of structural retrieval in biomedical documents.
Abstract
Retrieval-Augmented Generation (RAG) systems for biomedical literature are typically evaluated using ranking metrics like Mean Reciprocal Rank (MRR), which measure how well the system identifies the single most relevant chunk. We argue that for full-text scientific documents, this paradigm is incomplete: it rewards retrieval precision while ignoring retrieval breadth -- the ability to surface evidence from across a document's structural sections. We propose GraLC-RAG, a framework that unifies late chunking with graph-aware structural intelligence, introducing structure-aware chunk boundary detection, UMLS knowledge graph infusion, and graph-guided hybrid retrieval. We evaluate six strategies on 2,359 IMRaD-filtered PubMed Central articles using 2,033 cross-section questions and two metric families: standard ranking metrics (MRR, Recall@k) and structural coverage metrics (SecCov@k, CS…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBiomedical Text Mining and Ontologies · Information Retrieval and Search Behavior · Topic Modeling
