Retrieval-Augmented Generation Assistant for Anatomical Pathology Laboratories
Diogo Pires, Yuriy Perezhohin, Mauro Castelli

TL;DR
This paper develops and evaluates a Retrieval-Augmented Generation system tailored for Anatomical Pathology labs, improving access to protocols through domain-specific retrieval and chunking strategies, enhancing workflow and safety.
Contribution
It introduces a novel RAG assistant for AP labs, utilizing domain-specific embeddings and optimized retrieval strategies, demonstrating improved accuracy and relevance over baseline methods.
Findings
Recursive chunking and hybrid retrieval improve baseline performance.
Biomedical-specific embeddings (MedEmbed) enhance answer relevance and faithfulness.
Retrieving a single top chunk maximizes efficiency and accuracy.
Abstract
Accurate and efficient access to laboratory protocols is essential in Anatomical Pathology (AP), where up to 70% of medical decisions depend on laboratory diagnoses. However, static documentation such as printed manuals or PDFs is often outdated, fragmented, and difficult to search, creating risks of workflow errors and diagnostic delays. This study proposes and evaluates a Retrieval-Augmented Generation (RAG) assistant tailored to AP laboratories, designed to provide technicians with context-grounded answers to protocol-related queries. We curated a novel corpus of 99 AP protocols from a Portuguese healthcare institution and constructed 323 question-answer pairs for systematic evaluation. Ten experiments were conducted, varying chunking strategies, retrieval methods, and embedding models. Performance was assessed using the RAGAS framework (faithfulness, answer relevance, context…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBiomedical Text Mining and Ontologies · Topic Modeling · Artificial Intelligence in Healthcare and Education
