Path-RAG: Knowledge-Guided Key Region Retrieval for Open-ended Pathology Visual Question Answering
Awais Naeem, Tianhao Li, Huang-Ru Liao, Jiawei Xu, Aby M. Mathew,, Zehao Zhu, Zhen Tan, Ajay Kumar Jaiswal, Raffi A. Salibian, Ziniu Hu,, Tianlong Chen, Ying Ding

TL;DR
Path-RAG introduces a knowledge-guided retrieval framework using HistoCartography to enhance open-ended pathology visual question answering, significantly improving accuracy by leveraging domain-specific tissue and cell information.
Contribution
The paper presents a novel Path-RAG framework that incorporates domain knowledge retrieval via HistoCartography, addressing the limitations of deep learning in complex pathology image analysis.
Findings
Improves PathVQA-Open accuracy from 38% to 47%.
Achieves 28% gain on H&E-stained pathology images.
Enhances long-form question answering performance by over 30%.
Abstract
Accurate diagnosis and prognosis assisted by pathology images are essential for cancer treatment selection and planning. Despite the recent trend of adopting deep-learning approaches for analyzing complex pathology images, they fall short as they often overlook the domain-expert understanding of tissue structure and cell composition. In this work, we focus on a challenging Open-ended Pathology VQA (PathVQA-Open) task and propose a novel framework named Path-RAG, which leverages HistoCartography to retrieve relevant domain knowledge from pathology images and significantly improves performance on PathVQA-Open. Admitting the complexity of pathology image analysis, Path-RAG adopts a human-centered AI approach by retrieving domain knowledge using HistoCartography to select the relevant patches from pathology images. Our experiments suggest that domain guidance can significantly boost the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Advanced Text Analysis Techniques · Multimodal Machine Learning Applications
MethodsFocus
