Guideline-grounded retrieval-augmented generation for ophthalmic clinical decision support
Shuying Chen, Sen Cui, Zhong Cao

TL;DR
This paper introduces Oph-Guid-RAG, a multimodal retrieval-augmented generation system for ophthalmic clinical decision support that improves accuracy and robustness by integrating visual evidence retrieval with controllable reasoning.
Contribution
The work presents a novel multimodal RAG system that directly retrieves guideline page images and employs a controllable retrieval framework for ophthalmology decision support.
Findings
Improves accuracy from 0.5956 to 0.6576 on challenging cases.
Achieves 30% improvement over GPT-5.2 in overall score.
Reranking, routing, and retrieval design are critical for performance.
Abstract
In this work, we propose Oph-Guid-RAG, a multimodal visual RAG system for ophthalmology clinical question answering and decision support. We treat each guideline page as an independent evidence unit and directly retrieve page images, preserving tables, flowcharts, and layout information. We further design a controllable retrieval framework with routing and filtering, which selectively introduces external evidence and reduces noise. The system integrates query decomposition, query rewriting, retrieval, reranking, and multimodal reasoning, and provides traceable outputs with guideline page references. We evaluate our method on HealthBench using a doctor-based scoring protocol. On the hard subset, our approach improves the overall score from 0.2969 to 0.3861 (+0.0892, +30.0%) compared to GPT-5.2, and achieves higher accuracy, improving from 0.5956 to 0.6576 (+0.0620, +10.4%). Compared to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Topic Modeling · Retinal Imaging and Analysis
