Reinforced Correlation Between Vision and Language for Precise Medical AI Assistant
Haonan Wang, Jiaji Mao, Lehan Wang, Qixiang Zhang, Marawan Elbatel, Yi, Qin, Huijun Hu, Baoxun Li, Wenhui Deng, Weifeng Qin, Hongrui Li, Jialin, Liang, Jun Shen, Xiaomeng Li

TL;DR
RCMed is a novel multimodal AI assistant that enhances vision-language alignment for precise medical diagnosis, achieving state-of-the-art results across diverse clinical tasks and modalities.
Contribution
The paper introduces RCMed, a full-stack medical AI system with a self-reinforcing correlation mechanism for improved multimodal alignment and generalization.
Findings
Achieves 23.5% improvement in cell segmentation accuracy.
Excels in 165 clinical tasks across 9 modalities.
Demonstrates strong generalization in external validation.
Abstract
Medical AI assistants support doctors in disease diagnosis, medical image analysis, and report generation. However, they still face significant challenges in clinical use, including limited accuracy with multimodal content and insufficient validation in real-world settings. We propose RCMed, a full-stack AI assistant that improves multimodal alignment in both input and output, enabling precise anatomical delineation, accurate localization, and reliable diagnosis through hierarchical vision-language grounding. A self-reinforcing correlation mechanism allows visual features to inform language context, while language semantics guide pixel-wise attention, forming a closed loop that refines both modalities. This correlation is enhanced by a color region description strategy, translating anatomical structures into semantically rich text to learn shape-location-text relationships across…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMedical Imaging and Analysis
