TL;DR
MedSynapse-V introduces a novel framework that enhances medical visual language models by simulating clinician-like diagnostic memory evolution, significantly improving diagnostic accuracy.
Contribution
It proposes a dynamic latent memory evolution mechanism with causal refinement and dual-branch learning, addressing cognitive misalignments in medical VLMs.
Findings
Outperforms existing state-of-the-art methods in diagnostic accuracy.
Effectively transfers external expertise into endogenous model parameters.
Demonstrates robustness across multiple medical datasets.
Abstract
High-precision medical diagnosis relies not only on static imaging features but also on the implicit diagnostic memory experts instantly invoke during image interpretation. We pinpoint a fundamental cognitive misalignment in medical VLMs caused by discrete tokenization, leading to quantization loss, long-range information dissipation, and missing case-adaptive expertise. To bridge this gap, we propose ours, a framework for latent diagnostic memory evolution that simulates the experiential invocation of clinicians by dynamically synthesizing implicit diagnostic memories within the model's hidden stream. Specifically, it begins with a Meta Query for Prior Memorization mechanism, where learnable probes retrieve structured priors from an anatomical prior encoder to generate condensed implicit memories. To ensure clinical fidelity, we introduce Causal Counterfactual Refinement (CCR), which…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
