Learning to Trim: End-to-End Causal Graph Pruning with Dynamic Anatomical Feature Banks for Medical VQA
Zibo Xu, Qiang Li, Weizhi Nie, Yuting Su

TL;DR
The paper introduces a novel end-to-end causal pruning framework for Medical VQA that dynamically suppresses dataset biases, improving robustness and generalization.
Contribution
It proposes a learnable causal trimming method with a dynamic anatomical feature bank, enabling adaptive suppression of spurious correlations in medical VQA models.
Findings
LCT improves robustness across multiple datasets.
It outperforms existing debiasing strategies.
The dynamic feature bank captures dataset-level regularities effectively.
Abstract
Medical Visual Question Answering (MedVQA) models often exhibit limited generalization due to reliance on dataset-specific correlations, such as recurring anatomical patterns or question-type regularities, rather than genuine diagnostic evidence. Existing causal approaches are typically implemented as static adjustments or post-hoc corrections. To address this issue, we propose a Learnable Causal Trimming (LCT) framework that integrates causal pruning into end-to-end optimization. We introduce a Dynamic Anatomical Feature Bank (DAFB), updated via a momentum mechanism, to capture global prototypes of frequent anatomical and linguistic patterns, serving as an approximation of dataset-level regularities. We further design a differentiable trimming module that estimates the dependency between instance-level representations and the global feature bank. Features highly correlated with global…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
