Med-SegLens: Latent-Level Model Diffing for Interpretable Medical Image Segmentation
Salma J. Ahmed, Emad A. Mohammed, Azam Asilian Bidgoli

TL;DR
Med-SegLens introduces a latent-level model diffing framework that interprets segmentation models, identifies dataset shifts, and enables targeted interventions to improve performance without retraining.
Contribution
This work presents Med-SegLens, a novel approach for decomposing segmentation model activations into interpretable latents, facilitating diagnosis and correction of failures across datasets.
Findings
Identifies stable shared representations across models and datasets.
Latent interventions can recover 70% of failure cases.
Improves Dice score from 39.4% to 74.2% after interventions.
Abstract
Modern segmentation models achieve strong predictive performance but remain largely opaque, limiting our ability to diagnose failures, understand dataset shift, or intervene in a principled manner. We introduce Med-SegLens, a model-diffing framework that decomposes segmentation model activations into interpretable latent features using sparse autoencoders trained on SegFormer and U-Net. Through cross-architecture and cross-dataset latent alignment across healthy, adult, pediatric, and sub-Saharan African glioma cohorts, we identify a stable backbone of shared representations, while dataset shift is driven by differential reliance on population-specific latents. We show that these latents act as causal bottlenecks for segmentation failures, and that targeted latent-level interventions can correct errors and improve cross-dataset adaption without retraining, recovering performance in 70%…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Glioma Diagnosis and Treatment · AI in cancer detection
