MedSteer: Counterfactual Endoscopic Synthesis via Training-Free Activation Steering
Trong-Thang Pham, Loc Nguyen, Anh Nguyen, Hien Nguyen, Ngan Le

TL;DR
MedSteer introduces a training-free activation steering method for endoscopic image synthesis, enabling precise counterfactual generation with high fidelity and structural preservation, improving data augmentation for medical imaging tasks.
Contribution
It proposes a novel, training-free activation steering framework that generates counterfactual endoscopic images while maintaining structural integrity, outperforming existing inversion-based methods.
Findings
Achieves high concept flip rates of 0.800, 0.925, and 0.950 across experiments.
Significantly improves dye removal to 75% compared to baseline methods.
Enhances downstream polyp detection with higher AUC of 0.9755.
Abstract
Generative diffusion models are increasingly used for medical imaging data augmentation, but text prompting cannot produce causal training data. Re-prompting rerolls the entire generation trajectory, altering anatomy, texture, and background. Inversion-based editing methods introduce reconstruction error that causes structural drift. We propose MedSteer, a training-free activation-steering framework for endoscopic synthesis. MedSteer identifies a pathology vector for each contrastive prompt pair in the cross-attention layers of a diffusion transformer. At inference time, it steers image activations along this vector, generating counterfactual pairs from scratch where the only difference is the steered concept. All other structure is preserved by construction. We evaluate MedSteer across three experiments on Kvasir v3 and HyperKvasir. On counterfactual generation across three clinical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Image Enhancement Techniques · Computer Graphics and Visualization Techniques
