Feature-level Site Leakage Reduction for Cross-Hospital Chest X-ray Transfer via Self-Supervised Learning
Ayoub Louaye Bouaziz, Lokmane Chebouba

TL;DR
This paper investigates how to measure site leakage in cross-hospital chest X-ray models, revealing that self-supervised learning improves transfer performance and that adversarial methods may not reliably reduce leakage or improve transfer.
Contribution
It introduces a method to measure site leakage directly and analyzes how different transfer techniques affect leakage and model performance across hospitals.
Findings
Multi-site SSL significantly improves transfer accuracy.
Adversarial site confusion reduces leakage measurement but does not reliably improve transfer.
Measuring leakage alters the interpretation of transfer methods' effectiveness.
Abstract
Cross-hospital failure in chest X-ray models is often attributed to domain shift, yet most work assumes invariance without measuring it. This paper studies how to measure site leakage directly and how that measurement changes conclusions about transfer methods. We study multi-site self-supervised learning (SSL) and feature-level adversarial site confusion for cross-hospital transfer. We pretrain a ResNet-18 on NIH and CheXpert without pathology labels. We then freeze the encoder and train a linear pneumonia classifier on NIH only, evaluating transfer to RSNA. We quantify site leakage using a post hoc linear probe that predicts acquisition site from frozen backbone features and projection features . Across 3 random seeds, multi-site SSL improves RSNA AUC from 0.6736 0.0148 (ImageNet initialization) to 0.7804 0.0197. Adding adversarial site confusion on reduces…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
