Feature-level Site Leakage Reduction for Cross-Hospital Chest X-ray Transfer via Self-Supervised Learning

Ayoub Louaye Bouaziz; Lokmane Chebouba

arXiv:2604.00263·eess.IV·April 2, 2026

Feature-level Site Leakage Reduction for Cross-Hospital Chest X-ray Transfer via Self-Supervised Learning

Ayoub Louaye Bouaziz, Lokmane Chebouba

PDF

TL;DR

This paper investigates how to measure site leakage in cross-hospital chest X-ray models, revealing that self-supervised learning improves transfer performance and that adversarial methods may not reliably reduce leakage or improve transfer.

Contribution

It introduces a method to measure site leakage directly and analyzes how different transfer techniques affect leakage and model performance across hospitals.

Findings

01

Multi-site SSL significantly improves transfer accuracy.

02

Adversarial site confusion reduces leakage measurement but does not reliably improve transfer.

03

Measuring leakage alters the interpretation of transfer methods' effectiveness.

Abstract

Cross-hospital failure in chest X-ray models is often attributed to domain shift, yet most work assumes invariance without measuring it. This paper studies how to measure site leakage directly and how that measurement changes conclusions about transfer methods. We study multi-site self-supervised learning (SSL) and feature-level adversarial site confusion for cross-hospital transfer. We pretrain a ResNet-18 on NIH and CheXpert without pathology labels. We then freeze the encoder and train a linear pneumonia classifier on NIH only, evaluating transfer to RSNA. We quantify site leakage using a post hoc linear probe that predicts acquisition site from frozen backbone features $f$ and projection features $z$ . Across 3 random seeds, multi-site SSL improves RSNA AUC from 0.6736 $\pm$ 0.0148 (ImageNet initialization) to 0.7804 $\pm$ 0.0197. Adding adversarial site confusion on $f$ reduces…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.