# MedShift: Implicit Conditional Transport for X-Ray Domain Adaptation

**Authors:** Francisco Caetano, Christiaan Viviers, Peter H.N. De With, Fons van der Sommen

arXiv: 2508.21435 · 2026-04-08

## TL;DR

MedShift is a novel generative model that enables high-quality, unpaired cross-domain translation of X-ray images, effectively bridging synthetic and real data gaps for improved medical imaging analysis.

## Contribution

It introduces MedShift, a class-conditional Flow Matching model that learns a shared latent space for flexible, unpaired domain translation in medical imaging, with a new dataset for benchmarking.

## Key findings

- MedShift achieves strong performance despite smaller size compared to diffusion models.
- It supports flexible inference tuning for perceptual or structural fidelity.
- The model effectively bridges synthetic and real X-ray domain gaps.

## Abstract

Synthetic medical data offers a scalable solution for training robust models, but significant domain gaps limit its generalizability to real-world clinical settings. This paper addresses the challenge of cross-domain translation between synthetic and real X-ray images of the head, focusing on bridging discrepancies in attenuation behavior, noise characteristics, and soft tissue representation. We propose MedShift, a unified class-conditional generative model based on Flow Matching and Schrodinger Bridges, which enables high-fidelity, unpaired image translation across multiple domains. Unlike prior approaches that require domain-specific training or rely on paired data, MedShift learns a shared domain-agnostic latent space and supports seamless translation between any pair of domains seen during training. We introduce X-DigiSkull, a new dataset comprising aligned synthetic and real skull X-rays under varying radiation doses, to benchmark domain translation models. Experimental results demonstrate that, despite its smaller model size compared to diffusion-based approaches, MedShift offers strong performance and remains flexible at inference time, as it can be tuned to prioritize either perceptual fidelity or structural consistency, making it a scalable and generalizable solution for domain adaptation in medical imaging. The code and dataset are available at https://caetas.github.io/medshift.html

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/2508.21435/full.md

## Figures

7 figures with captions in the complete paper: https://tomesphere.com/paper/2508.21435/full.md

## References

33 references — full list in the complete paper: https://tomesphere.com/paper/2508.21435/full.md

---
Source: https://tomesphere.com/paper/2508.21435