Leveraging the Structure of Medical Data for Improved Representation Learning

Andrea Agostini; Sonia Laguna; Alain Ryser; Samuel Ruiperez-Campillo; Moritz Vandenhirtz; Nicolas Deperrois; Farhad Nooralahzadeh; Michael Krauthammer; Thomas M. Sutter; Julia E. Vogt

arXiv:2507.02987·cs.CV·July 25, 2025

Leveraging the Structure of Medical Data for Improved Representation Learning

Andrea Agostini, Sonia Laguna, Alain Ryser, Samuel Ruiperez-Campillo, Moritz Vandenhirtz, Nicolas Deperrois, Farhad Nooralahzadeh, Michael Krauthammer, Thomas M. Sutter, Julia E. Vogt

PDF

Open Access

TL;DR

This paper introduces a self-supervised learning framework that exploits the inherent multi-view structure of medical imaging data, specifically chest X-rays, to improve representation learning without textual labels.

Contribution

It presents a novel, structure-aware pretraining method for medical images that enhances data efficiency and domain-specific feature extraction.

Findings

01

Outperforms supervised methods on MIMIC-CXR

02

Requires no textual annotations for training

03

Leverages multi-view structure for better representations

Abstract

Building generalizable medical AI systems requires pretraining strategies that are data-efficient and domain-aware. Unlike internet-scale corpora, clinical datasets such as MIMIC-CXR offer limited image counts and scarce annotations, but exhibit rich internal structure through multi-view imaging. We propose a self-supervised framework that leverages the inherent structure of medical datasets. Specifically, we treat paired chest X-rays (i.e., frontal and lateral views) as natural positive pairs, learning to reconstruct each view from sparse patches while aligning their latent embeddings. Our method requires no textual supervision and produces informative representations. Evaluated on MIMIC-CXR, we show strong performance compared to supervised objectives and baselines being trained without leveraging structure. This work provides a lightweight, modality-agnostic blueprint for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Generative Adversarial Networks and Image Synthesis · Machine Learning in Healthcare