Disentangling representations of retinal images with generative models

Sarah M\"uller; Lisa M. Koch; Hendrik P. A. Lensch; Philipp Berens

arXiv:2402.19186·cs.CV·June 24, 2025·1 cites

Disentangling representations of retinal images with generative models

Sarah M\"uller, Lisa M. Koch, Hendrik P. A. Lensch, Philipp Berens

PDF

Open Access 1 Repo

TL;DR

This paper introduces a generative model that disentangles patient attributes from camera effects in retinal fundus images, improving controllability and realism for ophthalmology AI applications.

Contribution

It proposes a novel disentanglement loss based on distance correlation to separate confounding factors in retinal images, enhancing interpretability and generation control.

Findings

01

Effective disentanglement of patient and camera attributes

02

Improved controllable and realistic image generation

03

Validated through qualitative and quantitative analyses

Abstract

Retinal fundus images play a crucial role in the early detection of eye diseases. However, the impact of technical factors on these images can pose challenges for reliable AI applications in ophthalmology. For example, large fundus cohorts are often confounded by factors like camera type, bearing the risk of learning shortcuts rather than the causal relationships behind the image generation process. Here, we introduce a population model for retinal fundus images that effectively disentangles patient attributes from camera effects, enabling controllable and highly realistic image generation. To achieve this, we propose a disentanglement loss based on distance correlation. Through qualitative and quantitative analyses, we show that our models encode desired information in disentangled subspaces and enable controllable image generation based on the learned subspaces, demonstrating the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

berenslab/disentangling-retinal-images
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputability, Logic, AI Algorithms