Controllable retinal image synthesis using conditional StyleGAN and   latent space manipulation for improved diagnosis and grading of diabetic   retinopathy

Somayeh Pakdelmoez (1); Saba Omidikia (1); Seyyed Ali Seyyedsalehi; (1); Seyyede Zohreh Seyyedsalehi (2) ((1) Department of Biomedical; Engineering; Amirkabir University of Technology; Tehran; Iran; (2) Department; of Biomedical Engineering; Faculty of Health; Tehran Medical Sciences,; Islamic Azad University; Tehran; Iran)

arXiv:2409.07422·eess.IV·September 12, 2024

Controllable retinal image synthesis using conditional StyleGAN and latent space manipulation for improved diagnosis and grading of diabetic retinopathy

Somayeh Pakdelmoez (1), Saba Omidikia (1), Seyyed Ali Seyyedsalehi, (1), Seyyede Zohreh Seyyedsalehi (2) ((1) Department of Biomedical, Engineering, Amirkabir University of Technology, Tehran, Iran, (2) Department, of Biomedical Engineering, Faculty of Health

PDF

Open Access

TL;DR

This paper introduces a controllable, high-fidelity retinal image synthesis framework using conditional StyleGAN and latent space manipulation to enhance diabetic retinopathy detection and grading accuracy.

Contribution

It presents a novel method for generating diverse, controllable retinal images without auxiliary networks, improving dataset quality and classifier performance for diabetic retinopathy.

Findings

01

ResNet50 achieves over 98% accuracy with real data.

02

Synthetic images improve DR grading accuracy to 83.33%.

03

The generated images are highly realistic and enhance classifier robustness.

Abstract

Diabetic retinopathy (DR) is a consequence of diabetes mellitus characterized by vascular damage within the retinal tissue. Timely detection is paramount to mitigate the risk of vision loss. However, training robust grading models is hindered by a shortage of annotated data, particularly for severe cases. This paper proposes a framework for controllably generating high-fidelity and diverse DR fundus images, thereby improving classifier performance in DR grading and detection. We achieve comprehensive control over DR severity and visual features (optic disc, vessel structure, lesion areas) within generated images solely through a conditional StyleGAN, eliminating the need for feature masks or auxiliary networks. Specifically, leveraging the SeFa algorithm to identify meaningful semantics within the latent space, we manipulate the DR images generated conditionally on grades, further…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRetinal Imaging and Analysis · Medical Image Segmentation Techniques · Image Retrieval and Classification Techniques

MethodsHuMan(Expedia)||How do I get a human at Expedia? · Adaptive Instance Normalization · Dense Connections · Feedforward Network · R1 Regularization · Convolution · StyleGAN · Focus