Controllable retinal image synthesis using conditional StyleGAN and latent space manipulation for improved diagnosis and grading of diabetic retinopathy
Somayeh Pakdelmoez (1), Saba Omidikia (1), Seyyed Ali Seyyedsalehi, (1), Seyyede Zohreh Seyyedsalehi (2) ((1) Department of Biomedical, Engineering, Amirkabir University of Technology, Tehran, Iran, (2) Department, of Biomedical Engineering, Faculty of Health

TL;DR
This paper introduces a controllable, high-fidelity retinal image synthesis framework using conditional StyleGAN and latent space manipulation to enhance diabetic retinopathy detection and grading accuracy.
Contribution
It presents a novel method for generating diverse, controllable retinal images without auxiliary networks, improving dataset quality and classifier performance for diabetic retinopathy.
Findings
ResNet50 achieves over 98% accuracy with real data.
Synthetic images improve DR grading accuracy to 83.33%.
The generated images are highly realistic and enhance classifier robustness.
Abstract
Diabetic retinopathy (DR) is a consequence of diabetes mellitus characterized by vascular damage within the retinal tissue. Timely detection is paramount to mitigate the risk of vision loss. However, training robust grading models is hindered by a shortage of annotated data, particularly for severe cases. This paper proposes a framework for controllably generating high-fidelity and diverse DR fundus images, thereby improving classifier performance in DR grading and detection. We achieve comprehensive control over DR severity and visual features (optic disc, vessel structure, lesion areas) within generated images solely through a conditional StyleGAN, eliminating the need for feature masks or auxiliary networks. Specifically, leveraging the SeFa algorithm to identify meaningful semantics within the latent space, we manipulate the DR images generated conditionally on grades, further…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRetinal Imaging and Analysis · Medical Image Segmentation Techniques · Image Retrieval and Classification Techniques
MethodsHuMan(Expedia)||How do I get a human at Expedia? · Adaptive Instance Normalization · Dense Connections · Feedforward Network · R1 Regularization · Convolution · StyleGAN · Focus
