SynFundus-1M: A High-quality Million-scale Synthetic fundus images   Dataset with Fifteen Types of Annotation

Fangxin Shang; Jie Fu; Yehui Yang; Haifeng Huang; Junwei Liu; Lei Ma

arXiv:2312.00377·cs.CV·March 15, 2024·2 cites

SynFundus-1M: A High-quality Million-scale Synthetic fundus images Dataset with Fifteen Types of Annotation

Fangxin Shang, Jie Fu, Yehui Yang, Haifeng Huang, Junwei Liu, Lei Ma

PDF

Open Access 1 Repo

TL;DR

SynFundus-1M is the largest synthetic fundus image dataset with detailed annotations, generated by a diffusion model, enhancing medical imaging research and model training while maintaining high realism.

Contribution

This paper introduces SynFundus-1M, a large-scale synthetic fundus dataset with detailed annotations, created using a novel diffusion model trained on authentic images.

Findings

01

Synthetic images are indistinguishable from real images by experts.

02

Models pretrained on SynFundus-1M outperform those trained on other datasets.

03

SynFundus-1M accelerates convergence in downstream retinal disease diagnosis tasks.

Abstract

Large-scale public datasets with high-quality annotations are rarely available for intelligent medical imaging research, due to data privacy concerns and the cost of annotations. In this paper, we release SynFundus-1M, a high-quality synthetic dataset containing over one million fundus images in terms of \textbf{eleven disease types}. Furthermore, we deliberately assign four readability labels to the key regions of the fundus images. To the best of our knowledge, SynFundus-1M is currently the largest fundus dataset with the most sophisticated annotations. Leveraging over 1.3 million private authentic fundus images from various scenarios, we trained a powerful Denoising Diffusion Probabilistic Model, named SynFundus-Generator. The released SynFundus-1M are generated by SynFundus-Generator under predefined conditions. To demonstrate the value of SynFundus-1M, extensive experiments are…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

parap1uie-s/synfundus-1m
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRetinal Imaging and Analysis · Digital Imaging for Blood Diseases · Retinal and Optic Conditions

MethodsMulti-Head Attention · Attention Is All You Need · Dropout · Linear Layer · Dense Connections · Position-Wise Feed-Forward Layer · Absolute Position Encodings · Label Smoothing · Byte Pair Encoding · Adam