Classification of Large-Scale Fundus Image Data Sets: A Cloud-Computing Framework
Sohini Roychowdhury

TL;DR
This paper introduces a cloud-based framework for efficient classification of large fundus image datasets, optimizing feature selection to improve accuracy and reduce processing time in diabetic retinopathy detection.
Contribution
It proposes a generalized feature selection method combined with cloud computing to enhance classification accuracy and efficiency in large-scale fundus image analysis.
Findings
Achieved 90.1% accuracy in DR lesion classification with 792 seconds processing time.
Classified microaneurysms with 72% accuracy.
Classified minor blood vessels with 83.5% accuracy in 326 seconds.
Abstract
Large medical image data sets with high dimensionality require substantial amount of computation time for data creation and data processing. This paper presents a novel generalized method that finds optimal image-based feature sets that reduce computational time complexity while maximizing overall classification accuracy for detection of diabetic retinopathy (DR). First, region-based and pixel-based features are extracted from fundus images for classification of DR lesions and vessel-like structures. Next, feature ranking strategies are used to distinguish the optimal classification feature sets. DR lesion and vessel classification accuracies are computed using the boosted decision tree and decision forest classifiers in the Microsoft Azure Machine Learning Studio platform, respectively. For images from the DIARETDB1 data set, 40 of its highest-ranked features are used to classify four…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
