# Enhancing the ophthalmic AI assessment with a fundus image quality classifier using local and global attention mechanisms

**Authors:** Shengzhan Wang, Wenyue Shen, Zhiyuan Gao, Xiaoyu Jiang, Yaqi Wang, Yunxiang Li, Xiaoyu Ma, Wenhao Wang, Shuanghua Xin, Weina Ren, Kai Jin, Juan Ye

PMC · DOI: 10.3389/fmed.2024.1418048 · Frontiers in Medicine · 2024-08-07

## TL;DR

This paper introduces a new AI model that improves the quality assessment of fundus images for diagnosing eye diseases by combining local and global image features.

## Contribution

LGAANet is the first algorithm trained on a multi-source heterogeneous fundus dataset for fundus image quality assessment.

## Key findings

- LGAANet achieved high accuracy scores (0.947, 0.924, 0.947) on color fundus photography datasets for illumination, clarity, and contrast.
- The model also demonstrated strong performance on ultrawide-field datasets with accuracy scores of 0.889, 0.913, and 0.923.
- The use of attention mechanisms and saliency maps enhances the interpretability of the model.

## Abstract

The assessment of image quality (IQA) plays a pivotal role in the realm of image-based computer-aided diagnosis techniques, with fundus imaging standing as the primary method for the screening and diagnosis of ophthalmic diseases. Conventional studies on fundus IQA tend to rely on simplistic datasets for evaluation, predominantly focusing on either local or global information, rather than a synthesis of both. Moreover, the interpretability of these studies often lacks compelling evidence. In order to address these issues, this study introduces the Local and Global Attention Aggregated Deep Neural Network (LGAANet), an innovative approach that integrates both local and global information for enhanced analysis.

The LGAANet was developed and validated using a Multi-Source Heterogeneous Fundus (MSHF) database, encompassing a diverse collection of images. This dataset includes 802 color fundus photography (CFP) images (302 from portable cameras), and 500 ultrawide-field (UWF) images from 904 patients with diabetic retinopathy (DR) and glaucoma, as well as healthy individuals. The assessment of image quality was meticulously carried out by a trio of ophthalmologists, leveraging the human visual system as a benchmark. Furthermore, the model employs attention mechanisms and saliency maps to bolster its interpretability.

In testing with the CFP dataset, LGAANet demonstrated remarkable accuracy in three critical dimensions of image quality (illumination, clarity and contrast based on the characteristics of human visual system, and indicates the potential aspects to improve the image quality), recording scores of 0.947, 0.924, and 0.947, respectively. Similarly, when applied to the UWF dataset, the model achieved accuracies of 0.889, 0.913, and 0.923, respectively. These results underscore the efficacy of LGAANet in distinguishing between varying degrees of image quality with high precision.

To our knowledge, LGAANet represents the inaugural algorithm trained on an MSHF dataset specifically for fundus IQA, marking a significant milestone in the advancement of computer-aided diagnosis in ophthalmology. This research significantly contributes to the field, offering a novel methodology for the assessment and interpretation of fundus images in the detection and diagnosis of ocular diseases.

## Linked entities

- **Diseases:** diabetic retinopathy (MONDO:0005266), glaucoma (MONDO:0005041)

## Full-text entities

- **Diseases:** glaucoma (MESH:D005901), ocular diseases (MESH:D005128), ophthalmic diseases (MESH:C535922), DR (MESH:D003930)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC11339790/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC11339790/full.md

## References

23 references — full list in the complete paper: https://tomesphere.com/paper/PMC11339790/full.md

---
Source: https://tomesphere.com/paper/PMC11339790