# Comparative analysis of novel preprocessing techniques and deep learning based multi modal feature fusion for diabetic retinopathy grading

**Authors:** B. C. Anupama, Sheela N. Rao, M. Bindu Malini, V. Vikram Athreya

PMC · DOI: 10.1038/s41598-025-31339-w · Scientific Reports · 2025-12-12

## TL;DR

This paper introduces new image preprocessing techniques and deep learning methods to improve automated grading of diabetic retinopathy from fundus images.

## Contribution

The study proposes three novel preprocessing methods and a multi-modal feature fusion framework for enhanced diabetic retinopathy grading.

## Key findings

- XGBoost with fused features and Adaptive Sigmoid Enhancement achieves 96.39% accuracy in DR grading.
- The proposed preprocessing techniques improve lesion visibility and reduce background noise effectively.
- Multi-modal fusion of handcrafted and deep features outperforms individual feature sets.

## Abstract

Fundus images are crucial for the detection and monitoring of retinal diseases such as diabetic retinopathy (DR). However, issues such as uneven illumination, low contrast, and noise often degrade image quality, impacting the accuracy of automated grading systems. This study introduces three novel preprocessing techniques Adaptive Sigmoid Enhancement, LAB-ACE Image Enhancement, and Multi-channel Image Enhancement designed to address these challenges. Adaptive Sigmoid Enhancement adaptively adjusts local contrast to highlight subtle lesions, LAB-ACE operates in the LAB color space to selectively enhance the lightness channel while preserving color fidelity, and Multi-channel Image Enhancement applies targeted green-channel optimization combined with contrast stretching and channel recombination. These methods extend beyond conventional contrast enhancement and normalization by integrating multi-stage adaptive processing and color-channel-specific optimization to improve lesion visibility and vessel delineation while minimizing background noise. Following pre-processing, handcrafted features (LBP, GLCM) and deep features from a pre-trained ResNet-50 are fused in a multi-modal framework and evaluated using multiple classifiers, including SVM, KNN, Random Forest, and XGBoost. Results demonstrate that XGBoost with fused features and Adaptive Sigmoid Enhancement achieves the highest accuracy (96.39%), outperforming other combinations. The findings highlight the effectiveness of the proposed pre-processing strategies in enhancing DR grading performance, paving the way for improved computer-aided diagnosis systems.

## Linked entities

- **Diseases:** diabetic retinopathy (MONDO:0005266)

## Full-text entities

- **Diseases:** diabetic retinopathy (MESH:D003930)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12800199/full.md

## Figures

9 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12800199/full.md

## References

34 references — full list in the complete paper: https://tomesphere.com/paper/PMC12800199/full.md

---
Source: https://tomesphere.com/paper/PMC12800199