Decentralized LoRA augmented transformer with multi-scale feature learning for secured eye diagnosis

Md. Naimur Asif Borno; Md Sakib Hossain Shovon; MD Hanif Sikder; Iffat Firozy Rimi; Tahani Jaser Alahmadi; Mohammad Ali Moni

arXiv:2505.06982·cs.CV·May 18, 2026

Decentralized LoRA augmented transformer with multi-scale feature learning for secured eye diagnosis

Md. Naimur Asif Borno, Md Sakib Hossain Shovon, MD Hanif Sikder, Iffat Firozy Rimi, Tahani Jaser Alahmadi, Mohammad Ali Moni

PDF

TL;DR

This paper introduces a privacy-preserving, multi-scale transformer framework with LoRA and federated learning for improved ophthalmic disease diagnosis, demonstrating superior accuracy and interpretability on benchmark datasets.

Contribution

It presents a novel integrated framework combining multi-scale patch embedding, LoRA, knowledge distillation, and federated learning for secure and effective eye disease diagnosis.

Findings

01

Outperforms traditional CNNs and transformer models on OCTDL and Eye Disease Image Dataset.

02

Achieves higher AUC, F1 score, and precision across evaluations.

03

Provides interpretable Grad-CAM++ visualizations supporting clinical trust.

Abstract

Accurate and privacy-preserving diagnosis of ophthalmic diseases remains a critical challenge in medical imaging, particularly given the limitations of existing deep learning models in handling data imbalance, data privacy concerns, spatial feature diversity, and clinical interpretability. This paper proposes a novel Data efficient Image Transformer (DeiT) based framework that integrates context aware multiscale patch embedding, Low-Rank Adaptation (LoRA), knowledge distillation, and federated learning to address these challenges in a unified manner. The proposed model effectively captures both local and global retinal features by leveraging multi scale patch representations with local and global attention mechanisms. LoRA integration enhances computational efficiency by reducing the number of trainable parameters, while federated learning ensures secure, decentralized training without…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.