Decentralized LoRA augmented transformer with multi-scale feature learning for secured eye diagnosis
Md. Naimur Asif Borno, Md Sakib Hossain Shovon, MD Hanif Sikder, Iffat Firozy Rimi, Tahani Jaser Alahmadi, Mohammad Ali Moni

TL;DR
This paper introduces a privacy-preserving, multi-scale transformer framework with LoRA and federated learning for improved ophthalmic disease diagnosis, demonstrating superior accuracy and interpretability on benchmark datasets.
Contribution
It presents a novel integrated framework combining multi-scale patch embedding, LoRA, knowledge distillation, and federated learning for secure and effective eye disease diagnosis.
Findings
Outperforms traditional CNNs and transformer models on OCTDL and Eye Disease Image Dataset.
Achieves higher AUC, F1 score, and precision across evaluations.
Provides interpretable Grad-CAM++ visualizations supporting clinical trust.
Abstract
Accurate and privacy-preserving diagnosis of ophthalmic diseases remains a critical challenge in medical imaging, particularly given the limitations of existing deep learning models in handling data imbalance, data privacy concerns, spatial feature diversity, and clinical interpretability. This paper proposes a novel Data efficient Image Transformer (DeiT) based framework that integrates context aware multiscale patch embedding, Low-Rank Adaptation (LoRA), knowledge distillation, and federated learning to address these challenges in a unified manner. The proposed model effectively captures both local and global retinal features by leveraging multi scale patch representations with local and global attention mechanisms. LoRA integration enhances computational efficiency by reducing the number of trainable parameters, while federated learning ensures secure, decentralized training without…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
