Disentanglement and Assessment of Shortcuts in Ophthalmological Retinal Imaging Exams
Leonor Fernandes, Tiago Gon\c{c}alves, Jo\~ao Matos, Luis Filipe Nakayama, Jaime S. Cardoso

TL;DR
This study evaluates AI models for diabetic retinopathy detection in retinal images, focusing on fairness and bias mitigation through disentanglement, revealing complex effects on model performance and disparities across subgroups.
Contribution
It provides a comprehensive analysis of fairness issues in retinal imaging AI and assesses disentanglement as a bias mitigation technique across multiple models.
Findings
High accuracy in DR prediction (up to 94% AUROC).
Fairness disparities observed between subgroups, e.g., 10% AUROC gap.
Disentanglement effects vary, improving some models while degrading others.
Abstract
Diabetic retinopathy (DR) is a leading cause of vision loss in working-age adults. While screening reduces the risk of blindness, traditional imaging is often costly and inaccessible. Artificial intelligence (AI) algorithms present a scalable diagnostic solution, but concerns regarding fairness and generalization persist. This work evaluates the fairness and performance of image-trained models in DR prediction, as well as the impact of disentanglement as a bias mitigation technique, using the diverse mBRSET fundus dataset. Three models, ConvNeXt V2, DINOv2, and Swin V2, were trained on macula images to predict DR and sensitive attributes (SAs) (e.g., age and gender/sex). Fairness was assessed between subgroups of SAs, and disentanglement was applied to reduce bias. All models achieved high DR prediction performance in diagnosing (up to 94% AUROC) and could reasonably predict age and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPharmacovigilance and Adverse Drug Reactions · Artificial Intelligence in Healthcare and Education · Academic integrity and plagiarism
