Fairness-Aware Fine-Tuning of Vision-Language Models for Medical Glaucoma Diagnosis

Zijian Gu; Yuxi Liu; Zhenhao Zhang; Song Wang

arXiv:2512.03477·cs.CV·March 11, 2026

Fairness-Aware Fine-Tuning of Vision-Language Models for Medical Glaucoma Diagnosis

Zijian Gu, Yuxi Liu, Zhenhao Zhang, Song Wang

PDF

Open Access

TL;DR

This paper presents a fairness-aware fine-tuning method for vision-language models in medical glaucoma diagnosis, significantly reducing demographic disparities while maintaining high accuracy with minimal additional parameters.

Contribution

It introduces a differentiable MaxAccGap loss and three fine-tuning methods that improve fairness in medical VLMs with parameter-efficient adaptation.

Findings

01

GR-LoRA reduces accuracy disparities by 69%

02

Maintains 53.15% overall accuracy

03

Requires only 0.24% trainable parameters

Abstract

Vision-language models achieve expert-level performance on medical imaging tasks but exhibit significant diagnostic accuracy disparities across demographic groups. We introduce fairness-aware Low-Rank Adaptation for medical VLMs, combining parameter efficiency with explicit fairness optimization. Our key algorithmic contribution is a differentiable MaxAccGap loss that enables end-to-end optimization of accuracy parity across demographic groups. We propose three methods: FR-LoRA integrates MaxAccGap regularization into the training objective, GR-LoRA applies inverse frequency weighting to balance gradient contributions, and Hybrid-LoRA combines both mechanisms. Evaluated on 10,000 glaucoma fundus images, GR-LoRA reduces diagnostic accuracy disparities by 69% while maintaining 53.15% overall accuracy. Ablation studies reveal that strong regularization strength achieves optimal fairness…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRetinal Imaging and Analysis · Multimodal Machine Learning Applications · Retinal Diseases and Treatments