TL;DR
FairEnc is a novel vision-language model designed to improve fairness in glaucoma detection by jointly debiasing visual and textual modalities across multiple demographic attributes, ensuring equitable healthcare outcomes.
Contribution
The paper introduces FairEnc, a pretraining method that mitigates biases in vision-language models for medical diagnosis, using synthetic data generation and dual-level fairness strategies.
Findings
Reduces demographic disparity as measured by DPD and DEOdds.
Maintains strong diagnostic performance in zero-shot and linear probing evaluations.
Generalizes fairness under distribution shifts across datasets.
Abstract
Automated glaucoma detection is critical for preventing irreversible vision loss and reducing the burden on healthcare systems. However, ensuring fairness across diverse patient populations remains a significant challenge. In this paper, we propose FairEnc, a fair pretraining method for vision-language models (VLMs) that enables simultaneous debiasing across multiple sensitive attributes. FairEnc jointly mitigates biases in both textual and visual modalities with respect to multiple sensitive attributes, including race, gender, ethnicity, and language. Specifically, for the textual encoder, we leverage a large language model to generate synthetic clinical descriptions with varied sensitive attributes while preserving disease semantics, and employ a contrastive alignment objective to encourage demographic-invariant representations. For the visual encoder, we propose a dual-level fairness…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
