Evaluating Fairness in Large Vision-Language Models Across Diverse Demographic Attributes and Prompts

Xuyang Wu; Yuan Wang; Hsin-Tai Wu; Zhiqiang Tao; Yi Fang

arXiv:2406.17974·cs.CL·September 23, 2025

Evaluating Fairness in Large Vision-Language Models Across Diverse Demographic Attributes and Prompts

Xuyang Wu, Yuan Wang, Hsin-Tai Wu, Zhiqiang Tao, Yi Fang

PDF

Open Access 1 Repo

TL;DR

This paper evaluates demographic fairness in large vision-language models, revealing persistent biases and proposing a multi-modal Chain-of-thought strategy to mitigate unfairness across diverse demographic groups.

Contribution

It introduces a comprehensive fairness evaluation framework for LVLMs and proposes a novel multi-modal Chain-of-thought method for addressing biases.

Findings

01

LVLMs exhibit significant fairness disparities across demographic attributes.

02

Zero-shot prompting reveals persistent bias issues in LVLMs.

03

Proposed Chain-of-thought strategy improves fairness and transparency.

Abstract

Large vision-language models (LVLMs) have recently achieved significant progress, demonstrating strong capabilities in open-world visual understanding. However, it is not yet clear how LVLMs address demographic biases in real life, especially the disparities across attributes such as gender, skin tone, age and race. In this paper, We empirically investigate \emph{visual fairness} in several mainstream LVLMs by auditing their performance disparities across demographic attributes using public fairness benchmark datasets (e.g., FACET, UTKFace). Our fairness evaluation framework employs direct and single-choice question prompt on visual question-answering/classification tasks. Despite advancements in visual understanding, our zero-shot prompting results show that both open-source and closed-source LVLMs continue to exhibit fairness issues across different prompts and demographic groups.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

elviswxy/LVLM_fairness
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsQualitative Comparative Analysis Research · Social and Intergroup Psychology