Evaluating Fairness in Large Vision-Language Models Across Diverse Demographic Attributes and Prompts
Xuyang Wu, Yuan Wang, Hsin-Tai Wu, Zhiqiang Tao, Yi Fang

TL;DR
This paper evaluates demographic fairness in large vision-language models, revealing persistent biases and proposing a multi-modal Chain-of-thought strategy to mitigate unfairness across diverse demographic groups.
Contribution
It introduces a comprehensive fairness evaluation framework for LVLMs and proposes a novel multi-modal Chain-of-thought method for addressing biases.
Findings
LVLMs exhibit significant fairness disparities across demographic attributes.
Zero-shot prompting reveals persistent bias issues in LVLMs.
Proposed Chain-of-thought strategy improves fairness and transparency.
Abstract
Large vision-language models (LVLMs) have recently achieved significant progress, demonstrating strong capabilities in open-world visual understanding. However, it is not yet clear how LVLMs address demographic biases in real life, especially the disparities across attributes such as gender, skin tone, age and race. In this paper, We empirically investigate \emph{visual fairness} in several mainstream LVLMs by auditing their performance disparities across demographic attributes using public fairness benchmark datasets (e.g., FACET, UTKFace). Our fairness evaluation framework employs direct and single-choice question prompt on visual question-answering/classification tasks. Despite advancements in visual understanding, our zero-shot prompting results show that both open-source and closed-source LVLMs continue to exhibit fairness issues across different prompts and demographic groups.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsQualitative Comparative Analysis Research · Social and Intergroup Psychology
