$\texttt{ModSCAN}$: Measuring Stereotypical Bias in Large Vision-Language Models from Vision and Language Modalities
Yukun Jiang, Zheng Li, Xinyue Shen, Yugeng Liu, Michael Backes, Yang, Zhang

TL;DR
This paper introduces ModSCAN, a framework for measuring stereotypical biases in large vision-language models across vision and language modalities, revealing significant biases and potential mitigation strategies.
Contribution
The paper presents ModSCAN, a novel measurement framework for detecting stereotypical biases in LVLMs from both visual and textual data modalities.
Findings
LVLMs exhibit significant stereotypical biases, especially in CogVLM.
Biases are likely due to training data and pre-trained model inherent biases.
Prompt prefixes can effectively reduce stereotypical biases.
Abstract
Large vision-language models (LVLMs) have been rapidly developed and widely used in various fields, but the (potential) stereotypical bias in the model is largely unexplored. In this study, we present a pioneering measurement framework, , to the stereotypical bias within LVLMs from both vision and language alities. examines stereotypical biases with respect to two typical stereotypical attributes (gender and race) across three kinds of scenarios: occupations, descriptors, and persona traits. Our findings suggest that 1) the currently popular LVLMs show significant stereotype biases, with CogVLM emerging as the most biased model; 2) these stereotypical biases may stem from the inherent biases in the training dataset and pre-trained models; 3) the utilization of specific prompt prefixes (from both vision and language…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsText Readability and Simplification · Natural Language Processing Techniques
