$\texttt{ModSCAN}$: Measuring Stereotypical Bias in Large   Vision-Language Models from Vision and Language Modalities

Yukun Jiang; Zheng Li; Xinyue Shen; Yugeng Liu; Michael Backes; Yang; Zhang

arXiv:2410.06967·cs.CR·October 10, 2024

$\texttt{ModSCAN}$: Measuring Stereotypical Bias in Large Vision-Language Models from Vision and Language Modalities

Yukun Jiang, Zheng Li, Xinyue Shen, Yugeng Liu, Michael Backes, Yang, Zhang

PDF

Open Access 1 Repo 1 Datasets

TL;DR

This paper introduces ModSCAN, a framework for measuring stereotypical biases in large vision-language models across vision and language modalities, revealing significant biases and potential mitigation strategies.

Contribution

The paper presents ModSCAN, a novel measurement framework for detecting stereotypical biases in LVLMs from both visual and textual data modalities.

Findings

01

LVLMs exhibit significant stereotypical biases, especially in CogVLM.

02

Biases are likely due to training data and pre-trained model inherent biases.

03

Prompt prefixes can effectively reduce stereotypical biases.

Abstract

Large vision-language models (LVLMs) have been rapidly developed and widely used in various fields, but the (potential) stereotypical bias in the model is largely unexplored. In this study, we present a pioneering measurement framework, $ModSCAN$ , to $\underline{S C A N}$ the stereotypical bias within LVLMs from both vision and language $\underline{M o d}$ alities. $ModSCAN$ examines stereotypical biases with respect to two typical stereotypical attributes (gender and race) across three kinds of scenarios: occupations, descriptors, and persona traits. Our findings suggest that 1) the currently popular LVLMs show significant stereotype biases, with CogVLM emerging as the most biased model; 2) these stereotypical biases may stem from the inherent biases in the training dataset and pre-trained models; 3) the utilization of specific prompt prefixes (from both vision and language…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

TrustAIRLab/ModSCAN
noneOfficial

Datasets

A5hbr1ng3r/ModSCAN
dataset· 11k dl
11k dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsText Readability and Simplification · Natural Language Processing Techniques