Survey of Social Bias in Vision-Language Models
Nayeon Lee, Yejin Bang, Holy Lovenia, Samuel Cahyawijaya, Wenliang, Dai, Pascale Fung

TL;DR
This survey reviews social bias issues in vision-language models, highlighting their similarities with NLP and CV biases, and offers guidelines for mitigation to promote fairer AI systems.
Contribution
It provides a comprehensive overview of social bias in pre-trained vision-language models and compares it with NLP and CV biases, offering insights and mitigation strategies.
Findings
VL models are susceptible to social bias similar to NLP and CV.
Current understanding of bias in VL models is limited compared to NLP and CV.
Guidelines are proposed for addressing and reducing social bias in multimodal models.
Abstract
In recent years, the rapid advancement of machine learning (ML) models, particularly transformer-based pre-trained models, has revolutionized Natural Language Processing (NLP) and Computer Vision (CV) fields. However, researchers have discovered that these models can inadvertently capture and reinforce social biases present in their training datasets, leading to potential social harms, such as uneven resource allocation and unfair representation of specific social groups. Addressing these biases and ensuring fairness in artificial intelligence (AI) systems has become a critical concern in the ML community. The recent introduction of pre-trained vision-and-language (VL) models in the emerging multimodal field demands attention to the potential social biases present in these models as well. Although VL models are susceptible to social bias, there is a limited understanding compared to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning
