Uncovering Cultural Representation Disparities in Vision-Language Models

Ram Mohan Rao Kadiyala; Siddhant Gupta; Jebish Purbey; Srishti Yadav; Suman Debnath; Alejandro Salamanca; Desmond Elliott

arXiv:2505.14729·cs.CV·August 1, 2025

Uncovering Cultural Representation Disparities in Vision-Language Models

Ram Mohan Rao Kadiyala, Siddhant Gupta, Jebish Purbey, Srishti Yadav, Suman Debnath, Alejandro Salamanca, Desmond Elliott

PDF

1 Datasets

TL;DR

This paper evaluates cultural biases in vision-language models by testing their accuracy on a country identification task across diverse datasets and prompting strategies, revealing significant disparities influenced by training data biases.

Contribution

It introduces a comprehensive evaluation of cultural biases in VLMs using the Country211 dataset and various prompting methods, highlighting how data distribution affects model fairness.

Findings

01

VLMs show significant accuracy disparities across countries.

02

Prompting strategies influence model bias and performance.

03

Training data biases impact model generalization across cultures.

Abstract

Vision-Language Models (VLMs) have demonstrated impressive capabilities across a range of tasks, yet concerns about their potential biases exist. This work investigates the extent to which prominent VLMs exhibit cultural biases by evaluating their performance on an image-based country identification task at a country level. Utilizing the geographically diverse Country211 dataset, we probe several large vision language models (VLMs) under various prompting strategies: open-ended questions, multiple-choice questions (MCQs) including challenging setups like multilingual and adversarial settings. Our analysis aims to uncover disparities in model accuracy across different countries and question formats, providing insights into how training data distribution and evaluation methodologies might influence cultural biases in VLMs. The findings highlight significant variations in performance,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

Biases/CulturalBiases-2025
dataset· 9 dl
9 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.