TL;DR
This paper examines geo-economic biases in chart-to-text generation by large Vision-Language Models, revealing a tendency to produce more positive summaries for high-income countries and evaluating partial mitigation strategies.
Contribution
It provides a large-scale evaluation of geo-economic biases in VLM-generated chart summaries and explores prompt-based debiasing techniques, highlighting the need for more effective solutions.
Findings
VLMs favor high-income countries with positive descriptions
Prompt-based debiasing techniques are only partially effective
Bias varies across different models like GPT-4o-mini and Gemini-1.5-Flash
Abstract
Charts are very common for exploring data and communicating insights, but extracting key takeaways from charts and articulating them in natural language can be challenging. The chart-to-text task aims to automate this process by generating textual summaries of charts. While with the rapid advancement of large Vision-Language Models (VLMs), we have witnessed great progress in this domain, little to no attention has been given to potential biases in their outputs. This paper investigates how VLMs can amplify geo-economic biases when generating chart summaries, potentially causing societal harm. Specifically, we conduct a large-scale evaluation of geo-economic biases in VLM-generated chart summaries across 6,000 chart-country pairs from six widely used proprietary and open-source models to understand how a country's economic status influences the sentiment of generated summaries. Our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
