Uplifting Lower-Income Data: Strategies for Socioeconomic Perspective   Shifts in Large Multi-modal Models

Joan Nwatu; Oana Ignat; Rada Mihalcea

arXiv:2407.02623·cs.CY·October 15, 2024

Uplifting Lower-Income Data: Strategies for Socioeconomic Perspective Shifts in Large Multi-modal Models

Joan Nwatu, Oana Ignat, Rada Mihalcea

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper proposes prompting strategies that incorporate geographic and socioeconomic attributes to enhance Large Multi-modal models' performance on underrepresented, lower-income data, addressing biases in training data.

Contribution

It introduces novel prompting techniques that leverage socioeconomic and geographic information to improve model fairness and performance on marginalized data groups.

Findings

01

Improved model performance on lower-income data.

02

Prompting strategies favoring low-income household topics.

03

Identified contexts with significant performance gains.

Abstract

Recent work has demonstrated that the unequal representation of cultures and socioeconomic groups in training data leads to biased Large Multi-modal (LMM) models. To improve LMM model performance on underrepresented data, we propose and evaluate several prompting strategies using non-English, geographic, and socioeconomic attributes. We show that these geographic and socioeconomic integrated prompts favor retrieving topic appearances commonly found in data from low-income households across different countries leading to improved LMM model performance on lower-income data. Our analyses identify and highlight contexts where these strategies yield the most improvements.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

anniejoan/uplifting-lower-income-data
noneOfficial

Videos

Uplifting Lower-Income Data: Strategies for Socioeconomic Perspective Shifts in Large Multi-modal Models· underline

Taxonomy

TopicsE-Government and Public Services