Women also Snowboard: Overcoming Bias in Captioning Models (Extended   Abstract)

Lisa Anne Hendricks; Kaylee Burns; Kate Saenko; Trevor Darrell; Anna; Rohrbach

arXiv:1807.00517·cs.CV·July 3, 2018·1 cites

Women also Snowboard: Overcoming Bias in Captioning Models (Extended Abstract)

Lisa Anne Hendricks, Kaylee Burns, Kate Saenko, Trevor Darrell, Anna, Rohrbach

PDF

Open Access

TL;DR

This paper introduces the Equalizer model to reduce gender bias in image captioning, ensuring fairer gender predictions by focusing on visual evidence rather than contextual cues, and demonstrates improved accuracy and fairness.

Contribution

The paper proposes the Equalizer model with novel loss functions to mitigate gender bias in captioning models, promoting fairer and more accurate gender-specific descriptions.

Findings

01

Lower error rate in gendered image captions.

02

More accurate gender predictions matching ground truth ratios.

03

Reduced reliance on contextual cues for gender prediction.

Abstract

Most machine learning methods are known to capture and exploit biases of the training data. While some biases are beneficial for learning, others are harmful. Specifically, image captioning models tend to exaggerate biases present in training data. This can lead to incorrect captions in domains where unbiased captions are desired, or required, due to over reliance on the learned prior and image context. We investigate generation of gender specific caption words (e.g. man, woman) based on the person's appearance or the image context. We introduce a new Equalizer model that ensures equal gender probability when gender evidence is occluded in a scene and confident predictions when gender evidence is present. The resulting model is forced to look at a person rather than use contextual cues to make a gender specific prediction. The losses that comprise our model, the Appearance Confusion…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Video Analysis and Summarization · Human Pose and Action Recognition