Computer Vision and Conflicting Values: Describing People with Automated   Alt Text

Margot Hanley; Solon Barocas; Karen Levy; Shiri Azenkot; Helen; Nissenbaum

arXiv:2105.12754·cs.CY·May 28, 2021

Computer Vision and Conflicting Values: Describing People with Automated Alt Text

Margot Hanley, Solon Barocas, Karen Levy, Shiri Azenkot, Helen, Nissenbaum

PDF

TL;DR

This paper examines the ethical issues of using computer vision for automated alt text generation, comparing corporate policies and museum practices, and highlights the complex normative dilemmas involved.

Contribution

It provides an analytic framework contrasting corporate and museum approaches to alt text, revealing the ethical tensions in automated image description.

Findings

01

Facebook's policies on identity in alt text are cautious and selective.

02

Museum practices favor manual, context-aware descriptions of cultural artifacts.

03

Automated alt text raises complex ethical and normative dilemmas.

Abstract

Scholars have recently drawn attention to a range of controversial issues posed by the use of computer vision for automatically generating descriptions of people in images. Despite these concerns, automated image description has become an important tool to ensure equitable access to information for blind and low vision people. In this paper, we investigate the ethical dilemmas faced by companies that have adopted the use of computer vision for producing alt text: textual descriptions of images for blind and low vision people, We use Facebook's automatic alt text tool as our primary case study. First, we analyze the policies that Facebook has adopted with respect to identity categories, such as race, gender, age, etc., and the company's decisions about whether to present these terms in alt text. We then describe an alternative -- and manual -- approach practiced in the museum community,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.