Computer Vision and Conflicting Values: Describing People with Automated Alt Text
Margot Hanley, Solon Barocas, Karen Levy, Shiri Azenkot, Helen, Nissenbaum

TL;DR
This paper examines the ethical issues of using computer vision for automated alt text generation, comparing corporate policies and museum practices, and highlights the complex normative dilemmas involved.
Contribution
It provides an analytic framework contrasting corporate and museum approaches to alt text, revealing the ethical tensions in automated image description.
Findings
Facebook's policies on identity in alt text are cautious and selective.
Museum practices favor manual, context-aware descriptions of cultural artifacts.
Automated alt text raises complex ethical and normative dilemmas.
Abstract
Scholars have recently drawn attention to a range of controversial issues posed by the use of computer vision for automatically generating descriptions of people in images. Despite these concerns, automated image description has become an important tool to ensure equitable access to information for blind and low vision people. In this paper, we investigate the ethical dilemmas faced by companies that have adopted the use of computer vision for producing alt text: textual descriptions of images for blind and low vision people, We use Facebook's automatic alt text tool as our primary case study. First, we analyze the policies that Facebook has adopted with respect to identity categories, such as race, gender, age, etc., and the company's decisions about whether to present these terms in alt text. We then describe an alternative -- and manual -- approach practiced in the museum community,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
