Impacts of Racial Bias in Historical Training Data for News AI
Rahul Bhargava, Malene Hornstrup Jespersen, Emily Boardman Ndulue, Vivica Dsouza

TL;DR
This paper examines racial biases embedded in AI models trained on historical news data, revealing how such biases can influence model outputs and impact journalism practices.
Contribution
It provides a detailed analysis of racial bias in a news corpus-trained AI, highlighting the limitations of current explainable AI methods in detecting modern bias.
Findings
The 'blacks' label acts as a partial racism detector.
The classifier performs poorly on modern anti-Asian and Black Lives Matter stories.
Biases in training data can lead to unintended model outputs.
Abstract
AI technologies have rapidly moved into business and research applications that involve large text corpora, including computational journalism research and newsroom settings. These models, trained on extant data from various sources, can be conceptualized as historical artifacts that encode decades-old attitudes and stereotypes. This paper investigates one such example trained on the broadly-used New York Times Annotated Corpus to create a multi-label classifier. Our use in research settings surfaced the concerning "blacks" thematic topic label. Through quantitative and qualitative means we investigate this label's use in the training corpus, what concepts it might be encoding in the trained classifier, and how those concepts impact our model use. Via the application of explainable AI methods, we find that the "blacks" label operates partially as a general "racism detector" across some…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational and Text Analysis Methods · Media Influence and Politics · Digital Humanities and Scholarship
