Dehumanizing Machines: Mitigating Anthropomorphic Behaviors in Text Generation Systems
Myra Cheng, Su Lin Blodgett, Alicia DeVrio, Lisa Egede, Alexandra Olteanu

TL;DR
This paper investigates how to reduce anthropomorphic behaviors in text generation systems to prevent harmful outcomes, providing empirical data and a conceptual framework for effective interventions.
Contribution
It offers an empirical inventory of interventions and a theoretical framework to characterize and evaluate methods for mitigating anthropomorphism in AI outputs.
Findings
Compiled an intervention inventory from literature and crowdsourcing.
Developed a conceptual framework for intervention types.
Provides a basis for evaluating intervention effectiveness.
Abstract
As text generation systems' outputs are increasingly anthropomorphic -- perceived as human-like -- scholars have also increasingly raised concerns about how such outputs can lead to harmful outcomes, such as users over-relying or developing emotional dependence on these systems. How to intervene on such system outputs to mitigate anthropomorphic behaviors and their attendant harmful outcomes, however, remains understudied. With this work, we aim to provide empirical and theoretical grounding for developing such interventions. To do so, we compile an inventory of interventions grounded both in prior literature and a crowdsourcing study where participants edited system outputs to make them less human-like. Drawing on this inventory, we also develop a conceptual framework to help characterize the landscape of possible interventions, articulate distinctions between different types of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsTopic Modeling · Educational Games and Gamification · AI in Service Interactions
