Harms of Gender Exclusivity and Challenges in Non-Binary Representation in Language Technologies
Sunipa Dev, Masoud Monajatipoor, Anaelia Ovalle, Arjun, Subramonian, Jeff M Phillips, Kai-Wei Chang

TL;DR
This paper explores how treating gender as binary in language technologies perpetuates harms against non-binary individuals, highlighting biases in models and datasets, and emphasizing the need for inclusive representations.
Contribution
It provides a detailed analysis of gender complexity, surveys non-binary individuals on harms, and examines how current language models encode and perpetuate gender biases.
Findings
Current models reinforce binary gender stereotypes
Non-binary harms are perpetuated by dataset biases
Addressing gender complexity can improve fairness in language tech
Abstract
Gender is widely discussed in the context of language tasks and when examining the stereotypes propagated by language models. However, current discussions primarily treat gender as binary, which can perpetuate harms such as the cyclical erasure of non-binary gender identities. These harms are driven by model and dataset biases, which are consequences of the non-recognition and lack of understanding of non-binary genders in society. In this paper, we explain the complexity of gender and language around it, and survey non-binary persons to understand harms associated with the treatment of gender as binary in English language technologies. We also detail how current language representations (e.g., GloVe, BERT) capture and perpetuate these harms and related challenges that need to be acknowledged and addressed for representations to equitably encode gender information.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Hate Speech and Cyberbullying Detection · Text Readability and Simplification
MethodsGloVe Embeddings
