Auditing Gender Analyzers on Text Data
Siddharth D Jaiswal, Ankit Kumar Verma, Animesh Mukherjee

TL;DR
This paper audits existing gender analyzers and ChatGPT for biases against non-binary individuals, revealing significant inaccuracies and societal biases, and demonstrates that fine-tuned BERT models on inclusive datasets can mitigate these biases effectively.
Contribution
The study provides a comprehensive bias audit of popular gender analyzers and ChatGPT, and introduces a fine-tuned BERT approach that improves accuracy and reduces societal bias.
Findings
Existing tools are ~50% accurate and propagate societal biases.
Fine-tuned BERT achieves ~77% overall accuracy and 90% for non-binary classification.
ChatGPT shows 58% accuracy with zero-shot prompts on bias detection.
Abstract
AI models have become extremely popular and accessible to the general public. However, they are continuously under the scanner due to their demonstrable biases toward various sections of the society like people of color and non-binary people. In this study, we audit three existing gender analyzers -- uClassify, Readable and HackerFactor, for biases against non-binary individuals. These tools are designed to predict only the cisgender binary labels, which leads to discrimination against non-binary members of the society. We curate two datasets -- Reddit comments (660k) and, Tumblr posts (2.05M) and our experimental evaluation shows that the tools are highly inaccurate with the overall accuracy being ~50% on all platforms. Predictions for non-binary comments on all platforms are mostly female, thus propagating the societal bias that non-binary individuals are effeminate. To address this,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHate Speech and Cyberbullying Detection · Gender Studies in Language
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Linear Layer · Dropout · Weight Decay · Softmax · Linear Warmup With Linear Decay · WordPiece · Adam
