Bias, Accuracy, and Trust: Gender-Diverse Perspectives on Large Language Models
Aimen Gaba, Emily Wall, Tejas Ramkumar Babu, Yuriy Brun, Kyle Hall, Cindy Xiong Bearfield

TL;DR
This study explores how gender-diverse users perceive bias, accuracy, and trust in ChatGPT, revealing gendered response patterns and emphasizing the importance of inclusive AI development.
Contribution
It provides novel insights into gender-specific perceptions of LLMs and suggests practical improvements for more inclusive and trustworthy AI systems.
Findings
Gendered prompts elicit more identity-specific responses.
Non-binary participants experience more stereotypical portrayals.
Trust varies by gender, with men trusting performance more.
Abstract
Large language models (LLMs) are becoming increasingly ubiquitous in our daily lives, but numerous concerns about bias in LLMs exist. This study examines how gender-diverse populations perceive bias, accuracy, and trustworthiness in LLMs, specifically ChatGPT. Through 25 in-depth interviews with non-binary/transgender, male, and female participants, we investigate how gendered and neutral prompts influence model responses and how users evaluate these responses. Our findings reveal that gendered prompts elicit more identity-specific responses, with non-binary participants particularly susceptible to condescending and stereotypical portrayals. Perceived accuracy was consistent across gender groups, with errors most noted in technical topics and creative tasks. Trustworthiness varied by gender, with men showing higher trust, especially in performance, and non-binary participants…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Healthcare and Education · AI in Service Interactions · Ethics and Social Impacts of AI
