Hire Me or Not? Examining Language Model's Behavior with Occupation Attributes
Damin Zhang, Yi Zhang, Geetanjali Bihani, Julia Rayz

TL;DR
This study examines how large language models exhibit gender stereotypes in occupation-related decision making, revealing biases similar to humans and highlighting limitations of current debiasing methods.
Contribution
It introduces a framework to quantify gender stereotypes in LLMs' behavior using multi-round question answering and evaluates three prominent models.
Findings
All tested models show gender stereotypes similar to human biases.
GPT-3.5-turbo and Llama2-70b-chat exhibit distinct bias preferences.
Current alignment methods may be insufficient and could introduce new biases.
Abstract
With the impressive performance in various downstream tasks, large language models (LLMs) have been widely integrated into production pipelines, like recruitment and recommendation systems. A known issue of models trained on natural language data is the presence of human biases, which can impact the fairness of the system. This paper investigates LLMs' behavior with respect to gender stereotypes, in the context of occupation decision making. Our framework is designed to investigate and quantify the presence of gender stereotypes in LLMs' behavior via multi-round question answering. Inspired by prior works, we construct a dataset by leveraging a standard occupation classification knowledge base released by authoritative agencies. We tested three LLMs (RoBERTa-large, GPT-3.5-turbo, and Llama2-70b-chat) and found that all models exhibit gender stereotypes analogous to human biases, but…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultilingual Education and Policy
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · 15 Ways to Contact How can i speak to someone at Delta Airlines · Cosine Annealing · Dropout · Linear Warmup With Cosine Annealing · Residual Connection · Byte Pair Encoding · Adam · Softmax · Attention Is All You Need
