GPT detectors are biased against non-native English writers

Weixin Liang; Mert Yuksekgonul; Yining Mao; Eric Wu; James Zou

arXiv:2304.02819·cs.CL·July 13, 2023·37 cites

GPT detectors are biased against non-native English writers

Weixin Liang, Mert Yuksekgonul, Yining Mao, Eric Wu, James Zou

PDF

Open Access 2 Repos 1 Datasets

TL;DR

This study reveals that GPT detectors are biased against non-native English writers, often misclassifying their work as AI-generated, and shows that simple prompts can bypass these detectors, raising ethical concerns.

Contribution

The paper provides empirical evidence of bias in GPT detectors against non-native speakers and demonstrates how prompting strategies can circumvent these detectors.

Findings

01

Detectors misclassify non-native English writing as AI-generated.

02

Native English writing is correctly identified by detectors.

03

Prompting strategies can bypass GPT detectors effectively.

Abstract

The rapid adoption of generative language models has brought about substantial advancements in digital communication, while simultaneously raising concerns regarding the potential misuse of AI-generated content. Although numerous detection methods have been proposed to differentiate between AI and human-generated content, the fairness and robustness of these detectors remain underexplored. In this study, we evaluate the performance of several widely-used GPT detectors using writing samples from native and non-native English writers. Our findings reveal that these detectors consistently misclassify non-native English writing samples as AI-generated, whereas native writing samples are accurately identified. Furthermore, we demonstrate that simple prompting strategies can not only mitigate this bias but also effectively bypass GPT detectors, suggesting that GPT detectors may…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Datasets

WxWx/ChatGPT-Detector-Bias
dataset· 10 dl
10 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Healthcare and Education · Topic Modeling · Ethics and Social Impacts of AI

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Cosine Annealing · Dense Connections · Attention Dropout · Weight Decay · Adam · Softmax · Linear Layer