Different Time, Different Language: Revisiting the Bias Against Non-Native Speakers in GPT Detectors

Adnan Al Ali; Jind\v{r}ich Helcl; Jind\v{r}ich Libovick\'y

arXiv:2602.05769·cs.CL·February 6, 2026

Different Time, Different Language: Revisiting the Bias Against Non-Native Speakers in GPT Detectors

Adnan Al Ali, Jind\v{r}ich Helcl, Jind\v{r}ich Libovick\'y

PDF

Open Access 1 Video

TL;DR

This paper investigates whether GPT detectors are biased against non-native Czech speakers and finds that modern detectors do not rely on perplexity and are not systematically biased.

Contribution

The study revisits the bias against non-native speakers in GPT detectors, showing that in Czech, detectors operate effectively without relying on perplexity and lack systematic bias.

Findings

01

Perplexity of non-native Czech texts is not lower than native texts.

02

Detectors from three families show no systematic bias against non-native speakers.

03

Contemporary detectors do not rely on perplexity for detection.

Abstract

LLM-based assistants have been widely popularised after the release of ChatGPT. Concerns have been raised about their misuse in academia, given the difficulty of distinguishing between human-written and generated text. To combat this, automated techniques have been developed and shown to be effective, to some extent. However, prior work suggests that these methods often falsely flag essays from non-native speakers as generated, due to their low perplexity extracted from an LLM, which is supposedly a key feature of the detectors. We revisit these statements two years later, specifically in the Czech language setting. We show that the perplexity of texts from non-native speakers of Czech is not lower than that of native speakers. We further examine detectors from three separate families and find no systematic bias against non-native speakers. Finally, we demonstrate that contemporary…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Different Time, Different Language: Revisiting the Bias Against Non-Native Speakers in GPT Detectors· underline

Taxonomy

TopicsArtificial Intelligence in Healthcare and Education · Hate Speech and Cyberbullying Detection · Misinformation and Its Impacts