Exploring the Limitations of Detecting Machine-Generated Text

Jad Doughman; Osama Mohammed Afzal; Hawau Olamide Toyin; Shady; Shehata; Preslav Nakov; Zeerak Talat

arXiv:2406.11073·cs.CL·December 13, 2024

Exploring the Limitations of Detecting Machine-Generated Text

Jad Doughman, Osama Mohammed Afzal, Hawau Olamide Toyin, Shady, Shehata, Preslav Nakov, Zeerak Talat

PDF

Open Access

TL;DR

This paper investigates how stylistic differences and text complexity affect the performance of machine-generated text detectors, revealing significant sensitivity and potential unreliability in current detection methods.

Contribution

It provides an empirical analysis of detection performance across various writing styles and complexities, highlighting the need to consider stylistic factors in future detection systems.

Findings

01

Classifiers are highly sensitive to stylistic changes.

02

Detection performance degrades to random levels with certain styles.

03

Easy-to-read texts are more likely to be misclassified.

Abstract

Recent improvements in the quality of the generations by large language models have spurred research into identifying machine-generated text. Such work often presents high-performing detectors. However, humans and machines can produce text in different styles and domains, yet the performance impact of such on machine generated text detection systems remains unclear. In this paper, we audit the classification performance for detecting machine-generated text by evaluating on texts with varying writing styles. We find that classifiers are highly sensitive to stylistic changes and differences in text complexity, and in some cases degrade entirely to random classifiers. We further find that detection systems are particularly susceptible to misclassify easy-to-read texts while they have high performance for complex texts, leading to concerns about the reliability of detection systems. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputational and Text Analysis Methods · Law, AI, and Intellectual Property · Digital and Cyber Forensics