Hidding the Ghostwriters: An Adversarial Evaluation of AI-Generated   Student Essay Detection

Xinlin Peng; Ying Zhou; Ben He; Le Sun; Yingfei Sun

arXiv:2402.00412·cs.CL·February 2, 2024·2 cites

Hidding the Ghostwriters: An Adversarial Evaluation of AI-Generated Student Essay Detection

Xinlin Peng, Ying Zhou, Ben He, Le Sun, Yingfei Sun

PDF

Open Access 1 Repo

TL;DR

This paper evaluates the vulnerability of current AI-generated student essay detectors against adversarial attacks, revealing their ease of circumvention and emphasizing the need for more robust detection methods.

Contribution

It introduces AIG-ASAP, a new dataset for testing essay detectors, and demonstrates that existing detectors are easily bypassed using simple adversarial perturbations.

Findings

01

Existing detectors can be circumvented with simple word and sentence substitutions.

02

Adversarial attacks maintain essay quality while evading detection.

03

Robust detection methods are urgently needed in education.

Abstract

Large language models (LLMs) have exhibited remarkable capabilities in text generation tasks. However, the utilization of these models carries inherent risks, including but not limited to plagiarism, the dissemination of fake news, and issues in educational exercises. Although several detectors have been proposed to address these concerns, their effectiveness against adversarial perturbations, specifically in the context of student essay writing, remains largely unexplored. This paper aims to bridge this gap by constructing AIG-ASAP, an AI-generated student essay dataset, employing a range of text perturbation methods that are expected to generate high-quality essays while evading detection. Through empirical experiments, we assess the performance of current AIGC detectors on the AIG-ASAP dataset. The results reveal that the existing detectors can be easily circumvented using…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

xinlinpeng/aig-asap
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsText Readability and Simplification · Natural Language Processing Techniques · Adversarial Robustness in Machine Learning