ArguGPT: evaluating, understanding and identifying argumentative essays generated by GPT models
Yikang Liu, Ziyin Zhang, Wanyang Zhang, Shisen Yue, Xiaojing Zhao,, Xinyuan Cheng, Yiwen Zhang, Hai Hu

TL;DR
This paper introduces ArguGPT, a comprehensive corpus of GPT-generated argumentative essays, analyzes their linguistic features, and develops detection models achieving over 90% accuracy, aiding educators in identifying AI-generated content.
Contribution
It provides the first extensive analysis of GPT-generated argumentative essays and develops effective detection tools using machine learning models.
Findings
Instructors' detection accuracy improves from 61% to 67% after minimal training.
GPT essays have more complex syntax but less lexical complexity than human essays.
RoBERTa-based detectors achieve over 90% accuracy in identifying AI-generated essays.
Abstract
AI generated content (AIGC) presents considerable challenge to educators around the world. Instructors need to be able to detect such text generated by large language models, either with the naked eye or with the help of some tools. There is also growing need to understand the lexical, syntactic and stylistic features of AIGC. To address these challenges in English language teaching, we first present ArguGPT, a balanced corpus of 4,038 argumentative essays generated by 7 GPT models in response to essay prompts from three sources: (1) in-class or homework exercises, (2) TOEFL and (3) GRE writing tasks. Machine-generated texts are paired with roughly equal number of human-written essays with three score levels matched in essay prompts. We then hire English instructors to distinguish machine essays from human ones. Results show that when first exposed to machine-generated essays, the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Test · Linear Warmup With Linear Decay · WordPiece · Residual Connection · Cosine Annealing · Softmax · Linear Layer · Byte Pair Encoding
