Assessing GPTZero's Accuracy in Identifying AI vs. Human-Written Essays

Selin Dik; Osman Erdem; Mehmet Dik

arXiv:2506.23517·cs.AI·July 1, 2025

Assessing GPTZero's Accuracy in Identifying AI vs. Human-Written Essays

Selin Dik, Osman Erdem, Mehmet Dik

PDF

Open Access

TL;DR

This study evaluates GPTZero's effectiveness in distinguishing AI-generated essays from human-written ones across different lengths, finding high accuracy for AI texts but limited reliability for human texts, cautioning educators on sole reliance.

Contribution

It provides an empirical assessment of GPTZero's detection accuracy across essay lengths, highlighting its strengths and limitations in real-world educational settings.

Findings

01

GPTZero detects AI essays with 91-100% accuracy

02

False positives occur with some human essays

03

Detection reliability varies with essay length

Abstract

As the use of AI tools by students has become more prevalent, instructors have started using AI detection tools like GPTZero and QuillBot to detect AI written text. However, the reliability of these detectors remains uncertain. In our study, we focused mostly on the success rate of GPTZero, the most-used AI detector, in identifying AI-generated texts based on different lengths of randomly submitted essays: short (40-100 word count), medium (100-350 word count), and long (350-800 word count). We gathered a data set consisting of twenty-eight AI-generated papers and fifty human-written papers. With this randomized essay data, papers were individually plugged into GPTZero and measured for percentage of AI generation and confidence. A vast majority of the AI-generated papers were detected accurately (ranging from 91-100% AI believed generation), while the human generated essays fluctuated;…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Healthcare and Education · Academic integrity and plagiarism · Intelligent Tutoring Systems and Adaptive Learning