DetectGPT-SC: Improving Detection of Text Generated by Large Language   Models through Self-Consistency with Masked Predictions

Rongsheng Wang; Qi Li; Sihong Xie

arXiv:2310.14479·cs.CL·October 24, 2023·1 cites

DetectGPT-SC: Improving Detection of Text Generated by Large Language Models through Self-Consistency with Masked Predictions

Rongsheng Wang, Qi Li, Sihong Xie

PDF

Open Access

TL;DR

DetectGPT-SC introduces a novel method leveraging large language models' self-consistency in masked text predictions to improve detection of AI-generated texts, outperforming existing detectors across various tasks.

Contribution

The paper proposes a new detection approach based on self-consistency with masked predictions, exploiting LLMs' reasoning ability to distinguish AI-generated texts from human-written ones.

Findings

01

DetectGPT-SC outperforms current state-of-the-art detectors.

02

Self-consistency with masked predictions effectively identifies AI-generated texts.

03

The method works across different mask schemes and prompts.

Abstract

General large language models (LLMs) such as ChatGPT have shown remarkable success, but it has also raised concerns among people about the misuse of AI-generated texts. Therefore, an important question is how to detect whether the texts are generated by ChatGPT or by humans. Existing detectors are built on the assumption that there is a distribution gap between human-generated and AI-generated texts. These gaps are typically identified using statistical information or classifiers. In contrast to prior research methods, we find that large language models such as ChatGPT exhibit strong self-consistency in text generation and continuation. Self-consistency capitalizes on the intuition that AI-generated texts can still be reasoned with by large language models using the same logical reasoning when portions of the texts are masked, which differs from human-generated texts. Using this…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification