Robust Detection of LLM-Generated Text: A Comparative Analysis

Yongye Su; Yuqing Wu

arXiv:2411.06248·cs.CL·November 12, 2024

Robust Detection of LLM-Generated Text: A Comparative Analysis

Yongye Su, Yuqing Wu

PDF

Open Access

TL;DR

This paper compares various methods for detecting LLM-generated text, emphasizing model generalization, adversarial robustness, and evaluation accuracy to address misuse and misinformation.

Contribution

It provides a comprehensive comparison of traditional ML, transformer-based, and LLM-based detection methods for identifying AI-generated text.

Findings

01

Transformer-based models outperform traditional ML in detection accuracy.

02

Adversarial attacks significantly reduce detection effectiveness.

03

Model generalization remains a key challenge for robust detection.

Abstract

The ability of large language models to generate complex texts allows them to be widely integrated into many aspects of life, and their output can quickly fill all network resources. As the impact of LLMs grows, it becomes increasingly important to develop powerful detectors for the generated text. This detector is essential to prevent the potential misuse of these technologies and to protect areas such as social media from the negative effects of false content generated by LLMS. The main goal of LLM-generated text detection is to determine whether text is generated by an LLM, which is a basic binary classification task. In our work, we mainly use three different classification methods based on open source datasets: traditional machine learning techniques such as logistic regression, k-means clustering, Gaussian Naive Bayes, support vector machines, and methods based on converters such…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Mathematics, Computing, and Information Processing

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Dropout · Linear Warmup With Linear Decay · WordPiece · Dense Connections · Layer Normalization · Adam · Attention Dropout