Robust Detection of LLM-Generated Text: A Comparative Analysis
Yongye Su, Yuqing Wu

TL;DR
This paper compares various methods for detecting LLM-generated text, emphasizing model generalization, adversarial robustness, and evaluation accuracy to address misuse and misinformation.
Contribution
It provides a comprehensive comparison of traditional ML, transformer-based, and LLM-based detection methods for identifying AI-generated text.
Findings
Transformer-based models outperform traditional ML in detection accuracy.
Adversarial attacks significantly reduce detection effectiveness.
Model generalization remains a key challenge for robust detection.
Abstract
The ability of large language models to generate complex texts allows them to be widely integrated into many aspects of life, and their output can quickly fill all network resources. As the impact of LLMs grows, it becomes increasingly important to develop powerful detectors for the generated text. This detector is essential to prevent the potential misuse of these technologies and to protect areas such as social media from the negative effects of false content generated by LLMS. The main goal of LLM-generated text detection is to determine whether text is generated by an LLM, which is a basic binary classification task. In our work, we mainly use three different classification methods based on open source datasets: traditional machine learning techniques such as logistic regression, k-means clustering, Gaussian Naive Bayes, support vector machines, and methods based on converters such…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Mathematics, Computing, and Information Processing
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Dropout · Linear Warmup With Linear Decay · WordPiece · Dense Connections · Layer Normalization · Adam · Attention Dropout
