Assaying on the Robustness of Zero-Shot Machine-Generated Text Detectors

Yi-Fan Zhang; Zhang Zhang; Liang Wang; Tieniu Tan; Rong; Jin

arXiv:2312.12918·cs.CL·December 22, 2023·2 cites

Assaying on the Robustness of Zero-Shot Machine-Generated Text Detectors

Yi-Fan Zhang, Zhang Zhang, Liang Wang, Tieniu Tan, Rong, Jin

PDF

Open Access 1 Repo

TL;DR

This paper investigates the robustness of zero-shot machine-generated text detectors across different topics and LLMs, highlighting their limitations and the impact of topic shifts on detection performance.

Contribution

It provides empirical insights into how topic variability affects zero-shot detectors and explores the robustness of various advanced LLMs in this context.

Findings

01

Detection performance correlates with topics.

02

Topic shifts significantly impact detection accuracy.

03

Robustness varies across different LLMs.

Abstract

To combat the potential misuse of Natural Language Generation (NLG) technology, a variety of algorithms have been developed for the detection of AI-generated texts. Traditionally, this task is treated as a binary classification problem. Although supervised learning has demonstrated promising results, acquiring labeled data for detection purposes poses real-world challenges and the risk of overfitting. In an effort to address these issues, we delve into the realm of zero-shot machine-generated text detection. Existing zero-shot detectors, typically designed for specific tasks or topics, often assume uniform testing scenarios, limiting their practicality. In our research, we explore various advanced Large Language Models (LLMs) and their specialized variants, contributing to this field in several ways. In empirical studies, we uncover a significant correlation between topics and detection…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yfzhang114/robustness-detection
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Software Engineering Research