Reliably Bounding False Positives: A Zero-Shot Machine-Generated Text Detection Framework via Multiscaled Conformal Prediction

Xiaowei Zhu; Yubing Ren; Yanan Cao; Xixun Lin; Fang Fang; Yangxi Li

arXiv:2505.05084·cs.CL·May 15, 2025

Reliably Bounding False Positives: A Zero-Shot Machine-Generated Text Detection Framework via Multiscaled Conformal Prediction

Xiaowei Zhu, Yubing Ren, Yanan Cao, Xixun Lin, Fang Fang, Yangxi Li

PDF

Open Access 1 Video

TL;DR

This paper introduces a zero-shot detection framework using multiscaled conformal prediction to reliably bound false positives in machine-generated text detection, improving accuracy and robustness.

Contribution

It proposes a novel MCP framework that constrains false positive rates while enhancing detection performance, along with a new diverse dataset, RealDet.

Findings

01

MCP effectively constrains false positive rates.

02

Detection performance is significantly improved.

03

Robustness against adversarial attacks is increased.

Abstract

The rapid advancement of large language models has raised significant concerns regarding their potential misuse by malicious actors. As a result, developing effective detectors to mitigate these risks has become a critical priority. However, most existing detection methods focus excessively on detection accuracy, often neglecting the societal risks posed by high false positive rates (FPRs). This paper addresses this issue by leveraging Conformal Prediction (CP), which effectively constrains the upper bound of FPRs. While directly applying CP constrains FPRs, it also leads to a significant reduction in detection performance. To overcome this trade-off, this paper proposes a Zero-Shot Machine-Generated Text Detection Framework via Multiscaled Conformal Prediction (MCP), which both enforces the FPR constraint and improves detection performance. This paper also introduces RealDet, a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Reliably Bounding False Positives: A Zero-Shot Machine-Generated Text Detection Framework via Multiscaled Conformal Prediction· underline

Taxonomy

TopicsTopic Modeling · Handwritten Text Recognition Techniques

MethodsFocus