Glimpse: Enabling White-Box Methods to Use Proprietary Models for   Zero-Shot LLM-Generated Text Detection

Guangsheng Bao; Yanbin Zhao; Juncai He; Yue Zhang

arXiv:2412.11506·cs.CL·February 20, 2025

Glimpse: Enabling White-Box Methods to Use Proprietary Models for Zero-Shot LLM-Generated Text Detection

Guangsheng Bao, Yanbin Zhao, Juncai He, Yue Zhang

PDF

Open Access 1 Repo 1 Video

TL;DR

Glimpse introduces a method to enable white-box detection techniques to utilize proprietary LLMs for zero-shot detection of generated text by estimating full probability distributions from limited API access.

Contribution

The paper presents Glimpse, a novel approach that allows white-box detection methods to leverage proprietary models despite limited API information.

Findings

01

Glimpse achieves about 0.95 AUROC with GPT-3.5 on recent models.

02

It improves detection performance by 51% over open-source baselines.

03

Proprietary LLMs can effectively detect their own generated outputs.

Abstract

Advanced large language models (LLMs) can generate text almost indistinguishable from human-written text, highlighting the importance of LLM-generated text detection. However, current zero-shot techniques face challenges as white-box methods are restricted to use weaker open-source LLMs, and black-box methods are limited by partial observation from stronger proprietary LLMs. It seems impossible to enable white-box methods to use proprietary models because API-level access to the models neither provides full predictive distributions nor inner embeddings. To traverse the divide, we propose **Glimpse**, a probability distribution estimation approach, predicting the full distributions from partial observations. Despite the simplicity of Glimpse, we successfully extend white-box methods like Entropy, Rank, Log-Rank, and Fast-DetectGPT to latest proprietary models. Experiments show that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

baoguangsheng/glimpse
pytorchOfficial

Videos

Glimpse: Enabling White-Box Methods to Use Proprietary Models for Zero-Shot LLM-Generated Text Detection· slideslive

Taxonomy

TopicsNatural Language Processing Techniques

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · {Dispute@FaQ-s}How to file a dispute with Expedia? · 15 Ways to Contact How can i speak to someone at Delta Airlines · Attention Is All You Need · Linear Layer · Dense Connections · Byte Pair Encoding · Multi-Head Attention · Cosine Annealing · Residual Connection