Glimpse: Enabling White-Box Methods to Use Proprietary Models for Zero-Shot LLM-Generated Text Detection
Guangsheng Bao, Yanbin Zhao, Juncai He, Yue Zhang

TL;DR
Glimpse introduces a method to enable white-box detection techniques to utilize proprietary LLMs for zero-shot detection of generated text by estimating full probability distributions from limited API access.
Contribution
The paper presents Glimpse, a novel approach that allows white-box detection methods to leverage proprietary models despite limited API information.
Findings
Glimpse achieves about 0.95 AUROC with GPT-3.5 on recent models.
It improves detection performance by 51% over open-source baselines.
Proprietary LLMs can effectively detect their own generated outputs.
Abstract
Advanced large language models (LLMs) can generate text almost indistinguishable from human-written text, highlighting the importance of LLM-generated text detection. However, current zero-shot techniques face challenges as white-box methods are restricted to use weaker open-source LLMs, and black-box methods are limited by partial observation from stronger proprietary LLMs. It seems impossible to enable white-box methods to use proprietary models because API-level access to the models neither provides full predictive distributions nor inner embeddings. To traverse the divide, we propose **Glimpse**, a probability distribution estimation approach, predicting the full distributions from partial observations. Despite the simplicity of Glimpse, we successfully extend white-box methods like Entropy, Rank, Log-Rank, and Fast-DetectGPT to latest proprietary models. Experiments show that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsNatural Language Processing Techniques
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · {Dispute@FaQ-s}How to file a dispute with Expedia? · 15 Ways to Contact How can i speak to someone at Delta Airlines · Attention Is All You Need · Linear Layer · Dense Connections · Byte Pair Encoding · Multi-Head Attention · Cosine Annealing · Residual Connection
