Learn-to-Distance: Distance Learning for Detecting LLM-Generated Text

Hongyi Zhou; Jin Zhu; Kai Ye; Ying Yang; Erhan Xu; Chengchun Shi

arXiv:2601.21895·cs.CL·March 3, 2026

Learn-to-Distance: Distance Learning for Detecting LLM-Generated Text

Hongyi Zhou, Jin Zhu, Kai Ye, Ying Yang, Erhan Xu, Chengchun Shi

PDF

Open Access 3 Reviews

TL;DR

This paper introduces a novel adaptive distance learning algorithm for detecting LLM-generated text, demonstrating superior performance over existing methods across various models and settings.

Contribution

It presents a geometric analysis of rewrite-based detection and develops an adaptive distance learning approach that improves detection accuracy.

Findings

01

Achieves up to 75.4% relative improvement over baselines

02

Demonstrates generalization across multiple LLMs like GPT, Claude, and Gemini

03

Provides a publicly available Python implementation

Abstract

Modern large language models (LLMs) such as GPT, Claude, and Gemini have transformed the way we learn, work, and communicate. Yet, their ability to produce highly human-like text raises serious concerns about misinformation and academic integrity, making it an urgent need for reliable algorithms to detect LLM-generated content. In this paper, we start by presenting a geometric approach to demystify rewrite-based detection algorithms, revealing their underlying rationale and demonstrating their generalization ability. Building on this insight, we introduce a novel rewrite-based detection algorithm that adaptively learns the distance between the original and rewritten text. Theoretically, we demonstrate that employing an adaptively learned distance function is more effective for detection than using a fixed distance. Empirically, we conduct extensive experiments with over 100 settings,…

Peer Reviews

Decision·ICLR 2026 Poster

Reviewer 01Rating 6Confidence 4

Strengths

1. The paper presents a well-structured categorization of existing methods for detecting human-written and LLM-generated text, help building a clear roadmap and contextualizing its own research direction. 2. It introduces a novel perspective centered on maximizing the discrepancy between human-written and machine-generated text, which is supported by well-motivated theoretical assumptions. 3. The method's effectiveness is validated through extensive experiments, which include comparisons again

Weaknesses

1. The paper mentions that "adaptively learn a distance function" to enhance detection performance, but the implementation of this adaptive learning process is not clearly described. 2. The pre-trained LLMs used in the experiments are not explicitly identified, making it difficult to assess the experimental setup with confidence. Given the importance of model selection in evaluating the contribution of the work, such implementation details should be unambiguously stated. 3. There is an appare

Reviewer 02Rating 8Confidence 2

Strengths

- Introduces a clear geometric interpretation of rewrite-based detectors, offering theoretical insight that previous empirical works lacked. - The learned-distance formulation bridges theory and implementation elegantly, and the optimization is compatible with LoRA-style fine-tuning for scalability. - Evaluated on > 100 settings (24 datasets, 7 LLMs, 3 prompt types, 2 attacks) with consistent superiority over 11 baselines. - Addresses the realistic "unseen-prompt" condition that undermines most

Weaknesses

- The Hilbert-space and projection assumptions (Assumptions 1–3) are strong, but empirical verification of these geometric hypotheses is limited. - The authors mention small declines on certain datasets but do not analyze why the learned distance struggles there. - Because the detector is trained using LLM-generated corpora, it may implicitly learn stylistic or semantic regularities specific to those generation distributions. Discussing how well the method generalizes beyond the seven tested gen

Reviewer 03Rating 4Confidence 3

Strengths

1. The paper is well organized and easy to follow 2. Important and timely topic 3. prompt robustness is very important for these trained llm-text classifiers.

Weaknesses

1. My major concern lies in the geometric assumptions underpinning the theory, which are elegant but often unrealistic in practice. The framework assumes that LLM-generated text is a linear projection of human-written text onto an “LLM subspace” (Assumption 2) and that rewriting behaves equivalently on human and LLM-like inputs (Assumption 3). However, real-world text generation is highly nonlinear and context-dependent, and rewriting can amplify stylistic or semantic differences depending on pr

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Misinformation and Its Impacts · Handwritten Text Recognition Techniques