Multi-Level Contextual Token Relation Modeling for Machine-Generated Text Detection

Chenwang Wu; Yiuming Cheung; Bo Han; Shuhai Zhang; Defu Lian

arXiv:2605.16107·cs.CL·May 18, 2026

Multi-Level Contextual Token Relation Modeling for Machine-Generated Text Detection

Chenwang Wu, Yiuming Cheung, Bo Han, Shuhai Zhang, Defu Lian

PDF

TL;DR

This paper introduces a multi-level contextual token relation modeling framework to improve the detection of machine-generated texts by addressing biases in token-level scores through local and global relation modeling.

Contribution

It presents a novel multi-level relation modeling approach that combines local calibration and global rule-support reasoning for more accurate MGT detection.

Findings

01

Significant performance improvements across multiple real-world scenarios.

02

Effective in cross-LLM and cross-domain detection tasks.

03

Low computational overhead compared to existing methods.

Abstract

Machine-generated texts (MGTs) pose risks such as disinformation and phishing, underscoring the need for reliable detection. Metric-based methods, which extract statistically distinguishable features of MGTs, are often more practical than complex model-based methods that are prone to overfitting. Given their diverse designs, we first place representative metric-based methods within a unified framework, enabling a clear assessment of their advantages and limitations. Our analysis identifies a core challenge across these methods: the token-level detection score is easily biased by the inherent randomness of the MGTs generation process. Then, we theoretically derive the multi-hop transitions of the token-level detection score and explore their local and global relations. Based on these findings, we propose a multi-level contextual token relation modeling framework for MGT detection.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.