Who Wrote This Line? Evaluating the Detection of LLM-Generated Classical Chinese Poetry

Jiang Li; Tian Lan; Shanshan Wang; Dongxing Zhang; Dianqing Lin; Guanglai Gao; Derek F. Wong; Xiangdong Su

arXiv:2604.10101·cs.CL·April 17, 2026

Who Wrote This Line? Evaluating the Detection of LLM-Generated Classical Chinese Poetry

Jiang Li, Tian Lan, Shanshan Wang, Dongxing Zhang, Dianqing Lin, Guanglai Gao, Derek F. Wong, Xiangdong Su

PDF

1 Repo

TL;DR

This paper introduces ChangAn, a large benchmark dataset for detecting AI-generated classical Chinese poetry, revealing current detectors' limitations and emphasizing the need for specialized evaluation tools.

Contribution

The paper presents ChangAn, a comprehensive benchmark dataset for LLM-generated classical Chinese poetry detection, and systematically evaluates existing detectors' performance on this challenging task.

Findings

01

Current detectors are unreliable for classical Chinese poetry detection.

02

The dataset contains over 30,000 poems, including human and AI-generated works.

03

Evaluation shows significant performance gaps in existing detection methods.

Abstract

The rapid development of large language models (LLMs) has extended text generation tasks into the literary domain. However, AI-generated literary creations has raised increasingly prominent issues of creative authenticity and ethics in literary world, making the detection of LLM-generated literary texts essential and urgent. While previous works have made significant progress in detecting AI-generated text, it has yet to address classical Chinese poetry. Due to the unique linguistic features of classical Chinese poetry, such as strict metrical regularity, a shared system of poetic imagery, and flexible syntax, distinguishing whether a poem is authored by AI presents a substantial challenge. To address these issues, we introduce ChangAn, a benchmark for detecting LLM-generated classical Chinese poetry that containing total 30,664 poems, 10,276 are human-written poems and 20,388 poems are…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

VelikayaScarlet/ChangAn
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.