LM$^2$otifs : An Explainable Framework for Machine-Generated Texts Detection
Xu Zheng, Zhuomin Chen, Esteban Schafir, Sipeng Chen, Hojat Allah Salehi, Haifeng Chen, Farhad Shirani, Wei Cheng, Dongsheng Luo

TL;DR
LM$^2$otifs introduces an explainable graph neural network framework that effectively detects machine-generated texts while providing interpretable linguistic motifs, addressing the explainability gap in existing detection methods.
Contribution
It proposes a novel GNN-based framework with explainability for MGT detection, combining graph representations of text with motif extraction for interpretability.
Findings
Comparable detection performance to existing methods
Effective extraction of linguistically meaningful motifs
Distinct linguistic fingerprints identified in machine-generated texts
Abstract
The impressive ability of large language models to generate natural text across various tasks has led to critical challenges in authorship authentication. Although numerous detection methods have been developed to differentiate between machine-generated texts (MGT) and human-generated texts (HGT), the explainability of these methods remains a significant gap. Traditional explainability techniques often fall short in capturing the complex word relationships that distinguish HGT from MGT. To address this limitation, we present LMotifs, a novel explainable framework for MGT detection. Inspired by probabilistic graphical models, we provide a theoretical rationale for the effectiveness. LMotifs utilizes eXplainable Graph Neural Networks to achieve both accurate detection and interpretability. The LMotifs pipeline operates in three key stages: first, it transforms text into graphs…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAuthorship Attribution and Profiling · Topic Modeling · Advanced Graph Neural Networks
