Leveraging Explainable AI for LLM Text Attribution: Differentiating   Human-Written and Multiple LLMs-Generated Text

Ayat Najjar; Huthaifa I. Ashqar; Omar Darwish; and Eman Hammad

arXiv:2501.03212·cs.CL·January 7, 2025·2 cites

Leveraging Explainable AI for LLM Text Attribution: Differentiating Human-Written and Multiple LLMs-Generated Text

Ayat Najjar, Huthaifa I. Ashqar, Omar Darwish, and Eman Hammad

PDF

Open Access

TL;DR

This paper presents machine learning and explainable AI techniques to accurately distinguish between human-written text and content generated by various LLMs, aiding in plagiarism detection and content attribution.

Contribution

It introduces a novel ML-based approach with XAI for multi-class attribution of texts to specific LLMs, outperforming existing tools like GPTZero.

Findings

01

High classification accuracy of 98.5% for multi-class attribution

02

XAI reveals key stylistic features for source differentiation

03

Model outperforms GPTZero in recognizing AI-generated texts

Abstract

The development of Generative AI Large Language Models (LLMs) raised the alarm regarding identifying content produced through generative AI or humans. In one case, issues arise when students heavily rely on such tools in a manner that can affect the development of their writing or coding skills. Other issues of plagiarism also apply. This study aims to support efforts to detect and identify textual content generated using LLM tools. We hypothesize that LLMs-generated text is detectable by machine learning (ML), and investigate ML models that can recognize and differentiate texts generated by multiple LLMs tools. We leverage several ML and Deep Learning (DL) algorithms such as Random Forest (RF), and Recurrent Neural Networks (RNN), and utilized Explainable Artificial Intelligence (XAI) to understand the important features in attribution. Our method is divided into 1) binary…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Law · Natural Language Processing Techniques

MethodsLLaMA