Neural Authorship Attribution: Stylometric Analysis on Large Language   Models

Tharindu Kumarage; Huan Liu

arXiv:2308.07305·cs.CL·August 15, 2023·2 cites

Neural Authorship Attribution: Stylometric Analysis on Large Language Models

Tharindu Kumarage, Huan Liu

PDF

Open Access 1 Repo

TL;DR

This paper investigates neural authorship attribution for AI-generated texts from large language models, analyzing stylometric features to distinguish between proprietary and open-source models, and providing empirical insights for forensic applications.

Contribution

It introduces a comprehensive empirical analysis of LLM writing signatures using stylometric features, enhancing understanding of neural authorship attribution for both proprietary and open-source models.

Findings

01

Stylometric features can effectively differentiate LLMs

02

Distinct signatures exist between proprietary and open-source models

03

Results support improved forensic detection of AI-generated text

Abstract

Large language models (LLMs) such as GPT-4, PaLM, and Llama have significantly propelled the generation of AI-crafted text. With rising concerns about their potential misuse, there is a pressing need for AI-generated-text forensics. Neural authorship attribution is a forensic effort, seeking to trace AI-generated text back to its originating LLM. The LLM landscape can be divided into two primary categories: proprietary and open-source. In this work, we delve into these emerging categories of LLMs, focusing on the nuances of neural authorship attribution. To enrich our understanding, we carry out an empirical analysis of LLM writing signatures, highlighting the contrasts between proprietary and open-source models, and scrutinizing variations within each group. By integrating stylometric features across lexical, syntactic, and structural aspects of language, we explore their potential to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

TSKumarage/Neural-Authorship-Attribution
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAuthorship Attribution and Profiling · Artificial Intelligence in Healthcare and Education · Hate Speech and Cyberbullying Detection

MethodsMulti-Head Attention · Attention Is All You Need · Adam · Softmax · Label Smoothing · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Layer Normalization · Linear Layer · Residual Connection