I Know Which LLM Wrote Your Code Last Summer: LLM generated Code Stylometry for Authorship Attribution

Tamas Bisztray; Bilel Cherif; Richard A. Dubniczky; Nils Gruschka; Bertalan Borsos; Mohamed Amine Ferrag; Attila Kovacs; Vasileios Mavroeidis; and Norbert Tihanyi

arXiv:2506.17323·cs.LG·June 24, 2025

I Know Which LLM Wrote Your Code Last Summer: LLM generated Code Stylometry for Authorship Attribution

Tamas Bisztray, Bilel Cherif, Richard A. Dubniczky, Nils Gruschka, Bertalan Borsos, Mohamed Amine Ferrag, Attila Kovacs, Vasileios Mavroeidis, and Norbert Tihanyi

PDF

TL;DR

This paper introduces a new model and benchmark for identifying which large language model generated a given C program, achieving high accuracy in authorship attribution among multiple LLMs.

Contribution

We present CodeT5-Authorship, a novel encoder-only transformer model, and LLM-AuthorBench, a comprehensive benchmark for LLM authorship attribution in C code.

Findings

01

Achieved 97.56% accuracy in binary classification of similar LLMs.

02

Achieved 95.40% accuracy in multi-class attribution among five LLMs.

03

Outperformed traditional ML classifiers and other transformer models.

Abstract

Detecting AI-generated code, deepfakes, and other synthetic content is an emerging research challenge. As code generated by Large Language Models (LLMs) becomes more common, identifying the specific model behind each sample is increasingly important. This paper presents the first systematic study of LLM authorship attribution for C programs. We released CodeT5-Authorship, a novel model that uses only the encoder layers from the original CodeT5 encoder-decoder architecture, discarding the decoder to focus on classification. Our model's encoder output (first token) is passed through a two-layer classification head with GELU activation and dropout, producing a probability distribution over possible authors. To evaluate our approach, we introduce LLM-AuthorBench, a benchmark of 32,000 compilable C programs generated by eight state-of-the-art LLMs across diverse tasks. We compare our model…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsGated Linear Unit · Refunds@Expedia|||How do I get a full refund from Expedia? · Absolute Position Encodings · Byte Pair Encoding · Label Smoothing · Transformer · Attention Dropout · Dropout · Softmax · Dense Connections