Interpretable Stylistic Variation in Human and LLM Writing Across Genres, Models, and Decoding Strategies
Swati Rallapalli, Shannon Gallagher, Ronald Yurko, Tyler Brooks, Chuck Loughin, Michele Sezgin, Violet Turri

TL;DR
This study analyzes stylistic differences between human and machine-generated text across genres and models, revealing that genre and model choice significantly influence style more than decoding strategies.
Contribution
It provides a large-scale analysis of stylistic variation in LLM outputs, highlighting the robustness of linguistic features and the influence of genre and model over decoding strategies.
Findings
Genre has a stronger influence on style than the source.
Chat variants of models cluster together stylistically.
Model choice impacts style more than decoding strategies.
Abstract
Large Language Models (LLMs) are now capable of generating highly fluent, human-like text. They enable many applications, but also raise concerns such as large scale spam, phishing, or academic misuse. While much work has focused on detecting LLM-generated text, only limited work has gone into understanding the stylistic differences between human-written and machine-generated text. In this work, we perform a large scale analysis of stylistic variation across human-written text and outputs from 11 LLMs spanning 8 different genres and 4 decoding strategies using Douglas Biber's set of lexicogrammatical and functional features. Our findings reveal insights that can guide intentional LLM usage. First, key linguistic differentiators of LLM-generated text seem robust to generation conditions (e.g., prompt settings to nudge them to generate human-like text, or availability of human-written…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
