Decoding AI and Human Authorship: Nuances Revealed Through NLP and   Statistical Analysis

Mayowa Akinwande; Oluwaseyi Adeliyi; Toyyibat Yussuph

arXiv:2408.00769·cs.DL·August 5, 2024

Decoding AI and Human Authorship: Nuances Revealed Through NLP and Statistical Analysis

Mayowa Akinwande, Oluwaseyi Adeliyi, Toyyibat Yussuph

PDF

TL;DR

This study analyzes linguistic differences between AI and human texts using statistical methods on a large dataset, revealing key distinctions in vocabulary, creativity, and novelty to better understand AI's language capabilities.

Contribution

It provides a detailed statistical comparison of AI and human writing, highlighting new insights into linguistic traits, creativity, and biases in AI-generated texts.

Findings

01

Humans have higher vocabulary diversity than AI.

02

AI-generated texts show slightly higher novelty.

03

Humans produce longer but shorter words on average.

Abstract

This research explores the nuanced differences in texts produced by AI and those written by humans, aiming to elucidate how language is expressed differently by AI and humans. Through comprehensive statistical data analysis, the study investigates various linguistic traits, patterns of creativity, and potential biases inherent in human-written and AI- generated texts. The significance of this research lies in its contribution to understanding AI's creative capabilities and its impact on literature, communication, and societal frameworks. By examining a meticulously curated dataset comprising 500K essays spanning diverse topics and genres, generated by LLMs, or written by humans, the study uncovers the deeper layers of linguistic expression and provides insights into the cognitive processes underlying both AI and human-driven textual compositions. The analysis revealed that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.