Generative AI & Fictionality: How Novels Power Large Language Models
Edwin Roland, Richard Jean So

TL;DR
This paper investigates how novels and fiction influence large language models like BERT, revealing that fiction's unique attributes shape AI outputs and cultural responses, highlighting the importance of training data sources.
Contribution
It provides an analysis of fiction's role in shaping LLMs and introduces a new perspective on how training data influences AI-generated culture.
Findings
Fictional texts contribute unique attributes to LLMs.
Fiction influences social and communicative qualities of AI outputs.
Training data sources significantly impact AI's cultural and social responses.
Abstract
Generative models, like the one in ChatGPT, are powered by their training data. The models are simply next-word predictors, based on patterns learned from vast amounts of pre-existing text. Since the first generation of GPT, it is striking that the most popular datasets have included substantial collections of novels. For the engineers and research scientists who build these models, there is a common belief that the language in fiction is rich enough to cover all manner of social and communicative phenomena, yet the belief has gone mostly unexamined. How does fiction shape the outputs of generative AI? Specifically, what are novels' effects relative to other forms of text, such as newspapers, Reddit, and Wikipedia? Since the 1970s, literature scholars such as Catherine Gallagher and James Phelan have developed robust and insightful accounts of how fiction operates as a form of discourse…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational and Text Analysis Methods · Artificial Intelligence in Healthcare and Education · Digital Humanities and Scholarship
