Generative Large Language Models (gLLMs) in Content Analysis: A Practical Guide for Communication Research

Daria Kravets-Meinke; Hannah Schmid-Petri; Sonja Niemann; Ute Schmid

arXiv:2510.24337·cs.AI·October 29, 2025

Generative Large Language Models (gLLMs) in Content Analysis: A Practical Guide for Communication Research

Daria Kravets-Meinke, Hannah Schmid-Petri, Sonja Niemann, Ute Schmid

PDF

TL;DR

This paper explores how generative large language models like ChatGPT are transforming content analysis in communication research by offering faster, cost-effective, and more nuanced coding capabilities, while also addressing methodological challenges.

Contribution

It provides a comprehensive guide on integrating gLLMs into content analysis, highlighting best practices and addressing key challenges to improve research validity and reliability.

Findings

01

gLLMs outperform traditional coding methods in speed and cost

02

gLLMs can decode implicit meanings and contextual information

03

The paper offers a best-practice framework for using gLLMs in research

Abstract

Generative Large Language Models (gLLMs), such as ChatGPT, are increasingly being used in communication research for content analysis. Studies show that gLLMs can outperform both crowd workers and trained coders, such as research assistants, on various coding tasks relevant to communication science, often at a fraction of the time and cost. Additionally, gLLMs can decode implicit meanings and contextual information, be instructed using natural language, deployed with only basic programming skills, and require little to no annotated data beyond a validation dataset - constituting a paradigm shift in automated content analysis. Despite their potential, the integration of gLLMs into the methodological toolkit of communication research remains underdeveloped. In gLLM-assisted quantitative content analysis, researchers must address at least seven critical challenges that impact result…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.