Computational frame analysis revisited: On LLMs for studying news coverage
Sharaj Kunjar, Alyssa Hasegawa Smith, Tyler R Mckenzie, Rushali Mohbe, Samuel V Scarpino, Brooke Foucault Welles

TL;DR
This study evaluates the effectiveness of large language models in media frame analysis, comparing them to traditional methods and manual coding, and offers guidance on their appropriate use in research workflows.
Contribution
It provides a systematic comparison of LLMs with traditional and manual methods for media frame analysis using a novel dataset from the US Mpox epidemic coverage.
Findings
LLMs are outperformed by manual coders and smaller models in frame analysis.
Human validation remains essential for model selection.
Different approaches are suitable for different analytical tasks.
Abstract
Computational approaches have previously shown various promises and pitfalls when it comes to the reliable identification of media frames. Generative LLMs like GPT and Claude are increasingly being used as content analytical tools, but how effective are they for frame analysis? We address this question by systematically evaluating them against their computational predecessors: bag-of-words models and encoder-only transformers; and traditional manual coding procedures. Our analysis rests on a novel gold standard dataset that we inductively and iteratively developed through the study, investigating six months of news coverage of the US Mpox epidemic of 2022. While we discover some potential applications for generative LLMs, we demonstrate that they were consistently outperformed by manual coders, and in some instances, by smaller language models. Some form of human validation was always…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational and Text Analysis Methods · Misinformation and Its Impacts · Hate Speech and Cyberbullying Detection
