Can Large Language Models Transform Computational Social Science?
Caleb Ziems, William Held, Omar Shaikh, Jiaao Chen, Zhehao Zhang, Diyi, Yang

TL;DR
This paper explores how Large Language Models can be integrated into Computational Social Science workflows, demonstrating their potential for classification and explanation tasks, and providing best practices and evaluation methods.
Contribution
It offers a comprehensive evaluation pipeline, prompting best practices, and insights into the capabilities of 13 LLMs on social science benchmarks, highlighting their augmentative potential.
Findings
LLMs achieve fair agreement with humans on classification tasks.
LLMs produce high-quality explanations surpassing crowdworker references.
LLMs can serve as zero-shot annotators and creative generators in CSS.
Abstract
Large Language Models (LLMs) are capable of successfully performing many language processing tasks zero-shot (without training data). If zero-shot LLMs can also reliably classify and explain social phenomena like persuasiveness and political ideology, then LLMs could augment the Computational Social Science (CSS) pipeline in important ways. This work provides a road map for using LLMs as CSS tools. Towards this end, we contribute a set of prompting best practices and an extensive evaluation pipeline to measure the zero-shot performance of 13 language models on 25 representative English CSS benchmarks. On taxonomic labeling tasks (classification), LLMs fail to outperform the best fine-tuned models but still achieve fair levels of agreement with humans. On free-form coding tasks (generation), LLMs produce explanations that often exceed the quality of crowdworkers' gold references. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsTopic Modeling · Computational and Text Analysis Methods · Natural Language Processing Techniques
Methodsfail
