Can Large Language Models Transform Computational Social Science?

Caleb Ziems; William Held; Omar Shaikh; Jiaao Chen; Zhehao Zhang; Diyi; Yang

arXiv:2305.03514·cs.CL·February 27, 2024·61 cites

Can Large Language Models Transform Computational Social Science?

Caleb Ziems, William Held, Omar Shaikh, Jiaao Chen, Zhehao Zhang, Diyi, Yang

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper explores how Large Language Models can be integrated into Computational Social Science workflows, demonstrating their potential for classification and explanation tasks, and providing best practices and evaluation methods.

Contribution

It offers a comprehensive evaluation pipeline, prompting best practices, and insights into the capabilities of 13 LLMs on social science benchmarks, highlighting their augmentative potential.

Findings

01

LLMs achieve fair agreement with humans on classification tasks.

02

LLMs produce high-quality explanations surpassing crowdworker references.

03

LLMs can serve as zero-shot annotators and creative generators in CSS.

Abstract

Large Language Models (LLMs) are capable of successfully performing many language processing tasks zero-shot (without training data). If zero-shot LLMs can also reliably classify and explain social phenomena like persuasiveness and political ideology, then LLMs could augment the Computational Social Science (CSS) pipeline in important ways. This work provides a road map for using LLMs as CSS tools. Towards this end, we contribute a set of prompting best practices and an extensive evaluation pipeline to measure the zero-shot performance of 13 language models on 25 representative English CSS benchmarks. On taxonomic labeling tasks (classification), LLMs fail to outperform the best fine-tuned models but still achieve fair levels of agreement with humans. On free-form coding tasks (generation), LLMs produce explanations that often exceed the quality of crowdworkers' gold references. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

salt-nlp/llms_for_css
pytorchOfficial

Videos

Can Large Language Models Transform Computational Social Science?· underline

Taxonomy

TopicsTopic Modeling · Computational and Text Analysis Methods · Natural Language Processing Techniques

Methodsfail