SCALE: Towards Collaborative Content Analysis in Social Science with Large Language Model Agents and Human Intervention

Chengshuai Zhao; Zhen Tan; Chau-Wai Wong; Xinyan Zhao; Tianlong Chen; Huan Liu

arXiv:2502.10937·cs.AI·February 12, 2026

SCALE: Towards Collaborative Content Analysis in Social Science with Large Language Model Agents and Human Intervention

Chengshuai Zhao, Zhen Tan, Chau-Wai Wong, Xinyan Zhao, Tianlong Chen, Huan Liu

PDF

Open Access 1 Repo 1 Video

TL;DR

SCALE introduces a multi-agent framework utilizing large language models and human intervention to automate and improve complex content analysis in social science, mimicking human coding, discussion, and codebook evolution.

Contribution

This paper presents SCALE, a novel multi-agent system that simulates human-like content analysis processes with LLMs and human input, advancing automation in social science research.

Findings

01

SCALE achieves performance comparable to human annotators.

02

The framework effectively models collaborative discussion and codebook evolution.

03

Incorporating human intervention enhances analysis accuracy.

Abstract

Content analysis breaks down complex and unstructured texts into theory-informed numerical categories. Particularly, in social science, this process usually relies on multiple rounds of manual annotation, domain expert discussion, and rule-based refinement. In this paper, we introduce SCALE, a novel multi-agent framework that effectively $\underline{S}$ imulates $\underline{C}$ ontent $\underline{A}$ nalysis via $\underline{L}$ arge language model (LLM) ag $\underline{E}$ nts. SCALE imitates key phases of content analysis, including text coding, collaborative discussion, and dynamic codebook evolution, capturing the reflective depth and adaptive discussions of human researchers. Furthermore, by integrating diverse modes of human intervention, SCALE is augmented with expert input to further enhance its performance. Extensive evaluations on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ChengshuaiZhao0/SCALE
noneOfficial

Videos

SCALE: Towards Collaborative Content Analysis in Social Science with Large Language Model Agents and Human Intervention· underline

Taxonomy

TopicsComputational and Text Analysis Methods