Redefining Research Crowdsourcing: Incorporating Human Feedback with LLM-Powered Digital Twins

Amanda Chan; Catherine Di; Joseph Rupertus; Gary Smith; Varun Nagaraj Rao; Manoel Horta Ribeiro; Andr\'es Monroy-Hern\'andez

arXiv:2505.24004·cs.HC·June 2, 2025

Redefining Research Crowdsourcing: Incorporating Human Feedback with LLM-Powered Digital Twins

Amanda Chan, Catherine Di, Joseph Rupertus, Gary Smith, Varun Nagaraj Rao, Manoel Horta Ribeiro, Andr\'es Monroy-Hern\'andez

PDF

TL;DR

This paper introduces a hybrid digital twin framework that models crowd workers to improve research data quality and worker engagement amidst AI automation challenges.

Contribution

It proposes a novel use of personalized AI digital twins to emulate human worker behavior, maintaining authenticity while enhancing productivity.

Findings

01

Digital twins can improve productivity and reduce decision fatigue.

02

The system maintains response quality comparable to human responses.

03

Transparency and ethical considerations are crucial for adoption.

Abstract

Crowd work platforms like Amazon Mechanical Turk and Prolific are vital for research, yet workers' growing use of generative AI tools poses challenges. Researchers face compromised data validity as AI responses replace authentic human behavior, while workers risk diminished roles as AI automates tasks. To address this, we propose a hybrid framework using digital twins, personalized AI models that emulate workers' behaviors and preferences while keeping humans in the loop. We evaluate our system with an experiment (n=88 crowd workers) and in-depth interviews with crowd workers (n=5) and social science researchers (n=4). Our results suggest that digital twins may enhance productivity and reduce decision fatigue while maintaining response quality. Both researchers and workers emphasized the importance of transparency, ethical data use, and worker agency. By automating repetitive tasks and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.