Is Crowdsourcing a Puppet Show? Detecting a New Type of Fraud in Online Platforms
Shengqian Wang, Israt Jahan Jui, Julie Thorpe

TL;DR
This paper reveals a significant presence of puppeteer-controlled accounts on Amazon Mechanical Turk, threatening data integrity, and proposes strategies for detecting and mitigating this new form of fraud in crowdsourcing.
Contribution
It identifies and quantifies the prevalence of puppeteer-controlled accounts on MTurk and discusses potential detection methods to improve data quality.
Findings
33% to 56.4% of accounts are likely puppets
Puppeteers can bypass standard attention checks
Detection strategies can help ensure data integrity
Abstract
Crowdsourcing platforms such as Amazon Mechanical Turk (MTurk) are important tools for researchers seeking to conduct studies with a broad, global participant base. Despite their popularity and demonstrated utility, we present evidence that suggests the integrity of data collected through Amazon MTurk is being threatened by the presence of puppeteers, apparently human workers controlling multiple puppet accounts that are capable of bypassing standard attention checks. If left undetected, puppeteers and their puppets can undermine the integrity of data collected on these platforms. This paper investigates data from two Amazon MTurk studies, finding that a substantial proportion of accounts (33% to 56.4%) are likely puppets. Our findings highlight the importance of adopting multifaceted strategies to ensure data integrity on crowdsourcing platforms. With the goal of detecting this type of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMobile Crowdsensing and Crowdsourcing · Open Source Software Innovations · Spam and Phishing Detection
