Are We All in a Truman Show? Spotting Instagram Crowdturfing through   Self-Training

Pier Paolo Tricomi; Sousan Tarahomi; Christian Cattai; Francesco; Martini; Mauro Conti

arXiv:2206.12904·cs.SI·April 6, 2023

Are We All in a Truman Show? Spotting Instagram Crowdturfing through Self-Training

Pier Paolo Tricomi, Sousan Tarahomi, Christian Cattai, Francesco, Martini, Mauro Conti

PDF

Open Access

TL;DR

This paper introduces the first Instagram crowdturfing detection method using semi-supervised learning, achieving high accuracy and revealing significant artificial engagement among mega-influencers, highlighting the challenge of content-based detection.

Contribution

It presents a novel semi-supervised approach for detecting crowdturfing on Instagram, leveraging profile features and outperforming supervised methods in identifying fake engagement.

Findings

01

Achieved 95% F1-score in detecting crowdturfing profiles.

02

Discovered over 20% of engagement on mega-influencers is artificial.

03

Content analysis alone is insufficient for detecting crowdturfing activities.

Abstract

Influencer Marketing generated $16 billion in 2022. Usually, the more popular influencers are paid more for their collaborations. Thus, many services were created to boost profiles' popularity metrics through bots or fake accounts. However, real people recently started participating in such boosting activities using their real accounts for monetary rewards, generating ungenuine content that is extremely difficult to detect. To date, no works have attempted to detect this new phenomenon, known as crowdturfing (CT), on Instagram. In this work, we propose the first Instagram CT engagement detector. Our algorithm leverages profiles' characteristics through semi-supervised learning to spot accounts involved in CT activities. Compared to the supervised approaches used so far to identify fake accounts, semi-supervised models can exploit huge quantities of unlabeled data to increase…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpam and Phishing Detection · Hate Speech and Cyberbullying Detection · FinTech, Crowdfunding, Digital Finance