A Hub-and-Spoke Model for Content-Moderation-at-Scale on an   Information-Sharing Platform

Gregory Coppola

arXiv:2010.10581·cs.IR·October 22, 2020

A Hub-and-Spoke Model for Content-Moderation-at-Scale on an Information-Sharing Platform

Gregory Coppola

PDF

Open Access

TL;DR

This paper introduces a scalable content moderation model for large information-sharing platforms, leveraging a hub-and-spoke system where user ratings inform a machine learning algorithm to reduce moderation costs.

Contribution

The paper proposes a novel hub-and-spoke content moderation framework that effectively utilizes user ratings and a core editorial team to improve scalability and cost-efficiency.

Findings

01

Achieved significant cost reduction in content moderation.

02

Implemented a prototype algorithm in open-source code.

03

Demonstrated effectiveness of user ratings as features.

Abstract

One of the most expensive parts of maintaining a modern information-sharing platform (e.g., web search, social network) is the task of content-moderation-at-scale. Content moderation is the binary task of determining whether or not a given user-created message meets the editorial team's content guidelines for the site. The challenge is that the number of messages to check scales with the number of users, which is much larger than the number of moderator-employees working for the given platform. We show how content moderation can be achieved significantly more cheaply than before, in the special case where all messages are public, by effectively platformizing the task of content moderation. Our approach is to use a hub-and-spoke model. The hub is the core editorial team delegated by the management of the given platform. The spokes are the individual users. The ratings of the editorial…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSentiment Analysis and Opinion Mining · Spam and Phishing Detection · Text and Document Classification Technologies