Leveraging Large-scale Multimedia Datasets to Refine Content Moderation   Models

Ioannis Sarridis; Christos Koutlis; Olga Papadopoulou; and Symeon; Papadopoulos

arXiv:2212.00668·cs.CV·December 2, 2022

Leveraging Large-scale Multimedia Datasets to Refine Content Moderation Models

Ioannis Sarridis, Christos Koutlis, Olga Papadopoulou, and Symeon, Papadopoulos

PDF

Open Access

TL;DR

This paper presents CM-Refinery, a framework that uses large-scale multimedia datasets to automatically enhance content moderation models by generating hard examples, reducing human annotation effort, and improving accuracy on NSFW and disturbing content detection tasks.

Contribution

The paper introduces CM-Refinery, a novel method leveraging large datasets and diversity criteria to automatically extend training data, improving content moderation models with minimal human involvement.

Findings

01

Achieved 1.32% and 1.94% accuracy improvements on benchmark datasets.

02

Automatically annotated 92.54% of disturbing content data, reducing human effort.

03

Enhanced model generalization through diversity-based data collection.

Abstract

The sheer volume of online user-generated content has rendered content moderation technologies essential in order to protect digital platform audiences from content that may cause anxiety, worry, or concern. Despite the efforts towards developing automated solutions to tackle this problem, creating accurate models remains challenging due to the lack of adequate task-specific training data. The fact that manually annotating such data is a highly demanding procedure that could severely affect the annotators' emotional well-being is directly related to the latter limitation. In this paper, we propose the CM-Refinery framework that leverages large-scale multimedia datasets to automatically extend initial training datasets with hard examples that can refine content moderation models, while significantly reducing the involvement of human annotators. We apply our method on two model adaptation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection · Sentiment Analysis and Opinion Mining · Topic Modeling