Leveraging Large-scale Multimedia Datasets to Refine Content Moderation Models
Ioannis Sarridis, Christos Koutlis, Olga Papadopoulou, and Symeon, Papadopoulos

TL;DR
This paper presents CM-Refinery, a framework that uses large-scale multimedia datasets to automatically enhance content moderation models by generating hard examples, reducing human annotation effort, and improving accuracy on NSFW and disturbing content detection tasks.
Contribution
The paper introduces CM-Refinery, a novel method leveraging large datasets and diversity criteria to automatically extend training data, improving content moderation models with minimal human involvement.
Findings
Achieved 1.32% and 1.94% accuracy improvements on benchmark datasets.
Automatically annotated 92.54% of disturbing content data, reducing human effort.
Enhanced model generalization through diversity-based data collection.
Abstract
The sheer volume of online user-generated content has rendered content moderation technologies essential in order to protect digital platform audiences from content that may cause anxiety, worry, or concern. Despite the efforts towards developing automated solutions to tackle this problem, creating accurate models remains challenging due to the lack of adequate task-specific training data. The fact that manually annotating such data is a highly demanding procedure that could severely affect the annotators' emotional well-being is directly related to the latter limitation. In this paper, we propose the CM-Refinery framework that leverages large-scale multimedia datasets to automatically extend initial training datasets with hard examples that can refine content moderation models, while significantly reducing the involvement of human annotators. We apply our method on two model adaptation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHate Speech and Cyberbullying Detection · Sentiment Analysis and Opinion Mining · Topic Modeling
