Hate, Obscenity, and Insults: Measuring the Exposure of Children to Inappropriate Comments in YouTube
Sultan Alshamrani, Ahmed Abusnaina, Mohammed Abuhamad, Daehun Nyang,, David Mohaisen

TL;DR
This study analyzes the exposure of children to inappropriate comments on YouTube, revealing that over 11% of comments on children's videos are toxic, emphasizing the need for better moderation.
Contribution
The paper introduces a large-scale dataset and machine learning classifiers to detect inappropriate comments targeting children on YouTube.
Findings
11% of comments on children's videos are toxic
High accuracy ensemble classifiers for detecting inappropriate comments
Large dataset of approximately four million records
Abstract
Social media has become an essential part of the daily routines of children and adolescents. Moreover, enormous efforts have been made to ensure the psychological and emotional well-being of young users as well as their safety when interacting with various social media platforms. In this paper, we investigate the exposure of those users to inappropriate comments posted on YouTube videos targeting this demographic. We collected a large-scale dataset of approximately four million records and studied the presence of five age-inappropriate categories and the amount of exposure to each category. Using natural language processing and machine learning techniques, we constructed ensemble classifiers that achieved high accuracy in detecting inappropriate comments. Our results show a large percentage of worrisome comments with inappropriate content: we found 11% of the comments on children's…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHate Speech and Cyberbullying Detection · Spam and Phishing Detection · Advanced Malware Detection Techniques
