Content Rating Classification for Fan Fiction
Yu Qiao, James Pope

TL;DR
This paper explores automatic content rating classification for fan fiction using NLP techniques, highlighting challenges with multi-class accuracy and the impact of self-annotation quality.
Contribution
It introduces NLP-based methods for fan fiction content rating and analyzes the limitations of current self-annotated datasets.
Findings
Binary classification outperforms multi-class methods.
Self-annotation errors limit classification accuracy.
Traditional and deep learning methods show poor multi-class results.
Abstract
Content ratings can enable audiences to determine the suitability of various media products. With the recent advent of fan fiction, the critical issue of fan fiction content ratings has emerged. Whether fan fiction content ratings are done voluntarily or required by regulation, there is the need to automate the content rating classification. The problem is to take fan fiction text and determine the appropriate content rating. Methods for other domains, such as online books, have been attempted though none have been applied to fan fiction. We propose natural language processing techniques, including traditional and deep learning methods, to automatically determine the content rating. We show that these methods produce poor accuracy results for multi-classification. We then demonstrate that treating the problem as a binary classification problem produces better accuracy. Finally, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational and Text Analysis Methods · Topic Modeling · Digital Games and Media
MethodsNone
