Content Rating Classification for Fan Fiction

Yu Qiao; James Pope

arXiv:2212.12496·cs.CL·December 26, 2022

Content Rating Classification for Fan Fiction

Yu Qiao, James Pope

PDF

Open Access

TL;DR

This paper explores automatic content rating classification for fan fiction using NLP techniques, highlighting challenges with multi-class accuracy and the impact of self-annotation quality.

Contribution

It introduces NLP-based methods for fan fiction content rating and analyzes the limitations of current self-annotated datasets.

Findings

01

Binary classification outperforms multi-class methods.

02

Self-annotation errors limit classification accuracy.

03

Traditional and deep learning methods show poor multi-class results.

Abstract

Content ratings can enable audiences to determine the suitability of various media products. With the recent advent of fan fiction, the critical issue of fan fiction content ratings has emerged. Whether fan fiction content ratings are done voluntarily or required by regulation, there is the need to automate the content rating classification. The problem is to take fan fiction text and determine the appropriate content rating. Methods for other domains, such as online books, have been attempted though none have been applied to fan fiction. We propose natural language processing techniques, including traditional and deep learning methods, to automatically determine the content rating. We show that these methods produce poor accuracy results for multi-classification. We then demonstrate that treating the problem as a binary classification problem produces better accuracy. Finally, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputational and Text Analysis Methods · Topic Modeling · Digital Games and Media

MethodsNone