Predicting IMDb Rating of TV Series with Deep Learning: The Case of Arrow
Anna Luiza Gomes, Get\'ulio Vianna, Tatiana Escovedo, Marcos, Kalinowski

TL;DR
This paper demonstrates how deep learning models, specifically Catboost, can effectively predict IMDb ratings of TV series episodes like Arrow using features such as themes, reviews, and viewership, aiding decision-making in TV production.
Contribution
It introduces a predictive modeling approach for IMDb ratings based on episode features, including thematic analysis via LDA, which is a novel application for TV series evaluation.
Findings
Catboost achieved the best prediction performance.
IMDb ratings could be predicted with an RMSE of 0.55.
Thematic features influenced episode ratings significantly.
Abstract
Context: The number of TV series offered nowadays is very high. Due to its large amount, many series are canceled due to a lack of originality that generates a low audience. Problem: Having a decision support system that can show why some shows are a huge success or not would facilitate the choices of renewing or starting a show. Solution: We studied the case of the series Arrow broadcasted by CW Network and used descriptive and predictive modeling techniques to predict the IMDb rating. We assumed that the theme of the episode would affect its evaluation by users, so the dataset is composed only by the director of the episode, the number of reviews that episode got, the percentual of each theme extracted by the Latent Dirichlet Allocation (LDA) model of an episode, the number of viewers from Wikipedia and the rating from IMDb. The LDA model is a generative probabilistic model of a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsTest · Linear Discriminant Analysis
