MPST: A Corpus of Movie Plot Synopses with Tags
Sudipta Kar, Suraj Maharjan, A. Pastor L\'opez-Monroy, Thamar, Solorio

TL;DR
This paper introduces MPST, a comprehensive dataset of 14,000 movie plot synopses with around 70 detailed tags, aiming to facilitate automatic movie tagging and analysis of narrative and emotional content.
Contribution
We created a fine-grained, multi-label movie tag corpus and explored its potential for automatic tag inference from plot summaries.
Findings
Tags correlate with movie genres and emotional flow.
The corpus enables analysis of narrative characteristics.
Feasibility of automatic tag prediction demonstrated.
Abstract
Social tagging of movies reveals a wide range of heterogeneous information about movies, like the genre, plot structure, soundtracks, metadata, visual and emotional experiences. Such information can be valuable in building automatic systems to create tags for movies. Automatic tagging systems can help recommendation engines to improve the retrieval of similar movies as well as help viewers to know what to expect from a movie in advance. In this paper, we set out to the task of collecting a corpus of movie plot synopses and tags. We describe a methodology that enabled us to build a fine-grained set of around 70 tags exposing heterogeneous characteristics of movie plots and the multi-label associations of these tags with some 14K movie plot synopses. We investigate how these tags correlate with movies and the flow of emotions throughout different types of movies. Finally, we use this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Video Analysis and Summarization · Image Retrieval and Classification Techniques
