IMDB Spoiler Dataset

Rishabh Misra

arXiv:2212.06034·cs.CL·December 13, 2022

IMDB Spoiler Dataset

Rishabh Misra

PDF

Open Access

TL;DR

This paper introduces a high-quality dataset of movie reviews containing spoilers, aiming to facilitate research on automatic spoiler detection in user-generated media reviews.

Contribution

It provides a new, curated dataset specifically designed for developing and evaluating spoiler detection methods in movie reviews.

Findings

01

Dataset enables effective training of spoiler detection models

02

Preliminary experiments show promising results in identifying spoilers

03

Dataset covers diverse genres and spoiler types

Abstract

User-generated reviews are often our first point of contact when we consider watching a movie or a TV show. However, beyond telling us the qualitative aspects of the media we want to consume, reviews may inevitably contain undesired revelatory information (i.e. 'spoilers') such as the surprising fate of a character in a movie, or the identity of a murderer in a crime-suspense movie, etc. In this paper, we present a high-quality movie-review based spoiler dataset to tackle the problem of spoiler detection and describe various research questions it can answer.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVideo Analysis and Summarization · Digital Media Forensic Detection · Generative Adversarial Networks and Image Synthesis