PTVD: A Large-Scale Plot-Oriented Multimodal Dataset Based on Television Dramas
Chen Li, Xutan Peng, Teng Wang, Yixiao Ge, Mengyang Liu, Xuyuan Xu,, Yexin Wang, Ying Shan

TL;DR
PTVD is a comprehensive, large-scale multimodal dataset based on TV dramas, designed to improve plot coherence understanding, with extensive annotations, comments, and a baseline model for multimodal learning research.
Contribution
This paper introduces PTVD, the first plot-oriented multimodal TV drama dataset, including long-form video, detailed annotations, and a strong baseline model for multimodal tasks.
Findings
PTVD enables new multimodal research in TV dramas.
Extensive experiments validate the dataset's usefulness.
Counter-intuitive insights from cognitive-inspired tasks.
Abstract
Art forms such as movies and television (TV) dramas are reflections of the real world, which have attracted much attention from the multimodal learning community recently. However, existing corpora in this domain share three limitations: (1) annotated in a scene-oriented fashion, they ignore the coherence within plots; (2) their text lacks empathy and seldom mentions situational context; (3) their video clips fail to cover long-form relationship due to short duration. To address these fundamental issues, using 1,106 TV drama episodes and 24,875 informative plot-focused sentences written by professionals, with the help of 449 human annotators, we constructed PTVD, the first plot-oriented multimodal dataset in the TV domain. It is also the first non-English dataset of its kind. Additionally, PTVD contains more than 26 million bullet screen comments (BSCs), powering large-scale…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Topic Modeling · Natural Language Processing Techniques
Methodsfail
