Movie Genre Classification by Language Augmentation and Shot Sampling

Zhongping Zhang; Yiwen Gu; Bryan A. Plummer; Xin Miao; Jiayi Liu,; Huayan Wang

arXiv:2203.13281·cs.CV·November 9, 2023·1 cites

Movie Genre Classification by Language Augmentation and Shot Sampling

Zhongping Zhang, Yiwen Gu, Bryan A. Plummer, Xin Miao, Jiayi Liu,, Huayan Wang

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces Movie-CLIP, a novel approach for movie genre classification that leverages language elements and shot sampling to improve accuracy and efficiency, also extending to scene boundary detection.

Contribution

The paper proposes a new method combining language augmentation and shot sampling for more accurate and efficient movie genre classification and scene boundary detection.

Findings

01

Achieved 6-9% improvement in mean Average Precision over baselines.

02

Extended the approach to scene boundary detection with 1.1% AP improvement.

03

Demonstrated effectiveness on MovieNet and Condensed Movies datasets.

Abstract

Video-based movie genre classification has garnered considerable attention due to its various applications in recommendation systems. Prior work has typically addressed this task by adapting models from traditional video classification tasks, such as action recognition or event detection. However, these models often neglect language elements (e.g., narrations or conversations) present in videos, which can implicitly convey high-level semantics of movie genres, like storylines or background context. Additionally, existing approaches are primarily designed to encode the entire content of the input video, leading to inefficiencies in predicting movie genres. Movie genre prediction may require only a few shots to accurately determine the genres, rendering a comprehensive understanding of the entire video unnecessary. To address these challenges, we propose a Movie genre Classification…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zhongping-zhang/movie-clip
pytorchOfficial

Videos

Movie Genre Classification by Language Augmentation and Shot Sampling· youtube

Taxonomy

TopicsVideo Analysis and Summarization · Music and Audio Processing · Human Pose and Action Recognition