Unraveling Movie Genres through Cross-Attention Fusion of Bi-Modal Synergy of Poster
Utsav Kumar Nareti, Chandranath Adak, Soumi Chattopadhyay, Pichao Wang

TL;DR
This paper proposes a novel multi-modal framework that combines visual and textual features from movie posters using cross-attention to improve multilabel movie genre classification, demonstrating superior performance over existing models.
Contribution
The study introduces a cross-attention fusion module for integrating visual and textual poster features, advancing movie genre classification methods.
Findings
Model outperforms several contemporary architectures.
Effective use of OCR-extracted text and visual features.
Promising results on IMDb poster dataset.
Abstract
Movie posters are not just decorative; they are meticulously designed to capture the essence of a movie, such as its genre, storyline, and tone/vibe. For decades, movie posters have graced cinema walls, billboards, and now our digital screens as a form of digital posters. Movie genre classification plays a pivotal role in film marketing, audience engagement, and recommendation systems. Previous explorations into movie genre classification have been mostly examined in plot summaries, subtitles, trailers and movie scenes. Movie posters provide a pre-release tantalizing glimpse into a film's key aspects, which can ignite public interest. In this paper, we presented the framework that exploits movie posters from a visual and textual perspective to address the multilabel movie genre classification problem. Firstly, we extracted text from movie posters using an OCR and retrieved the relevant…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Analysis and Summarization · Generative Adversarial Networks and Image Synthesis
MethodsSoftmax · Attention Is All You Need
