Character-focused Video Thumbnail Retrieval

Shervin Ardeshir; Nagendra Kamath; Hossein Taghavi

arXiv:2204.06563·cs.CV·April 15, 2022

Character-focused Video Thumbnail Retrieval

Shervin Ardeshir, Nagendra Kamath, Hossein Taghavi

PDF

Open Access

TL;DR

This paper proposes a method for retrieving character-focused video frames as thumbnails by evaluating facial expressions and character prominence and interactions using CNNs and face clustering.

Contribution

It introduces a novel approach combining facial expression analysis and character importance modeling for improved thumbnail selection.

Findings

01

Effective facial expression classifier for thumbnail relevance

02

Character prominence and interaction graph improves frame ranking

03

Method outperforms baseline thumbnail retrieval techniques

Abstract

We explore retrieving character-focused video frames as candidates for being video thumbnails. To evaluate each frame of the video based on the character(s) present in it, characters (faces) are evaluated in two aspects: Facial-expression: We train a CNN model to measure whether a face has an acceptable facial expression for being in a video thumbnail. This model is trained to distinguish faces extracted from artworks/thumbnails, from faces extracted from random frames of videos. Prominence and interactions: Character(s) in the thumbnail should be important character(s) in the video, to prevent the algorithm from suggesting non-representative frames as candidates. We use face clustering to identify the characters in the video, and form a graph in which the prominence (frequency of appearance) of the character(s), and their interactions (co-occurrence) are captured. We use this graph to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace recognition and analysis · Image Retrieval and Classification Techniques · Generative Adversarial Networks and Image Synthesis