Bag of Genres for Video Retrieval

Leonardo A. Duarte; Ot\'avio A. B. Penatti; and Jurandy Almeida

arXiv:1506.00051·cs.CV·December 29, 2020

Bag of Genres for Video Retrieval

Leonardo A. Duarte, Ot\'avio A. B. Penatti, and Jurandy Almeida

PDF

TL;DR

The paper introduces the Bag of Genres, a compact video representation based on genre classification, improving video retrieval by capturing multiple concepts within videos more effectively.

Contribution

It proposes a novel Bag of Genres representation using a genre classifier-based visual dictionary, enhancing video retrieval with a more compact and effective feature.

Findings

01

Achieves comparable or superior results to state-of-the-art methods.

02

Provides a more compact representation than existing features.

03

Effective for both video genre and event retrieval.

Abstract

Often, videos are composed of multiple concepts or even genres. For instance, news videos may contain sports, action, nature, etc. Therefore, encoding the distribution of such concepts/genres in a compact and effective representation is a challenging task. In this sense, we propose the Bag of Genres representation, which is based on a visual dictionary defined by a genre classifier. Each visual word corresponds to a region in the classification space. The Bag of Genres video vector contains a summary of the activations of each genre in the video content. We evaluate the proposed method for video genre retrieval using the dataset of MediaEval Tagging Task of 2012 and for video event retrieval using the EVVE dataset. Results show that the proposed method achieves results comparable or superior to state-of-the-art methods, with the advantage of providing a much more compact representation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.