Content-based Video Indexing and Retrieval Using Corr-LDA

Rahul Radhakrishnan Iyer; Sanjeel Parekh; Vikas Mohandoss; Anush; Ramsurat; Bhiksha Raj; Rita Singh

arXiv:1602.08581·cs.IR·November 21, 2019·20 cites

Content-based Video Indexing and Retrieval Using Corr-LDA

Rahul Radhakrishnan Iyer, Sanjeel Parekh, Vikas Mohandoss, Anush, Ramsurat, Bhiksha Raj, Rita Singh

PDF

Open Access

TL;DR

This paper introduces a content-based video retrieval method using corr-LDA, which auto-annotates videos with textual descriptors and leverages semantic relations for improved accuracy, demonstrated through audio-based experiments.

Contribution

The paper presents a novel corr-LDA framework for content-based video indexing that enhances retrieval accuracy by modeling semantic relations between video content and text.

Findings

01

Corr-LDA effectively auto-annotates videos with textual descriptors.

02

The method improves retrieval accuracy over SVM-based approaches.

03

Audio-only analysis shows promising results for content-based video retrieval.

Abstract

Existing video indexing and retrieval methods on popular web-based multimedia sharing websites are based on user-provided sparse tagging. This paper proposes a very specific way of searching for video clips, based on the content of the video. We present our work on Content-based Video Indexing and Retrieval using the Correspondence-Latent Dirichlet Allocation (corr-LDA) probabilistic framework. This is a model that provides for auto-annotation of videos in a database with textual descriptors, and brings the added benefit of utilizing the semantic relations between the content of the video and text. We use the concept-level matching provided by corr-LDA to build correspondences between text and multimedia, with the objective of retrieving content with increased accuracy. In our experiments, we employ only the audio components of the individual recordings and compare our results with an…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVideo Analysis and Summarization · Music and Audio Processing · Advanced Image and Video Retrieval Techniques