Content-based Video Indexing and Retrieval Using Corr-LDA
Rahul Radhakrishnan Iyer, Sanjeel Parekh, Vikas Mohandoss, Anush, Ramsurat, Bhiksha Raj, Rita Singh

TL;DR
This paper introduces a content-based video retrieval method using corr-LDA, which auto-annotates videos with textual descriptors and leverages semantic relations for improved accuracy, demonstrated through audio-based experiments.
Contribution
The paper presents a novel corr-LDA framework for content-based video indexing that enhances retrieval accuracy by modeling semantic relations between video content and text.
Findings
Corr-LDA effectively auto-annotates videos with textual descriptors.
The method improves retrieval accuracy over SVM-based approaches.
Audio-only analysis shows promising results for content-based video retrieval.
Abstract
Existing video indexing and retrieval methods on popular web-based multimedia sharing websites are based on user-provided sparse tagging. This paper proposes a very specific way of searching for video clips, based on the content of the video. We present our work on Content-based Video Indexing and Retrieval using the Correspondence-Latent Dirichlet Allocation (corr-LDA) probabilistic framework. This is a model that provides for auto-annotation of videos in a database with textual descriptors, and brings the added benefit of utilizing the semantic relations between the content of the video and text. We use the concept-level matching provided by corr-LDA to build correspondences between text and multimedia, with the objective of retrieving content with increased accuracy. In our experiments, we employ only the audio components of the individual recordings and compare our results with an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Analysis and Summarization · Music and Audio Processing · Advanced Image and Video Retrieval Techniques
