TagBook: A Semantic Video Representation without Supervision for Event   Detection

Masoud Mazloom; Xirong Li; Cees G. M. Snoek

arXiv:1510.02899·cs.CV·April 26, 2016

TagBook: A Semantic Video Representation without Supervision for Event Detection

Masoud Mazloom, Xirong Li, Cees G. M. Snoek

PDF

Open Access

TL;DR

TagBook introduces a novel, supervision-free semantic video representation using social tags for event detection, outperforming supervised methods in few- and zero-example scenarios.

Contribution

It proposes a new tag propagation-based video representation that does not require training concept detectors, enabling effective event detection with minimal or no labeled data.

Findings

01

Outperforms state-of-the-art supervised methods in zero- and few-example detection

02

Effective on multiple datasets including TRECVID and Columbia Video Dataset

03

Simple algorithm with competitive performance

Abstract

We consider the problem of event detection in video for scenarios where only few, or even zero examples are available for training. For this challenging setting, the prevailing solutions in the literature rely on a semantic video representation obtained from thousands of pre-trained concept detectors. Different from existing work, we propose a new semantic video representation that is based on freely available social tagged videos only, without the need for training any intermediate concept detectors. We introduce a simple algorithm that propagates tags from a video's nearest neighbors, similar in spirit to the ones used for image retrieval, but redesign it for video event detection by including video source set refinement and varying the video tag assignment. We call our approach TagBook and study its construction, descriptiveness and detection performance on the TRECVID 2013 and 2014…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Advanced Image and Video Retrieval Techniques · Video Analysis and Summarization