Detecting Temporally Consistent Objects in Videos through Object Class   Label Propagation

Subarna Tripathi; Serge Belongie; Youngbae Hwang; Truong Nguyen

arXiv:1601.05447·cs.CV·January 22, 2016·1 cites

Detecting Temporally Consistent Objects in Videos through Object Class Label Propagation

Subarna Tripathi, Serge Belongie, Youngbae Hwang, Truong Nguyen

PDF

Open Access

TL;DR

This paper introduces OVERLAP, a novel method for efficient, temporally consistent object detection in videos using label propagation and clustering of object proposals, achieving state-of-the-art results.

Contribution

It presents a new VOP generation method and a streaming clustering approach that reduces computation by classifying only a small subset of proposals.

Findings

01

Achieves state-of-the-art detection performance on Youtube-Objects dataset.

02

Reduces computational load by classifying fewer proposals per frame.

03

Demonstrates effective temporal consistency through label propagation.

Abstract

Object proposals for detecting moving or static video objects need to address issues such as speed, memory complexity and temporal consistency. We propose an efficient Video Object Proposal (VOP) generation method and show its efficacy in learning a better video object detector. A deep-learning based video object detector learned using the proposed VOP achieves state-of-the-art detection performance on the Youtube-Objects dataset. We further propose a clustering of VOPs which can efficiently be used for detecting objects in video in a streaming fashion. As opposed to applying per-frame convolutional neural network (CNN) based object detection, our proposed method called Objects in Video Enabler thRough LAbel Propagation (OVERLAP) needs to classify only a small fraction of all candidate proposals in every video frame through streaming clustering of object proposals and class-label…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Advanced Neural Network Applications · Visual Attention and Saliency Detection