Seq-NMS for Video Object Detection

Wei Han; Pooya Khorrami; Tom Le Paine; Prajit Ramachandran; Mohammad; Babaeizadeh; Honghui Shi; Jianan Li; Shuicheng Yan; Thomas S. Huang

arXiv:1602.08465·cs.CV·August 24, 2016·154 cites

Seq-NMS for Video Object Detection

Wei Han, Pooya Khorrami, Tom Le Paine, Prajit Ramachandran, Mohammad, Babaeizadeh, Honghui Shi, Jianan Li, Shuicheng Yan, Thomas S. Huang

PDF

Open Access 1 Repo

TL;DR

Seq-NMS enhances video object detection by leveraging high-scoring detections from neighboring frames to improve weaker detections, leading to superior results compared to single-image methods and achieving top placement in ILSVRC2015.

Contribution

The paper introduces Seq-NMS, a novel post-processing method that improves video object detection by utilizing temporal information from adjacent frames.

Findings

01

Outperforms state-of-the-art single-image detection methods

02

Achieved 3rd place in ILSVRC2015 VID task

03

Demonstrates the effectiveness of temporal score boosting

Abstract

Video object detection is challenging because objects that are easily detected in one frame may be difficult to detect in another frame within the same clip. Recently, there have been major advances for doing object detection in a single image. These methods typically contain three phases: (i) object proposal generation (ii) object classification and (iii) post-processing. We propose a modification of the post-processing phase that uses high-scoring object detections from nearby frames to boost scores of weaker detections within the same clip. We show that our method obtains superior results to state-of-the-art single image object detection techniques. Our method placed 3rd in the video object detection (VID) task of the ImageNet Large Scale Visual Recognition Challenge 2015 (ILSVRC2015).

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tmoopenn/seq-nms
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning