Joint Anchor-Feature Refinement for Real-Time Accurate Object Detection   in Images and Videos

Xingyu Chen; Junzhi Yu; Shihan Kong; Zhengxing Wu; and Li Wen

arXiv:1807.08638·cs.CV·March 16, 2020·1 cites

Joint Anchor-Feature Refinement for Real-Time Accurate Object Detection in Images and Videos

Xingyu Chen, Junzhi Yu, Shihan Kong, Zhengxing Wu, and Li Wen

PDF

Open Access 1 Repo

TL;DR

This paper introduces a dual refinement detection framework with anchor-offset mechanisms for real-time, accurate object detection in images and videos, achieving state-of-the-art results on multiple datasets.

Contribution

It proposes a novel anchor-offset detection method and dual refinement networks for static and temporal detection, enhancing accuracy and speed in real-world scenes.

Findings

01

Achieves 84.4% mAP on VOC 2007

02

Attains 69.4% mAP on VID 2017

03

Runs in real-time with high accuracy

Abstract

Object detection has been vigorously investigated for years but fast accurate detection for real-world scenes remains a very challenging problem. Overcoming drawbacks of single-stage detectors, we take aim at precisely detecting objects for static and temporal scenes in real time. Firstly, as a dual refinement mechanism, a novel anchor-offset detection is designed, which includes an anchor refinement, a feature location refinement, and a deformable detection head. This new detection mode is able to simultaneously perform two-step regression and capture accurate object features. Based on the anchor-offset detection, a dual refinement network (DRNet) is developed for high-performance static detection, where a multi-deformable head is further designed to leverage contextual information for describing objects. As for temporal detection in videos, temporal refinement networks (TRNet) and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

SeanChenxy/TDRN
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Image Enhancement Techniques · Multimodal Machine Learning Applications