Joint Anchor-Feature Refinement for Real-Time Accurate Object Detection in Images and Videos
Xingyu Chen, Junzhi Yu, Shihan Kong, Zhengxing Wu, and Li Wen

TL;DR
This paper introduces a dual refinement detection framework with anchor-offset mechanisms for real-time, accurate object detection in images and videos, achieving state-of-the-art results on multiple datasets.
Contribution
It proposes a novel anchor-offset detection method and dual refinement networks for static and temporal detection, enhancing accuracy and speed in real-world scenes.
Findings
Achieves 84.4% mAP on VOC 2007
Attains 69.4% mAP on VID 2017
Runs in real-time with high accuracy
Abstract
Object detection has been vigorously investigated for years but fast accurate detection for real-world scenes remains a very challenging problem. Overcoming drawbacks of single-stage detectors, we take aim at precisely detecting objects for static and temporal scenes in real time. Firstly, as a dual refinement mechanism, a novel anchor-offset detection is designed, which includes an anchor refinement, a feature location refinement, and a deformable detection head. This new detection mode is able to simultaneously perform two-step regression and capture accurate object features. Based on the anchor-offset detection, a dual refinement network (DRNet) is developed for high-performance static detection, where a multi-deformable head is further designed to leverage contextual information for describing objects. As for temporal detection in videos, temporal refinement networks (TRNet) and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Image Enhancement Techniques · Multimodal Machine Learning Applications
