Recurrent Scale Approximation for Object Detection in CNN
Yu Liu, Hongyang Li, Junjie Yan, Fangyin Wei, Xiaogang Wang, Xiaoou, Tang

TL;DR
This paper introduces a recurrent scale approximation method that reduces the computational cost of multi-scale object detection in CNNs by approximating feature maps at various scales through a recursive mechanism, improving efficiency and accuracy.
Contribution
It proposes a novel RSA technique with a scale-forecast network and landmark retracing network, enabling end-to-end training and superior face detection performance.
Findings
Outperforms state-of-the-art face detection methods
Achieves comparable results in generic proposal generation
Reduces computational cost significantly
Abstract
Since convolutional neural network (CNN) lacks an inherent mechanism to handle large scale variations, we always need to compute feature maps multiple times for multi-scale object detection, which has the bottleneck of computational cost in practice. To address this, we devise a recurrent scale approximation (RSA) to compute feature map once only, and only through this map can we approximate the rest maps on other levels. At the core of RSA is the recursive rolling out mechanism: given an initial map at a particular scale, it generates the prediction at a smaller scale that is half the size of input. To further increase efficiency and accuracy, we (a): design a scale-forecast network to globally predict potential scales in the image since there is no need to compute maps on all levels of the pyramid. (b): propose a landmark retracing network (LRN) to trace back locations of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Face recognition and analysis · Advanced Image and Video Retrieval Techniques
