Toward Scale-Invariance and Position-Sensitive Region Proposal Networks

Hsueh-Fu Lu; Xiaofei Du; Ping-Lin Chang

arXiv:1807.09528·cs.CV·July 26, 2018

Toward Scale-Invariance and Position-Sensitive Region Proposal Networks

Hsueh-Fu Lu, Xiaofei Du, Ping-Lin Chang

PDF

Open Access

TL;DR

This paper introduces a novel object proposal network that enhances scale-invariance and position sensitivity, significantly improving average recall on standard datasets while maintaining real-time performance.

Contribution

The proposed network architecture combines translation-invariance and scale-invariance with large receptive fields, offering a simple yet effective approach for high-quality object proposals.

Findings

01

Improves AR at 1,000 proposals by 35% on PASCAL VOC

02

Achieves 45% AR improvement on COCO dataset

03

Runs at 44.8 ms inference time for 640x640 images

Abstract

Accurately localising object proposals is an important precondition for high detection rate for the state-of-the-art object detection frameworks. The accuracy of an object detection method has been shown highly related to the average recall (AR) of the proposals. In this work, we propose an advanced object proposal network in favour of translation-invariance for objectness classification, translation-variance for bounding box regression, large effective receptive fields for capturing global context and scale-invariance for dealing with a range of object sizes from extremely small to large. The design of the network architecture aims to be simple while being effective and with real time performance. Without bells and whistles the proposed object proposal network significantly improves the AR at 1,000 proposals by $35%$ and $45%$ on PASCAL VOC and COCO dataset respectively and has a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning