SAN: Learning Relationship between Convolutional Features for   Multi-Scale Object Detection

Yonghyun Kim; Bong-Nam Kang; Daijin Kim

arXiv:1808.04974·cs.CV·August 16, 2018·5 cites

SAN: Learning Relationship between Convolutional Features for Multi-Scale Object Detection

Yonghyun Kim, Bong-Nam Kang, Daijin Kim

PDF

Open Access

TL;DR

This paper introduces a Scale Aware Network (SAN) that maps convolutional features from different scales onto a scale-invariant subspace, improving CNN-based object detection robustness to scale variations.

Contribution

The paper proposes SAN, a novel method that normalizes features across scales by focusing on channel relationships, enhancing detection accuracy in multi-scale scenarios.

Findings

01

SAN reduces feature differences across scales

02

Improves detection accuracy on VOC PASCAL and MS COCO datasets

03

Adds minimal computational overhead

Abstract

Most of the recent successful methods in accurate object detection build on the convolutional neural networks (CNN). However, due to the lack of scale normalization in CNN-based detection methods, the activated channels in the feature space can be completely different according to a scale and this difference makes it hard for the classifier to learn samples. We propose a Scale Aware Network (SAN) that maps the convolutional features from the different scales onto a scale-invariant subspace to make CNN-based detection methods more robust to the scale variation, and also construct a unique learning method which considers purely the relationship between channels without the spatial information for the efficient learning of SAN. To show the validity of our method, we visualize how convolutional features change according to the scale through a channel activation matrix and experimentally…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Video Surveillance and Tracking Methods · Advanced Image and Video Retrieval Techniques