SAN: Learning Relationship between Convolutional Features for Multi-Scale Object Detection
Yonghyun Kim, Bong-Nam Kang, Daijin Kim

TL;DR
This paper introduces a Scale Aware Network (SAN) that maps convolutional features from different scales onto a scale-invariant subspace, improving CNN-based object detection robustness to scale variations.
Contribution
The paper proposes SAN, a novel method that normalizes features across scales by focusing on channel relationships, enhancing detection accuracy in multi-scale scenarios.
Findings
SAN reduces feature differences across scales
Improves detection accuracy on VOC PASCAL and MS COCO datasets
Adds minimal computational overhead
Abstract
Most of the recent successful methods in accurate object detection build on the convolutional neural networks (CNN). However, due to the lack of scale normalization in CNN-based detection methods, the activated channels in the feature space can be completely different according to a scale and this difference makes it hard for the classifier to learn samples. We propose a Scale Aware Network (SAN) that maps the convolutional features from the different scales onto a scale-invariant subspace to make CNN-based detection methods more robust to the scale variation, and also construct a unique learning method which considers purely the relationship between channels without the spatial information for the efficient learning of SAN. To show the validity of our method, we visualize how convolutional features change according to the scale through a channel activation matrix and experimentally…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Video Surveillance and Tracking Methods · Advanced Image and Video Retrieval Techniques
