Scene recognition with CNNs: objects, scales and dataset bias

Luis Herranz; Shuqiang Jiang; Xiangyang Li

arXiv:1801.06867·cs.CV·January 23, 2018

Scene recognition with CNNs: objects, scales and dataset bias

Luis Herranz, Shuqiang Jiang, Xiangyang Li

PDF

TL;DR

This paper investigates how scale-induced dataset bias affects scene recognition with CNNs and proposes scale-specific CNNs to effectively combine scene and object knowledge, significantly improving accuracy.

Contribution

It introduces scale-specific CNNs to address dataset bias and demonstrates improved scene recognition by combining scene-centric and object-centric CNNs at multiple scales.

Findings

01

Scale-specific CNNs outperform single-scale models.

02

Multi-scale combinations improve recognition accuracy.

03

Achieved up to 70.17% accuracy on SUN397 dataset.

Abstract

Since scenes are composed in part of objects, accurate recognition of scenes requires knowledge about both scenes and objects. In this paper we address two related problems: 1) scale induced dataset bias in multi-scale convolutional neural network (CNN) architectures, and 2) how to combine effectively scene-centric and object-centric knowledge (i.e. Places and ImageNet) in CNNs. An earlier attempt, Hybrid-CNN, showed that incorporating ImageNet did not help much. Here we propose an alternative method taking the scale into account, resulting in significant recognition gains. By analyzing the response of ImageNet-CNNs and Places-CNNs at different scales we find that both operate in different scale ranges, so using the same network for all the scales induces dataset bias resulting in limited performance. Thus, adapting the feature extractor to each particular scale (i.e. scale-specific…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.