Semantic-Aware Scene Recognition

Alejandro L\'opez-Cifuentes; Marcos Escudero-Vi\~nolo; Jes\'us; Besc\'os; \'Alvaro Garc\'ia-Mart\'in

arXiv:1909.02410·cs.CV·February 28, 2020

Semantic-Aware Scene Recognition

Alejandro L\'opez-Cifuentes, Marcos Escudero-Vi\~nolo, Jes\'us, Besc\'os, \'Alvaro Garc\'ia-Mart\'in

PDF

1 Repo

TL;DR

This paper introduces a novel multi-modal CNN that integrates image and semantic context information via an attention mechanism to improve scene recognition accuracy and efficiency.

Contribution

It presents an end-to-end multi-modal CNN with an attention module that leverages semantic segmentation for better scene disambiguation, outperforming state-of-the-art methods.

Findings

01

Outperforms existing methods on four datasets

02

Reduces network parameters significantly

03

Enhances scene disambiguation through semantic gating

Abstract

Scene recognition is currently one of the top-challenging research fields in computer vision. This may be due to the ambiguity between classes: images of several scene classes may share similar objects, which causes confusion among them. The problem is aggravated when images of a particular scene class are notably different. Convolutional Neural Networks (CNNs) have significantly boosted performance in scene recognition, albeit it is still far below from other recognition tasks (e.g., object or image recognition). In this paper, we describe a novel approach for scene recognition based on an end-to-end multi-modal CNN that combines image and context information by means of an attention module. Context information, in the shape of semantic segmentation, is used to gate features extracted from the RGB image by leveraging on information encoded in the semantic representation: the set of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

vpulab/Semantic-Aware-Scene-Recognition
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.