Scene Recognition with Objectness, Attribute and Category Learning
Ji Zhang, Jean-Paul Ainam, Li-hui Zhao, Wenai Song, and Xin Wang

TL;DR
This paper introduces a multi-task learning approach that leverages object attributes and category information to improve scene recognition, reducing annotation effort and enhancing discriminative power.
Contribution
It proposes the MASR network for joint scene and attribute recognition with a partially supervised annotation strategy, reducing labeling costs.
Findings
MASR achieves competitive accuracy on large-scale datasets.
Attribute and scene recognition mutually benefit each other.
The annotation strategy significantly reduces labeling effort.
Abstract
Scene classification has established itself as a challenging research problem. Compared to images of individual objects, scene images could be much more semantically complex and abstract. Their difference mainly lies in the level of granularity of recognition. Yet, image recognition serves as a key pillar for the good performance of scene recognition as the knowledge attained from object images can be used for accurate recognition of scenes. The existing scene recognition methods only take the category label of the scene into consideration. However, we find that the contextual information that contains detailed local descriptions are also beneficial in allowing the scene recognition model to be more discriminative. In this paper, we aim to improve scene recognition using attribute and category label information encoded in objects. Based on the complementarity of attribute and category…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Advanced Image and Video Retrieval Techniques
