SRRM: Semantic Region Relation Model for Indoor Scene Recognition
Chuanxin Song, Xin Ma

TL;DR
The paper introduces SRRM, a novel model that leverages semantic region relationships for improved indoor scene recognition, combining it with PlacesCNN to outperform state-of-the-art methods on multiple datasets.
Contribution
It proposes the SRRM model that directly models semantic region relationships and combines it with PlacesCNN for enhanced indoor scene recognition.
Findings
CSRRM outperforms SOTA methods on MIT Indoor 67
Significant improvement on reduced Places365 dataset
Effective exploitation of semantic relationships without retraining
Abstract
Despite the remarkable success of convolutional neural networks in various computer vision tasks, recognizing indoor scenes still presents a significant challenge due to their complex composition. Consequently, effectively leveraging semantic information in the scene has been a key issue in advancing indoor scene recognition. Unfortunately, the accuracy of semantic segmentation has limited the effectiveness of existing approaches for leveraging semantic information. As a result, many of these approaches remain at the stage of auxiliary labeling or co-occurrence statistics, with few exploring the contextual relationships between the semantic elements directly within the scene. In this paper, we propose the Semantic Region Relationship Model (SRRM), which starts directly from the semantic information inside the scene. Specifically, SRRM adopts an adaptive and efficient approach to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Video Surveillance and Tracking Methods · Advanced Neural Network Applications
