Exploiting Object-based and Segmentation-based Semantic Features for Deep Learning-based Indoor Scene Classification
Ricardo Pereira, Lu\'is Garrote, Tiago Barros, Ana Lopes, and Urbano, J. Nunes

TL;DR
This paper introduces a novel deep learning approach that combines object detection, semantic segmentation, and shape features to improve indoor scene classification, achieving state-of-the-art results on benchmark datasets.
Contribution
It proposes a new multi-branch network integrating semantic and object-based features with shape characterization for enhanced indoor scene classification.
Findings
Achieved state-of-the-art accuracy on SUN RGB-D dataset.
Demonstrated effectiveness of combining semantic segmentation with object detection.
Improved discrimination between similar indoor scene categories.
Abstract
Indoor scenes are usually characterized by scattered objects and their relationships, which turns the indoor scene classification task into a challenging computer vision task. Despite the significant performance boost in classification tasks achieved in recent years, provided by the use of deep-learning-based methods, limitations such as inter-category ambiguity and intra-category variation have been holding back their performance. To overcome such issues, gathering semantic information has been shown to be a promising source of information towards a more complete and discriminative feature representation of indoor scenes. Therefore, the work described in this paper uses both semantic information, obtained from object detection, and semantic segmentation techniques. While object detection techniques provide the 2D location of objects allowing to obtain spatial distributions between…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Advanced Image and Video Retrieval Techniques · Remote Sensing and LiDAR Applications
