Semantic-embedded Similarity Prototype for Scene Recognition
Chuanxin Song, Hanbo Wu, Xin Ma, Yibin Li

TL;DR
This paper introduces a semantic knowledge-based similarity prototype for scene recognition that improves accuracy without increasing computational costs, making it suitable for edge devices.
Contribution
It proposes a simple, plug-and-play semantic similarity prototype using class-level representations to enhance scene recognition performance efficiently.
Findings
Improves scene recognition accuracy across multiple benchmarks.
Does not add computational overhead during deployment.
Supports training with Gradient Label Softening and Contrastive Loss.
Abstract
Due to the high inter-class similarity caused by the complex composition and the co-existing objects across scenes, numerous studies have explored object semantic knowledge within scenes to improve scene recognition. However, a resulting challenge emerges as object information extraction techniques require heavy computational costs, thereby burdening the network considerably. This limitation often renders object-assisted approaches incompatible with edge devices in practical deployment. In contrast, this paper proposes a semantic knowledge-based similarity prototype, which can help the scene recognition network achieve superior accuracy without increasing the computational cost in practice. It is simple and can be plug-and-played into existing pipelines. More specifically, a statistical strategy is introduced to depict semantic knowledge in scenes as class-level semantic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Advanced Neural Network Applications · Image Retrieval and Classification Techniques
