Tag-based Semantic Features for Scene Image Classification
Chiranjibi Sitaula, Yong Xiang, Anish Basnet, Sunil Aryal, Xuequan Lu

TL;DR
This paper introduces a novel semantic feature extraction method for scene image classification that leverages web-based annotations of similar images, resulting in improved accuracy with lower feature dimensions.
Contribution
The paper proposes a new two-step semantic feature extraction approach using web annotations, enhancing classification performance over traditional methods.
Findings
Outperforms vision-based and tag-based features in accuracy
Achieves comparable results to deep learning features
Uses lower-dimensional features for efficient classification
Abstract
The existing image feature extraction methods are primarily based on the content and structure information of images, and rarely consider the contextual semantic information. Regarding some types of images such as scenes and objects, the annotations and descriptions of them available on the web may provide reliable contextual semantic information for feature extraction. In this paper, we introduce novel semantic features of an image based on the annotations and descriptions of its similar images available on the web. Specifically, we propose a new method which consists of two consecutive steps to extract our semantic features. For each image in the training set, we initially search the top most similar images from the internet and extract their annotations/descriptions (e.g., tags or keywords). The annotation information is employed to design a filter bank for each image category…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
