Tag-based Semantic Features for Scene Image Classification

Chiranjibi Sitaula; Yong Xiang; Anish Basnet; Sunil Aryal; Xuequan Lu

arXiv:1909.09999·cs.CV·January 23, 2020

Tag-based Semantic Features for Scene Image Classification

Chiranjibi Sitaula, Yong Xiang, Anish Basnet, Sunil Aryal, Xuequan Lu

PDF

TL;DR

This paper introduces a novel semantic feature extraction method for scene image classification that leverages web-based annotations of similar images, resulting in improved accuracy with lower feature dimensions.

Contribution

The paper proposes a new two-step semantic feature extraction approach using web annotations, enhancing classification performance over traditional methods.

Findings

01

Outperforms vision-based and tag-based features in accuracy

02

Achieves comparable results to deep learning features

03

Uses lower-dimensional features for efficient classification

Abstract

The existing image feature extraction methods are primarily based on the content and structure information of images, and rarely consider the contextual semantic information. Regarding some types of images such as scenes and objects, the annotations and descriptions of them available on the web may provide reliable contextual semantic information for feature extraction. In this paper, we introduce novel semantic features of an image based on the annotations and descriptions of its similar images available on the web. Specifically, we propose a new method which consists of two consecutive steps to extract our semantic features. For each image in the training set, we initially search the top $k$ most similar images from the internet and extract their annotations/descriptions (e.g., tags or keywords). The annotation information is employed to design a filter bank for each image category…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.