Encoding High Dimensional Local Features by Sparse Coding Based Fisher Vectors
Lingqiao Liu, Chunhua Shen, Lei Wang, Anton van den Hengel, Chao Wang

TL;DR
This paper introduces a novel Fisher vector encoding method based on sparse coding for high dimensional local features, significantly improving image classification performance over traditional GMM-based methods.
Contribution
It proposes a new model that replaces GMM with a subspace-based Gaussian, enabling sparse coding for high dimensional features, and demonstrates its effectiveness with CNN descriptors.
Findings
Outperforms traditional GMM Fisher vectors in accuracy
Achieves state-of-the-art results in various image classification tasks
Efficiently models high dimensional features with sparse coding
Abstract
Deriving from the gradient vector of a generative model of local features, Fisher vector coding (FVC) has been identified as an effective coding method for image classification. Most, if not all, % FVC implementations employ the Gaussian mixture model (GMM) to characterize the generation process of local features. This choice has shown to be sufficient for traditional low dimensional local features, e.g., SIFT; and typically, good performance can be achieved with only a few hundred Gaussian distributions. However, the same number of Gaussians is insufficient to model the feature space spanned by higher dimensional local features, which have become popular recently. In order to improve the modeling capacity for high dimensional features, it turns out to be inefficient and computationally impractical to simply increase the number of Gaussians. In this paper, we propose a model in which…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Robotics and Sensor-Based Localization · Video Surveillance and Tracking Methods
