Visual Word Selection without Re-Coding and Re-Pooling

Fatih Cakir; Stan Sclaroff

arXiv:1407.6174·cs.CV·July 24, 2014

Visual Word Selection without Re-Coding and Re-Pooling

Fatih Cakir, Stan Sclaroff

PDF

Open Access

TL;DR

This paper introduces a novel method for selecting a subset of visual words from a large codebook in Bag-of-Words models, eliminating the need for re-coding and re-pooling, thus reducing computational costs in computer vision tasks.

Contribution

It proposes formulations for hard and soft coding schemes that enable efficient subset selection without re-coding or re-pooling, improving computational efficiency in BoW-based applications.

Findings

01

Effective subset selection demonstrated on 15 Scenes benchmark.

02

Reduces computational complexity compared to brute-force methods.

03

Applicable to both hard and soft coding schemes.

Abstract

The Bag-of-Words (BoW) representation is widely used in computer vision. The size of the codebook impacts the time and space complexity of the applications that use BoW. Thus, given a training set for a particular computer vision task, a key problem is pruning a large codebook to select only a subset of visual words. Evaluating possible selections of words to be included in the pruned codebook can be computationally prohibitive; in a brute-force scheme, evaluating each pruned codebook requires re-coding of all features extracted from training images to words in the candidate codebook and then re-pooling the words to obtain a representation of each image, e.g., histogram of visual word frequencies. In this paper, a method is proposed that selects and evaluates a subset of words from an initially large codebook, without the need for re-coding or re-pooling. Formulations are proposed for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Image Retrieval and Classification Techniques · Multimodal Machine Learning Applications