Are visual dictionaries generalizable?
Otavio A. B. Penatti, Eduardo Valle, Ricardo da S. Torres

TL;DR
This paper investigates whether visual dictionaries for image classification and retrieval can be generalized across datasets, showing that small or different datasets can produce effective representations if they cover diverse low-level features.
Contribution
It demonstrates that visual dictionaries are not dataset-specific and can be generated from small or different datasets without losing effectiveness, simplifying large-scale image processing.
Findings
Dictionaries based on small datasets perform comparably to those from full datasets.
Dictionaries from different datasets still capture low-level feature diversity effectively.
Using diverse subsets reduces the need for extensive data collection.
Abstract
Mid-level features based on visual dictionaries are today a cornerstone of systems for classification and retrieval of images. Those state-of-the-art representations depend crucially on the choice of a codebook (visual dictionary), which is usually derived from the dataset. In general-purpose, dynamic image collections (e.g., the Web), one cannot have the entire collection in order to extract a representative dictionary. However, based on the hypothesis that the dictionary reflects only the diversity of low-level appearances and does not capture semantics, we argue that a dictionary based on a small subset of the data, or even on an entirely different dataset, is able to produce a good representation, provided that the chosen images span a diverse enough portion of the low-level feature space. Our experiments confirm that hypothesis, opening the opportunity to greatly alleviate the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Image Retrieval and Classification Techniques · Video Analysis and Summarization
