Image Reconstruction from Bag-of-Visual-Words
Hiroharu Kato, Tatsuya Harada

TL;DR
This paper presents a novel method for reconstructing original images from Bag-of-Visual-Words, overcoming the lack of spatial information, and demonstrates its effectiveness across various object categories and applications.
Contribution
It introduces an evaluation function and parameter estimation method to enable image reconstruction from BoVW, a task not previously achieved.
Findings
Successfully reconstructed images of 101 object categories
Analyzed object classifiers using reconstructed images
Generated novel images from BoVW features
Abstract
The objective of this work is to reconstruct an original image from Bag-of-Visual-Words (BoVW). Image reconstruction from features can be a means of identifying the characteristics of features. Additionally, it enables us to generate novel images via features. Although BoVW is the de facto standard feature for image recognition and retrieval, successful image reconstruction from BoVW has not been reported yet. What complicates this task is that BoVW lacks the spatial information for including visual words. As described in this paper, to estimate an original arrangement, we propose an evaluation function that incorporates the naturalness of local adjacency and the global position, with a method to obtain related parameters using an external image database. To evaluate the performance of our method, we reconstruct images of objects of 101 kinds. Additionally, we apply our method to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Image Retrieval and Classification Techniques · Image Processing Techniques and Applications
