Unsupervised Hashing with Semantic Concept Mining
Rong-Cheng Tu, Xian-Ling Mao, Kevin Qinghong Lin, Chengfei, Cai, Weize Qin, Hongfa Wang, Wei Wei, Heyan Huang

TL;DR
This paper introduces UHSCM, an unsupervised hashing method that leverages a vision-language model to mine semantic concepts, improving image retrieval performance by constructing a high-quality semantic similarity matrix.
Contribution
The paper proposes a novel unsupervised hashing approach that uses a VLP model for semantic concept mining to enhance similarity measurement.
Findings
Outperforms state-of-the-art methods on three benchmark datasets.
Effectively mines semantic concepts using a vision-language pretraining model.
Improves image retrieval accuracy with semantic-aware hashing.
Abstract
Recently, to improve the unsupervised image retrieval performance, plenty of unsupervised hashing methods have been proposed by designing a semantic similarity matrix, which is based on the similarities between image features extracted by a pre-trained CNN model. However, most of these methods tend to ignore high-level abstract semantic concepts contained in images. Intuitively, concepts play an important role in calculating the similarity among images. In real-world scenarios, each image is associated with some concepts, and the similarity between two images will be larger if they share more identical concepts. Inspired by the above intuition, in this work, we propose a novel Unsupervised Hashing with Semantic Concept Mining, called UHSCM, which leverages a VLP model to construct a high-quality similarity matrix. Specifically, a set of randomly chosen concepts is first collected. Then,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Image Retrieval and Classification Techniques · Multimodal Machine Learning Applications
