Cross-modal Zero-shot Hashing by Label Attributes Embedding
Runmin Wang, Guoxian Yu, Lei Liu, Lizhen Cui, Carlotta Domeniconi,, Xiangliang Zhang

TL;DR
This paper introduces LAEH, a novel zero-shot cross-modal hashing method that embeds label attributes into a shared space to improve retrieval performance across unseen classes.
Contribution
LAEH effectively integrates label attribute embeddings with feature learning, bridging the information gap in zero-shot cross-modal hashing.
Findings
LAEH outperforms existing zero-shot and cross-modal hashing methods.
The approach reduces the semantic gap between modalities.
Experimental results demonstrate improved retrieval accuracy.
Abstract
Cross-modal hashing (CMH) is one of the most promising methods in cross-modal approximate nearest neighbor search. Most CMH solutions ideally assume the labels of training and testing set are identical. However, the assumption is often violated, causing a zero-shot CMH problem. Recent efforts to address this issue focus on transferring knowledge from the seen classes to the unseen ones using label attributes. However, the attributes are isolated from the features of multi-modal data. To reduce the information gap, we introduce an approach called LAEH (Label Attributes Embedding for zero-shot cross-modal Hashing). LAEH first gets the initial semantic attribute vectors of labels by word2vec model and then uses a transformation network to transform them into a common subspace. Next, it leverages the hash vectors and the feature similarity matrix to guide the feature extraction network of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning
