TL;DR
This paper introduces a lightweight contrastive distilled hashing method for online cross-modal retrieval, effectively bridging offline and online hashing through similarity matrix approximation and knowledge distillation, achieving superior performance.
Contribution
The proposed LCDH method innovatively combines contrastive learning, attention-enhanced feature fusion, and lightweight models to improve online cross-modal hashing performance.
Findings
Outperforms state-of-the-art methods on three datasets
Enhances online hashing accuracy via similarity matrix approximation
Efficiently transfers knowledge from offline to online models
Abstract
Deep online cross-modal hashing has gained much attention from researchers recently, as its promising applications with low storage requirement, fast retrieval efficiency and cross modality adaptive, etc. However, there still exists some technical hurdles that hinder its applications, e.g., 1) how to extract the coexistent semantic relevance of cross-modal data, 2) how to achieve competitive performance when handling the real time data streams, 3) how to transfer the knowledge learned from offline to online training in a lightweight manner. To address these problems, this paper proposes a lightweight contrastive distilled hashing (LCDH) for cross-modal retrieval, by innovatively bridging the offline and online cross-modal hashing by similarity matrix approximation in a knowledge distillation framework. Specifically, in the teacher network, LCDH first extracts the cross-modal features by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
MethodsSoftmax · Attention Is All You Need · Knowledge Distillation
