TL;DR
This paper introduces MTFH, a flexible cross-modal hashing framework that encodes heterogeneous data with varying hash lengths, improving semantic relevance and retrieval performance across diverse scenarios.
Contribution
It proposes a novel matrix tri-factorization hashing method that supports unequal hash lengths and unpaired data, enhancing cross-modal retrieval flexibility and effectiveness.
Findings
Outperforms existing methods on benchmark datasets.
Effectively handles unpaired and varying hash length data.
Achieves superior semantic correlation in hash codes.
Abstract
Hashing has recently sparked a great revolution in cross-modal retrieval because of its low storage cost and high query speed. Recent cross-modal hashing methods often learn unified or equal-length hash codes to represent the multi-modal data and make them intuitively comparable. However, such unified or equal-length hash representations could inherently sacrifice their representation scalability because the data from different modalities may not have one-to-one correspondence and could be encoded more efficiently by different hash codes of unequal lengths. To mitigate these problems, this paper exploits a related and relatively unexplored problem: encode the heterogeneous data with varying hash lengths and generalize the cross-modal retrieval in various challenging scenarios. To this end, a generalized and flexible cross-modal hashing framework, termed Matrix Tri-Factorization Hashing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
