Asymmetric Scalable Cross-modal Hashing
Wenyun Li, Chi-Man Pun

TL;DR
This paper introduces ASCMH, a novel scalable cross-modal hashing method that efficiently learns binary codes using asymmetric optimization and collective matrix factorization, improving large-scale multimedia retrieval accuracy and efficiency.
Contribution
It proposes a new asymmetric scalable hashing framework with collective matrix factorization and orthogonal label constraints, reducing computational complexity and enhancing retrieval performance.
Findings
Outperforms state-of-the-art methods on benchmark datasets
Reduces computational complexity for large-scale data
Achieves higher retrieval accuracy and efficiency
Abstract
Cross-modal hashing is a successful method to solve large-scale multimedia retrieval issue. A lot of matrix factorization-based hashing methods are proposed. However, the existing methods still struggle with a few problems, such as how to generate the binary codes efficiently rather than directly relax them to continuity. In addition, most of the existing methods choose to use an similarity matrix for optimization, which makes the memory and computation unaffordable. In this paper we propose a novel Asymmetric Scalable Cross-Modal Hashing (ASCMH) to address these issues. It firstly introduces a collective matrix factorization to learn a common latent space from the kernelized features of different modalities, and then transforms the similarity matrix optimization to a distance-distance difference problem minimization with the help of semantic labels and common latent space.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications · Video Analysis and Summarization
