Asymmetric Scalable Cross-modal Hashing

Wenyun Li; Chi-Man Pun

arXiv:2207.12650·cs.CV·July 27, 2022

Asymmetric Scalable Cross-modal Hashing

Wenyun Li, Chi-Man Pun

PDF

Open Access

TL;DR

This paper introduces ASCMH, a novel scalable cross-modal hashing method that efficiently learns binary codes using asymmetric optimization and collective matrix factorization, improving large-scale multimedia retrieval accuracy and efficiency.

Contribution

It proposes a new asymmetric scalable hashing framework with collective matrix factorization and orthogonal label constraints, reducing computational complexity and enhancing retrieval performance.

Findings

01

Outperforms state-of-the-art methods on benchmark datasets

02

Reduces computational complexity for large-scale data

03

Achieves higher retrieval accuracy and efficiency

Abstract

Cross-modal hashing is a successful method to solve large-scale multimedia retrieval issue. A lot of matrix factorization-based hashing methods are proposed. However, the existing methods still struggle with a few problems, such as how to generate the binary codes efficiently rather than directly relax them to continuity. In addition, most of the existing methods choose to use an $n \times n$ similarity matrix for optimization, which makes the memory and computation unaffordable. In this paper we propose a novel Asymmetric Scalable Cross-Modal Hashing (ASCMH) to address these issues. It firstly introduces a collective matrix factorization to learn a common latent space from the kernelized features of different modalities, and then transforms the similarity matrix optimization to a distance-distance difference problem minimization with the help of semantic labels and common latent space.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications · Video Analysis and Summarization