Central Similarity Multi-View Hashing for Multimedia Retrieval
Jian Zhu, Wen Cheng, Yu Cui, Chang Tang, Yuyang Dai, Yong Li, Lingfang, Zeng

TL;DR
This paper introduces CSMVH, a novel multi-view hashing method that leverages central similarity learning and gate-based fusion to significantly improve multimedia retrieval accuracy over existing approaches.
Contribution
The paper proposes a new multi-view hashing framework using central similarity and gate-based fusion, addressing local and global similarity issues in multimedia retrieval.
Findings
CSMVH outperforms state-of-the-art methods by up to 11.41% mAP.
Gate-based fusion is more effective than traditional weighted sum or concatenation.
Empirical results on MS COCO and NUS-WIDE datasets demonstrate significant accuracy improvements.
Abstract
Hash representation learning of multi-view heterogeneous data is the key to improving the accuracy of multimedia retrieval. However, existing methods utilize local similarity and fall short of deeply fusing the multi-view features, resulting in poor retrieval accuracy. Current methods only use local similarity to train their model. These methods ignore global similarity. Furthermore, most recent works fuse the multi-view features via a weighted sum or concatenation. We contend that these fusion methods are insufficient for capturing the interaction between various views. We present a novel Central Similarity Multi-View Hashing (CSMVH) method to address the mentioned problems. Central similarity learning is used for solving the local similarity problem, which can utilize the global similarity between the hash center and samples. We present copious empirical data demonstrating the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Video Surveillance and Tracking Methods · Multimodal Machine Learning Applications
