Learning Decorrelated Hashing Codes for Multimodal Retrieval
Dayong Tian

TL;DR
This paper introduces a minimum correlation regularization technique for multimodal hashing that decorrelates binary codes, improving retrieval performance especially with longer hash codes in large-scale social network data.
Contribution
It proposes a novel decorrelation regularization method for multimodal hashing that enhances retrieval accuracy by reducing code correlation, particularly effective with longer hash codes.
Findings
Performance improves with longer hash codes.
The method outperforms existing approaches in experiments.
Decorrelation leads to more efficient retrieval.
Abstract
In social networks, heterogeneous multimedia data correlate to each other, such as videos and their corresponding tags in YouTube and image-text pairs in Facebook. Nearest neighbor retrieval across multiple modalities on large data sets becomes a hot yet challenging problem. Hashing is expected to be an efficient solution, since it represents data as binary codes. As the bit-wise XOR operations can be fast handled, the retrieval time is greatly reduced. Few existing multimodal hashing methods consider the correlation among hashing bits. The correlation has negative impact on hashing codes. When the hashing code length becomes longer, the retrieval performance improvement becomes slower. In this paper, we propose a minimum correlation regularization (MCR) for multimodal hashing. First, the sigmoid function is used to embed the data matrices. Then, the MCR is applied on the output of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications · Image Retrieval and Classification Techniques
