Distribution-Consistency-Guided Multi-modal Hashing
Jin-Yu Liu, Xian-Ling Mao, Tian-Yi Che, Rong-Cheng Tu

TL;DR
This paper introduces a novel multi-modal hashing method that leverages distribution consistency patterns to identify and correct noisy labels, significantly improving retrieval performance in noisy label scenarios.
Contribution
The paper proposes DCGMH, a distribution-consistency-guided approach that filters and reconstructs noisy labels in multi-modal hashing, which is a novel strategy for handling label noise.
Findings
Outperforms state-of-the-art methods on three datasets
Effectively filters and corrects noisy labels using distribution consistency
Enhances multi-modal retrieval accuracy with noisy labels
Abstract
Multi-modal hashing methods have gained popularity due to their fast speed and low storage requirements. Among them, the supervised methods demonstrate better performance by utilizing labels as supervisory signals compared with unsupervised methods. Currently, for almost all supervised multi-modal hashing methods, there is a hidden assumption that training sets have no noisy labels. However, labels are often annotated incorrectly due to manual labeling in real-world scenarios, which will greatly harm the retrieval performance. To address this issue, we first discover a significant distribution consistency pattern through experiments, i.e., the 1-0 distribution of the presence or absence of each category in the label is consistent with the high-low distribution of similarity scores of the hash codes relative to category centers. Then, inspired by this pattern, we propose a novel…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Video Analysis and Summarization · Algorithms and Data Compression
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
