Distribution-Consistency-Guided Multi-modal Hashing

Jin-Yu Liu; Xian-Ling Mao; Tian-Yi Che; Rong-Cheng Tu

arXiv:2412.11216·cs.CV·December 20, 2024

Distribution-Consistency-Guided Multi-modal Hashing

Jin-Yu Liu, Xian-Ling Mao, Tian-Yi Che, Rong-Cheng Tu

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a novel multi-modal hashing method that leverages distribution consistency patterns to identify and correct noisy labels, significantly improving retrieval performance in noisy label scenarios.

Contribution

The paper proposes DCGMH, a distribution-consistency-guided approach that filters and reconstructs noisy labels in multi-modal hashing, which is a novel strategy for handling label noise.

Findings

01

Outperforms state-of-the-art methods on three datasets

02

Effectively filters and corrects noisy labels using distribution consistency

03

Enhances multi-modal retrieval accuracy with noisy labels

Abstract

Multi-modal hashing methods have gained popularity due to their fast speed and low storage requirements. Among them, the supervised methods demonstrate better performance by utilizing labels as supervisory signals compared with unsupervised methods. Currently, for almost all supervised multi-modal hashing methods, there is a hidden assumption that training sets have no noisy labels. However, labels are often annotated incorrectly due to manual labeling in real-world scenarios, which will greatly harm the retrieval performance. To address this issue, we first discover a significant distribution consistency pattern through experiments, i.e., the 1-0 distribution of the presence or absence of each category in the label is consistent with the high-low distribution of similarity scores of the hash codes relative to category centers. Then, inspired by this pattern, we propose a novel…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

liujinyu1229/dcgmh
pytorchOfficial

Videos

Distribution-Consistency-Guided Multi-modal Hashing· underline

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Video Analysis and Summarization · Algorithms and Data Compression

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings