Neighbor-aware Instance Refining with Noisy Labels for Cross-Modal Retrieval
Yizhi Liu, Ruitao Pu, Shilin Xu, Yingke Chen, Quan-Hui Liu, Yuan Sun

TL;DR
This paper introduces NIRNL, a novel framework for cross-modal retrieval that effectively handles noisy labels by neighbor-aware instance refining, improving robustness and performance on benchmark datasets.
Contribution
The paper proposes a new robust learning framework, NIRNL, combining CMP and neighbor-aware instance refining to better utilize data and mitigate noise in cross-modal retrieval.
Findings
Achieves state-of-the-art results on benchmark datasets.
Demonstrates robustness under high noise rates.
Effectively partitions data into pure, hard, and noisy subsets.
Abstract
In recent years, Cross-Modal Retrieval (CMR) has made significant progress in the field of multi-modal analysis. However, since it is time-consuming and labor-intensive to collect large-scale and well-annotated data, the annotation of multi-modal data inevitably contains some noise. This will degrade the retrieval performance of the model. To tackle the problem, numerous robust CMR methods have been developed, including robust learning paradigms, label calibration strategies, and instance selection mechanisms. Unfortunately, they often fail to simultaneously satisfy model performance ceilings, calibration reliability, and data utilization rate. To overcome the limitations, we propose a novel robust cross-modal learning framework, namely Neighbor-aware Instance Refining with Noisy Labels (NIRNL). Specifically, we first propose Cross-modal Margin Preserving (CMP) to adjust the relative…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications · Image Retrieval and Classification Techniques
