An Unsupervised Cross-Modal Hashing Method Robust to Noisy Training Image-Text Correspondences in Remote Sensing
Georgii Mikriukov, Mahdyar Ravanbakhsh, Beg\"um Demir

TL;DR
This paper introduces an unsupervised cross-modal hashing method called CHNR that is designed to be robust against noisy image-text pairs in remote sensing data, improving retrieval accuracy.
Contribution
The paper presents a novel unsupervised hashing approach with a noise detection module and a two-phase training process to handle noisy multi-modal data in remote sensing.
Findings
CHNR outperforms existing methods in noisy settings
Effective noise detection improves retrieval accuracy
Robustness to noisy data is demonstrated through experiments
Abstract
The development of accurate and scalable cross-modal image-text retrieval methods, where queries from one modality (e.g., text) can be matched to archive entries from another (e.g., remote sensing image) has attracted great attention in remote sensing (RS). Most of the existing methods assume that a reliable multi-modal training set with accurately matched text-image pairs is existing. However, this assumption may not always hold since the multi-modal training sets may include noisy pairs (i.e., textual descriptions/captions associated to training images can be noisy), distorting the learning process of the retrieval methods. To address this problem, we propose a novel unsupervised cross-modal hashing method robust to the noisy image-text correspondences (CHNR). CHNR consists of three modules: 1) feature extraction module, which extracts feature representations of image-text pairs; 2)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications · Image Retrieval and Classification Techniques
