Deep Unsupervised Contrastive Hashing for Large-Scale Cross-Modal Text-Image Retrieval in Remote Sensing
Georgii Mikriukov, Mahdyar Ravanbakhsh, Beg\"um Demir

TL;DR
This paper introduces DUCH, a novel deep unsupervised contrastive hashing method for large-scale cross-modal text-image retrieval in remote sensing, addressing label dependency and efficiency issues of existing systems.
Contribution
The paper proposes a new unsupervised deep contrastive hashing framework with a multi-objective loss for efficient cross-modal retrieval in remote sensing.
Findings
Outperforms state-of-the-art unsupervised methods on RS benchmarks
Effective in preserving intra- and inter-modal similarities
Enables fast, memory-efficient retrieval in large-scale RS data
Abstract
Due to the availability of large-scale multi-modal data (e.g., satellite images acquired by different sensors, text sentences, etc) archives, the development of cross-modal retrieval systems that can search and retrieve semantically relevant data across different modalities based on a query in any modality has attracted great attention in RS. In this paper, we focus our attention on cross-modal text-image retrieval, where queries from one modality (e.g., text) can be matched to archive entries from another (e.g., image). Most of the existing cross-modal text-image retrieval systems require a high number of labeled training samples and also do not allow fast and memory-efficient retrieval due to their intrinsic characteristics. These issues limit the applicability of the existing cross-modal retrieval systems for large-scale applications in RS. To address this problem, in this paper we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications · Image Retrieval and Classification Techniques
