TL;DR
This paper introduces a novel triplet sampling method for deep metric learning in multi-label remote sensing image retrieval, improving training efficiency and convergence speed by selecting the most representative triplets.
Contribution
The proposed method effectively selects diverse, relevant, and informative triplets for multi-label RS CBIR, reducing computational complexity and enhancing learning speed.
Findings
Reduces training computational complexity without performance loss.
Speeds up convergence of deep metric learning models.
Improves retrieval accuracy on benchmark datasets.
Abstract
Learning the similarity between remote sensing (RS) images forms the foundation for content-based RS image retrieval (CBIR). Recently, deep metric learning approaches that map the semantic similarity of images into an embedding (metric) space have been found very popular in RS. A common approach for learning the metric space relies on the selection of triplets of similar (positive) and dissimilar (negative) images to a reference image called as an anchor. Choosing triplets is a difficult task particularly for multi-label RS CBIR, where each training image is annotated by multiple class labels. To address this problem, in this paper we propose a novel triplet sampling method in the framework of deep neural networks (DNNs) defined for multi-label RS CBIR problems. The proposed method selects a small set of the most representative and informative triplets based on two main steps. In the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
