TL;DR
This paper introduces the Attention-based Triplet Hashing (ATH) network, an end-to-end deep learning framework that improves medical image retrieval accuracy by preserving classification, ROI, and small-sample information in hash codes.
Contribution
The paper proposes a novel ATH network with spatial-attention and triplet cross-entropy loss, enhancing small-sample ranking performance in medical image retrieval.
Findings
ATH outperforms existing deep hashing methods in medical datasets.
The spatial-attention module effectively focuses on ROI information.
Triplet cross-entropy loss improves classification and hash code discriminability.
Abstract
Deep hashing methods have been shown to be the most efficient approximate nearest neighbor search techniques for large-scale image retrieval. However, existing deep hashing methods have a poor small-sample ranking performance for case-based medical image retrieval. The top-ranked images in the returned query results may be as a different class than the query image. This ranking problem is caused by classification, regions of interest (ROI), and small-sample information loss in the hashing space. To address the ranking problem, we propose an end-to-end framework, called Attention-based Triplet Hashing (ATH) network, to learn low-dimensional hash codes that preserve the classification, ROI, and small-sample information. We embed a spatial-attention module into the network structure of our ATH to focus on ROI information. The spatial-attention module aggregates the spatial information of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
