SEMICON: A Learning-to-hash Solution for Large-scale Fine-grained Image Retrieval
Yang Shen, Xuhao Sun, Xiu-Shen Wei, Qing-Yuan Jiang, Jian Yang

TL;DR
SEMICON introduces a novel attention and channel transformation approach to learn binary hash codes, significantly improving large-scale fine-grained image retrieval accuracy by localizing discriminative regions and exploiting part correlations.
Contribution
The paper proposes SEMICON, a new method combining suppression-enhancing mask attention and interactive channel transformation for effective fine-grained image hashing.
Findings
Outperforms existing methods on five benchmark datasets.
Effectively localizes discriminative regions for fine-grained features.
Improves retrieval accuracy through part correlation modeling.
Abstract
In this paper, we propose Suppression-Enhancing Mask based attention and Interactive Channel transformatiON (SEMICON) to learn binary hash codes for dealing with large-scale fine-grained image retrieval tasks. In SEMICON, we first develop a suppression-enhancing mask (SEM) based attention to dynamically localize discriminative image regions. More importantly, different from existing attention mechanism simply erasing previous discriminative regions, our SEM is developed to restrain such regions and then discover other complementary regions by considering the relation between activated regions in a stage-by-stage fashion. In each stage, the interactive channel transformation (ICON) module is afterwards designed to exploit correlations across channels of attended activation tensors. Since channels could generally correspond to the parts of fine-grained objects, the part correlation can be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Advanced Neural Network Applications · Domain Adaptation and Few-Shot Learning
