Supervised Hashing based on Energy Minimization
Zihao Hu, Xiyi Luo, Hongtao Lu, and Yong Yu

TL;DR
This paper introduces a novel energy minimization approach for supervised hashing, transforming complex formulations into linear systems with closed-form solutions, leading to improved retrieval performance.
Contribution
It proposes a new energy-based framework for supervised hashing, using linear approximations to solve consistency equations efficiently, enhancing existing methods like KSH and SPLH.
Findings
EM-KSH outperforms traditional KSH on multiple datasets.
EM-SPLH demonstrates superior accuracy compared to SPLH.
The proposed methods achieve better retrieval speed and storage efficiency.
Abstract
Recently, supervised hashing methods have attracted much attention since they can optimize retrieval speed and storage cost while preserving semantic information. Because hashing codes learning is NP-hard, many methods resort to some form of relaxation technique. But the performance of these methods can easily deteriorate due to the relaxation. Luckily, many supervised hashing formulations can be viewed as energy functions, hence solving hashing codes is equivalent to learning marginals in the corresponding conditional random field (CRF). By minimizing the KL divergence between a fully factorized distribution and the Gibbs distribution of this CRF, a set of consistency equations can be obtained, but updating them in parallel may not yield a local optimum since the variational lower bound is not guaranteed to increase. In this paper, we use a linear approximation of the sigmoid function…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications · Advanced Neural Network Applications
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Conditional Random Field
