# Unsupervised Rank-Preserving Hashing for Large-Scale Image Retrieval

**Authors:** Svebor Karaman, Xudong Lin, Xuefeng Hu, Shih-Fu Chang

arXiv: 1903.01545 · 2019-03-06

## TL;DR

This paper introduces an unsupervised hashing technique that preserves ranking for large-scale image retrieval, enabling efficient storage and computation while maintaining high retrieval accuracy through a neural network model and optional feature reconstruction.

## Contribution

The paper presents a novel neural network-based unsupervised hashing method that directly optimizes ranking preservation and incorporates a feature decoder for improved retrieval performance.

## Key findings

- Outperforms state-of-the-art unsupervised hashing methods on large-scale datasets.
- Reconstruction boosts search accuracy with minimal additional cost.
- Efficient graph-based search with re-ranking enhances retrieval results.

## Abstract

We propose an unsupervised hashing method which aims to produce binary codes that preserve the ranking induced by a real-valued representation. Such compact hash codes enable the complete elimination of real-valued feature storage and allow for significant reduction of the computation complexity and storage cost of large-scale image retrieval applications. Specifically, we learn a neural network-based model, which transforms the input representation into a binary representation. We formalize the training objective of the network in an intuitive and effective way, considering each training sample as a query and aiming to obtain the same retrieval results using the produced hash codes as those obtained with the original features. This training formulation directly optimizes the hashing model for the target usage of the hash codes it produces. We further explore the addition of a decoder trained to obtain an approximated reconstruction of the original features. At test time, we retrieved the most promising database samples with an efficient graph-based search procedure using only our hash codes and perform re-ranking using the reconstructed features, thus without needing to access the original features at all. Experiments conducted on multiple publicly available large-scale datasets show that our method consistently outperforms all compared state-of-the-art unsupervised hashing methods and that the reconstruction procedure can effectively boost the search accuracy with a minimal constant additional cost.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1903.01545/full.md

## Figures

10 figures with captions in the complete paper: https://tomesphere.com/paper/1903.01545/full.md

## References

41 references — full list in the complete paper: https://tomesphere.com/paper/1903.01545/full.md

---
Source: https://tomesphere.com/paper/1903.01545