# Deep Spherical Quantization for Image Search

**Authors:** Sepehr Eghbali, Ladan Tahvildari

arXiv: 1906.02865 · 2019-06-10

## TL;DR

Deep Spherical Quantization (DSQ) is a novel deep learning method that produces compact, discriminative binary codes for efficient large-scale image retrieval by combining low-dimensional embedding, hypersphere normalization, and multi-codebook quantization.

## Contribution

The paper introduces DSQ, a new supervised quantization technique on a hypersphere, with an extension for sparse codebooks, improving image retrieval performance.

## Key findings

- DSQ outperforms state-of-the-art methods on three image retrieval benchmarks.
- Normalization on the unit hypersphere improves codebook learning.
- Sparse extension enhances efficiency without sacrificing accuracy.

## Abstract

Hashing methods, which encode high-dimensional images with compact discrete codes, have been widely applied to enhance large-scale image retrieval. In this paper, we put forward Deep Spherical Quantization (DSQ), a novel method to make deep convolutional neural networks generate supervised and compact binary codes for efficient image search. Our approach simultaneously learns a mapping that transforms the input images into a low-dimensional discriminative space, and quantizes the transformed data points using multi-codebook quantization. To eliminate the negative effect of norm variance on codebook learning, we force the network to L_2 normalize the extracted features and then quantize the resulting vectors using a new supervised quantization technique specifically designed for points lying on a unit hypersphere. Furthermore, we introduce an easy-to-implement extension of our quantization technique that enforces sparsity on the codebooks. Extensive experiments demonstrate that DSQ and its sparse variant can generate semantically separable compact binary codes outperforming many state-of-the-art image retrieval methods on three benchmarks.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1906.02865/full.md

## Figures

8 figures with captions in the complete paper: https://tomesphere.com/paper/1906.02865/full.md

## References

47 references — full list in the complete paper: https://tomesphere.com/paper/1906.02865/full.md

---
Source: https://tomesphere.com/paper/1906.02865