# Beyond Product Quantization: Deep Progressive Quantization for Image   Retrieval

**Authors:** Lianli Gao, Xiaosu Zhu, Jingkuan Song, Zhou Zhao, Heng Tao Shen

arXiv: 1906.06698 · 2020-12-08

## TL;DR

This paper introduces Deep Progressive Quantization (DPQ), a novel neural network-based approach for image retrieval that learns quantization codes sequentially, enabling efficient training for multiple code lengths and outperforming existing methods.

## Contribution

The paper proposes a deep progressive quantization model that learns quantization codes sequentially, allowing simultaneous training for various code lengths and improving retrieval performance.

## Key findings

- DPQ outperforms state-of-the-art image retrieval methods.
- DPQ requires less computation time due to single training for multiple code lengths.
- Ablation studies confirm the effectiveness of each component.

## Abstract

Product Quantization (PQ) has long been a mainstream for generating an exponentially large codebook at very low memory/time cost. Despite its success, PQ is still tricky for the decomposition of high-dimensional vector space, and the retraining of model is usually unavoidable when the code length changes. In this work, we propose a deep progressive quantization (DPQ) model, as an alternative to PQ, for large scale image retrieval. DPQ learns the quantization codes sequentially and approximates the original feature space progressively. Therefore, we can train the quantization codes with different code lengths simultaneously. Specifically, we first utilize the label information for guiding the learning of visual features, and then apply several quantization blocks to progressively approach the visual features. Each quantization block is designed to be a layer of a convolutional neural network, and the whole framework can be trained in an end-to-end manner. Experimental results on the benchmark datasets show that our model significantly outperforms the state-of-the-art for image retrieval. Our model is trained once for different code lengths and therefore requires less computation time. Additional ablation study demonstrates the effect of each component of our proposed model. Our code is released at https://github.com/cfm-uestc/DPQ.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1906.06698/full.md

## Figures

8 figures with captions in the complete paper: https://tomesphere.com/paper/1906.06698/full.md

## References

28 references — full list in the complete paper: https://tomesphere.com/paper/1906.06698/full.md

---
Source: https://tomesphere.com/paper/1906.06698