Givens Coordinate Descent Methods for Rotation Matrix Learning in Trainable Embedding Indexes
Yunjiang Jiang, Han Zhang, Yiming Qiu, Yun Xiao, Bo Long, Wen-Yun Yang

TL;DR
This paper introduces Givens coordinate descent algorithms for learning rotation matrices in embedding indexes, enabling efficient, stable, and parallelizable end-to-end training for improved product quantization in ANN search.
Contribution
It proposes a novel family of Givens coordinate descent algorithms based on Lie group theory for rotation matrix learning, outperforming SVD in speed and stability.
Findings
Algorithms are highly parallelizable and faster on GPUs.
Givens methods converge more stably than SVD.
Significant improvements in end-to-end product quantization performance.
Abstract
Product quantization (PQ) coupled with a space rotation, is widely used in modern approximate nearest neighbor (ANN) search systems to significantly compress the disk storage for embeddings and speed up the inner product computation. Existing rotation learning methods, however, minimize quantization distortion for fixed embeddings, which are not applicable to an end-to-end training scenario where embeddings are updated constantly. In this paper, based on geometric intuitions from Lie group theory, in particular the special orthogonal group , we propose a family of block Givens coordinate descent algorithms to learn rotation matrix that are provably convergent on any convex objectives. Compared to the state-of-the-art SVD method, the Givens algorithms are much more parallelizable, reducing runtime by orders of magnitude on modern GPUs, and converge more stably according to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Face and Expression Recognition · Domain Adaptation and Few-Shot Learning
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
