Dimension Reduction for Efficient Dense Retrieval via Conditional   Autoencoder

Zhenghao Liu; Han Zhang; Chenyan Xiong; Zhiyuan Liu; Yu Gu; Xiaohua Li

arXiv:2205.03284·cs.IR·October 25, 2022·1 cites

Dimension Reduction for Efficient Dense Retrieval via Conditional Autoencoder

Zhenghao Liu, Han Zhang, Chenyan Xiong, Zhiyuan Liu, Yu Gu, Xiaohua Li

PDF

Open Access 1 Repo

TL;DR

This paper introduces a Conditional Autoencoder (ConAE) to compress high-dimensional dense retrieval embeddings, reducing storage and latency while maintaining ranking effectiveness, thus improving retrieval efficiency.

Contribution

The novel ConAE method effectively compresses dense retrieval embeddings with minimal performance loss, simplifying the model with only one linear layer.

Findings

01

ConAE achieves comparable ranking performance to the original model.

02

ConAE reduces embedding size, improving retrieval efficiency.

03

ConAE alleviates embedding redundancy with a simple linear layer.

Abstract

Dense retrievers encode queries and documents and map them in an embedding space using pre-trained language models. These embeddings need to be high-dimensional to fit training signals and guarantee the retrieval effectiveness of dense retrievers. However, these high-dimensional embeddings lead to larger index storage and higher retrieval latency. To reduce the embedding dimensions of dense retrieval, this paper proposes a Conditional Autoencoder (ConAE) to compress the high-dimensional embeddings to maintain the same embedding distribution and better recover the ranking features. Our experiments show that ConAE is effective in compressing embeddings by achieving comparable ranking performance with its teacher model and making the retrieval system more efficient. Our further analyses show that ConAE can alleviate the redundancy of the embeddings of dense retrieval with only one linear…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

neuir/conae
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Topic Modeling · Speech Recognition and Synthesis