A Gradient Accumulation Method for Dense Retriever under Memory Constraint
Jaehee Kim, Yukyung Lee, Pilsung Kang

TL;DR
This paper introduces ContAccum, a memory-efficient and stable training method for dense retrievers that leverages a dual memory bank, enabling effective training with smaller batch sizes and outperforming existing methods.
Contribution
The paper proposes ContAccum, a novel dual memory bank approach that improves stability and efficiency in training dense retrievers under memory constraints.
Findings
ContAccum surpasses existing memory reduction methods in performance.
It enables stable training with smaller batch sizes.
Experimental results confirm improved training stability.
Abstract
InfoNCE loss is commonly used to train dense retriever in information retrieval tasks. It is well known that a large batch is essential to stable and effective training with InfoNCE loss, which requires significant hardware resources. Due to the dependency of large batch, dense retriever has bottleneck of application and research. Recently, memory reduction methods have been broadly adopted to resolve the hardware bottleneck by decomposing forward and backward or using a memory bank. However, current methods still suffer from slow and unstable training. To address these issues, we propose Contrastive Accumulation (ContAccum), a stable and efficient memory reduction method for dense retriever trains that uses a dual memory bank structure to leverage previously generated query and passage representations. Experiments on widely used five information retrieval datasets indicate that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Image Retrieval and Classification Techniques
MethodsInfoNCE
