Curriculum Learning for Dense Retrieval Distillation

Hansi Zeng; Hamed Zamani; Vishwa Vinay

arXiv:2204.13679·cs.IR·April 29, 2022

Curriculum Learning for Dense Retrieval Distillation

Hansi Zeng, Hamed Zamani, Vishwa Vinay

PDF

Open Access 1 Repo

TL;DR

This paper introduces CL-DRD, a curriculum learning framework that gradually increases the difficulty of distillation data to improve dense retrieval models, showing effectiveness on multiple datasets.

Contribution

It proposes a novel curriculum learning approach for distillation in dense retrieval, controlling data difficulty to enhance model training.

Findings

01

Improves dense retrieval performance on three datasets.

02

Effectively increases training data difficulty over iterations.

03

Enhances state-of-the-art models with simple implementation.

Abstract

Recent work has shown that more effective dense retrieval models can be obtained by distilling ranking knowledge from an existing base re-ranking model. In this paper, we propose a generic curriculum learning based optimization framework called CL-DRD that controls the difficulty level of training data produced by the re-ranking (teacher) model. CL-DRD iteratively optimizes the dense retrieval (student) model by increasing the difficulty of the knowledge distillation data made available to it. In more detail, we initially provide the student model coarse-grained preference pairs between documents in the teacher's ranking and progressively move towards finer-grained pairwise document ordering requirements. In our experiments, we apply a simple implementation of the CL-DRD framework to enhance two state-of-the-art dense retrieval models. Experiments on three public passage retrieval…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hansizeng/cl-drd
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques · Topic Modeling

MethodsKnowledge Distillation · Balanced Selection