Curriculum Learning for Dense Retrieval Distillation
Hansi Zeng, Hamed Zamani, Vishwa Vinay

TL;DR
This paper introduces CL-DRD, a curriculum learning framework that gradually increases the difficulty of distillation data to improve dense retrieval models, showing effectiveness on multiple datasets.
Contribution
It proposes a novel curriculum learning approach for distillation in dense retrieval, controlling data difficulty to enhance model training.
Findings
Improves dense retrieval performance on three datasets.
Effectively increases training data difficulty over iterations.
Enhances state-of-the-art models with simple implementation.
Abstract
Recent work has shown that more effective dense retrieval models can be obtained by distilling ranking knowledge from an existing base re-ranking model. In this paper, we propose a generic curriculum learning based optimization framework called CL-DRD that controls the difficulty level of training data produced by the re-ranking (teacher) model. CL-DRD iteratively optimizes the dense retrieval (student) model by increasing the difficulty of the knowledge distillation data made available to it. In more detail, we initially provide the student model coarse-grained preference pairs between documents in the teacher's ranking and progressively move towards finer-grained pairwise document ordering requirements. In our experiments, we apply a simple implementation of the CL-DRD framework to enhance two state-of-the-art dense retrieval models. Experiments on three public passage retrieval…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques · Topic Modeling
MethodsKnowledge Distillation · Balanced Selection
