Retrieval-Feedback-Driven Distillation and Preference Alignment for Efficient LLM-based Query Expansion

Minghan Li; Guodong Zhou

arXiv:2603.13776·cs.IR·March 17, 2026

Retrieval-Feedback-Driven Distillation and Preference Alignment for Efficient LLM-based Query Expansion

Minghan Li, Guodong Zhou

PDF

Open Access

TL;DR

This paper introduces a retrieval-feedback-driven distillation framework that transfers retrieval-friendly query expansion behavior from large teacher models to smaller, more efficient student models, maintaining high retrieval performance while reducing inference costs.

Contribution

The paper presents a novel framework combining retrieval-feedback-driven distillation and preference alignment to create compact models with strong retrieval capabilities, applicable across multilingual settings.

Findings

01

The distilled Qwen3-4B model achieves about 97% of the teacher model's performance on DL19.

02

The approach effectively reduces inference costs while maintaining retrieval effectiveness.

03

Demonstrates strong practicality in both English and Chinese retrieval benchmarks.

Abstract

Large language models have recently enabled a generative paradigm for query expansion, but their high inference cost makes direct deployment difficult in practical retrieval systems. To address this issue, a retrieval-feedback-driven distillation and preference-alignment framework is proposed to transfer retrieval-friendly expansion behavior from a strong teacher model to a compact student model. Rather than relying on few-shot exemplars at inference time, the framework first leverages two complementary types of teacher-generated expansions, produced under zero-shot and few-shot prompting conditions, as supervision signals for distillation and as candidate pools for preference construction. A retrieval-metric-driven strategy is then introduced to automatically form chosen/rejected expansion pairs according to nDCG@10 differences, and Direct Preference Optimization is applied to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsInformation Retrieval and Search Behavior · Topic Modeling · Advanced Graph Neural Networks