Disentangling Questions from Query Generation for Task-Adaptive Retrieval
Yoonsang Lee, Minsoo Kim, Seung-won Hwang

TL;DR
This paper introduces EGG, a query generator that captures high-level search intents for task-adaptive retrieval, outperforming larger models by explicitly modeling search intent, especially on underexplored tasks.
Contribution
Proposes EGG, a novel query generator that models search intent as a compilation, improving task adaptation with a significantly smaller model.
Findings
EGG outperforms baselines on four tasks with underexplored intents.
Explicit instruction of search intent improves query generation.
EGG uses 47 times fewer parameters than previous state-of-the-art models.
Abstract
This paper studies the problem of information retrieval, to adapt to unseen tasks. Existing work generates synthetic queries from domain-specific documents to jointly train the retriever. However, the conventional query generator assumes the query as a question, thus failing to accommodate general search intents. A more lenient approach incorporates task-adaptive elements, such as few-shot learning with an 137B LLM. In this paper, we challenge a trend equating query and question, and instead conceptualize query generation task as a "compilation" of high-level intent into task-adaptive query. Specifically, we propose EGG, a query generator that better adapts to wide search intents expressed in the BeIR benchmark. Our method outperforms baselines and existing models on four tasks with underexplored intents, while utilizing a query generator 47 times smaller than the previous…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Advanced Text Analysis Techniques · Intelligent Tutoring Systems and Adaptive Learning
