Boosting LLM-based Relevance Modeling with Distribution-Aware Robust Learning
Hong Liu, Saisai Gong, Yixin Ji, Kaixin Wu, Jia Xu, Jinjie Gu

TL;DR
This paper introduces DaRL, a novel framework that enhances LLM-based relevance modeling by improving fine-grained discrimination and robustness to data distribution shifts, specifically applied to Alipay Search.
Contribution
The paper proposes a distribution-aware robust learning framework with a new loss function, sample augmentation, and multi-stage fine-tuning to improve relevance modeling performance and generalization.
Findings
DaRL improves relevance discrimination across fine-grained relevance levels.
DaRL enhances model robustness to distribution shifts in real-world scenarios.
DaRL achieves better online performance in Alipay Search.
Abstract
With the rapid advancement of pre-trained large language models (LLMs), recent endeavors have leveraged the capabilities of LLMs in relevance modeling, resulting in enhanced performance. This is usually done through the process of fine-tuning LLMs on specifically annotated datasets to determine the relevance between queries and items. However, there are two limitations when LLMs are naively employed for relevance modeling through fine-tuning and inference. First, it is not inherently efficient for performing nuanced tasks beyond simple yes or no answers, such as assessing search relevance. It may therefore tend to be overconfident and struggle to distinguish fine-grained degrees of relevance (e.g., strong relevance, weak relevance, irrelevance) used in search engines. Second, it exhibits significant performance degradation when confronted with data distribution shift in real-world…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsADaptive gradient method with the OPTimal convergence rate · Sparse Evolutionary Training
