Automated Query-Product Relevance Labeling using Large Language Models   for E-commerce Search

Jayant Sachdev; Sean D Rosario; Abhijeet Phatak; He Wen; Swati Kirti,; Chittaranjan Tripathy

arXiv:2502.15990·cs.IR·February 27, 2025

Automated Query-Product Relevance Labeling using Large Language Models for E-commerce Search

Jayant Sachdev, Sean D Rosario, Abhijeet Phatak, He Wen, Swati Kirti,, Chittaranjan Tripathy

PDF

TL;DR

This paper demonstrates that Large Language Models can automate query-product relevance labeling in e-commerce, achieving near-human accuracy more efficiently and at scale, thus improving search quality and reducing costs.

Contribution

The study applies prompt engineering techniques to LLMs for large-scale relevance labeling, showing their potential to replace traditional human annotation in e-commerce search.

Findings

01

LLMs can approach human-level accuracy in relevance labeling

02

Prompt engineering enhances LLM performance in this task

03

The approach reduces time and cost compared to human labeling

Abstract

Accurate query-product relevance labeling is indispensable to generate ground truth dataset for search ranking in e-commerce. Traditional approaches for annotating query-product pairs rely on human-based labeling services, which is expensive, time-consuming and prone to errors. In this work, we explore the application of Large Language Models (LLMs) to automate query-product relevance labeling for large-scale e-commerce search. We use several publicly available and proprietary LLMs for this task, and conducted experiments on two open-source datasets and an in-house e-commerce search dataset. Using prompt engineering techniques such as Chain-of-Thought (CoT) prompting, In-context Learning (ICL), and Retrieval Augmented Generation (RAG) with Maximum Marginal Relevance (MMR), we show that LLM's performance has the potential to approach human-level accuracy on this task in a fraction of the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.