Improving E-commerce Search with Category-Aligned Retrieval

Rauf Aliev

arXiv:2510.21711·cs.IR·October 28, 2025

Improving E-commerce Search with Category-Aligned Retrieval

Rauf Aliev

PDF

TL;DR

This paper introduces a Category-Aligned Retrieval System (CARS) that enhances e-commerce search relevance by predicting product categories from user queries and boosting within those categories, demonstrating significant improvements in category prediction accuracy.

Contribution

The paper presents a novel method for creating trainable category prototypes from query embeddings and evaluates its effectiveness with different embedding models, highlighting the importance of adaptive integration strategies.

Findings

01

OpenAI's embedding model increased category prediction accuracy from 43.8% to 83.2%.

02

Naive category boosting can negatively impact search relevance metrics.

03

Dataset ambiguities and system sensitivity affect the effectiveness of category-based boosting.

Abstract

Traditional e-commerce search systems often struggle with the semantic gap between user queries and product catalogs. In this paper, we propose a Category-Aligned Retrieval System (CARS) that improves search relevance by first predicting the product category from a user's query and then boosting products within that category. We introduce a novel method for creating "Trainable Category Prototypes" from query embeddings. We evaluate this method with two models: a lightweight all-MiniLM-L6-v2 and OpenAI's text-embedding-ada-002. Our offline evaluation shows this method is highly effective, with the OpenAI model increasing Top-3 category prediction accuracy from a zero-shot baseline of 43.8% to 83.2% after training. The end-to-end simulation, however, highlights the limitations of blindly applying category boosts in a complex retrieval pipeline: while accuracy is high, naive integration…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.