Improving Product Search Relevance with EAR-MP: A Solution for the CIKM 2025 AnalytiCup

JaeEun Lim; Soomin Kim; Jaeyong Seo; Iori Ono; Qimu Ran; Jae-woong Lee

arXiv:2510.23018·cs.IR·November 3, 2025

Improving Product Search Relevance with EAR-MP: A Solution for the CIKM 2025 AnalytiCup

JaeEun Lim, Soomin Kim, Jaeyong Seo, Iori Ono, Qimu Ran, Jae-woong Lee

PDF

TL;DR

This paper presents EAR-MP, a multilingual e-commerce search solution for the CIKM 2025 AnalytiCup, focusing on improving query relevance through data normalization, advanced training techniques, and task-specific enhancements, achieving high F1 scores.

Contribution

The paper introduces a comprehensive approach combining data normalization, model improvements, and task-specific strategies for multilingual relevance in e-commerce search.

Findings

01

Achieved F1 score of 0.8796 on Query-Category relevance

02

Attained F1 score of 0.8744 on Query-Item relevance

03

Demonstrated the effectiveness of systematic data preprocessing and tailored training.

Abstract

Multilingual e-commerce search is challenging due to linguistic diversity and the noise inherent in user-generated queries. This paper documents the solution employed by our team (EAR-MP) for the CIKM 2025 AnalytiCup, which addresses two core tasks: Query-Category (QC) relevance and Query-Item (QI) relevance. Our approach first normalizes the multilingual dataset by translating all text into English, then mitigates noise through extensive data cleaning and normalization. For model training, we build on DeBERTa-v3-large and improve performance with label smoothing, self-distillation, and dropout. In addition, we introduce task-specific upgrades, including hierarchical token injection for QC and a hybrid scoring mechanism for QI. Under constrained compute, our method achieves competitive results, attaining an F1 score of 0.8796 on QC and 0.8744 on QI. These findings underscore the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.