TPDR: A Novel Two-Step Transformer-based Product and Class Description Match and Retrieval Method
Washington Cunha, Celso Fran\c{c}a, Leonardo Rocha, Marcos Andr\'e, Gon\c{c}alves

TL;DR
This paper introduces TPDR, a two-step Transformer-based method for matching and retrieving product descriptions in complex, noisy, and cross-language scenarios, significantly improving accuracy over baseline approaches.
Contribution
The paper presents a novel two-step Transformer-based approach combining semantic encoding and syntactic re-ranking for product description matching, addressing noise and language variability.
Findings
Achieved 71% top-5 accuracy in product retrieval
Improved retrieval effectiveness up to 3.7 times over baselines
Successfully handled noisy, short, and cross-language descriptions
Abstract
There is a niche of companies responsible for intermediating the purchase of large batches of varied products for other companies, for which the main challenge is to perform product description standardization, i.e., matching an item described by a client with a product described in a catalog. The problem is complex since the client's product description may be: (1) potentially noisy; (2) short and uninformative (e.g., missing information about model and size); and (3) cross-language. In this paper, we formalize this problem as a ranking task: given an initial client product specification (query), return the most appropriate standardized descriptions (response). In this paper, we propose TPDR, a two-step Transformer-based Product and Class Description Retrieval method that is able to explore the semantic correspondence between IS and SD, by exploiting attention mechanisms and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsText and Document Classification Technologies
MethodsNone · Contrastive Learning
