Learning variant product relationship and variation attributes from e-commerce website structures
Pedro Herrero-Vidal, You-Lin Chen, Cris Liu, Prithviraj Sen, Lichao, Wang

TL;DR
This paper presents VARM, a strategy combining encoding and generative AI models to identify and analyze variant product relationships in e-commerce catalogs, capturing both product matches and attribute variations.
Contribution
It introduces a novel approach for variant product relationship detection that integrates encoding and generative AI models, addressing limitations of traditional entity resolution.
Findings
Outperforms existing methods in variant product matching
Effectively extracts variation and common attributes among products
Validated on real e-commerce data from a leading retailer
Abstract
We introduce VARM, variant relationship matcher strategy, to identify pairs of variant products in e-commerce catalogs. Traditional definitions of entity resolution are concerned with whether product mentions refer to the same underlying product. However, this fails to capture product relationships that are critical for e-commerce applications, such as having similar, but not identical, products listed on the same webpage or share reviews. Here, we formulate a new type of entity resolution in variant product relationships to capture these similar e-commerce product links. In contrast with the traditional definition, the new definition requires both identifying if two products are variant matches of each other and what are the attributes that vary between them. To satisfy these two requirements, we developed a strategy that leverages the strengths of both encoding and generative AI…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsConsumer Market Behavior and Pricing · Advanced Text Analysis Techniques · Web Data Mining and Analysis
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · WordPiece · Attention Dropout · Linear Layer · Weight Decay · Linear Warmup With Linear Decay · Dropout · Byte Pair Encoding · BERT
