Ranking Policy Learning via Marketplace Expected Value Estimation From   Observational Data

Ehsan Ebrahimzadeh; Nikhil Monga; Hang Gao; Alex Cozzi; Abraham; Bagherjeiran

arXiv:2410.04568·cs.IR·October 8, 2024

Ranking Policy Learning via Marketplace Expected Value Estimation From Observational Data

Ehsan Ebrahimzadeh, Nikhil Monga, Hang Gao, Alex Cozzi, Abraham, Bagherjeiran

PDF

Open Access

TL;DR

This paper introduces a framework for learning ranking policies in e-commerce marketplaces by estimating expected marketplace value from observational data, optimizing user interactions and economic outcomes.

Contribution

It formulates a novel expected reward optimization approach for ranking policies using observational data, accounting for heterogeneity and distribution shifts in user sessions.

Findings

01

Empirical results demonstrate the effectiveness of the proposed ranking policy framework.

02

The approach captures the trade-offs in ranking decisions based on context value distribution.

03

The method improves marketplace utility by optimizing expected user interactions.

Abstract

We develop a decision making framework to cast the problem of learning a ranking policy for search or recommendation engines in a two-sided e-commerce marketplace as an expected reward optimization problem using observational data. As a value allocation mechanism, the ranking policy allocates retrieved items to the designated slots so as to maximize the user utility from the slotted items, at any given stage of the shopping journey. The objective of this allocation can in turn be defined with respect to the underlying probabilistic user browsing model as the expected number of interaction events on presented items matching the user intent, given the ranking context. Through recognizing the effect of ranking as an intervention action to inform users' interactions with slotted items and the corresponding economic value of the interaction events for the marketplace, we formulate the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsForecasting Techniques and Applications