Measuring Fairness in Large-Scale Recommendation Systems with Missing   Labels

Yulong Dong; Kun Jin; Xinghai Hu; Yang Liu

arXiv:2406.05247·cs.IR·June 11, 2024·1 cites

Measuring Fairness in Large-Scale Recommendation Systems with Missing Labels

Yulong Dong, Kun Jin, Xinghai Hu, Yang Liu

PDF

Open Access

TL;DR

This paper introduces a novel method using randomized traffic to accurately measure fairness in large-scale recommendation systems with missing labels, supported by theoretical bounds and empirical validation on real data.

Contribution

It is the first to highlight the importance of random traffic in fairness measurement, propose a new estimation method, and provide a fairness dataset from TikTok.

Findings

01

The proposed method accurately estimates fairness metrics with theoretical error bounds.

02

Empirical results validate the effectiveness of the method on synthetic and real TikTok data.

03

The study emphasizes the importance of randomized data collection for fairness in recommendation systems.

Abstract

In large-scale recommendation systems, the vast array of items makes it infeasible to obtain accurate user preferences for each product, resulting in a common issue of missing labels. Typically, only items previously recommended to users have associated ground truth data. Although there is extensive research on fairness concerning fully observed user-item interactions, the challenge of fairness in scenarios with missing labels remains underexplored. Previous methods often treat these samples missing labels as negative, which can significantly deviate from the ground truth fairness metrics. Our study addresses this gap by proposing a novel method employing a small randomized traffic to estimate fairness metrics accurately. We present theoretical bounds for the estimation error of our fairness metric and support our findings with empirical evidence on real data. Our numerical experiments…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsConsumer Market Behavior and Pricing