An Unbiased Data Collection and Content Exploitation/Exploration Strategy for Personalization
Liangjie Hong, Adnan Boz

TL;DR
This paper proposes a novel unbiased data collection framework for personalization systems using Thompson sampling, addressing bias issues caused by user interaction feedback and improving data quality for downstream tasks.
Contribution
It introduces a new Thompson sampling approach for Bernoulli ranked-list, effectively balancing user experience with unbiased data collection, and validates it through real-world testing.
Findings
The proposed method reduces bias in user interaction data.
It outperforms traditional algorithms in real bucket tests.
The framework enhances downstream personalization tasks.
Abstract
One of missions for personalization systems and recommender systems is to show content items according to users' personal interests. In order to achieve such goal, these systems are learning user interests over time and trying to present content items tailoring to user profiles. Recommending items according to users' preferences has been investigated extensively in the past few years, mainly thanks for the popularity of Netflix competition. In a real setting, users may be attracted by a subset of those items and interact with them, only leaving partial feedbacks to the system to learn in the next cycle, which leads to significant biases into systems and hence results in a situation where user engagement metrics cannot be improved over time. The problem is not just for one component of the system. The data collected from users is usually used in many different tasks, including learning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Smart Grid Energy Management · Recommender Systems and Techniques
