Partial Bandit and Semi-Bandit: Making the Most Out of Scarce Users'   Feedback

Alexandre Letard; Tassadit Amghar; Olivier Camp; Nicolas Gutowski

arXiv:2009.07518·cs.LG·September 17, 2020

Partial Bandit and Semi-Bandit: Making the Most Out of Scarce Users' Feedback

Alexandre Letard, Tassadit Amghar, Olivier Camp, Nicolas Gutowski

PDF

Open Access

TL;DR

This paper introduces a novel combinatorial bandit approach that reduces the need for explicit user feedback in recommender systems, achieving comparable accuracy with significantly less user input.

Contribution

The paper proposes a new method for incorporating user feedback in COM-MAB algorithms that minimizes explicit feedback requirements while maintaining high accuracy.

Findings

01

Achieves similar accuracy with only 20% user feedback

02

Reduces explicit feedback needs in recommender systems

03

Maintains learning efficiency comparable to state-of-the-art methods

Abstract

Recent works on Multi-Armed Bandits (MAB) and Combinatorial Multi-Armed Bandits (COM-MAB) show good results on a global accuracy metric. This can be achieved, in the case of recommender systems, with personalization. However, with a combinatorial online learning approach, personalization implies a large amount of user feedbacks. Such feedbacks can be hard to acquire when users need to be directly and frequently solicited. For a number of fields of activities undergoing the digitization of their business, online learning is unavoidable. Thus, a number of approaches allowing implicit user feedback retrieval have been implemented. Nevertheless, this implicit feedback can be misleading or inefficient for the agent's learning. Herein, we propose a novel approach reducing the number of explicit feedbacks required by Combinatorial Multi Armed bandit (COM-MAB) algorithms while providing similar…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Data Stream Mining Techniques · Machine Learning and Algorithms