The Nah Bandit: Modeling User Non-compliance in Recommendation Systems
Tianyue Zhou, Jung-Hoon Cho, Cathy Wu

TL;DR
This paper introduces the Nah Bandit model, accounting for user non-compliance in physical-world recommendation systems, and proposes the EWC algorithm that accelerates preference learning by leveraging non-compliance feedback.
Contribution
It models user non-compliance in recommendation systems and proposes the EWC algorithm, a hierarchical method that improves learning efficiency and recommendation accuracy.
Findings
EWC achieves a regret bound of O(N√T log K + NT).
EWC outperforms supervised learning and traditional bandit algorithms.
Effective use of non-compliance feedback accelerates preference learning.
Abstract
Recommendation systems now pervade the digital world, ranging from advertising to entertainment. However, it remains challenging to implement effective recommendation systems in the physical world, such as in mobility or health. This work focuses on a key challenge: in the physical world, it is often easy for the user to opt out of taking any recommendation if they are not to her liking, and to fall back to her baseline behavior. It is thus crucial in cyber-physical recommendation systems to operate with an interaction model that is aware of such user behavior, lest the user abandon the recommendations altogether. This paper thus introduces the Nah Bandit, a tongue-in-cheek reference to describe a Bandit problem where users can say `nah' to the recommendation and opt for their preferred option instead. As such, this problem lies in between a typical bandit setup and supervised learning.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Mobile Crowdsensing and Crowdsourcing · Smart Grid Energy Management
MethodsAttentive Walk-Aggregating Graph Neural Network · Elastic Weight Consolidation · OPT
