G-UBS: Towards Robust Understanding of Implicit Feedback via Group-Aware User Behavior Simulation
Boyu Chen, Siran Chen, Zhengrong Yue, Kainan Yan, Chenyun Yu, Beibei Kong, Cheng Lei, Chengxiang Zhuo, Zang Li, Yali Wang

TL;DR
This paper introduces G-UBS, a group-aware simulation framework that improves the robustness of implicit user feedback interpretation in recommendation systems by leveraging group context and reinforcement learning.
Contribution
It proposes a novel group-aware paradigm with a clustering-based group profile generation and reinforcement learning guided by group profiles, enhancing implicit feedback analysis.
Findings
G-UBS outperforms mainstream models with 4.0% higher play rate.
Achieves 14.9% higher reasoning accuracy on the IF-VR benchmark.
First multi-modal implicit feedback benchmark for video recommendation.
Abstract
User feedback is critical for refining recommendation systems, yet explicit feedback (e.g., likes or dislikes) remains scarce in practice. As a more feasible alternative, inferring user preferences from massive implicit feedback has shown great potential (e.g., a user quickly skipping a recommended video usually indicates disinterest). Unfortunately, implicit feedback is often noisy: a user might skip a video due to accidental clicks or other reasons, rather than disliking it. Such noise can easily misjudge user interests, thereby undermining recommendation performance. To address this issue, we propose a novel Group-aware User Behavior Simulation (G-UBS) paradigm, which leverages contextual guidance from relevant user groups, enabling robust and in-depth interpretation of implicit feedback for individual users. Specifically, G-UBS operates via two key agents. First, the User Group…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsIntelligent Tutoring Systems and Adaptive Learning · Reinforcement Learning in Robotics · Human-Automation Interaction and Safety
