Towards End-to-End Alignment of User Satisfaction via Questionnaire in Video Recommendation
Na Li, Jiaqi Yu, Minzhi Xie, Tiantian He, Xiaoxiao Xu, Zixiu Wang, Lantao Hu, Yongqi Liu, Han Li, Kaiqiao Zhan, Kun Gai

TL;DR
This paper introduces EASQ, a novel framework for real-time alignment of video recommendation models with true user satisfaction using sparse questionnaire feedback, improving user satisfaction metrics and business outcomes.
Contribution
EASQ employs a multi-task architecture with LoRA modules and a DPO-based objective to effectively incorporate sparse satisfaction signals into online recommendation models.
Findings
EASQ improves user satisfaction metrics in offline and online tests.
EASQ achieves stable business gains in production deployment.
The framework effectively balances sparse questionnaire signals with dense behavioral data.
Abstract
Short-video recommender systems typically optimize ranking models using dense user behavioral signals, such as clicks and watch time. However, these signals are only indirect proxies of user satisfaction and often suffer from noise and bias. Recently, explicit satisfaction feedback collected through questionnaires has emerged as a high-quality direct alignment supervision, but is extremely sparse and easily overwhelmed by abundant behavioral data, making it difficult to incorporate into online recommendation models. To address these challenges, we propose a novel framework which is towards End-to-End Alignment of user Satisfaction via Questionaire, named EASQ, to enable real-time alignment of ranking models with true user satisfaction. Specifically, we first construct an independent parameter pathway for sparse questionnaire signals by combining a multi-task architecture and a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Advanced Bandit Algorithms Research · Explainable Artificial Intelligence (XAI)
