Generative Regression Based Watch Time Prediction for Short-Video Recommendation
Hongxu Ma, Kai Tian, Tao Zhang, Xuefeng Zhang, Han Zhou, Chunjie Chen,, Han Li, Jihong Guan, Shuigeng Zhou

TL;DR
This paper introduces a novel generative regression framework for short-video watch time prediction, reformulating the task as sequence generation to improve accuracy and flexibility over traditional methods.
Contribution
The paper proposes a generative regression approach with structural discretization and curriculum learning, addressing limitations of existing ordinal regression methods in watch time prediction.
Findings
Outperforms state-of-the-art methods on multiple datasets
Achieves significant improvements in prediction accuracy
Validated through online A/B testing in a real-world app
Abstract
Watch time prediction (WTP) has emerged as a pivotal task in short video recommendation systems, designed to quantify user engagement through continuous interaction modeling. Predicting users' watch times on videos often encounters fundamental challenges, including wide value ranges and imbalanced data distributions, which can lead to significant estimation bias when directly applying regression techniques. Recent studies have attempted to address these issues by converting the continuous watch time estimation into an ordinal regression task. While these methods demonstrate partial effectiveness, they exhibit notable limitations: (1) the discretization process frequently relies on bucket partitioning, inherently reducing prediction flexibility and accuracy and (2) the interdependencies among different partition intervals remain underutilized, missing opportunities for effective error…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Analysis and Summarization · Image and Video Quality Assessment
MethodsMixup
