TL;DR
This paper employs deep reinforcement learning to predict music skipping behavior, demonstrating that user behavior features are most influential, and highlights a temporal data leakage issue in real-world datasets.
Contribution
It introduces a DRL-based approach for sequentially predicting music skips and provides a detailed analysis of feature importance and data leakage issues.
Findings
User behavior features are most predictive of skips.
Content and contextual features have limited impact.
Temporal data leakage affects model evaluation.
Abstract
Music recommender systems are an integral part of our daily life. Recent research has seen a significant effort around black-box recommender based approaches such as Deep Reinforcement Learning (DRL). These advances have led, together with the increasing concerns around users' data collection and privacy, to a strong interest in building responsible recommender systems. A key element of a successful music recommender system is modelling how users interact with streamed content. By first understanding these interactions, insights can be drawn to enable the construction of more transparent and responsible systems. An example of these interactions is skipping behaviour, a signal that can measure users' satisfaction, dissatisfaction, or lack of interest. In this paper, we study the utility of users' historical data for the task of sequentially predicting users' skipping behaviour. To this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
