Debiased Cross-modal Matching for Content-based Micro-video Background Music Recommendation
Jinng Yi, Zhenzhong Chen

TL;DR
This paper introduces a Debiased Cross-Modal matching model that reduces selection bias in micro-video background music recommendation by using a teacher-student network, knowledge transfer, and confounder adjustment, improving recommendation accuracy.
Contribution
It proposes a novel debiased cross-modal matching framework employing teacher-student networks, KL-based knowledge transfer, and backdoor adjustment to mitigate bias in music recommendation systems.
Findings
Effective in reducing selection bias on TT-150k-genre dataset
Improves recommendation accuracy over baseline models
Demonstrates robustness to confounders and biases
Abstract
Micro-video background music recommendation is a complicated task where the matching degree between videos and uploader-selected background music is a major issue. However, the selection of the user-generated content (UGC) is biased caused by knowledge limitations and historical preferences among music of each uploader. In this paper, we propose a Debiased Cross-Modal (DebCM) matching model to alleviate the influence of such selection bias. Specifically, we design a teacher-student network to utilize the matching of segments of music videos, which is professional-generated content (PGC) with specialized music-matching techniques, to better alleviate the bias caused by insufficient knowledge of users. The PGC data is captured by a teacher network to guide the matching of uploader-selected UGC data of the student network by KL-based knowledge transfer. In addition, uploaders' personal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Recommender Systems and Techniques · Neuroscience and Music Perception
