Semi-Supervised Contrastive Learning for Controllable Video-to-Music Retrieval
Shanti Stewart, Gouthaman KV, Lie Lu, Andrea Fanelli

TL;DR
This paper introduces a semi-supervised contrastive learning framework that improves video-to-music retrieval by combining self-supervised and label-supervised objectives, allowing controllable and effective cross-modal retrieval.
Contribution
It presents a novel joint embedding approach that integrates self-supervised and supervised contrastive learning for cross-modal video and music retrieval.
Findings
Effective retrieval performance demonstrated on multiple tasks.
Framework allows adjustable focus on self-supervised vs. label information.
Generalizable to various music annotations like emotion and instrument.
Abstract
Content creators often use music to enhance their videos, from soundtracks in movies to background music in video blogs and social media content. However, identifying the best music for a video can be a difficult and time-consuming task. To address this challenge, we propose a novel framework for automatically retrieving a matching music clip for a given video, and vice versa. Our approach leverages annotated music labels, as well as the inherent artistic correspondence between visual and music elements. Distinct from previous cross-modal music retrieval works, our method combines both self-supervised and supervised training objectives. We use self-supervised and label-supervised contrastive learning to train a joint embedding space between music and video. We show the effectiveness of our approach by using music genre labels for the supervised training component, and our framework can…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Music Technology and Sound Studies · Diverse Musicological Studies
