Solos: A Dataset for Audio-Visual Music Analysis

Juan F. Montesinos; Olga Slizovskaia; Gloria Haro

arXiv:2006.07931·eess.AS·August 10, 2020

Solos: A Dataset for Audio-Visual Music Analysis

Juan F. Montesinos, Olga Slizovskaia, Gloria Haro

PDF

1 Repo

TL;DR

The paper introduces Solos, a high-quality, instrument-diverse audio-visual dataset from YouTube performances, designed to advance machine learning tasks like source separation, localization, and cross-modal analysis.

Contribution

It provides a clean, manually verified dataset covering all instruments in URMP, enabling improved training and evaluation of audio-visual machine learning models.

Findings

01

Models trained on Solos perform well on URMP source separation tasks.

02

Solos dataset is cleaner and more comprehensive than previous datasets.

03

The dataset supports multiple audio-visual analysis tasks.

Abstract

In this paper, we present a new dataset of music performance videos which can be used for training machine learning methods for multiple tasks such as audio-visual blind source separation and localization, cross-modal correspondences, cross-modal generation and, in general, any audio-visual self-supervised task. These videos, gathered from YouTube, consist of solo musical performances of 13 different instruments. Compared to previously proposed audio-visual datasets, Solos is cleaner since a big amount of its recordings are auditions and manually checked recordings, ensuring there is no background noise nor effects added in the video post-processing. Besides, it is, up to the best of our knowledge, the only dataset that contains the whole set of instruments present in the URMP\cite{URPM} dataset, a high-quality dataset of 44 audio-visual recordings of multi-instrument classical music…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

JuanFMontesinos/Solos
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.