Contrastive Learning with Positive-Negative Frame Mask for Music   Representation

Dong Yao; Zhou Zhao; Shengyu Zhang; Jieming Zhu; Yudong Zhu; Rui; Zhang; Xiuqiang He

arXiv:2203.09129·cs.SD·April 5, 2022

Contrastive Learning with Positive-Negative Frame Mask for Music Representation

Dong Yao, Zhou Zhao, Shengyu Zhang, Jieming Zhu, Yudong Zhu, Rui, Zhang, Xiuqiang He

PDF

TL;DR

This paper introduces PEMR, a contrastive learning method that uses frame masking to improve music representation by focusing on essential audio components, enhancing downstream task performance.

Contribution

It proposes a novel frame masking technique with a transformer-based mask generator for contrastive learning in music representation, addressing noise and inessential elements.

Findings

01

Improves music classification accuracy

02

Enhances cover song identification performance

03

Demonstrates strong generalization across datasets

Abstract

Self-supervised learning, especially contrastive learning, has made an outstanding contribution to the development of many deep learning research fields. Recently, researchers in the acoustic signal processing field noticed its success and leveraged contrastive learning for better music representation. Typically, existing approaches maximize the similarity between two distorted audio segments sampled from the same music. In other words, they ensure a semantic agreement at the music level. However, those coarse-grained methods neglect some inessential or noisy elements at the frame level, which may be detrimental to the model to learn the effective representation of music. Towards this end, this paper proposes a novel Positive-nEgative frame mask for Music Representation based on the contrastive learning framework, abbreviated as PEMR. Concretely, PEMR incorporates a Positive-Negative…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsContrastive Learning