Knowledge Distillation for Singing Voice Detection

Soumava Paul; Gurunath Reddy M; K Sreenivasa Rao; Partha Pratim Das

arXiv:2011.04297·cs.SD·August 23, 2021

Knowledge Distillation for Singing Voice Detection

Soumava Paul, Gurunath Reddy M, K Sreenivasa Rao, Partha Pratim Das

PDF

1 Repo

TL;DR

This paper explores using knowledge distillation to compress large neural networks for singing voice detection, enabling deployment on resource-limited devices while maintaining high accuracy.

Contribution

It introduces the application of knowledge distillation techniques to compress SVD models, addressing deployment challenges in music information retrieval.

Findings

01

Knowledge distillation effectively reduces model size.

02

Compressed models retain high detection accuracy.

03

Ensemble distillation improves performance further.

Abstract

Singing Voice Detection (SVD) has been an active area of research in music information retrieval (MIR). Currently, two deep neural network-based methods, one based on CNN and the other on RNN, exist in literature that learn optimized features for the voice detection (VD) task and achieve state-of-the-art performance on common datasets. Both these models have a huge number of parameters (1.4M for CNN and 65.7K for RNN) and hence not suitable for deployment on devices like smartphones or embedded sensors with limited capacity in terms of memory and computation power. The most popular method to address this issue is known as knowledge distillation in deep learning literature (in addition to model compression) where a large pre-trained network known as the teacher is used to train a smaller student network. Given the wide applications of SVD in music information retrieval, to the best of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mvp18/KD-SVD
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsKnowledge Distillation