PrimaDNN': A Characteristics-aware DNN Customization for Singing Technique Detection
Yuya Yamamoto, Juhan Nam, Hiroko Terasawa

TL;DR
PrimaDNN is a novel deep learning model that improves singing technique detection by incorporating characteristics-aware features like auxiliary pitch and multi-resolution spectrograms, achieving state-of-the-art results.
Contribution
The paper introduces PrimaDNN, a CRNN model with data-characteristics-oriented enhancements, including specialized input features and convolution modules, for improved singing technique detection.
Findings
PrimaDNN outperforms conventional methods with 44.9% macro-F measure.
Component contributions vary depending on singing technique type.
Characteristics-aware features enhance detection performance.
Abstract
Professional vocalists modulate their voice timbre or pitch to make their vocal performance more expressive. Such fluctuations are called singing techniques. Automatic detection of singing techniques from audio tracks can be beneficial to understand how each singer expresses the performance, yet it can also be difficult due to the wide variety of the singing techniques. A deep neural network (DNN) model can handle such variety; however, there might be a possibility that considering the characteristics of the data improves the performance of singing technique detection. In this paper, we propose PrimaDNN, a CRNN model with a characteristics-oriented improvement. The features of the model are: 1) input feature representation based on auxiliary pitch information and multi-resolution mel spectrograms, 2) Convolution module based on the Squeeze-and-excitation (SENet) and the Instance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Diverse Musicological Studies · Music Technology and Sound Studies
MethodsConvolution
