Learning Sparse Analytic Filters for Piano Transcription
Frank Cwitkowitz, Mojtaba Heydari, Zhiyao Duan

TL;DR
This paper explores learning sparse, analytic filterbanks for improved piano transcription by modifying feature extraction with unconstrained complex filters, Hilbert transforms, and variational dropout, leading to better audio feature representations.
Contribution
It introduces a novel filterbank learning module with analytic and sparse filters for piano transcription, enhancing feature extraction in deep learning models.
Findings
Filterbank learning improves transcription accuracy.
Analytic and sparse filters provide better audio representations.
Visualization shows meaningful filterbank structures.
Abstract
In recent years, filterbank learning has become an increasingly popular strategy for various audio-related machine learning tasks. This is partly due to its ability to discover task-specific audio characteristics which can be leveraged in downstream processing. It is also a natural extension of the nearly ubiquitous deep learning methods employed to tackle a diverse array of audio applications. In this work, several variations of a frontend filterbank learning module are investigated for piano transcription, a challenging low-level music information retrieval task. We build upon a standard piano transcription model, modifying only the feature extraction stage. The filterbank module is designed such that its complex filters are unconstrained 1D convolutional kernels with long receptive fields. Additional variations employ the Hilbert transform to render the filters intrinsically analytic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Speech and Audio Processing · Speech Recognition and Synthesis
MethodsVariational Dropout · Dropout
