Sines, Transient, Noise Neural Modeling of Piano Notes
Riccardo Simionato, Stefano Fasciani

TL;DR
This paper presents a differentiable spectral modeling synthesizer for piano notes that decomposes sounds into sines, transients, and noise, enabling efficient and perceptually accurate emulation of piano sounds from recordings.
Contribution
It introduces a novel, trainable spectral decomposition approach for piano synthesis, combining physics-guided sinusoidal modeling with deep learning for transients and noise.
Findings
Model accurately reproduces spectral energy distribution.
Efficient in computation and memory usage.
Perceptual tests show good single note and chord emulation.
Abstract
This paper introduces a novel method for emulating piano sounds. We propose to exploit the sines, transient, and noise decomposition to design a differentiable spectral modeling synthesizer replicating piano notes. Three sub-modules learn these components from piano recordings and generate the corresponding harmonic, transient, and noise signals. Splitting the emulation into three independently trainable models reduces the modeling tasks' complexity. The quasi-harmonic content is produced using a differentiable sinusoidal model guided by physics-derived formulas, whose parameters are automatically estimated from audio recordings. The noise sub-module uses a learnable time-varying filter, and the transients are generated using a deep convolutional network. From singular notes, we emulate the coupling between different keys in trichords with a convolutional-based network. Results show the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Music Technology and Sound Studies
