HyperSound: Generating Implicit Neural Representations of Audio Signals   with Hypernetworks

Filip Szatkowski; Karol J. Piczak; Przemys{\l}aw Spurek; Jacek Tabor,; Tomasz Trzci\'nski

arXiv:2211.01839·cs.SD·January 26, 2024·5 cites

HyperSound: Generating Implicit Neural Representations of Audio Signals with Hypernetworks

Filip Szatkowski, Karol J. Piczak, Przemys{\l}aw Spurek, Jacek Tabor,, Tomasz Trzci\'nski

PDF

Open Access

TL;DR

HyperSound introduces a meta-learning hypernetwork approach to generate implicit neural representations for audio signals, enabling high-quality reconstruction of unseen sounds without training separate models for each sample.

Contribution

It is the first to adapt hypernetwork-based INRs to audio, allowing efficient, sample-independent audio representation and reconstruction.

Findings

01

Achieves sound wave reconstruction quality comparable to state-of-the-art models.

02

Eliminates need for training separate models per audio sample.

03

Demonstrates effectiveness on various audio signals.

Abstract

Implicit neural representations (INRs) are a rapidly growing research field, which provides alternative ways to represent multimedia signals. Recent applications of INRs include image super-resolution, compression of high-dimensional signals, or 3D rendering. However, these solutions usually focus on visual data, and adapting them to the audio domain is not trivial. Moreover, it requires a separately trained model for every data sample. To address this limitation, we propose HyperSound, a meta-learning method leveraging hypernetworks to produce INRs for audio signals unseen at training time. We show that our approach can reconstruct sound waves with quality comparable to other state-of-the-art models.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHearing Loss and Rehabilitation · Speech and Audio Processing · Image and Signal Denoising Methods