A Statistics-Driven Differentiable Approach for Sound Texture Synthesis and Analysis
Esteban Guti\'errez, Frederic Font, Xavier Serra, Lonce Wyse

TL;DR
This paper introduces TexStat, a new statistical loss function for analyzing and synthesizing texture sounds, along with a lightweight synthesizer and a generative model, demonstrating improved perceptual quality and evaluation robustness.
Contribution
The paper presents TexStat, a novel, perceptually meaningful loss function for texture sound analysis and synthesis, and integrates it into a new differentiable synthesizer and generative model.
Findings
TexStat is perceptually meaningful and time-invariant.
The proposed tools outperform existing methods in texture sound synthesis.
Open-source code enables reproducibility and further research.
Abstract
In this work, we introduce TexStat, a novel loss function specifically designed for the analysis and synthesis of texture sounds characterized by stochastic structure and perceptual stationarity. Drawing inspiration from the statistical and perceptual framework of McDermott and Simoncelli, TexStat identifies similarities between signals belonging to the same texture category without relying on temporal structure. We also propose using TexStat as a validation metric alongside Frechet Audio Distances (FAD) to evaluate texture sound synthesis models. In addition to TexStat, we present TexEnv, an efficient, lightweight and differentiable texture sound synthesizer that generates audio by imposing amplitude envelopes on filtered noise. We further integrate these components into TexDSP, a DDSP-inspired generative model tailored for texture sounds. Through extensive experiments across various…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic Technology and Sound Studies · Generative Adversarial Networks and Image Synthesis · Model Reduction and Neural Networks
