One or Two Components? The Scattering Transform Answers

Vincent Lostanlen; Alice Cohen-Hadria; Juan Pablo Bello

arXiv:2003.01037·cs.SD·June 26, 2020·1 cites

One or Two Components? The Scattering Transform Answers

Vincent Lostanlen, Alice Cohen-Hadria, Juan Pablo Bello

PDF

Open Access

TL;DR

This paper investigates how wavelet scattering networks can model multicomponent signals, providing criteria for component interference, visualizing similarity spaces, and analyzing the growth of scattering depth with bandwidth.

Contribution

It introduces a simple criterion for component interference, applies manifold learning to scattering coefficients, and generalizes the framework to multiple sine waves with theoretical analysis.

Findings

01

Renormalizing second-order nodes assesses component interference.

02

Manifold learning visualizes similarity spaces of signals.

03

Scattering depth grows logarithmically with bandwidth.

Abstract

With the aim of constructing a biologically plausible model of machine listening, we study the representation of a multicomponent stationary signal by a wavelet scattering network. First, we show that renormalizing second-order nodes by their first-order parents gives a simple numerical criterion to assess whether two neighboring components will interfere psychoacoustically. Secondly, we run a manifold learning algorithm (Isomap) on scattering coefficients to visualize the similarity space underlying parametric additive synthesis. Thirdly, we generalize the "one or two components" framework to three sine waves or more, and prove that the effective scattering depth of a Fourier series grows in logarithmic proportion to its bandwidth.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Speech and Audio Processing · Digital Media Forensic Detection