Joint Time-Frequency Scattering
Joakim And\'en, Vincent Lostanlen, St\'ephane Mallat

TL;DR
The paper introduces a joint time-frequency scattering transform, a wavelet-based, time-shift invariant representation for time series that improves classification accuracy and characterizes signals in both time and frequency domains.
Contribution
It presents a novel joint time-frequency scattering transform computed via wavelets, enhancing signal representation for time-shift invariant tasks.
Findings
Achieves state-of-the-art results on audio classification tasks.
Outperforms traditional time scattering transforms.
Comparable accuracy to fully learned neural networks.
Abstract
In time series classification and regression, signals are typically mapped into some intermediate representation used for constructing models. Since the underlying task is often insensitive to time shifts, these representations are required to be time-shift invariant. We introduce the joint time-frequency scattering transform, a time-shift invariant representation which characterizes the multiscale energy distribution of a signal in time and frequency. It is computed through wavelet convolutions and modulus non-linearities and may therefore be implemented as a deep convolutional neural network whose filters are not learned but calculated from wavelets. We consider the progression from mel-spectrograms to time scattering and joint time-frequency scattering transforms, illustrating the relationship between increased discriminability and refinements of convolutional network architectures.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
