Phase Repair for Time-Domain Convolutional Neural Networks in Music Super-Resolution
Yenan Zhang, Guilly Kolkman, Hiroshi Watanabe

TL;DR
This paper identifies phase distortion as the cause of artifacts in time-domain CNNs for music super-resolution and introduces a phase repair method using a neural vocoder to significantly enhance perceptual audio quality.
Contribution
It is the first to demonstrate phase distortion causes artifacts in TD-CNNs and proposes a neural vocoder-based phase repair method that improves audio quality across different models.
Findings
Phase distortion causes artifacts in TD-CNN outputs.
Neural vocoder-based phase repair improves perceptual quality.
Method is effective across various TD-CNN architectures.
Abstract
Audio Super-Resolution (SR) is an important topic as low-resolution recordings are ubiquitous in daily life. In this paper, we focus on the music SR task, which is challenging due to the wide frequency response and dynamic range of music. Many models are designed in time domain to jointly process magnitude and phase of audio signals. However, prior works show that approaches using Time-Domain Convolutional Neural Network (TD-CNN) tend to produce annoying artifacts in their waveform outputs, and the cause of the artifacts is yet to be identified. To the best of our knowledge, this work is the first to demonstrate the artifacts in TD-CNNs are caused by the phase distortion via a subjective experiment. We further propose Time-Domain Phase Repair (TD-PR), which uses a neural vocoder pre-trained on the wide-band data to repair the phase components in the waveform outputs of TD-CNNs. Although…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Acoustic Wave Phenomena Research · Ultrasonics and Acoustic Wave Propagation
