Blind Restoration of Real-World Audio by 1D Operational GANs
Turker Ince, Serkan Kiranyaz, Ozer Can Devecioglu, Muhammad Salman, Khan, Muhammad Chowdhury, and Moncef Gabbouj

TL;DR
This paper introduces a novel 1D Operational-GAN approach for blind restoration of real-world audio, effectively handling multiple artifact types and severities in a single model, outperforming baseline methods.
Contribution
The study presents the first application of 1D Op-GANs for direct time-domain blind audio restoration across diverse artifact types and severities.
Findings
Achieved over 7.2 dB SDR improvement on speech datasets.
Achieved over 4.9 dB SDR improvement on non-speech datasets.
Demonstrated robustness across various artifact combinations and severities.
Abstract
Objective: Despite numerous studies proposed for audio restoration in the literature, most of them focus on an isolated restoration problem such as denoising or dereverberation, ignoring other artifacts. Moreover, assuming a noisy or reverberant environment with limited number of fixed signal-to-distortion ratio (SDR) levels is a common practice. However, real-world audio is often corrupted by a blend of artifacts such as reverberation, sensor noise, and background audio mixture with varying types, severities, and duration. In this study, we propose a novel approach for blind restoration of real-world audio signals by Operational Generative Adversarial Networks (Op-GANs) with temporal and spectral objective metrics to enhance the quality of restored audio signal regardless of the type and severity of each artifact corrupting it. Methods: 1D Operational-GANs are used with generative…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Image and Signal Denoising Methods · Music and Audio Processing
