Blind Restoration of Real-World Audio by 1D Operational GANs

Turker Ince; Serkan Kiranyaz; Ozer Can Devecioglu; Muhammad Salman; Khan; Muhammad Chowdhury; and Moncef Gabbouj

arXiv:2212.14618·cs.SD·January 23, 2023

Blind Restoration of Real-World Audio by 1D Operational GANs

Turker Ince, Serkan Kiranyaz, Ozer Can Devecioglu, Muhammad Salman, Khan, Muhammad Chowdhury, and Moncef Gabbouj

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel 1D Operational-GAN approach for blind restoration of real-world audio, effectively handling multiple artifact types and severities in a single model, outperforming baseline methods.

Contribution

The study presents the first application of 1D Op-GANs for direct time-domain blind audio restoration across diverse artifact types and severities.

Findings

01

Achieved over 7.2 dB SDR improvement on speech datasets.

02

Achieved over 4.9 dB SDR improvement on non-speech datasets.

03

Demonstrated robustness across various artifact combinations and severities.

Abstract

Objective: Despite numerous studies proposed for audio restoration in the literature, most of them focus on an isolated restoration problem such as denoising or dereverberation, ignoring other artifacts. Moreover, assuming a noisy or reverberant environment with limited number of fixed signal-to-distortion ratio (SDR) levels is a common practice. However, real-world audio is often corrupted by a blend of artifacts such as reverberation, sensor noise, and background audio mixture with varying types, severities, and duration. In this study, we propose a novel approach for blind restoration of real-world audio signals by Operational Generative Adversarial Networks (Op-GANs) with temporal and spectral objective metrics to enhance the quality of restored audio signal regardless of the type and severity of each artifact corrupting it. Methods: 1D Operational-GANs are used with generative…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

inceturker/blind-restoration-of-real-world-audio-by-1d-operational-gans
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Image and Signal Denoising Methods · Music and Audio Processing