A Speech Enhancement Method Using Fast Fourier Transform and Convolutional Autoencoder

Pu-Yun Kow; Pu-Zhao Kow

arXiv:2501.01650·cs.SD·November 14, 2025

A Speech Enhancement Method Using Fast Fourier Transform and Convolutional Autoencoder

Pu-Yun Kow, Pu-Zhao Kow

PDF

TL;DR

This paper introduces a lightweight speech enhancement method combining FFT and convolutional autoencoders, demonstrating competitive results in a speech reconstruction challenge without relying on neural networks.

Contribution

The paper presents a novel hybrid approach using FFT and convolutional autoencoders for speech enhancement, achieving high performance in a competitive setting.

Findings

01

Achieved second place in Helsinki Speech Challenge 2024

02

Demonstrated effectiveness of neural-network-free methods

03

Showed potential of FFT-ConvAE for speech reconstruction

Abstract

This paper addresses the reconstruction of audio signals from degraded measurements. We propose a lightweight model that combines the discrete Fourier transform with a Convolutional Autoencoder (FFT-ConvAE), which enabled our team to achieve second place in the Helsinki Speech Challenge 2024. Our results, together with those of other teams, demonstrate the potential of neural-network-free approaches for effective speech signal reconstruction.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.