End to End Deep Neural Network Frequency Demodulation of Speech Signals

Dan Elbaz; Michael Zibulevsky

arXiv:1704.02046·cs.LG·October 10, 2017·1 cites

End to End Deep Neural Network Frequency Demodulation of Speech Signals

Dan Elbaz, Michael Zibulevsky

PDF

Open Access

TL;DR

This paper introduces an end-to-end deep neural network approach for frequency demodulation of speech signals in FM radio, leveraging prior speech information to improve robustness against noise and disturbances.

Contribution

It presents a novel SDR receiver that uses deep learning for FM demodulation, outperforming traditional methods especially at low SNR conditions.

Findings

01

High performance detection in noisy environments

02

Outperforms established methods in mean square error

03

Improves perceptual speech quality scores

Abstract

Frequency modulation (FM) is a form of radio broadcasting which is widely used nowadays and has been for almost a century. We suggest a software-defined-radio (SDR) receiver for FM demodulation that adopts an end-to-end learning based approach and utilizes the prior information of transmitted speech message in the demodulation process. The receiver detects and enhances speech from the in-phase and quadrature components of its base band version. The new system yields high performance detection for both acoustical disturbances, and communication channel noise and is foreseen to out-perform the established methods for low signal to noise ratio (SNR) conditions in both mean square error and in perceptual evaluation of speech quality score.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Speech Recognition and Synthesis · Ultrasonics and Acoustic Wave Propagation