A Robust Frame-based Nonlinear Prediction System for Automatic Speech Coding
Mahmood Yousefi-Azar, Farbod Razzazi

TL;DR
This paper introduces a neural-based speech coding system that uses a two-stage training process to accurately compress and reconstruct speech signals, demonstrating robustness across speakers and phonemes in both time and DCT domains.
Contribution
It presents a novel frame-based nonlinear predictive coding method utilizing neural networks with a two-stage training process for improved speech signal reconstruction.
Findings
Effective reconstruction of phonemes with good accuracy.
DCT domain training yields better performance for energy-rich frames.
System demonstrates robustness across different speakers and utterances.
Abstract
In this paper, we propose a neural-based coding scheme in which an artificial neural network is exploited to automatically compress and decompress speech signals by a trainable approach. Having a two-stage training phase, the system can be fully specified to each speech frame and have robust performance across different speakers and wide range of spoken utterances. Indeed, Frame-based nonlinear predictive coding (FNPC) would code a frame in the procedure of training to predict the frame samples. The motivating objective is to analyze the system behavior in regenerating not only the envelope of spectra, but also the spectra phase. This scheme has been evaluated in time and discrete cosine transform (DCT) domains and the output of predicted phonemes show the potentiality of the FNPC to reconstruct complicated signals. The experiments were conducted on three voiced plosive phonemes, b/d/g/…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Speech Recognition and Synthesis · Advanced Data Compression Techniques
