Extension spectrale d'un signal de parole de la bande t\'el\'ephonique \`a la bande AM
Jean-Marc Valin

TL;DR
This paper presents a system for bandwidth extension of telephone speech signals to wideband using independent high and low-frequency methods, employing neural networks and sinusoidal models to improve sound quality.
Contribution
It introduces a novel bandwidth extension system combining excitation-filter and sinusoidal models with neural networks for spectral and amplitude estimation.
Findings
Sound quality after extension exceeds narrowband speech
Significant listener variation in perceived quality
Excitation extension techniques are promising for speech coding
Abstract
This document proposes a bandwidth extension system producing a wideband signal from a narrowband speech signal. The extension is performed independently for high and low frequencies. High-frequency extension uses the excitation-filter model. Extension of the excitation is performed in the time domain using a non-linear function, while the spectral envelope is extended in the cepstral domain using a multi-layer perceptron. Low-band extension is based on the sinusoidal model. The amplitude of sinusoids is also estimated using a multi-layer perceptron. The results show that the sound quality after extension is higher than that of narrowband speech, with a significant variation across listeners. Some of the techniques, including excitation extension, are of interest in the field of speech coding. ----- Le pr\'esent m\'emoire propose un syst\`eme d'extension de la bande permettant de…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Advanced Data Compression Techniques · Acoustic Wave Phenomena Research
