A Bayesian Approach to Estimation of Speaker Normalization Parameters
Dhananjay Ram, Debasis Kundu, Rajesh M. Hegde

TL;DR
This paper introduces a Bayesian method using Gibbs sampling for vocal tract length normalization in speech recognition, improving speaker normalization accuracy and recognition performance over traditional models.
Contribution
It presents a novel Bayesian approach with hyperparameter estimation for VTLN, modeling vocal tract variation more effectively than previous linear models.
Findings
Bayesian VTLN improves recognition accuracy significantly.
The method outperforms traditional linear models in modeling vocal tract variation.
Experimental results show enhanced vowel and phrase recognition performance.
Abstract
In this work, a Bayesian approach to speaker normalization is proposed to compensate for the degradation in performance of a speaker independent speech recognition system. The speaker normalization method proposed herein uses the technique of vocal tract length normalization (VTLN). The VTLN parameters are estimated using a novel Bayesian approach which utilizes the Gibbs sampler, a special type of Markov Chain Monte Carlo method. Additionally the hyperparameters are estimated using maximum likelihood approach. This model is used assuming that human vocal tract can be modeled as a tube of uniform cross section. It captures the variation in length of the vocal tract of different speakers more effectively, than the linear model used in literature. The work has also investigated different methods like minimization of Mean Square Error (MSE) and Mean Absolute Error (MAE) for the estimation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis · Speech and Audio Processing · Advanced Data Compression Techniques
