Bayesian Recurrent Units and the Forward-Backward Algorithm
Alexandre Bittar, Philip N. Garner

TL;DR
This paper introduces Bayesian recurrent units derived from Bayes's theorem, which can be integrated into neural networks to enhance speech recognition performance with minimal additional parameters.
Contribution
It presents a novel theoretical framework linking Bayesian recurrence with neural networks, inspired by hidden Markov models, and demonstrates practical benefits in speech recognition.
Findings
Improved speech recognition accuracy with Bayesian units
Low additional computational cost
Theoretical connection between Bayesian recurrence and HMMs
Abstract
Using Bayes's theorem, we derive a unit-wise recurrence as well as a backward recursion similar to the forward-backward algorithm. The resulting Bayesian recurrent units can be integrated as recurrent neural networks within deep learning frameworks, while retaining a probabilistic interpretation from the direct correspondence with hidden Markov models. Whilst the contribution is mainly theoretical, experiments on speech recognition indicate that adding the derived units at the end of state-of-the-art recurrent architectures can improve the performance at a very low cost in terms of trainable parameters.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis · Speech and dialogue systems · Speech and Audio Processing
