Multi-input Multi-output Beta Wavelet Network: Modeling of Acoustic   Units for Speech Recognition

Ridha Ejbali; Mourad Zaied; Chokri Ben Amar

arXiv:1211.2007·cs.CV·November 12, 2012·1 cites

Multi-input Multi-output Beta Wavelet Network: Modeling of Acoustic Units for Speech Recognition

Ridha Ejbali, Mourad Zaied, Chokri Ben Amar

PDF

Open Access

TL;DR

This paper introduces MIMOWN, a new wavelet network architecture designed to improve speech recognition by effectively modeling acoustic units through multi-input multi-output capabilities.

Contribution

The paper presents MIMOWN, a novel wavelet network architecture that generalizes previous models and enhances acoustic unit modeling for speech recognition.

Findings

01

MIMOWN effectively models acoustic units in speech recognition.

02

The architecture overcomes limitations of previous wavelet networks.

03

Improved training on diverse acoustic examples.

Abstract

In this paper, we propose a novel architecture of wavelet network called Multi-input Multi-output Wavelet Network MIMOWN as a generalization of the old architecture of wavelet network. This newel prototype was applied to speech recognition application especially to model acoustic unit of speech. The originality of our work is the proposal of MIMOWN to model acoustic unit of speech. This approach was proposed to overcome limitation of old wavelet network model. The use of the multi-input multi-output architecture will allows training wavelet network on various examples of acoustic units.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Neural Networks and Applications · Ultrasonics and Acoustic Wave Propagation