TR01: Time-continuous Sparse Imputation

J. F. Gemmeke; B. Cranen

arXiv:0901.2416·cs.SD·January 19, 2009·1 cites

TR01: Time-continuous Sparse Imputation

J. F. Gemmeke, B. Cranen

PDF

Open Access

TL;DR

This paper introduces a novel time-continuous sparse imputation method for noise-robust speech recognition, leveraging large time-context information through a sliding window and sparse representations of reliable features.

Contribution

It presents a new approach that exploits large time-context for speech imputation, improving noise robustness over previous frame-by-frame methods.

Findings

01

Effective noise robustness demonstrated on AURORA-2 database.

02

Sparse representation approach improves speech feature estimation.

03

Potential for enhanced automatic speech recognition accuracy.

Abstract

An effective way to increase the noise robustness of automatic speech recognition is to label noisy speech features as either reliable or unreliable (missing) prior to decoding, and to replace the missing ones by clean speech estimates. We present a novel method to obtain such clean speech estimates. Unlike previous imputation frameworks which work on a frame-by-frame basis, our method focuses on exploiting information from a large time-context. Using a sliding window approach, denoised speech representations are constructed using a sparse representation of the reliable features in an overcomplete basis of fixed-length exemplar fragments. We demonstrate the potential of our approach with experiments on the AURORA-2 connected digit database.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSparse and Compressive Sensing Techniques · Blind Source Separation Techniques · Image and Signal Denoising Methods