Interpretable Representation Learning for Speech and Audio Signals Based   on Relevance Weighting

Purvi Agrawal; Sriram Ganapathy

arXiv:2011.02136·eess.AS·November 5, 2020

Interpretable Representation Learning for Speech and Audio Signals Based on Relevance Weighting

Purvi Agrawal, Sriram Ganapathy

PDF

1 Repo

TL;DR

This paper introduces a relevance weighting scheme within neural networks that enhances interpretability and improves performance in speech recognition and sound classification tasks by enabling feature selection during model training.

Contribution

The work presents a novel relevance weighting approach integrated into CNNs for interpretable speech and audio representation learning, applicable to noisy and reverberant environments.

Findings

01

Relevance weights capture meaningful speech/audio content.

02

Improved accuracy in speech recognition on noisy datasets.

03

Enhanced sound classification performance.

Abstract

The learning of interpretable representations from raw data presents significant challenges for time series data like speech. In this work, we propose a relevance weighting scheme that allows the interpretation of the speech representations during the forward propagation of the model itself. The relevance weighting is achieved using a sub-network approach that performs the task of feature selection. A relevance sub-network, applied on the output of first layer of a convolutional neural network model operating on raw speech signals, acts as an acoustic filterbank (FB) layer with relevance weighting. A similar relevance sub-network applied on the second convolutional layer performs modulation filterbank learning with relevance weighting. The full acoustic model consisting of relevance sub-networks, convolutional layers and feed-forward layers is trained for a speech recognition task on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

PurviAgrawal/Interpretable_rawWaveform_relevanceWeighted
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.