Contrastive and Consistency Learning for Neural Noisy-Channel Model in   Spoken Language Understanding

Suyoung Kim; Jiyeon Hwang; Ho-Young Jung

arXiv:2405.15097·cs.CL·May 27, 2024

Contrastive and Consistency Learning for Neural Noisy-Channel Model in Spoken Language Understanding

Suyoung Kim, Jiyeon Hwang, Ho-Young Jung

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a two-stage Contrastive and Consistency Learning method to enhance neural noisy-channel models for spoken language understanding, improving robustness against ASR errors in noisy environments.

Contribution

It proposes a novel CCL approach that correlates error patterns and enforces feature consistency, advancing robustness in SLU systems using noisy ASR transcripts.

Findings

01

CCL outperforms existing methods on benchmark datasets.

02

Improves robustness of SLU models in noisy environments.

03

Enhances handling of transcription inconsistencies caused by ASR errors.

Abstract

Recently, deep end-to-end learning has been studied for intent classification in Spoken Language Understanding (SLU). However, end-to-end models require a large amount of speech data with intent labels, and highly optimized models are generally sensitive to the inconsistency between the training and evaluation conditions. Therefore, a natural language understanding approach based on Automatic Speech Recognition (ASR) remains attractive because it can utilize a pre-trained general language model and adapt to the mismatch of the speech input environment. Using this module-based approach, we improve a noisy-channel model to handle transcription inconsistencies caused by ASR errors. We propose a two-stage method, Contrastive and Consistency Learning (CCL), that correlates error patterns between clean and noisy ASR transcripts and emphasizes the consistency of the latent features of the two…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

syoung7388/ccl
pytorchOfficial

Videos

Contrastive and Consistency Learning for Neural Noisy-Channel Model in Spoken Language Understanding· underline

Taxonomy

TopicsNeural Networks and Applications · Speech Recognition and Synthesis