Confusion2vec 2.0: Enriching Ambiguous Spoken Language Representations   with Subwords

Prashanth Gurunath Shivakumar; Panayiotis Georgiou; Shrikanth; Narayanan

arXiv:2102.02270·cs.CL·May 4, 2022

Confusion2vec 2.0: Enriching Ambiguous Spoken Language Representations with Subwords

Prashanth Gurunath Shivakumar, Panayiotis Georgiou, Shrikanth, Narayanan

PDF

1 Repo

TL;DR

This paper introduces Confusion2vec 2.0, a novel word embedding that incorporates subword information to better capture acoustic ambiguities in spoken language, improving performance on various linguistic tasks with ASR errors.

Contribution

It proposes an unsupervised subword-based extension of Confusion2vec that effectively models acoustic ambiguities directly from ASR lattices, outperforming existing embeddings.

Findings

01

Significantly better performance on semantic, syntactic, and acoustic analogy tasks.

02

Enhanced spoken language intent detection accuracy.

03

Reduces need for retraining NLU models on ASR transcripts.

Abstract

Word vector representations enable machines to encode human language for spoken language understanding and processing. Confusion2vec, motivated from human speech production and perception, is a word vector representation which encodes ambiguities present in human spoken language in addition to semantics and syntactic information. Confusion2vec provides a robust spoken language representation by considering inherent human language ambiguities. In this paper, we propose a novel word vector space estimation by unsupervised learning on lattices output by an automatic speech recognition (ASR) system. We encode each word in confusion2vec vector space by its constituent subword character n-grams. We show the subword encoding helps better represent the acoustic perceptual ambiguities in human spoken language via information modeled on lattice structured ASR output. The usefulness of the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

pgurunath/confusion2vec_2.0
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.