Exploring the Advantages of Dense-Vector to One-Hot Encoding of Intent   Classes in Out-of-Scope Detection Tasks

Claudio Pinhanez; Paulo Cavalin

arXiv:2205.09021·cs.LG·May 19, 2022·1 cites

Exploring the Advantages of Dense-Vector to One-Hot Encoding of Intent Classes in Out-of-Scope Detection Tasks

Claudio Pinhanez, Paulo Cavalin

PDF

Open Access

TL;DR

This paper investigates how dense-vector encodings of intent classes can significantly improve out-of-scope detection in intent classification tasks, surpassing traditional one-hot encoding methods, and introduces a new algorithm for optimizing these encodings.

Contribution

It demonstrates that dense-vector encodings, even random ones, can outperform one-hot encodings in OOS detection and proposes a novel search algorithm for effective dense-vector encoding selection.

Findings

01

Dense-vector encodings create richer OOS space topologies.

02

Random dense-vector encodings outperform one-hot encodings by over 20%.

03

The proposed search algorithm shows promising initial results.

Abstract

This work explores the intrinsic limitations of the popular one-hot encoding method in classification of intents when detection of out-of-scope (OOS) inputs is required. Although recent work has shown that there can be significant improvements in OOS detection when the intent classes are represented as dense-vectors based on domain specific knowledge, we argue in this paper that such gains are more likely due to advantages of dense-vector to one-hot encoding methods in representing the complexity of the OOS space. We start by showing how dense-vector encodings can create OOS spaces with much richer topologies than one-hot encoding methods. We then demonstrate empirically, using four standard intent classification datasets, that knowledge-free, randomly generated dense-vector encodings of intent classes can yield massive, over 20% gains over one-hot encodings, and also outperform the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Integrated Circuits and Semiconductor Failure Analysis · Machine Learning and Data Classification