A new approach for fine-tuning sentence transformers for intent   classification and out-of-scope detection tasks

Tianyi Zhang; Atta Norouzian; Aanchan Mohan; Frederick Ducatelle

arXiv:2410.13649·cs.CL·October 22, 2024

A new approach for fine-tuning sentence transformers for intent classification and out-of-scope detection tasks

Tianyi Zhang, Atta Norouzian, Aanchan Mohan, Frederick Ducatelle

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel fine-tuning method for sentence transformers that improves out-of-scope detection in virtual assistants by combining intent classification with embedding reconstruction, leading to better OOS rejection without harming classification accuracy.

Contribution

The paper proposes a regularization technique using auto-encoder based embedding reconstruction to enhance OOS detection in sentence transformers.

Findings

01

Achieved 1-4% improvement in OOS rejection performance.

02

Maintained comparable intent classification accuracy.

03

Enhanced embedding space separation for in-scope and OOS queries.

Abstract

In virtual assistant (VA) systems it is important to reject or redirect user queries that fall outside the scope of the system. One of the most accurate approaches for out-of-scope (OOS) rejection is to combine it with the task of intent classification on in-scope queries, and to use methods based on the similarity of embeddings produced by transformer-based sentence encoders. Typically, such encoders are fine-tuned for the intent-classification task, using cross-entropy loss. Recent work has shown that while this produces suitable embeddings for the intent-classification task, it also tends to disperse in-scope embeddings over the full sentence embedding space. This causes the in-scope embeddings to potentially overlap with OOS embeddings, thereby making OOS rejection difficult. This is compounded when OOS data is unknown. To mitigate this issue our work proposes to regularize the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

slanglab-nu/autoencoder-oos
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware Engineering Research · Anomaly Detection Techniques and Applications · Adversarial Robustness in Machine Learning