A new approach for fine-tuning sentence transformers for intent classification and out-of-scope detection tasks
Tianyi Zhang, Atta Norouzian, Aanchan Mohan, Frederick Ducatelle

TL;DR
This paper introduces a novel fine-tuning method for sentence transformers that improves out-of-scope detection in virtual assistants by combining intent classification with embedding reconstruction, leading to better OOS rejection without harming classification accuracy.
Contribution
The paper proposes a regularization technique using auto-encoder based embedding reconstruction to enhance OOS detection in sentence transformers.
Findings
Achieved 1-4% improvement in OOS rejection performance.
Maintained comparable intent classification accuracy.
Enhanced embedding space separation for in-scope and OOS queries.
Abstract
In virtual assistant (VA) systems it is important to reject or redirect user queries that fall outside the scope of the system. One of the most accurate approaches for out-of-scope (OOS) rejection is to combine it with the task of intent classification on in-scope queries, and to use methods based on the similarity of embeddings produced by transformer-based sentence encoders. Typically, such encoders are fine-tuned for the intent-classification task, using cross-entropy loss. Recent work has shown that while this produces suitable embeddings for the intent-classification task, it also tends to disperse in-scope embeddings over the full sentence embedding space. This causes the in-scope embeddings to potentially overlap with OOS embeddings, thereby making OOS rejection difficult. This is compounded when OOS data is unknown. To mitigate this issue our work proposes to regularize the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Anomaly Detection Techniques and Applications · Adversarial Robustness in Machine Learning
