Watch the Neighbors: A Unified K-Nearest Neighbor Contrastive Learning   Framework for OOD Intent Discovery

Yutao Mou; Keqing He; Pei Wang; Yanan Wu; Jingang Wang; Wei Wu; Weiran; Xu

arXiv:2210.08909·cs.CL·October 18, 2022·1 cites

Watch the Neighbors: A Unified K-Nearest Neighbor Contrastive Learning Framework for OOD Intent Discovery

Yutao Mou, Keqing He, Pei Wang, Yanan Wu, Jingang Wang, Wei Wu, Weiran, Xu

PDF

Open Access 1 Repo

TL;DR

This paper introduces a unified K-nearest neighbor contrastive learning framework that improves out-of-domain intent discovery in dialogue systems by addressing in-domain overfitting and bridging representation learning with clustering.

Contribution

It proposes a novel KCL and KCC method that jointly enhances in-domain discriminative features and out-of-domain clustering, outperforming existing approaches.

Findings

01

Significant performance improvements on three benchmark datasets.

02

Effective mitigation of in-domain overfitting.

03

Enhanced clustering quality through hard negative mining.

Abstract

Discovering out-of-domain (OOD) intent is important for developing new skills in task-oriented dialogue systems. The key challenges lie in how to transfer prior in-domain (IND) knowledge to OOD clustering, as well as jointly learn OOD representations and cluster assignments. Previous methods suffer from in-domain overfitting problem, and there is a natural gap between representation learning and clustering objectives. In this paper, we propose a unified K-nearest neighbor contrastive learning framework to discover OOD intents. Specifically, for IND pre-training stage, we propose a KCL objective to learn inter-class discriminative features, while maintaining intra-class diversity, which alleviates the in-domain overfitting problem. For OOD clustering stage, we propose a KCC method to form compact clusters by mining true hard negative samples, which bridges the gap between clustering and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

myt517/kcod
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and dialogue systems · Topic Modeling · Natural Language Processing Techniques

MethodsContrastive Learning