ProtoInfoMax: Prototypical Networks with Mutual Information Maximization   for Out-of-Domain Detection

Iftitahu Ni'mah; Meng Fang; Vlado Menkovski; Mykola Pechenizkiy

arXiv:2108.12229·cs.CL·January 20, 2022

ProtoInfoMax: Prototypical Networks with Mutual Information Maximization for Out-of-Domain Detection

Iftitahu Ni'mah, Meng Fang, Vlado Menkovski, Mykola Pechenizkiy

PDF

1 Repo

TL;DR

ProtoInfoMax enhances out-of-domain detection in NLP by extending Prototypical Networks with a mutual information maximization objective, significantly improving performance and reliability in low-resource scenarios.

Contribution

It introduces ProtoInfoMax, a novel architecture that combines prototypical networks with mutual information maximization for better OOD detection.

Findings

01

Improves OOD detection accuracy by up to 20% in low-resource settings.

02

Reduces neural network overconfidence errors.

03

Enhances reliability of predictions in NLP applications.

Abstract

The ability to detect Out-of-Domain (OOD) inputs has been a critical requirement in many real-world NLP applications. For example, intent classification in dialogue systems. The reason is that the inclusion of unsupported OOD inputs may lead to catastrophic failure of systems. However, it remains an empirical question whether current methods can tackle such problems reliably in a realistic scenario where zero OOD training data is available. In this study, we propose ProtoInfoMax, a new architecture that extends Prototypical Networks to simultaneously process in-domain and OOD sentences via Mutual Information Maximization (InfoMax) objective. Experimental results show that our proposed method can substantially improve performance up to 20% for OOD detection in low resource settings of text classification. We also show that ProtoInfoMax is less prone to typical overconfidence errors of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

inimah/protoinfomax
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.