Pseudo-OOD training for robust language models

Dhanasekar Sundararaman; Nikhil Mehta; Lawrence Carin

arXiv:2210.09132·cs.CL·October 18, 2022

Pseudo-OOD training for robust language models

Dhanasekar Sundararaman, Nikhil Mehta, Lawrence Carin

PDF

Open Access

TL;DR

This paper introduces POORE, a post hoc method that generates pseudo-OOD samples from in-distribution data to improve out-of-distribution detection in language models, achieving state-of-the-art results.

Contribution

The paper proposes a novel post hoc regularization framework that creates pseudo-OOD samples for better OOD detection without needing prior OOD data.

Findings

01

Significant improvements in OOD detection accuracy.

02

Achieved new state-of-the-art results on three dialogue systems.

03

Effective in real-world NLP applications.

Abstract

While pre-trained large-scale deep models have garnered attention as an important topic for many downstream natural language processing (NLP) tasks, such models often make unreliable predictions on out-of-distribution (OOD) inputs. As such, OOD detection is a key component of a reliable machine-learning model for any industry-scale application. Common approaches often assume access to additional OOD samples during the training stage, however, outlier distribution is often unknown in advance. Instead, we propose a post hoc framework called POORE - POsthoc pseudo-Ood REgularization, that generates pseudo-OOD samples using in-distribution (IND) data. The model is fine-tuned by introducing a new regularization loss that separates the embeddings of IND and OOD data, which leads to significant gains on the OOD prediction task during testing. We extensively evaluate our framework on three…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems

MethodsHigh-Order Consensuses