Your Finetuned Large Language Model is Already a Powerful   Out-of-distribution Detector

Andi Zhang; Tim Z. Xiao; Weiyang Liu; Robert Bamler; Damon Wischik

arXiv:2404.08679·cs.CL·March 7, 2025·1 cites

Your Finetuned Large Language Model is Already a Powerful Out-of-distribution Detector

Andi Zhang, Tim Z. Xiao, Weiyang Liu, Robert Bamler, Damon Wischik

PDF

Open Access

TL;DR

This paper proposes using the likelihood ratio between pretrained and finetuned large language models as an effective, easy-to-implement criterion for out-of-distribution detection across various tasks, enhancing model reliability.

Contribution

It introduces a novel OOD detection method based on likelihood ratios of LLMs that requires no additional training and can be applied directly using existing models and frameworks.

Findings

01

Likelihood ratio effectively detects OOD data in multiple scenarios.

02

The method is simple to implement with existing neural network loss functions.

03

Evaluation shows strong performance across diverse OOD detection tasks.

Abstract

We revisit the likelihood ratio between a pretrained large language model (LLM) and its finetuned variant as a criterion for out-of-distribution (OOD) detection. The intuition behind such a criterion is that, the pretrained LLM has the prior knowledge about OOD data due to its large amount of training data, and once finetuned with the in-distribution data, the LLM has sufficient knowledge to distinguish their difference. Leveraging the power of LLMs, we show that, the likelihood ratio can serve as an effective OOD detection criterion. Moreover, we apply the proposed LLM-based likelihood ratio to detect OOD questions in question-answering (QA) systems, which can be used to improve the performance of specialized LLMs for general questions. Given that likelihood can be easily obtained by the loss functions within contemporary neural network frameworks, it is straightforward to implement…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling