Unsupervised Calibration through Prior Adaptation for Text   Classification using Large Language Models

Lautaro Estienne; Luciana Ferrer; Mat\'ias Vera; Pablo Piantanida

arXiv:2307.06713·cs.CL·October 9, 2023·1 cites

Unsupervised Calibration through Prior Adaptation for Text Classification using Large Language Models

Lautaro Estienne, Luciana Ferrer, Mat\'ias Vera, Pablo Piantanida

PDF

Open Access 1 Repo

TL;DR

This paper introduces an unsupervised method to calibrate large language models for text classification by adapting prior class distributions using minimal in-domain data, improving performance without labeled samples.

Contribution

The work presents a novel prior adaptation technique that calibrates LLMs for classification tasks without requiring labeled data, outperforming previous calibration methods.

Findings

01

Outperforms unadapted models across various training shot scenarios

02

Effective calibration achieved with few in-domain samples

03

Surpasses previous calibration approaches without adaptation data

Abstract

A wide variety of natural language tasks are currently being addressed with large-scale language models (LLMs). These models are usually trained with a very large amount of unsupervised text data and adapted to perform a downstream natural language task using methods like fine-tuning, calibration or in-context learning. In this work, we propose an approach to adapt the prior class distribution to perform text classification tasks without the need for labelled samples and only few in-domain sample queries. The proposed approach treats the LLM as a black box, adding a stage where the model posteriors are calibrated to the task. Results show that these methods outperform the un-adapted model for different number of training shots in the prompt and a previous approach were calibration is performed without using any adaptation data.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

LautaroEst/efficient-reestimation
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech Recognition and Synthesis