Bag of Tricks for In-Distribution Calibration of Pretrained Transformers

Jaeyoung Kim; Dongbin Na; Sungchul Choi; Sungbin Lim

arXiv:2302.06690·cs.CL·February 15, 2023

Bag of Tricks for In-Distribution Calibration of Pretrained Transformers

Jaeyoung Kim, Dongbin Na, Sungchul Choi, Sungbin Lim

PDF

Open Access 1 Repo

TL;DR

This paper empirically studies confidence calibration in pre-trained language models, revealing limitations of existing methods and proposing the Calibrated PLM (CALL) to improve both calibration and accuracy.

Contribution

It introduces CALL, a combined calibration technique for PLMs, and provides extensive analysis of calibration methods' effects on PLM performance.

Findings

01

Ensemble models overfit training data and have poor calibration.

02

Confidence penalty loss introduces a trade-off between calibration and accuracy.

03

CALL effectively improves both calibration and classification performance.

Abstract

While pre-trained language models (PLMs) have become a de-facto standard promoting the accuracy of text classification tasks, recent studies find that PLMs often predict over-confidently. Although various calibration methods have been proposed, such as ensemble learning and data augmentation, most of the methods have been verified in computer vision benchmarks rather than in PLM-based text classification tasks. In this paper, we present an empirical study on confidence calibration for PLMs, addressing three categories, including confidence penalty losses, data augmentations, and ensemble methods. We find that the ensemble model overfitted to the training set shows sub-par calibration performance and also observe that PLMs trained with confidence penalty loss have a trade-off between calibration and accuracy. Building on these observations, we propose the Calibrated PLM (CALL), a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

kimjeyoung/plm_call
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Machine Learning and Data Classification