Bag of Tricks for In-Distribution Calibration of Pretrained Transformers
Jaeyoung Kim, Dongbin Na, Sungchul Choi, Sungbin Lim

TL;DR
This paper empirically studies confidence calibration in pre-trained language models, revealing limitations of existing methods and proposing the Calibrated PLM (CALL) to improve both calibration and accuracy.
Contribution
It introduces CALL, a combined calibration technique for PLMs, and provides extensive analysis of calibration methods' effects on PLM performance.
Findings
Ensemble models overfit training data and have poor calibration.
Confidence penalty loss introduces a trade-off between calibration and accuracy.
CALL effectively improves both calibration and classification performance.
Abstract
While pre-trained language models (PLMs) have become a de-facto standard promoting the accuracy of text classification tasks, recent studies find that PLMs often predict over-confidently. Although various calibration methods have been proposed, such as ensemble learning and data augmentation, most of the methods have been verified in computer vision benchmarks rather than in PLM-based text classification tasks. In this paper, we present an empirical study on confidence calibration for PLMs, addressing three categories, including confidence penalty losses, data augmentations, and ensemble methods. We find that the ensemble model overfitted to the training set shows sub-par calibration performance and also observe that PLMs trained with confidence penalty loss have a trade-off between calibration and accuracy. Building on these observations, we propose the Calibrated PLM (CALL), a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Machine Learning and Data Classification
