Shift Happens: Adjusting Classifiers

Theodore James Thibault Heiser; Mari-Liis Allikivi; Meelis Kull

arXiv:2111.02529·cs.LG·November 5, 2021

Shift Happens: Adjusting Classifiers

Theodore James Thibault Heiser, Mari-Liis Allikivi, Meelis Kull

PDF

TL;DR

This paper introduces methods to adjust probabilistic classifiers for dataset shift by transforming predictions to better match new class distributions, with theoretical guarantees and experimental validation.

Contribution

It proposes unbounded and bounded adjustment methods that improve classifier calibration under dataset shift, extending proper scoring rule minimization.

Findings

01

Methods reduce loss when class distribution is known

02

Adjustments improve performance under approximate class distribution knowledge

03

Theoretical guarantees support practical effectiveness

Abstract

Minimizing expected loss measured by a proper scoring rule, such as Brier score or log-loss (cross-entropy), is a common objective while training a probabilistic classifier. If the data have experienced dataset shift where the class distributions change post-training, then often the model's performance will decrease, over-estimating the probabilities of some classes while under-estimating the others on average. We propose unbounded and bounded general adjustment (UGA and BGA) methods that transform all predictions to (re-)equalize the average prediction and the class distribution. These methods act differently depending on which proper scoring rule is to be minimized, and we have a theoretical guarantee of reducing loss on test data, if the exact class distribution is known. We also demonstrate experimentally that, when in practice the class distribution is known only approximately,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.