Bayesian-based Online Label Shift Estimation with Dynamic Dirichlet Priors
Jiawei Hu, Javier A. Barria

TL;DR
This paper introduces Bayesian methods with dynamic Dirichlet priors for online label shift estimation, significantly improving test prior accuracy and classifier performance in streaming and imbalanced data scenarios.
Contribution
It proposes FMAPLS and online-FMAPLS, novel Bayesian algorithms with closed-form hyperparameter updates for real-time label shift estimation.
Findings
Achieves up to 40% lower KL divergence on CIFAR100 and ImageNet.
Improves post-shift classification accuracy under severe class imbalance.
Demonstrates robustness and scalability in large-scale, dynamic environments.
Abstract
Label shift, a prevalent challenge in supervised learning, arises when the class prior distribution of test data differs from that of training data, leading to significant degradation in classifier performance. To accurately estimate the test priors and enhance classification accuracy, we propose a Bayesian framework for label shift estimation, termed Full Maximum A Posterior Label Shift (FMAPLS), along with its online version, online-FMAPLS. Leveraging batch and online Expectation-Maximization (EM) algorithms, these methods jointly and dynamically optimize Dirichlet hyperparameters and class priors , thereby overcoming the rigid constraints of the existing Maximum A Posterior Label Shift (MAPLS) approach. Moreover, we introduce a linear surrogate function (LSF) to replace gradient-based hyperparameter updates, yielding closed-form solutions that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Stream Mining Techniques · Machine Learning and Data Classification · Domain Adaptation and Few-Shot Learning
