Addressing Label Shift in Distributed Learning via Entropy   Regularization

Zhiyuan Wu; Changkyu Choi; Xiangcheng Cao; Volkan Cevher; Ali; Ramezani-Kebrya

arXiv:2502.02544·cs.LG·February 5, 2025

Addressing Label Shift in Distributed Learning via Entropy Regularization

Zhiyuan Wu, Changkyu Choi, Xiangcheng Cao, Volkan Cevher, Ali, Ramezani-Kebrya

PDF

Open Access

TL;DR

This paper introduces VRLS, a novel entropy regularization method for mitigating label shift in distributed learning, significantly improving model accuracy under challenging label distribution changes across nodes.

Contribution

The paper proposes VRLS, a new entropy-based regularization technique that enhances label shift adaptation in distributed learning environments, with theoretical guarantees and practical improvements.

Findings

01

VRLS outperforms baselines by up to 20% on benchmark datasets.

02

It effectively mitigates label shift in multi-node distributed learning.

03

Theoretical bounds support the robustness of VRLS.

Abstract

We address the challenge of minimizing true risk in multi-node distributed learning. These systems are frequently exposed to both inter-node and intra-node label shifts, which present a critical obstacle to effectively optimizing model performance while ensuring that data remains confined to each node. To tackle this, we propose the Versatile Robust Label Shift (VRLS) method, which enhances the maximum likelihood estimation of the test-to-train label density ratio. VRLS incorporates Shannon entropy-based regularization and adjusts the density ratio during training to better handle label shifts at the test time. In multi-node learning environments, VRLS further extends its capabilities by learning and adapting density ratios across nodes, effectively mitigating label shifts and improving overall model performance. Experiments conducted on MNIST, Fashion MNIST, and CIFAR-10 demonstrate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Advanced Adaptive Filtering Techniques