Generalized Data Weighting via Class-level Gradient Manipulation
Can Chen, Shuhao Zheng, Xi Chen, Erqun Dong, Xue Liu, Hao Liu, Dejing, Dou

TL;DR
This paper introduces Generalized Data Weighting (GDW), a novel method that manipulates class-level gradients to simultaneously address label noise and class imbalance without extra computational cost.
Contribution
GDW unrolls loss gradients to class-level gradients and reweights them, improving performance on noisy and imbalanced datasets with efficient class-level weighting.
Findings
GDW outperforms state-of-the-art methods by 2.56% on CIFAR10 with 60% uniform noise.
GDW effectively mitigates label noise and class imbalance simultaneously.
GDW achieves these improvements without additional computational overhead.
Abstract
Label noise and class imbalance are two major issues coexisting in real-world datasets. To alleviate the two issues, state-of-the-art methods reweight each instance by leveraging a small amount of clean and unbiased data. Yet, these methods overlook class-level information within each instance, which can be further utilized to improve performance. To this end, in this paper, we propose Generalized Data Weighting (GDW) to simultaneously mitigate label noise and class imbalance by manipulating gradients at the class level. To be specific, GDW unrolls the loss gradient to class-level gradients by the chain rule and reweights the flow of each gradient separately. In this way, GDW achieves remarkable performance improvement on both issues. Aside from the performance gain, GDW efficiently obtains class-level weights without introducing any extra computational cost compared with instance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Imbalanced Data Classification Techniques · Anomaly Detection Techniques and Applications
