Seesaw Loss for Long-Tailed Instance Segmentation
Jiaqi Wang, Wenwei Zhang, Yuhang Zang, Yuhang Cao, Jiangmiao Pang, Tao, Gong, Kai Chen, Ziwei Liu, Chen Change Loy, Dahua Lin

TL;DR
Seesaw Loss is a novel loss function designed to improve long-tailed instance segmentation by dynamically balancing positive and negative sample gradients, leading to better performance on datasets with skewed category distributions.
Contribution
The paper introduces Seesaw Loss, which adaptively re-balances gradients for long-tailed data, significantly enhancing segmentation accuracy without complex modifications.
Findings
Achieves state-of-the-art results on LVIS dataset.
Significant improvements over Cross-Entropy Loss in long-tailed scenarios.
Effective across various frameworks and sampling strategies.
Abstract
Instance segmentation has witnessed a remarkable progress on class-balanced benchmarks. However, they fail to perform as accurately in real-world scenarios, where the category distribution of objects naturally comes with a long tail. Instances of head classes dominate a long-tailed dataset and they serve as negative samples of tail categories. The overwhelming gradients of negative samples on tail classes lead to a biased learning process for classifiers. Consequently, objects of tail categories are more likely to be misclassified as backgrounds or head categories. To tackle this problem, we propose Seesaw Loss to dynamically re-balance gradients of positive and negative samples for each category, with two complementary factors, i.e., mitigation factor and compensation factor. The mitigation factor reduces punishments to tail categories w.r.t. the ratio of cumulative training instances…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Machine Learning and Data Classification
MethodsSeesaw Loss
