Asymmetric Loss For Multi-Label Classification
Emanuel Ben-Baruch, Tal Ridnik, Nadav Zamir, Asaf Noy, Itamar, Friedman, Matan Protter, Lihi Zelnik-Manor

TL;DR
This paper introduces an asymmetric loss function for multi-label classification that dynamically balances positive and negative samples, leading to improved accuracy and state-of-the-art results across multiple datasets.
Contribution
The paper proposes a novel asymmetric loss (ASL) that differentially weights positive and negative samples, enhancing training effectiveness in imbalanced multi-label tasks.
Findings
Achieved state-of-the-art mAP scores on MS-COCO, Pascal-VOC, NUS-WIDE, and Open Images.
Demonstrated ASL's effectiveness in single-label classification and object detection.
ASL is easy to implement and does not increase training complexity.
Abstract
In a typical multi-label setting, a picture contains on average few positive labels, and many negative ones. This positive-negative imbalance dominates the optimization process, and can lead to under-emphasizing gradients from positive labels during training, resulting in poor accuracy. In this paper, we introduce a novel asymmetric loss ("ASL"), which operates differently on positive and negative samples. The loss enables to dynamically down-weights and hard-thresholds easy negative samples, while also discarding possibly mislabeled samples. We demonstrate how ASL can balance the probabilities of different samples, and how this balancing is translated to better mAP scores. With ASL, we reach state-of-the-art results on multiple popular multi-label datasets: MS-COCO, Pascal-VOC, NUS-WIDE and Open Images. We also demonstrate ASL applicability for other tasks, such as single-label…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Machine Learning and Data Classification · Text and Document Classification Technologies
