SoftAdapt: Techniques for Adaptive Loss Weighting of Neural Networks with Multi-Part Loss Functions
A. Ali Heydari, Craig A. Thompson, Asif Mehmood

TL;DR
SoftAdapt introduces a dynamic weighting scheme for multi-part loss functions in neural networks, improving convergence and performance by adjusting weights based on real-time loss statistics.
Contribution
It proposes a novel, mathematically intuitive, and efficient method for adaptive loss weighting called SoftAdapt, applicable to various neural network tasks.
Findings
Improved convergence speed in neural network training.
Enhanced performance in image reconstruction and data generation tasks.
Demonstrated effectiveness across different neural network architectures.
Abstract
Adaptive loss function formulation is an active area of research and has gained a great deal of popularity in recent years, following the success of deep learning. However, existing frameworks of adaptive loss functions often suffer from slow convergence and poor choice of weights for the loss components. Traditionally, the elements of a multi-part loss function are weighted equally or their weights are determined through heuristic approaches that yield near-optimal (or sub-optimal) results. To address this problem, we propose a family of methods, called SoftAdapt, that dynamically change function weights for multi-part loss functions based on live performance statistics of the component losses. SoftAdapt is mathematically intuitive, computationally efficient and straightforward to implement. In this paper, we present the mathematical formulation and pseudocode for SoftAdapt, along with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Generative Adversarial Networks and Image Synthesis
MethodsAdaptive Robust Loss
