SoftAdapt: Techniques for Adaptive Loss Weighting of Neural Networks   with Multi-Part Loss Functions

A. Ali Heydari; Craig A. Thompson; Asif Mehmood

arXiv:1912.12355·cs.LG·January 1, 2020·68 cites

SoftAdapt: Techniques for Adaptive Loss Weighting of Neural Networks with Multi-Part Loss Functions

A. Ali Heydari, Craig A. Thompson, Asif Mehmood

PDF

Open Access 2 Repos

TL;DR

SoftAdapt introduces a dynamic weighting scheme for multi-part loss functions in neural networks, improving convergence and performance by adjusting weights based on real-time loss statistics.

Contribution

It proposes a novel, mathematically intuitive, and efficient method for adaptive loss weighting called SoftAdapt, applicable to various neural network tasks.

Findings

01

Improved convergence speed in neural network training.

02

Enhanced performance in image reconstruction and data generation tasks.

03

Demonstrated effectiveness across different neural network architectures.

Abstract

Adaptive loss function formulation is an active area of research and has gained a great deal of popularity in recent years, following the success of deep learning. However, existing frameworks of adaptive loss functions often suffer from slow convergence and poor choice of weights for the loss components. Traditionally, the elements of a multi-part loss function are weighted equally or their weights are determined through heuristic approaches that yield near-optimal (or sub-optimal) results. To address this problem, we propose a family of methods, called SoftAdapt, that dynamically change function weights for multi-part loss functions based on live performance statistics of the component losses. SoftAdapt is mathematically intuitive, computationally efficient and straightforward to implement. In this paper, we present the mathematical formulation and pseudocode for SoftAdapt, along with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Generative Adversarial Networks and Image Synthesis

MethodsAdaptive Robust Loss