Hybridised Loss Functions for Improved Neural Network Generalisation

Matthew C. Dickson; Anna S. Bosman; Katherine M. Malan

arXiv:2204.12244·cs.LG·April 27, 2022

Hybridised Loss Functions for Improved Neural Network Generalisation

Matthew C. Dickson, Anna S. Bosman, Katherine M. Malan

PDF

TL;DR

This paper investigates hybrid loss functions combining cross entropy and sum squared error to enhance neural network generalisation, demonstrating improved performance across various problems.

Contribution

It introduces and evaluates hybrid loss functions that switch from sum squared error to cross entropy during training, showing improved generalisation in neural networks.

Findings

01

Hybrid loss functions improve neural network generalisation.

02

Switching from sum squared error to cross entropy yields best results.

03

Hybrid approach outperforms individual loss functions in experiments.

Abstract

Loss functions play an important role in the training of artificial neural networks (ANNs), and can affect the generalisation ability of the ANN model, among other properties. Specifically, it has been shown that the cross entropy and sum squared error loss functions result in different training dynamics, and exhibit different properties that are complementary to one another. It has previously been suggested that a hybrid of the entropy and sum squared error loss functions could combine the advantages of the two functions, while limiting their disadvantages. The effectiveness of such hybrid loss functions is investigated in this study. It is shown that hybridisation of the two loss functions improves the generalisation ability of the ANNs on all problems considered. The hybrid loss function that starts training with the sum squared error loss function and later switches to the cross…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.