# Visualising Basins of Attraction for the Cross-Entropy and the Squared   Error Neural Network Loss Functions

**Authors:** Anna Sergeevna Bosman, Andries Engelbrecht, Mard\'e Helbig

arXiv: 1901.02302 · 2019-01-10

## TL;DR

This paper introduces a new gradient-based visualization method to analyze the basins of attraction in neural network loss surfaces, comparing quadratic and entropic loss functions and revealing their different landscape properties.

## Contribution

It presents a novel visualization technique for neural network loss surfaces and provides empirical insights into how different loss functions shape these landscapes.

## Key findings

- Entropic loss has stronger gradients and fewer stationary points.
- Quadratic loss is more resistant to overfitting.
- Number of local minima decreases with higher dimensionality.

## Abstract

Quantification of the stationary points and the associated basins of attraction of neural network loss surfaces is an important step towards a better understanding of neural network loss surfaces at large. This work proposes a novel method to visualise basins of attraction together with the associated stationary points via gradient-based random sampling. The proposed technique is used to perform an empirical study of the loss surfaces generated by two different error metrics: quadratic loss and entropic loss. The empirical observations confirm the theoretical hypothesis regarding the nature of neural network attraction basins. Entropic loss is shown to exhibit stronger gradients and fewer stationary points than quadratic loss, indicating that entropic loss has a more searchable landscape. Quadratic loss is shown to be more resilient to overfitting than entropic loss. Both losses are shown to exhibit local minima, but the number of local minima is shown to decrease with an increase in dimensionality. Thus, the proposed visualisation technique successfully captures the local minima properties exhibited by the neural network loss surfaces, and can be used for the purpose of fitness landscape analysis of neural networks.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1901.02302/full.md

## Figures

107 figures with captions in the complete paper: https://tomesphere.com/paper/1901.02302/full.md

## References

51 references — full list in the complete paper: https://tomesphere.com/paper/1901.02302/full.md

---
Source: https://tomesphere.com/paper/1901.02302