Exploring and Exploiting the Asymmetric Valley of Deep Neural Networks

Xin-Chun Li; Jin-Lin Tang; Bo Zhang; Lan Li; De-Chuan Zhan

arXiv:2405.12489·cs.LG·October 10, 2024

Exploring and Exploiting the Asymmetric Valley of Deep Neural Networks

Xin-Chun Li, Jin-Lin Tang, Bo Zhang, Lan Li, De-Chuan Zhan

PDF

Open Access 1 Video

TL;DR

This paper investigates the asymmetry in the loss landscape of deep neural networks, revealing how factors like noise and activation functions influence valley symmetry, with implications for model fusion and federated learning.

Contribution

It systematically analyzes the causes of valley asymmetry in DNNs and introduces the concept of sign consistency as a key factor affecting model interpolation and alignment.

Findings

01

Sign consistency correlates with valley symmetry.

02

Imposing sign alignment improves federated learning.

03

Theoretical insights explain the asymmetry phenomenon.

Abstract

Exploring the loss landscape offers insights into the inherent principles of deep neural networks (DNNs). Recent work suggests an additional asymmetry of the valley beyond the flat and sharp ones, yet without thoroughly examining its causes or implications. Our study methodically explores the factors affecting the symmetry of DNN valleys, encompassing (1) the dataset, network architecture, initialization, and hyperparameters that influence the convergence point; and (2) the magnitude and direction of the noise for 1D visualization. Our major observation shows that the {\it degree of sign consistency} between the noise and the convergence point is a critical indicator of valley symmetry. Theoretical insights from the aspects of ReLU activation and softmax function could explain the interesting phenomenon. Our discovery propels novel understanding and applications in the scenario of Model…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Exploring and Exploiting the Asymmetric Valley of Deep Neural Networks· slideslive

Taxonomy

TopicsNeural Networks and Applications

MethodsSoftmax