Designing a Linearized Potential Function in Neural Network Optimization Using Csisz\'{a}r Type of Tsallis Entropy
Keito Akiyama

TL;DR
This paper introduces a novel framework using Csiszár type of Tsallis entropy to design a linearized potential function, leading to exponential convergence in neural network optimization.
Contribution
It proposes a new approach employing generalized Tsallis entropy for potential functions, addressing technical challenges in convergence analysis.
Findings
Established a framework with Tsallis entropy for potential functions.
Derived exponential convergence results for neural network optimization.
Enhanced understanding of entropy's role in convergence behavior.
Abstract
In recent years, learning for neural networks can be viewed as optimization in the space of probability measures. To obtain the exponential convergence to the optimizer, the regularizing term based on Shannon entropy plays an important role. Even though an entropy function heavily affects convergence results, there is almost no result on its generalization, because of the following two technical difficulties: one is the lack of sufficient condition for generalized logarithmic Sobolev inequality, and the other is the distributional dependence of the potential function within the gradient flow equation. In this paper, we establish a framework that utilizes a linearized potential function via Csisz\'{a}r type of Tsallis entropy, which is one of the generalized entropies. We also show that our new framework enable us to derive an exponential convergence result.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Mechanics and Entropy · Neural Networks and Applications · Model Reduction and Neural Networks
