Why Rectified Power Unit Networks Fail and How to Improve It: An Effective Field Theory Perspective
Taeyoung Kim, Myungjoo Kang

TL;DR
This paper analyzes the failure modes of Rectified Power Unit networks using effective field theory and introduces MRePU, a new activation function that improves training stability and performance in deep neural networks.
Contribution
It provides a theoretical understanding of RePU failures and proposes MRePU, a novel activation function that enhances stability and universality in deep networks.
Findings
MRePU improves training stability across tasks
Theoretical analysis confirms MRePU's criticality conditions
Experiments show enhanced performance in vision and physics tasks
Abstract
The Rectified Power Unit (RePU) activation function, a differentiable generalization of the Rectified Linear Unit (ReLU), has shown promise in constructing neural networks due to its smoothness properties. However, deep RePU networks often suffer from critical issues such as vanishing or exploding values during training, rendering them unstable regardless of hyperparameter initialization. Leveraging the perspective of effective field theory, we identify the root causes of these failures and propose the Modified Rectified Power Unit (MRePU) activation function. MRePU addresses RePU's limitations while preserving its advantages, such as differentiability and universal approximation properties. Theoretical analysis demonstrates that MRePU satisfies criticality conditions necessary for stable training, placing it in a distinct universality class. Extensive experiments validate the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPower System Reliability and Maintenance
