Embedding principle of homogeneous neural network for classification problem
Jiahan Zhang, Yaoyu Zhang, Tao Luo

TL;DR
This paper introduces the KKT point embedding principle for homogeneous neural networks, showing how solutions in smaller networks can be embedded into larger ones and how this relates to training dynamics and network structure.
Contribution
It formalizes the KKT point embedding principle for homogeneous neural networks and connects static embeddings to gradient flow dynamics during training.
Findings
KKT points of smaller networks embed into larger networks via linear isometric transformations.
Training trajectories preserve the embedding, maintaining solution alignment.
Insights into network width effects and parameter redundancy in homogeneous networks.
Abstract
In this paper, we study the Karush-Kuhn-Tucker (KKT) points of the associated maximum-margin problem in homogeneous neural networks, including fully-connected and convolutional neural networks. In particular, We investigates the relationship between such KKT points across networks of different widths generated. We introduce and formalize the \textbf{KKT point embedding principle}, establishing that KKT points of a homogeneous network's max-margin problem () can be embedded into the KKT points of a larger network's problem () via specific linear isometric transformations. We rigorously prove this principle holds for neuron splitting in fully-connected networks and channel splitting in convolutional neural networks. Furthermore, we connect this static embedding to the dynamics of gradient flow training with smooth losses. We demonstrate that trajectories…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Advanced Data Processing Techniques · Industrial Technology and Control Systems
