
TL;DR
This paper introduces a method to approximate convexity in neural graphs by decomposing complex structures and applying a scale mechanism, enabling more effective optimization of neural architectures.
Contribution
It proposes a novel scale mechanism to convexify neural graph components, facilitating better optimization of complex neural structures.
Findings
Nearly convex in each variable when others are fixed
Scale mechanism transforms non-convex parts into convex forms
Experimental validation supports theoretical claims
Abstract
Traditionally, most complex intelligence architectures are extremely non-convex, which could not be well performed by convex optimization. However, this paper decomposes complex structures into three types of nodes: operators, algorithms and functions. Iteratively, propagating from node to node along edge, we prove that "regarding the tree-structured neural graph, it is nearly convex in each variable, when the other variables are fixed." In fact, the non-convex properties stem from circles and functions, which could be transformed to be convex with our proposed \textit{\textbf{scale mechanism}}. Experimentally, we justify our theoretical analysis by two practical applications.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Face and Expression Recognition · Machine Learning and Algorithms
