Structure of Artificial Neural Networks -- Empirical Investigations
Julian Stier

TL;DR
This paper investigates the structural properties of deep neural networks, formalizes graph-induced architectures, and empirically evaluates automated neural architecture search methods to improve efficiency and understanding.
Contribution
It introduces a formal framework for graph-induced neural networks and analyzes how structure impacts various objectives, advancing neural architecture search techniques.
Findings
Structural properties influence correctness, robustness, and energy consumption.
New predictive models reduce evaluation costs in architecture search.
Informed sampling improves neural architecture search efficiency.
Abstract
Within one decade, Deep Learning overtook the dominating solution methods of countless problems of artificial intelligence. ``Deep'' refers to the deep architectures with operations in manifolds of which there are no immediate observations. For these deep architectures some kind of structure is pre-defined -- but what is this structure? With a formal definition for structures of neural networks, neural architecture search problems and solution methods can be formulated under a common framework. Both practical and theoretical questions arise from closing the gap between applied neural architecture search and learning theory. Does structure make a difference or can it be chosen arbitrarily? This work is concerned with deep structures of artificial neural networks and examines automatic construction methods under empirical principles to shed light on to the so called ``black-box…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
