STEERAGE: Synthesis of Neural Networks Using Architecture Search and Grow-and-Prune Methods
Shayan Hassantabar, Xiaoliang Dai, Niraj K. Jha

TL;DR
STEERAGE introduces a novel methodology combining efficient architecture search and grow-and-prune techniques to synthesize neural networks, reducing redundancy and improving accuracy on datasets like MNIST and CIFAR-10.
Contribution
The paper presents a new synthesis approach that integrates architecture search with grow-and-prune methods, achieving state-of-the-art results and more efficient neural network designs.
Findings
Achieved 0.66% error on MNIST with 8.6x fewer parameters.
Improved ResNet-18 accuracy by 2.52% on CIFAR-10.
Highest reported accuracy for ResNet-based architectures on CIFAR-10.
Abstract
Neural networks (NNs) have been successfully deployed in many applications. However, architectural design of these models is still a challenging problem. Moreover, neural networks are known to have a lot of redundancy. This increases the computational cost of inference and poses an obstacle to deployment on Internet-of-Thing sensors and edge devices. To address these challenges, we propose the STEERAGE synthesis methodology. It consists of two complementary approaches: efficient architecture search, and grow-and-prune NN synthesis. The first step, covered in a global search module, uses an accuracy predictor to efficiently navigate the architectural search space. The predictor is built using boosted decision tree regression, iterative sampling, and efficient evolutionary search. The second step involves local search. By using various grow-and-prune methodologies for synthesizing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Non-Destructive Testing Techniques
MethodsAverage Pooling · *Communicated@Fast*How Do I Communicate to Expedia? · 1x1 Convolution · Batch Normalization · Bottleneck Residual Block · Global Average Pooling · Residual Block · Kaiming Initialization · Max Pooling · Residual Connection
