Growing Efficient Accurate and Robust Neural Networks on the Edge

Vignesh Sundaresha; Naresh Shanbhag

arXiv:2410.07691·cs.LG·October 11, 2024

Growing Efficient Accurate and Robust Neural Networks on the Edge

Vignesh Sundaresha, Naresh Shanbhag

PDF

Open Access 3 Reviews

TL;DR

This paper introduces GEARnn, a method for growing, training, and robustifying neural networks directly on resource-limited Edge devices, reducing reliance on cloud-based training and transmission.

Contribution

GEARnn enables in-situ growth and robust training of neural networks on Edge devices, combining one-shot growth and efficient augmentation for resource-constrained environments.

Findings

01

Achieves robust, accurate networks on NVIDIA Jetson Xavier NX

02

Balances accuracy, robustness, and energy efficiency effectively

03

Reduces need for cloud-based training and data transmission

Abstract

The ubiquitous deployment of deep learning systems on resource-constrained Edge devices is hindered by their high computational complexity coupled with their fragility to out-of-distribution (OOD) data, especially to naturally occurring common corruptions. Current solutions rely on the Cloud to train and compress models before deploying to the Edge. This incurs high energy and latency costs in transmitting locally acquired field data to the Cloud while also raising privacy concerns. We propose GEARnn (Growing Efficient, Accurate, and Robust neural networks) to grow and train robust networks in-situ, i.e., completely on the Edge device. Starting with a low-complexity initial backbone network, GEARnn employs One-Shot Growth (OSG) to grow a network satisfying the memory constraints of the Edge device using clean data, and robustifies the network using Efficient Robust Augmentation (ERA) to…

Peer Reviews

Decision·Submitted to ICLR 2025

Reviewer 01Rating 8Confidence 3

Strengths

1. It is critical to produce and optimize an accurate neural network on resource-constrained edge devices, the paper provides a good solution that addressing the challenge of deploying efficient, accurate, and robust deep learning models on the Edge. 2. The two-phase growth approach can provide efficiency and robustness. 3. The proposed method is verified on real devices NVIDIA Jetson Xavier NX 4. The proposed approach addresses the key challenges of high computational complexity and fragility t

Weaknesses

1. I am expecting more real edge devices to demonstrate the feasibility and also the performance difference. 2. Is it possible to include other datasets such as multimodal cases. 3. Is it possible to show some analysis regarding on the performance of the neural training method?

Reviewer 02Rating 5Confidence 4

Strengths

- The paper addresses a real-world problem of training robust networks on resource-constrained edge devices - The paper also provides comprehensive empirical evaluation results.

Weaknesses

- More theoretical analysis is expected. For example, the rationale provided in Section 8 (referenced but not shown in the excerpt) appears mostly empirical. What is the fundamental challenge and uniqueness of the technical contribution? - More implementation details can be used. For example, for the ERA method, more specifics about transform operations and hyperparameter choices can help the readers. - More analysis analysis for some designs can help. For example, the hyperparameter growth r

Reviewer 03Rating 6Confidence 3

Strengths

The paper is very well written. The problem is interesting. The performance analysis shows the benefit of applying GEAR and provides interesting insights on how grow techniques work.

Weaknesses

The paper lacks the novelty, the proposed solution is a combination of different techniques proposed before. Also, it has limited applicability, as the evaluation results are shown for CCN models only, and it is not clear how the proposed solution can be extended to more recent models, such as transformers. Besides, I found isolated on-device training rather impractical: i) it is very slow (as also mentioned by the authors), and ii) in many cases, there might not be enough data at the device t

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications