Search-time Efficient Device Constraints-Aware Neural Architecture   Search

Oshin Dutta; Tanu Kanvar; Sumeet Agarwal

arXiv:2307.04443·cs.CV·July 11, 2023

Search-time Efficient Device Constraints-Aware Neural Architecture Search

Oshin Dutta, Tanu Kanvar, Sumeet Agarwal

PDF

Open Access

TL;DR

This paper introduces DCA-NAS, a fast neural architecture search method that efficiently generates device-constraint-aware models suitable for edge computing, outperforming manual designs and matching mobile architectures.

Contribution

The paper presents DCA-NAS, a novel NAS approach that incorporates device constraints and uses weight sharing and channel bottlenecks for rapid search.

Findings

01

DCA-NAS outperforms manual architectures on image classification tasks.

02

It achieves comparable performance to popular mobile architectures.

03

It generalizes well across different search spaces and hardware benchmarks.

Abstract

Edge computing aims to enable edge devices, such as IoT devices, to process data locally instead of relying on the cloud. However, deep learning techniques like computer vision and natural language processing can be computationally expensive and memory-intensive. Creating manual architectures specialized for each device is infeasible due to their varying memory and computational constraints. To address these concerns, we automate the construction of task-specific deep learning architectures optimized for device constraints through Neural Architecture Search (NAS). We present DCA-NAS, a principled method of fast neural network architecture search that incorporates edge-device constraints such as model size and floating-point operations. It incorporates weight sharing and channel bottleneck techniques to speed up the search time. Based on our experiments, we see that DCA-NAS outperforms…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · IoT and Edge/Fog Computing

MethodsDifferentiable Architecture Search · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings