AutoMCU: Feasibility-First MCU Neural Network Customization via LLM-based Multi-Agent Systems

Penglin Dai; Zijie Zhou; Xincao Xu; Junhua Wang; Xiao Wu; Lixin Duan

arXiv:2605.21560·cs.LG·May 22, 2026

AutoMCU: Feasibility-First MCU Neural Network Customization via LLM-based Multi-Agent Systems

Penglin Dai, Zijie Zhou, Xincao Xu, Junhua Wang, Xiao Wu, Lixin Duan

PDF

TL;DR

AutoMCU introduces an LLM-based multi-agent system for rapid, feasible neural network customization on MCUs, significantly reducing development time while ensuring deployability and performance.

Contribution

It presents a novel feasibility-first approach combining LLMs and multi-agent coordination for efficient neural network deployment on resource-constrained MCUs.

Findings

01

AutoMCU achieves competitive accuracy under strict MCU constraints.

02

Reduces customization time to 1-2 hours from hundreds of GPU hours.

03

Demonstrates practical deployment on STM32 microcontrollers.

Abstract

Deploying neural networks on microcontroller units (MCUs) is critical for edge intelligence but remains challenging due to tight memory, storage, and computation constraints. Existing approaches, such as model compression and hardware-aware neural architecture search (HW-NAS), often depend on proxy metrics, incur high search cost, and do not fully bridge the gap between architecture design and verified deployment. This paper presents AutoMCU, a feasibility-first large language model (LLM)-based multi-agent system for automated neural network customization under MCU constraints. Given natural-language task requirements and hardware specifications, AutoMCU iteratively generates structured architecture candidates, filters infeasible designs through vendor toolchain feedback before training, evaluates feasible models under a controlled protocol, and verifies deployability through…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.