AutoMCU: Feasibility-First MCU Neural Network Customization via LLM-based Multi-Agent Systems
Penglin Dai, Zijie Zhou, Xincao Xu, Junhua Wang, Xiao Wu, Lixin Duan

TL;DR
AutoMCU introduces an LLM-based multi-agent system for rapid, feasible neural network customization on MCUs, significantly reducing development time while ensuring deployability and performance.
Contribution
It presents a novel feasibility-first approach combining LLMs and multi-agent coordination for efficient neural network deployment on resource-constrained MCUs.
Findings
AutoMCU achieves competitive accuracy under strict MCU constraints.
Reduces customization time to 1-2 hours from hundreds of GPU hours.
Demonstrates practical deployment on STM32 microcontrollers.
Abstract
Deploying neural networks on microcontroller units (MCUs) is critical for edge intelligence but remains challenging due to tight memory, storage, and computation constraints. Existing approaches, such as model compression and hardware-aware neural architecture search (HW-NAS), often depend on proxy metrics, incur high search cost, and do not fully bridge the gap between architecture design and verified deployment. This paper presents AutoMCU, a feasibility-first large language model (LLM)-based multi-agent system for automated neural network customization under MCU constraints. Given natural-language task requirements and hardware specifications, AutoMCU iteratively generates structured architecture candidates, filters infeasible designs through vendor toolchain feedback before training, evaluates feasible models under a controlled protocol, and verifies deployability through…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
