Skilled AI Agents for Embedded and IoT Systems Development
Yiming Li, Yuhan Cheng, Mingchen Ma, Yihang Zou, Ningyuan Yang, Wei Cheng, Hai "Helen" Li, Yiran Chen, and Tingjun Chen

TL;DR
This paper presents a skills-based AI agent framework and a comprehensive benchmark for developing and evaluating AI systems in embedded and IoT hardware environments, addressing deployment challenges.
Contribution
It introduces a novel skills-based agentic framework and IoT-SkillsBench benchmark for systematic evaluation of AI agents in real embedded hardware settings.
Findings
Human-expert skills achieve near-perfect success rates.
Structured expert knowledge significantly improves AI agent performance.
Benchmark covers diverse platforms, peripherals, and tasks.
Abstract
Large language models (LLMs) and agentic systems have shown promise for automated software development, but applying them to hardware-in-the-loop (HIL) embedded and Internet-of-Things (IoT) systems remains challenging due to the tight coupling between software logic and physical hardware behavior. Code that compiles successfully may still fail when deployed on real devices because of timing constraints, peripheral initialization requirements, or hardware-specific behaviors. To address this challenge, we introduce a skills-based agentic framework for HIL embedded development together with IoT-SkillsBench, a benchmark designed to systematically evaluate AI agents in real embedded programming environments. IoT-SkillsBench spans three representative embedded platforms, 23 peripherals, and 42 tasks across three difficulty levels, where each task is evaluated under three agent configurations…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Advanced Software Engineering Methodologies · AI-based Problem Solving and Planning
