Octopus Protocol: One-Shot Hardware Discovery and Control for AI Agents via Infrastructure-as-Prompts
Quilee Simeon, Justin M. Wei, Yile Fan

TL;DR
Octopus Protocol enables rapid hardware discovery and control for AI agents using a prompt-based, five-stage pipeline that minimizes engineering effort and works across diverse platforms and devices.
Contribution
It introduces a prompt-based system that automates hardware onboarding and control, reducing setup time to about 10-15 minutes with minimal manual coding.
Findings
Hardware onboarding takes 10-15 minutes per device.
System successfully controls heterogeneous hardware including a robotic arm.
Enables closed-loop visual-motor control via generated tools.
Abstract
Recent agentic-robotics systems, from Code-asPolicies to modern vision-language-action (VLA) foundation models, presuppose that drivers, SDKs, or ROS-style primitives for the target hardware already exist. Writing those primitives is the dominant engineering cost of bringing up new hardware for agent control. We present Octopus Protocol, a system that collapses that cost to a single shell command. Given only raw OS access and a language-model API key, a coding agent executes a five-stage pipeline--PROBE, IDENTIFY, INTERFACE, SERVE, DEPLOY--to discover connected devices, infer their capabilities, generate a Model Context Protocol (MCP) server with typed tools, and deploy it as a live HTTP endpoint. A persistent daemon then monitors the system, heals broken code, and perceives physical state through the camera tools it generated for itself. Two architectural principles make this work:…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
