TL;DR
RoboNeuron is a middleware layer that seamlessly connects LLM agents with robot middleware like ROS2, enabling modular, reusable, and adaptable control of physical robots in embodied AI applications.
Contribution
It introduces RoboNeuron, a middleware that derives agent-callable tools from ROS schemas, supporting flexible backend transitions and modular orchestration for embodied AI.
Findings
Enables modular system orchestration with a unified interface.
Supports backend transitions without system rewiring.
Demonstrates effectiveness in simulation and hardware tasks.
Abstract
Vision-language-action (VLA) models and LLM agents have advanced rapidly, yet reliable deployment on physical robots is often hindered by an interface mismatch between agent tool APIs and robot middleware. Current implementations typically rely on ad-hoc wrappers that are difficult to reuse, and changes to the VLA backend or serving stack often necessitate extensive re-integration. We introduce RoboNeuron, a middleware layer that connects the Model Context Protocol (MCP) for LLM agents with robot middleware such as ROS2. RoboNeuron bridges these ecosystems by deriving agent-callable tools directly from ROS schemas, providing a unified execution abstraction that supports both direct commands and modular composition, and localizing backend, runtime, and acceleration-preset changes within a stable inference boundary. We evaluate RoboNeuron in simulation and on hardware through…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
