Structured Task Solving via Modular Embodied Intelligence: A Case Study on Rubik's Cube
Chongshan Fan, Shenghai Yuan

TL;DR
Auto-RubikAI is a modular framework combining symbolic reasoning, vision-language understanding, and large language models to solve Rubik's Cube tasks with high success rates and interpretability, demonstrating effective real-world deployment.
Contribution
The paper introduces Auto-RubikAI, a novel modular system integrating KB, VLM, and LLM for interpretable, minimal-data structured task solving in robotics.
Findings
Achieved 79% success rate in real-world Rubik's Cube solving.
Reduced solution steps compared to baseline methods.
Demonstrated effective sim-to-real transfer without retraining.
Abstract
This paper presents Auto-RubikAI, a modular autonomous planning framework that integrates a symbolic Knowledge Base (KB), a vision-language model (VLM), and a large language model (LLM) to solve structured manipulation tasks exemplified by Rubik's Cube restoration. Unlike traditional robot systems based on predefined scripts, or modern approaches relying on pretrained networks and large-scale demonstration data, Auto-RubikAI enables interpretable, multi-step task execution with minimal data requirements and no prior demonstrations. The proposed system employs a KB module to solve group-theoretic restoration steps, overcoming LLMs' limitations in symbolic reasoning. A VLM parses RGB-D input to construct a semantic 3D scene representation, while the LLM generates structured robotic control code via prompt chaining. This tri-module architecture enables robust performance under spatial…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Robotic Path Planning Algorithms · Multimodal Machine Learning Applications
