AIC MLLM: Autonomous Interactive Correction MLLM for Robust Robotic Manipulation
Chuyan Xiong, Chengyu Shen, Xiaoqi Li, Kaichen Zhou, Jeremy Liu,, Ruiping Wang, Hao Dong

TL;DR
This paper introduces AIC MLLM, a novel approach that uses low-level interaction experiences and adaptive prompting to improve robotic manipulation robustness, especially in correcting articulated object poses.
Contribution
The paper presents a new autonomous interactive correction method for MLLMs that leverages failure feedback and test-time adaptation to enhance low-level robotic manipulation accuracy.
Findings
Effective correction of failure samples in manipulation tasks
Improved robustness in simulated and real-world environments
Utilization of feedback prompts for adaptive pose correction
Abstract
The ability to reflect on and correct failures is crucial for robotic systems to interact stably with real-life objects.Observing the generalization and reasoning capabilities of Multimodal Large Language Models (MLLMs), previous approaches have aimed to utilize these models to enhance robotic systems accordingly.However, these methods typically focus on high-level planning corrections using an additional MLLM, with limited utilization of failed samples to correct low-level contact poses which is particularly prone to occur during articulated object manipulation.To address this gap, we propose an Autonomous Interactive Correction (AIC) MLLM, which makes use of previous low-level interaction experiences to correct SE(3) pose predictions for articulated object. Specifically, AIC MLLM is initially fine-tuned to acquire both pose prediction and feedback prompt comprehension abilities.We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Robotic Mechanisms and Dynamics · Robotic Path Planning Algorithms
MethodsFocus
