AIC MLLM: Autonomous Interactive Correction MLLM for Robust Robotic   Manipulation

Chuyan Xiong; Chengyu Shen; Xiaoqi Li; Kaichen Zhou; Jeremy Liu,; Ruiping Wang; Hao Dong

arXiv:2406.11548·cs.RO·November 19, 2024·2 cites

AIC MLLM: Autonomous Interactive Correction MLLM for Robust Robotic Manipulation

Chuyan Xiong, Chengyu Shen, Xiaoqi Li, Kaichen Zhou, Jeremy Liu,, Ruiping Wang, Hao Dong

PDF

Open Access

TL;DR

This paper introduces AIC MLLM, a novel approach that uses low-level interaction experiences and adaptive prompting to improve robotic manipulation robustness, especially in correcting articulated object poses.

Contribution

The paper presents a new autonomous interactive correction method for MLLMs that leverages failure feedback and test-time adaptation to enhance low-level robotic manipulation accuracy.

Findings

01

Effective correction of failure samples in manipulation tasks

02

Improved robustness in simulated and real-world environments

03

Utilization of feedback prompts for adaptive pose correction

Abstract

The ability to reflect on and correct failures is crucial for robotic systems to interact stably with real-life objects.Observing the generalization and reasoning capabilities of Multimodal Large Language Models (MLLMs), previous approaches have aimed to utilize these models to enhance robotic systems accordingly.However, these methods typically focus on high-level planning corrections using an additional MLLM, with limited utilization of failed samples to correct low-level contact poses which is particularly prone to occur during articulated object manipulation.To address this gap, we propose an Autonomous Interactive Correction (AIC) MLLM, which makes use of previous low-level interaction experiences to correct SE(3) pose predictions for articulated object. Specifically, AIC MLLM is initially fine-tuned to acquire both pose prediction and feedback prompt comprehension abilities.We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Robotic Mechanisms and Dynamics · Robotic Path Planning Algorithms

MethodsFocus