Intent-Driven LLM Ensemble Planning for Flexible Multi-Robot Disassembly: Demonstration on EV Batteries
Cansu Erdogan, Cesar Alan Contreras, Alireza Rastegarpanah, Manolis Chiou, Rustam Stolkin

TL;DR
This paper presents an intent-driven planning pipeline utilizing ensemble large language models for flexible multi-robot disassembly of EV batteries, demonstrating reliable, low-effort task execution in unstructured scenes.
Contribution
The novel pipeline integrates perception, LLM ensembles, verification, and filtering to robustly generate and verify multi-robot action sequences from simple human instructions.
Findings
Achieved high sequence correctness in 200 real scenes
Demonstrated low user effort and efficient task execution
Validated pipeline's robustness and reliability
Abstract
This paper addresses the problem of planning complex manipulation tasks, in which multiple robots with different end-effectors and capabilities, informed by computer vision, must plan and execute concatenated sequences of actions on a variety of objects that can appear in arbitrary positions and configurations in unstructured scenes. We propose an intent-driven planning pipeline which can robustly construct such action sequences with varying degrees of supervisory input from a human using simple language instructions. The pipeline integrates: (i) perception-to-text scene encoding, (ii) an ensemble of large language models (LLMs) that generate candidate removal sequences based on the operator's intent, (iii) an LLM-based verifier that enforces formatting and precedence constraints, and (iv) a deterministic consistency filter that rejects hallucinated objects. The pipeline is evaluated on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · AI-based Problem Solving and Planning · Multimodal Machine Learning Applications
