Iterative Shaping of Multi-Particle Aggregates based on Action Trees and   VLM

Hoi-Yin Lee; Peng Zhou; Anqing Duan; Chenguang Yang; and David; Navarro-Alarcon

arXiv:2501.13507·cs.RO·January 24, 2025

Iterative Shaping of Multi-Particle Aggregates based on Action Trees and VLM

Hoi-Yin Lee, Peng Zhou, Anqing Duan, Chenguang Yang, and David, Navarro-Alarcon

PDF

Open Access

TL;DR

This paper presents a novel robotic manipulation approach that combines vision language models and Fourier series-based shape representation to autonomously shape and transport multi-particle aggregates.

Contribution

It introduces a new method integrating VLMs for task planning and Fourier series for shape control in multi-particle manipulation tasks.

Findings

01

Effective autonomous shaping of particle aggregates demonstrated

02

High system cohesion maintained during manipulation

03

Trajectory adaptation based on aggregate geometry

Abstract

In this paper, we address the problem of manipulating multi-particle aggregates using a bimanual robotic system. Our approach enables the autonomous transport of dispersed particles through a series of shaping and pushing actions using robotically-controlled tools. Achieving this advanced manipulation capability presents two key challenges: high-level task planning and trajectory execution. For task planning, we leverage Vision Language Models (VLMs) to enable primitive actions such as tool affordance grasping and non-prehensile particle pushing. For trajectory execution, we represent the evolving particle aggregate's contour using truncated Fourier series, providing efficient parametrization of its closed shape. We adaptively compute trajectory waypoints based on group cohesion and the geometric centroid of the aggregate, accounting for its spatial distribution and collective motion.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDigital Imaging for Blood Diseases · Advanced Clustering Algorithms Research · Image Processing and 3D Reconstruction