LVDiffusor: Distilling Functional Rearrangement Priors from Large Models into Diffusor
Yiming Zeng, Mingdong Wu, Long Yang, Jiyao Zhang, Hao Ding, Hui Cheng,, Hao Dong

TL;DR
This paper introduces LVDiffusor, a novel method that distills functional rearrangement priors from large models into a diffusion model, enabling robots to generate goal configurations for object rearrangement tasks more effectively.
Contribution
The work presents a new approach that leverages large language and vision models to distill functional rearrangement priors into a diffusion model, improving scalability and generalization.
Findings
Outperforms baseline methods in multiple domains
Effective in real-world object rearrangement scenarios
Generates compatible goals meeting functional requirements
Abstract
Object rearrangement, a fundamental challenge in robotics, demands versatile strategies to handle diverse objects, configurations, and functional needs. To achieve this, the AI robot needs to learn functional rearrangement priors in order to specify precise goals that meet the functional requirements. Previous methods typically learn such priors from either laborious human annotations or manually designed heuristics, which limits scalability and generalization. In this work, we propose a novel approach that leverages large models to distill functional rearrangement priors. Specifically, our approach collects diverse arrangement examples using both LLMs and VLMs and then distills the examples into a diffusion model. During test time, the learned diffusion model is conditioned on the initial configuration and guides the positioning of objects to meet functional requirements. In this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Topic Modeling
