DiMSam: Diffusion Models as Samplers for Task and Motion Planning under Partial Observability
Xiaolin Fang, Caelan Reed Garrett, Clemens Eppner, Tom\'as, Lozano-P\'erez, Leslie Pack Kaelbling, Dieter Fox

TL;DR
DiMSam integrates diffusion models with task and motion planning to enable constraint-based reasoning in partially observable environments, improving multi-step planning for robot manipulation tasks.
Contribution
The paper introduces a novel approach combining diffusion models with TAMP using learned latent embeddings to handle unseen objects and complex constraints.
Findings
Effective in simulated articulated object manipulation tasks
Enables multi-step constraint reasoning with learned components
Successfully applied in real-world robot manipulation
Abstract
Generative models such as diffusion models, excel at capturing high-dimensional distributions with diverse input modalities, e.g. robot trajectories, but are less effective at multi-step constraint reasoning. Task and Motion Planning (TAMP) approaches are suited for planning multi-step autonomous robot manipulation. However, it can be difficult to apply them to domains where the environment and its dynamics are not fully known. We propose to overcome these limitations by composing diffusion models using a TAMP system. We use the learned components for constraints and samplers that are difficult to engineer in the planning model, and use a TAMP solver to search for the task plan with constraint-satisfying action parameter values. To tractably make predictions for unseen objects in the environment, we define the learned samplers and TAMP operators on learned latent embedding of changing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Topic Modeling · Human Pose and Action Recognition
MethodsDiffusion
