Fast Monte Carlo Tree Diffusion: 100x Speedup via Parallel Sparse Planning
Jaesik Yoon, Hyeonseo Cho, Yoshua Bengio, Sungjin Ahn

TL;DR
Fast-MCTD significantly accelerates Monte Carlo Tree Diffusion for trajectory planning by combining parallelization and sparsification, achieving up to 100x speedup while maintaining or improving planning performance.
Contribution
The paper introduces Fast-MCTD, a novel variant that enhances MCTD's speed and scalability through parallel rollouts and trajectory coarsening techniques.
Findings
Fast-MCTD achieves up to 100x speedup over standard MCTD.
Fast-MCTD maintains or improves planning performance.
Fast-MCTD outperforms Diffuser in inference speed on some tasks.
Abstract
Diffusion models have recently emerged as a powerful approach for trajectory planning. However, their inherently non-sequential nature limits their effectiveness in long-horizon reasoning tasks at test time. The recently proposed Monte Carlo Tree Diffusion (MCTD) offers a promising solution by combining diffusion with tree-based search, achieving state-of-the-art performance on complex planning problems. Despite its strengths, our analysis shows that MCTD incurs substantial computational overhead due to the sequential nature of tree search and the cost of iterative denoising. To address this, we propose Fast-MCTD, a more efficient variant that preserves the strengths of MCTD while significantly improving its speed and scalability. Fast-MCTD integrates two techniques: Parallel MCTD, which enables parallel rollouts via delayed tree updates and redundancy-aware selection; and Sparse MCTD,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI-based Problem Solving and Planning · Robotic Path Planning Algorithms · Reinforcement Learning in Robotics
MethodsDiffusion · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
