Geometry-Aligned LLM Fine-Tuning for Sequential Narrow-Opening Planning
Al Jaber Mahmud, Xuan Wang

TL;DR
This paper introduces a geometry-aligned LLM fine-tuning framework for sequential narrow-opening motion planning, enabling long-horizon geometric reasoning and high success rates in complex environments.
Contribution
It presents a novel bi-level training pipeline combining failure-driven supervised fine-tuning and geometric verification for improved motion planning.
Findings
Achieves highest success rates in diverse environments.
Demonstrates effective long-horizon geometric reasoning.
Outperforms existing methods in sequential narrow-opening tasks.
Abstract
We study rigid-body motion planning through multiple sequential narrow openings, which requires long-horizon geometric reasoning because the configuration used to traverse an early opening constrains the set of reachable configurations for subsequent ones. To achieve this, we propose a geometry-aligned large language model (LLM) fine-tuning framework that generates fixed-length, machine-readable waypoint sequences that are both geometrically feasible and coordinated across openings. Our approach uses a bi-level training pipeline. First, we perform failure-driven LoRA supervised fine-tuning (SFT) on human demonstrations, which incorporates structured failure feedback to teach the model common failure modes and enforce the output format. Second, we refine the same LoRA adapters using Group Relative Policy Optimization (GRPO) with geometric verification: each sampled waypoint sequence is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotic Path Planning Algorithms · Reinforcement Learning in Robotics · Robot Manipulation and Learning
