Geometry-Aligned LLM Fine-Tuning for Sequential Narrow-Opening Planning

Al Jaber Mahmud; Xuan Wang

arXiv:2603.16028·cs.RO·March 18, 2026

Geometry-Aligned LLM Fine-Tuning for Sequential Narrow-Opening Planning

Al Jaber Mahmud, Xuan Wang

PDF

Open Access

TL;DR

This paper introduces a geometry-aligned LLM fine-tuning framework for sequential narrow-opening motion planning, enabling long-horizon geometric reasoning and high success rates in complex environments.

Contribution

It presents a novel bi-level training pipeline combining failure-driven supervised fine-tuning and geometric verification for improved motion planning.

Findings

01

Achieves highest success rates in diverse environments.

02

Demonstrates effective long-horizon geometric reasoning.

03

Outperforms existing methods in sequential narrow-opening tasks.

Abstract

We study rigid-body motion planning through multiple sequential narrow openings, which requires long-horizon geometric reasoning because the configuration used to traverse an early opening constrains the set of reachable configurations for subsequent ones. To achieve this, we propose a geometry-aligned large language model (LLM) fine-tuning framework that generates fixed-length, machine-readable waypoint sequences that are both geometrically feasible and coordinated across openings. Our approach uses a bi-level training pipeline. First, we perform failure-driven LoRA supervised fine-tuning (SFT) on human demonstrations, which incorporates structured failure feedback to teach the model common failure modes and enforce the output format. Second, we refine the same LoRA adapters using Group Relative Policy Optimization (GRPO) with geometric verification: each sampled waypoint sequence is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotic Path Planning Algorithms · Reinforcement Learning in Robotics · Robot Manipulation and Learning