SafeGen-LLM: Enhancing Safety Generalization in Task Planning for Robotic Systems

Jialiang Fan; Weizhe Xu; Mengyu Liu; Oleg Sokolsky; Insup Lee; Fanxin Kong

arXiv:2602.24235·cs.RO·March 11, 2026

SafeGen-LLM: Enhancing Safety Generalization in Task Planning for Robotic Systems

Jialiang Fan, Weizhe Xu, Mengyu Liu, Oleg Sokolsky, Insup Lee, Fanxin Kong

PDF

Open Access

TL;DR

SafeGen-LLM is a novel large language model designed to improve safety adherence and generalization in robotic task planning across multiple domains and input formats, addressing limitations of classical and RL-based methods.

Contribution

The paper introduces SafeGen-LLM, a two-stage training framework that enhances safety compliance and generalization in LLM-based robotic planning, using a new multi-domain safety benchmark and formal verification-guided training.

Findings

01

SafeGen-LLM outperforms existing methods in safety adherence across multiple domains.

02

The two-stage training improves safety generalization and planning accuracy.

03

SafeGen-LLM handles various input formats effectively.

Abstract

Safety-critical task planning in robotic systems remains challenging: classical planners suffer from poor scalability, Reinforcement Learning (RL)-based methods generalize poorly, and base Large Language Models (LLMs) cannot guarantee safety. To address this gap, we propose safety-generalizable large language models, named SafeGen-LLM. SafeGen-LLM can not only enhance the safety satisfaction of task plans but also generalize well to novel safety properties in various domains. We first construct a multi-domain Planning Domain Definition Language 3 (PDDL3) benchmark with explicit safety constraints. Then, we introduce a two-stage post-training framework: Supervised Fine-Tuning (SFT) on a constraint-compliant planning dataset to learn planning syntax and semantics, and Group Relative Policy Optimization (GRPO) guided by fine-grained reward machines derived from formal verification to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Adversarial Robustness in Machine Learning · AI-based Problem Solving and Planning