SafePilot: A Framework for Assuring LLM-enabled Cyber-Physical Systems
Weizhe Xu, Mengyu Liu, Fanxin Kong

TL;DR
SafePilot is a hierarchical neuro-symbolic framework designed to ensure safety and correctness in LLM-enabled cyber-physical systems by verifying and adjusting plans based on formal specifications.
Contribution
The paper introduces SafePilot, a novel framework that combines hierarchical planning, verification, and iterative prompt adjustment to assure LLM-enabled CPS.
Findings
Effective in managing complex tasks with attribute and temporal constraints
Demonstrated success in two case studies
Enhances safety and reliability of LLM-enabled CPS
Abstract
Large Language Models (LLMs), deep learning architectures with typically over 10 billion parameters, have recently begun to be integrated into various cyber-physical systems (CPS) such as robotics, industrial automation, and autopilot systems. The abstract knowledge and reasoning capabilities of LLMs are employed for tasks like planning and navigation. However, a significant challenge arises from the tendency of LLMs to produce "hallucinations" - outputs that are coherent yet factually incorrect or contextually unsuitable. This characteristic can lead to undesirable or unsafe actions in the CPS. Therefore, our research focuses on assuring the LLM-enabled CPS by enhancing their critical properties. We propose SafePilot, a novel hierarchical neuro-symbolic framework that provides end-to-end assurance for LLM-enabled CPS according to attribute-based and temporal specifications. Given a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI) · Autonomous Vehicle Technology and Safety
