RESTL: Reinforcement Learning Guided by Multi-Aspect Rewards for Signal Temporal Logic Transformation

Yue Fang; Jin Zhi; Jie An; Hongshen Chen; Xiaohong Chen; Naijun Zhan

arXiv:2511.08555·cs.FL·November 12, 2025

RESTL: Reinforcement Learning Guided by Multi-Aspect Rewards for Signal Temporal Logic Transformation

Yue Fang, Jin Zhi, Jie An, Hongshen Chen, Xiaohong Chen, Naijun Zhan

PDF

Open Access 1 Video

TL;DR

RESTL is a reinforcement learning framework that improves the automatic translation of natural language into Signal Temporal Logic formulas by using multi-aspect reward models for better semantic fidelity, readability, and correctness.

Contribution

The paper introduces RESTL, a novel RL-based approach with multi-faceted reward models that enhance the accuracy and quality of natural language to STL transformations.

Findings

01

RESTL outperforms existing methods in automatic metrics.

02

RESTL achieves higher human evaluation scores.

03

The multi-aspect reward approach improves semantic and structural correctness.

Abstract

Signal Temporal Logic (STL) is a powerful formal language for specifying real-time specifications of Cyber-Physical Systems (CPS). Transforming specifications written in natural language into STL formulas automatically has attracted increasing attention. Existing rule-based methods depend heavily on rigid pattern matching and domain-specific knowledge, limiting their generalizability and scalability. Recently, Supervised Fine-Tuning (SFT) of large language models (LLMs) has been successfully applied to transform natural language into STL. However, the lack of fine-grained supervision on atomic proposition correctness, semantic fidelity, and formula readability often leads SFT-based methods to produce formulas misaligned with the intended meaning. To address these issues, we propose RESTL, a reinforcement learning (RL)-based framework for the transformation from natural language to STL.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

RESTL: Reinforcement Learning Guided by Multi-Aspect Rewards for Signal Temporal Logic Transformation· underline

Taxonomy

TopicsFormal Methods in Verification · Machine Learning and Algorithms · Topic Modeling