Enhancing Transformation from Natural Language to Signal Temporal Logic Using LLMs with Diverse External Knowledge
Yue Fang, Zhi Jin, Jie An, Hongshen Chen, Xiaohong Chen, Naijun Zhan

TL;DR
This paper introduces a large, diverse NL-STL dataset and a novel knowledge-guided framework for transforming natural language into Signal Temporal Logic, improving accuracy and diversity over existing methods.
Contribution
The paper presents STL-DivEn, a large diverse NL-STL dataset, and KGST, a knowledge-guided transformation framework leveraging external knowledge and a generate-then-refine process.
Findings
STL-DivEn dataset is more diverse than existing datasets.
KGST outperforms baseline models in transformation accuracy.
Both metric and human evaluations confirm the effectiveness of KGST.
Abstract
Temporal Logic (TL), especially Signal Temporal Logic (STL), enables precise formal specification, making it widely used in cyber-physical systems such as autonomous driving and robotics. Automatically transforming NL into STL is an attractive approach to overcome the limitations of manual transformation, which is time-consuming and error-prone. However, due to the lack of datasets, automatic transformation currently faces significant challenges and has not been fully explored. In this paper, we propose an NL-STL dataset named STL-Diversity-Enhanced (STL-DivEn), which comprises 16,000 samples enriched with diverse patterns. To develop the dataset, we first manually create a small-scale seed set of NL-STL pairs. Next, representative examples are identified through clustering and used to guide large language models (LLMs) in generating additional NL-STL pairs. Finally, diversity and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Semantic Web and Ontologies
