Structure- and Event-Driven Frameworks for State Machine Modeling with Large Language Models
Samer Abdulkarim, Evan Boyd, Karl Bridi, Alec Tufenkjian, Boqi Chen, Gunter Mussbacher

TL;DR
This paper explores using large language models with specialized frameworks to automate UML state machine generation from natural language, evaluating their capabilities and limitations.
Contribution
It introduces a hybrid approach combining LLM outputs with structured frameworks, advancing automated state machine modeling from unstructured requirements.
Findings
Claude 3.5 Sonnet achieves high F1-scores for states and transitions.
GPT-4o's performance is limited in generating guards and actions.
The hybrid approach enhances GPT-4o's performance to match Claude 3.5 Sonnet.
Abstract
UML state machine design is a critical process in software engineering. Traditionally, state machines are manually crafted by experienced engineers based on natural language requirements-a time-consuming and error-prone procedure. Many automated approaches exist but they require structured NL requirements. In this paper, we investigate the capabilities of current Large Language Models to fully automate UML state machine generation via specialized State Machine Frameworks (SMFs) from non-structured NL requirements. We evaluate two types of state-of-the-art LLMs using single-step and multi-step prompting approaches: a non-reasoning LLM GPT-4o and a reasoning-focused LLM Claude 3.5 Sonnet, and introduce a novel Hybrid Approach that uses the output from a Single-Prompt Baseline as an initial draft state machine, which is then refined through an SMF. In our study, two distinct SMFs are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
