Instruction-Tuning Open-Weight Language Models for BPMN Model Generation
G\"okberk \c{C}elikmasat, Atay \"Ozg\"ovde, Fatma Ba\c{s}ak Aydemir

TL;DR
This paper demonstrates that instruction-tuned open-weight language models can generate high-quality BPMN process models from natural language, reducing resource needs and improving structural accuracy compared to untuned models.
Contribution
It introduces InstruBPM, a cost-effective, privacy-preserving approach for fine-tuning open-source language models to generate BPMN diagrams from natural language descriptions.
Findings
Tuned models outperform untuned baselines in structural fidelity and similarity metrics.
Generated diagrams largely follow BPMN best practices and are useful as starting points.
Instruction tuning enhances robustness and reduces resource requirements.
Abstract
Domain models are central to software engineering, as they enable a shared understanding, guide implementation, and support automated analyses and model-driven development. Yet, despite these benefits, practitioners often skip modeling because it is time-consuming and demands scarce expertise. We address this barrier by investigating whether open-weight large language models, adapted via instruction tuning, can generate high-quality BPMN process models directly from natural language descriptions in a cost-effective and privacy-preserving way. We introduce InstruBPM, a reproducible approach that prepares paired text-diagram data and instruction tunes an open source large language model with parameter-efficient fine-tuning and quantization for on-prem deployment. We evaluate the tuned model through complementary perspectives: (i) text/code similarity using BLEU, ROUGE-L, and METEOR, (ii)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel-Driven Software Engineering Techniques · Software Engineering Research · Software Engineering Techniques and Practices
