From Text to DSL: Evaluating Grammar-Based Model Generation Using Open LLMs

Junaid Baber; Nicolas Hili; Didier Schwab; L\'eo Challier; C\'ecilia Satrin

arXiv:2605.15865·cs.SE·May 18, 2026

From Text to DSL: Evaluating Grammar-Based Model Generation Using Open LLMs

Junaid Baber, Nicolas Hili, Didier Schwab, L\'eo Challier, C\'ecilia Satrin

PDF

TL;DR

This paper evaluates the ability of open-source LLMs of various sizes to generate DSL-conformant models from natural language without fine-tuning, focusing on syntactic validity, semantic completeness, and consistency.

Contribution

It extends prior work by generating both UI and data models from scratch, demonstrating smaller open-source LLMs can effectively produce grammar-conformant models for MDE tasks.

Findings

01

Several compact models approach or match larger models in quality.

02

Smaller open-source LLMs are feasible for grammar-conformant DSL generation.

03

Evaluation combines automatic parsing and expert feedback across 39 LLMs.

Abstract

Large Language Models (LLMs) have shown increasing potential in automating model-driven software engineering tasks, particularly in generating models conforming to Domain Specific Languages (DSLs) from natural language. While most existing approaches rely on large proprietary models, their high cost and limited deployability hinder broader adoption. In this paper, we evaluate whether open-source LLMs of varying sizes (0.5B to 32B parameters) can generate DSL-conformant models using only few-shot prompting, without any fine-tuning. Our evaluation focuses on key model-driven engineering (MDE) requirements, including syntactic validity, semantic completeness, and inter-model reference consistency. We extend our prior work by moving from generating user interface models (referred to as "UI models" in this paper) over fixed, predefined data schemas ("data models") to generating both the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.