On the Limitations of Large Language Models for Conceptual Database Modeling
Arthur F. Siqueira, Carlos D. S. Nogueira, Eduarda Farias, Claudio E. C. Campelo, J\'ulia Menezes

TL;DR
This paper evaluates the capabilities and limitations of Large Language Models in automatically generating Entity-Relationship diagrams from natural language requirements, highlighting their decreasing reliability with increasing complexity.
Contribution
It provides an empirical analysis of LLMs' effectiveness in conceptual database modeling and identifies current limitations in handling complex scenarios.
Findings
LLMs perform reasonably in simple scenarios
Reliability decreases as complexity increases
Inconsistencies and ambiguities rise with complexity
Abstract
This article analyzes the use of Large Language Models (LLMs) as support for the conceptual modeling of relational databases through the automatic generation of Entity-Relationship (ER) diagrams from natural language requirements. The approach combines different language models with prompt engineering techniques to evaluate their ability to identify entities, relationships, and attributes in a conceptually consistent manner. The experimental evaluation involved three LLMs, each subjected to three prompting techniques (Zero-Shot, Chain of Thought, and Chain of Thought + Verifier), applied to the same requirements scenario with progressively increasing complexity. The generated diagrams were qualitatively analyzed through direct comparison with the textual requirements, considering the structural and semantic adherence of the modeled elements. The results indicate that, although LLMs show…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
