Assessing UML Diagrams by GPT: Implications for Education
Chong Wang, Beian Wang, Peng Liang, Jie Liang

TL;DR
This study explores the potential of GPT to evaluate UML diagrams in software engineering education, proposing evaluation criteria and testing GPT's grading performance against human experts.
Contribution
The paper introduces a set of evaluation criteria for UML diagrams and assesses GPT's effectiveness in grading them, highlighting its current capabilities and limitations.
Findings
GPT can evaluate UML diagrams but cannot fully replace human experts.
There are five key discrepancies between GPT and human assessments.
GPT shows strengths and weaknesses depending on diagram type and evaluation criteria.
Abstract
In software engineering (SE) research and practice, UML is well known as an essential modeling methodology for requirements analysis and software modeling in both academia and industry. In particular, fundamental knowledge of UML modeling and practice in creating high-quality UML diagrams are included in SE-relevant courses in the undergraduate programs of many universities. This leads to a time-consuming and labor-intensive task for educators to review and grade a large number of UML diagrams created by the students. Recent advances in generative AI techniques, such as GPT, have paved new ways to automate many SE tasks. However, current research or tools seldom explore the capabilities of GPT in evaluating the quality of UML diagrams. This paper aims to investigate the feasibility and performance of GPT in assessing the quality of UML use case diagrams, class diagrams, and sequence…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Healthcare and Education · Online Learning and Analytics · AI in Service Interactions
