Large Language Models for Analyzing Enterprise Architecture Debt in Unstructured Documentation

Christin Pagels; Simon Hacks; Rob Henk Bemthuis

arXiv:2604.00046·cs.SE·April 2, 2026

Large Language Models for Analyzing Enterprise Architecture Debt in Unstructured Documentation

Christin Pagels, Simon Hacks, Rob Henk Bemthuis

PDF

TL;DR

This paper presents an LLM-based approach to automatically detect and quantify Enterprise Architecture Smells in unstructured documentation, enhancing EA governance practices.

Contribution

It introduces a novel LLM-based prototype for identifying EA Smells in unstructured documents, evaluated through a case study and benchmark comparison.

Findings

01

LLMs can detect multiple EA Smells in unstructured text.

02

Benchmark models achieve higher precision and speed.

03

Fine-tuned models offer data protection benefits.

Abstract

Enterprise Architecture Debt (EA Debt) arises from suboptimal design decisions and misaligned components that can degrade an organization's IT landscape over time. Early indicators, Enterprise Architecture Smells (EA Smells), are currently mainly detected manually or only from structured artifacts, leaving much unstructured documentation under-analyzed. This study proposes an approach using a large language model (LLM) to identify and quantify EA Debt in unstructured architectural documentation. Following a design science research approach, we design and evaluate an LLM-based prototype for automated EA Smell detection. The artifact ingests unstructured documents (e.g., process descriptions, strategy papers), applies fine-tuned detection models, and outputs identified smells. We evaluate the prototype through a case study using synthetic yet realistic business documents, benchmarking…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.