Enhancing COBOL Code Explanations: A Multi-Agents Approach Using Large Language Models

Fangjian Lei; Jiawen Liu; Shayan Noei; Ying Zou; Derek Truong; William Alexander

arXiv:2507.02182·cs.SE·July 4, 2025

Enhancing COBOL Code Explanations: A Multi-Agents Approach Using Large Language Models

Fangjian Lei, Jiawen Liu, Shayan Noei, Ying Zou, Derek Truong, William Alexander

PDF

TL;DR

This paper introduces a multi-agent system leveraging large language models to improve explanations of COBOL code, effectively handling its complexity and size limitations, and demonstrating significant performance gains over baseline methods.

Contribution

It presents a novel multi-agent LLM-based approach that enhances COBOL code explanations by incorporating contextual information, addressing size limitations, and outperforming existing methods.

Findings

01

Significant improvements in explanation quality metrics (METEOR, chrF, SentenceBERT)

02

Effective explanation of large COBOL files exceeding token limits

03

82% of projects successfully explained in terms of functionality and purpose

Abstract

Common Business Oriented Language (COBOL) is a programming language used to develop business applications that are widely adopted by financial, business, and government agencies. Due to its age, complexity, and declining number of COBOL developers, maintaining COBOL codebases is becoming increasingly challenging. In particular, the lack of documentation makes it difficult for new developers to effectively understand and maintain COBOL systems. Existing research utilizes large language models (LLMs) to explain the functionality of code snippets. However, COBOL presents unique challenges due to its architectural and syntactical differences, which often cause its code to exceed the token window size of LLMs. In this work, we propose a multi-agent approach that leverages two LLM-based agents working collaboratively to generate explanations for functions, files, and the overall project.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.