Deterministic vs. LLM-Controlled Orchestration for COBOL-to-Python Modernization

Naing Oo Lwin; Rajesh Kumar

arXiv:2605.09894·cs.SE·May 12, 2026

Deterministic vs. LLM-Controlled Orchestration for COBOL-to-Python Modernization

Naing Oo Lwin, Rajesh Kumar

PDF

TL;DR

This study compares deterministic and LLM-controlled orchestration in COBOL-to-Python modernization, finding that deterministic control offers better robustness and cost efficiency without sacrificing correctness.

Contribution

It provides the first controlled empirical evaluation isolating orchestration strategies, demonstrating the advantages of deterministic control in structured software workflows.

Findings

01

Deterministic orchestration matches LLM-controlled accuracy.

02

Deterministic control improves worst-case robustness.

03

It reduces token consumption by up to 3.5x.

Abstract

Modernizing legacy COBOL systems remains difficult due to scarce expertise, large and long-lived codebases, and strict correctness requirements. Recent large language model (LLM)-based modernization systems increasingly rely on agentic workflows in which the model controls multi-step tool execution. However, it remains unclear whether delegating execution control to the LLM improves correctness, robustness, or efficiency in structured software engineering workflows. We present a controlled empirical study of deterministic and LLM-controlled orchestration for COBOL-to-Python modernization. Using a unified experimental framework, we hold the language models, prompts, tools, configurations, and source programs constant while varying only the execution control strategy. This isolates orchestration as the sole experimental variable. We evaluate both approaches using functional correctness,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.