Deterministic vs. LLM-Controlled Orchestration for COBOL-to-Python Modernization
Naing Oo Lwin, Rajesh Kumar

TL;DR
This study compares deterministic and LLM-controlled orchestration in COBOL-to-Python modernization, finding that deterministic control offers better robustness and cost efficiency without sacrificing correctness.
Contribution
It provides the first controlled empirical evaluation isolating orchestration strategies, demonstrating the advantages of deterministic control in structured software workflows.
Findings
Deterministic orchestration matches LLM-controlled accuracy.
Deterministic control improves worst-case robustness.
It reduces token consumption by up to 3.5x.
Abstract
Modernizing legacy COBOL systems remains difficult due to scarce expertise, large and long-lived codebases, and strict correctness requirements. Recent large language model (LLM)-based modernization systems increasingly rely on agentic workflows in which the model controls multi-step tool execution. However, it remains unclear whether delegating execution control to the LLM improves correctness, robustness, or efficiency in structured software engineering workflows. We present a controlled empirical study of deterministic and LLM-controlled orchestration for COBOL-to-Python modernization. Using a unified experimental framework, we hold the language models, prompts, tools, configurations, and source programs constant while varying only the execution control strategy. This isolates orchestration as the sole experimental variable. We evaluate both approaches using functional correctness,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
