Evaluating perturbation robustness of generative systems that use COBOL code inputs
Samuel Ackerman, Wesam Ibraheem, Orna Raz, Marcel Zalmanovici

TL;DR
This paper introduces a framework to evaluate and visualize the robustness of LLM-based systems processing COBOL code, addressing sensitivity issues in critical legacy applications and providing tools for debugging and improvement.
Contribution
It develops perturbation methods for COBOL code, creates a dataset with variants, and provides visualization dashboards to analyze and enhance system robustness.
Findings
Robustness metrics reveal sensitivity to input variations
Visualization tools assist in debugging and understanding system behavior
Framework applicable to other code-related tasks
Abstract
Systems incorporating large language models (LLMs) as a component are known to be sensitive (i.e., non-robust) to minor input variations that do not change the meaning of the input; such sensitivity may reduce the system's usefulness. Here, we present a framework to evaluate robustness of systems using COBOL code as input; our application is translation between COBOL and Java programming languages, but the approach extends to other tasks such as code generation or explanation. Targeting robustness of systems with COBOL as input is essential yet challenging. Many business-critical applications are written in COBOL, yet these are typically proprietary legacy applications and their code is unavailable to LLMs for training. We develop a library of COBOL paragraph and full-program perturbation methods, and create variant-expanded versions of a benchmark dataset of examples for a specific…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Parallel Computing and Optimization Techniques · Advanced Software Engineering Methodologies
