BlackBoxToBlueprint: Extracting Interpretable Logic from Legacy Systems using Reinforcement Learning and Counterfactual Analysis
Vidhi Rathore

TL;DR
This paper introduces a method to automatically extract interpretable decision rules from legacy black-box systems using reinforcement learning and counterfactual analysis, aiding system understanding and modernization.
Contribution
It presents a novel pipeline combining RL exploration, counterfactual analysis, and decision tree learning to extract human-readable logic from complex legacy systems.
Findings
RL effectively identifies decision boundaries
Extracted rules closely match original system logic
Method works on various dummy legacy systems
Abstract
Modernizing legacy software systems is a critical but challenging task, often hampered by a lack of documentation and understanding of the original system's intricate decision logic. Traditional approaches like behavioral cloning merely replicate input-output behavior without capturing the underlying intent. This paper proposes a novel pipeline to automatically extract interpretable decision logic from legacy systems treated as black boxes. The approach uses a Reinforcement Learning (RL) agent to explore the input space and identify critical decision boundaries by rewarding actions that cause meaningful changes in the system's output. These counterfactual state transitions, where the output changes, are collected and clustered using K-Means. Decision trees are then trained on these clusters to extract human-readable rules that approximate the system's decision logic near the identified…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Software Engineering Research · Software Testing and Debugging Techniques
