Reversa: A Reverse Documentation Engineering Framework for Converting Legacy Software into Operational Specifications for AI Agents
Sanderson Oliveira de Macedo, Ronaldo Martins da Costa

TL;DR
Reversa is a framework that converts legacy software into operational specifications for AI agents, enhancing traceability, confidence marking, and human validation in software migration tasks.
Contribution
It introduces a multi-agent pipeline for reverse documentation engineering, integrating traceability, confidence, and gap preservation in legacy system modernization.
Findings
Produced 517 claims with confidence indices in a case study
Identified 10 gaps and generated 53 Gherkin scenarios
Reconstructed a migration plan with 9 of 11 tasks completed
Abstract
Legacy systems concentrate business rules, architectural decisions, and operational exceptions that often remain implicit in code, data, configuration, and maintenance practices. At the same time, language-model-based coding agents depend on reliable context, correctness criteria, and behavioral contracts to modify real systems with lower risk. This paper presents Reversa, a reverse documentation engineering framework for converting legacy software into traceable operational specifications for AI agents. Reversa organizes this process as a multi-agent pipeline: specialized agents map the project surface, analyze modules, extract implicit rules, synthesize architecture, write unit-level specifications, and review generated claims. The proposal emphasizes three mechanisms: traceability between code and specification, explicit confidence marking, and preservation of gaps for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
