Reversa: A Reverse Documentation Engineering Framework for Converting Legacy Software into Operational Specifications for AI Agents

Sanderson Oliveira de Macedo; Ronaldo Martins da Costa

arXiv:2605.18684·cs.SE·May 19, 2026

Reversa: A Reverse Documentation Engineering Framework for Converting Legacy Software into Operational Specifications for AI Agents

Sanderson Oliveira de Macedo, Ronaldo Martins da Costa

PDF

TL;DR

Reversa is a framework that converts legacy software into operational specifications for AI agents, enhancing traceability, confidence marking, and human validation in software migration tasks.

Contribution

It introduces a multi-agent pipeline for reverse documentation engineering, integrating traceability, confidence, and gap preservation in legacy system modernization.

Findings

01

Produced 517 claims with confidence indices in a case study

02

Identified 10 gaps and generated 53 Gherkin scenarios

03

Reconstructed a migration plan with 9 of 11 tasks completed

Abstract

Legacy systems concentrate business rules, architectural decisions, and operational exceptions that often remain implicit in code, data, configuration, and maintenance practices. At the same time, language-model-based coding agents depend on reliable context, correctness criteria, and behavioral contracts to modify real systems with lower risk. This paper presents Reversa, a reverse documentation engineering framework for converting legacy software into traceable operational specifications for AI agents. Reversa organizes this process as a multi-agent pipeline: specialized agents map the project surface, analyze modules, extract implicit rules, synthesize architecture, write unit-level specifications, and review generated claims. The proposal emphasizes three mechanisms: traceability between code and specification, explicit confidence marking, and preservation of gaps for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.