HiRAS: A Hierarchical Multi-Agent Framework for Paper-to-Code Generation and Execution

Hanhua Hong; Yizhi LI; Jiaoyan Chen; Sophia Ananiadou; Xiaoli Li; Jung-jae Kim; Chenghua Lin

arXiv:2604.17745·cs.CL·April 28, 2026

HiRAS: A Hierarchical Multi-Agent Framework for Paper-to-Code Generation and Execution

Hanhua Hong, Yizhi LI, Jiaoyan Chen, Sophia Ananiadou, Xiaoli Li, Jung-jae Kim, Chenghua Lin

PDF

1 Repo

TL;DR

HiRAS is a hierarchical multi-agent framework designed to improve paper-to-code generation and execution, enhancing robustness and performance over existing methods.

Contribution

The paper introduces HiRAS, a hierarchical multi-agent system with supervisory coordination and a refined evaluation protocol, advancing reproducibility in computational research.

Findings

01

Achieved over 10% relative performance gain over previous state-of-the-art.

02

Significantly reduced hallucination in code generation evaluation.

03

Validated effectiveness and robustness through extensive experiments.

Abstract

Recent advances in large language models have highlighted their potential to automate computational research, particularly reproducing experimental results. However, existing approaches still use fixed sequential agent pipelines with weak global coordination, which limits their robustness and overall performance. In this work, we propose Hierarchical Research Agent System (HiRAS), a hierarchical multi-agent framework for end-to-end experiment reproduction that employs supervisory manager agents to coordinate specialised agents across fine-grained stages. We also identify limitations in the reference-free evaluation of the Paper2Code benchmark and introduce Paper2Code-Extra (P2C-Ex), a refined protocol that incorporates repository-level information and better aligns with the original reference-based metric. We conduct extensive evaluation, validating the effectiveness and robustness of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

KOU-199024/HiRAS
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.