Software Engineering Agents for Embodied Controller Generation : A Study in Minigrid Environments

Timoth\'e Boulet; Xavier Hinaut; Cl\'ement Moulin-Frier

arXiv:2510.21902·cs.SE·October 28, 2025

Software Engineering Agents for Embodied Controller Generation : A Study in Minigrid Environments

Timoth\'e Boulet, Xavier Hinaut, Cl\'ement Moulin-Frier

PDF

Open Access

TL;DR

This paper evaluates Software Engineering Agents (SWE-Agents) in generating controllers for embodied tasks in Minigrid environments, analyzing how information access impacts their performance and establishing a new evaluation domain.

Contribution

First extended evaluation of SWE-Agents on embodied tasks, comparing static and dynamic information access, and establishing a new benchmark for future research.

Findings

01

Information access significantly affects SWE-Agent performance.

02

Dynamic exploration improves task-solving success.

03

Static code analysis alone is often insufficient.

Abstract

Software Engineering Agents (SWE-Agents) have proven effective for traditional software engineering tasks with accessible codebases, but their performance for embodied tasks requiring well-designed information discovery remains unexplored. We present the first extended evaluation of SWE-Agents on controller generation for embodied tasks, adapting Mini-SWE-Agent (MSWEA) to solve 20 diverse embodied tasks from the Minigrid environment. Our experiments compare agent performance across different information access conditions: with and without environment source code access, and with varying capabilities for interactive exploration. We quantify how different information access levels affect SWE-Agent performance for embodied tasks and analyze the relative importance of static code analysis versus dynamic exploration for task solving. This work establishes controller generation for embodied…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Software Engineering Methodologies · Multi-Agent Systems and Negotiation · Reinforcement Learning in Robotics