Develop AI Agents for System Engineering in Factorio

Neel Kant

arXiv:2502.01492·cs.AI·February 4, 2025

Develop AI Agents for System Engineering in Factorio

Neel Kant

PDF

Open Access

TL;DR

This paper advocates for using the game Factorio as a sandbox environment to train and evaluate AI agents' system engineering skills, emphasizing dynamic system management and long-term planning.

Contribution

It proposes a new benchmark approach using automation-oriented sandbox games to better assess AI agents' system engineering capabilities.

Findings

01

Highlights limitations of current static benchmarks.

02

Suggests sandbox games as effective training environments.

03

Encourages research on AI reasoning and planning in engineering contexts.

Abstract

Continuing advances in frontier model research are paving the way for widespread deployment of AI agents. Meanwhile, global interest in building large, complex systems in software, manufacturing, energy and logistics has never been greater. Although AI driven system engineering holds tremendous promise, the static benchmarks dominating agent evaluations today fail to capture the crucial skills required for implementing dynamic systems, such as managing uncertain trade-offs and ensuring proactive adaptability. This position paper advocates for training and evaluating AI agents' system engineering abilities through automation-oriented sandbox games-particularly Factorio. By directing research efforts in this direction, we can equip AI agents with the specialized reasoning and long-horizon planning necessary to design, maintain, and optimize tomorrow's most demanding engineering projects.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsScheduling and Optimization Algorithms · Manufacturing Process and Optimization · Assembly Line Balancing Optimization