Autopoiesis: A Self-Evolving System Paradigm for LLM Serving Under Runtime Dynamics

Youhe Jiang; Ran Yan; You Peng; Wenshuang Li; Taiyi Wang; Fangcheng Fu; Binhang Yuan

arXiv:2604.07144·cs.DC·April 9, 2026

Autopoiesis: A Self-Evolving System Paradigm for LLM Serving Under Runtime Dynamics

Youhe Jiang, Ran Yan, You Peng, Wenshuang Li, Taiyi Wang, Fangcheng Fu, Binhang Yuan

PDF

TL;DR

Autopoiesis introduces an online, self-evolving system for LLM serving that uses LLM-driven program synthesis to adapt policies in real-time, outperforming static approaches under dynamic runtime conditions.

Contribution

It presents a novel paradigm where LLM serving policies are continuously evolved during deployment, enabling autonomous adaptation to changing runtime dynamics.

Findings

01

Achieves up to 53% performance improvement over state-of-the-art systems.

02

Demonstrates continuous policy evolution effectively adapts to diverse runtime dynamics.

03

Transforms policy design from offline to ongoing, autonomous system component.

Abstract

Modern Large Language Model (LLM) serving operates in highly volatile environments characterized by severe runtime dynamics, such as workload fluctuations and elastic cluster autoscaling. Traditional serving systems rely on static, human-engineered serving policies (e.g., scheduling algorithms and rescheduling strategies) to manage these dynamics. However, these policies must navigate deeply intertwined runtime trade-offs (e.g., scheduling overhead vs. execution efficiency, rescheduling frequency vs. reconfiguration overhead), whose optimal balance is workload-specific and shifts continuously as runtime conditions evolve, rendering any fixed policy fundamentally unable to adapt. We propose Autopoiesis, a novel online self-evolving system that shifts LLM serving from static policy deployment to continuous online policy evolution. First, Autopoiesis introduces an LLM-driven program…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.