Autopoiesis: A Self-Evolving System Paradigm for LLM Serving Under Runtime Dynamics
Youhe Jiang, Ran Yan, You Peng, Wenshuang Li, Taiyi Wang, Fangcheng Fu, Binhang Yuan

TL;DR
Autopoiesis introduces an online, self-evolving system for LLM serving that uses LLM-driven program synthesis to adapt policies in real-time, outperforming static approaches under dynamic runtime conditions.
Contribution
It presents a novel paradigm where LLM serving policies are continuously evolved during deployment, enabling autonomous adaptation to changing runtime dynamics.
Findings
Achieves up to 53% performance improvement over state-of-the-art systems.
Demonstrates continuous policy evolution effectively adapts to diverse runtime dynamics.
Transforms policy design from offline to ongoing, autonomous system component.
Abstract
Modern Large Language Model (LLM) serving operates in highly volatile environments characterized by severe runtime dynamics, such as workload fluctuations and elastic cluster autoscaling. Traditional serving systems rely on static, human-engineered serving policies (e.g., scheduling algorithms and rescheduling strategies) to manage these dynamics. However, these policies must navigate deeply intertwined runtime trade-offs (e.g., scheduling overhead vs. execution efficiency, rescheduling frequency vs. reconfiguration overhead), whose optimal balance is workload-specific and shifts continuously as runtime conditions evolve, rendering any fixed policy fundamentally unable to adapt. We propose Autopoiesis, a novel online self-evolving system that shifts LLM serving from static policy deployment to continuous online policy evolution. First, Autopoiesis introduces an LLM-driven program…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
