Adaptive AI Agent Placement and Migration in Edge Intelligence Systems

Xingdan Wang; Jiayi He; Zhiqing Tang; Jianxiong Guo; Jiong Lou; Liping Qian; Tian Wang; Weijia Jia

arXiv:2508.03345·cs.AI·August 6, 2025

Adaptive AI Agent Placement and Migration in Edge Intelligence Systems

Xingdan Wang, Jiayi He, Zhiqing Tang, Jianxiong Guo, Jiong Lou, Liping Qian, Tian Wang, Weijia Jia

PDF

Open Access

TL;DR

This paper introduces an adaptive framework for deploying and migrating AI agents at the edge, optimizing resource use and QoS in dynamic environments using ant colony algorithms and LLM-based optimization.

Contribution

It presents the first systematic solution for managing LLM-based AI agents in edge environments, addressing placement and migration challenges with novel algorithms.

Findings

01

Reduces deployment latency significantly

02

Lowers migration costs effectively

03

Improves resource utilization and QoS

Abstract

The rise of LLMs such as ChatGPT and Claude fuels the need for AI agents capable of real-time task handling. However, migrating data-intensive, multi-modal edge workloads to cloud data centers, traditionally used for agent deployment, introduces significant latency. Deploying AI agents at the edge improves efficiency and reduces latency. However, edge environments present challenges due to limited and heterogeneous resources. Maintaining QoS for mobile users necessitates agent migration, which is complicated by the complexity of AI agents coordinating LLMs, task planning, memory, and external tools. This paper presents the first systematic deployment and management solution for LLM-based AI agents in dynamic edge environments. We propose a novel adaptive framework for AI agent placement and migration in edge intelligence systems. Our approach models resource constraints and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIoT and Edge/Fog Computing · Cloud Computing and Resource Management · Big Data and Digital Economy