Networking-Aware Energy Efficiency in Agentic AI Inference: A Survey

Xiaojing Chen; Haiqi Yu; Wei Ni; Dusit Niyato; Ruichen Zhang; Xin Wang; Shunqing Zhang; and Shugong Xu

arXiv:2604.07857·eess.SY·April 10, 2026

Networking-Aware Energy Efficiency in Agentic AI Inference: A Survey

Xiaojing Chen, Haiqi Yu, Wei Ni, Dusit Niyato, Ruichen Zhang, Xin Wang, Shunqing Zhang, and Shugong Xu

PDF

TL;DR

This survey reviews energy efficiency challenges in agentic AI, emphasizing computational and communication costs, and proposes a framework and taxonomy for optimizing energy use in autonomous systems.

Contribution

It introduces an energy accounting framework and a unified taxonomy for energy-efficient agentic AI, highlighting cross-layer co-design strategies and future research challenges.

Findings

01

Identifies computational and communication energy bottlenecks in agentic AI.

02

Proposes a unified taxonomy for energy optimization strategies.

03

Outlines open challenges and a roadmap for scalable autonomous intelligence.

Abstract

The rapid emergence of Large Language Models (LLMs) has catalyzed Agentic artificial intelligence (AI), autonomous systems integrating perception, reasoning, and action into closed-loop pipelines for continuous adaptation. While unlocking transformative applications in mobile edge computing, autonomous systems, and next-generation wireless networks, this paradigm creates fundamental energy challenges through iterative inference and persistent data exchange. Unlike traditional AI where bottlenecks are computational Floating Point Operations (FLOPs), Agentic AI faces compounding computational and communication energy costs. In this survey, we propose an energy accounting framework identifying computational and communication costs across the Perception-Reasoning-Action cycle. We establish a unified taxonomy spanning model simplification, computation control, input and attention…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.