Toward Agentic AI: Task-Oriented Communication for Hierarchical Planning of Long-Horizon Tasks
Sin-Yu Huang, Lele Wang, Vincent W.S. Wong

TL;DR
This paper introduces HiTOC, a hierarchical task-oriented communication framework for agentic AI that efficiently transmits only task-relevant information, enabling better performance in complex, long-horizon tasks.
Contribution
The paper develops a novel hierarchical communication framework with a conditional variational information bottleneck for adaptive, minimal information transmission in agentic AI.
Findings
HiTOC outperforms existing schemes in success rate on MAP-THOR benchmark.
The cVIB approach effectively reduces communication overhead.
Hierarchical structure improves handling of long-horizon tasks.
Abstract
Agentic artificial intelligence (AI) is an AI paradigm that can perceive the environment, reason over observations, and execute actions to achieve specific goals. Task-oriented communication supports agentic AI by transmitting only the task-related information instead of full raw data in order to reduce the bandwidth requirement. In real-world scenarios, AI agents often need to perform a sequence of actions to complete complex tasks. Completing these long-horizon tasks requires a hierarchical agentic AI architecture, where a high-level planner module decomposes a task into subtasks, and a low-level actor module executes each subtask sequentially. Since each subtask has a distinct goal, the existing task-oriented communication schemes are not designed to handle different goals for different subtasks. To address this challenge, in this paper, we develop a hierarchical task-oriented…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Social Robot Interaction and HRI · Multimodal Machine Learning Applications
