AI Agent Systems: Architectures, Applications, and Evaluation
Bin Xu

TL;DR
This survey reviews the architecture, applications, and evaluation methods of AI agent systems that integrate foundation models with reasoning, planning, and tool use, highlighting design trade-offs and open challenges.
Contribution
It provides a unified taxonomy of AI agent architectures, orchestration patterns, and deployment settings, along with analysis of evaluation practices and open research challenges.
Findings
Organized prior work into a comprehensive taxonomy.
Identified key design trade-offs in AI agent systems.
Summarized current evaluation and benchmarking practices.
Abstract
AI agents -- systems that combine foundation models with reasoning, planning, memory, and tool use -- are rapidly becoming a practical interface between natural-language intent and real-world computation. This survey synthesizes the emerging landscape of AI agent architectures across: (i) deliberation and reasoning (e.g., chain-of-thought-style decomposition, self-reflection and verification, and constraint-aware decision making), (ii) planning and control (from reactive policies to hierarchical and multi-step planners), and (iii) tool calling and environment interaction (retrieval, code execution, APIs, and multimodal perception). We organize prior work into a unified taxonomy spanning agent components (policy/LLM core, memory, world models, planners, tool routers, and critics), orchestration patterns (single-agent vs.\ multi-agent; centralized vs.\ decentralized coordination), and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMulti-Agent Systems and Negotiation · Reinforcement Learning in Robotics · AI-based Problem Solving and Planning
