Agent-as-Tool: A Study on the Hierarchical Decision Making with Reinforcement Learning
Yanfei Zhang

TL;DR
This paper introduces a hierarchical framework called Agent-as-Tool that separates tool calling from reasoning in reinforcement learning agents, improving reasoning efficiency and performance in complex tasks.
Contribution
The study proposes a novel hierarchical approach that decouples tool calling from reasoning, enhancing reasoning clarity and efficiency in LLM-based reinforcement learning agents.
Findings
Achieved 63.2% exact match on Bamboogle, surpassing previous methods.
Only slight reinforcement fine-tuning needed on 180 samples.
Outperformed Search-R1 by 4.8% in exact match.
Abstract
Large Language Models (LLMs) have emerged as one of the most significant technological advancements in artificial intelligence in recent years. Their ability to understand, generate, and reason with natural language has transformed how we interact with AI systems. With the development of LLM-based agents and reinforcement-learning-based reasoning models, the study of applying reinforcement learning in agent frameworks has become a new research focus. However, all previous studies face the challenge of deciding the tool calling process and the reasoning process simultaneously, and the chain of reasoning was solely relied on the unprocessed raw result with redundant information and symbols unrelated to the task from the tool, which impose a heavy burden on the model's capability to reason. Therefore, in our research, we proposed a hierarchical framework Agent-as-tool that detach the tool…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Multimodal Machine Learning Applications · Explainable Artificial Intelligence (XAI)
