TL;DR
ToolOmni is a novel framework that enhances open-world tool use in LLMs through proactive retrieval and grounded execution, significantly improving accuracy and robustness.
Contribution
It introduces a unified agentic approach with a new dataset and a decoupled multi-objective algorithm for better tool retrieval and execution in open-world scenarios.
Findings
Achieves +10.8% end-to-end execution success rate over baselines.
Demonstrates superior robustness and generalization in open-world tool use.
Develops a new multi-turn interaction dataset for training.
Abstract
Large Language Models (LLMs) enhance their problem-solving capability by utilizing external tools. However, in open-world scenarios with massive and evolving tool repositories, existing methods relying on static embedding retrieval or parameter memorization of tools struggle to align user intent with tool semantics or generalize to unseen tools, respectively, leading to suboptimal accuracy of open-world tool retrieval and execution. To address these, we present ToolOmni, a unified agentic framework that enables LLMs for open-world tool use by proactive retrieval and grounded execution within a reasoning loop. First, we construct a cold-start multi-turn interaction dataset to instill foundational agentic capabilities via Supervised Fine-Tuning (SFT). Then, we introduce open-world tool learning based on a Decoupled Multi-Objective GRPO algorithm, which simultaneously optimizes LLMs for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
