Open, Reliable, and Collective: A Community-Driven Framework for Tool-Using AI Agents

Hy Dang; Quang Dao; Meng Jiang

arXiv:2604.00137·cs.AI·April 2, 2026

Open, Reliable, and Collective: A Community-Driven Framework for Tool-Using AI Agents

Hy Dang, Quang Dao, Meng Jiang

PDF

TL;DR

OpenTools is a community-driven framework that standardizes, evaluates, and monitors external tools for AI agents, significantly improving reliability and task performance through collective contributions.

Contribution

It introduces a standardized, extensible toolbox with evaluation pipelines and a contribution protocol, emphasizing intrinsic tool accuracy for better AI agent reliability.

Findings

01

Community contributions lead to 6%-22% performance gains.

02

Standardized evaluation improves reproducibility and reliability.

03

Intrinsic tool accuracy is crucial for effective tool use by AI agents.

Abstract

Tool-integrated LLMs can retrieve, compute, and take real-world actions via external tools, but reliability remains a key bottleneck. We argue that failures stem from both tool-use accuracy (how well an agent invokes a tool) and intrinsic tool accuracy (the tool's own correctness), while most prior work emphasizes the former. We introduce OpenTools, a community-driven toolbox that standardizes tool schemas, provides lightweight plug-and-play wrappers, and evaluates tools with automated test suites and continuous monitoring. We also release a public web demo where users can run predefined agents and tools and contribute test cases, enabling reliability reports to evolve as tools change. OpenTools includes the core framework, an initial tool set, evaluation pipelines, and a contribution protocol. Experiments and evaluations show improved end-to-end reproducibility and task performance;…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.