Asynchronous Tool Usage for Real-Time Agents

Antonio A. Ginart; Naveen Kodali; Jason Lee; Caiming Xiong; Silvio; Savarese; John Emmons

arXiv:2410.21620·cs.AI·October 30, 2024

Asynchronous Tool Usage for Real-Time Agents

Antonio A. Ginart, Naveen Kodali, Jason Lee, Caiming Xiong, Silvio, Savarese, John Emmons

PDF

Open Access

TL;DR

This paper introduces asynchronous AI agents that enable real-time, multitasking interactions by leveraging an event-driven architecture, improving upon traditional synchronous, turn-based systems.

Contribution

The paper presents a novel asynchronous, event-driven framework for AI agents, allowing parallel processing and real-time tool use, inspired by real-time operating systems.

Findings

01

Developed an event-driven finite-state machine architecture for AI agents.

02

Integrated automatic speech recognition and text-to-speech for real-time interaction.

03

Demonstrated improved multitasking capabilities in AI agents.

Abstract

While frontier large language models (LLMs) are capable tool-using agents, current AI systems still operate in a strict turn-based fashion, oblivious to passage of time. This synchronous design forces user queries and tool-use to occur sequentially, preventing the systems from multitasking and reducing interactivity. To address this limitation, we introduce asynchronous AI agents capable of parallel processing and real-time tool-use. Our key contribution is an event-driven finite-state machine architecture for agent execution and prompting, integrated with automatic speech recognition and text-to-speech. Drawing inspiration from the concepts originally developed for real-time operating systems, this work presents both a conceptual framework and practical tools for creating AI agents capable of fluid, multitasking interactions.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDistributed and Parallel Computing Systems · Robotic Path Planning Algorithms · Mobile Agent-Based Network Management