Conversational AI Multi-Agent Interoperability, Universal Open APIs for Agentic Natural Language Multimodal Communications
Diego Gosmar, Deborah A. Dahl, Emmett Coin

TL;DR
This paper introduces a novel architecture for interoperable multi-modal Conversational AI agents using Universal Open APIs and a Discovery framework, enhancing scalability and standardization of AI interactions across diverse platforms.
Contribution
It proposes a new architecture and standards for multi-agent interoperability in Conversational AI, including Universal APIs and a Discovery specification framework.
Findings
Designed a Universal API framework for multi-modal AI agents
Introduced a Discovery specification for efficient agent lookup
Enhanced scalability and interoperability of AI communications
Abstract
This paper analyses Conversational AI multi-agent interoperability frameworks and describes the novel architecture proposed by the Open Voice Interoperability initiative (Linux Foundation AI and DATA), also known briefly as OVON (Open Voice Network). The new approach is illustrated, along with the main components, delineating the key benefits and use cases for deploying standard multi-modal AI agency (or agentic AI) communications. Beginning with Universal APIs based on Natural Language, the framework establishes and enables interoperable interactions among diverse Conversational AI agents, including chatbots, voicebots, videobots, and human agents. Furthermore, a new Discovery specification framework is introduced, designed to efficiently look up agents providing specific services and to obtain accurate information about these services through a standard Manifest publication,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and dialogue systems · Natural Language Processing Techniques
MethodsSparse Evolutionary Training
