GRAIL: A Deep-Granularity Hybrid Resonance Framework for Real-Time Agent Discovery via SLM-Enhanced Indexing

Jinliang Xu

arXiv:2605.02489·cs.AI·May 7, 2026

GRAIL: A Deep-Granularity Hybrid Resonance Framework for Real-Time Agent Discovery via SLM-Enhanced Indexing

Jinliang Xu

PDF

TL;DR

GRAIL is a hybrid framework that enables real-time agent discovery with high accuracy and sub-400ms latency by combining specialized models, data augmentation, and fine-grained matching techniques.

Contribution

GRAIL introduces a novel hybrid approach with SLM-enhanced prediction, pseudo-document expansion, and MaxSim resonance for fast, accurate agent discovery.

Findings

01

Achieves over 79x reduction in discovery latency compared to LLM-based methods.

02

Outperforms traditional vector search in Recall@10 on AgentTaxo-9K dataset.

03

Validates effectiveness on a large-scale agent dataset with high accuracy and speed.

Abstract

As the ecosystem of Large Language Model (LLM)-based agents expands rapidly, efficient and accurate Agent Discovery becomes a critical bottleneck for large-scale multi-agent collaboration. Existing approaches typically face a dichotomy: either relying on heavy-weight LLMs for intent parsing, leading to prohibitive latency (often exceeding 30 seconds), or using monolithic vector retrieval that sacrifices semantic precision for speed. To bridge this gap, we propose \textbf{GRAIL} (Granular Resonance-based Agent/AI Link), a novel framework achieving sub-400ms discovery latency without compromising accuracy. GRAIL introduces three key innovations: (1) \textbf{SLM-Enhanced Prediction}, replacing the generalized LLM parser with a specialized, fine-tuned Small Language Model (SLM) for millisecond-level capability tag prediction; (2) \textbf{Pseudo-Document Expansion}, augmenting agent…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.