KIMAs: A Configurable Knowledge Integrated Multi-Agent System
Zitao Li, Fei Wei, Yuexiang Xie, Dawei Gao, Weirui Kuang, Zhijian Ma,, Bingchen Qian, Yaliang Li, Bolin Ding

TL;DR
KIMAs is a flexible, scalable multi-agent system designed to enhance knowledge-intensive applications with heterogeneous data, improved context management, and low-latency responses, facilitating real-world deployment of LLMs.
Contribution
This work introduces KIMAs, a configurable framework that integrates diverse knowledge sources with efficient retrieval, context handling, and parallel execution for practical LLM applications.
Findings
Successfully configured KIMAs for three real-world applications
Improved retrieval accuracy and conversational coherence
Achieved low-latency, scalable multi-agent deployment
Abstract
Knowledge-intensive conversations supported by large language models (LLMs) have become one of the most popular and helpful applications that can assist people in different aspects. Many current knowledge-intensive applications are centered on retrieval-augmented generation (RAG) techniques. While many open-source RAG frameworks facilitate the development of RAG-based applications, they often fall short in handling practical scenarios complicated by heterogeneous data in topics and formats, conversational context management, and the requirement of low-latency response times. This technical report presents a configurable knowledge integrated multi-agent system, KIMAs, to address these challenges. KIMAs features a flexible and configurable system for integrating diverse knowledge sources with 1) context management and query rewrite mechanisms to improve retrieval accuracy and multi-turn…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSemantic Web and Ontologies · Fuzzy Logic and Control Systems · Data Mining Algorithms and Applications
MethodsAttention Is All You Need · Linear Warmup With Linear Decay · Refunds@Expedia|||How do I get a full refund from Expedia? · Weight Decay · Attention Dropout · Byte Pair Encoding · Layer Normalization · Residual Connection · WordPiece · Linear Layer
