A Multi-LLM Orchestration Engine for Personalized, Context-Rich Assistance
Sumedh Rasal

TL;DR
This paper introduces a multi-LLM orchestration system that combines graph and vector databases to deliver personalized, context-aware AI assistance with reduced hallucinations and improved relevance.
Contribution
It presents a novel architecture integrating multiple LLMs with graph and vector databases for enhanced personalization and contextual understanding in AI assistants.
Findings
Improved contextual relevance in AI responses.
Reduced hallucination rates through orchestration.
Enhanced personalization via user interaction graphs.
Abstract
In recent years, large language models have demonstrated remarkable capabilities in natural language understanding and generation. However, these models often struggle with hallucinations and maintaining long term contextual relevance, particularly when dealing with private or local data. This paper presents a novel architecture that addresses these challenges by integrating an orchestration engine that utilizes multiple LLMs in conjunction with a temporal graph database and a vector database. The proposed system captures user interactions, builds a graph representation of conversations, and stores nodes and edges that map associations between key concepts, entities, and behaviors over time. This graph based structure allows the system to develop an evolving understanding of the user preferences, providing personalized and contextually relevant answers. In addition to this, a vector…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Semantic Web and Ontologies
