A Multi-LLM Orchestration Engine for Personalized, Context-Rich   Assistance

Sumedh Rasal

arXiv:2410.10039·cs.MA·October 15, 2024

A Multi-LLM Orchestration Engine for Personalized, Context-Rich Assistance

Sumedh Rasal

PDF

Open Access

TL;DR

This paper introduces a multi-LLM orchestration system that combines graph and vector databases to deliver personalized, context-aware AI assistance with reduced hallucinations and improved relevance.

Contribution

It presents a novel architecture integrating multiple LLMs with graph and vector databases for enhanced personalization and contextual understanding in AI assistants.

Findings

01

Improved contextual relevance in AI responses.

02

Reduced hallucination rates through orchestration.

03

Enhanced personalization via user interaction graphs.

Abstract

In recent years, large language models have demonstrated remarkable capabilities in natural language understanding and generation. However, these models often struggle with hallucinations and maintaining long term contextual relevance, particularly when dealing with private or local data. This paper presents a novel architecture that addresses these challenges by integrating an orchestration engine that utilizes multiple LLMs in conjunction with a temporal graph database and a vector database. The proposed system captures user interactions, builds a graph representation of conversations, and stores nodes and edges that map associations between key concepts, entities, and behaviors over time. This graph based structure allows the system to develop an evolving understanding of the user preferences, providing personalized and contextually relevant answers. In addition to this, a vector…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Semantic Web and Ontologies