EACO-RAG: Towards Distributed Tiered LLM Deployment using Edge-Assisted and Collaborative RAG with Adaptive Knowledge Update
Jiaxing Li, Chi Xu, Lianchen Jia, Feng Wang, Cong Zhang, Jiangchuan, Liu

TL;DR
EACO-RAG is a novel framework that enables distributed, efficient, and adaptive deployment of large language models on edge devices by leveraging collaborative retrieval and knowledge updates, matching cloud accuracy at lower costs.
Contribution
This paper introduces EACO-RAG, a lightweight, distributed RAG framework with adaptive knowledge updates and hierarchical retrieval strategies for edge-based LLM deployment.
Findings
Achieves cloud-level accuracy with significantly reduced costs.
Reduces total costs by up to 84.6% under relaxed delay constraints.
Demonstrates effective hierarchical retrieval and adaptive knowledge updating.
Abstract
Large language models (LLMs) have demonstrated impressive capabilities in language tasks, but they require high computing power and rely on static knowledge. To overcome these limitations, Retrieval-Augmented Generation (RAG) incorporates up-to-date external information into LLMs without extensive fine-tuning. Meanwhile, small language models (SLMs) deployed on edge devices offer efficiency and low latency but often struggle with complex reasoning tasks. Unfortunately, current RAG approaches are predominantly based on centralized databases and have not been adapted to address the distinct constraints associated with deploying SLMs in edge environments. To bridge this gap, we propose Edge-Assisted and Collaborative RAG (EACO-RAG), a lightweight framework that leverages distributed edge nodes for adaptive knowledge updates and retrieval. EACO-RAG also employs a hierarchical collaborative…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Distributed and Parallel Computing Systems · Image Retrieval and Classification Techniques
