GroupGPT: A Token-efficient and Privacy-preserving Agentic Framework for Multi-User Chat Assistant

Zhuokang Shen; Yifan Wang; Hanyu Chen; Yunhang Shen; Wenxuan Huang; Gaoqi He; Jiao Xie; Rongrong Ji; Shaohui Lin

arXiv:2603.01059·cs.CL·April 10, 2026

GroupGPT: A Token-efficient and Privacy-preserving Agentic Framework for Multi-User Chat Assistant

Zhuokang Shen, Yifan Wang, Hanyu Chen, Yunhang Shen, Wenxuan Huang, Gaoqi He, Jiao Xie, Rongrong Ji, Shaohui Lin

PDF

1 Repo 2 Models

TL;DR

GroupGPT is a novel framework that enhances multi-user chat assistants by being token-efficient, privacy-preserving, and capable of handling multimodal inputs through an edge-cloud architecture.

Contribution

It introduces GroupGPT, a new edge-cloud framework for multi-user chat that reduces token usage, preserves privacy, and supports multimodal inputs, along with a benchmark dataset MUIR.

Findings

01

GroupGPT achieves high response accuracy with an average score of 4.72/5.

02

Token usage is reduced by up to 3 times compared to baselines.

03

GroupGPT is well-received in diverse chat scenarios.

Abstract

Recent advances in large language models (LLMs) have enabled increasingly capable chatbots. However, most existing systems focus on single-user settings and do not generalize well to multi-user group chat interactions, where agents require more proactive and accurate intervention under complex, evolving contexts. Existing approaches typically rely on LLMs for both intervention reasoning and response generation, leading to high token consumption, limited scalability, and potential privacy risks. To address these challenges, we propose GroupGPT, a token-efficient and privacy-preserving agentic framework for multi-user chat assistant. GroupGPT adopts an edge-cloud model collaboration architecture to decouple intervention timing from response generation, enabling efficient and accurate decision-making while preserving user privacy through on-device processing of sensitive information. The…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Eliot-Shen/GroupGPT
github

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.