TeamLLM: A Human-Like Team-Oriented Collaboration Framework for Multi-Step Contextualized Tasks

Xiangyu Wang; Jin Wu; Haoran Shi; Wei Xia; Jiarui Yu; Chanjin Zheng

arXiv:2604.06765·cs.CL·April 9, 2026

TeamLLM: A Human-Like Team-Oriented Collaboration Framework for Multi-Step Contextualized Tasks

Xiangyu Wang, Jin Wu, Haoran Shi, Wei Xia, Jiarui Yu, Chanjin Zheng

PDF

1 Repo

TL;DR

TeamLLM introduces a human-like multi-LLM collaboration framework with distinct roles, improving performance on multi-step contextualized tasks, evaluated using a new benchmark with comprehensive assessments.

Contribution

It proposes a novel team-oriented collaboration framework for LLMs, explicitly emulating human team roles, and introduces a benchmark for multi-step contextualized tasks.

Findings

01

TeamLLM significantly enhances LLM performance on the CGPST benchmark.

02

The CGPST benchmark features contextual grounding, procedural structure, and multi-dimensional evaluation.

03

Evaluation of ten LLMs shows improved results with the TeamLLM framework.

Abstract

Recently, multi-Large Language Model (LLM) frameworks have been proposed to solve contextualized tasks. However, these frameworks do not explicitly emulate human team role division, which may lead to a single perspective, thereby weakening performance on multi-step contextualized tasks. To address this issue, we propose TeamLLM, a human-like Team-Oriented Multi-LLM Collaboration Framework. TeamLLM adopts four team roles with distinct division and employs a three-phase multi-LLM collaboration for multi-step contextualized tasks. To evaluate the effectiveness of TeamLLM on multi-step contextualized tasks, we propose Contextually-Grounded and Procedurally-Structured tasks (CGPST) and construct the CGPST benchmark. This benchmark has four core features: contextual grounding, procedural structure, process-oriented evaluation and multi-dimensional assessment. We evaluate ten popular LLMs on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

https://anonymous.4open.science/r/TeamLLM-anonymous-C50E
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.