Communicating Activations Between Language Model Agents
Vignav Ramesh, Kenneth Li

TL;DR
This paper introduces a novel activation-based communication method for language model agents that improves scalability and efficiency over natural language communication, achieving significant performance gains in reasoning tasks.
Contribution
It proposes a simple activation communication technique that scales LMs without extra parameters or data, reducing compute costs and enhancing performance over traditional natural language methods.
Findings
Achieves up to 27% improvement over natural language communication.
Requires less than a quarter of the compute used by language-based communication.
Demonstrates robustness across multiple tasks and experimental setups.
Abstract
Communication between multiple language model (LM) agents has been shown to scale up the reasoning ability of LMs. While natural language has been the dominant medium for inter-LM communication, it is not obvious this should be the standard: not only does natural language communication incur high inference costs that scale quickly with the number of both agents and messages, but also the decoding process abstracts away too much rich information that could be otherwise accessed from the internal activations. In this work, we propose a simple technique whereby LMs communicate via activations; concretely, we pause an LM 's computation at an intermediate layer, combine its current activation with another LM 's intermediate activation via some function , then pass 's output into the next layer of and continue the forward pass till…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMulti-Agent Systems and Negotiation · Natural Language Processing Techniques · Semantic Web and Ontologies
