Reasoning, Memorization, and Fine-Tuning Language Models for Non-Cooperative Games
Yunhao Yang, Leonard Berthellemy, Ufuk Topcu

TL;DR
This paper introduces a novel multi-agent, tree-of-thought framework combined with fine-tuning to improve language models' ability to solve complex non-cooperative games efficiently, achieving high success rates with minimal training data.
Contribution
The paper presents a new integrated approach that combines reasoning, memorization, and fine-tuning in language models for game solving, demonstrating significant performance improvements.
Findings
Achieved a 65% winning rate against benchmark algorithms.
Fine-tuning improved performance by an additional 10%.
Method requires only about 1000 training samples, showing high efficiency.
Abstract
We develop a method that integrates the tree of thoughts and multi-agent framework to enhance the capability of pre-trained language models in solving complex, unfamiliar games. The method decomposes game-solving into four incremental tasks -- game summarization, area selection, action extraction, and action validation -- each assigned to a specific language-model agent. By constructing a tree of thoughts, the method simulates reasoning paths and allows agents to collaboratively distill game representations and tactics, mitigating the limitations of language models in reasoning and long-term memorization. Additionally, an automated fine-tuning process further optimizes the agents' performance by ranking query-response pairs based on game outcomes, e.g., winning or losing. We apply the method to a non-cooperative game and demonstrate a 65 percent winning rate against benchmark…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Multi-Agent Systems and Negotiation
