Reasoning, Memorization, and Fine-Tuning Language Models for   Non-Cooperative Games

Yunhao Yang; Leonard Berthellemy; Ufuk Topcu

arXiv:2410.14890·cs.AI·October 22, 2024

Reasoning, Memorization, and Fine-Tuning Language Models for Non-Cooperative Games

Yunhao Yang, Leonard Berthellemy, Ufuk Topcu

PDF

Open Access

TL;DR

This paper introduces a novel multi-agent, tree-of-thought framework combined with fine-tuning to improve language models' ability to solve complex non-cooperative games efficiently, achieving high success rates with minimal training data.

Contribution

The paper presents a new integrated approach that combines reasoning, memorization, and fine-tuning in language models for game solving, demonstrating significant performance improvements.

Findings

01

Achieved a 65% winning rate against benchmark algorithms.

02

Fine-tuning improved performance by an additional 10%.

03

Method requires only about 1000 training samples, showing high efficiency.

Abstract

We develop a method that integrates the tree of thoughts and multi-agent framework to enhance the capability of pre-trained language models in solving complex, unfamiliar games. The method decomposes game-solving into four incremental tasks -- game summarization, area selection, action extraction, and action validation -- each assigned to a specific language-model agent. By constructing a tree of thoughts, the method simulates reasoning paths and allows agents to collaboratively distill game representations and tactics, mitigating the limitations of language models in reasoning and long-term memorization. Additionally, an automated fine-tuning process further optimizes the agents' performance by ranking query-response pairs based on game outcomes, e.g., winning or losing. We apply the method to a non-cooperative game and demonstrate a 65 percent winning rate against benchmark…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Multi-Agent Systems and Negotiation