TL;DR
This paper introduces a Strategy-Augmented Planning framework with a Strategy Evaluation Network to improve opponent modeling and exploitation in large language model-based agents, demonstrating significant performance gains in adversarial environments.
Contribution
The paper presents a novel two-stage SAP framework that enhances LLM-based opponent exploitation by explicitly modeling strategies and dynamically responding to unseen opponents.
Findings
SAP achieves 85.35% performance improvement over baselines.
SAP generalizes well to unseen opponent strategies.
SAP matches reinforcement learning approaches against SOTA rule-based AI.
Abstract
Efficiently modeling and exploiting opponents is a long-standing challenge in adversarial domains. Large Language Models (LLMs) trained on extensive textual data have recently demonstrated outstanding performance in general tasks, introducing new research directions for opponent modeling. Some studies primarily focus on directly using LLMs to generate decisions based on the elaborate prompt context that incorporates opponent descriptions, while these approaches are limited to scenarios where LLMs possess adequate domain expertise. To address that, we introduce a two-stage Strategy-Augmented Planning (SAP) framework that significantly enhances the opponent exploitation capabilities of LLM-based agents by utilizing a critical component, the Strategy Evaluation Network (SEN). Specifically, in the offline stage, we construct an explicit strategy space and subsequently collect…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsFocus
