Navigating Human Language Models with Synthetic Agents
Philip Feldman, Antonio Bucchiarone

TL;DR
This paper explores how large language models like GPT-2 can be used as sociological tools by training them on historical chess data and analyzing the generated text to reflect human beliefs and patterns in chess play.
Contribution
The study demonstrates that GPT-2 can accurately reflect human chess strategies and create latent representations of the game, offering a novel sociological approach using synthetic agents.
Findings
Model-generated move patterns align with human chess strategies
The model accurately reconstructs the chessboard state from text
Trajectory plots reveal legal move sequences across the board
Abstract
Modern natural language models such as the GPT-2/GPT-3 contain tremendous amounts of information about human belief in a consistently testable form. If these models could be shown to accurately reflect the underlying beliefs of the human beings that produced the data used to train these models, then such models become a powerful sociological tool in ways that are distinct from traditional methods, such as interviews and surveys. In this study, We train a version of the GPT-2 on a corpora of historical chess games, and then "launch" clusters of synthetic agents into the model, using text strings to create context and orientation. We compare the trajectories contained in the text generated by the agents/model and compare that to the known ground truth of the chess board, move legality, and historical patterns of play. We find that the percentages of moves by piece using the model are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Artificial Intelligence in Games · Sports Analytics and Performance
MethodsLinear Layer · Cosine Annealing · Multi-Head Attention · Byte Pair Encoding · Dropout · Adam · Weight Decay · Attention Dropout · Softmax · Dense Connections
