Cross-Entropy Games for Language Models: From Implicit Knowledge to General Capability Measures
Cl\'ement Hongler, Andrew Emil

TL;DR
This paper introduces Cross-Entropy (Xent) Games as a framework to evaluate and measure the capabilities of Large Language Models through a variety of tasks beyond simple text generation, using game-theoretic principles.
Contribution
It formulates a novel class of tasks called Xent Games based on LLM probability measures, enabling new capability benchmarks and evaluation methods.
Findings
Xent Games encompass diverse tasks like summarization and debating.
They can be represented as simple computational graphs and programs.
The framework allows constructing comprehensive capability benchmarks.
Abstract
Large Language Models (LLMs) define probability measures on text. By considering the implicit knowledge question of what it means for an LLM to know such a measure and what it entails algorithmically, we are naturally led to formulate a series of tasks that go beyond generative sampling, involving forms of summarization, counterfactual thinking, anomaly detection, originality search, reverse prompting, debating, creative solving, etc. These tasks can be formulated as games based on LLM measures, which we call Cross-Entropy (Xent) Games. Xent Games can be single-player or multi-player. They involve cross-entropy scores and cross-entropy constraints, and can be expressed as simple computational graphs and programs. We show the Xent Game space is large enough to contain a wealth of interesting examples, while being constructible from basic game-theoretic consistency axioms. We then discuss…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsLanguage and cultural evolution · Topic Modeling · Text Readability and Simplification
