Score: 0

Cross-Entropy Games for Language Models: From Implicit Knowledge to General Capability Measures

Published: June 7, 2025 | arXiv ID: 2506.06832v2

By: Clément Hongler, Andrew Emil

Potential Business Impact:

Tests computers to see how smart they are.

Business Areas:

Casual Games Gaming

Large Language Models (LLMs) define probability measures on text. By considering the implicit knowledge question of what it means for an LLM to know such a measure and what it entails algorithmically, we are naturally led to formulate a series of tasks that go beyond generative sampling, involving forms of summarization, counterfactual thinking, anomaly detection, originality search, reverse prompting, debating, creative solving, etc. These tasks can be formulated as games based on LLM measures, which we call Cross-Entropy (Xent) Games. Xent Games can be single-player or multi-player. They involve cross-entropy scores and cross-entropy constraints, and can be expressed as simple computational graphs and programs. We show the Xent Game space is large enough to contain a wealth of interesting examples, while being constructible from basic game-theoretic consistency axioms. We then discuss how the Xent Game space can be used to measure the abilities of LLMs. This leads to the construction of Xent Game measures: finite families of Xent Games that can be used as capability benchmarks, built from a given scope, by extracting a covering measure. To address the unbounded scope problem associated with the challenge of measuring general abilities, we propose to explore the space of Xent Games in a coherent fashion, using ideas inspired by evolutionary dynamics.

GuessingGame: Measuring the Informativeness of Open-Ended Questions in Large Language Models

Computation and Language

Teaches computers to ask smart questions to guess things.

23 Sep 2025 1

87%

Measuring and Analyzing Intelligence via Contextual Uncertainty in Large Language Models using Information-Theoretic Metrics

Artificial Intelligence

Shows how AI thinks by tracking its guessing.

21 Jul 2025 1

87%

GIFT: Games as Informal Training for Generalizable LLMs

Computation and Language

Teaches computers to learn like humans by playing games.

9 Jan 2026 2

View PDF Login to Bookmark

Page Count

42 pages

Cross-Entropy Games for Language Models: From Implicit Knowledge to General Capability Measures

Tests computers to see how smart they are.

Technical Abstract

GuessingGame: Measuring the Informativeness of Open-Ended Questions in Large Language Models

Measuring and Analyzing Intelligence via Contextual Uncertainty in Large Language Models using Information-Theoretic Metrics

GIFT: Games as Informal Training for Generalizable LLMs