Score: 0

Toward a Metrology for Artificial Intelligence: Hidden-Rule Environments and Reinforcement Learning

Published: September 7, 2025 | arXiv ID: 2509.06213v2

By: Christo Mathew , Wentian Wang , Jacob Feldman and more

Potential Business Impact:

Teaches computers to solve puzzles by guessing rules.

Business Areas:

Artificial Intelligence Artificial Intelligence, Data and Analytics, Science and Engineering, Software

We investigate reinforcement learning in the Game Of Hidden Rules (GOHR) environment, a complex puzzle in which an agent must infer and execute hidden rules to clear a 6$\times$6 board by placing game pieces into buckets. We explore two state representation strategies, namely Feature-Centric (FC) and Object-Centric (OC), and employ a Transformer-based Advantage Actor-Critic (A2C) algorithm for training. The agent has access only to partial observations and must simultaneously infer the governing rule and learn the optimal policy through experience. We evaluate our models across multiple rule-based and trial-list-based experimental setups, analyzing transfer effects and the impact of representation on learning efficiency.

Toward a Metrology for Artificial Intelligence: Hidden-Rule Environments and Reinforcement Learning

Machine Learning (CS)

Teaches computers to solve puzzles by guessing rules.

7 Sep 2025 0

87%

First-Order Representation Languages for Goal-Conditioned RL

Artificial Intelligence

Teaches robots to learn from trying and failing.

22 Dec 2025 0

87%

Subgoal Graph-Augmented Planning for LLM-Guided Open-World Reinforcement Learning

Machine Learning (CS)

Helps robots follow plans by checking steps.

26 Nov 2025 0

View PDF Login to Bookmark

Page Count

38 pages

Toward a Metrology for Artificial Intelligence: Hidden-Rule Environments and Reinforcement Learning

Teaches computers to solve puzzles by guessing rules.

Technical Abstract

Toward a Metrology for Artificial Intelligence: Hidden-Rule Environments and Reinforcement Learning

First-Order Representation Languages for Goal-Conditioned RL

Subgoal Graph-Augmented Planning for LLM-Guided Open-World Reinforcement Learning